Skip to main content

Showing 1–50 of 111 results for author: Chi, H

  1. arXiv:2407.10153  [pdf, other

    cs.CL cs.AI

    Look Within, Why LLMs Hallucinate: A Causal Perspective

    Authors: He Li, Haoang Chi, Mingyu Liu, Wenjing Yang

    Abstract: The emergence of large language models (LLMs) is a milestone in generative artificial intelligence, achieving significant success in text comprehension and generation tasks. Despite the tremendous success of LLMs in many downstream tasks, they suffer from severe hallucination problems, posing significant challenges to the practical applications of LLMs. Most of the works about LLMs' hallucinations… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: 15 pages, 7 figures

  2. arXiv:2406.04520  [pdf, other

    cs.CL cs.AI

    NATURAL PLAN: Benchmarking LLMs on Natural Language Planning

    Authors: Huaixiu Steven Zheng, Swaroop Mishra, Hugh Zhang, Xinyun Chen, Minmin Chen, Azade Nova, Le Hou, Heng-Tze Cheng, Quoc V. Le, Ed H. Chi, Denny Zhou

    Abstract: We introduce NATURAL PLAN, a realistic planning benchmark in natural language containing 3 key tasks: Trip Planning, Meeting Planning, and Calendar Scheduling. We focus our evaluation on the planning capabilities of LLMs with full information on the task, by providing outputs from tools such as Google Flights, Google Maps, and Google Calendar as contexts to the models. This eliminates the need for… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  3. arXiv:2405.14677  [pdf, other

    cs.CV cs.LG

    RectifID: Personalizing Rectified Flow with Anchored Classifier Guidance

    Authors: Zhicheng Sun, Zhenhao Yang, Yang Jin, Haozhe Chi, Kun Xu, Kun Xu, Liwei Chen, Hao Jiang, Di Zhang, Yang Song, Kun Gai, Yadong Mu

    Abstract: Customizing diffusion models to generate identity-preserving images from user-provided reference images is an intriguing new problem. The prevalent approaches typically require training on extensive domain-specific images to achieve identity preservation, which lacks flexibility across different use cases. To address this issue, we exploit classifier guidance, a training-free technique that steers… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  4. arXiv:2405.12327  [pdf, other

    cs.IR cs.LG

    Diversifying by Intent in Recommender Systems

    Authors: Yuyan Wang, Cheenar Banerjee, Samer Chucri, Fabio Soldo, Sriraj Badam, Ed H. Chi, Minmin Chen

    Abstract: It has become increasingly clear that recommender systems overly focusing on short-term engagement can inadvertently hurt long-term user experience. However, it is challenging to optimize long-term user experience directly as the desired signal is sparse, noisy and manifests over a long horizon. In this work, we show the benefits of incorporating higher-level user understanding, specifically user… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  5. arXiv:2405.00378  [pdf, other

    cs.CV

    Adaptive Bidirectional Displacement for Semi-Supervised Medical Image Segmentation

    Authors: Hanyang Chi, Jian Pang, Bingfeng Zhang, Weifeng Liu

    Abstract: Consistency learning is a central strategy to tackle unlabeled data in semi-supervised medical image segmentation (SSMIS), which enforces the model to produce consistent predictions under the perturbation. However, most current approaches solely focus on utilizing a specific single perturbation, which can only cope with limited cases, while employing multiple perturbations simultaneously is hard t… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: Accepted to CVPR 2024

  6. arXiv:2404.00245  [pdf, other

    cs.IR

    Aligning Large Language Models with Recommendation Knowledge

    Authors: Yuwei Cao, Nikhil Mehta, Xinyang Yi, Raghunandan Keshavan, Lukasz Heldt, Lichan Hong, Ed H. Chi, Maheswaran Sathiamoorthy

    Abstract: Large language models (LLMs) have recently been used as backbones for recommender systems. However, their performance often lags behind conventional methods in standard tasks like retrieval. We attribute this to a mismatch between LLMs' knowledge and the knowledge crucial for effective recommendations. While LLMs excel at natural language reasoning, they cannot model complex user-item interactions… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: Accepted to the NAACL 2024 Findings

  7. arXiv:2402.15680  [pdf, other

    cs.LG

    Overcoming Pitfalls in Graph Contrastive Learning Evaluation: Toward Comprehensive Benchmarks

    Authors: Qian Ma, Hongliang Chi, Hengrui Zhang, Kay Liu, Zhiwei Zhang, Lu Cheng, Suhang Wang, Philip S. Yu, Yao Ma

    Abstract: The rise of self-supervised learning, which operates without the need for labeled data, has garnered significant interest within the graph learning community. This enthusiasm has led to the development of numerous Graph Contrastive Learning (GCL) techniques, all aiming to create a versatile graph encoder that leverages the wealth of unlabeled data for various downstream tasks. However, the current… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  8. arXiv:2402.14035  [pdf, other

    cs.LG cs.AI

    Wisdom of Committee: Distilling from Foundation Model to Specialized Application Model

    Authors: Zichang Liu, Qingyun Liu, Yuening Li, Liang Liu, Anshumali Shrivastava, Shuchao Bi, Lichan Hong, Ed H. Chi, Zhe Zhao

    Abstract: Recent advancements in foundation models have yielded impressive performance across a wide range of tasks. Meanwhile, for specific applications, practitioners have been developing specialized application models. To enjoy the benefits of both kinds of models, one natural path is to transfer the knowledge in foundation models into specialized application models, which are generally more efficient fo… ▽ More

    Submitted 15 May, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

  9. arXiv:2402.09668  [pdf, other

    cs.LG cs.AI cs.CL

    How to Train Data-Efficient LLMs

    Authors: Noveen Sachdeva, Benjamin Coleman, Wang-Cheng Kang, Jianmo Ni, Lichan Hong, Ed H. Chi, James Caverlee, Julian McAuley, Derek Zhiyuan Cheng

    Abstract: The training of large language models (LLMs) is expensive. In this paper, we study data-efficient approaches for pre-training LLMs, i.e., techniques that aim to optimize the Pareto frontier of model quality and training resource/data consumption. We seek to understand the tradeoffs associated with data selection routines based on (i) expensive-to-compute data-quality estimates, and (ii) maximizati… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: Under review. 44 pages, 30 figures

  10. arXiv:2402.04644  [pdf, other

    cs.LG cs.AI

    LEVI: Generalizable Fine-tuning via Layer-wise Ensemble of Different Views

    Authors: Yuji Roh, Qingyun Liu, Huan Gui, Zhe Yuan, Yujin Tang, Steven Euijong Whang, Liang Liu, Shuchao Bi, Lichan Hong, Ed H. Chi, Zhe Zhao

    Abstract: Fine-tuning is becoming widely used for leveraging the power of pre-trained foundation models in new downstream tasks. While there are many successes of fine-tuning on various tasks, recent studies have observed challenges in the generalization of fine-tuned models to unseen distributions (i.e., out-of-distribution; OOD). To improve OOD generalization, some previous studies identify the limitation… ▽ More

    Submitted 18 June, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: In Proceedings of the 41st International Conference on Machine Learning (ICML), 2024

  11. arXiv:2402.03620  [pdf, other

    cs.AI cs.CL

    Self-Discover: Large Language Models Self-Compose Reasoning Structures

    Authors: Pei Zhou, Jay Pujara, Xiang Ren, Xinyun Chen, Heng-Tze Cheng, Quoc V. Le, Ed H. Chi, Denny Zhou, Swaroop Mishra, Huaixiu Steven Zheng

    Abstract: We introduce SELF-DISCOVER, a general framework for LLMs to self-discover the task-intrinsic reasoning structures to tackle complex reasoning problems that are challenging for typical prompting methods. Core to the framework is a self-discovery process where LLMs select multiple atomic reasoning modules such as critical thinking and step-by-step thinking, and compose them into an explicit reasonin… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 17 pages, 11 figures, 5 tables

  12. arXiv:2402.02321  [pdf, other

    cs.LG

    Active Learning for Graphs with Noisy Structures

    Authors: Hongliang Chi, Cong Qi, Suhang Wang, Yao Ma

    Abstract: Graph Neural Networks (GNNs) have seen significant success in tasks such as node classification, largely contingent upon the availability of sufficient labeled nodes. Yet, the excessive cost of labeling large-scale graphs led to a focus on active learning on graphs, which aims for effective data selection to maximize downstream model performance. Notably, most existing methods assume reliable grap… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

  13. arXiv:2402.01943  [pdf, other

    cs.LG

    Precedence-Constrained Winter Value for Effective Graph Data Valuation

    Authors: Hongliang Chi, Wei Jin, Charu Aggarwal, Yao Ma

    Abstract: Data valuation is essential for quantifying data's worth, aiding in assessing data quality and determining fair compensation. While existing data valuation methods have proven effective in evaluating the value of Euclidean data, they face limitations when applied to the increasingly popular graph-structured data. Particularly, graph data valuation introduces unique challenges, primarily stemming f… ▽ More

    Submitted 8 March, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: 17 pages in total

  14. arXiv:2402.00977  [pdf, other

    cs.CV eess.IV

    Enhanced fringe-to-phase framework using deep learning

    Authors: Won-Hoe Kim, Bongjoong Kim, Hyung-Gun Chi, Jae-Sang Hyun

    Abstract: In Fringe Projection Profilometry (FPP), achieving robust and accurate 3D reconstruction with a limited number of fringe patterns remains a challenge in structured light 3D imaging. Conventional methods require a set of fringe images, but using only one or two patterns complicates phase recovery and unwrapping. In this study, we introduce SFNet, a symmetric fusion network that transforms two fring… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: 35 pages, 13 figures, 6 tables

  15. arXiv:2312.16243  [pdf, other

    cs.LG

    Mixture Data for Training Cannot Ensure Out-of-distribution Generalization

    Authors: Songming Zhang, Yuxiao Luo, Qizhou Wang, Haoang Chi, Xiaofeng Chen, Bo Han, Jinyan Li

    Abstract: Deep neural networks often face generalization problems to handle out-of-distribution (OOD) data, and there remains a notable theoretical gap between the contributing factors and their respective impacts. Literature evidence from in-distribution data has suggested that generalization error can shrink if the size of mixture data for training increases. However, when it comes to OOD samples, this co… ▽ More

    Submitted 23 April, 2024; v1 submitted 25 December, 2023; originally announced December 2023.

    Comments: 13 pages, 9 figures

  16. arXiv:2311.05884  [pdf, other

    cs.IR cs.LG

    Hiformer: Heterogeneous Feature Interactions Learning with Transformers for Recommender Systems

    Authors: Huan Gui, Ruoxi Wang, Ke Yin, Long Jin, Maciej Kula, Taibai Xu, Lichan Hong, Ed H. Chi

    Abstract: Learning feature interaction is the critical backbone to building recommender systems. In web-scale applications, learning feature interaction is extremely challenging due to the sparse and large input feature space; meanwhile, manually crafting effective feature interactions is infeasible because of the exponential solution space. We propose to leverage a Transformer-based architecture with atten… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

  17. arXiv:2310.10547  [pdf, other

    cs.CV

    InfoGCN++: Learning Representation by Predicting the Future for Online Human Skeleton-based Action Recognition

    Authors: Seunggeun Chi, Hyung-gun Chi, Qixing Huang, Karthik Ramani

    Abstract: Skeleton-based action recognition has made significant advancements recently, with models like InfoGCN showcasing remarkable accuracy. However, these models exhibit a key limitation: they necessitate complete action observation prior to classification, which constrains their applicability in real-time situations such as surveillance and robotic systems. To overcome this barrier, we introduce InfoG… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  18. arXiv:2310.06117  [pdf, other

    cs.LG cs.AI cs.CL

    Take a Step Back: Evoking Reasoning via Abstraction in Large Language Models

    Authors: Huaixiu Steven Zheng, Swaroop Mishra, Xinyun Chen, Heng-Tze Cheng, Ed H. Chi, Quoc V Le, Denny Zhou

    Abstract: We present Step-Back Prompting, a simple prompting technique that enables LLMs to do abstractions to derive high-level concepts and first principles from instances containing specific details. Using the concepts and principles to guide reasoning, LLMs significantly improve their abilities in following a correct reasoning path towards the solution. We conduct experiments of Step-Back Prompting with… ▽ More

    Submitted 12 March, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: ICLR 2024

  19. arXiv:2310.03188  [pdf, other

    cs.AI

    Talking Models: Distill Pre-trained Knowledge to Downstream Models via Interactive Communication

    Authors: Zhe Zhao, Qingyun Liu, Huan Gui, Bang An, Lichan Hong, Ed H. Chi

    Abstract: Many recent breakthroughs in machine learning have been enabled by the pre-trained foundation models. By scaling up model parameters, training data, and computation resources, foundation models have significantly advanced the state-of-the-art in many applications. However, it is still an open question of how to use these models to perform downstream tasks efficiently. Knowledge distillation (KD) h… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    Comments: 19 pages, 3 figures

  20. arXiv:2310.01714  [pdf, other

    cs.LG

    Large Language Models as Analogical Reasoners

    Authors: Michihiro Yasunaga, Xinyun Chen, Yujia Li, Panupong Pasupat, Jure Leskovec, Percy Liang, Ed H. Chi, Denny Zhou

    Abstract: Chain-of-thought (CoT) prompting for language models demonstrates impressive performance across reasoning tasks, but typically needs labeled exemplars of the reasoning process. In this work, we introduce a new prompting approach, analogical prompting, designed to automatically guide the reasoning process of large language models. Inspired by analogical reasoning, a cognitive process in which human… ▽ More

    Submitted 9 March, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: Published at ICLR 2024

  21. arXiv:2309.12565  [pdf, other

    cs.IR

    Modeling Spatiotemporal Periodicity and Collaborative Signal for Local-Life Service Recommendation

    Authors: Huixuan Chi, Hao Xu, Mengya Liu, Yuanchen Bei, Sheng Zhou, Danyang Liu, Mengdi Zhang

    Abstract: Online local-life service platforms provide services like nearby daily essentials and food delivery for hundreds of millions of users. Different from other types of recommender systems, local-life service recommendation has the following characteristics: (1) spatiotemporal periodicity, which means a user's preferences for items vary from different locations at different times. (2) spatiotemporal c… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Comments: KDAH CIKM'23 Workshop

  22. arXiv:2308.01563  [pdf, other

    cs.IR

    Density Weighting for Multi-Interest Personalized Recommendation

    Authors: Nikhil Mehta, Anima Singh, Xinyang Yi, Sagar Jain, Lichan Hong, Ed H. Chi

    Abstract: Using multiple user representations (MUR) to model user behavior instead of a single user representation (SUR) has been shown to improve personalization in recommendation systems. However, the performance gains observed with MUR can be sensitive to the skewness in the item and/or user interest distribution. When the data distribution is highly skewed, the gains observed by learning multiple repres… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

  23. arXiv:2307.16449  [pdf, other

    cs.CV

    MovieChat: From Dense Token to Sparse Memory for Long Video Understanding

    Authors: Enxin Song, Wenhao Chai, Guanhong Wang, Yucheng Zhang, Haoyang Zhou, Feiyang Wu, Haozhe Chi, Xun Guo, Tian Ye, Yanting Zhang, Yan Lu, Jenq-Neng Hwang, Gaoang Wang

    Abstract: Recently, integrating video foundation models and large language models to build a video understanding system can overcome the limitations of specific pre-defined vision tasks. Yet, existing systems can only handle videos with very few frames. For long videos, the computation complexity, memory cost, and long-term temporal connection impose additional challenges. Taking advantage of the Atkinson-S… ▽ More

    Submitted 9 March, 2024; v1 submitted 31 July, 2023; originally announced July 2023.

    Comments: CVPR 2024. First three authors contribute equally to this work. Project Website https://rese1f.github.io/MovieChat/

  24. arXiv:2307.15893  [pdf, other

    cs.LG

    Online Matching: A Real-time Bandit System for Large-scale Recommendations

    Authors: Xinyang Yi, Shao-Chuan Wang, Ruining He, Hariharan Chandrasekaran, Charles Wu, Lukasz Heldt, Lichan Hong, Minmin Chen, Ed H. Chi

    Abstract: The last decade has witnessed many successes of deep learning-based models for industry-scale recommender systems. These models are typically trained offline in a batch manner. While being effective in capturing users' past interactions with recommendation platforms, batch learning suffers from long model-update latency and is vulnerable to system biases, making it hard to adapt to distribution sh… ▽ More

    Submitted 29 July, 2023; originally announced July 2023.

    Comments: RecSys 2023

  25. arXiv:2307.05948  [pdf, other

    cs.LG

    Diversity-enhancing Generative Network for Few-shot Hypothesis Adaptation

    Authors: Ruijiang Dong, Feng Liu, Haoang Chi, Tongliang Liu, Mingming Gong, Gang Niu, Masashi Sugiyama, Bo Han

    Abstract: Generating unlabeled data has been recently shown to help address the few-shot hypothesis adaptation (FHA) problem, where we aim to train a classifier for the target domain with a few labeled target-domain data and a well-trained source-domain classifier (i.e., a source hypothesis), for the additional information of the highly-compatible unlabeled data. However, the generated data of the existing… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

  26. arXiv:2307.02813  [pdf, other

    cs.LG cs.SI

    CPDG: A Contrastive Pre-Training Method for Dynamic Graph Neural Networks

    Authors: Yuanchen Bei, Hao Xu, Sheng Zhou, Huixuan Chi, Haishuai Wang, Mengdi Zhang, Zhao Li, Jiajun Bu

    Abstract: Dynamic graph data mining has gained popularity in recent years due to the rich information contained in dynamic graphs and their widespread use in the real world. Despite the advances in dynamic graph neural networks (DGNNs), the rich information and diverse downstream tasks have posed significant difficulties for the practical application of DGNNs in industrial scenarios. To this end, in this pa… ▽ More

    Submitted 24 December, 2023; v1 submitted 6 July, 2023; originally announced July 2023.

    Comments: 14 pages, 8 figures, accepted by ICDE2024

  27. arXiv:2306.08121  [pdf, other

    cs.IR cs.LG

    Better Generalization with Semantic IDs: A Case Study in Ranking for Recommendations

    Authors: Anima Singh, Trung Vu, Nikhil Mehta, Raghunandan Keshavan, Maheswaran Sathiamoorthy, Yilin Zheng, Lichan Hong, Lukasz Heldt, Li Wei, Devansh Tandon, Ed H. Chi, Xinyang Yi

    Abstract: Randomly-hashed item ids are used ubiquitously in recommendation models. However, the learned representations from random hashing prevents generalization across similar items, causing problems of learning unseen and long-tail items, especially when item corpus is large, power-law distributed, and evolving dynamically. In this paper, we propose using content-derived features as a replacement for ra… ▽ More

    Submitted 30 May, 2024; v1 submitted 13 June, 2023; originally announced June 2023.

  28. arXiv:2306.02544  [pdf, other

    cs.CV

    Fourier Test-time Adaptation with Multi-level Consistency for Robust Classification

    Authors: Yuhao Huang, Xin Yang, Xiaoqiong Huang, Xinrui Zhou, Haozhe Chi, Haoran Dou, Xindi Hu, Jian Wang, Xuedong Deng, Dong Ni

    Abstract: Deep classifiers may encounter significant performance degradation when processing unseen testing data from varying centers, vendors, and protocols. Ensuring the robustness of deep models against these domain shifts is crucial for their widespread clinical application. In this study, we propose a novel approach called Fourier Test-time Adaptation (FTTA), which employs a dual-adaptation design to i… ▽ More

    Submitted 4 June, 2023; originally announced June 2023.

    Comments: Accepted by MICCAI 2023

  29. arXiv:2306.01720  [pdf, other

    cs.IR

    Fresh Content Needs More Attention: Multi-funnel Fresh Content Recommendation

    Authors: Jianling Wang, Haokai Lu, Sai zhang, Bart Locanthi, Haoting Wang, Dylan Greaves, Benjamin Lipshitz, Sriraj Badam, Ed H. Chi, Cristos Goodrow, Su-Lin Wu, Lexi Baugher, Minmin Chen

    Abstract: Recommendation system serves as a conduit connecting users to an incredibly large, diverse and ever growing collection of contents. In practice, missing information on fresh (and tail) contents needs to be filled in order for them to be exposed and discovered by their audience. We here share our success stories in building a dedicated fresh content recommendation stack on a large commercial platfo… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Comments: Accepted by KDD 2023

  30. arXiv:2306.01476  [pdf, other

    cs.IR cs.LG

    Hierarchical Reinforcement Learning for Modeling User Novelty-Seeking Intent in Recommender Systems

    Authors: Pan Li, Yuyan Wang, Ed H. Chi, Minmin Chen

    Abstract: Recommending novel content, which expands user horizons by introducing them to new interests, has been shown to improve users' long-term experience on recommendation platforms \cite{chen2021values}. Users however are not constantly looking to explore novel content. It is therefore crucial to understand their novelty-seeking intent and adjust the recommendation policy accordingly. Most existing lit… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

  31. arXiv:2306.01475  [pdf, other

    cs.IR cs.LG

    Prompt Tuning Large Language Models on Personalized Aspect Extraction for Recommendations

    Authors: Pan Li, Yuyan Wang, Ed H. Chi, Minmin Chen

    Abstract: Existing aspect extraction methods mostly rely on explicit or ground truth aspect information, or using data mining or machine learning approaches to extract aspects from implicit user feedback such as user reviews. It however remains under-explored how the extracted aspects can help generate more meaningful recommendations to the users. Meanwhile, existing research on aspect-based recommendations… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

  32. arXiv:2305.17386  [pdf, other

    cs.IR cs.LG

    HyperFormer: Learning Expressive Sparse Feature Representations via Hypergraph Transformer

    Authors: Kaize Ding, Albert Jiongqian Liang, Bryan Perrozi, Ting Chen, Ruoxi Wang, Lichan Hong, Ed H. Chi, Huan Liu, Derek Zhiyuan Cheng

    Abstract: Learning expressive representations for high-dimensional yet sparse features has been a longstanding problem in information retrieval. Though recent deep learning methods can partially solve the problem, they often fail to handle the numerous sparse features, particularly those tail feature values with infrequent occurrences in the training data. Worse still, existing methods cannot explicitly lev… ▽ More

    Submitted 27 May, 2023; originally announced May 2023.

    Comments: Accepted by SIGIR 2023

  33. arXiv:2305.15498  [pdf, other

    cs.CL cs.AI cs.IR

    Large Language Models for User Interest Journeys

    Authors: Konstantina Christakopoulou, Alberto Lalama, Cj Adams, Iris Qu, Yifat Amir, Samer Chucri, Pierce Vollucci, Fabio Soldo, Dina Bseiso, Sarah Scodel, Lucas Dixon, Ed H. Chi, Minmin Chen

    Abstract: Large language models (LLMs) have shown impressive capabilities in natural language understanding and generation. Their potential for deeper user understanding and improved personalized user experience on recommendation platforms is, however, largely untapped. This paper aims to address this gap. Recommender systems today capture users' interests through encoding their historical activities on the… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  34. arXiv:2305.13535  [pdf, other

    cs.CL cs.LG

    Improving Classifier Robustness through Active Generation of Pairwise Counterfactuals

    Authors: Ananth Balashankar, Xuezhi Wang, Yao Qin, Ben Packer, Nithum Thain, Jilin Chen, Ed H. Chi, Alex Beutel

    Abstract: Counterfactual Data Augmentation (CDA) is a commonly used technique for improving robustness in natural language classifiers. However, one fundamental challenge is how to discover meaningful counterfactuals and efficiently label them, with minimal human labeling cost. Most existing methods either completely rely on human-annotated labels, an expensive process which limits the scale of counterfactu… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  35. arXiv:2305.12102  [pdf, other

    cs.LG cs.IR

    Unified Embedding: Battle-Tested Feature Representations for Web-Scale ML Systems

    Authors: Benjamin Coleman, Wang-Cheng Kang, Matthew Fahrbach, Ruoxi Wang, Lichan Hong, Ed H. Chi, Derek Zhiyuan Cheng

    Abstract: Learning high-quality feature embeddings efficiently and effectively is critical for the performance of web-scale machine learning systems. A typical model ingests hundreds of features with vocabularies on the order of millions to billions of tokens. The standard approach is to represent each feature value as a d-dimensional embedding, introducing hundreds of billions of parameters for extremely h… ▽ More

    Submitted 14 November, 2023; v1 submitted 20 May, 2023; originally announced May 2023.

    Comments: NeurIPS'23 Spotlight

    Journal ref: Proceedings of the 37th Annual Conference on Neural Information Processing Systems (NeurIPS 2023) 56234-56255

  36. arXiv:2305.05065  [pdf, other

    cs.IR cs.LG

    Recommender Systems with Generative Retrieval

    Authors: Shashank Rajput, Nikhil Mehta, Anima Singh, Raghunandan H. Keshavan, Trung Vu, Lukasz Heldt, Lichan Hong, Yi Tay, Vinh Q. Tran, Jonah Samost, Maciej Kula, Ed H. Chi, Maheswaran Sathiamoorthy

    Abstract: Modern recommender systems perform large-scale retrieval by first embedding queries and item candidates in the same unified space, followed by approximate nearest neighbor search to select top candidates given a query embedding. In this paper, we propose a novel generative retrieval approach, where the retrieval model autoregressively decodes the identifiers of the target candidates. To that end,… ▽ More

    Submitted 3 November, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

    Comments: To appear in The 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  37. arXiv:2304.14660  [pdf, other

    eess.IV cs.CV cs.LG

    Segment Anything Model for Medical Images?

    Authors: Yuhao Huang, Xin Yang, Lian Liu, Han Zhou, Ao Chang, Xinrui Zhou, Rusi Chen, Junxuan Yu, Jiongquan Chen, Chaoyu Chen, Sijing Liu, Haozhe Chi, Xindi Hu, Kejuan Yue, Lei Li, Vicente Grau, Deng-Ping Fan, Fajin Dong, Dong Ni

    Abstract: The Segment Anything Model (SAM) is the first foundation model for general image segmentation. It has achieved impressive results on various natural image segmentation tasks. However, medical image segmentation (MIS) is more challenging because of the complex modalities, fine anatomical structures, uncertain and complex object boundaries, and wide-range object scales. To fully validate SAM's perfo… ▽ More

    Submitted 17 January, 2024; v1 submitted 28 April, 2023; originally announced April 2023.

    Comments: Accepted by Medical Image Analysis. 23 pages, 18 figures, 8 tables

  38. arXiv:2303.16005  [pdf, other

    cs.CV

    Uncovering the Missing Pattern: Unified Framework Towards Trajectory Imputation and Prediction

    Authors: Yi Xu, Armin Bazarjani, Hyung-gun Chi, Chiho Choi, Yun Fu

    Abstract: Trajectory prediction is a crucial undertaking in understanding entity movement or human behavior from observed sequences. However, current methods often assume that the observed sequences are complete while ignoring the potential for missing values caused by object occlusion, scope limitation, sensor failure, etc. This limitation inevitably hinders the accuracy of trajectory prediction. To addres… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

    Comments: Accepted by CVPR 2023, Supplementary Material at https://github.com/colorfulfuture/GC-VRNN

  39. arXiv:2302.11188  [pdf, other

    cs.LG

    What Are Effective Labels for Augmented Data? Improving Calibration and Robustness with AutoLabel

    Authors: Yao Qin, Xuezhi Wang, Balaji Lakshminarayanan, Ed H. Chi, Alex Beutel

    Abstract: A wide breadth of research has devised data augmentation approaches that can improve both accuracy and generalization performance for neural networks. However, augmented data can end up being far from the clean training data and what is the appropriate label is less clear. Despite this, most existing work simply uses one-hot labels for augmented data. In this paper, we show re-using one-hot labels… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

    Comments: Accepted to SaTML-2023

  40. Improving Training Stability for Multitask Ranking Models in Recommender Systems

    Authors: Jiaxi Tang, Yoel Drori, Daryl Chang, Maheswaran Sathiamoorthy, Justin Gilmer, Li Wei, Xinyang Yi, Lichan Hong, Ed H. Chi

    Abstract: Recommender systems play an important role in many content platforms. While most recommendation research is dedicated to designing better models to improve user experience, we found that research on stabilizing the training for such models is severely under-explored. As recommendation models become larger and more sophisticated, they are more susceptible to training instability issues, i.e., loss… ▽ More

    Submitted 15 June, 2023; v1 submitted 17 February, 2023; originally announced February 2023.

    Comments: Accepted at KDD 2023; 12 pages

  41. arXiv:2301.03281  [pdf, other

    eess.IV cs.CV

    The state-of-the-art 3D anisotropic intracranial hemorrhage segmentation on non-contrast head CT: The INSTANCE challenge

    Authors: Xiangyu Li, Gongning Luo, Kuanquan Wang, Hongyu Wang, Jun Liu, Xinjie Liang, Jie Jiang, Zhenghao Song, Chunyue Zheng, Haokai Chi, Mingwang Xu, Yingte He, Xinghua Ma, Jingwen Guo, Yifan Liu, Chuanpu Li, Zeli Chen, Md Mahfuzur Rahman Siddiquee, Andriy Myronenko, Antoine P. Sanner, Anirban Mukhopadhyay, Ahmed E. Othman, Xingyu Zhao, Weiping Liu, Jinhuang Zhang , et al. (9 additional authors not shown)

    Abstract: Automatic intracranial hemorrhage segmentation in 3D non-contrast head CT (NCCT) scans is significant in clinical practice. Existing hemorrhage segmentation methods usually ignores the anisotropic nature of the NCCT, and are evaluated on different in-house datasets with distinct metrics, making it highly challenging to improve segmentation performance and perform objective comparisons among differ… ▽ More

    Submitted 12 January, 2023; v1 submitted 9 January, 2023; originally announced January 2023.

    Comments: Summarized paper for the MICCAI INSTANCE 2022 Challenge

  42. Latent User Intent Modeling for Sequential Recommenders

    Authors: Bo Chang, Alexandros Karatzoglou, Yuyan Wang, Can Xu, Ed H. Chi, Minmin Chen

    Abstract: Sequential recommender models are essential components of modern industrial recommender systems. These models learn to predict the next items a user is likely to interact with based on his/her interaction history on the platform. Most sequential recommenders however lack a higher-level understanding of user intents, which often drive user behaviors online. Intent modeling is thus critical for unde… ▽ More

    Submitted 27 March, 2023; v1 submitted 17 November, 2022; originally announced November 2022.

    Comments: The Web Conference 2023, Industry Track

  43. arXiv:2210.14309  [pdf, other

    cs.IR

    Empowering Long-tail Item Recommendation through Cross Decoupling Network (CDN)

    Authors: Yin Zhang, Ruoxi Wang, Tiansheng Yao, Xinyang Yi, Lichan Hong, James Caverlee, Ed H. Chi, Derek Zhiyuan Cheng

    Abstract: Industry recommender systems usually suffer from highly-skewed long-tail item distributions where a small fraction of the items receives most of the user feedback. This skew hurts recommender quality especially for the item slices without much user feedback. While there have been many research advances made in academia, deploying these methods in production is very difficult and very few improveme… ▽ More

    Submitted 3 September, 2023; v1 submitted 25 October, 2022; originally announced October 2022.

    Comments: Accepted by KDD 2023 Applied Data Science (ADS) track

  44. BC-IoDT: Blockchain-based Framework for Authentication in Internet of Drone Things

    Authors: Junaid Akram, Awais Akram, Rutvij H. Jhaveri, Mamoun Alazab, Haoran Chi

    Abstract: We leverage blockchain technology for drone node authentication in internet of drone things (IoDT). During the authentication procedure, the credentials of drone nodes are examined to remove malicious nodes from the system. In IoDT, drones are responsible for gathering data and transmitting it to cluster heads (CHs) for further processing. The CH collects and organizes data. Due to computational l… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: 6 pages, 3 figures, ACM MobiCom Workshop on Drone Assisted Wireless Communications for 5G and Beyond

    Journal ref: 2022 ACM MobiCom Workshop on Drone Assisted Wireless Communications for 5G and Beyond

  45. arXiv:2210.11416  [pdf, other

    cs.LG cs.CL

    Scaling Instruction-Finetuned Language Models

    Authors: Hyung Won Chung, Le Hou, Shayne Longpre, Barret Zoph, Yi Tay, William Fedus, Yunxuan Li, Xuezhi Wang, Mostafa Dehghani, Siddhartha Brahma, Albert Webson, Shixiang Shane Gu, Zhuyun Dai, Mirac Suzgun, Xinyun Chen, Aakanksha Chowdhery, Alex Castro-Ros, Marie Pellat, Kevin Robinson, Dasha Valter, Sharan Narang, Gaurav Mishra, Adams Yu, Vincent Zhao, Yanping Huang , et al. (10 additional authors not shown)

    Abstract: Finetuning language models on a collection of datasets phrased as instructions has been shown to improve model performance and generalization to unseen tasks. In this paper we explore instruction finetuning with a particular focus on (1) scaling the number of tasks, (2) scaling the model size, and (3) finetuning on chain-of-thought data. We find that instruction finetuning with the above aspects d… ▽ More

    Submitted 6 December, 2022; v1 submitted 20 October, 2022; originally announced October 2022.

    Comments: Public checkpoints: https://huggingface.co/docs/transformers/model_doc/flan-t5

  46. arXiv:2210.09261  [pdf, other

    cs.CL cs.AI

    Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them

    Authors: Mirac Suzgun, Nathan Scales, Nathanael Schärli, Sebastian Gehrmann, Yi Tay, Hyung Won Chung, Aakanksha Chowdhery, Quoc V. Le, Ed H. Chi, Denny Zhou, Jason Wei

    Abstract: BIG-Bench (Srivastava et al., 2022) is a diverse evaluation suite that focuses on tasks believed to be beyond the capabilities of current language models. Language models have already made good progress on this benchmark, with the best model in the BIG-Bench paper outperforming average reported human-rater results on 65% of the BIG-Bench tasks via few-shot prompting. But on what tasks do language… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

    Comments: GitHub repository: https://github.com/suzgunmirac/BIG-Bench-Hard

  47. arXiv:2210.07755  [pdf, other

    cs.IR cs.AI cs.LG

    Simpson's Paradox in Recommender Fairness: Reconciling differences between per-user and aggregated evaluations

    Authors: Flavien Prost, Ben Packer, Jilin Chen, Li Wei, Pierre Kremp, Nicholas Blumm, Susan Wang, Tulsee Doshi, Tonia Osadebe, Lukasz Heldt, Ed H. Chi, Alex Beutel

    Abstract: There has been a flurry of research in recent years on notions of fairness in ranking and recommender systems, particularly on how to evaluate if a recommender allocates exposure equally across groups of relevant items (also known as provider fairness). While this research has laid an important foundation, it gave rise to different approaches depending on whether relevant items are compared per-us… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

  48. arXiv:2210.03428  [pdf, other

    cs.CV

    Missing Modality meets Meta Sampling (M3S): An Efficient Universal Approach for Multimodal Sentiment Analysis with Missing Modality

    Authors: Haozhe Chi, Minghua Yang, Junhao Zhu, Guanhong Wang, Gaoang Wang

    Abstract: Multimodal sentiment analysis (MSA) is an important way of observing mental activities with the help of data captured from multiple modalities. However, due to the recording or transmission error, some modalities may include incomplete data. Most existing works that address missing modalities usually assume a particular modality is completely missing and seldom consider a mixture of missing across… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

  49. arXiv:2209.15166  [pdf, other

    cs.IR cs.AI cs.LG

    Reward Shaping for User Satisfaction in a REINFORCE Recommender

    Authors: Konstantina Christakopoulou, Can Xu, Sai Zhang, Sriraj Badam, Trevor Potter, Daniel Li, Hao Wan, Xinyang Yi, Ya Le, Chris Berg, Eric Bencomo Dixon, Ed H. Chi, Minmin Chen

    Abstract: How might we design Reinforcement Learning (RL)-based recommenders that encourage aligning user trajectories with the underlying user satisfaction? Three research questions are key: (1) measuring user satisfaction, (2) combatting sparsity of satisfaction signals, and (3) adapting the training of the recommender agent to maximize satisfaction. For measurement, it has been found that surveys explici… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

    Comments: Accepted in Reinforcement Learning for Real Life (RL4RealLife) Workshop in the 38th International Conference on Machine Learning, 2021

  50. arXiv:2208.08149  [pdf, ps, other

    cs.AI

    A Concept and Argumentation based Interpretable Model in High Risk Domains

    Authors: Haixiao Chi, Dawei Wang, Gaojie Cui, Feng Mao, Beishui Liao

    Abstract: Interpretability has become an essential topic for artificial intelligence in some high-risk domains such as healthcare, bank and security. For commonly-used tabular data, traditional methods trained end-to-end machine learning models with numerical and categorical data only, and did not leverage human understandable knowledge such as data descriptions. Yet mining human-level knowledge from tabula… ▽ More

    Submitted 17 August, 2022; originally announced August 2022.