Skip to main content

Showing 1–7 of 7 results for author: Tajwar, F

  1. arXiv:2404.14367  [pdf, other

    cs.LG

    Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data

    Authors: Fahim Tajwar, Anikait Singh, Archit Sharma, Rafael Rafailov, Jeff Schneider, Tengyang Xie, Stefano Ermon, Chelsea Finn, Aviral Kumar

    Abstract: Learning from preference labels plays a crucial role in fine-tuning large language models. There are several distinct approaches for preference fine-tuning, including supervised learning, on-policy reinforcement learning (RL), and contrastive learning. Different methods come with different implementation tradeoffs and performance differences, and existing empirical findings present different concl… ▽ More

    Submitted 2 June, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: International Conference on Machine Learning (ICML), 2024

  2. arXiv:2310.08558  [pdf, other

    cs.LG cs.AI cs.RO

    Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias

    Authors: Max Sobol Mark, Archit Sharma, Fahim Tajwar, Rafael Rafailov, Sergey Levine, Chelsea Finn

    Abstract: It is desirable for policies to optimistically explore new states and behaviors during online reinforcement learning (RL) or fine-tuning, especially when prior offline data does not provide enough state coverage. However, exploration bonuses can bias the learned policy, and our experiments find that naive, yet standard use of such bonuses can fail to recover a performant policy. Concurrently, pess… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  3. arXiv:2306.04974  [pdf, other

    cs.LG cs.AI

    Conservative Prediction via Data-Driven Confidence Minimization

    Authors: Caroline Choi, Fahim Tajwar, Yoonho Lee, Huaxiu Yao, Ananya Kumar, Chelsea Finn

    Abstract: In safety-critical applications of machine learning, it is often desirable for a model to be conservative, abstaining from making predictions on unknown inputs which are not well-represented in the training data. However, detecting unknown examples is challenging, as it is impossible to anticipate all potential inputs at test time. To address this, prior work (Hendrycks et al., 2018) minimizes mod… ▽ More

    Submitted 3 June, 2024; v1 submitted 8 June, 2023; originally announced June 2023.

    Comments: Transactions on Machine Learning Research (TMLR), 2024

  4. arXiv:2210.11466  [pdf, other

    cs.LG cs.AI

    Surgical Fine-Tuning Improves Adaptation to Distribution Shifts

    Authors: Yoonho Lee, Annie S. Chen, Fahim Tajwar, Ananya Kumar, Huaxiu Yao, Percy Liang, Chelsea Finn

    Abstract: A common approach to transfer learning under distribution shift is to fine-tune the last few layers of a pre-trained model, preserving learned features while also adapting to the new task. This paper shows that in such settings, selectively fine-tuning a subset of layers (which we term surgical fine-tuning) matches or outperforms commonly used fine-tuning approaches. Moreover, the type of distribu… ▽ More

    Submitted 6 June, 2023; v1 submitted 20 October, 2022; originally announced October 2022.

    Comments: ICLR 2023

  5. arXiv:2210.10765  [pdf, other

    cs.LG

    When to Ask for Help: Proactive Interventions in Autonomous Reinforcement Learning

    Authors: Annie Xie, Fahim Tajwar, Archit Sharma, Chelsea Finn

    Abstract: A long-term goal of reinforcement learning is to design agents that can autonomously interact and learn in the world. A critical challenge to such autonomy is the presence of irreversible states which require external assistance to recover from, such as when a robot arm has pushed an object off of a table. While standard agents require constant monitoring to decide when to intervene, we aim to des… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: 36th Conference on Neural Information Processing Systems (NeurIPS 2022)

  6. arXiv:2203.09739  [pdf, other

    cs.CV cs.LG

    Do Deep Networks Transfer Invariances Across Classes?

    Authors: Allan Zhou, Fahim Tajwar, Alexander Robey, Tom Knowles, George J. Pappas, Hamed Hassani, Chelsea Finn

    Abstract: To generalize well, classifiers must learn to be invariant to nuisance transformations that do not alter an input's class. Many problems have "class-agnostic" nuisance transformations that apply similarly to all classes, such as lighting and background changes for image classification. Neural networks can learn these invariances given sufficient data, but many real-world datasets are heavily class… ▽ More

    Submitted 18 March, 2022; originally announced March 2022.

  7. arXiv:2109.05554  [pdf, other

    cs.LG

    No True State-of-the-Art? OOD Detection Methods are Inconsistent across Datasets

    Authors: Fahim Tajwar, Ananya Kumar, Sang Michael Xie, Percy Liang

    Abstract: Out-of-distribution detection is an important component of reliable ML systems. Prior literature has proposed various methods (e.g., MSP (Hendrycks & Gimpel, 2017), ODIN (Liang et al., 2018), Mahalanobis (Lee et al., 2018)), claiming they are state-of-the-art by showing they outperform previous methods on a selected set of in-distribution (ID) and out-of-distribution (OOD) datasets. In this work,… ▽ More

    Submitted 12 September, 2021; originally announced September 2021.

    Comments: ICML Workshop on Uncertainty & Robustness in Deep Learning, 2021