Skip to main content

Showing 1–11 of 11 results for author: Feuer, B

  1. arXiv:2406.19314  [pdf, other

    cs.CL cs.AI cs.LG

    LiveBench: A Challenging, Contamination-Free LLM Benchmark

    Authors: Colin White, Samuel Dooley, Manley Roberts, Arka Pal, Ben Feuer, Siddhartha Jain, Ravid Shwartz-Ziv, Neel Jain, Khalid Saifullah, Siddartha Naidu, Chinmay Hegde, Yann LeCun, Tom Goldstein, Willie Neiswanger, Micah Goldblum

    Abstract: Test set contamination, wherein test data from a benchmark ends up in a newer model's training set, is a well-documented obstacle for fair LLM evaluation and can quickly render benchmarks obsolete. To mitigate this, many recent benchmarks crowdsource new prompts and evaluations from human or LLM judges; however, these can introduce significant biases, and break down when scoring hard questions. In… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2406.17720  [pdf, other

    cs.CV

    Arboretum: A Large Multimodal Dataset Enabling AI for Biodiversity

    Authors: Chih-Hsuan Yang, Benjamin Feuer, Zaki Jubery, Zi K. Deng, Andre Nakkab, Md Zahid Hasan, Shivani Chiranjeevi, Kelly Marshall, Nirmal Baishnab, Asheesh K Singh, Arti Singh, Soumik Sarkar, Nirav Merchant, Chinmay Hegde, Baskar Ganapathysubramanian

    Abstract: We introduce Arboretum, the largest publicly accessible dataset designed to advance AI for biodiversity applications. This dataset, curated from the iNaturalist community science platform and vetted by domain experts to ensure accuracy, includes 134.6 million images, surpassing existing datasets in scale by an order of magnitude. The dataset encompasses image-language paired data for a diverse set… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Preprint under review

  3. arXiv:2402.11137  [pdf, other

    cs.LG

    TuneTables: Context Optimization for Scalable Prior-Data Fitted Networks

    Authors: Benjamin Feuer, Robin Tibor Schirrmeister, Valeriia Cherepanova, Chinmay Hegde, Frank Hutter, Micah Goldblum, Niv Cohen, Colin White

    Abstract: While tabular classification has traditionally relied on from-scratch training, a recent breakthrough called prior-data fitted networks (PFNs) challenges this approach. Similar to large language models, PFNs make use of pretraining and in-context learning to achieve strong performance on new tasks in a single forward pass. However, current PFNs have limitations that prohibit their widespread adopt… ▽ More

    Submitted 18 March, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

  4. arXiv:2311.10609  [pdf, other

    cs.LG cs.DB

    Scaling TabPFN: Sketching and Feature Selection for Tabular Prior-Data Fitted Networks

    Authors: Benjamin Feuer, Chinmay Hegde, Niv Cohen

    Abstract: Tabular classification has traditionally relied on supervised algorithms, which estimate the parameters of a prediction model using its training data. Recently, Prior-Data Fitted Networks (PFNs) such as TabPFN have successfully learned to classify tabular data in-context: the model parameters are designed to classify new samples based on labelled training samples given after the model training. Wh… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Comments: 2nd Table Representation Learning Workshop: 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  5. arXiv:2311.04016  [pdf, ps, other

    cs.CV cs.LG

    Exploring Dataset-Scale Indicators of Data Quality

    Authors: Benjamin Feuer, Chinmay Hegde

    Abstract: Modern computer vision foundation models are trained on massive amounts of data, incurring large economic and environmental costs. Recent research has suggested that improving data quality can significantly reduce the need for data quantity. But what constitutes data quality in computer vision? We posit that the quality of a given dataset can be decomposed into distinct sample-level and dataset-le… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: 1st Workshop on Attributing Model Behavior at Scale: 37th Conference on Neural Information Processing Systems (NeurIPS 2023). 7 pages, 1 figure

  6. arXiv:2310.18208  [pdf, other

    cs.CL cs.LG

    ArcheType: A Novel Framework for Open-Source Column Type Annotation using Large Language Models

    Authors: Benjamin Feuer, Yurong Liu, Chinmay Hegde, Juliana Freire

    Abstract: Existing deep-learning approaches to semantic column type annotation (CTA) have important shortcomings: they rely on semantic types which are fixed at training time; require a large number of training samples per type and incur large run-time inference costs; and their performance can degrade when evaluated on novel datasets, even when types remain constant. Large language models have exhibited st… ▽ More

    Submitted 6 November, 2023; v1 submitted 27 October, 2023; originally announced October 2023.

    Comments: 17 pages, 8 figures

  7. arXiv:2308.03821  [pdf, other

    cs.CV cs.LG

    Distributionally Robust Classification on a Data Budget

    Authors: Benjamin Feuer, Ameya Joshi, Minh Pham, Chinmay Hegde

    Abstract: Real world uses of deep learning require predictable model behavior under distribution shifts. Models such as CLIP show emergent natural distributional robustness comparable to humans, but may require hundreds of millions of training samples. Can we train robust learners in a domain where data is limited? To rigorously address this question, we introduce JANuS (Joint Annotations and Names Set), a… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

    Comments: TMLR 2023; openreview link: https://openreview.net/forum?id=D5Z2E8CNsD

  8. arXiv:2305.02997  [pdf, other

    cs.LG cs.AI stat.ML

    When Do Neural Nets Outperform Boosted Trees on Tabular Data?

    Authors: Duncan McElfresh, Sujay Khandagale, Jonathan Valverde, Vishak Prasad C, Benjamin Feuer, Chinmay Hegde, Ganesh Ramakrishnan, Micah Goldblum, Colin White

    Abstract: Tabular data is one of the most commonly used types of data in machine learning. Despite recent advances in neural nets (NNs) for tabular data, there is still an active discussion on whether or not NNs generally outperform gradient-boosted decision trees (GBDTs) on tabular data, with several recent works arguing either that GBDTs consistently outperform NNs on tabular data, or vice versa. In this… ▽ More

    Submitted 15 July, 2024; v1 submitted 4 May, 2023; originally announced May 2023.

    Comments: NeurIPS Datasets and Benchmarks Track 2023

  9. arXiv:2302.10281  [pdf, other

    cs.CV cs.AI cs.CL

    LiT Tuned Models for Efficient Species Detection

    Authors: Andre Nakkab, Benjamin Feuer, Chinmay Hegde

    Abstract: Recent advances in training vision-language models have demonstrated unprecedented robustness and transfer learning effectiveness; however, standard computer vision datasets are image-only, and therefore not well adapted to such training methods. Our paper introduces a simple methodology for adapting any fine-grained image classification dataset for distributed vision-language pretraining. We impl… ▽ More

    Submitted 12 February, 2023; originally announced February 2023.

    Comments: 5 pages, 5 figures, 1 table, presented at AAAI 2023 conference for the AIAFS workshop

  10. arXiv:2210.07396  [pdf, other

    cs.CV

    Caption supervision enables robust learners

    Authors: Benjamin Feuer, Ameya Joshi, Chinmay Hegde

    Abstract: Vision language (VL) models like CLIP are robust to natural distribution shifts, in part because CLIP learns on unstructured data using a technique called caption supervision; the model inteprets image-linked texts as ground-truth labels. In a carefully controlled comparison study, we show that caption-supervised CNNs trained on a standard cross-entropy loss (with image labels assigned by scanning… ▽ More

    Submitted 8 December, 2022; v1 submitted 13 October, 2022; originally announced October 2022.

    ACM Class: I.4.9

  11. arXiv:2206.07565  [pdf, other

    cs.CV cs.LG

    A Meta-Analysis of Distributionally-Robust Models

    Authors: Benjamin Feuer, Ameya Joshi, Chinmay Hegde

    Abstract: State-of-the-art image classifiers trained on massive datasets (such as ImageNet) have been shown to be vulnerable to a range of both intentional and incidental distribution shifts. On the other hand, several recent classifiers with favorable out-of-distribution (OOD) robustness properties have emerged, achieving high accuracy on their target tasks while maintaining their in-distribution accuracy… ▽ More

    Submitted 15 June, 2022; originally announced June 2022.

    Comments: To be presented at ICML Workshop on Principles of Distribution Shift 2022. Copyright 2022 by the author(s)