Skip to main content

Showing 1–18 of 18 results for author: Ben-David, E

  1. arXiv:2406.03618  [pdf, other

    cs.CL

    TACT: Advancing Complex Aggregative Reasoning with Information Extraction Tools

    Authors: Avi Caciularu, Alon Jacovi, Eyal Ben-David, Sasha Goldshtein, Tal Schuster, Jonathan Herzig, Gal Elidan, Amir Globerson

    Abstract: Large Language Models (LLMs) often do not perform well on queries that require the aggregation of information across texts. To better evaluate this setting and facilitate modeling efforts, we introduce TACT - Text And Calculations through Tables, a dataset crafted to evaluate LLMs' reasoning and computational abilities using complex instructions. TACT contains challenging instructions that demand… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: Website (https://tact-benchmark.github.io), Huggingface (https://huggingface.co/datasets/google/TACT)

  2. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  3. arXiv:2310.01929  [pdf, other

    cs.CL cs.AI cs.LG

    Navigating Cultural Chasms: Exploring and Unlocking the Cultural POV of Text-To-Image Models

    Authors: Mor Ventura, Eyal Ben-David, Anna Korhonen, Roi Reichart

    Abstract: Text-To-Image (TTI) models, such as DALL-E and StableDiffusion, have demonstrated remarkable prompt-based image generation capabilities. Multilingual encoders may have a substantial impact on the cultural agency of these models, as language is a conduit of culture. In this study, we explore the cultural perception embedded in TTI models by characterizing culture across three hierarchical tiers: cu… ▽ More

    Submitted 29 November, 2023; v1 submitted 3 October, 2023; originally announced October 2023.

  4. arXiv:2306.00168  [pdf, other

    cs.CL

    Measuring the Robustness of NLP Models to Domain Shifts

    Authors: Nitay Calderon, Naveh Porat, Eyal Ben-David, Alexander Chapanin, Zorik Gekhman, Nadav Oved, Vitaly Shalumov, Roi Reichart

    Abstract: Existing research on Domain Robustness (DR) suffers from disparate setups, limited task variety, and scarce research on recent capabilities such as in-context learning. Furthermore, the common practice of measuring DR might not be fully accurate. Current research focuses on challenge sets and relies solely on the Source Drop (SD): Using the source in-domain performance as a reference point for deg… ▽ More

    Submitted 20 April, 2024; v1 submitted 31 May, 2023; originally announced June 2023.

  5. arXiv:2210.15182  [pdf, other

    cs.CV cs.LG

    Text2Model: Text-based Model Induction for Zero-shot Image Classification

    Authors: Ohad Amosy, Tomer Volk, Eilam Shapira, Eyal Ben-David, Roi Reichart, Gal Chechik

    Abstract: We address the challenge of building task-agnostic classifiers using only text descriptions, demonstrating a unified approach to image classification, 3D point cloud classification, and action recognition from scenes. Unlike approaches that learn a fixed representation of the output classes, we generate at inference time a model tailored to a query classification task. To generate task-based zero-… ▽ More

    Submitted 9 March, 2024; v1 submitted 27 October, 2022; originally announced October 2022.

  6. arXiv:2209.00830  [pdf, other

    cs.CL cs.AI cs.LG

    Domain Adaptation from Scratch

    Authors: Eyal Ben-David, Yftah Ziser, Roi Reichart

    Abstract: Natural language processing (NLP) algorithms are rapidly improving but often struggle when applied to out-of-distribution examples. A prominent approach to mitigate the domain gap is domain adaptation, where a model trained on a source domain is adapted to a new target domain. We present a new learning setup, ``domain adaptation from scratch'', which we believe to be crucial for extending the reac… ▽ More

    Submitted 2 September, 2022; originally announced September 2022.

  7. arXiv:2206.00259  [pdf, other

    cs.CL cs.AI cs.LG

    IDANI: Inference-time Domain Adaptation via Neuron-level Interventions

    Authors: Omer Antverg, Eyal Ben-David, Yonatan Belinkov

    Abstract: Large pre-trained models are usually fine-tuned on downstream task data, and tested on unseen data. When the train and test data come from different domains, the model is likely to struggle, as it is not adapted to the test domain. We propose a new approach for domain adaptation (DA), using neuron-level interventions: We modify the representation of each test example in specific neurons, resulting… ▽ More

    Submitted 1 June, 2022; originally announced June 2022.

    Comments: Our code is available at https://github.com/technion-cs-nlp/idani

  8. arXiv:2203.14276  [pdf, other

    cs.CL cs.AI cs.LG

    Example-based Hypernetworks for Out-of-Distribution Generalization

    Authors: Tomer Volk, Eyal Ben-David, Ohad Amosy, Gal Chechik, Roi Reichart

    Abstract: As Natural Language Processing (NLP) algorithms continually achieve new milestones, out-of-distribution generalization remains a significant challenge. This paper addresses the issue of multi-source adaptation for unfamiliar domains: We leverage labeled data from multiple source domains to generalize to unknown target domains at training. Our innovative framework employs example-based Hypernetwork… ▽ More

    Submitted 18 October, 2023; v1 submitted 27 March, 2022; originally announced March 2022.

    Comments: First two authors contributed equally to this work. Our code and data are available at: https://github.com/TomerVolk/Hyper-PADA

  9. arXiv:2202.12350  [pdf, other

    cs.CL cs.AI

    DoCoGen: Domain Counterfactual Generation for Low Resource Domain Adaptation

    Authors: Nitay Calderon, Eyal Ben-David, Amir Feder, Roi Reichart

    Abstract: Natural language processing (NLP) algorithms have become very successful, but they still struggle when applied to out-of-distribution examples. In this paper we propose a controllable generation approach in order to deal with this domain adaptation (DA) challenge. Given an input text example, our DoCoGen algorithm generates a domain-counterfactual textual example (D-con) - that is similar to the o… ▽ More

    Submitted 5 March, 2022; v1 submitted 24 February, 2022; originally announced February 2022.

    Comments: Our code and data are available at https://github.com/nitaytech/DoCoGen

    ACM Class: I.2.7

  10. arXiv:2111.01767  [pdf, other

    stat.ML cs.LG stat.ME

    Regularization for Shuffled Data Problems via Exponential Family Priors on the Permutation Group

    Authors: Zhenbang Wang, Emanuel Ben-David, Martin Slawski

    Abstract: In the analysis of data sets consisting of (X, Y)-pairs, a tacit assumption is that each pair corresponds to the same observation unit. If, however, such pairs are obtained via record linkage of two files, this assumption can be violated as a result of mismatch error rooting, for example, in the lack of reliable identifiers in the two files. Recently, there has been a surge of interest in this set… ▽ More

    Submitted 2 November, 2021; originally announced November 2021.

    Comments: 25 pages, 5 figures

  11. arXiv:2110.12412  [pdf, other

    cs.CL cs.AI cs.LG

    Improved Goal Oriented Dialogue via Utterance Generation and Look Ahead

    Authors: Eyal Ben-David, Boaz Carmeli, Ateret Anaby-Tavor

    Abstract: Goal oriented dialogue systems have become a prominent customer-care interaction channel for most businesses. However, not all interactions are smooth, and customer intent misunderstanding is a major cause of dialogue failure. We show that intent prediction can be improved by training a deep text-to-text neural model to generate successive user utterances from unlabeled dialogue data. For that, we… ▽ More

    Submitted 24 October, 2021; originally announced October 2021.

  12. arXiv:2108.11328  [pdf, other

    stat.ML cs.LG stat.AP stat.CO

    Predicting Census Survey Response Rates With Parsimonious Additive Models and Structured Interactions

    Authors: Shibal Ibrahim, Peter Radchenko, Emanuel Ben-David, Rahul Mazumder

    Abstract: In this paper we consider the problem of predicting survey response rates using a family of flexible and interpretable nonparametric models. The study is motivated by the US Census Bureau's well-known ROAM application which uses a linear regression model trained on the US Census Planning Database data to identify hard-to-survey areas. A crowdsourcing competition (Erdman and Bates, 2016) organized… ▽ More

    Submitted 7 December, 2023; v1 submitted 24 August, 2021; originally announced August 2021.

    Comments: 40 pages, 7 figures

  13. arXiv:2102.12206  [pdf, other

    cs.CL cs.AI cs.LG

    PADA: Example-based Prompt Learning for on-the-fly Adaptation to Unseen Domains

    Authors: Eyal Ben-David, Nadav Oved, Roi Reichart

    Abstract: Natural Language Processing algorithms have made incredible progress, but they still struggle when applied to out-of-distribution examples. We address a challenging and underexplored version of this domain adaptation problem, where an algorithm is trained on several source domains, and then applied to examples from unseen domains that are unknown at training time. Particularly, no examples, labele… ▽ More

    Submitted 27 January, 2022; v1 submitted 24 February, 2021; originally announced February 2021.

    Comments: Accepted for publication at TACL in January 2022. First two authors contributed equally to this work. Our code and data are available at: https://github.com/eyalbd2/PADA

  14. arXiv:2010.02592  [pdf, other

    cs.CL cs.AI cs.LG

    Semantically Driven Sentence Fusion: Modeling and Evaluation

    Authors: Eyal Ben-David, Orgad Keller, Eric Malmi, Idan Szpektor, Roi Reichart

    Abstract: Sentence fusion is the task of joining related sentences into coherent text. Current training and evaluation schemes for this task are based on single reference ground-truths and do not account for valid fusion variants. We show that this hinders models from robustly capturing the semantic relationship between input sentences. To alleviate this, we present an approach in which ground-truth solutio… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

    Comments: This paper was accepted to Findings of EMNLP 2020

  15. arXiv:2006.09075  [pdf, other

    cs.CL cs.LG

    PERL: Pivot-based Domain Adaptation for Pre-trained Deep Contextualized Embedding Models

    Authors: Eyal Ben-David, Carmel Rabinovitz, Roi Reichart

    Abstract: Pivot-based neural representation models have lead to significant progress in domain adaptation for NLP. However, previous works that follow this approach utilize only labeled data from the source domain and unlabeled data from the source and target domains, but neglect to incorporate massive unlabeled corpora that are not necessarily drawn from these domains. To alleviate this, we propose PERL: A… ▽ More

    Submitted 16 June, 2020; originally announced June 2020.

    Comments: Accepted to TACL in June 2020

  16. arXiv:1910.01623  [pdf, other

    stat.ME cs.LG stat.ML

    A Pseudo-Likelihood Approach to Linear Regression with Partially Shuffled Data

    Authors: Martin Slawski, Guoqing Diao, Emanuel Ben-David

    Abstract: Recently, there has been significant interest in linear regression in the situation where predictors and responses are not observed in matching pairs corresponding to the same statistical unit as a consequence of separate data collection and uncertainty in data integration. Mismatched pairs can considerably impact the model fit and disrupt the estimation of regression parameters. In this paper, we… ▽ More

    Submitted 3 October, 2019; originally announced October 2019.

    Comments: 31 pages

  17. arXiv:1907.07148  [pdf, other

    stat.ML cs.IT cs.LG stat.ME

    A Two-Stage Approach to Multivariate Linear Regression with Sparsely Mismatched Data

    Authors: Martin Slawski, Emanuel Ben-David, Ping Li

    Abstract: A tacit assumption in linear regression is that (response, predictor)-pairs correspond to identical observational units. A series of recent works have studied scenarios in which this assumption is violated under terms such as ``Unlabeled Sensing and ``Regression with Unknown Permutation''. In this paper, we study the setup of multiple response variables and a notion of mismatches that generalizes… ▽ More

    Submitted 28 June, 2020; v1 submitted 16 July, 2019; originally announced July 2019.

  18. arXiv:1901.11252  [pdf

    cs.CV cs.LG physics.app-ph physics.optics

    Three-dimensional virtual refocusing of fluorescence microscopy images using deep learning

    Authors: Yichen Wu, Yair Rivenson, Hongda Wang, Yilin Luo, Eyal Ben-David, Laurent A. Bentolila, Christian Pritz, Aydogan Ozcan

    Abstract: Three-dimensional (3D) fluorescence microscopy in general requires axial scanning to capture images of a sample at different planes. Here we demonstrate that a deep convolutional neural network can be trained to virtually refocus a 2D fluorescence image onto user-defined 3D surfaces within the sample volume. With this data-driven computational microscopy framework, we imaged the neuron activity of… ▽ More

    Submitted 2 November, 2019; v1 submitted 31 January, 2019; originally announced January 2019.

    Comments: 47 pages, 5 figures (main text)

    Journal ref: Nature Methods (2019)