Skip to main content

Showing 1–9 of 9 results for author: Cohen-Karlik, E

  1. arXiv:2402.07875  [pdf, other

    cs.LG cs.AI eess.SY stat.ML

    Implicit Bias of Policy Gradient in Linear Quadratic Control: Extrapolation to Unseen Initial States

    Authors: Noam Razin, Yotam Alexander, Edo Cohen-Karlik, Raja Giryes, Amir Globerson, Nadav Cohen

    Abstract: In modern machine learning, models can often fit training data in numerous ways, some of which perform well on unseen (test) data, while others do not. Remarkably, in such cases gradient descent frequently exhibits an implicit bias that leads to excellent performance on unseen data. This implicit bias was extensively studied in supervised learning, but is far less understood in optimal control (re… ▽ More

    Submitted 1 June, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: Accepted to ICML 2024

  2. arXiv:2402.03387  [pdf, other

    cs.SI cs.LG

    Overcoming Order in Autoregressive Graph Generation

    Authors: Edo Cohen-Karlik, Eyal Rozenberg, Daniel Freedman

    Abstract: Graph generation is a fundamental problem in various domains, including chemistry and social networks. Recent work has shown that molecular graph generation using recurrent neural networks (RNNs) is advantageous compared to traditional generative approaches which require converting continuous latent representations into graphs. One issue which arises when treating graph generation as sequential ge… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: 16 pages, 3 figures

  3. arXiv:2210.14064  [pdf, other

    cs.LG

    Learning Low Dimensional State Spaces with Overparameterized Recurrent Neural Nets

    Authors: Edo Cohen-Karlik, Itamar Menuhin-Gruman, Raja Giryes, Nadav Cohen, Amir Globerson

    Abstract: Overparameterization in deep learning typically refers to settings where a trained neural network (NN) has representational capacity to fit the training data in many ways, some of which generalize well, while others do not. In the case of Recurrent Neural Networks (RNNs), there exists an additional layer of overparameterization, in the sense that a model may exhibit many solutions that generalize… ▽ More

    Submitted 23 March, 2023; v1 submitted 25 October, 2022; originally announced October 2022.

    Comments: Accepted to ICLR 2023, 9 pages, 2 figures plus supplementary

  4. arXiv:2202.04302  [pdf, other

    cs.LG

    On the Implicit Bias of Gradient Descent for Temporal Extrapolation

    Authors: Edo Cohen-Karlik, Avichai Ben David, Nadav Cohen, Amir Globerson

    Abstract: When using recurrent neural networks (RNNs) it is common practice to apply trained models to sequences longer than those seen in training. This "extrapolating" usage deviates from the traditional statistical learning setup where guarantees are provided under the assumption that train and test distributions are identical. Here we set out to understand when RNNs can extrapolate, focusing on a simple… ▽ More

    Submitted 24 March, 2022; v1 submitted 9 February, 2022; originally announced February 2022.

    Comments: 8 pages, 5 figures (plus appendix), AISTATS2022

  5. arXiv:2010.13055  [pdf, other

    cs.LG stat.ML

    Regularizing Towards Permutation Invariance in Recurrent Models

    Authors: Edo Cohen-Karlik, Avichai Ben David, Amir Globerson

    Abstract: In many machine learning problems the output should not depend on the order of the input. Such "permutation invariant" functions have been studied extensively recently. Here we argue that temporal architectures such as RNNs are highly relevant for such problems, despite the inherent dependence of RNNs on order. We show that RNNs can be regularized towards permutation invariance, and that this can… ▽ More

    Submitted 25 October, 2020; originally announced October 2020.

    Comments: 9 pages, 5 figures, NeurIPS 2020

  6. arXiv:2010.06185  [pdf, other

    cs.CL

    The workweek is the best time to start a family -- A Study of GPT-2 Based Claim Generation

    Authors: Shai Gretz, Yonatan Bilu, Edo Cohen-Karlik, Noam Slonim

    Abstract: Argument generation is a challenging task whose research is timely considering its potential impact on social media and the dissemination of information. Here we suggest a pipeline based on GPT-2 for generating coherent claims, and explore the types of claims that it produces, and their veracity, using an array of manual and automatic assessments. In addition, we explore the interplay between this… ▽ More

    Submitted 13 October, 2020; originally announced October 2020.

    Comments: Accepted to Findings of EMNLP 2020

  7. arXiv:1911.11408  [pdf, other

    cs.CL

    A Large-scale Dataset for Argument Quality Ranking: Construction and Analysis

    Authors: Shai Gretz, Roni Friedman, Edo Cohen-Karlik, Assaf Toledo, Dan Lahav, Ranit Aharonov, Noam Slonim

    Abstract: Identifying the quality of free-text arguments has become an important task in the rapidly expanding field of computational argumentation. In this work, we explore the challenging task of argument quality ranking. To this end, we created a corpus of 30,497 arguments carefully annotated for point-wise quality, released as part of this work. To the best of our knowledge, this is the largest dataset… ▽ More

    Submitted 26 November, 2019; originally announced November 2019.

    Comments: Accepted to AAAI 2020

  8. arXiv:1909.01007  [pdf, other

    cs.CL

    Automatic Argument Quality Assessment -- New Datasets and Methods

    Authors: Assaf Toledo, Shai Gretz, Edo Cohen-Karlik, Roni Friedman, Elad Venezian, Dan Lahav, Michal Jacovi, Ranit Aharonov, Noam Slonim

    Abstract: We explore the task of automatic assessment of argument quality. To that end, we actively collected 6.3k arguments, more than a factor of five compared to previously examined data. Each argument was explicitly and carefully annotated for its quality. In addition, 14k pairs of arguments were annotated independently, identifying the higher quality argument in each pair. In spite of the inherent subj… ▽ More

    Submitted 3 September, 2019; originally announced September 2019.

    Comments: Published at EMNLP 2019

  9. arXiv:1906.03897  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Learning to combine Grammatical Error Corrections

    Authors: Yoav Kantor, Yoav Katz, Leshem Choshen, Edo Cohen-Karlik, Naftali Liberman, Assaf Toledo, Amir Menczel, Noam Slonim

    Abstract: The field of Grammatical Error Correction (GEC) has produced various systems to deal with focused phenomena or general text editing. We propose an automatic way to combine black-box systems. Our method automatically detects the strength of a system or the combination of several systems per error type, improving precision and recall while optimizing $F$ score directly. We show consistent improvemen… ▽ More

    Submitted 10 June, 2019; originally announced June 2019.

    Comments: BEA 2019