Skip to main content

Showing 51–79 of 79 results for author: Globerson, A

  1. arXiv:1902.10200  [pdf, other

    cs.CV

    Differentiable Scene Graphs

    Authors: Moshiko Raboh, Roei Herzig, Gal Chechik, Jonathan Berant, Amir Globerson

    Abstract: Reasoning about complex visual scenes involves perception of entities and their relations. Scene graphs provide a natural representation for reasoning tasks, by assigning labels to both entities (nodes) and relations (edges). Unfortunately, reasoning systems based on SGs are typically trained in a two-step procedure: First, training a model to predict SGs from images; Then, a separate model is cre… ▽ More

    Submitted 14 March, 2020; v1 submitted 26 February, 2019; originally announced February 2019.

    Comments: Winter Conference on Applications of Computer Vision (WACV), 2020

  2. arXiv:1902.09492  [pdf, other

    cs.CL cs.LG

    Cross-Lingual Alignment of Contextual Word Embeddings, with Applications to Zero-shot Dependency Parsing

    Authors: Tal Schuster, Ori Ram, Regina Barzilay, Amir Globerson

    Abstract: We introduce a novel method for multilingual transfer that utilizes deep contextual embeddings, pretrained in an unsupervised fashion. While contextual embeddings have been shown to yield richer representations of meaning compared to their static counterparts, aligning them poses a challenge due to their dynamic nature. To this end, we construct context-independent variants of the original monolin… ▽ More

    Submitted 3 April, 2019; v1 submitted 25 February, 2019; originally announced February 2019.

    Comments: NAACL 2019

  3. arXiv:1812.01233  [pdf, other

    cs.CV

    Spatio-Temporal Action Graph Networks

    Authors: Roei Herzig, Elad Levi, Huijuan Xu, Hang Gao, Eli Brosh, Xiaolong Wang, Amir Globerson, Trevor Darrell

    Abstract: Events defined by the interaction of objects in a scene are often of critical importance; yet important events may have insufficient labeled examples to train a conventional deep model to generalize to future object appearance. Activity recognition models that represent object interactions explicitly have the potential to learn in a more efficient manner than those that represent scenes with globa… ▽ More

    Submitted 29 September, 2019; v1 submitted 4 December, 2018; originally announced December 2018.

    Comments: IEEE/CVF International Conference on Computer Vision Workshop (ICCVW), 2019

  4. arXiv:1810.03037  [pdf, other

    cs.LG stat.ML

    Why do Larger Models Generalize Better? A Theoretical Perspective via the XOR Problem

    Authors: Alon Brutzkus, Amir Globerson

    Abstract: Empirical evidence suggests that neural networks with ReLU activations generalize better with over-parameterization. However, there is currently no theoretical analysis that explains this observation. In this work, we provide theoretical and empirical evidence that, in certain cases, overparameterized convolutional networks generalize better than small networks because of an interplay between weig… ▽ More

    Submitted 29 January, 2019; v1 submitted 6 October, 2018; originally announced October 2018.

  5. arXiv:1808.04614  [pdf, other

    cs.CL cs.AI

    Explaining Queries over Web Tables to Non-Experts

    Authors: Jonathan Berant, Daniel Deutch, Amir Globerson, Tova Milo, Tomer Wolfson

    Abstract: Designing a reliable natural language (NL) interface for querying tables has been a longtime goal of researchers in both the data management and natural language processing (NLP) communities. Such an interface receives as input an NL question, translates it into a formal query, executes the query and returns the results. Errors in the translation process are not uncommon, and users typically strug… ▽ More

    Submitted 14 August, 2018; originally announced August 2018.

    Comments: Short paper version to appear in ICDE 2019

  6. arXiv:1803.03155  [pdf, other

    cs.LG

    Learning Rules-First Classifiers

    Authors: Deborah Cohen, Amit Daniely, Amir Globerson, Gal Elidan

    Abstract: Complex classifiers may exhibit "embarassing" failures in cases where humans can easily provide a justified classification. Avoiding such failures is obviously of key importance. In this work, we focus on one such setting, where a label is perfectly predictable if the input contains certain features, or rules, and otherwise it is predictable by a linear classifier. We define a hypothesis class tha… ▽ More

    Submitted 13 June, 2019; v1 submitted 8 March, 2018; originally announced March 2018.

  7. arXiv:1802.05451  [pdf, other

    stat.ML cs.CV cs.LG

    Mapping Images to Scene Graphs with Permutation-Invariant Structured Prediction

    Authors: Roei Herzig, Moshiko Raboh, Gal Chechik, Jonathan Berant, Amir Globerson

    Abstract: Machine understanding of complex images is a key goal of artificial intelligence. One challenge underlying this task is that visual scenes contain multiple inter-related objects, and that global context plays an important role in interpreting the scene. A natural modeling framework for capturing such effects is structured prediction, which optimizes over complex labels, while modeling within-label… ▽ More

    Submitted 1 November, 2018; v1 submitted 15 February, 2018; originally announced February 2018.

    Comments: Paper is accepted for NIPS 2018 conference

  8. arXiv:1802.04721  [pdf, other

    cs.LG

    Predict and Constrain: Modeling Cardinality in Deep Structured Prediction

    Authors: Nataly Brukhim, Amir Globerson

    Abstract: Many machine learning problems require the prediction of multi-dimensional labels. Such structured prediction models can benefit from modeling dependencies between labels. Recently, several deep learning approaches to structured prediction have been proposed. Here we focus on capturing cardinality constraints in such models. Namely, constraining the number of non-zero labels that the model outputs… ▽ More

    Submitted 13 February, 2018; originally announced February 2018.

  9. arXiv:1711.05240  [pdf, other

    cs.CL cs.AI cs.LG

    Weakly-supervised Semantic Parsing with Abstract Examples

    Authors: Omer Goldman, Veronica Latcinnik, Udi Naveh, Amir Globerson, Jonathan Berant

    Abstract: Training semantic parsers from weak supervision (denotations) rather than strong supervision (programs) complicates training in two ways. First, a large search space of potential programs needs to be explored at training time to find a correct program. Second, spurious programs that accidentally lead to a correct denotation add noise to training. In this work we propose that in closed worlds with… ▽ More

    Submitted 13 March, 2019; v1 submitted 14 November, 2017; originally announced November 2017.

    Comments: CNLVR,NLVR. Accepted to ACL 2018

  10. arXiv:1710.10174  [pdf, other

    cs.LG

    SGD Learns Over-parameterized Networks that Provably Generalize on Linearly Separable Data

    Authors: Alon Brutzkus, Amir Globerson, Eran Malach, Shai Shalev-Shwartz

    Abstract: Neural networks exhibit good generalization behavior in the over-parameterized regime, where the number of network parameters exceeds the number of observations. Nonetheless, current generalization bounds for neural networks fail to explain this phenomenon. In an attempt to bridge this gap, we study the problem of learning a two-layer over-parameterized neural network, when the data is generated b… ▽ More

    Submitted 27 October, 2017; originally announced October 2017.

  11. arXiv:1708.02406  [pdf, other

    cs.LG

    Robust Conditional Probabilities

    Authors: Yoav Wald, Amir Globerson

    Abstract: Conditional probabilities are a core concept in machine learning. For example, optimal prediction of a label $Y$ given an input $X$ corresponds to maximizing the conditional probability of $Y$ given $X$. A common approach to inference tasks is learning a model of conditional probabilities. However, these models are often based on strong assumptions (e.g., log-linear models), and hence their estima… ▽ More

    Submitted 8 August, 2017; originally announced August 2017.

    Comments: 24 pages, 1 figure

  12. arXiv:1703.06426  [pdf, ps, other

    cs.LG

    Semi-Supervised Learning with Competitive Infection Models

    Authors: Nir Rosenfeld, Amir Globerson

    Abstract: The goal in semi-supervised learning is to effectively combine labeled and unlabeled data. One way to do this is by encouraging smoothness across edges in a graph whose nodes correspond to input examples. In many graph-based methods, labels can be thought of as propagating over the graph, where the underlying propagation mechanism is based on random walks or on averaging dynamics. While theoretica… ▽ More

    Submitted 27 February, 2018; v1 submitted 19 March, 2017; originally announced March 2017.

  13. arXiv:1702.07966  [pdf, other

    cs.LG math.OC stat.ML

    Globally Optimal Gradient Descent for a ConvNet with Gaussian Inputs

    Authors: Alon Brutzkus, Amir Globerson

    Abstract: Deep learning models are often successfully trained using gradient descent, despite the worst case hardness of the underlying non-convex optimization problem. The key question is then under what conditions can one prove that optimization will succeed. Here we provide a strong result of this kind. We consider a neural net with one hidden layer and a convolutional structure with no overlap and a ReL… ▽ More

    Submitted 25 February, 2017; originally announced February 2017.

  14. arXiv:1608.07639  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Learning to generalize to new compositions in image understanding

    Authors: Yuval Atzmon, Jonathan Berant, Vahid Kezami, Amir Globerson, Gal Chechik

    Abstract: Recurrent neural networks have recently been used for learning to describe images using natural language. However, it has been observed that these models generalize poorly to scenes that were not observed during training, possibly depending too strongly on the statistics of the text in the training data. Here we propose to describe images using short structured representations, aiming to capture t… ▽ More

    Submitted 26 August, 2016; originally announced August 2016.

  15. arXiv:1606.05316  [pdf, other

    cs.LG

    Learning Infinite-Layer Networks: Without the Kernel Trick

    Authors: Roi Livni, Daniel Carmon, Amir Globerson

    Abstract: Infinite--Layer Networks (ILN) have recently been proposed as an architecture that mimics neural networks while enjoying some of the advantages of kernel methods. ILN are networks that integrate over infinitely many nodes within a single hidden layer. It has been demonstrated by several authors that the problem of learning ILN can be reduced to the kernel trick, implying that whenever a certain in… ▽ More

    Submitted 28 July, 2017; v1 submitted 16 June, 2016; originally announced June 2016.

  16. arXiv:1605.04719  [pdf, ps, other

    cs.SI

    Optimal Tagging with Markov Chain Optimization

    Authors: Nir Rosenfeld, Amir Globerson

    Abstract: Many information systems use tags and keywords to describe and annotate content. These allow for efficient organization and categorization of items, as well as facilitate relevant search queries. As such, the selected set of tags for an item can have a considerable effect on the volume of traffic that eventually reaches an item. In settings where tags are chosen by an item's creator, who in turn i… ▽ More

    Submitted 19 May, 2016; v1 submitted 16 May, 2016; originally announced May 2016.

  17. arXiv:1409.5834  [pdf, other

    cs.LG cs.DS stat.ML

    Tight Error Bounds for Structured Prediction

    Authors: Amir Globerson, Tim Roughgarden, David Sontag, Cafer Yildirim

    Abstract: Structured prediction tasks in machine learning involve the simultaneous prediction of multiple labels. This is typically done by maximizing a score function on the space of labels, which decomposes as a sum of pairwise elements, each depending on two specific labels. Intuitively, the more pairwise terms are used, the better the expected accuracy. However, there is currently no theoretical account… ▽ More

    Submitted 19 September, 2014; originally announced September 2014.

  18. arXiv:1309.6848  [pdf

    cs.AI

    Tighter Linear Program Relaxations for High Order Graphical Models

    Authors: Elad Mezuman, Daniel Tarlow, Amir Globerson, Yair Weiss

    Abstract: Graphical models with High Order Potentials (HOPs) have received considerable interest in recent years. While there are a variety of approaches to inference in these models, nearly all of them amount to solving a linear program (LP) relaxation with unary consistency constraints between the HOP and the individual variables. In many cases, the resulting relaxations are loose, and in these cases the… ▽ More

    Submitted 26 September, 2013; originally announced September 2013.

    Comments: Appears in Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence (UAI2013)

    Report number: UAI-P-2013-PG-421-430

  19. arXiv:1309.6847  [pdf

    cs.LG stat.ML

    Learning Max-Margin Tree Predictors

    Authors: Ofer Meshi, Elad Eban, Gal Elidan, Amir Globerson

    Abstract: Structured prediction is a powerful framework for coping with joint prediction of interacting outputs. A central difficulty in using this framework is that often the correct label dependence structure is unknown. At the same time, we would like to avoid an overly complex structure that will lead to intractable prediction. In this work we address the challenge of learning tree structured predictive… ▽ More

    Submitted 26 September, 2013; originally announced September 2013.

    Comments: Appears in Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence (UAI2013)

    Report number: UAI-P-2013-PG-411-420

  20. arXiv:1212.2483  [pdf

    cs.LG stat.ML

    Sufficient Dimensionality Reduction with Irrelevant Statistics

    Authors: Amir Globerson, Gal Chechik, Naftali Tishby

    Abstract: The problem of finding a reduced dimensionality representation of categorical variables while preserving their most relevant characteristics is fundamental for the analysis of complex data. Specifically, given a co-occurrence matrix of two variables, one often seeks a compact representation of one variable which preserves information about the other variable. We have recently intro… ▽ More

    Submitted 19 October, 2012; originally announced December 2012.

    Comments: Appears in Proceedings of the Nineteenth Conference on Uncertainty in Artificial Intelligence (UAI2003)

    Report number: UAI-P-2003-PG-281-288

  21. arXiv:1207.4110  [pdf

    cs.LG stat.ML

    The Minimum Information Principle for Discriminative Learning

    Authors: Amir Globerson, Naftali Tishby

    Abstract: Exponential models of distributions are widely used in machine learning for classiffication and modelling. It is well known that they can be interpreted as maximum entropy models under empirical expectation constraints. In this work, we argue that for classiffication tasks, mutual information is a more suitable information theoretic measure to be optimized. We show how the principle of minimum mut… ▽ More

    Submitted 11 July, 2012; originally announced July 2012.

    Comments: Appears in Proceedings of the Twentieth Conference on Uncertainty in Artificial Intelligence (UAI2004)

    Report number: UAI-P-2004-PG-193-200

  22. arXiv:1206.6815  [pdf

    cs.LG stat.ML

    Discriminative Learning via Semidefinite Probabilistic Models

    Authors: Koby Crammer, Amir Globerson

    Abstract: Discriminative linear models are a popular tool in machine learning. These can be generally divided into two types: The first is linear classifiers, such as support vector machines, which are well studied and provide state-of-the-art results. One shortcoming of these models is that their output (known as the 'margin') is not calibrated, and cannot be translated naturally into a distribution over t… ▽ More

    Submitted 27 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the Twenty-Second Conference on Uncertainty in Artificial Intelligence (UAI2006)

    Report number: UAI-P-2006-PG-98-105

  23. arXiv:1206.5243  [pdf

    cs.LG stat.ML

    Convergent Propagation Algorithms via Oriented Trees

    Authors: Amir Globerson, Tommi S. Jaakkola

    Abstract: Inference problems in graphical models are often approximated by casting them as constrained optimization problems. Message passing algorithms, such as belief propagation, have previously been suggested as methods for solving these optimization problems. However, there are few convergence guarantees for such algorithms, and the algorithms are therefore not guaranteed to solve the corresponding opt… ▽ More

    Submitted 20 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the Twenty-Third Conference on Uncertainty in Artificial Intelligence (UAI2007)

    Report number: UAI-P-2007-PG-133-140

  24. arXiv:1206.4604  [pdf

    cs.LG cs.AI

    Learning the Experts for Online Sequence Prediction

    Authors: Elad Eban, Aharon Birnbaum, Shai Shalev-Shwartz, Amir Globerson

    Abstract: Online sequence prediction is the problem of predicting the next element of a sequence given previous elements. This problem has been extensively studied in the context of individual sequence prediction, where no prior assumptions are made on the origin of the sequence. Individual sequence prediction algorithms work quite well for long sequences, where the algorithm has enough time to learn the te… ▽ More

    Submitted 18 June, 2012; originally announced June 2012.

    Comments: ICML2012

  25. arXiv:1206.3288  [pdf

    cs.DS cs.AI cs.CE

    Tightening LP Relaxations for MAP using Message Passing

    Authors: David Sontag, Talya Meltzer, Amir Globerson, Tommi S. Jaakkola, Yair Weiss

    Abstract: Linear Programming (LP) relaxations have become powerful tools for finding the most probable (MAP) configuration in graphical models. These relaxations can be solved efficiently using message-passing algorithms such as belief propagation and, when the relaxation is tight, provably find the MAP configuration. The standard LP relaxation is not tight enough in many real-world problems, however, and t… ▽ More

    Submitted 13 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the Twenty-Fourth Conference on Uncertainty in Artificial Intelligence (UAI2008)

    Report number: UAI-P-2008-PG-503-510

  26. arXiv:1205.2625  [pdf

    cs.AI cs.LG

    Convergent message passing algorithms - a unifying view

    Authors: Talya Meltzer, Amir Globerson, Yair Weiss

    Abstract: Message-passing algorithms have emerged as powerful techniques for approximate inference in graphical models. When these algorithms converge, they can be shown to find local (or sometimes even global) optima of variational formulations to the inference problem. But many of the most popular algorithms are not guaranteed to converge. This has lead to recent interest in convergent message-passing alg… ▽ More

    Submitted 9 May, 2012; originally announced May 2012.

    Comments: Appears in Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence (UAI2009)

    Report number: UAI-P-2009-PG-393-401

  27. arXiv:1205.2624  [pdf

    cs.AI cs.LG

    Convexifying the Bethe Free Energy

    Authors: Ofer Meshi, Ariel Jaimovich, Amir Globerson, Nir Friedman

    Abstract: The introduction of loopy belief propagation (LBP) revitalized the application of graphical models in many domains. Many recent works present improvements on the basic LBP algorithm in an attempt to overcome convergence and local optima problems. Notable among these are convexified free energy approximations that lead to inference procedures with provable convergence and quality properties. Howeve… ▽ More

    Submitted 9 May, 2012; originally announced May 2012.

    Comments: Appears in Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence (UAI2009)

    Report number: UAI-P-2009-PG-402-410

  28. arXiv:1202.3731  [pdf

    cs.LG stat.ML

    What Cannot be Learned with Bethe Approximations

    Authors: Uri Heinemann, Amir Globerson

    Abstract: We address the problem of learning the parameters in graphical models when inference is intractable. A common strategy in this case is to replace the partition function with its Bethe approximation. We show that there exists a regime of empirical marginals where such Bethe learning will fail. By failure we mean that the empirical marginals cannot be recovered from the approximated maximum likeliho… ▽ More

    Submitted 14 February, 2012; originally announced February 2012.

    Report number: UAI-P-2011-PG-319-326

  29. arXiv:1104.0235  [pdf, other

    cs.LG

    Gaussian Robust Classification

    Authors: Ido Ginodi, Amir Globerson

    Abstract: Supervised learning is all about the ability to generalize knowledge. Specifically, the goal of the learning is to train a classifier using training data, in such a way that it will be capable of classifying new unseen data correctly. In order to acheive this goal, it is important to carefully design the learner, so it will not overfit the training data. The later can is done usually by adding a r… ▽ More

    Submitted 1 April, 2011; originally announced April 2011.

    Comments: Master's dissertation of the first author, carried out under the supervision of the second author