Skip to main content

Showing 1–50 of 116 results for author: Moran, S

  1. arXiv:2407.07765  [pdf, other

    cs.LG cs.CR cs.DS math.CO stat.ML

    Ramsey Theorems for Trees and a General 'Private Learning Implies Online Learning' Theorem

    Authors: Simone Fioravanti, Steve Hanneke, Shay Moran, Hilla Schefler, Iska Tsubari

    Abstract: This work continues to investigate the link between differentially private (DP) and online learning. Alon, Livni, Malliaris, and Moran (2019) showed that for binary concept classes, DP learnability of a given class implies that it has a finite Littlestone dimension (equivalently, that it is online learnable). Their proof relies on a model-theoretic result by Hodges (1997), which demonstrates that… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  2. arXiv:2406.15916  [pdf, ps, other

    cs.LG cs.CR stat.ML

    Credit Attribution and Stable Compression

    Authors: Roi Livni, Shay Moran, Kobbi Nissim, Chirag Pabbaraju

    Abstract: Credit attribution is crucial across various fields. In academic research, proper citation acknowledges prior work and establishes original contributions. Similarly, in generative models, such as those trained on existing artworks or music, it is important to ensure that any generated content influenced by these works appropriately credits the original creators. We study credit attribution by ma… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: 15 pages, 1 figure

  3. arXiv:2406.12406  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Fast Rates for Bandit PAC Multiclass Classification

    Authors: Liad Erez, Alon Cohen, Tomer Koren, Yishay Mansour, Shay Moran

    Abstract: We study multiclass PAC learning with bandit feedback, where inputs are classified into one of $K$ possible labels and feedback is limited to whether or not the predicted labels are correct. Our main contribution is in designing a novel learning algorithm for the agnostic $(\varepsilon,δ)$-PAC version of the problem, with sample complexity of… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  4. arXiv:2406.11946  [pdf, ps, other

    cs.HC

    Expedient Assistance and Consequential Misunderstanding: Envisioning an Operationalized Mutual Theory of Mind

    Authors: Justin D. Weisz, Michael Muller, Arielle Goldberg, Dario Andres Silva Moran

    Abstract: Design fictions allow us to prototype the future. They enable us to interrogate emerging or non-existent technologies and examine their implications. We present three design fictions that probe the potential consequences of operationalizing a mutual theory of mind (MToM) between human users and one (or more) AI agents. We use these fictions to explore many aspects of MToM, including how models of… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 11 pages. Published in Proceedings of Workshop on Theory of Mind in Human-AI Interaction at CHI 2024

  5. arXiv:2406.10529  [pdf, ps, other

    cs.LG cs.AI stat.ML

    A Theory of Interpretable Approximations

    Authors: Marco Bressan, Nicolò Cesa-Bianchi, Emmanuel Esposito, Yishay Mansour, Shay Moran, Maximilian Thiessen

    Abstract: Can a deep neural network be approximated by a small decision tree based on simple features? This question and its variants are behind the growing demand for machine learning models that are *interpretable* by humans. In this work we study such questions by introducing *interpretable approximations*, a notion that captures the idea of approximating a target concept $c$ by a small aggregation of co… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: To appear at COLT 2024

  6. arXiv:2405.17120  [pdf, ps, other

    cs.DM cs.LG

    Dual VC Dimension Obstructs Sample Compression by Embeddings

    Authors: Zachary Chase, Bogdan Chornomaz, Steve Hanneke, Shay Moran, Amir Yehudayoff

    Abstract: This work studies embedding of arbitrary VC classes in well-behaved VC classes, focusing particularly on extremal classes. Our main result expresses an impossibility: such embeddings necessarily require a significant increase in dimension. In particular, we prove that for every $d$ there is a class with VC dimension $d$ that cannot be embedded in any extremal class of VC dimension smaller than exp… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    ACM Class: I.2.6; G.2.1

  7. arXiv:2405.15753  [pdf, ps, other

    cs.CR

    Data Reconstruction: When You See It and When You Don't

    Authors: Edith Cohen, Haim Kaplan, Yishay Mansour, Shay Moran, Kobbi Nissim, Uri Stemmer, Eliad Tsfadia

    Abstract: We revisit the fundamental question of formally defining what constitutes a reconstruction attack. While often clear from the context, our exploration reveals that a precise definition is much more nuanced than it appears, to the extent that a single all-encompassing definition may not exist. Thus, we employ a different strategy and aim to "sandwich" the concept of reconstruction attacks by addres… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  8. arXiv:2405.15062  [pdf, other

    cs.LG

    Model-Agnostic Utility-Preserving Biometric Information Anonymization

    Authors: Chun-Fu Chen, Bill Moriarty, Shaohan Hu, Sean Moran, Marco Pistoia, Vincenzo Piuri, Pierangela Samarati

    Abstract: The recent rapid advancements in both sensing and machine learning technologies have given rise to the universal collection and utilization of people's biometrics, such as fingerprints, voices, retina/facial scans, or gait/motion/gestures data, enabling a wide range of applications including authentication, health monitoring, or much more sophisticated analytics. While providing better user experi… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: Preprint of IJIS version, https://link.springer.com/article/10.1007/s10207-024-00862-8

  9. arXiv:2405.10027  [pdf, ps, other

    cs.LG cs.AI stat.ML

    The Real Price of Bandit Information in Multiclass Classification

    Authors: Liad Erez, Alon Cohen, Tomer Koren, Yishay Mansour, Shay Moran

    Abstract: We revisit the classical problem of multiclass classification with bandit feedback (Kakade, Shalev-Shwartz and Tewari, 2008), where each input classifies to one of $K$ possible labels and feedback is restricted to whether the predicted label is correct or not. Our primary inquiry is with regard to the dependency on the number of labels $K$, and whether $T$-step regret bounds in this setting can be… ▽ More

    Submitted 19 June, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

  10. arXiv:2403.19509  [pdf

    cs.CL cs.SD eess.AS

    Phonetic Segmentation of the UCLA Phonetics Lab Archive

    Authors: Eleanor Chodroff, Blaž Pažon, Annie Baker, Steven Moran

    Abstract: Research in speech technologies and comparative linguistics depends on access to diverse and accessible speech data. The UCLA Phonetics Lab Archive is one of the earliest multilingual speech corpora, with long-form audio recordings and phonetic transcriptions for 314 languages (Ladefoged et al., 2009). Recently, 95 of these languages were time-aligned with word-level phonetic transcriptions (Li et… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: Accepted at LREC-COLING 2024

  11. arXiv:2403.10889  [pdf, other

    cs.LG stat.ML

    List Sample Compression and Uniform Convergence

    Authors: Steve Hanneke, Shay Moran, Tom Waknine

    Abstract: List learning is a variant of supervised classification where the learner outputs multiple plausible labels for each instance rather than just one. We investigate classical principles related to generalization within the context of list learning. Our primary goal is to determine whether classical principles in the PAC setting retain their applicability in the domain of list PAC learning. We focus… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

  12. arXiv:2403.07413  [pdf, ps, other

    cs.LG cs.DS

    Learning-Augmented Algorithms with Explicit Predictors

    Authors: Marek Elias, Haim Kaplan, Yishay Mansour, Shay Moran

    Abstract: Recent advances in algorithmic design show how to utilize predictions obtained by machine learning models from past and present data. These approaches have demonstrated an enhancement in performance when the predictions are accurate, while also ensuring robustness by providing worst-case guarantees when predictions fail. In this paper we focus on online problems; prior research in this context was… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  13. arXiv:2403.03909  [pdf, other

    cs.CL

    A Measure for Transparent Comparison of Linguistic Diversity in Multilingual NLP Data Sets

    Authors: Tanja Samardzic, Ximena Gutierrez, Christian Bentz, Steven Moran, Olga Pelloni

    Abstract: Typologically diverse benchmarks are increasingly created to track the progress achieved in multilingual NLP. Linguistic diversity of these data sets is typically measured as the number of languages or language families included in the sample, but such measures do not consider structural properties of the included languages. In this paper, we propose assessing linguistic diversity of a data set ag… ▽ More

    Submitted 16 April, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

    Comments: Accepted to NAACL 2024 Findings

  14. arXiv:2402.19303  [pdf, ps, other

    cs.LG cs.GT

    Learnability Gaps of Strategic Classification

    Authors: Lee Cohen, Yishay Mansour, Shay Moran, Han Shao

    Abstract: In contrast with standard classification tasks, strategic classification involves agents strategically modifying their features in an effort to receive favorable predictions. For instance, given a classifier determining loan approval based on credit scores, applicants may open or close their credit cards to fool the classifier. The learning goal is to find a classifier robust against strategic man… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  15. arXiv:2402.07453  [pdf, ps, other

    cs.LG stat.ML

    Bandit-Feedback Online Multiclass Classification: Variants and Tradeoffs

    Authors: Yuval Filmus, Steve Hanneke, Idan Mehalel, Shay Moran

    Abstract: Consider the domain of multiclass classification within the adversarial online setting. What is the price of relying on bandit feedback as opposed to full information? To what extent can an adaptive adversary amplify the loss compared to an oblivious one? To what extent can a randomized learner reduce the loss compared to a deterministic one? We study these questions in the mistake bound model and… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  16. arXiv:2401.01754  [pdf, other

    cs.SE cs.AI

    Using AI/ML to Find and Remediate Enterprise Secrets in Code & Document Sharing Platforms

    Authors: Gregor Kerr, David Algorry, Senad Ibraimoski, Peter Maciver, Sean Moran

    Abstract: We introduce a new challenge to the software development community: 1) leveraging AI to accurately detect and flag up secrets in code and on popular document sharing platforms that frequently used by developers, such as Confluence and 2) automatically remediating the detections (e.g. by suggesting password vault functionality). This is a challenging, and mostly unaddressed task. Existing methods l… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

  17. arXiv:2401.01753   

    cs.AI

    A Generative AI Assistant to Accelerate Cloud Migration

    Authors: Amal Vaidya, Mohan Krishna Vankayalapati, Jacky Chan, Senad Ibraimoski, Sean Moran

    Abstract: We present a tool that leverages generative AI to accelerate the migration of on-premises applications to the cloud. The Cloud Migration LLM accepts input from the user specifying the parameters of their migration, and outputs a migration strategy with an architecture diagram. A user study suggests that the migration LLM can assist inexperienced users in finding the right cloud migration profile,… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

    Comments: arXiv admin comment: This version has been removed by arXiv administrators as the submitter did not have the rights to agree to the license at the time of submission

  18. arXiv:2311.10448  [pdf, other

    cs.LG cs.CR cs.CV

    DeepClean: Machine Unlearning on the Cheap by Resetting Privacy Sensitive Weights using the Fisher Diagonal

    Authors: Jiaeli Shi, Najah Ghalyan, Kostis Gourgoulias, John Buford, Sean Moran

    Abstract: Machine learning models trained on sensitive or private data can inadvertently memorize and leak that information. Machine unlearning seeks to retroactively remove such details from model weights to protect privacy. We contribute a lightweight unlearning algorithm that leverages the Fisher Information Matrix (FIM) for selective forgetting. Prior work in this area requires full retraining or large… ▽ More

    Submitted 26 April, 2024; v1 submitted 17 November, 2023; originally announced November 2023.

  19. arXiv:2311.06428  [pdf, other

    cs.LG

    A Trichotomy for Transductive Online Learning

    Authors: Steve Hanneke, Shay Moran, Jonathan Shafer

    Abstract: We present new upper and lower bounds on the number of learner mistakes in the `transductive' online learning setting of Ben-David, Kushilevitz and Mansour (1997). This setting is similar to standard online learning, except that the adversary fixes a sequence of instances $x_1,\dots,x_n$ to be labeled at the start of the game, and this sequence is known to the learner. Qualitatively, we prove a tr… ▽ More

    Submitted 29 November, 2023; v1 submitted 10 November, 2023; originally announced November 2023.

  20. arXiv:2311.01599  [pdf, ps, other

    cs.LG cs.DS

    Local Borsuk-Ulam, Stability, and Replicability

    Authors: Zachary Chase, Bogdan Chornomaz, Shay Moran, Amir Yehudayoff

    Abstract: We use and adapt the Borsuk-Ulam Theorem from topology to derive limitations on list-replicable and globally stable learning algorithms. We further demonstrate the applicability of our methods in combinatorics and topology. We show that, besides trivial cases, both list-replicable and globally stable learning are impossible in the agnostic PAC setting. This is in contrast with the realizable cas… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    ACM Class: I.2.6

  21. arXiv:2310.18428  [pdf, ps, other

    cs.LG

    The Bayesian Stability Zoo

    Authors: Shay Moran, Hilla Schefler, Jonathan Shafer

    Abstract: We show that many definitions of stability found in the learning theory literature are equivalent to one another. We distinguish between two families of definitions of stability: distribution-dependent and distribution-independent Bayesian stability. Within each family, we establish equivalences between various definitions, encompassing approximate differential privacy, pure differential privacy,… ▽ More

    Submitted 5 December, 2023; v1 submitted 27 October, 2023; originally announced October 2023.

    Comments: v2, minor typo fix

  22. arXiv:2310.04442  [pdf, other

    physics.ins-det cs.LG hep-ex hep-ph nucl-ex

    The Optimal use of Segmentation for Sampling Calorimeters

    Authors: Fernando Torales Acosta, Bishnu Karki, Piyush Karande, Aaron Angerami, Miguel Arratia, Kenneth Barish, Ryan Milton, Sebastián Morán, Benjamin Nachman, Anshuman Sinha

    Abstract: One of the key design choices of any sampling calorimeter is how fine to make the longitudinal and transverse segmentation. To inform this choice, we study the impact of calorimeter segmentation on energy reconstruction. To ensure that the trends are due entirely to hardware and not to a sub-optimal use of segmentation, we deploy deep neural networks to perform the reconstruction. These networks m… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

  23. arXiv:2307.02066  [pdf, ps, other

    cs.LG stat.ML

    Universal Rates for Multiclass Learning

    Authors: Steve Hanneke, Shay Moran, Qian Zhang

    Abstract: We study universal rates for multiclass classification, establishing the optimal rates (up to log factors) for all hypothesis classes. This generalizes previous results on binary classification (Bousquet, Hanneke, Moran, van Handel, and Yehudayoff, 2021), and resolves an open question studied by Kalavasis, Velegkas, and Karbasi (2022) who handled the multiclass setting with a bounded number of cla… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: 67 pages, accepted to the 36th Annual Conference on Learning Theory (COLT 2023)

  24. arXiv:2307.00642  [pdf, ps, other

    cs.LG

    Multiclass Boosting: Simple and Intuitive Weak Learning Criteria

    Authors: Nataly Brukhim, Amit Daniely, Yishay Mansour, Shay Moran

    Abstract: We study a generalization of boosting to the multiclass setting. We introduce a weak learning condition for multiclass classification that captures the original notion of weak learnability as being "slightly better than random guessing". We give a simple and efficient boosting algorithm, that does not require realizability assumptions and its sample and oracle complexity bounds are independent of… ▽ More

    Submitted 2 July, 2023; originally announced July 2023.

  25. arXiv:2306.13119  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Adversarial Resilience in Sequential Prediction via Abstention

    Authors: Surbhi Goel, Steve Hanneke, Shay Moran, Abhishek Shetty

    Abstract: We study the problem of sequential prediction in the stochastic setting with an adversary that is allowed to inject clean-label adversarial (or out-of-distribution) examples. Algorithms designed to handle purely stochastic data tend to fail in the presence of such adversarial examples, often leading to erroneous predictions. This is undesirable in many high-stakes applications such as medical reco… ▽ More

    Submitted 24 January, 2024; v1 submitted 22 June, 2023; originally announced June 2023.

  26. arXiv:2305.15016  [pdf, other

    cs.LG

    Estimating class separability of text embeddings with persistent homology

    Authors: Kostis Gourgoulias, Najah Ghalyan, Maxime Labonne, Yash Satsangi, Sean Moran, Joseph Sabelja

    Abstract: This paper introduces an unsupervised method to estimate the class separability of text datasets from a topological point of view. Using persistent homology, we demonstrate how tracking the evolution of embedding manifolds during training can inform about class separability. More specifically, we show how this technique can be applied to detect when the training process stops improving the separab… ▽ More

    Submitted 18 June, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: Updated version of the article; pre-print of the version published at Transactions of Machine Learning Research, https://openreview.net/forum?id=8DWrIMuLya

  27. arXiv:2305.14822  [pdf, other

    cs.LG cs.CR

    Can Copyright be Reduced to Privacy?

    Authors: Niva Elkin-Koren, Uri Hacohen, Roi Livni, Shay Moran

    Abstract: There is a growing concern that generative AI models will generate outputs closely resembling the copyrighted materials for which they are trained. This worry has intensified as the quality and complexity of generative models have immensely improved, and the availability of extensive datasets containing copyrighted material has expanded. Researchers are actively exploring strategies to mitigate th… ▽ More

    Submitted 24 March, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

  28. arXiv:2305.14311  [pdf, other

    cs.LG cs.DS stat.ML

    Statistical Indistinguishability of Learning Algorithms

    Authors: Alkis Kalavasis, Amin Karbasi, Shay Moran, Grigoris Velegkas

    Abstract: When two different parties use the same learning rule on their own data, how can we test whether the distributions of the two outcomes are similar? In this paper, we study the similarity of outcomes of learning rules through the lens of the Total Variation (TV) distance of distributions. We say that a learning rule is TV indistinguishable if the expected TV distance between the posterior distribut… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  29. arXiv:2304.03996  [pdf, ps, other

    cs.LG

    A Unified Characterization of Private Learnability via Graph Theory

    Authors: Noga Alon, Shay Moran, Hilla Schefler, Amir Yehudayoff

    Abstract: We provide a unified framework for characterizing pure and approximate differentially private (DP) learnability. The framework uses the language of graph theory: for a concept class $\mathcal{H}$, we define the contradiction graph $G$ of $\mathcal{H}$. Its vertices are realizable datasets, and two datasets $S,S'$ are connected by an edge if they contradict each other (i.e., there is a point $x$ th… ▽ More

    Submitted 12 June, 2024; v1 submitted 8 April, 2023; originally announced April 2023.

  30. arXiv:2304.03757  [pdf, ps, other

    cs.LG math.CO stat.ML

    Replicability and stability in learning

    Authors: Zachary Chase, Shay Moran, Amir Yehudayoff

    Abstract: Replicability is essential in science as it allows us to validate and verify research findings. Impagliazzo, Lei, Pitassi and Sorrell (`22) recently initiated the study of replicability in machine learning. A learning algorithm is replicable if it typically produces the same output when applied on two i.i.d. inputs using the same internal randomness. We study a variant of replicability that does n… ▽ More

    Submitted 12 April, 2023; v1 submitted 7 April, 2023; originally announced April 2023.

    Comments: 17 pages

  31. arXiv:2304.01238  [pdf, other

    cs.CL cs.AI

    Spam-T5: Benchmarking Large Language Models for Few-Shot Email Spam Detection

    Authors: Maxime Labonne, Sean Moran

    Abstract: This paper investigates the effectiveness of large language models (LLMs) in email spam detection by comparing prominent models from three distinct families: BERT-like, Sentence Transformers, and Seq2Seq. Additionally, we examine well-established machine learning techniques for spam detection, such as Naïve Bayes and LightGBM, as baseline methods. We assess the performance of these models across f… ▽ More

    Submitted 7 May, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

  32. A Benchmark Generative Probabilistic Model for Weak Supervised Learning

    Authors: Georgios Papadopoulos, Fran Silavong, Sean Moran

    Abstract: Finding relevant and high-quality datasets to train machine learning models is a major bottleneck for practitioners. Furthermore, to address ambitious real-world use-cases there is usually the requirement that the data come labelled with high-quality annotations that can facilitate the training of a supervised model. Manually labelling data with high-quality labels is generally a time-consuming an… ▽ More

    Submitted 4 October, 2023; v1 submitted 31 March, 2023; originally announced March 2023.

    Journal ref: Lecture Notes in Computer Science 2023; vol 14174; Springer; p 36

  33. arXiv:2303.17716  [pdf, ps, other

    cs.LG stat.ML

    Multiclass Online Learning and Uniform Convergence

    Authors: Steve Hanneke, Shay Moran, Vinod Raman, Unique Subedi, Ambuj Tewari

    Abstract: We study multiclass classification in the agnostic adversarial online learning setting. As our main result, we prove that any multiclass concept class is agnostically learnable if and only if its Littlestone dimension is finite. This solves an open problem studied by Daniely, Sabato, Ben-David, and Shalev-Shwartz (2011,2015) who handled the case when the number of classes (or labels) is bounded. W… ▽ More

    Submitted 7 July, 2023; v1 submitted 30 March, 2023; originally announced March 2023.

    Comments: COLT Camera-Ready, 15 pages

  34. arXiv:2303.15383  [pdf, other

    cs.LG stat.ML

    List Online Classification

    Authors: Shay Moran, Ohad Sharon, Iska Tsubari, Sivan Yosebashvili

    Abstract: We study multiclass online prediction where the learner can predict using a list of multiple labels (as opposed to just one label in the traditional setting). We characterize learnability in this model using the $b$-ary Littlestone dimension. This dimension is a variation of the classical Littlestone dimension with the difference that binary mistake trees are replaced with $(k+1)$-ary mistake tree… ▽ More

    Submitted 18 May, 2023; v1 submitted 27 March, 2023; originally announced March 2023.

  35. arXiv:2302.14099  [pdf, ps, other

    cs.LG cs.CR cs.DS

    On Differentially Private Online Predictions

    Authors: Haim Kaplan, Yishay Mansour, Shay Moran, Kobbi Nissim, Uri Stemmer

    Abstract: In this work we introduce an interactive variant of joint differential privacy towards handling online processes in which existing privacy definitions seem too restrictive. We study basic properties of this definition and demonstrate that it satisfies (suitable variants) of group privacy, composition, and post processing. We then study the cost of interactive joint privacy in the basic setting of… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

  36. arXiv:2302.13849  [pdf, ps, other

    cs.LG

    Optimal Prediction Using Expert Advice and Randomized Littlestone Dimension

    Authors: Yuval Filmus, Steve Hanneke, Idan Mehalel, Shay Moran

    Abstract: A classical result in online learning characterizes the optimal mistake bound achievable by deterministic learners using the Littlestone dimension (Littlestone '88). We prove an analogous result for randomized learners: we show that the optimal expected mistake bound in learning a class $\mathcal{H}$ equals its randomized Littlestone dimension, which is the largest $d$ for which there exists a tre… ▽ More

    Submitted 17 August, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

  37. arXiv:2302.10798  [pdf, other

    cs.LG cs.CV

    Learning a Consensus Sub-Network with Polarization Regularization and One Pass Training

    Authors: Xiaoying Zhi, Varun Babbar, Pheobe Sun, Fran Silavong, Ruibo Shi, Sean Moran

    Abstract: The subject of green AI has been gaining attention within the deep learning community given the recent trend of ever larger and more complex neural network models. Existing solutions for reducing the computational load of training at inference time usually involve pruning the network parameters. Pruning schemes often create extra overhead either by iterative training and fine-tuning for static pru… ▽ More

    Submitted 4 November, 2023; v1 submitted 17 February, 2023; originally announced February 2023.

  38. arXiv:2301.02214  [pdf, other

    eess.AS cs.SD

    Automatic Sound Event Detection and Classification of Great Ape Calls Using Neural Networks

    Authors: Zifan Jiang, Adrian Soldati, Isaac Schamberg, Adriano R. Lameira, Steven Moran

    Abstract: We present a novel approach to automatically detect and classify great ape calls from continuous raw audio recordings collected during field research. Our method leverages deep pretrained and sequential neural networks, including wav2vec 2.0 and LSTM, and is validated on three data sets from three different great ape lineages (orangutans, chimpanzees, and bonobos). The recordings were collected by… ▽ More

    Submitted 21 June, 2024; v1 submitted 5 January, 2023; originally announced January 2023.

    Comments: This paper is published as: Jiang, Zifan, Adrian Soldati, Isaac Schamberg, Adriano R. Lameira and Steven Moran. Automatic Sound Event Detection and Classification of Great Ape Calls Using Neural Networks. In Proceedings of the 20th International Congress of Phonetic Sciences (ICPhS 2023), 3100-3104, Prague, Czech Republic (https://guarant.cz/icphs2023/508.pdf)

  39. arXiv:2301.01924  [pdf, other

    math.CO cs.CC cs.DM math.LO

    Diagonalization Games

    Authors: Noga Alon, Olivier Bousquet, Kasper Green Larsen, Shay Moran, Shlomo Moran

    Abstract: We study several variants of a combinatorial game which is based on Cantor's diagonal argument. The game is between two players called Kronecker and Cantor. The names of the players are motivated by the known fact that Leopold Kronecker did not appreciate Georg Cantor's arguments about the infinite, and even referred to him as a "scientific charlatan". In the game Kronecker maintains a list of m… ▽ More

    Submitted 22 January, 2023; v1 submitted 5 January, 2023; originally announced January 2023.

    Comments: 11 pages, added acknowledgements

  40. arXiv:2212.07253  [pdf, other

    cs.SE cs.AI

    API-Miner: an API-to-API Specification Recommendation Engine

    Authors: Sae Young Moon, Gregor Kerr, Fran Silavong, Sean Moran

    Abstract: When designing a new API for a large project, developers need to make smart design choices so that their code base can grow sustainably. To ensure that new API components are well designed, developers can learn from existing API components. However, the lack of standardized methods for comparing API designs makes this learning process time-consuming and difficult. To address this gap we developed… ▽ More

    Submitted 19 July, 2023; v1 submitted 14 December, 2022; originally announced December 2022.

  41. arXiv:2212.05050  [pdf, other

    math.LO cs.DM cs.LG cs.LO math.CO

    The unstable formula theorem revisited via algorithms

    Authors: Maryanthe Malliaris, Shay Moran

    Abstract: This paper is about the surprising interaction of a foundational result from model theory about stability of theories, which seems to be inherently about the infinite, with algorithmic stability in learning. Specifically, we develop a complete algorithmic analogue of Shelah's celebrated Unstable Formula Theorem, with algorithmic properties taking the place of the infinite. This draws on several ne… ▽ More

    Submitted 17 April, 2023; v1 submitted 9 December, 2022; originally announced December 2022.

  42. arXiv:2212.04216  [pdf, ps, other

    cs.LG cs.CR stat.ML

    Differentially-Private Bayes Consistency

    Authors: Olivier Bousquet, Haim Kaplan, Aryeh Kontorovich, Yishay Mansour, Shay Moran, Menachem Sadigurschi, Uri Stemmer

    Abstract: We construct a universally Bayes consistent learning rule that satisfies differential privacy (DP). We first handle the setting of binary classification and then extend our rule to the more general setting of density estimation (with respect to the total variation metric). The existence of a universally consistent DP learner reveals a stark difference with the distribution-free PAC model. Indeed,… ▽ More

    Submitted 8 December, 2022; originally announced December 2022.

  43. arXiv:2210.05406  [pdf, other

    cs.SE cs.AI

    Code Librarian: A Software Package Recommendation System

    Authors: Lili Tao, Alexandru-Petre Cazan, Senad Ibraimoski, Sean Moran

    Abstract: The use of packaged libraries can significantly shorten the software development cycle by improving the quality and readability of code. In this paper, we present a recommendation engine called Librarian for open source libraries. A candidate library package is recommended for a given context if: 1) it has been frequently used with the imported libraries in the program; 2) it has similar functiona… ▽ More

    Submitted 7 February, 2023; v1 submitted 11 October, 2022; originally announced October 2022.

  44. arXiv:2210.02713  [pdf, ps, other

    cs.LG cs.CR

    On Optimal Learning Under Targeted Data Poisoning

    Authors: Steve Hanneke, Amin Karbasi, Mohammad Mahmoody, Idan Mehalel, Shay Moran

    Abstract: Consider the task of learning a hypothesis class $\mathcal{H}$ in the presence of an adversary that can replace up to an $η$ fraction of the examples in the training set with arbitrary adversarial examples. The adversary aims to fail the learner on a particular target test point $x$ which is known to the adversary but not to the learner. In this work we aim to characterize the smallest achievable… ▽ More

    Submitted 12 October, 2022; v1 submitted 6 October, 2022; originally announced October 2022.

  45. arXiv:2208.14615  [pdf, other

    cs.LG cs.CC stat.ML

    Fine-Grained Distribution-Dependent Learning Curves

    Authors: Olivier Bousquet, Steve Hanneke, Shay Moran, Jonathan Shafer, Ilya Tolstikhin

    Abstract: Learning curves plot the expected error of a learning algorithm as a function of the number of labeled samples it receives from a target distribution. They are widely used as a measure of an algorithm's performance, but classic PAC learning theory cannot explain their behavior. As observed by Antos and Lugosi (1996 , 1998), the classic `No Free Lunch' lower bounds only trace the upper envelope a… ▽ More

    Submitted 10 November, 2022; v1 submitted 30 August, 2022; originally announced August 2022.

  46. arXiv:2208.09495  [pdf, other

    cs.SE cs.AI

    Topical: Learning Repository Embeddings from Source Code using Attention

    Authors: Agathe Lherondelle, Varun Babbar, Yash Satsangi, Fran Silavong, Shaltiel Eloul, Sean Moran

    Abstract: This paper presents Topical, a novel deep neural network for repository level embeddings. Existing methods, reliant on natural language documentation or naive aggregation techniques, are outperformed by Topical's utilization of an attention mechanism. This mechanism generates repository-level representations from source code, full dependency graphs, and script level textual data. Trained on public… ▽ More

    Submitted 4 November, 2023; v1 submitted 19 August, 2022; originally announced August 2022.

    Comments: Pre-print, under review

  47. arXiv:2207.00614  [pdf, other

    stat.ML cs.LG

    Integral Probability Metrics PAC-Bayes Bounds

    Authors: Ron Amit, Baruch Epstein, Shay Moran, Ron Meir

    Abstract: We present a PAC-Bayes-style generalization bound which enables the replacement of the KL-divergence with a variety of Integral Probability Metrics (IPM). We provide instances of this bound with the IPM being the total variation metric and the Wasserstein distance. A notable feature of the obtained bounds is that they naturally interpolate between classical uniform convergence bounds in the worst… ▽ More

    Submitted 25 December, 2022; v1 submitted 1 July, 2022; originally announced July 2022.

    Comments: Accepted to NeurIPS 2022

  48. arXiv:2206.14800  [pdf, other

    cs.LG

    Understanding Generalization via Leave-One-Out Conditional Mutual Information

    Authors: Mahdi Haghifam, Shay Moran, Daniel M. Roy, Gintare Karolina Dziugaite

    Abstract: We study the mutual information between (certain summaries of) the output of a learning algorithm and its $n$ training data, conditional on a supersample of $n+1$ i.i.d. data from which the training data is chosen at random without replacement. These leave-one-out variants of the conditional mutual information (CMI) of an algorithm (Steinke and Zakynthinou, 2020) are also seen to control the mean… ▽ More

    Submitted 29 June, 2022; originally announced June 2022.

    Comments: 18 pages

  49. arXiv:2206.04713  [pdf, ps, other

    cs.LG cs.DC

    A Resilient Distributed Boosting Algorithm

    Authors: Yuval Filmus, Idan Mehalel, Shay Moran

    Abstract: Given a learning task where the data is distributed among several parties, communication is one of the fundamental resources which the parties would like to minimize. We present a distributed boosting algorithm which is resilient to a limited amount of noise. Our algorithm is similar to classical boosting algorithms, although it is equipped with a new component, inspired by Impagliazzo's hard-core… ▽ More

    Submitted 13 June, 2022; v1 submitted 9 June, 2022; originally announced June 2022.

  50. arXiv:2205.08585  [pdf, other

    cs.SE cs.AI cs.CV cs.LG

    CV4Code: Sourcecode Understanding via Visual Code Representations

    Authors: Ruibo Shi, Lili Tao, Rohan Saphal, Fran Silavong, Sean J. Moran

    Abstract: We present CV4Code, a compact and effective computer vision method for sourcecode understanding. Our method leverages the contextual and the structural information available from the code snippet by treating each snippet as a two-dimensional image, which naturally encodes the context and retains the underlying structural information through an explicit spatial representation. To codify snippets as… ▽ More

    Submitted 11 May, 2022; originally announced May 2022.