Skip to main content

Showing 1–50 of 128 results for author: Sutton, C

  1. arXiv:2407.09522  [pdf, other

    cs.DB cs.AI cs.LG stat.ML

    UQE: A Query Engine for Unstructured Databases

    Authors: Hanjun Dai, Bethany Yixin Wang, Xingchen Wan, Bo Dai, Sherry Yang, Azade Nova, Pengcheng Yin, Phitchaya Mangpo Phothilimthana, Charles Sutton, Dale Schuurmans

    Abstract: Analytics on structured data is a mature field with many successful methods. However, most real world data exists in unstructured form, such as images and conversations. We investigate the potential of Large Language Models (LLMs) to enable unstructured data analytics. In particular, we propose a new Universal Query Engine (UQE) that directly interrogates and draws insights from unstructured data… ▽ More

    Submitted 23 June, 2024; originally announced July 2024.

  2. arXiv:2405.20270  [pdf, other

    physics.chem-ph cond-mat.mtrl-sci physics.comp-ph

    Bridging electronic and classical density-functional theory using universal machine-learned functional approximations

    Authors: Michelle M. Kelley, Joshua Quinton, Kamron Fazel, Nima Karimitari, Christopher Sutton, Ravishankar Sundararaman

    Abstract: The accuracy of density-functional theory (DFT) is determined by the quality of the approximate functionals, such as exchange-correlation in electronic DFT and the excess functional in the classical DFT formalism of fluids. The exact functional is highly nonlocal for both electrons and fluids, yet most approximate functionals are semi-local or nonlocal in a limited weighted-density form. Machine-l… ▽ More

    Submitted 17 June, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

    Comments: 11 pages, 7 figures

  3. arXiv:2405.20239  [pdf

    cond-mat.mtrl-sci physics.chem-ph

    BEAST DB: Grand-Canonical Database of Electrocatalyst Properties

    Authors: Cooper Tezak, Jacob Clary, Sophie Gerits, Joshua Quinton, Benjamin Rich, Nicholas Singstock, Abdulaziz Alherz, Taylor Aubry, Struan Clark, Rachel Hurst, Mauro Del Ben, Christopher Sutton, Ravishankar Sundararaman, Charles Musgrave, Derek Vigil-Fowler

    Abstract: We present BEAST DB, an open-source database comprised of ab initio electrochemical data computed using grand-canonical density functional theory in implicit solvent at consistent calculation parameters. The database contains over 20,000 surface calculations and covers a broad set of heterogeneous catalyst materials and electrochemical reactions. Calculations were performed at self-consistent fixe… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 24 pages, 8 figures

  4. arXiv:2404.14662  [pdf, other

    cs.LG cs.CL cs.PL cs.SE

    NExT: Teaching Large Language Models to Reason about Code Execution

    Authors: Ansong Ni, Miltiadis Allamanis, Arman Cohan, Yinlin Deng, Kensen Shi, Charles Sutton, Pengcheng Yin

    Abstract: A fundamental skill among human developers is the ability to understand and reason about program execution. As an example, a programmer can mentally simulate code execution in natural language to debug and repair code (aka. rubber duck debugging). However, large language models (LLMs) of code are typically trained on the surface textual form of programs, thus may lack a semantic understanding of h… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: 35 pages

  5. arXiv:2403.06955  [pdf, other

    cond-mat.mtrl-sci cs.LG

    Accurate Crystal Structure Prediction of New 2D Hybrid Organic Inorganic Perovskites

    Authors: Nima Karimitari, William J. Baldwin, Evan W. Muller, Zachary J. L. Bare, W. Joshua Kennedy, Gábor Csányi, Christopher Sutton

    Abstract: Low dimensional hybrid organic-inorganic perovskites (HOIPs) represent a promising class of electronically active materials for both light absorption and emission. The design space of HOIPs is extremely large, since a diverse space of organic cations can be combined with different inorganic frameworks. This immense design space allows for tunable electronic and mechanical properties, but also nece… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: 14 pages and 9 figures in the main text. Supplementary included in pdf

  6. arXiv:2401.10998  [pdf, other

    cond-mat.mtrl-sci

    Leveraging Domain Adaptation for Accurate Machine Learning Predictions of New Halide Perovskites

    Authors: Dipannoy Das Gupta, Zachary J. L. Bare, Suxuen Yew, Santosh Adhikari, Brian DeCost, Qi Zhang, Charles Musgrave, Christopher Sutton

    Abstract: We combine graph neural networks (GNN) with an inexpensive and reliable structure generation approach based on the bond-valence method (BVM) to train accurate machine learning models for screening 222,960 halide perovskites using statistical estimates of the DFT/PBE formation energy (Ef), and the PBE and HSE band gaps (Eg). The GNNs were fined tuned using domain adaptation (DA) from a source model… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

  7. arXiv:2401.00096  [pdf, other

    physics.chem-ph cond-mat.mtrl-sci

    A foundation model for atomistic materials chemistry

    Authors: Ilyes Batatia, Philipp Benner, Yuan Chiang, Alin M. Elena, Dávid P. Kovács, Janosh Riebesell, Xavier R. Advincula, Mark Asta, Matthew Avaylon, William J. Baldwin, Fabian Berger, Noam Bernstein, Arghya Bhowmik, Samuel M. Blau, Vlad Cărare, James P. Darby, Sandip De, Flaviano Della Pia, Volker L. Deringer, Rokas Elijošius, Zakariya El-Machachi, Fabio Falcioni, Edvin Fako, Andrea C. Ferrari, Annalena Genreith-Schriever , et al. (51 additional authors not shown)

    Abstract: Machine-learned force fields have transformed the atomistic modelling of materials by enabling simulations of ab initio quality on unprecedented time and length scales. However, they are currently limited by: (i) the significant computational and human effort that must go into development and validation of potentials for each particular system of interest; and (ii) a general lack of transferabilit… ▽ More

    Submitted 1 March, 2024; v1 submitted 29 December, 2023; originally announced January 2024.

    Comments: 119 pages, 63 figures, 37MB PDF

  8. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  9. arXiv:2312.02179  [pdf, other

    cs.LG cs.AI cs.CL

    Training Chain-of-Thought via Latent-Variable Inference

    Authors: Du Phan, Matthew D. Hoffman, David Dohan, Sholto Douglas, Tuan Anh Le, Aaron Parisi, Pavel Sountsov, Charles Sutton, Sharad Vikram, Rif A. Saurous

    Abstract: Large language models (LLMs) solve problems more accurately and interpretably when instructed to work out the answer step by step using a ``chain-of-thought'' (CoT) prompt. One can also improve LLMs' performance on a specific task by supervised fine-tuning, i.e., by using gradient ascent on some tunable parameters to maximize the average log-likelihood of correct answers from a labeled training se… ▽ More

    Submitted 28 November, 2023; originally announced December 2023.

    Comments: 23 pages, 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  10. arXiv:2311.17311  [pdf, other

    cs.CL cs.AI

    Universal Self-Consistency for Large Language Model Generation

    Authors: Xinyun Chen, Renat Aksitov, Uri Alon, Jie Ren, Kefan Xiao, Pengcheng Yin, Sushant Prakash, Charles Sutton, Xuezhi Wang, Denny Zhou

    Abstract: Self-consistency with chain-of-thought prompting (CoT) has demonstrated remarkable performance gains on various challenging tasks, by utilizing multiple reasoning paths sampled from large language models (LLMs). However, self-consistency relies on the answer extraction process to aggregate multiple solutions, which is not applicable to free-form answers. In this work, we propose Universal Self-Con… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  11. arXiv:2307.13883  [pdf, other

    cs.LG cs.PL

    ExeDec: Execution Decomposition for Compositional Generalization in Neural Program Synthesis

    Authors: Kensen Shi, Joey Hong, Yinlin Deng, Pengcheng Yin, Manzil Zaheer, Charles Sutton

    Abstract: When writing programs, people have the ability to tackle a new complex task by decomposing it into smaller and more familiar subtasks. While it is difficult to measure whether neural program synthesis methods have similar capabilities, we can measure whether they compositionally generalize, that is, whether a model that has been trained on the simpler subtasks is subsequently able to solve more co… ▽ More

    Submitted 6 May, 2024; v1 submitted 25 July, 2023; originally announced July 2023.

    Comments: ICLR 2024

  12. arXiv:2307.07609  [pdf, other

    cond-mat.mtrl-sci

    Interpretable machine learning to understand the performance of semi local density functionals for materials thermochemistry

    Authors: Santosh Adhikari, Christopher J. Bartel, Christopher Sutton

    Abstract: This study investigates the use of machine learning (ML) to correct the enthalpy of formation (Hf) from two separate DFT functionals, PBE and SCAN, to the experimental Hf across 1011 solid-state compounds. The ML model uses a set of 25 properties that characterize the electronic structure as calculated using PBE and SCAN. The ML model significantly decreases the error in PBE-calculated Hf values f… ▽ More

    Submitted 14 July, 2023; originally announced July 2023.

  13. arXiv:2306.12272  [pdf, other

    cond-mat.mtrl-sci cs.CE cs.LG math.CO

    From structure mining to unsupervised exploration of atomic octahedral networks

    Authors: R. Patrick Xian, Ryan J. Morelock, Ido Hadar, Charles B. Musgrave, Christopher Sutton

    Abstract: Networks of atom-centered coordination octahedra commonly occur in inorganic and hybrid solid-state materials. Characterizing their spatial arrangements and characteristics is crucial for relating structures to properties for many materials families. The traditional method using case-by-case inspection becomes prohibitive for discovering trends and similarities in large datasets. Here, we operatio… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

    Comments: 56 pages

  14. arXiv:2306.06545  [pdf, other

    cs.LG stat.ML

    A Probabilistic Framework for Modular Continual Learning

    Authors: Lazar Valkov, Akash Srivastava, Swarat Chaudhuri, Charles Sutton

    Abstract: Modular approaches that use a different composition of modules for each problem are a promising direction in continual learning (CL). However, searching through the large, discrete space of module compositions is challenging, especially because evaluating a composition's performance requires a round of neural network training. We address this challenge through a modular CL framework, PICLE, that u… ▽ More

    Submitted 2 May, 2024; v1 submitted 10 June, 2023; originally announced June 2023.

  15. arXiv:2306.02049  [pdf, other

    cs.LG cs.PL

    LambdaBeam: Neural Program Search with Higher-Order Functions and Lambdas

    Authors: Kensen Shi, Hanjun Dai, Wen-Ding Li, Kevin Ellis, Charles Sutton

    Abstract: Search is an important technique in program synthesis that allows for adaptive strategies such as focusing on particular search directions based on execution results. Several prior works have demonstrated that neural models are effective at guiding program synthesis searches. However, a common drawback of those approaches is the inability to handle iterative loops, higher-order functions, or lambd… ▽ More

    Submitted 28 October, 2023; v1 submitted 3 June, 2023; originally announced June 2023.

  16. arXiv:2306.00970  [pdf, other

    cond-mat.mtrl-sci

    Improving the reliability of machine learned potentials for modeling inhomogenous liquids

    Authors: Kamron Fazel, Nima Karimitari, Tanooj Shah, Christopher Sutton, Ravishankar Sundararaman

    Abstract: The atomic-scale response of inhomogeneous fluids at interfaces and surrounding solute particles plays a critical role in governing chemical, electrochemical and biological processes at such interfaces. Classical molecular dynamics simulations have been applied extensively to simulate the response of inhomogeneous fluids directly, and as inputs to classical density functional theory, but are limit… ▽ More

    Submitted 27 November, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: 8 pages, 4 figures

  17. arXiv:2304.04714  [pdf, other

    cond-mat.mtrl-sci

    Dynamic Local Structure in Caesium Lead Iodide: Spatial Correlation and Transient Domains

    Authors: William Baldwin, Xia Liang, Johan Klarbring, Milos Dubajic, David Dell'Angelo, Christopher Sutton, Claudia Caddeo, Samuel D. Stranks, Alessandro Mattoni, Aron Walsh, Gábor Csányi

    Abstract: Metal halide perovskites are multifunctional semiconductors with tunable structures and properties. They are highly dynamic crystals with complex octahedral tilting patterns and strongly anharmonic atomic behaviour. In the higher temperature, higher symmetry phases of these materials, several complex structural features have been observed. The local structure can differ greatly from the average st… ▽ More

    Submitted 11 April, 2023; v1 submitted 10 April, 2023; originally announced April 2023.

  18. arXiv:2212.09248  [pdf, other

    cs.CL cs.SE

    Natural Language to Code Generation in Interactive Data Science Notebooks

    Authors: Pengcheng Yin, Wen-Ding Li, Kefan Xiao, Abhishek Rao, Yeming Wen, Kensen Shi, Joshua Howland, Paige Bailey, Michele Catasta, Henryk Michalewski, Alex Polozov, Charles Sutton

    Abstract: Computational notebooks, such as Jupyter notebooks, are interactive computing environments that are ubiquitous among data scientists to perform data wrangling and analytic tasks. To measure the performance of AI pair programmers that automatically synthesize programs for those tasks given natural language (NL) intents from users, we build ARCADE, a benchmark of 1082 code generation problems using… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

    Comments: 46 pages. 32 figures

  19. arXiv:2208.07461  [pdf, other

    cs.LG cs.PL cs.SE

    A Library for Representing Python Programs as Graphs for Machine Learning

    Authors: David Bieber, Kensen Shi, Petros Maniatis, Charles Sutton, Vincent Hellendoorn, Daniel Johnson, Daniel Tarlow

    Abstract: Graph representations of programs are commonly a central element of machine learning for code research. We introduce an open source Python library python_graphs that applies static analysis to construct graph representations of Python programs suitable for training machine learning models. Our library admits the construction of control-flow graphs, data-flow graphs, and composite ``program graphs'… ▽ More

    Submitted 15 August, 2022; originally announced August 2022.

    Comments: 21 pages, 14 figures

  20. arXiv:2207.14405  [pdf, ps, other

    math.DG math.AP math.SP

    Spectral multiplicity and nodal sets for generic torus-invariant metrics

    Authors: Donato Cianci, Chris Judge, Samuel Lin, Craig Sutton

    Abstract: Let a torus $T$ act freely on a closed manifold $M$ of dimension at least two. We demonstrate that, for a generic $T$-invariant Riemannian metric $g$ on $M$, each real $Δ_g$-eigenspace is an irreducible real representation of $T$ and, therefore, has dimension at most two. We also show that, for the generic $T$-invariant metric on $M$, if $u$ is a non-invariant real-valued $Δ_g$-eigenfunction that… ▽ More

    Submitted 28 July, 2022; originally announced July 2022.

    Comments: 18 pages

    MSC Class: 58J50 (Primary) 35P05; 81Q10 (Secondary)

  21. arXiv:2207.10342  [pdf, ps, other

    cs.CL cs.AI

    Language Model Cascades

    Authors: David Dohan, Winnie Xu, Aitor Lewkowycz, Jacob Austin, David Bieber, Raphael Gontijo Lopes, Yuhuai Wu, Henryk Michalewski, Rif A. Saurous, Jascha Sohl-dickstein, Kevin Murphy, Charles Sutton

    Abstract: Prompted models have demonstrated impressive few-shot learning abilities. Repeated interactions at test-time with a single model, or the composition of multiple models together, further expands capabilities. These compositions are probabilistic models, and may be expressed in the language of graphical models with random variables whose values are complex data types such as strings. Cases with cont… ▽ More

    Submitted 28 July, 2022; v1 submitted 21 July, 2022; originally announced July 2022.

    Comments: Presented as spotlight at the Beyond Bases workshop at ICML 2022 (https://beyond-bayes.github.io)

  22. arXiv:2207.08050  [pdf, other

    cs.LG stat.ML

    Repairing Systematic Outliers by Learning Clean Subspaces in VAEs

    Authors: Simao Eduardo, Kai Xu, Alfredo Nazabal, Charles Sutton

    Abstract: Data cleaning often comprises outlier detection and data repair. Systematic errors result from nearly deterministic transformations that occur repeatedly in the data, e.g. specific image pixels being set to default values or watermarks. Consequently, models with enough capacity easily overfit to these errors, making detection and repair difficult. Seeing as a systematic outlier is a combination of… ▽ More

    Submitted 16 July, 2022; originally announced July 2022.

    Comments: Submitted for review in ICLR 2022

  23. arXiv:2204.03758  [pdf, other

    cs.LG cs.PL stat.ML

    Compositional Generalization and Decomposition in Neural Program Synthesis

    Authors: Kensen Shi, Joey Hong, Manzil Zaheer, Pengcheng Yin, Charles Sutton

    Abstract: When writing programs, people have the ability to tackle a new complex task by decomposing it into smaller and more familiar subtasks. While it is difficult to measure whether neural program synthesis methods have similar capabilities, what we can measure is whether they compositionally generalize, that is, whether a model that has been trained on the simpler subtasks is subsequently able to solve… ▽ More

    Submitted 7 April, 2022; originally announced April 2022.

    Comments: Published at the Deep Learning for Code (DL4C) Workshop at ICLR 2022

  24. arXiv:2204.02311  [pdf, other

    cs.CL

    PaLM: Scaling Language Modeling with Pathways

    Authors: Aakanksha Chowdhery, Sharan Narang, Jacob Devlin, Maarten Bosma, Gaurav Mishra, Adam Roberts, Paul Barham, Hyung Won Chung, Charles Sutton, Sebastian Gehrmann, Parker Schuh, Kensen Shi, Sasha Tsvyashchenko, Joshua Maynez, Abhishek Rao, Parker Barnes, Yi Tay, Noam Shazeer, Vinodkumar Prabhakaran, Emily Reif, Nan Du, Ben Hutchinson, Reiner Pope, James Bradbury, Jacob Austin , et al. (42 additional authors not shown)

    Abstract: Large language models have been shown to achieve remarkable performance across a variety of natural language tasks using few-shot learning, which drastically reduces the number of task-specific training examples needed to adapt the model to a particular application. To further our understanding of the impact of scale on few-shot learning, we trained a 540-billion parameter, densely activated, Tran… ▽ More

    Submitted 5 October, 2022; v1 submitted 5 April, 2022; originally announced April 2022.

  25. arXiv:2203.10452  [pdf, other

    cs.LG cs.PL stat.ML

    CrossBeam: Learning to Search in Bottom-Up Program Synthesis

    Authors: Kensen Shi, Hanjun Dai, Kevin Ellis, Charles Sutton

    Abstract: Many approaches to program synthesis perform a search within an enormous space of programs to find one that satisfies a given specification. Prior works have used neural models to guide combinatorial search algorithms, but such approaches still explore a huge portion of the search space and quickly become intractable as the size of the desired program increases. To tame the search space blowup, we… ▽ More

    Submitted 20 March, 2022; originally announced March 2022.

    Comments: Published at ICLR 2022

  26. arXiv:2112.00114  [pdf, other

    cs.LG cs.NE

    Show Your Work: Scratchpads for Intermediate Computation with Language Models

    Authors: Maxwell Nye, Anders Johan Andreassen, Guy Gur-Ari, Henryk Michalewski, Jacob Austin, David Bieber, David Dohan, Aitor Lewkowycz, Maarten Bosma, David Luan, Charles Sutton, Augustus Odena

    Abstract: Large pre-trained language models perform remarkably well on tasks that can be done "in one pass", such as generating realistic text or synthesizing computer programs. However, they struggle with tasks that require unbounded multi-step computation, such as adding integers or executing programs. Surprisingly, we find that these same models are able to perform complex multi-step computations -- even… ▽ More

    Submitted 30 November, 2021; originally announced December 2021.

  27. arXiv:2108.07732  [pdf, other

    cs.PL cs.LG

    Program Synthesis with Large Language Models

    Authors: Jacob Austin, Augustus Odena, Maxwell Nye, Maarten Bosma, Henryk Michalewski, David Dohan, Ellen Jiang, Carrie Cai, Michael Terry, Quoc Le, Charles Sutton

    Abstract: This paper explores the limits of the current generation of large language models for program synthesis in general purpose programming languages. We evaluate a collection of such models (with between 244M and 137B parameters) on two new benchmarks, MBPP and MathQA-Python, in both the few-shot and fine-tuning regimes. Our benchmarks are designed to measure the ability of these models to synthesize… ▽ More

    Submitted 15 August, 2021; originally announced August 2021.

    Comments: Jacob and Augustus contributed equally

  28. arXiv:2106.15339  [pdf, other

    cs.SE cs.LG cs.PL

    SpreadsheetCoder: Formula Prediction from Semi-structured Context

    Authors: Xinyun Chen, Petros Maniatis, Rishabh Singh, Charles Sutton, Hanjun Dai, Max Lin, Denny Zhou

    Abstract: Spreadsheet formula prediction has been an important program synthesis problem with many real-world applications. Previous works typically utilize input-output examples as the specification for spreadsheet formula synthesis, where each input-output pair simulates a separate row in the spreadsheet. However, this formulation does not fully capture the rich context in real-world spreadsheets. First,… ▽ More

    Submitted 26 June, 2021; originally announced June 2021.

    Comments: Published in ICML 2021

  29. arXiv:2104.05134  [pdf, other

    stat.ME stat.CO

    Couplings for Multinomial Hamiltonian Monte Carlo

    Authors: Kai Xu, Tor Erlend Fjelde, Charles Sutton, Hong Ge

    Abstract: Hamiltonian Monte Carlo (HMC) is a popular sampling method in Bayesian inference. Recently, Heng & Jacob (2019) studied Metropolis HMC with couplings for unbiased Monte Carlo estimation, establishing a generic parallelizable scheme for HMC. However, in practice a different HMC method, multinomial HMC, is considered as the go-to method, e.g. as part of the no-U-turn sampler. In multinomial HMC, pro… ▽ More

    Submitted 11 April, 2021; originally announced April 2021.

    Comments: Published in AISTATS 2021

  30. Measurement of the distribution of $^{207}$Bi depositions on calibration sources for SuperNEMO

    Authors: R. Arnold, C. Augier, A. S. Barabash, A. Basharina-Freshville, E. Birdsall, S. Blondel, M. Bongrand, D. Boursette, R. Breier, V. Brudanin, J. Busto, S. Calvez, C. Cerna, J. P. Cesar, M. Ceschia, A. Chapon, E. Chauveau, A. Chopra, L. Dawson, S. De Capua, D. Duchesneau, D. Durand, G. Eurin, J. J. Evans, D. Filosofov , et al. (75 additional authors not shown)

    Abstract: The SuperNEMO experiment will search for neutrinoless double-beta decay ($0νββ$), and study the Standard-Model double-beta decay process ($2νββ$). The SuperNEMO technology can measure the energy of each of the electrons produced in a double-beta ($ββ$) decay, and can reconstruct the topology of their individual tracks. The study of the double-beta decay spectrum requires very accurate energy calib… ▽ More

    Submitted 20 May, 2021; v1 submitted 26 March, 2021; originally announced March 2021.

    Comments: 16 pages, 12 figures, submitted to JINST, response to reviewer comments

  31. arXiv:2012.00377  [pdf, other

    cs.LG cs.AI

    Latent Programmer: Discrete Latent Codes for Program Synthesis

    Authors: Joey Hong, David Dohan, Rishabh Singh, Charles Sutton, Manzil Zaheer

    Abstract: In many sequence learning tasks, such as program synthesis and document summarization, a key problem is searching over a large space of possible output sequences. We propose to learn representations of the outputs that are specifically meant for search: rich enough to specify the desired output but compact enough to make search more efficient. Discrete latent codes are appealing for this purpose,… ▽ More

    Submitted 5 August, 2021; v1 submitted 1 December, 2020; originally announced December 2020.

    Comments: ICML 2021; 15 pages, 9 figures

  32. arXiv:2011.07657  [pdf, other

    nucl-ex hep-ex

    Search for Periodic Modulations of the Rate of Double-Beta Decay of $^{100}$Mo in the NEMO-3 Detector

    Authors: NEMO-3 Collaboration, :, R. Arnold, C. Augier, A. S. Barabash, A. Basharina-Freshville, S. Blondel, S. Blot, M. Bongrand, D. Boursette, R. Breier, V. Brudanin, J. Busto, A. J. Caffrey, S. Calvez, C. Cerna, J. P. Cesar, M. Ceschia, A. Chapon, E. Chauveau, A. Chopra, L. Dawson, D. Duchesneau, D. Durand, G. Eurin , et al. (84 additional authors not shown)

    Abstract: Double-beta decays of $^{100}$Mo from the 6.0195-year exposure of a 6.914 kg high-purity sample were recorded by the NEMO-3 experiment that searched for neutrinoless double-beta decays. These ultra-rare transitions to $^{100}$Ru have a half-life of approximately $7\times10^{18}$ years, and have been used to conduct the first ever search for periodic variations of this decay mode. The Lomb-Scargle… ▽ More

    Submitted 15 November, 2020; originally announced November 2020.

  33. arXiv:2011.05363  [pdf, other

    cs.LG

    Learning Discrete Energy-based Models via Auxiliary-variable Local Exploration

    Authors: Hanjun Dai, Rishabh Singh, Bo Dai, Charles Sutton, Dale Schuurmans

    Abstract: Discrete structures play an important role in applications like program language modeling and software engineering. Current approaches to predicting complex structures typically consider autoregressive models for their tractability, with some sacrifice in flexibility. Energy-based models (EBMs) on the other hand offer a more flexible and thus more powerful approach to modeling such distributions,… ▽ More

    Submitted 10 November, 2020; originally announced November 2020.

    Comments: NeurIPS 2020

  34. arXiv:2010.12621  [pdf, other

    cs.LG

    Learning to Execute Programs with Instruction Pointer Attention Graph Neural Networks

    Authors: David Bieber, Charles Sutton, Hugo Larochelle, Daniel Tarlow

    Abstract: Graph neural networks (GNNs) have emerged as a powerful tool for learning software engineering tasks including code completion, bug finding, and program repair. They benefit from leveraging program structure like control flow graphs, but they are not well-suited to tasks like program execution that require far more sequential reasoning steps than number of GNN propagation steps. Recurrent neural n… ▽ More

    Submitted 23 October, 2020; originally announced October 2020.

    Comments: Accepted at NeurIPS 2020

  35. arXiv:2010.11887  [pdf, other

    cs.PL cs.LG stat.ML

    Conditional independence by typing

    Authors: Maria I. Gorinova, Andrew D. Gordon, Charles Sutton, Matthijs Vákár

    Abstract: A central goal of probabilistic programming languages (PPLs) is to separate modelling from inference. However, this goal is hard to achieve in practice. Users are often forced to re-write their models in order to improve efficiency of inference or meet restrictions imposed by the PPL. Conditional independence (CI) relationships among parameters are a crucial aspect of probabilistic models that cap… ▽ More

    Submitted 18 February, 2022; v1 submitted 22 October, 2020; originally announced October 2020.

    Journal ref: ACM Transactions on Programming Languages and Systems, Volume 44, Issue 1, March 2022, Article No 4, pp 1-54

  36. Investigating the ranges of (meta)stable phase formation in (InxGa1-x)2O3: Impact of the cation coordination

    Authors: C. Wouters, C. Sutton, L. M. Ghiringhelli, T. Markurt, R. Schewski, A. Hassa, H. von Wenckstern, M. Grundmann, M. Scheffler, M. Albrecht

    Abstract: We investigate the phase diagram of the heterostructural solid solution (InxGa1-x)2O3 both computationally, by combining cluster expansion and density functional theory, and experimentally, by means of TEM measurements of pulsed laser deposited (PLD) heteroepitaxial thin films. The shapes of the Gibbs free energy curves for the monoclinic, hexagonal and cubic bixbyite alloy as a function of compos… ▽ More

    Submitted 11 August, 2020; originally announced August 2020.

    Comments: 16 pages, 7 figures

    Journal ref: Phys. Rev. Materials 4, 125001 (2020)

  37. arXiv:2007.14381  [pdf, other

    cs.PL cs.LG stat.ML

    BUSTLE: Bottom-Up Program Synthesis Through Learning-Guided Exploration

    Authors: Augustus Odena, Kensen Shi, David Bieber, Rishabh Singh, Charles Sutton, Hanjun Dai

    Abstract: Program synthesis is challenging largely because of the difficulty of search in a large space of programs. Human programmers routinely tackle the task of writing complex programs by writing sub-programs and then analyzing their intermediate results to compose them in appropriate ways. Motivated by this intuition, we present a new synthesis approach that leverages learning to guide a bottom-up sear… ▽ More

    Submitted 30 September, 2021; v1 submitted 28 July, 2020; originally announced July 2020.

  38. arXiv:2006.10924  [pdf, other

    stat.ML cs.LG

    Neural Program Synthesis with a Differentiable Fixer

    Authors: Matej Balog, Rishabh Singh, Petros Maniatis, Charles Sutton

    Abstract: We present a new program synthesis approach that combines an encoder-decoder based synthesis architecture with a differentiable program fixer. Our approach is inspired from the fact that human developers seldom get their program correct on the first attempt, and perform iterative testing-based program fixing to get to the desired program functionality. Similarly, our approach first learns a distri… ▽ More

    Submitted 18 June, 2020; originally announced June 2020.

  39. arXiv:2004.13214  [pdf, ps, other

    cs.SE cs.LG

    SCELMo: Source Code Embeddings from Language Models

    Authors: Rafael - Michael Karampatsis, Charles Sutton

    Abstract: Continuous embeddings of tokens in computer programs have been used to support a variety of software development tools, including readability, code search, and program repair. Contextual embeddings are common in natural language processing but have not been previously applied in software engineering. We introduce a new set of deep contextualized word representations for computer programs based on… ▽ More

    Submitted 27 April, 2020; originally announced April 2020.

    Comments: 12 pages

  40. arXiv:2004.00348  [pdf, other

    cs.PL cs.LG

    OptTyper: Probabilistic Type Inference by Optimising Logical and Natural Constraints

    Authors: Irene Vlassi Pandi, Earl T. Barr, Andrew D. Gordon, Charles Sutton

    Abstract: We present a new approach to the type inference problem for dynamic languages. Our goal is to combine \emph{logical} constraints, that is, deterministic information from a type system, with \emph{natural} constraints, that is, uncertain statistical information about types learnt from sources like identifier names. To this end, we introduce a framework for probabilistic type inference that combines… ▽ More

    Submitted 26 March, 2021; v1 submitted 1 April, 2020; originally announced April 2020.

    Comments: 29 pages, 5 figures, 2 tables

  41. Big Code != Big Vocabulary: Open-Vocabulary Models for Source Code

    Authors: Rafael-Michael Karampatsis, Hlib Babii, Romain Robbes, Charles Sutton, Andrea Janes

    Abstract: Statistical language modeling techniques have successfully been applied to large source code corpora, yielding a variety of new software development tools, such as tools for code suggestion, improving readability, and API migration. A major issue with these techniques is that code introduces new vocabulary at a far higher rate than natural language, as new identifier names proliferate. Both large… ▽ More

    Submitted 17 March, 2020; originally announced March 2020.

    Comments: 13 pages; to appear in Proceedings of ICSE 2020

  42. arXiv:2003.04227  [pdf, other

    cs.LG cs.AI

    Towards Modular Algorithm Induction

    Authors: Daniel A. Abolafia, Rishabh Singh, Manzil Zaheer, Charles Sutton

    Abstract: We present a modular neural network architecture Main that learns algorithms given a set of input-output examples. Main consists of a neural controller that interacts with a variable-length input tape and learns to compose modules together with their corresponding argument choices. Unlike previous approaches, Main uses a general domain-agnostic mechanism for selection of modules and their argument… ▽ More

    Submitted 27 February, 2020; originally announced March 2020.

    Comments: 10 pages, 4 figures, 2 tables

  43. arXiv:2002.09067  [pdf, other

    cs.LG cs.DS stat.ML

    Incremental Sampling Without Replacement for Sequence Models

    Authors: Kensen Shi, David Bieber, Charles Sutton

    Abstract: Sampling is a fundamental technique, and sampling without replacement is often desirable when duplicate samples are not beneficial. Within machine learning, sampling is useful for generating diverse outputs from a trained model. We present an elegant procedure for sampling without replacement from a broad class of randomized programs, including generative neural models that construct outputs seque… ▽ More

    Submitted 19 July, 2021; v1 submitted 20 February, 2020; originally announced February 2020.

  44. arXiv:2002.09030  [pdf, other

    cs.PL cs.LG

    Learning to Represent Programs with Property Signatures

    Authors: Augustus Odena, Charles Sutton

    Abstract: We introduce the notion of property signatures, a representation for programs and program specifications meant for consumption by machine learning algorithms. Given a function with input type $τ_{in}$ and output type $τ_{out}$, a property is a function of type: $(τ_{in}, τ_{out}) \rightarrow \texttt{Bool}$ that (informally) describes some simple property of the function under consideration. For in… ▽ More

    Submitted 12 February, 2020; originally announced February 2020.

    Comments: ICLR 2020

  45. arXiv:2001.06388  [pdf, other

    physics.ins-det hep-ex

    Search for the double-beta decay of 82Se to the excited states of 82Kr with NEMO-3

    Authors: The NEMO-3 collaboration R. Arnold, C. Augier, A. S. Barabash, A. Basharina-Freshville, S. Blondel, S. Blot, M. Bongrand, D. Boursette, R. Breier, V. Brudanin, J. Busto, A. J. Caffrey, S. Calvez, M. Cascella, C. Cerna, J. P. Cesar, A. Chapon, E. Chauveau, A. Chopra, L. Dawson, D. Duchesneau, D. Durand, V. Egorov, G. Eurin, J. J. Evans , et al. (82 additional authors not shown)

    Abstract: The double-beta decay of 82Se to the 0+1 excited state of 82Kr has been studied with the NEMO-3 detector using 0.93 kg of enriched 82Se measured for 4.75 y, corresponding to an exposure of 4.42 kg y. A dedicated analysis to reconstruct the gamma-rays has been performed to search for events in the 2e2g channel. No evidence of a 2nbb decay to the 0+1 state has been observed and a limit of T2n 1/2(82… ▽ More

    Submitted 17 January, 2020; originally announced January 2020.

    Journal ref: Nuclear Physics A Volume 996, April 2020, 121701

  46. arXiv:1911.01205  [pdf, other

    cs.LG cs.AI cs.SE stat.ML

    Learning to Fix Build Errors with Graph2Diff Neural Networks

    Authors: Daniel Tarlow, Subhodeep Moitra, Andrew Rice, Zimin Chen, Pierre-Antoine Manzagol, Charles Sutton, Edward Aftandilian

    Abstract: Professional software developers spend a significant amount of time fixing builds, but this has received little attention as a problem in automatic program repair. We present a new deep learning architecture, called Graph2Diff, for automatically localizing and fixing build errors. We represent source code, build configuration files, and compiler diagnostic messages as a graph, and then use a Graph… ▽ More

    Submitted 4 November, 2019; originally announced November 2019.

    Comments: Submitted for review on Aug 23, 2019

  47. arXiv:1910.14118  [pdf, ps, other

    math.DG math.SP

    Geometric structures and the Laplace spectrum, part II

    Authors: Samuel Lin, Benjamin Schmidt, Craig Sutton

    Abstract: We continue our exploration of the extent to which the spectrum encodes the local geometry of a locally homogeneous three-manifold and find that if $(M,g)$ and $(N,h)$ are a pair of locally homogeneous, locally non-isometric isospectral three-manifolds, where $M$ is an elliptic three-manifold, then $(1)$ $N$ is also an elliptic three-manifold, $(2)$ $M$ and $N$ have fundamental groups of different… ▽ More

    Submitted 30 October, 2019; originally announced October 2019.

    Comments: 42 pages, 1 Figure

    MSC Class: 53C20; 58J50

  48. Effects of green revolution led agricultural expansion on net ecosystem service values in India

    Authors: Srikanta Sannigrahi, Suman Chakraborti, Pawan Kumar Joshi, Saskia Keesstra, P. S. Roy, Paul. C. Sutton, Urs Kreuter, Saikat Kumar Paul, Somnath Sen, Sandeep Bhatt, Shahid Rahmat, Shouvik Jha, Qi Zhang, Laishram Kanta Singh

    Abstract: Ecosystem Services are a bundle of natural processes and functions that are essential for human well-being, subsistence, and livelihood. The expansion of cultivation and cropland, which is the backbone of the Indian economy, is one of the main drivers of rapid Land Use Land Cover changes in India. To assess the impact of the Green Revolution led agrarian expansion on the total ecosystem service va… ▽ More

    Submitted 15 November, 2020; v1 submitted 24 September, 2019; originally announced September 2019.

    Report number: Volume 277, 111381

    Journal ref: Journal of Environmental Management, 2020

  49. arXiv:1907.06671  [pdf, other

    cs.LG stat.ML

    Robust Variational Autoencoders for Outlier Detection and Repair of Mixed-Type Data

    Authors: Simão Eduardo, Alfredo Nazábal, Christopher K. I. Williams, Charles Sutton

    Abstract: We focus on the problem of unsupervised cell outlier detection and repair in mixed-type tabular data. Traditional methods are concerned only with detecting which rows in the dataset are outliers. However, identifying which cells are corrupted in a specific row is an important problem in practice, and the very first step towards repairing them. We introduce the Robust Variational Autoencoder (RVAE)… ▽ More

    Submitted 3 March, 2020; v1 submitted 15 July, 2019; originally announced July 2019.

    Comments: Accepted for publication at AISTATS 2020

  50. arXiv:1906.00781  [pdf, other

    cs.DB cs.IR cs.LG

    Learning Semantic Annotations for Tabular Data

    Authors: Jiaoyan Chen, Ernesto Jimenez-Ruiz, Ian Horrocks, Charles Sutton

    Abstract: The usefulness of tabular data such as web tables critically depends on understanding their semantics. This study focuses on column type prediction for tables without any meta data. Unlike traditional lexical matching-based methods, we propose a deep prediction model that can fully exploit a table's contextual semantics, including table locality features learned by a Hybrid Neural Network (HNN), a… ▽ More

    Submitted 30 May, 2019; originally announced June 2019.

    Comments: 7 pages

    Journal ref: IJCAI 2019