Skip to main content

Showing 1–13 of 13 results for author: Brevdo, E

  1. arXiv:2311.10800  [pdf, other

    cs.PL cs.LG cs.PF

    The Next 700 ML-Enabled Compiler Optimizations

    Authors: S. VenkataKeerthy, Siddharth Jain, Umesh Kalvakuntla, Pranav Sai Gorantla, Rajiv Shailesh Chitale, Eugene Brevdo, Albert Cohen, Mircea Trofin, Ramakrishna Upadrasta

    Abstract: There is a growing interest in enhancing compiler optimizations with ML models, yet interactions between compilers and ML frameworks remain challenging. Some optimizations require tightly coupled models and compiler internals,raising issues with modularity, performance and framework independence. Practical deployment and transparency for the end-user are also important concerns. We propose ML-Comp… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

  2. arXiv:2306.06798  [pdf, other

    cs.DB cs.LG

    Kepler: Robust Learning for Faster Parametric Query Optimization

    Authors: Lyric Doshi, Vincent Zhuang, Gaurav Jain, Ryan Marcus, Haoyu Huang, Deniz Altinbüken, Eugene Brevdo, Campbell Fraser

    Abstract: Most existing parametric query optimization (PQO) techniques rely on traditional query optimizer cost models, which are often inaccurate and result in suboptimal query performance. We propose Kepler, an end-to-end learning-based approach to PQO that demonstrates significant speedups in query latency over a traditional query optimizer. Central to our method is Row Count Evolution (RCE), a novel pla… ▽ More

    Submitted 18 October, 2023; v1 submitted 11 June, 2023; originally announced June 2023.

    Comments: SIGMOD 2023

  3. arXiv:2112.04041  [pdf, other

    cs.LG cs.AR

    A Transferable Approach for Partitioning Machine Learning Models on Multi-Chip-Modules

    Authors: Xinfeng Xie, Prakash Prabhu, Ulysse Beaugnon, Phitchaya Mangpo Phothilimthana, Sudip Roy, Azalia Mirhoseini, Eugene Brevdo, James Laudon, Yanqi Zhou

    Abstract: Multi-Chip-Modules (MCMs) reduce the design and fabrication cost of machine learning (ML) accelerators while delivering performance and energy efficiency on par with a monolithic large chip. However, ML compilers targeting MCMs need to solve complex optimization problems optimally and efficiently to achieve this high performance. One such problem is the multi-chip partitioning problem where compil… ▽ More

    Submitted 7 December, 2021; originally announced December 2021.

  4. arXiv:2106.02229  [pdf, other

    cs.LG cs.AI cs.CV

    Differentiable Architecture Search for Reinforcement Learning

    Authors: Yingjie Miao, Xingyou Song, John D. Co-Reyes, Daiyi Peng, Summer Yue, Eugene Brevdo, Aleksandra Faust

    Abstract: In this paper, we investigate the fundamental question: To what extent are gradient-based neural architecture search (NAS) techniques applicable to RL? Using the original DARTS as a convenient baseline, we discover that the discrete architectures found can achieve up to 250% performance compared to manual architecture designs on both discrete and continuous action space environments across off-pol… ▽ More

    Submitted 15 November, 2022; v1 submitted 3 June, 2021; originally announced June 2021.

    Comments: Published as a conference paper at the first Automated Machine Learning Conference (AutoML-Conf) 2022. Code can be found at https://github.com/google/brain_autorl/tree/main/rl_darts

  5. arXiv:2102.04736  [pdf, other

    cs.LG cs.AI cs.DC

    Reverb: A Framework For Experience Replay

    Authors: Albin Cassirer, Gabriel Barth-Maron, Eugene Brevdo, Sabela Ramos, Toby Boyd, Thibault Sottiaux, Manuel Kroiss

    Abstract: A central component of training in Reinforcement Learning (RL) is Experience: the data used for training. The mechanisms used to generate and consume this data have an important effect on the performance of RL algorithms. In this paper, we introduce Reverb: an efficient, extensible, and easy to use system designed specifically for experience replay in RL. Reverb is designed to work efficiently i… ▽ More

    Submitted 9 February, 2021; originally announced February 2021.

    Comments: 11 pages

  6. arXiv:2101.04808  [pdf, other

    cs.PL cs.LG

    MLGO: a Machine Learning Guided Compiler Optimizations Framework

    Authors: Mircea Trofin, Yundi Qian, Eugene Brevdo, Zinan Lin, Krzysztof Choromanski, David Li

    Abstract: Leveraging machine-learning (ML) techniques for compiler optimizations has been widely studied and explored in academia. However, the adoption of ML in general-purpose, industry strength compilers has yet to happen. We propose MLGO, a framework for integrating ML techniques systematically in an industrial compiler -- LLVM. As a case study, we present the details and results of replacing the heuris… ▽ More

    Submitted 12 January, 2021; originally announced January 2021.

    Comments: First two authors are equal contributors

  7. arXiv:1807.01675  [pdf, other

    cs.LG cs.AI stat.ML

    Sample-Efficient Reinforcement Learning with Stochastic Ensemble Value Expansion

    Authors: Jacob Buckman, Danijar Hafner, George Tucker, Eugene Brevdo, Honglak Lee

    Abstract: Integrating model-free and model-based approaches in reinforcement learning has the potential to achieve the high performance of model-free algorithms with low sample complexity. However, this is difficult because an imperfect dynamics model can degrade the performance of the learning algorithm, and in sufficiently complex environments, the dynamics model will almost always be imperfect. As a resu… ▽ More

    Submitted 7 June, 2019; v1 submitted 4 July, 2018; originally announced July 2018.

    Journal ref: Advances in Neural Information Processing Systems, 2019 (pp. 8224-8234)

  8. Dynamic Control Flow in Large-Scale Machine Learning

    Authors: Yuan Yu, Martín Abadi, Paul Barham, Eugene Brevdo, Mike Burrows, Andy Davis, Jeff Dean, Sanjay Ghemawat, Tim Harley, Peter Hawkins, Michael Isard, Manjunath Kudlur, Rajat Monga, Derek Murray, Xiaoqiang Zheng

    Abstract: Many recent machine learning models rely on fine-grained dynamic control flow for training and inference. In particular, models based on recurrent neural networks and on reinforcement learning depend on recurrence relations, data-dependent conditional execution, and other features that call for dynamic control flow. These applications benefit from the ability to make rapid control-flow decisions a… ▽ More

    Submitted 4 May, 2018; originally announced May 2018.

    Comments: Appeared in EuroSys 2018. 14 pages, 16 figures

    Journal ref: EuroSys 2018: Thirteenth EuroSys Conference, April 23-26, 2018, Porto, Portugal. ACM, New York, NY, USA

  9. arXiv:1803.07416  [pdf, other

    cs.LG cs.CL stat.ML

    Tensor2Tensor for Neural Machine Translation

    Authors: Ashish Vaswani, Samy Bengio, Eugene Brevdo, Francois Chollet, Aidan N. Gomez, Stephan Gouws, Llion Jones, Łukasz Kaiser, Nal Kalchbrenner, Niki Parmar, Ryan Sepassi, Noam Shazeer, Jakob Uszkoreit

    Abstract: Tensor2Tensor is a library for deep learning models that is well-suited for neural machine translation and includes the reference implementation of the state-of-the-art Transformer model.

    Submitted 16 March, 2018; originally announced March 2018.

    Comments: arXiv admin note: text overlap with arXiv:1706.03762

  10. arXiv:1711.10604  [pdf, ps, other

    cs.LG cs.AI cs.PL stat.ML

    TensorFlow Distributions

    Authors: Joshua V. Dillon, Ian Langmore, Dustin Tran, Eugene Brevdo, Srinivas Vasudevan, Dave Moore, Brian Patton, Alex Alemi, Matt Hoffman, Rif A. Saurous

    Abstract: The TensorFlow Distributions library implements a vision of probability theory adapted to the modern deep-learning paradigm of end-to-end differentiable computation. Building on two basic abstractions, it offers flexible building blocks for probabilistic computation. Distributions provide fast, numerically stable methods for generating samples and computing statistics, e.g., log density. Bijectors… ▽ More

    Submitted 28 November, 2017; originally announced November 2017.

  11. arXiv:1701.03757  [pdf, ps, other

    stat.ML cs.AI cs.LG cs.PL stat.CO

    Deep Probabilistic Programming

    Authors: Dustin Tran, Matthew D. Hoffman, Rif A. Saurous, Eugene Brevdo, Kevin Murphy, David M. Blei

    Abstract: We propose Edward, a Turing-complete probabilistic programming language. Edward defines two compositional representations---random variables and inference. By treating inference as a first class citizen, on a par with modeling, we show that probabilistic programming can be as flexible and computationally efficient as traditional deep learning. For flexibility, Edward makes it easy to fit the same… ▽ More

    Submitted 7 March, 2017; v1 submitted 13 January, 2017; originally announced January 2017.

    Comments: Appears in International Conference on Learning Representations, 2017. A companion webpage for this paper is available at http://edwardlib.org/iclr2017

  12. arXiv:1603.04467  [pdf, other

    cs.DC cs.LG

    TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems

    Authors: Martín Abadi, Ashish Agarwal, Paul Barham, Eugene Brevdo, Zhifeng Chen, Craig Citro, Greg S. Corrado, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Ian Goodfellow, Andrew Harp, Geoffrey Irving, Michael Isard, Yangqing Jia, Rafal Jozefowicz, Lukasz Kaiser, Manjunath Kudlur, Josh Levenberg, Dan Mane, Rajat Monga, Sherry Moore, Derek Murray, Chris Olah , et al. (15 additional authors not shown)

    Abstract: TensorFlow is an interface for expressing machine learning algorithms, and an implementation for executing such algorithms. A computation expressed using TensorFlow can be executed with little or no change on a wide variety of heterogeneous systems, ranging from mobile devices such as phones and tablets up to large-scale distributed systems of hundreds of machines and thousands of computational de… ▽ More

    Submitted 16 March, 2016; v1 submitted 14 March, 2016; originally announced March 2016.

    Comments: Version 2 updates only the metadata, to correct the formatting of Martín Abadi's name

  13. arXiv:1105.0010  [pdf, other

    math.CA cs.CE math.NA physics.data-an

    The Synchrosqueezing algorithm for time-varying spectral analysis: robustness properties and new paleoclimate applications

    Authors: Gaurav Thakur, Eugene Brevdo, Neven S. Fučkar, Hau-Tieng Wu

    Abstract: We analyze the stability properties of the Synchrosqueezing transform, a time-frequency signal analysis method that can identify and extract oscillatory components with time-varying frequency and amplitude. We show that Synchrosqueezing is robust to bounded perturbations of the signal and to Gaussian white noise. These results justify its applicability to noisy or nonuniformly sampled data that is… ▽ More

    Submitted 4 December, 2012; v1 submitted 29 April, 2011; originally announced May 2011.

    Comments: to appear in Signal Processing

    MSC Class: 42C40; 65T99; 62M15; 86A04

    Journal ref: Signal Processing 93:1079-1094, 2013