Skip to main content

Showing 1–50 of 54 results for author: Wright, S

  1. arXiv:2407.09693  [pdf, other

    cs.LG cs.AI

    A Mathematical Framework, a Taxonomy of Modeling Paradigms, and a Suite of Learning Techniques for Neural-Symbolic Systems

    Authors: Charles Dickens, Connor Pryor, Changyu Gao, Alon Albalak, Eriq Augustine, William Wang, Stephen Wright, Lise Getoor

    Abstract: The field of Neural-Symbolic (NeSy) systems is growing rapidly. Proposed approaches show great promise in achieving symbiotic unions of neural and symbolic methods. However, each NeSy system differs in fundamental ways. There is a pressing need for a unifying theory to illuminate the commonalities and differences in approaches and enable further progress. In this paper, we introduce Neural-Symboli… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  2. arXiv:2407.09690  [pdf, other

    cs.LG cs.CR math.OC

    Private Heterogeneous Federated Learning Without a Trusted Server Revisited: Error-Optimal and Communication-Efficient Algorithms for Convex Losses

    Authors: Changyu Gao, Andrew Lowy, Xingyu Zhou, Stephen J. Wright

    Abstract: We revisit the problem of federated learning (FL) with private data from people who do not trust the server or other silos/clients. In this context, every silo (e.g. hospital) has data from several people (e.g. patients) and needs to protect the privacy of each person's data (e.g. health records), even if the server and/or other silos try to uncover this data. Inter-Silo Record-Level Differential… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: The 41st International Conference on Machine Learning (ICML 2024)

  3. arXiv:2407.04667  [pdf, other

    stat.ME cs.LG

    The diameter of a stochastic matrix: A new measure for sensitivity analysis in Bayesian networks

    Authors: Manuele Leonelli, Jim Q. Smith, Sophia K. Wright

    Abstract: Bayesian networks are one of the most widely used classes of probabilistic models for risk management and decision support because of their interpretability and flexibility in including heterogeneous pieces of information. In any applied modelling, it is critical to assess how robust the inferences on certain target variables are to changes in the model. In Bayesian networks, these analyses fall u… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  4. arXiv:2405.16969  [pdf, other

    cs.CL stat.AP

    The Multi-Range Theory of Translation Quality Measurement: MQM scoring models and Statistical Quality Control

    Authors: Arle Lommel, Serge Gladkoff, Alan Melby, Sue Ellen Wright, Ingemar Strandvik, Katerina Gasova, Angelika Vaasa, Andy Benzo, Romina Marazzato Sparano, Monica Foresi, Johani Innis, Lifeng Han, Goran Nenadic

    Abstract: The year 2024 marks the 10th anniversary of the Multidimensional Quality Metrics (MQM) framework for analytic translation quality evaluation. The MQM error typology has been widely used by practitioners in the translation and localization industry and has served as the basis for many derivative projects. The annual Conference on Machine Translation (WMT) shared tasks on both human and automatic tr… ▽ More

    Submitted 9 June, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

    Comments: working paper, 20 pages, under-review

  5. arXiv:2403.10547  [pdf, ps, other

    math.OC cs.AI cs.DS cs.LG

    Robust Second-Order Nonconvex Optimization and Its Application to Low Rank Matrix Sensing

    Authors: Shuyao Li, Yu Cheng, Ilias Diakonikolas, Jelena Diakonikolas, Rong Ge, Stephen J. Wright

    Abstract: Finding an approximate second-order stationary point (SOSP) is a well-studied and fundamental problem in stochastic nonconvex optimization with many applications in machine learning. However, this problem is poorly understood in the presence of outliers, limiting the use of existing nonconvex algorithms in adversarial settings. In this paper, we study the problem of finding SOSPs in the strong c… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  6. arXiv:2402.11173  [pdf, other

    cs.LG cs.CR math.OC

    How to Make the Gradients Small Privately: Improved Rates for Differentially Private Non-Convex Optimization

    Authors: Andrew Lowy, Jonathan Ullman, Stephen J. Wright

    Abstract: We provide a simple and flexible framework for designing differentially private algorithms to find approximate stationary points of non-convex loss functions. Our framework is based on using a private approximate risk minimizer to "warm start" another private algorithm for finding stationary points. We use this framework to obtain improved, and sometimes optimal, rates for several classes of non-c… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  7. arXiv:2402.05071  [pdf, other

    math.OC cs.LG stat.ML

    Extending the Reach of First-Order Algorithms for Nonconvex Min-Max Problems with Cohypomonotonicity

    Authors: Ahmet Alacaoglu, Donghwan Kim, Stephen J. Wright

    Abstract: We focus on constrained, $L$-smooth, nonconvex-nonconcave min-max problems either satisfying $ρ$-cohypomonotonicity or admitting a solution to the $ρ$-weakly Minty Variational Inequality (MVI), where larger values of the parameter $ρ>0$ correspond to a greater degree of nonconvexity. These problem classes include examples in two player reinforcement learning, interaction dominant min-max problems,… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  8. arXiv:2401.09651  [pdf, other

    cs.LG cs.AI math.OC

    Convex and Bilevel Optimization for Neuro-Symbolic Inference and Learning

    Authors: Charles Dickens, Changyu Gao, Connor Pryor, Stephen Wright, Lise Getoor

    Abstract: We leverage convex and bilevel optimization techniques to develop a general gradient-based parameter learning framework for neural-symbolic (NeSy) systems. We demonstrate our framework with NeuPSL, a state-of-the-art NeSy architecture. To achieve this, we propose a smooth primal and dual formulation of NeuPSL inference and show learning gradients are functions of the optimal dual variables. Additi… ▽ More

    Submitted 3 June, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

  9. arXiv:2311.15399  [pdf, other

    cs.LG cs.AI

    Optimally Teaching a Linear Behavior Cloning Agent

    Authors: Shubham Kumar Bharti, Stephen Wright, Adish Singla, Xiaojin Zhu

    Abstract: We study optimal teaching of Linear Behavior Cloning (LBC) learners. In this setup, the teacher can select which states to demonstrate to an LBC learner. The learner maintains a version space of infinite linear hypotheses consistent with the demonstration. The goal of the teacher is to teach a realizable target policy to the learner using minimum number of state demonstrations. This number is know… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

  10. arXiv:2311.00678  [pdf, other

    math.OC cs.LG stat.ML

    Complexity of Single Loop Algorithms for Nonlinear Programming with Stochastic Objective and Constraints

    Authors: Ahmet Alacaoglu, Stephen J. Wright

    Abstract: We analyze the complexity of single-loop quadratic penalty and augmented Lagrangian algorithms for solving nonconvex optimization problems with functional equality constraints. We consider three cases, in all of which the objective is stochastic and smooth, that is, an expectation over an unknown distribution that is accessed by sampling. The nature of the equality constraints differs among the th… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

  11. arXiv:2310.18841  [pdf, ps, other

    math.OC cs.LG

    A randomized algorithm for nonconvex minimization with inexact evaluations and complexity guarantees

    Authors: Shuyao Li, Stephen J. Wright

    Abstract: We consider minimization of a smooth nonconvex function with inexact oracle access to gradient and Hessian (without assuming access to the function value) to achieve approximate second-order optimality. A novel feature of our method is that if an approximate direction of negative curvature is chosen as the step, we choose its sense to be positive or negative with equal probability. We allow gradie… ▽ More

    Submitted 26 March, 2024; v1 submitted 28 October, 2023; originally announced October 2023.

  12. arXiv:2310.04006  [pdf, other

    math.OC cs.LG

    Accelerating optimization over the space of probability measures

    Authors: Shi Chen, Qin Li, Oliver Tse, Stephen J. Wright

    Abstract: The acceleration of gradient-based optimization methods is a subject of significant practical and theoretical importance, particularly within machine learning applications. While much attention has been directed towards optimizing within Euclidean space, the need to optimize over spaces of probability measures in machine learning motivates exploration of accelerated gradient methods in this contex… ▽ More

    Submitted 18 June, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

  13. arXiv:2309.01753  [pdf, other

    math.OC cs.LG

    On Penalty Methods for Nonconvex Bilevel Optimization and First-Order Stochastic Approximation

    Authors: Jeongyeol Kwon, Dohyun Kwon, Stephen Wright, Robert Nowak

    Abstract: In this work, we study first-order algorithms for solving Bilevel Optimization (BO) where the objective functions are smooth but possibly nonconvex in both levels and the variables are restricted to closed convex sets. As a first step, we study the landscape of BO through the lens of penalty methods, in which the upper- and lower-level objectives are combined in a weighted sum with penalty paramet… ▽ More

    Submitted 11 February, 2024; v1 submitted 4 September, 2023; originally announced September 2023.

    Comments: ICLR 2024

  14. arXiv:2308.08732  [pdf, ps, other

    eess.IV cond-mat.mtrl-sci cs.CV

    Recursive Detection and Analysis of Nanoparticles in Scanning Electron Microscopy Images

    Authors: Aidan S. Wright, Nathaniel P. Youmans, Enrique F. Valderrama Araya

    Abstract: In this study, we present a computational framework tailored for the precise detection and comprehensive analysis of nanoparticles within scanning electron microscopy (SEM) images. The primary objective of this framework revolves around the accurate localization of nanoparticle coordinates, accompanied by secondary objectives encompassing the extraction of pertinent morphological attributes includ… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

    Comments: 9 pages, 10 figures

    ACM Class: I.4.7

  15. arXiv:2306.02192  [pdf, other

    cs.LG math.NA

    Correcting auto-differentiation in neural-ODE training

    Authors: Yewei Xu, Shi Chen, Qin Li, Stephen J. Wright

    Abstract: Does the use of auto-differentiation yield reasonable updates to deep neural networks that represent neural ODEs? Through mathematical analysis and numerical evidence, we find that when the neural network employs high-order forms to approximate the underlying ODE flows (such as the Linear Multistep Method (LMM)), brute-force computation using auto-differentiation often produces non-converging arti… ▽ More

    Submitted 3 June, 2023; originally announced June 2023.

  16. arXiv:2302.04972  [pdf, ps, other

    cs.LG cs.CR math.OC stat.ML

    Differentially Private Optimization for Smooth Nonconvex ERM

    Authors: Changyu Gao, Stephen J. Wright

    Abstract: We develop simple differentially private optimization algorithms that move along directions of (expected) descent to find an approximate second-order solution for nonconvex ERM. We use line search, mini-batching, and a two-phase strategy to improve the speed and practicality of the algorithm. Numerical experiments demonstrate the effectiveness of these approaches.

    Submitted 9 June, 2023; v1 submitted 9 February, 2023; originally announced February 2023.

  17. arXiv:2302.03952  [pdf, other

    cs.LG stat.ML

    Cut your Losses with Squentropy

    Authors: Like Hui, Mikhail Belkin, Stephen Wright

    Abstract: Nearly all practical neural models for classification are trained using cross-entropy loss. Yet this ubiquitous choice is supported by little theoretical or empirical evidence. Recent work (Hui & Belkin, 2020) suggests that training using the (rescaled) square loss is often superior in terms of the classification accuracy. In this paper we propose the "squentropy" loss, which is the sum of two ter… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

    Comments: 18 pages, 16 figures, 6 tables

  18. arXiv:2301.10945  [pdf, other

    math.OC cs.AI cs.LG

    A Fully First-Order Method for Stochastic Bilevel Optimization

    Authors: Jeongyeol Kwon, Dohyun Kwon, Stephen Wright, Robert Nowak

    Abstract: We consider stochastic unconstrained bilevel optimization problems when only the first-order gradient oracles are available. While numerous optimization methods have been proposed for tackling bilevel problems, existing methods either tend to require possibly expensive calculations regarding Hessians of lower-level objectives, or lack rigorous finite-time performance guarantees. In this work, we p… ▽ More

    Submitted 26 January, 2023; originally announced January 2023.

  19. arXiv:2301.07831  [pdf, other

    math.NA cs.MS stat.CO

    Multi-output multilevel best linear unbiased estimators via semidefinite programming

    Authors: M. Croci, K. E. Willcox, S. J. Wright

    Abstract: Multifidelity forward uncertainty quantification (UQ) problems often involve multiple quantities of interest and heterogeneous models (e.g., different grids, equations, dimensions, physics, surrogate and reduced-order models). While computational efficiency is key in this context, multi-output strategies in multilevel/multifidelity methods are either sub-optimal or non-existent. In this paper we e… ▽ More

    Submitted 15 May, 2023; v1 submitted 18 January, 2023; originally announced January 2023.

    Comments: 22 pages, 5 figures, 3 tables

  20. arXiv:2212.05088  [pdf, other

    math.OC cs.LG

    Cyclic Block Coordinate Descent With Variance Reduction for Composite Nonconvex Optimization

    Authors: Xufeng Cai, Chaobing Song, Stephen J. Wright, Jelena Diakonikolas

    Abstract: Nonconvex optimization is central in solving many machine learning problems, in which block-wise structure is commonly encountered. In this work, we propose cyclic block coordinate methods for nonconvex optimization problems with non-asymptotic gradient norm guarantees. Our convergence analysis is based on a gradient Lipschitz condition with respect to a Mahalanobis norm, inspired by a recent prog… ▽ More

    Submitted 27 January, 2023; v1 submitted 9 December, 2022; originally announced December 2022.

  21. arXiv:2209.08709  [pdf, other

    cs.LG cs.AI math.OC

    BOME! Bilevel Optimization Made Easy: A Simple First-Order Approach

    Authors: Mao Ye, Bo Liu, Stephen Wright, Peter Stone, Qiang Liu

    Abstract: Bilevel optimization (BO) is useful for solving a variety of important machine learning problems including but not limited to hyperparameter optimization, meta-learning, continual learning, and reinforcement learning. Conventional BO methods need to differentiate through the low-level optimization process with implicit differentiation, which requires expensive calculations related to the Hessian m… ▽ More

    Submitted 18 September, 2022; originally announced September 2022.

  22. arXiv:2201.07684  [pdf, other

    math.OC cs.LG stat.ML

    On the Complexity of a Practical Primal-Dual Coordinate Method

    Authors: Ahmet Alacaoglu, Volkan Cevher, Stephen J. Wright

    Abstract: We prove complexity bounds for the primal-dual algorithm with random extrapolation and coordinate descent (PURE-CD), which has been shown to obtain good practical performance for solving convex-concave min-max problems with bilinear coupling. Our complexity bounds either match or improve the best-known results in the literature for both dense and sparse (strongly)-convex-(strongly)-concave problem… ▽ More

    Submitted 19 January, 2022; originally announced January 2022.

  23. arXiv:2111.01842  [pdf, other

    math.OC cs.LG

    Coordinate Linear Variance Reduction for Generalized Linear Programming

    Authors: Chaobing Song, Cheuk Yin Lin, Stephen J. Wright, Jelena Diakonikolas

    Abstract: We study a class of generalized linear programs (GLP) in a large-scale setting, which includes simple, possibly nonsmooth convex regularizer and simple convex set constraints. By reformulating (GLP) as an equivalent convex-concave min-max problem, we show that the linear structure in the problem can be used to design an efficient, scalable first-order algorithm, to which we give the name \emph{Coo… ▽ More

    Submitted 6 April, 2023; v1 submitted 2 November, 2021; originally announced November 2021.

    Comments: 39 pages, NeurIPS 2022

  24. arXiv:2110.02926  [pdf, ps, other

    cs.LG math.NA stat.ML

    On the Global Convergence of Gradient Descent for multi-layer ResNets in the mean-field regime

    Authors: Zhiyan Ding, Shi Chen, Qin Li, Stephen Wright

    Abstract: Finding the optimal configuration of parameters in ResNet is a nonconvex minimization problem, but first-order methods nevertheless find the global optimum in the overparameterized regime. We study this phenomenon with mean-field analysis, by translating the training process of ResNet to a gradient-flow partial differential equation (PDE) and examining the convergence properties of this limiting p… ▽ More

    Submitted 28 November, 2021; v1 submitted 6 October, 2021; originally announced October 2021.

    Comments: arXiv admin note: text overlap with arXiv:2105.14417

  25. arXiv:2105.14417  [pdf, ps, other

    cs.LG math.NA stat.ML

    Overparameterization of deep ResNet: zero loss and mean-field analysis

    Authors: Zhiyan Ding, Shi Chen, Qin Li, Stephen Wright

    Abstract: Finding parameters in a deep neural network (NN) that fit training data is a nonconvex optimization problem, but a basic first-order optimization method (gradient descent) finds a global optimizer with perfect fit (zero-loss) in many practical situations. We examine this phenomenon for the case of Residual Neural Networks (ResNet) with smooth activation functions in a limiting regime in which both… ▽ More

    Submitted 9 November, 2021; v1 submitted 29 May, 2021; originally announced May 2021.

  26. arXiv:2104.11079  [pdf, other

    cs.AI cs.CE

    Randomized Algorithms for Scientific Computing (RASC)

    Authors: Aydin Buluc, Tamara G. Kolda, Stefan M. Wild, Mihai Anitescu, Anthony DeGennaro, John Jakeman, Chandrika Kamath, Ramakrishnan Kannan, Miles E. Lopes, Per-Gunnar Martinsson, Kary Myers, Jelani Nelson, Juan M. Restrepo, C. Seshadhri, Draguna Vrabie, Brendt Wohlberg, Stephen J. Wright, Chao Yang, Peter Zwart

    Abstract: Randomized algorithms have propelled advances in artificial intelligence and represent a foundational research area in advancing AI for Science. Future advancements in DOE Office of Science priority areas such as climate science, astrophysics, fusion, advanced materials, combustion, and quantum computing all require randomized algorithms for surmounting challenges of complexity, robustness, and sc… ▽ More

    Submitted 21 March, 2022; v1 submitted 19 April, 2021; originally announced April 2021.

  27. arXiv:2102.13643  [pdf, other

    math.OC cs.LG math.NA

    Variance Reduction via Primal-Dual Accelerated Dual Averaging for Nonsmooth Convex Finite-Sums

    Authors: Chaobing Song, Stephen J. Wright, Jelena Diakonikolas

    Abstract: We study structured nonsmooth convex finite-sum optimization that appears widely in machine learning applications, including support vector machines and least absolute deviation. For the primal-dual formulation of this problem, we propose a novel algorithm called \emph{Variance Reduction via Primal-Dual Accelerated Dual Averaging (\vrpda)}. In the nonsmooth and general convex setting, \vrpda~has t… ▽ More

    Submitted 7 April, 2021; v1 submitted 26 February, 2021; originally announced February 2021.

    Comments: 33 pages, 18 figures

  28. arXiv:2010.11366  [pdf, ps, other

    stat.ML cs.LG

    Random Coordinate Underdamped Langevin Monte Carlo

    Authors: Zhiyan Ding, Qin Li, Jianfeng Lu, Stephen J. Wright

    Abstract: The Underdamped Langevin Monte Carlo (ULMC) is a popular Markov chain Monte Carlo sampling method. It requires the computation of the full gradient of the log-density at each iteration, an expensive operation if the dimension of the problem is high. We propose a sampling method called Random Coordinate ULMC (RC-ULMC), which selects a single coordinate at each iteration to be updated and leaves the… ▽ More

    Submitted 21 October, 2020; originally announced October 2020.

  29. arXiv:2010.01405  [pdf, ps, other

    stat.ML cs.LG

    Random Coordinate Langevin Monte Carlo

    Authors: Zhiyan Ding, Qin Li, Jianfeng Lu, Stephen J. Wright

    Abstract: Langevin Monte Carlo (LMC) is a popular Markov chain Monte Carlo sampling method. One drawback is that it requires the computation of the full gradient at each iteration, an expensive operation if the dimension of the problem is high. We propose a new sampling method: Random Coordinate LMC (RC-LMC). At each iteration, a single coordinate is randomly selected to be updated by a multiple of the part… ▽ More

    Submitted 3 October, 2020; originally announced October 2020.

  30. arXiv:2005.13815  [pdf, ps, other

    cs.LG math.OC stat.ML

    Adversarial Classification via Distributional Robustness with Wasserstein Ambiguity

    Authors: Nam Ho-Nguyen, Stephen J. Wright

    Abstract: We study a model for adversarial classification based on distributionally robust chance constraints. We show that under Wasserstein ambiguity, the model aims to minimize the conditional value-at-risk of the distance to misclassification, and we explore links to adversarial classification models proposed earlier and to maximum-margin classifiers. We also provide a reformulation of the distributiona… ▽ More

    Submitted 3 November, 2021; v1 submitted 28 May, 2020; originally announced May 2020.

    Comments: 32 pages

  31. arXiv:2005.06083  [pdf, other

    cs.LG stat.ML

    Stochastic Learning for Sparse Discrete Markov Random Fields with Controlled Gradient Approximation Error

    Authors: Sinong Geng, Zhaobin Kuang, Jie Liu, Stephen Wright, David Page

    Abstract: We study the $L_1$-regularized maximum likelihood estimator/estimation (MLE) problem for discrete Markov random fields (MRFs), where efficient and scalable learning requires both sparse regularization and approximate inference. To address these challenges, we consider a stochastic learning framework called stochastic proximal gradient (SPG; Honorio 2012a, Atchade et al. 2014,Miasojedow and Rejchel… ▽ More

    Submitted 12 May, 2020; originally announced May 2020.

  32. arXiv:1912.08756  [pdf, other

    cs.LG cs.IR stat.ML

    Interleaved Composite Quantization for High-Dimensional Similarity Search

    Authors: Soroosh Khoram, Stephen J Wright, Jing Li

    Abstract: Similarity search retrieves the nearest neighbors of a query vector from a dataset of high-dimensional vectors. As the size of the dataset grows, the cost of performing the distance computations needed to implement a query can become prohibitive. A method often used to reduce this computational cost is quantization of the vector space and location-based encoding of the dataset vectors. These encod… ▽ More

    Submitted 18 December, 2019; v1 submitted 18 December, 2019; originally announced December 2019.

  33. arXiv:1912.06508  [pdf, other

    cs.LG math.OC stat.ML

    A Distributed Quasi-Newton Algorithm for Primal and Dual Regularized Empirical Risk Minimization

    Authors: Ching-pei Lee, Cong Han Lim, Stephen J. Wright

    Abstract: We propose a communication- and computation-efficient distributed optimization algorithm using second-order information for solving empirical risk minimization (ERM) problems with a nonsmooth regularization term. Our algorithm is applicable to both the primal and the dual ERM problem. Current second-order and quasi-Newton methods for this problem either do not work well in the distributed setting… ▽ More

    Submitted 12 December, 2019; originally announced December 2019.

    Comments: arXiv admin note: text overlap with arXiv:1803.01370

  34. arXiv:1905.09209  [pdf, other

    cs.LG math.OC stat.ML

    Convergence and Margin of Adversarial Training on Separable Data

    Authors: Zachary Charles, Shashank Rajput, Stephen Wright, Dimitris Papailiopoulos

    Abstract: Adversarial training is a technique for training robust machine learning models. To encourage robustness, it iteratively computes adversarial examples for the model, and then re-trains on these examples via some update rule. This work analyzes the performance of adversarial training on linearly separable data, and provides bounds on the number of iterations required for large margin. We show that… ▽ More

    Submitted 22 May, 2019; originally announced May 2019.

  35. arXiv:1901.02470  [pdf, other

    cs.LG stat.ML

    Bilinear Bandits with Low-rank Structure

    Authors: Kwang-Sung Jun, Rebecca Willett, Stephen Wright, Robert Nowak

    Abstract: We introduce the bilinear bandit problem with low-rank structure in which an action takes the form of a pair of arms from two different entity types, and the reward is a bilinear function of the known feature vectors of the arms. The unknown in the problem is a $d_1$ by $d_2$ matrix $\mathbfΘ^*$ that defines the reward, and has low rank $r \ll \min\{d_1,d_2\}$. Determination of $\mathbfΘ^*$ with t… ▽ More

    Submitted 9 June, 2019; v1 submitted 8 January, 2019; originally announced January 2019.

    Comments: Accepted to ICML'19

  36. arXiv:1809.07678  [pdf, other

    physics.soc-ph cs.SI

    Modeling a Double-Spending Detection System for the Bitcoin Network

    Authors: Marco Alberto Javarone, Craig Steven Wright

    Abstract: The Bitcoin protocol prevents the occurrence of double-spending (DS), i.e. the utilization of the same currency unit more than once. At the same time a DS attack, where more conflicting transactions are generated, might be performed to defraud a user, e.g. a merchant. Therefore, in this work, we propose a model for detecting the presence of conflicting transactions by means of an 'oracle' that pol… ▽ More

    Submitted 20 September, 2018; originally announced September 2018.

    Comments: 14 pages, 4 figures, 1 table

  37. arXiv:1806.05154  [pdf, other

    cs.CV

    Automated Performance Assessment in Transoesophageal Echocardiography with Convolutional Neural Networks

    Authors: Evangelos B. Mazomenos, Kamakshi Bansal, Bruce Martin, Andrew Smith, Susan Wright, Danail Stoyanov

    Abstract: Transoesophageal echocardiography (TEE) is a valuable diagnostic and monitoring imaging modality. Proper image acquisition is essential for diagnosis, yet current assessment techniques are solely based on manual expert review. This paper presents a supervised deep learn ing framework for automatically evaluating and grading the quality of TEE images. To obtain the necessary dataset, 38 participant… ▽ More

    Submitted 13 June, 2018; originally announced June 2018.

    Comments: to be presented in MICCAI 2018, Granada, Spain, 16-20 Sep 2018

  38. arXiv:1806.04090  [pdf, other

    stat.ML cs.DC cs.LG

    ATOMO: Communication-efficient Learning via Atomic Sparsification

    Authors: Hongyi Wang, Scott Sievert, Zachary Charles, Shengchao Liu, Stephen Wright, Dimitris Papailiopoulos

    Abstract: Distributed model training suffers from communication overheads due to frequent gradient updates transmitted between compute nodes. To mitigate these overheads, several studies propose the use of sparsified stochastic gradients. We argue that these are facets of a general sparsification method that can operate on any possible atomic decomposition. Notable examples include element-wise, singular va… ▽ More

    Submitted 8 November, 2018; v1 submitted 11 June, 2018; originally announced June 2018.

  39. arXiv:1806.03677  [pdf, other

    math.OC cs.LG stat.ML

    Dissipativity Theory for Accelerating Stochastic Variance Reduction: A Unified Analysis of SVRG and Katyusha Using Semidefinite Programs

    Authors: Bin Hu, Stephen Wright, Laurent Lessard

    Abstract: Techniques for reducing the variance of gradient estimates used in stochastic programming algorithms for convex finite-sum problems have received a great deal of attention in recent years. By leveraging dissipativity theory from control, we provide a new perspective on two important variance-reduction algorithms: SVRG and its direct accelerated variant Katyusha. Our perspective provides a physical… ▽ More

    Submitted 10 June, 2018; originally announced June 2018.

    Comments: to appear at ICML 2018

  40. arXiv:1805.07311  [pdf, ps, other

    math.OC cs.CC cs.LG

    Blended Conditional Gradients: the unconditioning of conditional gradients

    Authors: Gábor Braun, Sebastian Pokutta, Dan Tu, Stephen Wright

    Abstract: We present a blended conditional gradient approach for minimizing a smooth convex function over a polytope P, combining the Frank--Wolfe algorithm (also called conditional gradient) with gradient-based steps, different from away steps and pairwise steps, but still achieving linear convergence for strongly convex functions, along with good practical performance. Our approach retains all favorable p… ▽ More

    Submitted 31 May, 2019; v1 submitted 18 May, 2018; originally announced May 2018.

    Comments: 33 pages + 12 figures

    MSC Class: 68Q32; 90C52

  41. arXiv:1804.02350  [pdf, other

    physics.soc-ph cs.SI q-fin.ST

    From Bitcoin to Bitcoin Cash: a network analysis

    Authors: Marco Alberto Javarone, Craig Steven Wright

    Abstract: Bitcoins and Blockchain technologies are attracting the attention of different scientific communities. In addition, their widespread industrial applications and the continuous introduction of cryptocurrencies are also stimulating the attention of the public opinion. The underlying structure of these technologies constitutes one of their core concepts. In particular, they are based on peer-to-peer… ▽ More

    Submitted 23 July, 2018; v1 submitted 6 April, 2018; originally announced April 2018.

    Comments: CryBlock 18, June 15, 2018, Munich, Germany 5 pages, 3 figures, 1 table, ACM Proceeding CryBlock'18 1st Workshop on Cryptocurrencies and Blockchains for Distributed Systems 2018

  42. arXiv:1803.01370  [pdf, other

    math.OC cs.LG stat.ML

    A Distributed Quasi-Newton Algorithm for Empirical Risk Minimization with Nonsmooth Regularization

    Authors: Ching-pei Lee, Cong Han Lim, Stephen J. Wright

    Abstract: We propose a communication- and computation-efficient distributed optimization algorithm using second-order information for solving ERM problems with a nonsmooth regularization term. Current second-order and quasi-Newton methods for this problem either do not work well in the distributed setting or work only for specific regularizers. Our algorithm uses successive quadratic approximations, and we… ▽ More

    Submitted 26 May, 2018; v1 submitted 4 March, 2018; originally announced March 2018.

    Comments: In the proceedings of The 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

  43. arXiv:1801.08019  [pdf, other

    cs.LG stat.ML

    Training Set Debugging Using Trusted Items

    Authors: Xuezhou Zhang, Xiaojin Zhu, Stephen J. Wright

    Abstract: Training set bugs are flaws in the data that adversely affect machine learning. The training set is usually too large for man- ual inspection, but one may have the resources to verify a few trusted items. The set of trusted items may not by itself be adequate for learning, so we propose an algorithm that uses these items to identify bugs in the training set and thus im- proves learning. Specifical… ▽ More

    Submitted 24 January, 2018; originally announced January 2018.

    Comments: AAAI 2018

  44. arXiv:1711.02545  [pdf, other

    stat.ML cs.LG

    Online Learning for Changing Environments using Coin Betting

    Authors: Kwang-Sung Jun, Francesco Orabona, Stephen Wright, Rebecca Willett

    Abstract: A key challenge in online learning is that classical algorithms can be slow to adapt to changing environments. Recent studies have proposed "meta" algorithms that convert any online learning algorithm to one that is adaptive to changing environments, where the adaptivity is analyzed in a quantity called the strongly-adaptive regret. This paper describes a new meta algorithm that has a strongly-ada… ▽ More

    Submitted 6 November, 2017; originally announced November 2017.

    Comments: submitted to a journal. arXiv admin note: substantial text overlap with arXiv:1610.04578

  45. arXiv:1710.05916  [pdf, other

    cs.CE

    Using Neural Networks to Detect Line Outages from PMU Data

    Authors: Ching-pei Lee, Stephen J. Wright

    Abstract: We propose an approach based on neural networks and the AC power flow equations to identify single- and double-line outages in a power grid using the information from phasor measurement unit sensors (PMUs) placed on only a subset of the buses. Rather than inferring the outage from the sensor data by inverting the physical model, our approach uses the AC model to simulate sensor responses to all ou… ▽ More

    Submitted 27 March, 2018; v1 submitted 16 October, 2017; originally announced October 2017.

  46. arXiv:1610.04578  [pdf, other

    stat.ML cs.LG

    Improved Strongly Adaptive Online Learning using Coin Betting

    Authors: Kwang-Sung Jun, Francesco Orabona, Rebecca Willett, Stephen Wright

    Abstract: This paper describes a new parameter-free online learning algorithm for changing environments. In comparing against algorithms with the same time complexity as ours, we obtain a strongly adaptive regret bound that is a factor of at least $\sqrt{\log(T)}$ better, where $T$ is the time horizon. Empirical results show that our algorithm outperforms state-of-the-art methods in learning with expert adv… ▽ More

    Submitted 7 August, 2017; v1 submitted 14 October, 2016; originally announced October 2016.

    Comments: fixed a few typos

  47. arXiv:1404.5692  [pdf, other

    cs.DS cs.LG math.OC stat.ML

    Forward - Backward Greedy Algorithms for Atomic Norm Regularization

    Authors: Nikhil Rao, Parikshit Shah, Stephen Wright

    Abstract: In many signal processing applications, the aim is to reconstruct a signal that has a simple representation with respect to a certain basis or frame. Fundamental elements of the basis known as "atoms" allow us to define "atomic norms" that can be used to formulate convex regularizations for the reconstruction problem. Efficient algorithms are available to solve these formulations in certain specia… ▽ More

    Submitted 22 July, 2015; v1 submitted 22 April, 2014; originally announced April 2014.

    Comments: To appear in IEEE Transactions on Signal Processing

  48. arXiv:1309.6964  [pdf, other

    cs.CV

    Online Algorithms for Factorization-Based Structure from Motion

    Authors: Ryan Kennedy, Laura Balzano, Stephen J. Wright, Camillo J. Taylor

    Abstract: We present a family of online algorithms for real-time factorization-based structure from motion, leveraging a relationship between incremental singular value decomposition and recently proposed methods for online matrix completion. Our methods are orders of magnitude faster than previous state of the art, can handle missing data and a variable number of feature points, and are robust to noise and… ▽ More

    Submitted 16 July, 2016; v1 submitted 26 September, 2013; originally announced September 2013.

  49. arXiv:1307.5494  [pdf, other

    math.NA cs.LG stat.ML

    On GROUSE and Incremental SVD

    Authors: Laura Balzano, Stephen J. Wright

    Abstract: GROUSE (Grassmannian Rank-One Update Subspace Estimation) is an incremental algorithm for identifying a subspace of Rn from a sequence of vectors in this subspace, where only a subset of components of each vector is revealed at each iteration. Recent analysis has shown that GROUSE converges locally at an expected linear rate, under certain assumptions. GROUSE has a similar flavor to the incrementa… ▽ More

    Submitted 20 July, 2013; originally announced July 2013.

  50. arXiv:1207.0577  [pdf, ps, other

    stat.ML cs.LG

    Robust Dequantized Compressive Sensing

    Authors: Ji Liu, Stephen J. Wright

    Abstract: We consider the reconstruction problem in compressed sensing in which the observations are recorded in a finite number of bits. They may thus contain quantization errors (from being rounded to the nearest representable value) and saturation errors (from being outside the range of representable values). Our formulation has an objective of weighted $\ell_2$-$\ell_1$ type, along with constraints that… ▽ More

    Submitted 10 October, 2013; v1 submitted 3 July, 2012; originally announced July 2012.