Skip to main content

Showing 1–50 of 50 results for author: Brown, G

  1. arXiv:2406.04058  [pdf, ps, other

    cs.HC

    Watching Popular Musicians Learn by Ear: A Hypothesis-Generating Study of Human-Recording Interactions in YouTube Videos

    Authors: Christopher Liscio, Daniel G. Brown

    Abstract: Popular musicians often learn music by ear. It is unclear what role technology plays for those with experience at this task. In search of opportunities for the development of novel human-recording interactions, we analyze 18 YouTube videos depicting real-world examples of by-ear learning, and discuss why, during this preliminary phase of research, online videos are appropriate data. From our obser… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  2. arXiv:2406.01855  [pdf, other

    cs.CL cs.AI

    TruthEval: A Dataset to Evaluate LLM Truthfulness and Reliability

    Authors: Aisha Khatun, Daniel G. Brown

    Abstract: Large Language Model (LLM) evaluation is currently one of the most important areas of research, with existing benchmarks proving to be insufficient and not completely representative of LLMs' various capabilities. We present a curated collection of challenging statements on sensitive topics for LLM benchmarking called TruthEval. These statements were curated by hand and contain known truth values.… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  3. arXiv:2404.15409  [pdf, ps, other

    cs.LG cs.CR stat.ML

    Insufficient Statistics Perturbation: Stable Estimators for Private Least Squares

    Authors: Gavin Brown, Jonathan Hayase, Samuel Hopkins, Weihao Kong, Xiyang Liu, Sewoong Oh, Juan C. Perdomo, Adam Smith

    Abstract: We present a sample- and time-efficient differentially private algorithm for ordinary least squares, with error that depends linearly on the dimension and is independent of the condition number of $X^\top X$, where $X$ is the design matrix. All prior private algorithms for this task require either $d^{3/2}$ examples, error growing polynomially with the condition number, or exponential time. Our ne… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 42 pages, 3 figures

  4. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  5. arXiv:2402.13531  [pdf, other

    cs.LG cs.CR

    Private Gradient Descent for Linear Regression: Tighter Error Bounds and Instance-Specific Uncertainty Estimation

    Authors: Gavin Brown, Krishnamurthy Dvijotham, Georgina Evans, Daogao Liu, Adam Smith, Abhradeep Thakurta

    Abstract: We provide an improved analysis of standard differentially private gradient descent for linear regression under the squared error loss. Under modest assumptions on the input, we characterize the distribution of the iterate at each time step. Our analysis leads to new results on the algorithm's accuracy: for a proper fixed choice of hyperparameters, the sample complexity depends only linearly on… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: 22 pages, 11 figures

  6. arXiv:2401.07955  [pdf, other

    cs.CL cs.AI cs.LG

    A Study on Large Language Models' Limitations in Multiple-Choice Question Answering

    Authors: Aisha Khatun, Daniel G. Brown

    Abstract: The widespread adoption of Large Language Models (LLMs) has become commonplace, particularly with the emergence of open-source models. More importantly, smaller models are well-suited for integration into consumer devices and are frequently employed either as standalone solutions or as subroutines in various AI tasks. Despite their ubiquitous use, there is no systematic analysis of their specific… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

  7. arXiv:2312.13978  [pdf, other

    cs.LG cs.DS

    Metalearning with Very Few Samples Per Task

    Authors: Maryam Aliakbarpour, Konstantina Bairaktari, Gavin Brown, Adam Smith, Nathan Srebro, Jonathan Ullman

    Abstract: Metalearning and multitask learning are two frameworks for solving a group of related learning tasks more efficiently than we could hope to solve each of the individual tasks on their own. In multitask learning, we are given a fixed set of related learning tasks and need to output one accurate model per task, whereas in metalearning we are given tasks that are drawn i.i.d. from a metadistribution… ▽ More

    Submitted 1 April, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

  8. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  9. arXiv:2310.12100  [pdf, other

    cs.CL cs.AI cs.CV cs.LG cs.MM

    Non-Intrusive Adaptation: Input-Centric Parameter-efficient Fine-Tuning for Versatile Multimodal Modeling

    Authors: Yaqing Wang, Jialin Wu, Tanmaya Dabral, Jiageng Zhang, Geoff Brown, Chun-Ta Lu, Frederick Liu, Yi Liang, Bo Pang, Michael Bendersky, Radu Soricut

    Abstract: Large language models (LLMs) and vision language models (VLMs) demonstrate excellent performance on a wide range of tasks by scaling up parameter counts from O(10^9) to O(10^{12}) levels and further beyond. These large scales make it impossible to adapt and deploy fully specialized models given a task of interest. Parameter-efficient fine-tuning (PEFT) emerges as a promising direction to tackle th… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

  10. arXiv:2307.09602  [pdf, other

    cs.LG cs.AI

    A max-affine spline approximation of neural networks using the Legendre transform of a convex-concave representation

    Authors: Adam Perrett, Danny Wood, Gavin Brown

    Abstract: This work presents a novel algorithm for transforming a neural network into a spline representation. Unlike previous work that required convex and piecewise-affine network operators to create a max-affine spline alternate form, this work relaxes this constraint. The only constraint is that the function be bounded and possess a well-define second derivative, although this was shown experimentally t… ▽ More

    Submitted 16 July, 2023; originally announced July 2023.

  11. arXiv:2306.06199  [pdf, other

    cs.CL cs.LG

    Reliability Check: An Analysis of GPT-3's Response to Sensitive Topics and Prompt Wording

    Authors: Aisha Khatun, Daniel G. Brown

    Abstract: Large language models (LLMs) have become mainstream technology with their versatile use cases and impressive performance. Despite the countless out-of-the-box applications, LLMs are still not reliable. A lot of work is being done to improve the factual accuracy, consistency, and ethical standards of these models through fine-tuning, prompting, and Reinforcement Learning with Human Feedback (RLHF),… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Comments: Accepted in TrustNLP: Third Workshop on Trustworthy Natural Language Processing, co-located with ACL 2023

  12. arXiv:2305.05432  [pdf, other

    cs.CL cs.CV

    WikiWeb2M: A Page-Level Multimodal Wikipedia Dataset

    Authors: Andrea Burns, Krishna Srinivasan, Joshua Ainslie, Geoff Brown, Bryan A. Plummer, Kate Saenko, Jianmo Ni, Mandy Guo

    Abstract: Webpages have been a rich resource for language and vision-language tasks. Yet only pieces of webpages are kept: image-caption pairs, long text articles, or raw HTML, never all in one place. Webpage tasks have resultingly received little attention and structured image-text data underused. To study multimodal webpage understanding, we introduce the Wikipedia Webpage 2M (WikiWeb2M) suite; the first… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: Accepted at the WikiWorkshop 2023. Data is readily available at https://github.com/google-research-datasets/wit/blob/main/wikiweb2m.md. arXiv admin note: text overlap with arXiv:2305.03668

  13. arXiv:2305.03668  [pdf, other

    cs.CL cs.CV

    A Suite of Generative Tasks for Multi-Level Multimodal Webpage Understanding

    Authors: Andrea Burns, Krishna Srinivasan, Joshua Ainslie, Geoff Brown, Bryan A. Plummer, Kate Saenko, Jianmo Ni, Mandy Guo

    Abstract: Webpages have been a rich, scalable resource for vision-language and language only tasks. Yet only pieces of webpages are kept in existing datasets: image-caption pairs, long text articles, or raw HTML, never all in one place. Webpage tasks have resultingly received little attention and structured image-text data left underused. To study multimodal webpage understanding, we introduce the Wikipedia… ▽ More

    Submitted 20 October, 2023; v1 submitted 5 May, 2023; originally announced May 2023.

    Comments: Accepted in EMNLP 2023, revision contains camera ready edits. Data can be downloaded at https://github.com/google-research-datasets/wit/blob/main/wikiweb2m.md

  14. arXiv:2303.00031  [pdf, other

    cs.AR cs.LG

    Tiny Classifier Circuits: Evolving Accelerators for Tabular Data

    Authors: Konstantinos Iordanou, Timothy Atkinson, Emre Ozer, Jedrzej Kufel, John Biggs, Gavin Brown, Mikel Lujan

    Abstract: A typical machine learning (ML) development cycle for edge computing is to maximise the performance during model training and then minimise the memory/area footprint of the trained model for deployment on edge devices targeting CPUs, GPUs, microcontrollers, or custom hardware accelerators. This paper proposes a methodology for automatically generating predictor circuits for classification of tabul… ▽ More

    Submitted 28 September, 2023; v1 submitted 28 February, 2023; originally announced March 2023.

    Comments: 14 pages, 16 figures

    ACM Class: B.5.1; C.3

  15. arXiv:2301.12250  [pdf, other

    cs.LG

    Fast, Sample-Efficient, Affine-Invariant Private Mean and Covariance Estimation for Subgaussian Distributions

    Authors: Gavin Brown, Samuel B. Hopkins, Adam Smith

    Abstract: We present a fast, differentially private algorithm for high-dimensional covariance-aware mean estimation with nearly optimal sample complexity. Only exponential-time estimators were previously known to achieve this guarantee. Given $n$ samples from a (sub-)Gaussian distribution with unknown mean $μ$ and covariance $Σ$, our $(\varepsilon,δ)$-differentially private estimator produces $\tildeμ$ such… ▽ More

    Submitted 25 April, 2023; v1 submitted 28 January, 2023; originally announced January 2023.

    Comments: 44 pages. New version fixes typos and includes additional exposition and discussion of related work

  16. arXiv:2301.03962  [pdf, other

    cs.LG cs.AI stat.ML

    A Unified Theory of Diversity in Ensemble Learning

    Authors: Danny Wood, Tingting Mu, Andrew Webb, Henry Reeve, Mikel Luján, Gavin Brown

    Abstract: We present a theory of ensemble diversity, explaining the nature of diversity for a wide range of supervised learning scenarios. This challenge has been referred to as the holy grail of ensemble learning, an open research issue for over 30 years. Our framework reveals that diversity is in fact a hidden dimension in the bias-variance decomposition of the ensemble loss. We prove a family of exact bi… ▽ More

    Submitted 7 February, 2024; v1 submitted 10 January, 2023; originally announced January 2023.

    Journal ref: Journal of Machine Learning Research, 24(359), 2023

  17. arXiv:2212.11214  [pdf, other

    cs.AI

    Crowd Score: A Method for the Evaluation of Jokes using Large Language Model AI Voters as Judges

    Authors: Fabricio Goes, Zisen Zhou, Piotr Sawicki, Marek Grzes, Daniel G. Brown

    Abstract: This paper presents the Crowd Score, a novel method to assess the funniness of jokes using large language models (LLMs) as AI judges. Our method relies on inducing different personalities into the LLM and aggregating the votes of the AI judges into a single score to rate jokes. We validate the votes using an auditing technique that checks if the explanation for a particular vote is reasonable usin… ▽ More

    Submitted 21 December, 2022; originally announced December 2022.

    Comments: 11 pages, 3 figures

  18. arXiv:2207.07572  [pdf, other

    cs.LG cs.HC eess.SP

    Outlier detection of vital sign trajectories from COVID-19 patients

    Authors: Sara Summerton, Ann Tivey, Rohan Shotton, Gavin Brown, Oliver C. Redfern, Rachel Oakley, John Radford, David C. Wong

    Abstract: In this work, we present a novel trajectory comparison algorithm to identify abnormal vital sign trends, with the aim of improving recognition of deteriorating health. There is growing interest in continuous wearable vital sign sensors for monitoring patients remotely at home. These monitors are usually coupled to an alerting system, which is triggered when vital sign measurements fall outside a… ▽ More

    Submitted 20 April, 2023; v1 submitted 15 July, 2022; originally announced July 2022.

    Comments: 4 pages, 4 figures, 1 table. Accepted to EMBC 2023, to be indexed in IEEE Xplore and PubMed Medline

  19. arXiv:2206.04743  [pdf, ps, other

    cs.LG

    Strong Memory Lower Bounds for Learning Natural Models

    Authors: Gavin Brown, Mark Bun, Adam Smith

    Abstract: We give lower bounds on the amount of memory required by one-pass streaming algorithms for solving several natural learning problems. In a setting where examples lie in $\{0,1\}^d$ and the optimal classifier can be encoded using $κ$ bits, we show that algorithms which learn using a near-minimal number of examples, $\tilde O(κ)$, must use $\tilde Ω( dκ)$ bits of space. Our space bounds match the di… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

    Comments: 39 Pages. To appear at COLT 2022

  20. arXiv:2204.12155  [pdf, other

    stat.ML cs.LG

    Bias-Variance Decompositions for Margin Losses

    Authors: Danny Wood, Tingting Mu, Gavin Brown

    Abstract: We introduce a novel bias-variance decomposition for a range of strictly convex margin losses, including the logistic loss (minimized by the classic LogitBoost algorithm), as well as the squared margin loss and canonical boosting loss. Furthermore, we show that, for all strictly convex margin losses, the expected risk decomposes into the risk of a "central" model and a term quantifying variation i… ▽ More

    Submitted 26 April, 2022; originally announced April 2022.

    Comments: Supplementary material included

    Journal ref: 25th International Conference on Artificial Intelligence and Statistics, 2022

  21. arXiv:2106.13329  [pdf, ps, other

    cs.LG

    Covariance-Aware Private Mean Estimation Without Private Covariance Estimation

    Authors: Gavin Brown, Marco Gaboardi, Adam Smith, Jonathan Ullman, Lydia Zakynthinou

    Abstract: We present two sample-efficient differentially private mean estimators for $d$-dimensional (sub)Gaussian distributions with unknown covariance. Informally, given $n \gtrsim d/α^2$ samples from such a distribution with mean $μ$ and covariance $Σ$, our estimators output $\tildeμ$ such that $\| \tildeμ- μ\|_Σ \leq α$, where $\| \cdot \|_Σ$ is the Mahalanobis distance. All previous estimators with the… ▽ More

    Submitted 25 March, 2024; v1 submitted 24 June, 2021; originally announced June 2021.

    Comments: 49 pages. Appeared in NeurIPS 2021. Updated version contains improved analysis of Tukey depth mechanism: robustness guarantees, tighter error analysis, and techniques for faster implementation

  22. arXiv:2104.11372  [pdf, other

    cs.RO

    Grasp Synthesis for Novel Objects Using Heuristic-based and Data-driven Active Vision Methods

    Authors: Sabhari Natarajan, Galen Brown, Berk Calli

    Abstract: In this work, we present several heuristic-based and data-driven active vision strategies for viewpoint optimization of an arm-mounted depth camera for the purpose of aiding robotic grasping. These strategies aim to efficiently collect data to boost the performance of an underlying grasp synthesis algorithm. We created an open-source benchmarking platform in simulation (https://github.com/galenbr/… ▽ More

    Submitted 22 April, 2021; originally announced April 2021.

    Comments: 18 pages, 13 figures, submitted to Frontiers in Robotics and AI 2021

  23. When is Memorization of Irrelevant Training Data Necessary for High-Accuracy Learning?

    Authors: Gavin Brown, Mark Bun, Vitaly Feldman, Adam Smith, Kunal Talwar

    Abstract: Modern machine learning models are complex and frequently encode surprising amounts of information about individual inputs. In extreme cases, complex models appear to memorize entire input examples, including seemingly irrelevant information (social security numbers from text, for example). In this paper, we aim to understand whether this sort of memorization is necessary for accurate learning. We… ▽ More

    Submitted 21 July, 2021; v1 submitted 11 December, 2020; originally announced December 2020.

    Journal ref: STOC 2021 Pages 123-132

  24. arXiv:2011.03885  [pdf, other

    cs.LG cs.GT

    Performative Prediction in a Stateful World

    Authors: Gavin Brown, Shlomi Hod, Iden Kalemaj

    Abstract: Deployed supervised machine learning models make predictions that interact with and influence the world. This phenomenon is called performative prediction by Perdomo et al. (ICML 2020). It is an ongoing challenge to understand the influence of such predictions as well as design tools so as to control that influence. We propose a theoretical framework where the response of a target population to th… ▽ More

    Submitted 22 February, 2022; v1 submitted 7 November, 2020; originally announced November 2020.

    Comments: Accepted paper to AISTATS 2022. An earlier version appeared at the Workshop on Consequential Decision Making in Dynamic Environments, NeurIPS 2020

  25. arXiv:2010.14619  [pdf, other

    cs.NE cs.AI

    Ensembles of Spiking Neural Networks

    Authors: Georgiana Neculae, Oliver Rhodes, Gavin Brown

    Abstract: This paper demonstrates how to construct ensembles of spiking neural networks producing state-of-the-art results, achieving classification accuracies of 98.71%, 100.0%, and 99.09%, on the MNIST, NMNIST and DVS Gesture datasets respectively. Furthermore, this performance is achieved using simplified individual models, with ensembles containing less than 50% of the parameters of published reference… ▽ More

    Submitted 6 September, 2021; v1 submitted 15 October, 2020; originally announced October 2020.

    Comments: 16 pages, 3 tables, 5 figures

    MSC Class: 68T07; 62J12

  26. arXiv:2002.12466  [pdf

    cs.RO cs.AI

    Piecewise linear regressions for approximating distance metrics

    Authors: Josiah Putman, Lisa Oh, Luyang Zhao, Evan Honnold, Galen Brown, Weifu Wang, Devin Balkcom

    Abstract: This paper presents a data structure that summarizes distances between configurations across a robot configuration space, using a binary space partition whose cells contain parameters used for a locally linear approximation of the distance function. Querying the data structure is extremely fast, particularly when compared to the graph search required for querying Probabilistic Roadmaps, and memory… ▽ More

    Submitted 27 February, 2020; originally announced February 2020.

  27. arXiv:2001.10318  [pdf, other

    cs.LG cs.IT stat.ML

    Margin Maximization as Lossless Maximal Compression

    Authors: Nikolaos Nikolaou, Henry Reeve, Gavin Brown

    Abstract: The ultimate goal of a supervised learning algorithm is to produce models constructed on the training data that can generalize well to new examples. In classification, functional margin maximization -- correctly classifying as many training examples as possible with maximal confidence --has been known to construct models with good generalization guarantees. This work gives an information-theoretic… ▽ More

    Submitted 28 January, 2020; originally announced January 2020.

    Comments: 19 pages Main Paper + 7 pages Supplementary Material, 7 Figures, Submitted to the Machine Learning journal (11/11/19)

  28. arXiv:2001.06105  [pdf, other

    cs.LG stat.ML

    Better Boosting with Bandits for Online Learning

    Authors: Nikolaos Nikolaou, Joseph Mellor, Nikunj C. Oza, Gavin Brown

    Abstract: Probability estimates generated by boosting ensembles are poorly calibrated because of the margin maximization nature of the algorithm. The outputs of the ensemble need to be properly calibrated before they can be used as probability estimates. In this work, we demonstrate that online boosting is also prone to producing distorted probability estimates. In batch learning, calibration is achieved by… ▽ More

    Submitted 16 January, 2020; originally announced January 2020.

    Comments: 44 pages, 6 figures

  29. arXiv:1910.00722  [pdf

    eess.IV cs.AI cs.CV cs.LG

    Comparing Deep Learning Models for Multi-cell Classification in Liquid-based Cervical Cytology Images

    Authors: Sudhir Sornapudi, G. T. Brown, Zhiyun Xue, Rodney Long, Lisa Allen, Sameer Antani

    Abstract: Liquid-based cytology (LBC) is a reliable automated technique for the screening of Papanicolaou (Pap) smear data. It is an effective technique for collecting a majority of the cervical cells and aiding cytopathologists in locating abnormal cells. Most methods published in the research literature rely on accurate cell segmentation as a prior, which remains challenging due to a variety of factors, e… ▽ More

    Submitted 1 October, 2019; originally announced October 2019.

    Comments: AMIA 2019 Annual Symposium, Washington DC

    Journal ref: AMIA Annu Symp Proc. 2019 (2019) 820-827

  30. Robust Binaural Localization of a Target Sound Source by Combining Spectral Source Models and Deep Neural Networks

    Authors: Ning Ma, Jose A. Gonzalez, Guy J. Brown

    Abstract: Despite there being clear evidence for top-down (e.g., attentional) effects in biological spatial hearing, relatively few machine hearing systems exploit top-down model-based knowledge in sound localisation. This paper addresses this issue by proposing a novel framework for binaural sound localisation that combines model-based information about the spectral characteristics of sound sources and dee… ▽ More

    Submitted 5 April, 2019; originally announced April 2019.

    Comments: 10 pages

    Journal ref: IEEE/ACM Transactions on Audio Speech and Language Processing, vol. 26, no. 11, pp. 2122-2131, 2018

  31. Exploiting Deep Neural Networks and Head Movements for Robust Binaural Localisation of Multiple Sources in Reverberant Environments

    Authors: Ning Ma, Tobias May, Guy J. Brown

    Abstract: This paper presents a novel machine-hearing system that exploits deep neural networks (DNNs) and head movements for robust binaural localisation of multiple sources in reverberant environments. DNNs are used to learn the relationship between the source azimuth and binaural cues, consisting of the complete cross-correlation function (CCF) and interaural level differences (ILDs). In contrast to many… ▽ More

    Submitted 5 April, 2019; originally announced April 2019.

    Comments: 10 pages

    Journal ref: IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 25, no. 12, pp. 2444-2453, 2017

  32. arXiv:1904.02992  [pdf, other

    eess.AS cs.SD

    Deep Learning Features for Robust Detection of Acoustic Events in Sleep-Disordered Breathing

    Authors: Hector E. Romero, Ning Ma, Guy J. Brown, Amy V. Beeston, Madina Hasan

    Abstract: Sleep-disordered breathing (SDB) is a serious and prevalent condition, and acoustic analysis via consumer devices (e.g. smartphones) offers a low-cost solution to screening for it. We present a novel approach for the acoustic identification of SDB sounds, such as snoring, using bottleneck features learned from a corpus of whole-night sound recordings. Two types of bottleneck features are described… ▽ More

    Submitted 5 April, 2019; originally announced April 2019.

    Comments: Accepted by IEEE ICASSP 2018

  33. arXiv:1904.01916  [pdf, other

    cs.SD eess.AS

    End-to-end Binaural Sound Localisation from the Raw Waveform

    Authors: Paolo Vecchiotti, Ning Ma, Stefano Squartini, Guy J. Brown

    Abstract: A novel end-to-end binaural sound localisation approach is proposed which estimates the azimuth of a sound source directly from the waveform. Instead of employing hand-crafted features commonly employed for binaural sound localisation, such as the interaural time and level difference, our end-to-end system approach uses a convolutional neural network (CNN) to extract specific features from the wav… ▽ More

    Submitted 3 April, 2019; originally announced April 2019.

    Comments: Accepted by ICASSP 2019

  34. arXiv:1902.04422  [pdf, other

    stat.ML cs.CV cs.LG

    To Ensemble or Not Ensemble: When does End-To-End Training Fail?

    Authors: Andrew M. Webb, Charles Reynolds, Wenlin Chen, Henry Reeve, Dan-Andrei Iliescu, Mikel Lujan, Gavin Brown

    Abstract: End-to-End training (E2E) is becoming more and more popular to train complex Deep Network architectures. An interesting question is whether this trend will continue-are there any clear failure cases for E2E training? We study this question in depth, for the specific case of E2E training an ensemble of networks. Our strategy is to blend the gradient smoothly in between two extremes: from independen… ▽ More

    Submitted 6 August, 2020; v1 submitted 12 February, 2019; originally announced February 2019.

    Comments: Code: https://github.com/grey-area/modular-loss-experiments. Preprint updated to reflect version accepted for publication at ECML

  35. arXiv:1804.07933  [pdf, other

    cs.LG cs.CR cs.GT stat.ML

    Is feature selection secure against training data poisoning?

    Authors: Huang Xiao, Battista Biggio, Gavin Brown, Giorgio Fumera, Claudia Eckert, Fabio Roli

    Abstract: Learning in adversarial settings is becoming an important task for application domains where attackers may inject malicious data into the training set to subvert normal operation of data-driven technologies. Feature selection has been widely used in machine learning for security applications to improve generalization and computational efficiency, although it is not clear whether its use may be ben… ▽ More

    Submitted 21 April, 2018; originally announced April 2018.

    Journal ref: Proc. of the 32nd ICML, Lille, France, 2015. JMLR: W&CP vol. 37

  36. arXiv:1803.00316  [pdf, other

    cs.LG stat.ML

    The K-Nearest Neighbour UCB algorithm for multi-armed bandits with covariates

    Authors: Henry WJ Reeve, Joe Mellor, Gavin Brown

    Abstract: In this paper we propose and explore the k-Nearest Neighbour UCB algorithm for multi-armed bandits with covariates. We focus on a setting where the covariates are supported on a metric space of low intrinsic dimension, such as a manifold embedded within a high dimensional ambient feature space. The algorithm is conceptually simple and straightforward to implement. The k-Nearest Neighbour UCB algor… ▽ More

    Submitted 1 March, 2018; originally announced March 2018.

    Comments: To be presented at ALT 2018

    Journal ref: Algorithmic Learning Theory 2018

  37. arXiv:1803.00314  [pdf, other

    cs.LG

    Diversity and degrees of freedom in regression ensembles

    Authors: Henry WJ Reeve, Gavin Brown

    Abstract: Ensemble methods are a cornerstone of modern machine learning. The performance of an ensemble depends crucially upon the level of diversity between its constituent learners. This paper establishes a connection between diversity and degrees of freedom (i.e. the capacity of the model), showing that diversity may be viewed as a form of inverse regularisation. This is achieved by focusing on a previou… ▽ More

    Submitted 1 March, 2018; originally announced March 2018.

    Comments: Neurocomputing 2018

    Journal ref: Neurocomputing 2018

  38. arXiv:1803.00310  [pdf, other

    cs.LG stat.ML

    Minimax rates for cost-sensitive learning on manifolds with approximate nearest neighbours

    Authors: Henry WJ Reeve, Gavin Brown

    Abstract: We study the approximate nearest neighbour method for cost-sensitive classification on low-dimensional manifolds embedded within a high-dimensional feature space. We determine the minimax learning rates for distributions on a smooth manifold, in a cost-sensitive setting. This generalises a classic result of Audibert and Tsybakov. Building upon recent work of Chaudhuri and Dasgupta we prove that th… ▽ More

    Submitted 1 March, 2018; originally announced March 2018.

    Comments: Published in ALT 2017

    Journal ref: Algorithmic Learning Theory 2017

  39. arXiv:1708.06939  [pdf, other

    cs.LG cs.RO stat.ML

    Is Deep Learning Safe for Robot Vision? Adversarial Examples against the iCub Humanoid

    Authors: Marco Melis, Ambra Demontis, Battista Biggio, Gavin Brown, Giorgio Fumera, Fabio Roli

    Abstract: Deep neural networks have been widely adopted in recent years, exhibiting impressive performances in several application domains. It has however been shown that they can be fooled by adversarial examples, i.e., images altered by a barely-perceivable adversarial noise, carefully crafted to mislead classification. In this work, we aim to evaluate the extent to which robot-vision systems embodying de… ▽ More

    Submitted 23 August, 2017; originally announced August 2017.

    Comments: Accepted for publication at the ICCV 2017 Workshop on Vision in Practice on Autonomous Robots (ViPAR)

  40. arXiv:1612.01316  [pdf, other

    stat.ML cs.LG stat.AP

    Ranking Biomarkers Through Mutual Information

    Authors: Konstantinos Sechidis, Emily Turner, Paul D. Metcalfe, James Weatherall, Gavin Brown

    Abstract: We study information theoretic methods for ranking biomarkers. In clinical trials there are two, closely related, types of biomarkers: predictive and prognostic, and disentangling them is a key challenge. Our first step is to phrase biomarker ranking in terms of optimizing an information theoretic quantity. This formalization of the problem will enable us to derive rankings of predictive/prognosti… ▽ More

    Submitted 5 December, 2016; originally announced December 2016.

    Comments: Accepted at NIPS 2016 Workshop on Machine Learning for Health

  41. Reversible Communicating Processes

    Authors: Geoffrey Brown, Amr Sabry

    Abstract: Reversible distributed programs have the ability to abort unproductive computation paths and backtrack, while unwinding communication that occurred in the aborted paths. While it is natural to assume that reversibility implies full state recovery (as with traditional roll-back recovery protocols), an interesting alternative is to separate backtracking from local state recovery. For example, such… ▽ More

    Submitted 10 February, 2016; originally announced February 2016.

    Comments: In Proceedings PLACES 2015, arXiv:1602.03254

    Journal ref: EPTCS 203, 2016, pp. 45-59

  42. arXiv:1511.07340  [pdf, other

    cs.LG

    Modular Autoencoders for Ensemble Feature Extraction

    Authors: Henry W J Reeve, Gavin Brown

    Abstract: We introduce the concept of a Modular Autoencoder (MAE), capable of learning a set of diverse but complementary representations from unlabelled data, that can later be used for supervised tasks. The learning of the representations is controlled by a trade off parameter, and we show on six benchmark datasets the optimum lies between two extremes: a set of smaller, independent autoencoders each with… ▽ More

    Submitted 23 November, 2015; originally announced November 2015.

    Comments: 18 pages, 8 figures, to appear in a special issue of The Journal Of Machine Learning Research (vol.44, Dec 2015)

  43. arXiv:1508.06791  [pdf, other

    cs.DC cs.PL

    Boosting Java Performance using GPGPUs

    Authors: James Clarkson, Christos Kotselidis, Gavin Brown, Mikel Luján

    Abstract: Heterogeneous programming has started becoming the norm in order to achieve better performance by running portions of code on the most appropriate hardware resource. Currently, significant engineering efforts are undertaken in order to enable existing programming languages to perform heterogeneous execution mainly on GPUs. In this paper we describe Jacc, an experimental framework which allows deve… ▽ More

    Submitted 27 August, 2015; originally announced August 2015.

  44. arXiv:1304.7942  [pdf, other

    cs.CL

    ManTIME: Temporal expression identification and normalization in the TempEval-3 challenge

    Authors: Michele Filannino, Gavin Brown, Goran Nenadic

    Abstract: This paper describes a temporal expression identification and normalization system, ManTIME, developed for the TempEval-3 challenge. The identification phase combines the use of conditional random fields along with a post-processing identification pipeline, whereas the normalization phase is carried out using NorMA, an open-source rule-based temporal normalizer. We investigate the performance vari… ▽ More

    Submitted 30 April, 2013; originally announced April 2013.

    Comments: 5 pages, 1 figure, 2 tables Second Joint Conference on Lexical and Computational Semantics (*SEM), Volume 2: Seventh International Workshop on Semantic Evaluation (SemEval 2013)

    ACM Class: I.2.7; I.2.4; I.2.6

  45. Seven new champion linear codes

    Authors: Gavin Brown, Alexander M. Kasprzyk

    Abstract: We exhibit seven linear codes exceeding the current best known minimum distance d for their dimension k and block length n. Each code is defined over F_8, and their invariants [n,k,d] are given by [49,13,27], [49,14,26], [49,16,24], [49,17,23], [49,19,21], [49,25,16] and [49,26,15]. Our method includes an exhaustive search of all monomial evaluation codes generated by points in the [0,5]x[0,5] lat… ▽ More

    Submitted 20 December, 2012; originally announced December 2012.

    Comments: 10 pages, 4 figures

    MSC Class: 14G50 (Primary); 52B20; 14M25 (Secondary)

    Journal ref: LMS J. Comput. Math. 16 (2013) 109-117

  46. Small polygons and toric codes

    Authors: Gavin Brown, Alexander M. Kasprzyk

    Abstract: We describe two different approaches to making systematic classifications of plane lattice polygons, and recover the toric codes they generate, over small fields, where these match or exceed the best known minimum distance. This includes a [36,19,12]-code over F_7 whose minimum distance 12 exceeds that of all previously known codes.

    Submitted 1 April, 2012; originally announced April 2012.

    Comments: 9 pages, 4 tables, 3 figures

    MSC Class: 14G50 (Primary) 52B20; 14M25 (Secondary)

    Journal ref: Journal of Symbolic Computation, 51 (2013), 55-62

  47. arXiv:1202.3716  [pdf

    cs.LG stat.ML

    Boosting as a Product of Experts

    Authors: Narayanan U. Edakunni, Gary Brown, Tim Kovacs

    Abstract: In this paper, we derive a novel probabilistic model of boosting as a Product of Experts. We re-derive the boosting algorithm as a greedy incremental model selection procedure which ensures that addition of new experts to the ensemble does not decrease the likelihood of the data. These learning rules lead to a generic boosting algorithm - POE- Boost which turns out to be similar to the AdaBoost al… ▽ More

    Submitted 14 February, 2012; originally announced February 2012.

    Report number: UAI-P-2011-PG-187-194

  48. arXiv:1111.0379  [pdf, other

    q-bio.PE cs.CE

    Fast reconstruction of phylogenetic trees using locality-sensitive hashing

    Authors: Daniel G. Brown, Jakub Truszkowski

    Abstract: We present the first sub-quadratic time algorithm that with high probability correctly reconstructs phylogenetic trees for short sequences generated by a Markov model of evolution. Due to rapid expansion in sequence databases, such very fast algorithms are becoming necessary. Other fast heuristics have been developed for building trees from very large alignments (Price et al, and Brown et al), but… ▽ More

    Submitted 31 May, 2012; v1 submitted 2 November, 2011; originally announced November 2011.

  49. arXiv:1010.1866  [pdf, other

    q-bio.PE cs.CE cs.DS

    Fast error-tolerant quartet phylogeny algorithms

    Authors: Daniel G. Brown, Jakub Truszkowski

    Abstract: We present an algorithm for phylogenetic reconstruction using quartets that returns the correct topology for $n$ taxa in $O(n \log n)$ time with high probability, in a probabilistic model where a quartet is not consistent with the true topology of the tree with constant probability, independent of other quartets. Our incremental algorithm relies upon a search tree structure for the phylogeny that… ▽ More

    Submitted 9 October, 2010; originally announced October 2010.

  50. arXiv:cs/0503074  [pdf, ps, other

    cs.NI cs.OS

    A File System Abstraction for Sense and Respond Systems

    Authors: Sameer Tilak, Bhanu Pisupati, Kenneth Chiu, Geoffrey Brown, Nael Abu-Ghazaleh

    Abstract: The heterogeneity and resource constraints of sense-and-respond systems pose significant challenges to system and application development. In this paper, we present a flexible, intuitive file system abstraction for organizing and managing sense-and-respond systems based on the Plan 9 design principles. A key feature of this abstraction is the ability to support multiple views of the system via f… ▽ More

    Submitted 28 March, 2005; originally announced March 2005.

    Comments: 6 pages, 3 figures Workshop on End-to-End, Sense-and-Respond Systems, Applications, and Services In conjunction with MobiSys '05

    ACM Class: D.4.3 Distributed file systems; C.2.1 Wireless communication