Skip to main content

Showing 1–31 of 31 results for author: Frank, E

  1. arXiv:2407.06544  [pdf, other

    cs.LG

    Multiple Instance Verification

    Authors: Xin Xu, Eibe Frank, Geoffrey Holmes

    Abstract: We explore multiple-instance verification, a problem setting where a query instance is verified against a bag of target instances with heterogeneous, unknown relevancy. We show that naive adaptations of attention-based multiple instance learning (MIL) methods and standard verification methods like Siamese neural networks are unsuitable for this setting: directly combining state-of-the-art (SOTA) M… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 30 pages

  2. arXiv:2209.13917  [pdf, other

    cs.LG cs.AI

    A simple but strong baseline for online continual learning: Repeated Augmented Rehearsal

    Authors: Yaqian Zhang, Bernhard Pfahringer, Eibe Frank, Albert Bifet, Nick Jin Sean Lim, Yunzhe Jia

    Abstract: Online continual learning (OCL) aims to train neural networks incrementally from a non-stationary data stream with a single pass through data. Rehearsal-based methods attempt to approximate the observed input distributions over time with a small memory and revisit them later to avoid forgetting. Despite its strong empirical performance, rehearsal methods still suffer from a poor approximation of t… ▽ More

    Submitted 13 November, 2022; v1 submitted 28 September, 2022; originally announced September 2022.

    Comments: NeurIPS 2022

  3. arXiv:2206.02127  [pdf, other

    cs.LG

    DeeprETA: An ETA Post-processing System at Scale

    Authors: Xinyu Hu, Tanmay Binaykiya, Eric Frank, Olcay Cirit

    Abstract: Estimated Time of Arrival (ETA) plays an important role in delivery and ride-hailing platforms. For example, Uber uses ETAs to calculate fares, estimate pickup times, match riders to drivers, plan deliveries, and more. Commonly used route planning algorithms predict an ETA conditioned on the best available route, but such ETA estimates can be unreliable when the actual route taken is not known in… ▽ More

    Submitted 5 June, 2022; originally announced June 2022.

  4. arXiv:2205.05831  [pdf, other

    cs.CV cs.LG

    Feature Extractor Stacking for Cross-domain Few-shot Learning

    Authors: Hongyu Wang, Eibe Frank, Bernhard Pfahringer, Michael Mayo, Geoffrey Holmes

    Abstract: Cross-domain few-shot learning (CDFSL) addresses learning problems where knowledge needs to be transferred from one or more source domains into an instance-scarce target domain with an explicitly different distribution. Recently published CDFSL methods generally construct a universal model that combines knowledge of multiple source domains into one feature extractor. This enables efficient inferen… ▽ More

    Submitted 24 October, 2023; v1 submitted 11 May, 2022; originally announced May 2022.

  5. Hitting the Target: Stopping Active Learning at the Cost-Based Optimum

    Authors: Zac Pullar-Strecker, Katharina Dost, Eibe Frank, Jörg Wicker

    Abstract: Active learning allows machine learning models to be trained using fewer labels while retaining similar performance to traditional supervised learning. An active learner selects the most informative data points, requests their labels, and retrains itself. While this approach is promising, it raises the question of how to determine when the model is `good enough' without the additional labels requi… ▽ More

    Submitted 2 September, 2022; v1 submitted 7 October, 2021; originally announced October 2021.

    Comments: 20 pages, 7 figures

  6. Semi-Supervised Learning using Siamese Networks

    Authors: Attaullah Sahito, Eibe Frank, Bernhard Pfahringer

    Abstract: Neural networks have been successfully used as classification models yielding state-of-the-art results when trained on a large number of labeled samples. These models, however, are more difficult to train successfully for semi-supervised problems where small amounts of labeled instances are available along with a large number of unlabeled instances. This work explores a new training method for sem… ▽ More

    Submitted 9 September, 2021; v1 submitted 2 September, 2021; originally announced September 2021.

    Comments: added link of GitHub repository

    Journal ref: In AI 2019: Advances in Artificial Intelligence. AI 2019 . Lecture Notes in Computer Science, vol 11919. Springer, Cham

  7. Transfer of Pretrained Model Weights Substantially Improves Semi-Supervised Image Classification

    Authors: Attaullah Sahito, Eibe Frank, Bernhard Pfahringer

    Abstract: Deep neural networks produce state-of-the-art results when trained on a large number of labeled examples but tend to overfit when small amounts of labeled examples are used for training. Creating a large number of labeled examples requires considerable resources, time, and effort. If labeling new data is not feasible, so-called semi-supervised learning can achieve better generalisation than purely… ▽ More

    Submitted 9 September, 2021; v1 submitted 2 September, 2021; originally announced September 2021.

    Comments: added link to code and data repo

    Journal ref: In: AI 2020: Advances in Artificial Intelligence. AI 2020 . Lecture Notes in Computer Science, vol 12576. Springer, Cham

  8. arXiv:2109.00778  [pdf, other

    cs.CV

    Better Self-training for Image Classification through Self-supervision

    Authors: Attaullah Sahito, Eibe Frank, Bernhard Pfahringer

    Abstract: Self-training is a simple semi-supervised learning approach: Unlabelled examples that attract high-confidence predictions are labelled with their predictions and added to the training set, with this process being repeated multiple times. Recently, self-supervision -- learning without manual supervision by solving an automatically-generated pretext task -- has gained prominence in deep learning. Th… ▽ More

    Submitted 9 September, 2021; v1 submitted 2 September, 2021; originally announced September 2021.

    Comments: added link to code repository

  9. arXiv:2106.06161  [pdf, other

    cs.DC

    Bandwidth-Optimal Random Shuffling for GPUs

    Authors: Rory Mitchell, Daniel Stokes, Eibe Frank, Geoffrey Holmes

    Abstract: Linear-time algorithms that are traditionally used to shuffle data on CPUs, such as the method of Fisher-Yates, are not well suited to implementation on GPUs due to inherent sequential dependencies, and existing parallel shuffling algorithms are unsuitable for GPU architectures because they incur a large number of read/write operations to high latency global memory. To address this, we provide a m… ▽ More

    Submitted 3 February, 2022; v1 submitted 11 June, 2021; originally announced June 2021.

  10. arXiv:2104.12199  [pdf, other

    stat.ML cs.LG math.CO

    Sampling Permutations for Shapley Value Estimation

    Authors: Rory Mitchell, Joshua Cooper, Eibe Frank, Geoffrey Holmes

    Abstract: Game-theoretic attribution techniques based on Shapley values are used to interpret black-box machine learning models, but their exact calculation is generally NP-hard, requiring approximation methods for non-trivial models. As the computation of Shapley values can be expressed as a summation over a set of permutations, a common approach is to sample a subset of these permutations for approximatio… ▽ More

    Submitted 3 February, 2022; v1 submitted 25 April, 2021; originally announced April 2021.

    Comments: 33 pages, 13 figures

    MSC Class: 05A05 (Primary) 65K10; 90C27 (Secondary) ACM Class: I.2.6; G.2.1

  11. arXiv:2010.13972  [pdf, other

    cs.LG cs.DC

    GPUTreeShap: Massively Parallel Exact Calculation of SHAP Scores for Tree Ensembles

    Authors: Rory Mitchell, Eibe Frank, Geoffrey Holmes

    Abstract: SHAP (SHapley Additive exPlanation) values provide a game theoretic interpretation of the predictions of machine learning models based on Shapley values. While exact calculation of SHAP values is computationally intractable in general, a recursive polynomial-time algorithm called TreeShap is available for decision tree models. However, despite its polynomial time complexity, TreeShap can become a… ▽ More

    Submitted 3 February, 2022; v1 submitted 26 October, 2020; originally announced October 2020.

  12. Deep Learning in Diabetic Foot Ulcers Detection: A Comprehensive Evaluation

    Authors: Moi Hoon Yap, Ryo Hachiuma, Azadeh Alavi, Raphael Brungel, Bill Cassidy, Manu Goyal, Hongtao Zhu, Johannes Ruckert, Moshe Olshansky, Xiao Huang, Hideo Saito, Saeed Hassanpour, Christoph M. Friedrich, David Ascher, Anping Song, Hiroki Kajita, David Gillespie, Neil D. Reeves, Joseph Pappachan, Claire O'Shea, Eibe Frank

    Abstract: There has been a substantial amount of research involving computer methods and technology for the detection and recognition of diabetic foot ulcers (DFUs), but there is a lack of systematic comparisons of state-of-the-art deep learning object detection frameworks applied to this problem. DFUC2020 provided participants with a comprehensive dataset consisting of 2,000 images for training and 2,000 i… ▽ More

    Submitted 24 May, 2021; v1 submitted 7 October, 2020; originally announced October 2020.

    Comments: 19 pages, 18 figures, 10 tables

    Journal ref: Computers in Biology and Medicine, Volume 135, 2021, 104596, ISSN 0010-4825,

  13. arXiv:2008.01527  [pdf, other

    physics.app-ph cs.LG

    Universal Battery Performance and Degradation Model for Electric Aircraft

    Authors: Alexander Bills, Shashank Sripad, William L. Fredericks, Matthew Guttenberg, Devin Charles, Evan Frank, Venkatasubramanian Viswanathan

    Abstract: Development of Urban Air Mobility (UAM) concepts has been primarily focused on electric vertical takeoff and landing aircraft (eVTOLs), small aircraft which can land and takeoff vertically, and which are powered by rechargeable (typically lithium-ion) batteries. Design, analysis, and operation of eVTOLs requires fast and accurate prediction of Li-ion battery performance throughout the lifetime of… ▽ More

    Submitted 16 March, 2021; v1 submitted 6 July, 2020; originally announced August 2020.

    Comments: 38 pages, 5 figures, 8 pages of Supplementary information

  14. arXiv:2005.07353  [pdf, ps, other

    cs.LG stat.ML

    Adaptive XGBoost for Evolving Data Streams

    Authors: Jacob Montiel, Rory Mitchell, Eibe Frank, Bernhard Pfahringer, Talel Abdessalem, Albert Bifet

    Abstract: Boosting is an ensemble method that combines base models in a sequential manner to achieve high predictive accuracy. A popular learning algorithm based on this ensemble method is eXtreme Gradient Boosting (XGB). We present an adaptation of XGB for classification of evolving data streams. In this setting, new data arrives over time and the relationship between the class and the features may change… ▽ More

    Submitted 15 May, 2020; originally announced May 2020.

    Comments: To be published in Proceedings of the International Joint Conference on Neural Networks (IJCNN) 2020, 8 pages

  15. DFUC2020: Analysis Towards Diabetic Foot Ulcer Detection

    Authors: Bill Cassidy, Neil D. Reeves, Pappachan Joseph, David Gillespie, Claire O'Shea, Satyan Rajbhandari, Arun G. Maiya, Eibe Frank, Andrew Boulton, David Armstrong, Bijan Najafi, Justina Wu, Moi Hoon Yap

    Abstract: Every 20 seconds, a limb is amputated somewhere in the world due to diabetes. This is a global health problem that requires a global solution. The MICCAI challenge discussed in this paper, which concerns the automated detection of diabetic foot ulcers using machine learning techniques, will accelerate the development of innovative healthcare technology to address this unmet medical need. In an eff… ▽ More

    Submitted 24 May, 2021; v1 submitted 24 April, 2020; originally announced April 2020.

    Comments: 16 pages, 8 figures

    Journal ref: touchREVIEWS in Endocrinology, 17(1):5-11 (2021)

  16. arXiv:2004.02942  [pdf

    cs.LG cs.PL cs.SE stat.ML

    Embedding Java Classes with code2vec: Improvements from Variable Obfuscation

    Authors: Rhys Compton, Eibe Frank, Panos Patros, Abigail Koay

    Abstract: Automatic source code analysis in key areas of software engineering, such as code security, can benefit from Machine Learning (ML). However, many standard ML approaches require a numeric representation of data and cannot be applied directly to source code. Thus, to enable ML, we need to embed source code into numeric feature vectors while maintaining the semantics of the code as much as possible.… ▽ More

    Submitted 6 April, 2020; originally announced April 2020.

    Comments: In 17th International Conference on Mining Software Repositories (MSR) 2020, Seoul, Republic of Korea. 11 pages

  17. arXiv:1912.13405  [pdf, other

    cs.LG cs.AI stat.ML

    Classifier Chains: A Review and Perspectives

    Authors: Jesse Read, Bernhard Pfahringer, Geoff Holmes, Eibe Frank

    Abstract: The family of methods collectively known as classifier chains has become a popular approach to multi-label learning problems. This approach involves linking together off-the-shelf binary classifiers in a chain structure, such that class label predictions become features for other classifiers. Such methods have proved flexible and effective and have obtained state-of-the-art empirical performance a… ▽ More

    Submitted 15 April, 2020; v1 submitted 26 December, 2019; originally announced December 2019.

    Journal ref: Journal of Artificial Intelligence Research 70 (2021) 683-718

  18. arXiv:1912.02164  [pdf, other

    cs.CL cs.AI cs.LG

    Plug and Play Language Models: A Simple Approach to Controlled Text Generation

    Authors: Sumanth Dathathri, Andrea Madotto, Janice Lan, Jane Hung, Eric Frank, Piero Molino, Jason Yosinski, Rosanne Liu

    Abstract: Large transformer-based language models (LMs) trained on huge text corpora have shown unparalleled generation capabilities. However, controlling attributes of the generated language (e.g. switching topic or sentiment) is difficult without modifying the model architecture or fine-tuning on attribute-specific data and entailing the significant cost of retraining. We propose a simple alternative: the… ▽ More

    Submitted 3 March, 2020; v1 submitted 4 December, 2019; originally announced December 2019.

    Comments: ICLR 2020 camera ready

  19. arXiv:1901.07777  [pdf, ps, other

    stat.ML cs.LG

    Stochastic Gradient Trees

    Authors: Henry Gouk, Bernhard Pfahringer, Eibe Frank

    Abstract: We present an algorithm for learning decision trees using stochastic gradient information as the source of supervision. In contrast to previous approaches to gradient-based tree learning, our method operates in the incremental learning setting rather than the batch learning setting, and does not make use of soft splits or require the construction of a new tree for every update. We demonstrate how… ▽ More

    Submitted 24 September, 2019; v1 submitted 23 January, 2019; originally announced January 2019.

    Comments: Accepted at ACML 2019

  20. arXiv:1811.11357  [pdf, other

    stat.ML cs.LG

    Metropolis-Hastings Generative Adversarial Networks

    Authors: Ryan Turner, Jane Hung, Eric Frank, Yunus Saatci, Jason Yosinski

    Abstract: We introduce the Metropolis-Hastings generative adversarial network (MH-GAN), which combines aspects of Markov chain Monte Carlo and GANs. The MH-GAN draws samples from the distribution implicitly defined by a GAN's discriminator-generator pair, as opposed to standard GANs which draw samples from the distribution defined only by the generator. It uses the discriminator from GAN training to build a… ▽ More

    Submitted 17 May, 2019; v1 submitted 27 November, 2018; originally announced November 2018.

  21. arXiv:1809.02744  [pdf, ps, other

    cs.LG stat.ML

    On the Calibration of Nested Dichotomies for Large Multiclass Tasks

    Authors: Tim Leathart, Eibe Frank, Bernhard Pfahringer, Geoffrey Holmes

    Abstract: Nested dichotomies are used as a method of transforming a multiclass classification problem into a series of binary problems. A tree structure is induced that recursively splits the set of classes into subsets, and a binary classification model learns to discriminate between the two subsets of classes at each node. In this paper, we demonstrate that these nested dichotomies typically exhibit poor… ▽ More

    Submitted 2 October, 2018; v1 submitted 7 September, 2018; originally announced September 2018.

  22. arXiv:1809.02740  [pdf, other

    cs.LG stat.ML

    Ensembles of Nested Dichotomies with Multiple Subset Evaluation

    Authors: Tim Leathart, Eibe Frank, Bernhard Pfahringer, Geoffrey Holmes

    Abstract: A system of nested dichotomies is a method of decomposing a multi-class problem into a collection of binary problems. Such a system recursively applies binary splits to divide the set of classes into two subsets, and trains a binary classifier for each split. Many methods have been proposed to perform this split, each with various advantages and disadvantages. In this paper, we present a simple, g… ▽ More

    Submitted 10 September, 2018; v1 submitted 7 September, 2018; originally announced September 2018.

  23. arXiv:1808.00111  [pdf, other

    cs.LG stat.ML

    Probability Calibration Trees

    Authors: Tim Leathart, Eibe Frank, Geoffrey Holmes, Bernhard Pfahringer

    Abstract: Obtaining accurate and well calibrated probability estimates from classifiers is useful in many applications, for example, when minimising the expected cost of classifications. Existing methods of calibrating probability estimates are applied globally, ignoring the potential for improvements by applying a more fine-grained model. We propose probability calibration trees, a modification of logistic… ▽ More

    Submitted 14 September, 2018; v1 submitted 31 July, 2018; originally announced August 2018.

    Comments: Proceedings of the 9th Asian Conference on Machine Learning

    Journal ref: Leathart, T., Frank, E., Holmes, G., & Pfahringer, B. (2017). Probability calibration trees. In Proceedings of the 9th Asian Conference on Machine Learning (pp. 145-160)

  24. arXiv:1807.03247  [pdf, other

    cs.CV cs.LG stat.ML

    An Intriguing Failing of Convolutional Neural Networks and the CoordConv Solution

    Authors: Rosanne Liu, Joel Lehman, Piero Molino, Felipe Petroski Such, Eric Frank, Alex Sergeev, Jason Yosinski

    Abstract: Few ideas have enjoyed as large an impact on deep learning as convolution. For any problem involving pixels or spatial representations, common intuition holds that convolutional neural networks may be appropriate. In this paper we show a striking counterexample to this intuition via the seemingly trivial coordinate transform problem, which simply requires learning a mapping between coordinates in… ▽ More

    Submitted 3 December, 2018; v1 submitted 9 July, 2018; originally announced July 2018.

    Comments: Published in NeurIPS 2018

  25. arXiv:1806.11248  [pdf, other

    cs.LG stat.ML

    XGBoost: Scalable GPU Accelerated Learning

    Authors: Rory Mitchell, Andrey Adinets, Thejaswi Rao, Eibe Frank

    Abstract: We describe the multi-GPU gradient boosting algorithm implemented in the XGBoost library (https://github.com/dmlc/xgboost). Our algorithm allows fast, scalable training on multi-GPU systems with all of the features of the XGBoost library. We employ data compression techniques to minimise the usage of scarce GPU memory while still allowing highly efficient implementation. Using our algorithm we sho… ▽ More

    Submitted 28 June, 2018; originally announced June 2018.

  26. arXiv:1804.05965  [pdf, ps, other

    stat.ML cs.LG

    MaxGain: Regularisation of Neural Networks by Constraining Activation Magnitudes

    Authors: Henry Gouk, Bernhard Pfahringer, Eibe Frank, Michael Cree

    Abstract: Effective regularisation of neural networks is essential to combat overfitting due to the large number of parameters involved. We present an empirical analogue to the Lipschitz constant of a feed-forward neural network, which we refer to as the maximum gain. We hypothesise that constraining the gain of a network will have a regularising effect, similar to how constraining the Lipschitz constant of… ▽ More

    Submitted 1 July, 2018; v1 submitted 16 April, 2018; originally announced April 2018.

    Comments: Accepted at ECML-PKDD 2018

  27. arXiv:1804.04368  [pdf, ps, other

    stat.ML cs.LG

    Regularisation of Neural Networks by Enforcing Lipschitz Continuity

    Authors: Henry Gouk, Eibe Frank, Bernhard Pfahringer, Michael J. Cree

    Abstract: We investigate the effect of explicitly enforcing the Lipschitz continuity of neural networks with respect to their inputs. To this end, we provide a simple technique for computing an upper bound to the Lipschitz constant---for multiple $p$-norms---of a feed forward neural network composed of commonly used layer types. Our technique is then used to formulate training a neural network with a bounde… ▽ More

    Submitted 9 August, 2020; v1 submitted 12 April, 2018; originally announced April 2018.

  28. arXiv:1707.04943  [pdf, other

    cs.AI

    Improving Naive Bayes for Regression with Optimised Artificial Surrogate Data

    Authors: Michael Mayo, Eibe Frank

    Abstract: Can we evolve better training data for machine learning algorithms? To investigate this question we use population-based optimisation algorithms to generate artificial surrogate training data for naive Bayes for regression. We demonstrate that the generalisation performance of naive Bayes for regression models is enhanced by training them on the artificial data as opposed to the real data. These r… ▽ More

    Submitted 27 November, 2018; v1 submitted 16 July, 2017; originally announced July 2017.

  29. arXiv:1604.01854  [pdf, other

    stat.ML cs.LG

    Building Ensembles of Adaptive Nested Dichotomies with Random-Pair Selection

    Authors: Tim Leathart, Bernhard Pfahringer, Eibe Frank

    Abstract: A system of nested dichotomies is a method of decomposing a multi-class problem into a collection of binary problems. Such a system recursively splits the set of classes into two subsets, and trains a binary classifier to distinguish between each subset. Even though ensembles of nested dichotomies with random structure have been shown to perform well in practice, using a more sophisticated class s… ▽ More

    Submitted 5 July, 2016; v1 submitted 6 April, 2016; originally announced April 2016.

    Comments: ECMLPKDD 2016

  30. arXiv:1212.2487  [pdf

    cs.LG stat.ML

    Locally Weighted Naive Bayes

    Authors: Eibe Frank, Mark Hall, Bernhard Pfahringer

    Abstract: Despite its simplicity, the naive Bayes classifier has surprised machine learning researchers by exhibiting good performance on a variety of learning problems. Encouraged by these results, researchers have looked to overcome naive Bayes primary weakness - attribute independence - and improve the performance of the algorithm. This paper presents a locally weighted version of nai… ▽ More

    Submitted 19 October, 2012; originally announced December 2012.

    Comments: Appears in Proceedings of the Nineteenth Conference on Uncertainty in Artificial Intelligence (UAI2003)

    Report number: UAI-P-2003-PG-249-256

  31. arXiv:cs/9902007  [pdf

    cs.DL

    KEA: Practical Automatic Keyphrase Extraction

    Authors: Ian H. Witten, Gordon W. Paynter, Eibe Frank, Carl Gutwin, Craig G. Nevill-Manning

    Abstract: Keyphrases provide semantic metadata that summarize and characterize documents. This paper describes Kea, an algorithm for automatically extracting keyphrases from text. Kea identifies candidate keyphrases using lexical methods, calculates feature values for each candidate, and uses a machine-learning algorithm to predict which candidates are good keyphrases. The machine learning scheme first bu… ▽ More

    Submitted 4 February, 1999; originally announced February 1999.

    Comments: 9 pages

    ACM Class: H.3.7