Skip to main content

Showing 1–12 of 12 results for author: Lensen, A

  1. arXiv:2403.14139  [pdf, other

    cs.NE cs.LG

    Genetic Programming for Explainable Manifold Learning

    Authors: Ben Cravens, Andrew Lensen, Paula Maddigan, Bing Xue

    Abstract: Manifold learning techniques play a pivotal role in machine learning by revealing lower-dimensional embeddings within high-dimensional data, thus enhancing both the efficiency and interpretability of data analysis by transforming the data into a lower-dimensional representation. However, a notable challenge with current manifold learning methods is their lack of explicit functional mappings, cruci… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  2. arXiv:2403.03397  [pdf, other

    cs.NE

    Explaining Genetic Programming Trees using Large Language Models

    Authors: Paula Maddigan, Andrew Lensen, Bing Xue

    Abstract: Genetic programming (GP) has the potential to generate explainable results, especially when used for dimensionality reduction. In this research, we investigate the potential of leveraging eXplainable AI (XAI) and large language models (LLMs) like ChatGPT to improve the interpretability of GP-based non-linear dimensionality reduction. Our study introduces a novel XAI dashboard named GP4NLDR, the fi… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  3. arXiv:2304.08915  [pdf, other

    cs.NE cs.LG

    Differentiable Genetic Programming for High-dimensional Symbolic Regression

    Authors: Peng Zeng, Xiaotian Song, Andrew Lensen, Yuwei Ou, Yanan Sun, Mengjie Zhang, Jiancheng Lv

    Abstract: Symbolic regression (SR) is the process of discovering hidden relationships from data with mathematical expressions, which is considered an effective way to reach interpretable machine learning (ML). Genetic programming (GP) has been the dominator in solving SR problems. However, as the scale of SR problems increases, GP often poorly demonstrates and cannot effectively address the real-world high-… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

  4. arXiv:2301.06678  [pdf, other

    cs.CV cs.LG

    Feature-based Image Matching for Identifying Individual Kākā

    Authors: Fintan O'Sullivan, Kirita-Rose Escott, Rachael C. Shaw, Andrew Lensen

    Abstract: This report investigates an unsupervised, feature-based image matching pipeline for the novel application of identifying individual kākā. Applied with a similarity network for clustering, this addresses a weakness of current supervised approaches to identifying individual birds which struggle to handle the introduction of new individuals to the population. Our approach uses object localisation to… ▽ More

    Submitted 23 January, 2023; v1 submitted 16 January, 2023; originally announced January 2023.

    Comments: 42 pages, honour's report from Victoria University of Wellington

  5. arXiv:2208.06981  [pdf

    cs.LG cs.CL cs.CY cs.NE

    Explainable Artificial Intelligence for Assault Sentence Prediction in New Zealand

    Authors: Harry Rodger, Andrew Lensen, Marcin Betkier

    Abstract: The judiciary has historically been conservative in its use of Artificial Intelligence, but recent advances in machine learning have prompted scholars to reconsider such use in tasks like sentence prediction. This paper investigates by experimentation the potential use of explainable artificial intelligence for predicting imprisonment sentences in assault cases in New Zealand's courts. We propose… ▽ More

    Submitted 14 August, 2022; originally announced August 2022.

    Comments: 16 pages, 3 figures, accepted by the Journal of the Royal Society of New Zealand

  6. Genetic Programming for Manifold Learning: Preserving Local Topology

    Authors: Andrew Lensen, Bing Xue, Mengjie Zhang

    Abstract: Manifold learning methods are an invaluable tool in today's world of increasingly huge datasets. Manifold learning algorithms can discover a much lower-dimensional representation (embedding) of a high-dimensional dataset through non-linear transformations that preserve the most important structure of the original data. State-of-the-art manifold learning methods directly optimise an embedding witho… ▽ More

    Submitted 22 August, 2021; originally announced August 2021.

    Comments: Accepted by IEEE Transactions on Evolutionary Computation, 2021

  7. arXiv:2102.01355  [pdf, other

    cs.LG cs.AI cs.CV cs.NE

    Mining Feature Relationships in Data

    Authors: Andrew Lensen

    Abstract: When faced with a new dataset, most practitioners begin by performing exploratory data analysis to discover interesting patterns and characteristics within data. Techniques such as association rule mining are commonly applied to uncover relationships between features (attributes) of the data. However, association rules are primarily designed for use on binary or categorical data, due to their use… ▽ More

    Submitted 2 February, 2021; originally announced February 2021.

    Comments: 16 pages, accepted in EuroGP '21

  8. Genetic Programming for Evolving a Front of Interpretable Models for Data Visualisation

    Authors: Andrew Lensen, Bing Xue, Mengjie Zhang

    Abstract: Data visualisation is a key tool in data mining for understanding big datasets. Many visualisation methods have been proposed, including the well-regarded state-of-the-art method t-Distributed Stochastic Neighbour Embedding. However, the most powerful visualisation methods have a significant limitation: the manner in which they create their visualisation from the original features of the dataset i… ▽ More

    Submitted 26 January, 2020; originally announced January 2020.

    Comments: Accepted by IEEE Transactions on Cybernetics, 2020

  9. Multi-Objective Genetic Programming for Manifold Learning: Balancing Quality and Dimensionality

    Authors: Andrew Lensen, Mengjie Zhang, Bing Xue

    Abstract: Manifold learning techniques have become increasingly valuable as data continues to grow in size. By discovering a lower-dimensional representation (embedding) of the structure of a dataset, manifold learning algorithms can substantially reduce the dimensionality of a dataset while preserving as much information as possible. However, state-of-the-art manifold learning algorithms are opaque in how… ▽ More

    Submitted 5 January, 2020; originally announced January 2020.

    Comments: 31 pages, pre-print accepted by Genetic Programming and Evolvable Machines journal

  10. Genetic Programming for Evolving Similarity Functions for Clustering: Representations and Analysis

    Authors: Andrew Lensen, Bing Xue, Mengjie Zhang

    Abstract: Clustering is a difficult and widely-studied data mining task, with many varieties of clustering algorithms proposed in the literature. Nearly all algorithms use a similarity measure such as a distance metric (e.g. Euclidean distance) to decide which instances to assign to the same cluster. These similarity measures are generally pre-defined and cannot be easily tailored to the properties of a par… ▽ More

    Submitted 22 October, 2019; originally announced October 2019.

    Comments: 29 pages, accepted by Evolutionary Computation (Journal), MIT Press

  11. Can Genetic Programming Do Manifold Learning Too?

    Authors: Andrew Lensen, Bing Xue, Mengjie Zhang

    Abstract: Exploratory data analysis is a fundamental aspect of knowledge discovery that aims to find the main characteristics of a dataset. Dimensionality reduction, such as manifold learning, is often used to reduce the number of features in a dataset to a manageable level for human interpretation. Despite this, most manifold learning techniques do not explain anything about the original features nor the t… ▽ More

    Submitted 8 February, 2019; originally announced February 2019.

    Comments: 16 pages, accepted in EuroGP '19

  12. Generating Redundant Features with Unsupervised Multi-Tree Genetic Programming

    Authors: Andrew Lensen, Bing Xue, Mengjie Zhang

    Abstract: Recently, feature selection has become an increasingly important area of research due to the surge in high-dimensional datasets in all areas of modern life. A plethora of feature selection algorithms have been proposed, but it is difficult to truly analyse the quality of a given algorithm. Ideally, an algorithm would be evaluated by measuring how well it removes known bad features. Acquiring datas… ▽ More

    Submitted 20 March, 2018; v1 submitted 1 February, 2018; originally announced February 2018.

    Comments: 16 pages, preprint for EuroGP '18