Skip to main content

Showing 1–50 of 239 results for author: Jones, M

  1. arXiv:2407.01481  [pdf, other

    cs.DC cs.PF

    LLload: Simplifying Real-Time Job Monitoring for HPC Users

    Authors: Chansup Byun, Julia Mullen, Albert Reuther, William Arcand, William Bergeron, David Bestor, Daniel Burrill, Vijay Gadepally, Michael Houle, Matthew Hubbell, Hayden Jananthan, Michael Jones, Peter Michaleas, Guillermo Morales, Andrew Prout, Antonio Rosa, Charles Yee, Jeremy Kepner, Lauren Milechin

    Abstract: One of the more complex tasks for researchers using HPC systems is performance monitoring and tuning of their applications. Developing a practice of continuous performance improvement, both for speed-up and efficient use of resources is essential to the long term success of both the HPC practitioner and the research project. Profiling tools provide a nice view of the performance of an application… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2406.06899  [pdf, other

    cs.RO

    Developing, Analyzing, and Evaluating Vehicular Lane Keeping Algorithms Under Dynamic Lighting and Weather Conditions Using Electric Vehicles

    Authors: Michael Khalfin, Jack Volgren, Matthew Jones, Luke LeGoullon, Joshua Siegel, Chan-Jin Chung

    Abstract: Self-driving vehicles have the potential to reduce accidents and fatalities on the road. Many production vehicles already come equipped with basic self-driving capabilities, but have trouble following lanes in adverse lighting and weather conditions. Therefore, we develop, analyze, and evaluate two vehicular lane-keeping algorithms under dynamic weather conditions using a combined deep learning- a… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: Supported by the National Science Foundation under Grants No. 2150292 and 2150096

  3. arXiv:2405.19681  [pdf, other

    stat.ML cs.LG stat.CO

    Bayesian Online Natural Gradient (BONG)

    Authors: Matt Jones, Peter Chang, Kevin Murphy

    Abstract: We propose a novel approach to sequential Bayesian inference based on variational Bayes. The key insight is that, in the online setting, we do not need to add the KL term to regularize to the prior (which comes from the posterior at the previous timestep); instead we can optimize just the expected log-likelihood, performing a single step of natural gradient descent starting at the prior predictive… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 41 pages, 11 figures

  4. arXiv:2405.05646  [pdf, other

    stat.ML cs.LG eess.SY

    Outlier-robust Kalman Filtering through Generalised Bayes

    Authors: Gerardo Duran-Martin, Matias Altamirano, Alexander Y. Shestopaloff, Leandro Sánchez-Betancourt, Jeremias Knoblauch, Matt Jones, François-Xavier Briol, Kevin Murphy

    Abstract: We derive a novel, provably robust, and closed-form Bayesian update rule for online filtering in state-space models in the presence of outliers and misspecified measurement models. Our method combines generalised Bayesian inference with filtering methods such as the extended and ensemble Kalman filter. We use the former to show robustness and the latter to ensure computational efficiency in the ca… ▽ More

    Submitted 28 May, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

    Comments: 41st International Conference on Machine Learning (ICML 2024)

  5. arXiv:2405.01536  [pdf, other

    cs.CV cs.GR cs.LG

    Customizing Text-to-Image Models with a Single Image Pair

    Authors: Maxwell Jones, Sheng-Yu Wang, Nupur Kumari, David Bau, Jun-Yan Zhu

    Abstract: Art reinterpretation is the practice of creating a variation of a reference work, making a paired artwork that exhibits a distinct artistic style. We ask if such an image pair can be used to customize a generative model to capture the demonstrated stylistic difference. We propose Pair Customization, a new customization method that learns stylistic difference from a single image pair and then appli… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: project page: https://paircustomization.github.io/

  6. arXiv:2405.01091  [pdf, other

    cs.CC

    Maximizing Network Phylogenetic Diversity

    Authors: Leo van Iersel, Mark Jones, Jannik Schestag, Celine Scornavacca, Mathias Weller

    Abstract: Network Phylogenetic Diversity (Network-PD) is a measure for the diversity of a set of species based on a rooted phylogenetic network (with branch lengths and inheritance probabilities on the reticulation edges) describing the evolution of those species. We consider the \textsc{Max-Network-PD} problem: given such a network, find~$k$ species with maximum Network-PD score. We show that this problem… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  7. arXiv:2404.14643  [pdf, other

    cs.CR cs.CY cs.GR cs.NI cs.SI

    Teaching Network Traffic Matrices in an Interactive Game Environment

    Authors: Chasen Milner, Hayden Jananthan, Jeremy Kepner, Vijay Gadepally, Michael Jones, Peter Michaleas, Ritesh Patel, Sandeep Pisharody, Gabriel Wachman, Alex Pentland

    Abstract: The Internet has become a critical domain for modern society that requires ongoing efforts for its improvement and protection. Network traffic matrices are a powerful tool for understanding and analyzing networks and are broadly taught in online graph theory educational resources. Network traffic matrix concepts are rarely available in online computer network and cybersecurity educational resource… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: 9 pages, 10 figures, 52 references; accepted to IEEE GrAPL

  8. arXiv:2404.11764  [pdf, other

    cs.CV

    Multimodal 3D Object Detection on Unseen Domains

    Authors: Deepti Hegde, Suhas Lohit, Kuan-Chuan Peng, Michael J. Jones, Vishal M. Patel

    Abstract: LiDAR datasets for autonomous driving exhibit biases in properties such as point cloud density, range, and object dimensions. As a result, object detection networks trained and evaluated in different environments often experience performance degradation. Domain adaptation approaches assume access to unannotated samples from the test distribution to address this problem. However, in the real world,… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: technical report

  9. arXiv:2404.11737  [pdf, other

    cs.CV

    Equivariant Spatio-Temporal Self-Supervision for LiDAR Object Detection

    Authors: Deepti Hegde, Suhas Lohit, Kuan-Chuan Peng, Michael J. Jones, Vishal M. Patel

    Abstract: Popular representation learning methods encourage feature invariance under transformations applied at the input. However, in 3D perception tasks like object localization and segmentation, outputs are naturally equivariant to some transformations, such as rotation. Using pre-training loss functions that encourage equivariance of features under certain transformations provides a strong self-supervis… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: technical report

  10. arXiv:2403.14217  [pdf, ps, other

    cs.CC

    Maximizing Phylogenetic Diversity under Time Pressure: Planning with Extinctions Ahead

    Authors: Mark Jones, Jannik Schestag

    Abstract: Phylogenetic Diversity (PD) is a measure of the overall biodiversity of a set of present-day species (taxa) within a phylogenetic tree. In Maximize Phylogenetic Diversity (MPD) one is asked to find a set of taxa (of bounded size/cost) for which this measure is maximized. MPD is a relevant problem in conservation planning, where there are not enough resources to preserve all taxa and minimizing the… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  11. arXiv:2403.12734  [pdf, other

    cs.DS cs.DM math.CO

    Exact and Heuristic Computation of the Scanwidth of Directed Acyclic Graphs

    Authors: Niels Holtgrefe, Leo van Iersel, Mark Jones

    Abstract: To measure the tree-likeness of a directed acyclic graph (DAG), a new width parameter that considers the directions of the arcs was recently introduced: scanwidth. We present the first algorithm that efficiently computes the exact scanwidth of general DAGs. For DAGs with one root and scanwidth $k$ it runs in $O(k \cdot n^k \cdot m)$ time. The algorithm also functions as an FPT algorithm with compl… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: 32 pages, 15 figures

  12. arXiv:2403.09613  [pdf, other

    cs.LG cs.CL

    Reawakening knowledge: Anticipatory recovery from catastrophic interference via structured training

    Authors: Yanlai Yang, Matt Jones, Michael C. Mozer, Mengye Ren

    Abstract: We explore the training dynamics of neural networks in a structured non-IID setting where documents are presented cyclically in a fixed, repeated sequence. Typically, networks suffer from catastrophic interference when training on a sequence of documents; however, we discover a curious and remarkable property of LLMs fine-tuned sequentially in this setting: they exhibit anticipatory behavior, reco… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: 19 pages, 18 figures

  13. arXiv:2403.02697  [pdf, other

    stat.ML cs.LG

    Noise misleads rotation invariant algorithms on sparse targets

    Authors: Manfred K. Warmuth, Wojciech Kotłowski, Matt Jones, Ehsan Amid

    Abstract: It is well known that the class of rotation invariant algorithms are suboptimal even for learning sparse linear problems when the number of examples is below the "dimension" of the problem. This class includes any gradient descent trained neural net with a fully-connected input layer (initialized with a rotationally symmetric distribution). The simplest sparse problem is learning a single feature… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  14. arXiv:2402.18593  [pdf, other

    cs.AR cs.AI cs.DC

    Sustainable Supercomputing for AI: GPU Power Capping at HPC Scale

    Authors: Dan Zhao, Siddharth Samsi, Joseph McDonald, Baolin Li, David Bestor, Michael Jones, Devesh Tiwari, Vijay Gadepally

    Abstract: As research and deployment of AI grows, the computational burden to support and sustain its progress inevitably does too. To train or fine-tune state-of-the-art models in NLP, computer vision, etc., some form of AI hardware acceleration is virtually a requirement. Recent large language models require considerable resources to train and deploy, resulting in significant energy usage, potential carbo… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

  15. Dynamic nowcast of the New Zealand greenhouse gas inventory

    Authors: Malcolm Jones, Hannah Chorley, Flynn Owen, Tamsyn Hilder, Holly Trowland, Paul Bracewell

    Abstract: As efforts to mitigate the effects of climate change grow, reliable and thorough reporting of greenhouse gas emissions are essential for measuring progress towards international and domestic emissions reductions targets. New Zealand's national emissions inventories are currently reported between 15 to 27 months out-of-date. We present a machine learning approach to nowcast (dynamically estimate) n… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Journal ref: Environmental Modelling & Software 167 (2023), 105745

  16. arXiv:2402.00588  [pdf, other

    cs.RO cs.AI cs.MA

    BrainSLAM: SLAM on Neural Population Activity Data

    Authors: Kipp Freud, Nathan Lepora, Matt W. Jones, Cian O'Donnell

    Abstract: Simultaneous localisation and mapping (SLAM) algorithms are commonly used in robotic systems for learning maps of novel environments. Brains also appear to learn maps, but the mechanisms are not known and it is unclear how to infer these maps from neural activity data. We present BrainSLAM; a method for performing SLAM using only population activity (local field potential, LFP) data simultaneously… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: Accepted to the 23rd International Conference on Autonomous Agents and Multiagent Systems. 2024

  17. arXiv:2401.08787  [pdf, other

    cs.CV

    Segment Anything Model Can Not Segment Anything: Assessing AI Foundation Model's Generalizability in Permafrost Mapping

    Authors: Wenwen Li, Chia-Yu Hsu, Sizhe Wang, Yezhou Yang, Hyunho Lee, Anna Liljedahl, Chandi Witharana, Yili Yang, Brendan M. Rogers, Samantha T. Arundel, Matthew B. Jones, Kenton McHenry, Patricia Solis

    Abstract: This paper assesses trending AI foundation models, especially emerging computer vision foundation models and their performance in natural landscape feature segmentation. While the term foundation model has quickly garnered interest from the geospatial domain, its definition remains vague. Hence, this paper will first introduce AI foundation models and their defining characteristics. Built upon the… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  18. arXiv:2401.05406  [pdf, other

    eess.SP cs.AI cs.LG cs.NI

    RFRL Gym: A Reinforcement Learning Testbed for Cognitive Radio Applications

    Authors: Daniel Rosen, Illa Rochez, Caleb McIrvin, Joshua Lee, Kevin D'Alessandro, Max Wiecek, Nhan Hoang, Ramzy Saffarini, Sam Philips, Vanessa Jones, Will Ivey, Zavier Harris-Smart, Zavion Harris-Smart, Zayden Chin, Amos Johnson, Alyse M. Jones, William C. Headley

    Abstract: Radio Frequency Reinforcement Learning (RFRL) is anticipated to be a widely applicable technology in the next generation of wireless communication systems, particularly 6G and next-gen military communications. Given this, our research is focused on developing a tool to promote the development of RFRL techniques that leverage spectrum sensing. In particular, the tool was designed to address two cog… ▽ More

    Submitted 20 December, 2023; originally announced January 2024.

  19. arXiv:2312.06791  [pdf, ps, other

    math.OC cs.LG

    Learning Polynomial Representations of Physical Objects with Application to Certifying Correct Packing Configurations

    Authors: Morgan Jones

    Abstract: This paper introduces a novel approach for learning polynomial representations of physical objects. Given a point cloud data set associated with a physical object, we solve a one-class classification problem to bound the data points by a polynomial sublevel set while harnessing Sum-of-Squares (SOS) programming to enforce prior shape knowledge constraints. By representing objects as polynomial subl… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  20. Dataset for Investigating Anomalies in Compute Clusters

    Authors: Diana McSpadden, Yasir Alanazi, Bryan Hess, Laura Hild, Mark Jones, Yiyang Lub, Ahmed Mohammed, Wesley Moore, Jie Ren, Malachi Schram, Evgenia Smirni

    Abstract: The dataset was collected for 332 compute nodes throughout May 19 - 23, 2023. May 19 - 22 characterizes normal compute cluster behavior, while May 23 includes an anomalous event. The dataset includes eight CPU, 11 disk, 47 memory, and 22 Slurm metrics. It represents five distinct hardware configurations and contains over one million records, totaling more than 180GB of raw data.

    Submitted 31 October, 2023; originally announced November 2023.

    Comments: Work utilizing the dataset was presented in a Research track poster at the Super Computing 2023 conference

  21. DeepPatent2: A Large-Scale Benchmarking Corpus for Technical Drawing Understanding

    Authors: Kehinde Ajayi, Xin Wei, Martin Gryder, Winston Shields, Jian Wu, Shawn M. Jones, Michal Kucer, Diane Oyen

    Abstract: Recent advances in computer vision (CV) and natural language processing have been driven by exploiting big data on practical applications. However, these research fields are still limited by the sheer volume, versatility, and diversity of the available datasets. CV tasks, such as image captioning, which has primarily been carried out on natural images, still struggle to produce accurate and meanin… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  22. arXiv:2310.18334  [pdf, other

    cs.AR cs.DC

    Hypersparse Traffic Matrix Construction using GraphBLAS on a DPU

    Authors: William Bergeron, Michael Jones, Chase Barber, Kale DeYoung, George Amariucai, Kaleb Ernst, Nathan Fleming, Peter Michaleas, Sandeep Pisharody, Nathan Wells, Antonio Rosa, Eugene Vasserman, Jeremy Kepner

    Abstract: Low-power small form factor data processing units (DPUs) enable offloading and acceleration of a broad range of networking and security services. DPUs have accelerated the transition to programmable networking by enabling the replacement of FPGAs/ASICs in a wide range of network oriented devices. The GraphBLAS sparse matrix graph open standard math library is well-suited for constructing anonymize… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

  23. arXiv:2310.09145  [pdf, other

    cs.AI cs.DC

    Lincoln AI Computing Survey (LAICS) Update

    Authors: Albert Reuther, Peter Michaleas, Michael Jones, Vijay Gadepally, Siddharth Samsi, Jeremy Kepner

    Abstract: This paper is an update of the survey of AI accelerators and processors from past four years, which is now called the Lincoln AI Computing Survey - LAICS (pronounced "lace"). As in past years, this paper collects and summarizes the current commercial accelerators that have been publicly announced with peak performance and peak power consumption numbers. The performance and power values are plotted… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Comments: 7 pages, 6 figures, 2023 IEEE High Performance Extreme Computing (HPEC) conference, September 2023

    ACM Class: C.1.4; C.4

  24. arXiv:2310.03003  [pdf, other

    cs.CL cs.DC

    From Words to Watts: Benchmarking the Energy Costs of Large Language Model Inference

    Authors: Siddharth Samsi, Dan Zhao, Joseph McDonald, Baolin Li, Adam Michaleas, Michael Jones, William Bergeron, Jeremy Kepner, Devesh Tiwari, Vijay Gadepally

    Abstract: Large language models (LLMs) have exploded in popularity due to their new generative capabilities that go far beyond prior state-of-the-art. These technologies are increasingly being leveraged in various domains such as law, finance, and medicine. However, these models carry significant computational challenges, especially the compute and energy costs required for inference. Inference energy costs… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  25. arXiv:2310.00522  [pdf, other

    cs.SI

    Mapping of Internet "Coastlines" via Large Scale Anonymized Network Source Correlations

    Authors: Hayden Jananthan, Jeremy Kepner, Michael Jones, William Arcand, David Bestor, William Bergeron, Chansup Byun, Timothy Davis, Vijay Gadepally, Daniel Grant, Michael Houle, Matthew Hubbell, Anna Klein, Lauren Milechin, Guillermo Morales, Andrew Morris, Julie Mullen, Ritesh Patel, Alex Pentland, Sandeep Pisharody, Andrew Prout, Albert Reuther, Antonio Rosa, Siddharth Samsi, Tyler Trigg , et al. (3 additional authors not shown)

    Abstract: Expanding the scientific tools available to protect computer networks can be aided by a deeper understanding of the underlying statistical distributions of network traffic and their potential geometric interpretations. Analyses of large scale network observations provide a unique window into studying those underlying statistics. Newly developed GraphBLAS hypersparse matrices and D4M associative ar… ▽ More

    Submitted 30 September, 2023; originally announced October 2023.

    Comments: 9 pages, 7 figures, IEEE HPEC 2023 (accepted)

  26. arXiv:2309.16592  [pdf, other

    cs.CV cs.LG

    Tensor Factorization for Leveraging Cross-Modal Knowledge in Data-Constrained Infrared Object Detection

    Authors: Manish Sharma, Moitreya Chatterjee, Kuan-Chuan Peng, Suhas Lohit, Michael Jones

    Abstract: The primary bottleneck towards obtaining good recognition performance in IR images is the lack of sufficient labeled training data, owing to the cost of acquiring such data. Realizing that object detection methods for the RGB modality are quite robust (at least for some commonplace classes, like person, car, etc.), thanks to the giant training sets that exist, in this work we seek to leverage cues… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: Accepted to ICCV 2023, LIMIT Workshop. The first two authors contributed equally

  27. arXiv:2309.14531  [pdf, other

    cs.CV

    Pixel-Grounded Prototypical Part Networks

    Authors: Zachariah Carmichael, Suhas Lohit, Anoop Cherian, Michael Jones, Walter Scheirer

    Abstract: Prototypical part neural networks (ProtoPartNNs), namely PROTOPNET and its derivatives, are an intrinsically interpretable approach to machine learning. Their prototype learning scheme enables intuitive explanations of the form, this (prototype) looks like that (testing image patch). But, does this actually look like that? In this work, we delve into why object part localization and associated hea… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

    Comments: 21 pages

  28. arXiv:2309.10656  [pdf, other

    cs.LG

    A spectrum of physics-informed Gaussian processes for regression in engineering

    Authors: Elizabeth J Cross, Timothy J Rogers, Daniel J Pitchforth, Samuel J Gibson, Matthew R Jones

    Abstract: Despite the growing availability of sensing and data in general, we remain unable to fully characterise many in-service engineering systems and structures from a purely data-driven approach. The vast data and resources available to capture human activity are unmatched in our engineered world, and, even in cases where data could be referred to as ``big,'' they will rarely hold information across op… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

  29. arXiv:2309.08588  [pdf, other

    cs.CV cs.RO

    Robust Frame-to-Frame Camera Rotation Estimation in Crowded Scenes

    Authors: Fabien Delattre, David Dirnfeld, Phat Nguyen, Stephen Scarano, Michael J. Jones, Pedro Miraldo, Erik Learned-Miller

    Abstract: We present an approach to estimating camera rotation in crowded, real-world scenes from handheld monocular video. While camera rotation estimation is a well-studied problem, no previous methods exhibit both high accuracy and acceptable speed in this setting. Because the setting is not addressed well by other datasets, we provide a new dataset and benchmark, with high-accuracy, rigorously verified… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

    Comments: Published at ICCV 2023

  30. arXiv:2309.04976  [pdf, other

    cs.LG cs.AI eess.SY

    AVARS -- Alleviating Unexpected Urban Road Traffic Congestion using UAVs

    Authors: Jiaying Guo, Michael R. Jones, Soufiene Djahel, Shen Wang

    Abstract: Reducing unexpected urban traffic congestion caused by en-route events (e.g., road closures, car crashes, etc.) often requires fast and accurate reactions to choose the best-fit traffic signals. Traditional traffic light control systems, such as SCATS and SCOOT, are not efficient as their traffic data provided by induction loops has a low update frequency (i.e., longer than 1 minute). Moreover, th… ▽ More

    Submitted 10 September, 2023; originally announced September 2023.

  31. pPython Performance Study

    Authors: Chansup Byun, William Arcand, David Bestor, Bill Bergeron, Vijay Gadepally, Michael Houle, Matthew Hubbell, Hayden Jananthan, Michael Jones, Anna Klein, Peter Michaleas, Lauren Milechin, Guillermo Morales, Julie Mullen, Andrew Prout, Albert Reuther, Antonio Rosa, Siddharth Samsi, Charles Yee, Jeremy Kepner

    Abstract: pPython seeks to provide a parallel capability that provides good speed-up without sacrificing the ease of programming in Python by implementing partitioned global array semantics (PGAS) on top of a simple file-based messaging library (PythonMPI) in pure Python. pPython follows a SPMD (single program multiple data) model of computation. pPython runs on a single-node (e.g., a laptop) running Window… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2208.14908

  32. Deployment of Real-Time Network Traffic Analysis using GraphBLAS Hypersparse Matrices and D4M Associative Arrays

    Authors: Michael Jones, Jeremy Kepner, Andrew Prout, Timothy Davis, William Arcand, David Bestor, William Bergeron, Chansup Byun, Vijay Gadepally, Micheal Houle, Matthew Hubbell, Hayden Jananthan, Anna Klein, Lauren Milechin, Guillermo Morales, Julie Mullen, Ritesh Patel, Sandeep Pisharody, Albert Reuther, Antonio Rosa, Siddharth Samsi, Charles Yee, Peter Michaleas

    Abstract: Matrix/array analysis of networks can provide significant insight into their behavior and aid in their operation and protection. Prior work has demonstrated the analytic, performance, and compression capabilities of GraphBLAS (graphblas.org) hypersparse matrices and D4M (d4m.mit.edu) associative arrays (a mathematical superset of matrices). Obtaining the benefits of these capabilities requires int… ▽ More

    Submitted 8 December, 2023; v1 submitted 4 September, 2023; originally announced September 2023.

    Comments: Accepted to IEEE HPEC, 8 pages, 8 figures, 1 table, 69 references. arXiv admin note: text overlap with arXiv:2203.13934. text overlap with arXiv:2309.01806

  33. Focusing and Calibration of Large Scale Network Sensors using GraphBLAS Anonymized Hypersparse Matrices

    Authors: Jeremy Kepner, Michael Jones, Phil Dykstra, Chansup Byun, Timothy Davis, Hayden Jananthan, William Arcand, David Bestor, William Bergeron, Vijay Gadepally, Micheal Houle, Matthew Hubbell, Anna Klein, Lauren Milechin, Guillermo Morales, Julie Mullen, Ritesh Patel, Alex Pentland, Sandeep Pisharody, Andrew Prout, Albert Reuther, Antonio Rosa, Siddharth Samsi, Tyler Trigg, Charles Yee , et al. (1 additional authors not shown)

    Abstract: Defending community-owned cyber space requires community-based efforts. Large-scale network observations that uphold the highest regard for privacy are key to protecting our shared cyberspace. Deployment of the necessary network sensors requires careful sensor placement, focusing, and calibration with significant volumes of network observations. This paper demonstrates novel focusing and calibrati… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

    Comments: Accepted to IEEE HPEC, 9 pages, 12 figures, 1 table, 63 references, 2 appendices

  34. arXiv:2308.07358  [pdf, other

    cs.GR cs.AI cs.LG

    Conformal Predictions Enhanced Expert-guided Meshing with Graph Neural Networks

    Authors: Amin Heyrani Nobari, Justin Rey, Suhas Kodali, Matthew Jones, Faez Ahmed

    Abstract: Computational Fluid Dynamics (CFD) is widely used in different engineering fields, but accurate simulations are dependent upon proper meshing of the simulation domain. While highly refined meshes may ensure precision, they come with high computational costs. Similarly, adaptive remeshing techniques require multiple simulations and come at a great computational cost. This means that the meshing pro… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

  35. arXiv:2308.01999  [pdf, other

    quant-ph cs.PF cs.SE

    cuQuantum SDK: A High-Performance Library for Accelerating Quantum Science

    Authors: Harun Bayraktar, Ali Charara, David Clark, Saul Cohen, Timothy Costa, Yao-Lung L. Fang, Yang Gao, Jack Guan, John Gunnels, Azzam Haidar, Andreas Hehn, Markus Hohnerbach, Matthew Jones, Tom Lubowe, Dmitry Lyakh, Shinya Morino, Paul Springer, Sam Stanwyck, Igor Terentyev, Satya Varadhan, Jonathan Wong, Takuma Yamaguchi

    Abstract: We present the NVIDIA cuQuantum SDK, a state-of-the-art library of composable primitives for GPU-accelerated quantum circuit simulations. As the size of quantum devices continues to increase, making their classical simulation progressively more difficult, the availability of fast and scalable quantum circuit simulators becomes vital for quantum algorithm developers, as well as quantum hardware eng… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

    Comments: paper accepted at QCE 2023, journal reference will be updated whenever available

    MSC Class: 68Q12; 68Q09; 81P68;

  36. arXiv:2307.12451  [pdf, other

    q-bio.BM cs.LG stat.ML

    DiAMoNDBack: Diffusion-denoising Autoregressive Model for Non-Deterministic Backmapping of Cα Protein Traces

    Authors: Michael S. Jones, Kirill Shmilovich, Andrew L. Ferguson

    Abstract: Coarse-grained molecular models of proteins permit access to length and time scales unattainable by all-atom models and the simulation of processes that occur on long-time scales such as aggregation and folding. The reduced resolution realizes computational accelerations but an atomistic representation can be vital for a complete understanding of mechanistic details. Backmapping is the process of… ▽ More

    Submitted 23 July, 2023; originally announced July 2023.

  37. arXiv:2307.06458  [pdf, other

    cs.SI cs.CV cs.DL

    Discovering Image Usage Online: A Case Study With "Flatten the Curve''

    Authors: Shawn M. Jones, Diane Oyen

    Abstract: Understanding the spread of images across the web helps us understand the reuse of scientific visualizations and their relationship with the public. The "Flatten the Curve" graphic was heavily used during the COVID-19 pandemic to convey a complex concept in a simple form. It displays two curves comparing the impact on case loads for medical facilities if the populace either adopts or fails to adop… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

    Comments: 6 pages, 5 figures, Presented as poster at JCDL 2023

    ACM Class: I.4.9; H.3.3; H.4.3; H.3.7

  38. arXiv:2306.07098  [pdf, other

    cs.LG cs.AI

    Efficiently Learning the Graph for Semi-supervised Learning

    Authors: Dravyansh Sharma, Maxwell Jones

    Abstract: Computational efficiency is a major bottleneck in using classic graph-based approaches for semi-supervised learning on datasets with a large number of unlabeled examples. Known techniques to improve efficiency typically involve an approximation of the graph regularization objective, but suffer two major drawbacks - first the graph is assumed to be known or constructed with heuristic hyperparameter… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Comments: 29 pages, 9 figures

  39. arXiv:2305.19535  [pdf, other

    stat.ML cs.LG

    Low-rank extended Kalman filtering for online learning of neural networks from streaming data

    Authors: Peter G. Chang, Gerardo Durán-Martín, Alexander Y Shestopaloff, Matt Jones, Kevin Murphy

    Abstract: We propose an efficient online approximate Bayesian inference algorithm for estimating the parameters of a nonlinear function from a potentially non-stationary data stream. The method is based on the extended Kalman filter (EKF), but uses a novel low-rank plus diagonal decomposition of the posterior precision matrix, which gives a cost per step which is linear in the number of model parameters. In… ▽ More

    Submitted 27 June, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

    Journal ref: COLLAS conference 2023

  40. arXiv:2305.15591  [pdf, other

    cs.LG

    Lightweight Learner for Shared Knowledge Lifelong Learning

    Authors: Yunhao Ge, Yuecheng Li, Di Wu, Ao Xu, Adam M. Jones, Amanda Sofie Rios, Iordanis Fostiropoulos, Shixian Wen, Po-Hsuan Huang, Zachary William Murdock, Gozde Sahin, Shuo Ni, Kiran Lekkala, Sumedh Anand Sontakke, Laurent Itti

    Abstract: In Lifelong Learning (LL), agents continually learn as they encounter new conditions and tasks. Most current LL is limited to a single agent that learns tasks sequentially. Dedicated LL machinery is then deployed to mitigate the forgetting of old tasks as new tasks are learned. This is inherently slow. We propose a new Shared Knowledge Lifelong Learning (SKILL) challenge, which deploys a decentral… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: Transactions on Machine Learning Research (TMLR) paper

  41. arXiv:2305.08657  [pdf, other

    stat.ML cs.LG stat.AP

    Encoding Domain Expertise into Multilevel Models for Source Location

    Authors: Lawrence A. Bull, Matthew R. Jones, Elizabeth J. Cross, Andrew Duncan, Mark Girolami

    Abstract: Data from populations of systems are prevalent in many industrial applications. Machines and infrastructure are increasingly instrumented with sensing systems, emitting streams of telemetry data with complex interdependencies. In practice, data-centric monitoring procedures tend to consider these assets (and respective models) as distinct -- operating in isolation and associated with independent d… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

  42. arXiv:2305.03106  [pdf, other

    math.CO cs.DM

    Making a Network Orchard by Adding Leaves

    Authors: Leo van Iersel, Mark Jones, Esther Julien, Yukihiro Murakami

    Abstract: Phylogenetic networks are used to represent the evolutionary history of species. Recently, the new class of orchard networks was introduced, which were later shown to be interpretable as trees with additional horizontal arcs. This makes the network class ideal for capturing evolutionary histories that involve horizontal gene transfers. Here, we study the minimum number of additional leaves needed… ▽ More

    Submitted 8 May, 2023; v1 submitted 4 May, 2023; originally announced May 2023.

    Comments: 19 pages, 6 figures

  43. arXiv:2305.02783  [pdf, ps, other

    cs.SE cs.AI cs.CL cs.PL

    Automated Code generation for Information Technology Tasks in YAML through Large Language Models

    Authors: Saurabh Pujar, Luca Buratti, Xiaojie Guo, Nicolas Dupuis, Burn Lewis, Sahil Suneja, Atin Sood, Ganesh Nalawade, Matthew Jones, Alessandro Morari, Ruchir Puri

    Abstract: The recent improvement in code generation capabilities due to the use of large language models has mainly benefited general purpose programming languages. Domain specific languages, such as the ones used for IT Automation, have received far less attention, despite involving many active developers and being an essential component of modern cloud platforms. This work focuses on the generation of Ans… ▽ More

    Submitted 23 May, 2023; v1 submitted 2 May, 2023; originally announced May 2023.

  44. arXiv:2301.11581  [pdf, other

    cs.AI cs.CY cs.DC cs.LG

    A Green(er) World for A.I

    Authors: Dan Zhao, Nathan C. Frey, Joseph McDonald, Matthew Hubbell, David Bestor, Michael Jones, Andrew Prout, Vijay Gadepally, Siddharth Samsi

    Abstract: As research and practice in artificial intelligence (A.I.) grow in leaps and bounds, the resources necessary to sustain and support their operations also grow at an increasing pace. While innovations and applications from A.I. have brought significant advances, from applications to vision and natural language to improvements to fields like medical imaging and materials engineering, their costs sho… ▽ More

    Submitted 27 January, 2023; originally announced January 2023.

    Comments: 8 pages, published in 2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

    Journal ref: D. Zhao et al., "A Green(er) World for A.I.," 2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), Lyon, France, 2022, pp. 742-750

  45. arXiv:2301.09568  [pdf, other

    q-bio.NC cs.LG

    Interpretable Classification of Early Stage Parkinson's Disease from EEG

    Authors: Amarpal Sahota, Amber Roguski, Matthew W. Jones, Michal Rolinski, Alan Whone, Raul Santos-Rodriguez, Zahraa S. Abdallah

    Abstract: Detecting Parkinson's Disease in its early stages using EEG data presents a significant challenge. This paper introduces a novel approach, representing EEG data as a 15-variate series of bandpower and peak frequency values/coefficients. The hypothesis is that this representation captures essential information from the noisy EEG signal, improving disease detection. Statistical features extracted fr… ▽ More

    Submitted 8 December, 2023; v1 submitted 20 January, 2023; originally announced January 2023.

  46. arXiv:2212.07900  [pdf, other

    cs.CV

    EVAL: Explainable Video Anomaly Localization

    Authors: Ashish Singh, Michael J. Jones, Erik Learned-Miller

    Abstract: We develop a novel framework for single-scene video anomaly localization that allows for human-understandable reasons for the decisions the system makes. We first learn general representations of objects and their motions (using deep networks) and then use these representations to build a high-level, location-dependent model of any particular scene. This model can be used to detect anomalies in ne… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

  47. arXiv:2211.02115  [pdf, other

    cs.CV cs.IR

    Abstract Images Have Different Levels of Retrievability Per Reverse Image Search Engine

    Authors: Shawn M. Jones, Diane Oyen

    Abstract: Much computer vision research has focused on natural images, but technical documents typically consist of abstract images, such as charts, drawings, diagrams, and schematics. How well do general web search engines discover abstract images? Recent advancements in computer vision and machine learning have led to the rise of reverse image search engines. Where conventional search engines accept a tex… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

    Comments: 20 pages; 7 figures; to be published in the proceedings of the Drawings and abstract Imagery: Representation and Analysis (DIRA) Workshop from ECCV 2022

    ACM Class: H.3.3; H.3.7; H.3.5; I.4.9

  48. arXiv:2211.00378  [pdf, ps, other

    cs.DS

    A Near-Linear Kernel for Two-Parsimony Distance

    Authors: Elise Deen, Leo van Iersel, Remie Janssen, Mark Jones, Yuki Murakami, Norbert Zeh

    Abstract: The maximum parsimony distance $d_{\textrm{MP}}(T_1,T_2)$ and the bounded-state maximum parsimony distance $d_{\textrm{MP}}^t(T_1,T_2)$ measure the difference between two phylogenetic trees $T_1,T_2$ in terms of the maximum difference between their parsimony scores for any character (with $t$ a bound on the number of states in the character, in the case of $d_{\textrm{MP}}^t(T_1,T_2)$). While comp… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

  49. AI and ML Accelerator Survey and Trends

    Authors: Albert Reuther, Peter Michaleas, Michael Jones, Vijay Gadepally, Siddharth Samsi, Jeremy Kepner

    Abstract: This paper updates the survey of AI accelerators and processors from past three years. This paper collects and summarizes the current commercial accelerators that have been publicly announced with peak performance and power consumption numbers. The performance and power values are plotted on a scatter graph, and a number of dimensions and observations from the trends on this plot are again discuss… ▽ More

    Submitted 8 October, 2022; originally announced October 2022.

    Comments: 10 pages, 4 figures, 2022 IEEE High Performance Extreme Computing (HPEC) Conference. arXiv admin note: substantial text overlap with arXiv:2009.00993, arXiv:2109.08957

    ACM Class: C.1.4; C.4

  50. arXiv:2209.15579  [pdf, other

    cs.LG

    Physically Meaningful Uncertainty Quantification in Probabilistic Wind Turbine Power Curve Models as a Damage Sensitive Feature

    Authors: J. H. Mclean, M. R. Jones, B. J. O'Connell, A. E Maguire, T. J. Rogers

    Abstract: A wind turbines' power curve is easily accessible damage sensitive data, and as such is a key part of structural health monitoring in wind turbines. Power curve models can be constructed in a number of ways, but the authors argue that probabilistic methods carry inherent benefits in this use case, such as uncertainty quantification and allowing uncertainty propagation analysis. Many probabilistic… ▽ More

    Submitted 30 September, 2022; originally announced September 2022.