Skip to main content

Showing 1–18 of 18 results for author: Zhang, M M

  1. arXiv:2404.01697  [pdf, other

    stat.ML cs.LG

    Preventing Model Collapse in Gaussian Process Latent Variable Models

    Authors: Ying Li, Zhidi Lin, Feng Yin, Michael Minyi Zhang

    Abstract: Gaussian process latent variable models (GPLVMs) are a versatile family of unsupervised learning models commonly used for dimensionality reduction. However, common challenges in modeling data with GPLVMs include inadequate kernel flexibility and improper selection of the projection noise, leading to a type of model collapse characterized by vague latent representations that do not reflect the unde… ▽ More

    Submitted 18 June, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: International Conference on Machine Learning (ICML), 2024

  2. From Concept to Field Tests: Accelerated Development of Multi-AUV Missions Using a High-Fidelity Faster-than-Real-Time Simulator

    Authors: Timothy R. Player, Arjo Chakravarty, Mabel M. Zhang, Ben Yair Raanan, Brian Kieft, Yanwu Zhang, Brett Hobson

    Abstract: We designed and validated a novel simulator for efficient development of multi-robot marine missions. To accelerate development of cooperative behaviors, the simulator models the robots' operating conditions with moderately high fidelity and runs significantly faster than real time, including acoustic communications, dynamic environmental data, and high-resolution bathymetry in large worlds. The s… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Journal ref: IEEE International Conference on Robotics and Automation (ICRA), London, United Kingdom, 2023, pp. 3102-3108

  3. arXiv:2311.00564  [pdf, other

    stat.ML cs.LG

    Online Student-$t$ Processes with an Overall-local Scale Structure for Modelling Non-stationary Data

    Authors: Taole Sha, Michael Minyi Zhang

    Abstract: Time-dependent data often exhibit characteristics, such as non-stationarity and heavy-tailed errors, that would be inappropriate to model with the typical assumptions used in popular models. Thus, more flexible approaches are required to be able to accommodate such issues. To this end, we propose a Bayesian mixture of student-$t$ processes with an overall-local scale structure for the covariance.… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: 9 pages,5 figures

    MSC Class: 62F15

  4. arXiv:2308.14048  [pdf, other

    stat.ML cs.LG stat.AP stat.CO stat.ME

    A Bayesian Non-parametric Approach to Generative Models: Integrating Variational Autoencoder and Generative Adversarial Networks using Wasserstein and Maximum Mean Discrepancy

    Authors: Forough Fazeli-Asl, Michael Minyi Zhang

    Abstract: Generative models have emerged as a promising technique for producing high-quality images that are indistinguishable from real images. Generative adversarial networks (GANs) and variational autoencoders (VAEs) are two of the most prominent and widely studied generative models. GANs have demonstrated excellent performance in generating sharp realistic images and VAEs have shown strong abilities to… ▽ More

    Submitted 27 August, 2023; originally announced August 2023.

  5. arXiv:2306.08352  [pdf, other

    stat.ML cs.AI cs.LG

    Bayesian Non-linear Latent Variable Modeling via Random Fourier Features

    Authors: Michael Minyi Zhang, Gregory W. Gundersen, Barbara E. Engelhardt

    Abstract: The Gaussian process latent variable model (GPLVM) is a popular probabilistic method used for nonlinear dimension reduction, matrix factorization, and state-space modeling. Inference for GPLVMs is computationally tractable only when the data likelihood is Gaussian. Moreover, inference for GPLVMs has typically been restricted to obtaining maximum a posteriori point estimates, which can lead to over… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

  6. arXiv:2303.02637  [pdf, other

    stat.ML cs.LG stat.AP stat.ME

    A Semi-Bayesian Nonparametric Estimator of the Maximum Mean Discrepancy Measure: Applications in Goodness-of-Fit Testing and Generative Adversarial Networks

    Authors: Forough Fazeli-Asl, Michael Minyi Zhang, Lizhen Lin

    Abstract: A classic inferential statistical problem is the goodness-of-fit (GOF) test. Such a test can be challenging when the hypothesized parametric model has an intractable likelihood and its distributional form is not available. Bayesian methods for GOF can be appealing due to their ability to incorporate expert knowledge through prior distributions. However, standard Bayesian methods for this test of… ▽ More

    Submitted 10 November, 2023; v1 submitted 5 March, 2023; originally announced March 2023.

    Comments: Typos corrected, Secondary (simulation and theoretical) results added, Additional discussion added, references added

  7. arXiv:2209.02862  [pdf, other

    cs.RO

    DAVE Aquatic Virtual Environment: Toward a General Underwater Robotics Simulator

    Authors: Mabel M. Zhang, Woen-Sug Choi, Jessica Herman, Duane Davis, Carson Vogt, Michael McCarrin, Yadunund Vijay, Dharini Dutia, William Lew, Steven Peters, Brian Bingham

    Abstract: We present DAVE Aquatic Virtual Environment (DAVE), an open source simulation stack for underwater robots, sensors, and environments. Conventional robotics simulators are not designed to address unique challenges that come with the marine environment, including but not limited to environment conditions that vary spatially and temporally, impaired or challenging perception, and the unavailability o… ▽ More

    Submitted 6 September, 2022; originally announced September 2022.

    Comments: Accepted to IEEE/OES Autonomous Underwater Vehicles Symposium (AUV) 2022

  8. arXiv:2205.09909  [pdf, other

    stat.ML cs.AI cs.LG

    Sparse Infinite Random Feature Latent Variable Modeling

    Authors: Michael Minyi Zhang

    Abstract: We propose a non-linear, Bayesian non-parametric latent variable model where the latent space is assumed to be sparse and infinite dimensional a priori using an Indian buffet process prior. A posteriori, the number of instantiated dimensions in the latent space is guaranteed to be finite. The purpose of placing the Indian buffet process on the latent variables is to: 1.) Automatically and probabil… ▽ More

    Submitted 26 May, 2022; v1 submitted 19 May, 2022; originally announced May 2022.

  9. arXiv:2110.13587  [pdf, other

    cs.LG

    Arbitrary Distribution Modeling with Censorship in Real-Time Bidding Advertising

    Authors: Xu Li, Michelle Ma Zhang, Youjun Tong, Zhenya Wang

    Abstract: The purpose of Inventory Pricing is to bid the right prices to online ad opportunities, which is crucial for a Demand-Side Platform (DSP) to win advertising auctions in Real-Time Bidding (RTB). In the planning stage, advertisers need the forecast of probabilistic models to make bidding decisions. However, most of the previous works made strong assumptions on the distribution form of the winning pr… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

  10. arXiv:2010.08908  [pdf, other

    stat.CO cs.LG math.OC

    Accelerated Algorithms for Convex and Non-Convex Optimization on Manifolds

    Authors: Lizhen Lin, Bayan Saparbayeva, Michael Minyi Zhang, David B. Dunson

    Abstract: We propose a general scheme for solving convex and non-convex optimization problems on manifolds. The central idea is that, by adding a multiple of the squared retraction distance to the objective function in question, we "convexify" the objective function and solve a series of convex sub-problems in the optimization procedure. One of the key challenges for optimization on manifolds is the difficu… ▽ More

    Submitted 17 October, 2020; originally announced October 2020.

  11. arXiv:2006.11145  [pdf, other

    stat.ML cs.LG stat.ME

    Latent variable modeling with random features

    Authors: Gregory W. Gundersen, Michael Minyi Zhang, Barbara E. Engelhardt

    Abstract: Gaussian process-based latent variable models are flexible and theoretically grounded tools for nonlinear dimension reduction, but generalizing to non-Gaussian data likelihoods within this nonlinear framework is statistically challenging. Here, we use random features to develop a family of nonlinear dimension reduction models that are easily extensible to non-Gaussian data likelihoods; we call the… ▽ More

    Submitted 19 June, 2020; originally announced June 2020.

    Comments: 21 pages, 7 figures

  12. arXiv:2001.05591  [pdf, other

    stat.ML cs.LG

    Distributed, partially collapsed MCMC for Bayesian Nonparametrics

    Authors: Avinava Dubey, Michael Minyi Zhang, Eric P. Xing, Sinead A. Williamson

    Abstract: Bayesian nonparametric (BNP) models provide elegant methods for discovering underlying latent features within a data set, but inference in such models can be slow. We exploit the fact that completely random measures, which commonly used models like the Dirichlet process and the beta-Bernoulli process can be expressed as, are decomposable into independent sub-measures. We use this decomposition to… ▽ More

    Submitted 4 March, 2020; v1 submitted 15 January, 2020; originally announced January 2020.

    Comments: To appear in the 23rd International Conference on Artificial Intelligence and Statistics

    Journal ref: Artificial Intelligence and Statistics, 108:3685-3695, 2020

  13. arXiv:1910.06569  [pdf, other

    cs.LG eess.SP stat.ML

    Probabilistic Time of Arrival Localization

    Authors: Fernando Perez-Cruz, Pablo M. Olmos, Michael Minyi Zhang, Howard Huang

    Abstract: In this paper, we take a new approach for time of arrival geo-localization. We show that the main sources of error in metropolitan areas are due to environmental imperfections that bias our solutions, and that we can rely on a probabilistic model to learn and compensate for them. The resulting localization error is validated using measurements from a live LTE cellular network to be less than 10 me… ▽ More

    Submitted 15 October, 2019; originally announced October 2019.

    Comments: IEEE Signal Processing Letters, 2019

  14. Sequential Gaussian Processes for Online Learning of Nonstationary Functions

    Authors: Michael Minyi Zhang, Bianca Dumitrascu, Sinead A. Williamson, Barbara E. Engelhardt

    Abstract: Many machine learning problems can be framed in the context of estimating functions, and often these are time-dependent functions that are estimated in real-time as observations arrive. Gaussian processes (GPs) are an attractive choice for modeling real-valued nonlinear functions due to their flexibility and uncertainty quantification. However, the typical GP regression model suffers from several… ▽ More

    Submitted 6 May, 2023; v1 submitted 23 May, 2019; originally announced May 2019.

    Journal ref: IEEE Transactions on Signal Processing, vol. 71, pp. 1539-1550, 2023

  15. arXiv:1904.08548  [pdf, other

    stat.ML cs.LG

    A New Class of Time Dependent Latent Factor Models with Applications

    Authors: Sinead A. Williamson, Michael Minyi Zhang, Paul Damien

    Abstract: In many applications, observed data are influenced by some combination of latent causes. For example, suppose sensors are placed inside a building to record responses such as temperature, humidity, power consumption and noise levels. These random, observed responses are typically affected by many unobserved, latent factors (or features) within the building such as the number of individuals, the tu… ▽ More

    Submitted 17 April, 2019; originally announced April 2019.

    Journal ref: Journal of Machine Learning Research 21(27):1-24, 2020

  16. arXiv:1810.11155  [pdf, other

    stat.ML cs.LG

    Communication Efficient Parallel Algorithms for Optimization on Manifolds

    Authors: Bayan Saparbayeva, Michael Minyi Zhang, Lizhen Lin

    Abstract: The last decade has witnessed an explosion in the development of models, theory and computational algorithms for "big data" analysis. In particular, distributed computing has served as a natural and dominating paradigm for statistical inference. However, the existing literature on parallel inference almost exclusively focuses on Euclidean data and parameters. While this assumption is valid for man… ▽ More

    Submitted 1 November, 2018; v1 submitted 25 October, 2018; originally announced October 2018.

    Comments: 15 pages

  17. arXiv:1804.10899  [pdf, ps, other

    cs.CV cs.AI

    Scalable Angular Discriminative Deep Metric Learning for Face Recognition

    Authors: Bowen Wu, Huaming Wu, Monica M. Y. Zhang

    Abstract: With the development of deep learning, Deep Metric Learning (DML) has achieved great improvements in face recognition. Specifically, the widely used softmax loss in the training process often bring large intra-class variations, and feature normalization is only exploited in the testing process to compute the pair similarities. To bridge the gap, we impose the intra-class cosine similarity between… ▽ More

    Submitted 30 April, 2018; v1 submitted 29 April, 2018; originally announced April 2018.

  18. arXiv:1703.00095  [pdf, other

    cs.RO

    Active End-Effector Pose Selection for Tactile Object Recognition through Monte Carlo Tree Search

    Authors: Mabel M. Zhang, Nikolay Atanasov, Kostas Daniilidis

    Abstract: This paper considers the problem of active object recognition using touch only. The focus is on adaptively selecting a sequence of wrist poses that achieves accurate recognition by enclosure grasps. It seeks to minimize the number of touches and maximize recognition confidence. The actions are formulated as wrist poses relative to each other, making the algorithm independent of absolute workspace… ▽ More

    Submitted 29 July, 2017; v1 submitted 28 February, 2017; originally announced March 2017.

    Comments: Accepted to International Conference on Intelligent Robots and Systems (IROS) 2017