Skip to main content

Showing 1–30 of 30 results for author: Yue, M

  1. arXiv:2405.20124  [pdf, other

    stat.ML cs.LG math.OC

    A Geometric Unification of Distributionally Robust Covariance Estimators: Shrinking the Spectrum by Inflating the Ambiguity Set

    Authors: Man-Chung Yue, Yves Rychener, Daniel Kuhn, Viet Anh Nguyen

    Abstract: The state-of-the-art methods for estimating high-dimensional covariance matrices all shrink the eigenvalues of the sample covariance matrix towards a data-insensitive shrinkage target. The underlying shrinkage transformation is either chosen heuristically - without compelling theoretical justification - or optimally in view of restrictive distributional assumptions. In this paper, we propose a pri… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  2. arXiv:2404.06711  [pdf, other

    cs.CL cs.HC

    MathVC: An LLM-Simulated Multi-Character Virtual Classroom for Mathematics Education

    Authors: Murong Yue, Wijdane Mifdal, Yixuan Zhang, Jennifer Suh, Ziyu Yao

    Abstract: Mathematical modeling (MM) is considered a fundamental skill for students in STEM disciplines. Practicing the MM skill is often the most effective when students can engage in group discussion and collaborative problem-solving. However, due to unevenly distributed teachers and educational resources needed to monitor such group activities, students do not always receive equal opportunities for this… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: Work in progress

  3. arXiv:2401.14095  [pdf, other

    cs.HC

    Evaluating User Experience and Data Quality in a Gamified Data Collection for Appearance-Based Gaze Estimation

    Authors: Mingtao Yue, Tomomi Sayuda, Miles Pennington, Yusuke Sugano

    Abstract: Appearance-based gaze estimation, which uses only a regular camera to estimate human gaze, is important in various application fields. While the technique faces data bias issues, data collection protocol is often demanding, and collecting data from a wide range of participants is difficult. It is an important challenge to design opportunities that allow a diverse range of people to participate whi… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  4. arXiv:2312.07763  [pdf, other

    cs.CL

    Can LLM find the green circle? Investigation and Human-guided tool manipulation for compositional generalization

    Authors: Min Zhang, Jianfeng He, Shuo Lei, Murong Yue, Linhang Wang, Chang-Tien Lu

    Abstract: The meaning of complex phrases in natural language is composed of their individual components. The task of compositional generalization evaluates a model's ability to understand new combinations of components. Previous studies trained smaller, task-specific models, which exhibited poor generalization. While large language models (LLMs) exhibit impressive generalization abilities on many tasks thro… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: Accepted by ICASSP 2024

  5. arXiv:2311.11349  [pdf, other

    cs.LG math.OC

    Coverage-Validity-Aware Algorithmic Recourse

    Authors: Ngoc Bui, Duy Nguyen, Man-Chung Yue, Viet Anh Nguyen

    Abstract: Algorithmic recourse emerges as a prominent technique to promote the explainability, transparency and hence ethics of machine learning models. Existing algorithmic recourse approaches often assume an invariant predictive model; however, the predictive model is usually updated upon the arrival of new data. Thus, a recourse that is valid respective to the present model may become invalid for the fut… ▽ More

    Submitted 19 November, 2023; originally announced November 2023.

  6. arXiv:2310.03094  [pdf, other

    cs.CL cs.AI cs.LG

    Large Language Model Cascades with Mixture of Thoughts Representations for Cost-efficient Reasoning

    Authors: Murong Yue, Jie Zhao, Min Zhang, Liang Du, Ziyu Yao

    Abstract: Large language models (LLMs) such as GPT-4 have exhibited remarkable performance in a variety of tasks, but this strong performance often comes with the high expense of using paid API services. In this paper, we are motivated to study building an LLM cascade to save the cost of using LLMs, particularly for performing reasoning (e.g., mathematical, causal) tasks. Our cascade pipeline follows the in… ▽ More

    Submitted 8 February, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: Accepted to ICLR 2024

  7. arXiv:2308.04030  [pdf, other

    cs.AI

    Gentopia: A Collaborative Platform for Tool-Augmented LLMs

    Authors: Binfeng Xu, Xukun Liu, Hua Shen, Zeyu Han, Yuhan Li, Murong Yue, Zhiyuan Peng, Yuchen Liu, Ziyu Yao, Dongkuan Xu

    Abstract: Augmented Language Models (ALMs) empower large language models with the ability to use tools, transforming them into intelligent agents for real-world interactions. However, most existing frameworks for ALMs, to varying degrees, are deficient in the following critical features: flexible customization, collaborative democratization, and holistic evaluation. We present gentopia, an ALM framework ena… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

  8. Toward Zero-shot Character Recognition: A Gold Standard Dataset with Radical-level Annotations

    Authors: Xiaolei Diao, Daqian Shi, Jian Li, Lida Shi, Mingzhe Yue, Ruihua Qi, Chuntao Li, Hao Xu

    Abstract: Optical character recognition (OCR) methods have been applied to diverse tasks, e.g., street view text recognition and document analysis. Recently, zero-shot OCR has piqued the interest of the research community because it considers a practical OCR scenario with unbalanced data distribution. However, there is a lack of benchmarks for evaluating such zero-shot methods that apply a divide-and-conque… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

    Comments: Accepted by ACM MM 2023

  9. arXiv:2307.01090  [pdf, other

    astro-ph.GA astro-ph.CO astro-ph.IM cs.CV cs.LG

    Streamlined Lensed Quasar Identification in Multiband Images via Ensemble Networks

    Authors: Irham Taufik Andika, Sherry H. Suyu, Raoul Cañameras, Alejandra Melo, Stefan Schuldt, Yiping Shu, Anna-Christina Eilers, Anton Timur Jaelani, Minghao Yue

    Abstract: Quasars experiencing strong lensing offer unique viewpoints on subjects related to the cosmic expansion rate, the dark matter profile within the foreground deflectors, and the quasar host galaxies. Unfortunately, identifying them in astronomical images is challenging since they are overwhelmed by the abundance of non-lenses. To address this, we have developed a novel approach by ensembling cutting… ▽ More

    Submitted 18 August, 2023; v1 submitted 3 July, 2023; originally announced July 2023.

    Comments: Accepted for publication in the Astronomy & Astrophysics journal. 28 pages, 11 figures, and 3 tables. We welcome comments from the reader

    Journal ref: A&A 678, A103 (2023)

  10. arXiv:2302.06089  [pdf, other

    cs.CV cs.LG q-bio.QM

    Federated attention consistent learning models for prostate cancer diagnosis and Gleason grading

    Authors: Fei Kong, Xiyue Wang, Jinxi Xiang, Sen Yang, Xinran Wang, Meng Yue, Jun Zhang, Junhan Zhao, Xiao Han, Yuhan Dong, Biyue Zhu, Fang Wang, Yueping Liu

    Abstract: Artificial intelligence (AI) holds significant promise in transforming medical imaging, enhancing diagnostics, and refining treatment strategies. However, the reliance on extensive multicenter datasets for training AI models poses challenges due to privacy concerns. Federated learning provides a solution by facilitating collaborative model training across multiple centers without sharing raw data.… ▽ More

    Submitted 28 March, 2024; v1 submitted 12 February, 2023; originally announced February 2023.

    Comments: 14 pages

  11. arXiv:2301.12538  [pdf, other

    cs.LG cs.AI math.DS

    On Approximating the Dynamic Response of Synchronous Generators via Operator Learning: A Step Towards Building Deep Operator-based Power Grid Simulators

    Authors: Christian Moya, Guang Lin, Tianqiao Zhao, Meng Yue

    Abstract: This paper designs an Operator Learning framework to approximate the dynamic response of synchronous generators. One can use such a framework to (i) design a neural-based generator model that can interact with a numerical simulator of the rest of the power grid or (ii) shadow the generator's transient response. To this end, we design a data-driven Deep Operator Network~(DeepONet) that approximates… ▽ More

    Submitted 29 January, 2023; originally announced January 2023.

  12. arXiv:2209.13268  [pdf, other

    math.OC cs.LG math.NA

    Approximate Secular Equations for the Cubic Regularization Subproblem

    Authors: Yihang Gao, Man-Chung Yue, Michael K. Ng

    Abstract: The cubic regularization method (CR) is a popular algorithm for unconstrained non-convex optimization. At each iteration, CR solves a cubically regularized quadratic problem, called the cubic regularization subproblem (CRS). One way to solve the CRS relies on solving the secular equation, whose computational bottleneck lies in the computation of all eigenvalues of the Hessian matrix. In this paper… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

    Comments: Accepted to NeurIPS 2022

  13. arXiv:2209.10622  [pdf, other

    cs.LG

    DeepGraphONet: A Deep Graph Operator Network to Learn and Zero-shot Transfer the Dynamic Response of Networked Systems

    Authors: Yixuan Sun, Christian Moya, Guang Lin, Meng Yue

    Abstract: This paper develops a Deep Graph Operator Network (DeepGraphONet) framework that learns to approximate the dynamics of a complex system (e.g. the power grid or traffic) with an underlying sub-graph structure. We build our DeepGraphONet by fusing the ability of (i) Graph Neural Networks (GNN) to exploit spatially correlated graph information and (ii) Deep Operator Networks~(DeepONet) to approximate… ▽ More

    Submitted 21 September, 2022; originally announced September 2022.

  14. arXiv:2208.14259  [pdf, other

    cs.IT eess.SP

    RIS-Aided Multiuser MIMO-OFDM with Linear Precoding and Iterative Detection: Analysis and Optimization

    Authors: Mingyang Yue, Lei Liu, Xiaojun Yuan

    Abstract: In this paper, we consider a reconfigurable intelligence surface (RIS) aided uplink multiuser multi-input multi-output (MIMO) orthogonal frequency division multiplexing (OFDM) system, where the receiver is assumed to conduct low-complexity iterative detection. We aim to minimize the total transmit power by jointly designing the precoder of the transmitter and the passive beamforming of the RIS. Th… ▽ More

    Submitted 30 August, 2022; originally announced August 2022.

  15. arXiv:2206.10833  [pdf, other

    cs.LG

    Robust Bayesian Recourse

    Authors: Tuan-Duy H. Nguyen, Ngoc Bui, Duy Nguyen, Man-Chung Yue, Viet Anh Nguyen

    Abstract: Algorithmic recourse aims to recommend an informative feedback to overturn an unfavorable machine learning decision. We introduce in this paper the Bayesian recourse, a model-agnostic recourse that minimizes the posterior probability odds ratio. Further, we present its min-max robust counterpart with the goal of hedging against future changes in the machine learning model parameters. The robust co… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

    Comments: Accepted to UAI'22

  16. arXiv:2202.07176  [pdf, other

    math.NA cs.LG

    DeepONet-Grid-UQ: A Trustworthy Deep Operator Framework for Predicting the Power Grid's Post-Fault Trajectories

    Authors: Christian Moya, Shiqi Zhang, Meng Yue, Guang Lin

    Abstract: This paper proposes a new data-driven method for the reliable prediction of power system post-fault trajectories. The proposed method is based on the fundamentally new concept of Deep Operator Networks (DeepONets). Compared to traditional neural networks that learn to approximate functions, DeepONets are designed to approximate nonlinear operators. Under this operator framework, we design a DeepON… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

  17. arXiv:2202.03071  [pdf, other

    cs.LG math.OC stat.ML

    Distributionally Robust Fair Principal Components via Geodesic Descents

    Authors: Hieu Vu, Toan Tran, Man-Chung Yue, Viet Anh Nguyen

    Abstract: Principal component analysis is a simple yet useful dimensionality reduction technique in modern machine learning pipelines. In consequential domains such as college admission, healthcare and credit approval, it is imperative to take into account emerging criteria such as the fairness and the robustness of the learned projection. In this paper, we propose a distributionally robust optimization pro… ▽ More

    Submitted 7 February, 2022; originally announced February 2022.

    Comments: International Conference on Learning Representations (ICLR) 2022

  18. arXiv:2201.09145  [pdf, other

    cs.LG eess.SP

    glassoformer: a query-sparse transformer for post-fault power grid voltage prediction

    Authors: Yunling Zheng, Carson Hu, Guang Lin, Meng Yue, Bao Wang, Jack Xin

    Abstract: We propose GLassoformer, a novel and efficient transformer architecture leveraging group Lasso regularization to reduce the number of queries of the standard self-attention mechanism. Due to the sparsified queries, GLassoformer is more computationally efficient than the standard transformers. On the power grid post-fault voltage prediction task, GLassoformer shows remarkably better prediction than… ▽ More

    Submitted 22 January, 2022; originally announced January 2022.

  19. arXiv:2109.12846  [pdf, other

    cs.LG cs.AI cs.CY

    HAGEN: Homophily-Aware Graph Convolutional Recurrent Network for Crime Forecasting

    Authors: Chenyu Wang, Zongyu Lin, Xiaochen Yang, Jiao Sun, Mingxuan Yue, Cyrus Shahabi

    Abstract: The crime forecasting is an important problem as it greatly contributes to urban safety. Typically, the goal of the problem is to predict different types of crimes for each geographical region (like a neighborhood or censor tract) in the near future. Since nearby regions usually have similar socioeconomic characteristics which indicate similar crime patterns, recent state-of-the-art solutions cons… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

  20. arXiv:2106.00322  [pdf, other

    cs.LG math.OC stat.ML

    Sequential Domain Adaptation by Synthesizing Distributionally Robust Experts

    Authors: Bahar Taskesen, Man-Chung Yue, Jose Blanchet, Daniel Kuhn, Viet Anh Nguyen

    Abstract: Least squares estimators, when trained on a few target domain samples, may predict poorly. Supervised domain adaptation aims to improve the predictive accuracy by exploiting additional labeled training samples from a source distribution that is close to the target distribution. Given available data, we investigate novel strategies to synthesize a family of least squares estimator experts that are… ▽ More

    Submitted 1 June, 2021; originally announced June 2021.

  21. MIMO-OFDM-Based Massive Connectivity With Frequency Selectivity Compensation

    Authors: Wenjun Jiang, Mingyang Yue, Xiaojun Yuan, Yong Zuo

    Abstract: In this paper, we study how to efficiently and reliably detect active devices and estimate their channels in a multiple-input multiple-output (MIMO) orthogonal frequency-division multiplexing (OFDM) based grant-free non-orthogonal multiple access (NOMA) system to enable massive machine-type communications (mMTC). First, by exploiting the correlation of the channel frequency responses in narrow-ban… ▽ More

    Submitted 11 April, 2021; originally announced April 2021.

    Journal ref: IEEE Transactions on Wireless Communications, vol. 21, no. 9, pp. 6920-6934, Sept. 2022

  22. arXiv:2101.10386  [pdf, other

    cs.CR

    ProbLock: Probability-based Logic Locking

    Authors: Michael Yue, Fatemeh Tehranipoor

    Abstract: Integrated circuit (IC) piracy and overproduction are serious issues that threaten the security and integrity of a system. Logic locking is a type of hardware obfuscation technique where additional key gates are inserted into the circuit. Only the correct key can unlock the functionality of that circuit otherwise the system produces the wrong output. In an effort to hinder these threats on ICs, we… ▽ More

    Submitted 25 January, 2021; originally announced January 2021.

  23. arXiv:2009.07514  [pdf, other

    math.OC cs.LG eess.SP

    A Unified Approach to Synchronization Problems over Subgroups of the Orthogonal Group

    Authors: Huikang Liu, Man-Chung Yue, Anthony Man-Cho So

    Abstract: The problem of synchronization over a group $\mathcal{G}$ aims to estimate a collection of group elements $G^*_1, \dots, G^*_n \in \mathcal{G}$ based on noisy observations of a subset of all pairwise ratios of the form $G^*_i {G^*_j}^{-1}$. Such a problem has gained much attention recently and finds many applications across a wide range of scientific and engineering areas. In this paper, we consid… ▽ More

    Submitted 16 June, 2023; v1 submitted 16 September, 2020; originally announced September 2020.

  24. arXiv:2004.07162  [pdf, ps, other

    math.OC cs.LG

    On Linear Optimization over Wasserstein Balls

    Authors: Man-Chung Yue, Daniel Kuhn, Wolfram Wiesemann

    Abstract: Wasserstein balls, which contain all probability measures within a pre-specified Wasserstein distance to a reference measure, have recently enjoyed wide popularity in the distributionally robust optimization and machine learning communities to formulate and solve data-driven optimization problems with rigorous statistical guarantees. In this technical note we prove that the Wasserstein ball is wea… ▽ More

    Submitted 6 June, 2021; v1 submitted 15 April, 2020; originally announced April 2020.

  25. arXiv:2003.01351  [pdf, other

    cs.LG stat.ML

    DETECT: Deep Trajectory Clustering for Mobility-Behavior Analysis

    Authors: Mingxuan Yue, Yaguang Li, Haoze Yang, Ritesh Ahuja, Yao-Yi Chiang, Cyrus Shahabi

    Abstract: Identifying mobility behaviors in rich trajectory data is of great economic and social interest to various applications including urban planning, marketing and intelligence. Existing work on trajectory clustering often relies on similarity measurements that utilize raw spatial and/or temporal information of trajectories. These measures are incapable of identifying similar moving behaviors that exh… ▽ More

    Submitted 3 March, 2020; originally announced March 2020.

    Comments: Published as a conference paper at BigData 2019

  26. arXiv:1910.10583  [pdf, other

    cs.LG math.OC stat.ML

    Optimistic Distributionally Robust Optimization for Nonparametric Likelihood Approximation

    Authors: Viet Anh Nguyen, Soroosh Shafieezadeh-Abadeh, Man-Chung Yue, Daniel Kuhn, Wolfram Wiesemann

    Abstract: The likelihood function is a fundamental component in Bayesian statistics. However, evaluating the likelihood of an observation is computationally intractable in many applications. In this paper, we propose a non-parametric approximation of the likelihood that identifies a probability measure which lies in the neighborhood of the nominal measure and that maximizes the probability of observing the… ▽ More

    Submitted 23 October, 2019; originally announced October 2019.

  27. arXiv:1910.07817  [pdf, other

    math.OC cs.LG stat.ML

    Calculating Optimistic Likelihoods Using (Geodesically) Convex Optimization

    Authors: Viet Anh Nguyen, Soroosh Shafieezadeh-Abadeh, Man-Chung Yue, Daniel Kuhn, Wolfram Wiesemann

    Abstract: A fundamental problem arising in many areas of machine learning is the evaluation of the likelihood of a given observation under different nominal distributions. Frequently, these nominal distributions are themselves estimated from data, which makes them susceptible to estimation errors. We thus propose to replace each nominal distribution with an ambiguity set containing all distributions in its… ▽ More

    Submitted 17 October, 2019; originally announced October 2019.

  28. A Robust Design for MISO Physical-Layer Multicasting over Line-of-Sight Channels

    Authors: Man-Chung Yue, Sissi Xiaoxiao Wu, Anthony Man-Cho So

    Abstract: This paper studies a robust design problem for far-field line-of-sight (LOS) channels where phase errors are present. Compared with the commonly used additive error model, the phase error model is more suitable for capturing the uncertainty in an LOS channel, as the dominant source of uncertainty lies in the phase. We consider a multiple-input single-output (MISO) multicast scenario, in which our… ▽ More

    Submitted 13 November, 2015; originally announced November 2015.

    Comments: This manuscript is submitted for possible journal publication on 13-Nov-2015

  29. arXiv:1405.4890  [pdf

    cs.CE

    A Revised Incremental Conductance MPPT Algorithm for Solar PV Generation Systems

    Authors: Meng Yue, Xiaoyu Wang

    Abstract: A revised Incremental Conductance (IncCond) maximum power point tracking (MPPT) algorithm for PV generation systems is proposed in this paper. The commonly adopted traditional IncCond method uses a constant step size for voltage adjustment and is difficult to achieve both a good tracking performance and quick elimination of the oscillations, especially under the dramatic changes of the environment… ▽ More

    Submitted 19 May, 2014; originally announced May 2014.

  30. arXiv:1209.0377  [pdf, ps, other

    math.OC cs.IT

    A Perturbation Inequality for the Schatten-$p$ Quasi-Norm and Its Applications to Low-Rank Matrix Recovery

    Authors: Man-Chung Yue, Anthony Man-Cho So

    Abstract: In this paper, we establish the following perturbation result concerning the singular values of a matrix: Let $A,B \in \mathbb{R}^{m\times n}$ be given matrices, and let $f:\mathbb{R}_+\rightarrow\mathbb{R}_+$ be a concave function satisfying $f(0)=0$. Then, we have $$ \sum_{i=1}^{\min\{m,n\}} \big| f(σ_i(A)) - f(σ_i(B)) \big| \le \sum_{i=1}^{\min\{m,n\}} f(σ_i(A-B)), $$ where $σ_i(\cdot)$ denotes… ▽ More

    Submitted 27 June, 2014; v1 submitted 3 September, 2012; originally announced September 2012.