Skip to main content

Showing 1–50 of 92 results for author: Zou, S

  1. arXiv:2406.17795  [pdf, other

    cs.CV cs.GR

    RACon: Retrieval-Augmented Simulated Character Locomotion Control

    Authors: Yuxuan Mu, Shihao Zou, Kangning Yin, Zheng Tian, Li Cheng, Weinan Zhang, Jun Wang

    Abstract: In computer animation, driving a simulated character with lifelike motion is challenging. Current generative models, though able to generalize to diverse motions, often pose challenges to the responsiveness of end-user control. To address these issues, we introduce RACon: Retrieval-Augmented Simulated Character Locomotion Control. Our end-to-end hierarchical reinforcement learning method utilizes… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Accepted in ICME2024 for oral presentation

  2. arXiv:2406.17096  [pdf, other

    cs.LG cs.AI stat.ML

    Model-Free Robust Reinforcement Learning with Sample Complexity Analysis

    Authors: Yudan Wang, Shaofeng Zou, Yue Wang

    Abstract: Distributionally Robust Reinforcement Learning (DR-RL) aims to derive a policy optimizing the worst-case performance within a predefined uncertainty set. Despite extensive research, previous DR-RL algorithms have predominantly favored model-based approaches, with limited availability of model-free methods offering convergence guarantees or sample complexities. This paper proposes a model-free DR-R… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: UAI 2024

  3. arXiv:2406.10534  [pdf, other

    cs.LG cs.AI physics.flu-dyn

    A Finite Difference Informed Graph Network for Solving Steady-State Incompressible Flows on Block-Structured Grids

    Authors: Yiye Zou, Tianyu Li, Shufan Zou, Jingyu Wang, Laiping Zhang, Xiaogang Deng

    Abstract: Recently, advancements in deep learning have enabled physics-informed neural networks (PINNs) to solve partial differential equations (PDEs). Numerical differentiation (ND) using the finite difference (FD) method is efficient in physics-constrained designs, even in parameterized settings, often employing body-fitted block-structured grids for complex flow cases. However, convolution operators in C… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

  4. arXiv:2406.01762  [pdf, other

    cs.LG cs.AI stat.ML

    Non-Asymptotic Analysis for Single-Loop (Natural) Actor-Critic with Compatible Function Approximation

    Authors: Yudan Wang, Yue Wang, Yi Zhou, Shaofeng Zou

    Abstract: Actor-critic (AC) is a powerful method for learning an optimal policy in reinforcement learning, where the critic uses algorithms, e.g., temporal difference (TD) learning with function approximation, to evaluate the current policy and the actor updates the policy along an approximate gradient direction using information from the critic. This paper provides the \textit{tightest} non-asymptotic conv… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: ICML 2024

  5. arXiv:2405.19440  [pdf, other

    cs.LG math.OC stat.ML

    On the Convergence of Multi-objective Optimization under Generalized Smoothness

    Authors: Qi Zhang, Peiyao Xiao, Kaiyi Ji, Shaofeng Zou

    Abstract: Multi-objective optimization (MOO) is receiving more attention in various fields such as multi-task learning. Recent works provide some effective algorithms with theoretical analysis but they are limited by the standard $L$-smooth or bounded-gradient assumptions, which are typically unsatisfactory for neural networks, such as recurrent neural networks (RNNs) and transformers. In this paper, we stu… ▽ More

    Submitted 1 July, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

  6. arXiv:2405.16077  [pdf, ps, other

    cs.LG

    Finite-Time Analysis for Conflict-Avoidant Multi-Task Reinforcement Learning

    Authors: Yudan Wang, Peiyao Xiao, Hao Ban, Kaiyi Ji, Shaofeng Zou

    Abstract: Multi-task reinforcement learning (MTRL) has shown great promise in many real-world applications. Existing MTRL algorithms often aim to learn a policy that optimizes individual objective functions simultaneously with a given prior preference (or weights) on different tasks. However, these methods often suffer from the issue of \textit{gradient conflict} such that the tasks with larger gradients do… ▽ More

    Submitted 10 June, 2024; v1 submitted 25 May, 2024; originally announced May 2024.

    Comments: Initial submission at the 41$^{st}$ International Conference on Machine Learning

  7. arXiv:2405.01327  [pdf, other

    cs.LG

    Constrained Reinforcement Learning Under Model Mismatch

    Authors: Zhongchang Sun, Sihong He, Fei Miao, Shaofeng Zou

    Abstract: Existing studies on constrained reinforcement learning (RL) may obtain a well-performing policy in the training environment. However, when deployed in a real environment, it may easily violate constraints that were originally satisfied during training because there might be model mismatch between the training and real environments. To address the above challenge, we formulate the problem as constr… ▽ More

    Submitted 3 May, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

    Comments: ICML 2024

  8. arXiv:2405.00998  [pdf, other

    cs.CV

    Part-aware Shape Generation with Latent 3D Diffusion of Neural Voxel Fields

    Authors: Yuhang Huang, SHilong Zou, Xinwang Liu, Kai Xu

    Abstract: This paper presents a novel latent 3D diffusion model for the generation of neural voxel fields, aiming to achieve accurate part-aware structures. Compared to existing methods, there are two key designs to ensure high-quality and accurate part-aware generation. On one hand, we introduce a latent 3D diffusion process for neural voxel fields, enabling generation at significantly higher resolutions t… ▽ More

    Submitted 20 June, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

  9. arXiv:2404.07779  [pdf, other

    cs.SI physics.soc-ph

    Improving Network Degree Correlation by Degree-preserving Rewiring

    Authors: Shuo Zou, Bo Zhou, Qi Xuan

    Abstract: Degree correlation is a crucial measure in networks, significantly impacting network topology and dynamical behavior. The degree sequence of a network is a significant characteristic, and altering network degree correlation through degree-preserving rewiring poses an interesting problem. In this paper, we define the problem of maximizing network degree correlation through a finite number of rewiri… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  10. arXiv:2404.01436  [pdf, ps, other

    stat.ML cs.LG math.OC

    Convergence Guarantees for RMSProp and Adam in Generalized-smooth Non-convex Optimization with Affine Noise Variance

    Authors: Qi Zhang, Yi Zhou, Shaofeng Zou

    Abstract: This paper provides the first tight convergence analyses for RMSProp and Adam in non-convex optimization under the most relaxed assumptions of coordinate-wise generalized smoothness and affine noise variance. We first analyze RMSProp, which is a special case of Adam with adaptive learning rates but without first-order momentum. Specifically, to solve the challenges due to dependence among adaptive… ▽ More

    Submitted 3 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

  11. arXiv:2404.01200  [pdf, other

    stat.ML cs.LG

    Large-Scale Non-convex Stochastic Constrained Distributionally Robust Optimization

    Authors: Qi Zhang, Yi Zhou, Ashley Prater-Bennette, Lixin Shen, Shaofeng Zou

    Abstract: Distributionally robust optimization (DRO) is a powerful framework for training robust models against data distribution shifts. This paper focuses on constrained DRO, which has an explicit characterization of the robustness level. Existing studies on constrained DRO mostly focus on convex loss function, and exclude the practical and challenging case with non-convex loss function, e.g., neural netw… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: We have corrected Theorem 1 in Sec 4 for AAAI 2024 version, where the order of $n_z$ changes from $ε^{-k_*} )$ to $ε^{-2k_*-2}$

  12. arXiv:2403.07257  [pdf, other

    cs.AR cs.ET

    The Dawn of AI-Native EDA: Opportunities and Challenges of Large Circuit Models

    Authors: Lei Chen, Yiqi Chen, Zhufei Chu, Wenji Fang, Tsung-Yi Ho, Ru Huang, Yu Huang, Sadaf Khan, Min Li, Xingquan Li, Yu Li, Yun Liang, Jinwei Liu, Yi Liu, Yibo Lin, Guojie Luo, Zhengyuan Shi, Guangyu Sun, Dimitrios Tsaras, Runsheng Wang, Ziyi Wang, Xinming Wei, Zhiyao Xie, Qiang Xu, Chenhao Xue , et al. (14 additional authors not shown)

    Abstract: Within the Electronic Design Automation (EDA) domain, AI-driven solutions have emerged as formidable tools, yet they typically augment rather than redefine existing methodologies. These solutions often repurpose deep learning models from other domains, such as vision, text, and graph analytics, applying them to circuit design without tailoring to the unique complexities of electronic circuits. Suc… ▽ More

    Submitted 1 May, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: The authors are ordered alphabetically. Contact: qxu@cse[dot]cuhk[dot]edu[dot]hk, gluo@pku[dot]edu[dot]cn, yuan.mingxuan@huawei[dot]com

  13. arXiv:2403.00691  [pdf, other

    cs.CV cs.AI

    Tri-Modal Motion Retrieval by Learning a Joint Embedding Space

    Authors: Kangning Yin, Shihao Zou, Yuxuan Ge, Zheng Tian

    Abstract: Information retrieval is an ever-evolving and crucial research domain. The substantial demand for high-quality human motion data especially in online acquirement has led to a surge in human motion research works. Prior works have mainly concentrated on dual-modality learning, such as text and motion tasks, but three-modality learning has been rarely explored. Intuitively, an extra introduced modal… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  14. arXiv:2402.17570  [pdf, other

    cs.LG stat.AP stat.ME

    Sparse Variational Contaminated Noise Gaussian Process Regression with Applications in Geomagnetic Perturbations Forecasting

    Authors: Daniel Iong, Matthew McAnear, Yuezhou Qu, Shasha Zou, Gabor Toth, Yang Chen

    Abstract: Gaussian Processes (GP) have become popular machine-learning methods for kernel-based learning on datasets with complicated covariance structures. In this paper, we present a novel extension to the GP framework using a contaminated normal likelihood function to better account for heteroscedastic variance and outlier noise. We propose a scalable inference algorithm based on the Sparse Variational G… ▽ More

    Submitted 2 July, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

  15. arXiv:2402.02700  [pdf, ps, other

    cs.LG stat.ML

    Sample Complexity Characterization for Linear Contextual MDPs

    Authors: Junze Deng, Yuan Cheng, Shaofeng Zou, Yingbin Liang

    Abstract: Contextual Markov decision processes (CMDPs) describe a class of reinforcement learning problems in which the transition kernels and reward functions can change over time with different MDPs indexed by a context variable. While CMDPs serve as an important framework to model many real-world applications with time-varying environments, they are largely unexplored from theoretical perspective. In thi… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: accepted to AIstats2024

  16. arXiv:2401.07709  [pdf, other

    cs.CV cs.AI

    Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks

    Authors: Siyu Zou, Jiji Tang, Yiyi Zhou, Jing He, Chaoyi Zhao, Rongsheng Zhang, Zhipeng Hu, Xiaoshuai Sun

    Abstract: Diffusion-based Image Editing (DIE) is an emerging research hot-spot, which often applies a semantic mask to control the target area for diffusion-based editing. However, most existing solutions obtain these masks via manual operations or off-line processing, greatly reducing their efficiency. In this paper, we propose a novel and efficient image editing method for Text-to-Image (T2I) diffusion mo… ▽ More

    Submitted 23 January, 2024; v1 submitted 15 January, 2024; originally announced January 2024.

    Comments: Accepted by AAAI2024

  17. arXiv:2312.14410  [pdf, other

    cs.CV

    A Multi-Stage Adaptive Feature Fusion Neural Network for Multimodal Gait Recognition

    Authors: Shinan Zou, Jianbo Xiong, Chao Fan, Shiqi Yu, Jin Tang

    Abstract: Gait recognition is a biometric technology that has received extensive attention. Most existing gait recognition algorithms are unimodal, and a few multimodal gait recognition algorithms perform multimodal fusion only once. None of these algorithms may fully exploit the complementary advantages of the multiple modalities. In this paper, by considering the temporal and spatial characteristics of ga… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: This paper has been accepted by IJCB2023

    ACM Class: I.5

    Journal ref: IJCB2023

  18. arXiv:2312.14404  [pdf, other

    cs.CV

    Cross-Covariate Gait Recognition: A Benchmark

    Authors: Shinan Zou, Chao Fan, Jianbo Xiong, Chuanfu Shen, Shiqi Yu, Jin Tang

    Abstract: Gait datasets are essential for gait research. However, this paper observes that present benchmarks, whether conventional constrained or emerging real-world datasets, fall short regarding covariate diversity. To bridge this gap, we undertake an arduous 20-month effort to collect a cross-covariate gait recognition (CCGR) dataset. The CCGR dataset has 970 subjects and about 1.6 million sequences; al… ▽ More

    Submitted 4 March, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: AAAI2024

    ACM Class: I.5

    Journal ref: AAAI2024

  19. arXiv:2311.16494  [pdf, other

    cs.CV

    ArGue: Attribute-Guided Prompt Tuning for Vision-Language Models

    Authors: Xinyu Tian, Shu Zou, Zhaoyuan Yang, Jing Zhang

    Abstract: Although soft prompt tuning is effective in efficiently adapting Vision-Language (V&L) models for downstream tasks, it shows limitations in dealing with distribution shifts. We address this issue with Attribute-Guided Prompt Tuning (ArGue), making three key contributions. 1) In contrast to the conventional approach of directly appending soft prompts preceding class names, we align the model with p… ▽ More

    Submitted 12 March, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: Accepted to CVPR2024

  20. arXiv:2310.08924  [pdf, other

    cs.SI physics.soc-ph

    Attacking The Assortativity Coefficient Under A Rewiring Strategy

    Authors: Shuo Zou, Bo Zhou, Qi Xuan

    Abstract: Degree correlation is an important characteristic of networks, which is usually quantified by the assortativity coefficient. However, concerns arise about changing the assortativity coefficient of a network when networks suffer from adversarial attacks. In this paper, we analyze the factors that affect the assortativity coefficient and study the optimization problem of maximizing or minimizing the… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

  21. arXiv:2310.08789  [pdf, other

    eess.SP cs.IT

    Quickest Change Detection in Autoregressive Models

    Authors: Zhongchang Sun, Shaofeng Zou

    Abstract: The problem of quickest change detection (QCD) in autoregressive (AR) models is investigated. A system is being monitored with sequentially observed samples. At some unknown time, a disturbance signal occurs and changes the distribution of the observations. The disturbance signal follows an AR model, which is dependent over time. Before the change, observations only consist of measurement noise, a… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  22. arXiv:2310.04456  [pdf, other

    cs.CL cs.SD eess.AS

    Multimodal Prompt Transformer with Hybrid Contrastive Learning for Emotion Recognition in Conversation

    Authors: Shihao Zou, Xianying Huang, Xudong Shen

    Abstract: Emotion Recognition in Conversation (ERC) plays an important role in driving the development of human-machine interaction. Emotions can exist in multiple modalities, and multimodal ERC mainly faces two problems: (1) the noise problem in the cross-modal information fusion process, and (2) the prediction problem of less sample emotion labels that are semantically similar but different categories. To… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    Comments: Accepted to ACM MM 2023

  23. arXiv:2307.16212  [pdf, other

    cs.LG cs.AI cs.GT cs.MA eess.SY

    Robust Multi-Agent Reinforcement Learning with State Uncertainty

    Authors: Sihong He, Songyang Han, Sanbao Su, Shuo Han, Shaofeng Zou, Fei Miao

    Abstract: In real-world multi-agent reinforcement learning (MARL) applications, agents may not have perfect state information (e.g., due to inaccurate measurement or malicious attacks), which challenges the robustness of agents' policies. Though robustness is getting important in MARL deployment, little prior work has studied state uncertainties in MARL, neither in problem formulation nor algorithm design.… ▽ More

    Submitted 30 July, 2023; originally announced July 2023.

    Comments: 50 pages, Published in TMLR, Transactions on Machine Learning Research (06/2023)

  24. arXiv:2305.13289  [pdf, other

    cs.LG

    Achieving the Minimax Optimal Sample Complexity of Offline Reinforcement Learning: A DRO-Based Approach

    Authors: Yue Wang, Jinjun Xiong, Shaofeng Zou

    Abstract: Offline reinforcement learning aims to learn from pre-collected datasets without active exploration. This problem faces significant challenges, including limited data availability and distributional shifts. Existing approaches adopt a pessimistic stance towards uncertainty by penalizing rewards of under-explored state-action pairs to estimate value functions conservatively. In this paper, we show… ▽ More

    Submitted 3 December, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

  25. arXiv:2305.10504  [pdf, other

    cs.LG cs.AI

    Model-Free Robust Average-Reward Reinforcement Learning

    Authors: Yue Wang, Alvaro Velasquez, George Atia, Ashley Prater-Bennette, Shaofeng Zou

    Abstract: Robust Markov decision processes (MDPs) address the challenge of model uncertainty by optimizing the worst-case performance over an uncertainty set of MDPs. In this paper, we focus on the robust average-reward MDPs under the model-free setting. We first theoretically characterize the structure of solutions to the robust average-reward Bellman equation, which is essential for our later convergence… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

    Comments: ICML 2023

  26. arXiv:2303.09681  [pdf, other

    cs.CV

    Event-based Human Pose Tracking by Spiking Spatiotemporal Transformer

    Authors: Shihao Zou, Yuxuan Mu, Xinxin Zuo, Sen Wang, Li Cheng

    Abstract: Event camera, as an emerging biologically-inspired vision sensor for capturing motion dynamics, presents new potential for 3D human pose tracking, or video-based 3D human pose estimation. However, existing works in pose tracking either require the presence of additional gray-scale images to establish a solid starting pose, or ignore the temporal dependencies all together by collapsing segments of… ▽ More

    Submitted 6 September, 2023; v1 submitted 16 March, 2023; originally announced March 2023.

  27. arXiv:2301.00858  [pdf, other

    cs.LG cs.AI

    Robust Average-Reward Markov Decision Processes

    Authors: Yue Wang, Alvaro Velasquez, George Atia, Ashley Prater-Bennette, Shaofeng Zou

    Abstract: In robust Markov decision processes (MDPs), the uncertainty in the transition kernel is addressed by finding a policy that optimizes the worst-case performance over an uncertainty set of MDPs. While much of the literature has focused on discounted MDPs, robust average-reward MDPs remain largely unexplored. In this paper, we focus on robust average-reward MDPs, where the goal is to find a policy th… ▽ More

    Submitted 1 March, 2023; v1 submitted 2 January, 2023; originally announced January 2023.

    Comments: AAAI 2023

  28. arXiv:2212.11123  [pdf, other

    cs.CV cs.AI cs.RO

    THMA: Tencent HD Map AI System for Creating HD Map Annotations

    Authors: Kun Tang, Xu Cao, Zhipeng Cao, Tong Zhou, Erlong Li, Ao Liu, Shengtao Zou, Chang Liu, Shuqi Mei, Elena Sizikova, Chao Zheng

    Abstract: Nowadays, autonomous vehicle technology is becoming more and more mature. Critical to progress and safety, high-definition (HD) maps, a type of centimeter-level map collected using a laser sensor, provide accurate descriptions of the surrounding environment. The key challenge of HD map production is efficient, high-quality collection and annotation of large-volume datasets. Due to the demand for h… ▽ More

    Submitted 14 December, 2022; originally announced December 2022.

    Comments: IAAI 2023

  29. arXiv:2212.02705  [pdf, other

    cs.AI cs.GT cs.MA

    What is the Solution for State-Adversarial Multi-Agent Reinforcement Learning?

    Authors: Songyang Han, Sanbao Su, Sihong He, Shuo Han, Haizhao Yang, Shaofeng Zou, Fei Miao

    Abstract: Various methods for Multi-Agent Reinforcement Learning (MARL) have been developed with the assumption that agents' policies are based on accurate state information. However, policies learned through Deep Reinforcement Learning (DRL) are susceptible to adversarial state perturbation attacks. In this work, we propose a State-Adversarial Markov Game (SAMG) and make the first attempt to investigate di… ▽ More

    Submitted 12 April, 2024; v1 submitted 5 December, 2022; originally announced December 2022.

    Comments: Accepted by Transactions on Machine Learning Research (TMLR)

  30. arXiv:2211.11209  [pdf, other

    cs.RO

    A Novel Uncalibrated Visual Servoing Controller Baesd on Model-Free Adaptive Control Method with Neural Network

    Authors: Haibin Zeng, Yueyong Lyu, Jiaming Qi, Shuangquan Zou, Tanghao Qin, Wenyu Qin

    Abstract: Nowadays, with the continuous expansion of application scenarios of robotic arms, there are more and more scenarios where nonspecialist come into contact with robotic arms. However, in terms of robotic arm visual servoing, traditional Position-based Visual Servoing (PBVS) requires a lot of calibration work, which is challenging for the nonspecialist to cope with. To cope with this situation, Uncal… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

    Comments: 16 pages, 8 figures

  31. arXiv:2211.11178  [pdf, other

    cs.RO

    Adaptive Finite-Time Model Estimation and Control for Manipulator Visual Servoing using Sliding Mode Control and Neural Networks

    Authors: Haibin Zeng, Yueyong Lyu, Jiaming Qi, Shuangquan Zou, Tanghao Qin, Wenyu Qin

    Abstract: The image-based visual servoing without models of system is challenging since it is hard to fetch an accurate estimation of hand-eye relationship via merely visual measurement. Whereas, the accuracy of estimated hand-eye relationship expressed in local linear format with Jacobian matrix is important to whole system's performance. In this article, we proposed a finite-time controller as well as a J… ▽ More

    Submitted 20 November, 2022; originally announced November 2022.

    Comments: 24 pages, 10 figures

  32. arXiv:2211.02256  [pdf

    eess.IV cs.CV

    ISA-Net: Improved spatial attention network for PET-CT tumor segmentation

    Authors: Zhengyong Huang, Sijuan Zou, Guoshuai Wang, Zixiang Chen, Hao Shen, Haiyan Wang, Na Zhang, Lu Zhang, Fan Yang, Haining Wangg, Dong Liang, Tianye Niu, Xiaohua Zhuc, Zhanli Hua

    Abstract: Achieving accurate and automated tumor segmentation plays an important role in both clinical practice and radiomics research. Segmentation in medicine is now often performed manually by experts, which is a laborious, expensive and error-prone task. Manual annotation relies heavily on the experience and knowledge of these experts. In addition, there is much intra- and interobserver variation. There… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

  33. arXiv:2210.12869  [pdf, other

    math.ST cs.IT

    Robust Multi-Hypothesis Testing with Moment Constrained Uncertainty Sets

    Authors: Akshayaa Magesh, Zhongchang Sun, Venugopal V. Veeravalli, Shaofeng Zou

    Abstract: The problem of robust binary hypothesis testing is studied. Under both hypotheses, the data-generating distributions are assumed to belong to uncertainty sets constructed through moments; in particular, the sets contain distributions whose moments are centered around the empirical moments obtained from training samples. The goal is to design a test that performs well under all distributions in the… ▽ More

    Submitted 6 January, 2024; v1 submitted 23 October, 2022; originally announced October 2022.

    Comments: arXiv admin note: text overlap with arXiv:2203.12777

  34. arXiv:2210.08794  [pdf

    cs.LG cs.AI

    Break The Spell Of Total Correlation In betaTCVAE

    Authors: Zihao Chen, Wenyong Wang, Sai Zou

    Abstract: In the absence of artificial labels, the independent and dependent features in the data are cluttered. How to construct the inductive biases of the model to flexibly divide and effectively contain features with different complexity is the main focal point of unsupervised disentangled representation learning. This paper proposes a new iterative decomposition path of total correlation and explains t… ▽ More

    Submitted 27 April, 2023; v1 submitted 17 October, 2022; originally announced October 2022.

  35. arXiv:2209.10457  [pdf, other

    cs.CR cs.IT

    Understanding Information Disclosure from Secure Computation Output: A Study of Average Salary Computation

    Authors: Alessandro Baccarini, Marina Blanton, Shaofeng Zou

    Abstract: Secure multi-party computation has seen substantial performance improvements in recent years and is being increasingly used in commercial products. While a significant amount of work was dedicated to improving its efficiency under standard security models, the threat models do not account for information leakage from the output of secure function evaluation. Quantifying information disclosure abou… ▽ More

    Submitted 20 March, 2024; v1 submitted 21 September, 2022; originally announced September 2022.

    Comments: This is the full version of our conference paper, appearing in the proceedings of the Fourteenth ACM Conference on Data and Application Security and Privacy (CODASPY), Porto, Portugal, 2024

  36. arXiv:2209.08230  [pdf, other

    cs.MA cs.LG cs.RO eess.SY

    A Robust and Constrained Multi-Agent Reinforcement Learning Electric Vehicle Rebalancing Method in AMoD Systems

    Authors: Sihong He, Yue Wang, Shuo Han, Shaofeng Zou, Fei Miao

    Abstract: Electric vehicles (EVs) play critical roles in autonomous mobility-on-demand (AMoD) systems, but their unique charging patterns increase the model uncertainties in AMoD systems (e.g. state transition probability). Since there usually exists a mismatch between the training and test/true environments, incorporating model uncertainty into system design is of critical importance in real-world applicat… ▽ More

    Submitted 27 September, 2023; v1 submitted 16 September, 2022; originally announced September 2022.

    Comments: 8 pages, accepted to IROS2023

  37. arXiv:2209.06866  [pdf, other

    cs.LG

    Robust Constrained Reinforcement Learning

    Authors: Yue Wang, Fei Miao, Shaofeng Zou

    Abstract: Constrained reinforcement learning is to maximize the expected reward subject to constraints on utilities/costs. However, the training environment may not be the same as the test one, due to, e.g., modeling error, adversarial attack, non-stationarity, resulting in severe performance degradation and more importantly constraint violation. We propose a framework of robust constrained reinforcement le… ▽ More

    Submitted 14 September, 2022; originally announced September 2022.

  38. arXiv:2209.02555  [pdf, other

    cs.LG

    Finite-Time Error Bounds for Greedy-GQ

    Authors: Yue Wang, Yi Zhou, Shaofeng Zou

    Abstract: Greedy-GQ with linear function approximation, originally proposed in \cite{maei2010toward}, is a value-based off-policy algorithm for optimal control in reinforcement learning, and it has a non-linear two timescale structure with the non-convex objective function. This paper develops its tightest finite-time error bounds. We show that the Greedy-GQ algorithm converges as fast as… ▽ More

    Submitted 1 May, 2024; v1 submitted 6 September, 2022; originally announced September 2022.

    Comments: Accepted by Springer Machine Learning Journal

  39. arXiv:2207.04320  [pdf, other

    cs.CV

    Snipper: A Spatiotemporal Transformer for Simultaneous Multi-Person 3D Pose Estimation Tracking and Forecasting on a Video Snippet

    Authors: Shihao Zou, Yuanlu Xu, Chao Li, Lingni Ma, Li Cheng, Minh Vo

    Abstract: Multi-person pose understanding from RGB videos involves three complex tasks: pose estimation, tracking and motion forecasting. Intuitively, accurate multi-person pose estimation facilitates robust tracking, and robust tracking builds crucial history for correct motion forecasting. Most existing works either focus on a single task or employ multi-stage approaches to solving multiple tasks separate… ▽ More

    Submitted 12 September, 2023; v1 submitted 9 July, 2022; originally announced July 2022.

  40. arXiv:2206.06426  [pdf, ps, other

    cs.LG stat.ML

    Provably Efficient Offline Reinforcement Learning with Trajectory-Wise Reward

    Authors: Tengyu Xu, Yue Wang, Shaofeng Zou, Yingbin Liang

    Abstract: The remarkable success of reinforcement learning (RL) heavily relies on observing the reward of every visited state-action pair. In many real world applications, however, an agent can observe only a score that represents the quality of the whole trajectory, which is referred to as the {\em trajectory-wise reward}. In such a situation, it is difficult for standard RL methods to well utilize traject… ▽ More

    Submitted 18 April, 2023; v1 submitted 13 June, 2022; originally announced June 2022.

    Comments: Submitted for IEEE Transactions on Information Theory

  41. arXiv:2205.07344  [pdf, other

    cs.LG

    Policy Gradient Method For Robust Reinforcement Learning

    Authors: Yue Wang, Shaofeng Zou

    Abstract: This paper develops the first policy gradient method with global optimality guarantee and complexity analysis for robust reinforcement learning under model mismatch. Robust reinforcement learning is to learn a policy robust to model mismatch between simulator and real environment. We first develop the robust policy (sub-)gradient, which is applicable for any differentiable parametric policy class.… ▽ More

    Submitted 15 May, 2022; originally announced May 2022.

    Comments: Accepted by ICML 2022

  42. arXiv:2204.08166  [pdf, other

    cs.CV cs.LG

    TOD-CNN: An Effective Convolutional Neural Network for Tiny Object Detection in Sperm Videos

    Authors: Shuojia Zou, Chen Li, Hongzan Sun, Peng Xu, Jiawei Zhang, Pingli Ma, Yudong Yao, Xinyu Huang, Marcin Grzegorzek

    Abstract: The detection of tiny objects in microscopic videos is a problematic point, especially in large-scale experiments. For tiny objects (such as sperms) in microscopic videos, current detection methods face challenges in fuzzy, irregular, and precise positioning of objects. In contrast, we present a convolutional neural network for tiny object detection (TOD-CNN) with an underlying data set of high-qu… ▽ More

    Submitted 18 April, 2022; originally announced April 2022.

    Comments: 17 pages, 12 figures

  43. arXiv:2203.12777  [pdf, other

    eess.SP cs.IT cs.LG math.ST stat.ML

    Kernel Robust Hypothesis Testing

    Authors: Zhongchang Sun, Shaofeng Zou

    Abstract: The problem of robust hypothesis testing is studied, where under the null and the alternative hypotheses, the data-generating distributions are assumed to be in some uncertainty sets, and the goal is to design a test that performs well under the worst-case distributions over the uncertainty sets. In this paper, uncertainty sets are constructed in a data-driven manner using kernel method, i.e., the… ▽ More

    Submitted 5 August, 2023; v1 submitted 23 March, 2022; originally announced March 2022.

  44. LiDAR-guided Stereo Matching with a Spatial Consistency Constraint

    Authors: Yongjun Zhang, Siyuan Zou, Xinyi Liu, Xu Huang, Yi Wan, Yongxiang Yao

    Abstract: The complementary fusion of light detection and ranging (LiDAR) data and image data is a promising but challenging task for generating high-precision and high-density point clouds. This study proposes an innovative LiDAR-guided stereo matching approach called LiDAR-guided stereo matching (LGSM), which considers the spatial consistency represented by continuous disparity or depth changes in the hom… ▽ More

    Submitted 24 February, 2022; v1 submitted 20 February, 2022; originally announced February 2022.

    Comments: we replace an article because of the addition of journal reference, DOI, and report number information

    Journal ref: ISPRS Journal of Photogrammetry and Remote Sensing Volume 183(2021) 164-177

  45. arXiv:2202.07820  [pdf, other

    eess.IV cs.CV

    A Survey of Semen Quality Evaluation in Microscopic Videos Using Computer Assisted Sperm Analysis

    Authors: Wenwei Zhao, Pingli Ma, Chen Li, Xiaoning Bu, Shuojia Zou, Tao Jiang, Marcin Grzegorzek

    Abstract: The Computer Assisted Sperm Analysis (CASA) plays a crucial role in male reproductive health diagnosis and Infertility treatment. With the development of the computer industry in recent years, a great of accurate algorithms are proposed. With the assistance of those novel algorithms, it is possible for CASA to achieve a faster and higher quality result. Since image processing is the technical basi… ▽ More

    Submitted 17 February, 2022; v1 submitted 15 February, 2022; originally announced February 2022.

  46. arXiv:2111.06925  [pdf, other

    cs.CV

    Action2video: Generating Videos of Human 3D Actions

    Authors: Chuan Guo, Xinxin Zuo, Sen Wang, Xinshuang Liu, Shihao Zou, Minglun Gong, Li Cheng

    Abstract: We aim to tackle the interesting yet challenging problem of generating videos of diverse and natural human motions from prescribed action categories. The key issue lies in the ability to synthesize multiple distinct motion sequences that are realistic in their visual appearances. It is achieved in this paper by a two-step process that maintains internal 3D pose and shape representations, action2mo… ▽ More

    Submitted 18 December, 2021; v1 submitted 12 November, 2021; originally announced November 2021.

    Comments: Accepted by IJCV

  47. arXiv:2111.05460  [pdf, other

    cs.LG cs.NI

    Cross-Layered Distributed Data-driven Framework For Enhanced Smart Grid Cyber-Physical Security

    Authors: Allen Starke, Keerthiraj Nagaraj, Cody Ruben, Nader Aljohani, Sheng Zou, Arturo Bretas, Janise McNair, Alina Zare

    Abstract: Smart Grid (SG) research and development has drawn much attention from academia, industry and government due to the great impact it will have on society, economics and the environment. Securing the SG is a considerably significant challenge due the increased dependency on communication networks to assist in physical process control, exposing them to various cyber-threats. In addition to attacks th… ▽ More

    Submitted 9 November, 2021; originally announced November 2021.

  48. arXiv:2110.10351  [pdf, other

    math.OC cs.LG

    Faster Algorithm and Sharper Analysis for Constrained Markov Decision Process

    Authors: Tianjiao Li, Ziwei Guan, Shaofeng Zou, Tengyu Xu, Yingbin Liang, Guanghui Lan

    Abstract: The problem of constrained Markov decision process (CMDP) is investigated, where an agent aims to maximize the expected accumulated discounted reward subject to multiple constraints on its utilities/costs. A new primal-dual approach is proposed with a novel integration of three ingredients: entropy regularized policy optimizer, dual variable regularizer, and Nesterov's accelerated gradient descent… ▽ More

    Submitted 19 October, 2021; originally announced October 2021.

    Comments: The paper was initially submitted for publication in January 2021

  49. arXiv:2109.14523  [pdf, other

    cs.LG

    Online Robust Reinforcement Learning with Model Uncertainty

    Authors: Yue Wang, Shaofeng Zou

    Abstract: Robust reinforcement learning (RL) is to find a policy that optimizes the worst-case performance over an uncertainty set of MDPs. In this paper, we focus on model-free robust RL, where the uncertainty set is defined to be centering at a misspecified MDP that generates a single sample trajectory sequentially and is assumed to be unknown. We develop a sample-based approach to estimate the unknown un… ▽ More

    Submitted 27 October, 2021; v1 submitted 29 September, 2021; originally announced September 2021.

    Comments: Accepted by NeurIPS 2021

  50. arXiv:2109.03699  [pdf, other

    cs.LG

    Sample and Communication-Efficient Decentralized Actor-Critic Algorithms with Finite-Time Analysis

    Authors: Ziyi Chen, Yi Zhou, Rongrong Chen, Shaofeng Zou

    Abstract: Actor-critic (AC) algorithms have been widely adopted in decentralized multi-agent systems to learn the optimal joint control policy. However, existing decentralized AC algorithms either do not preserve the privacy of agents or are not sample and communication-efficient. In this work, we develop two decentralized AC and natural AC (NAC) algorithms that are private, and sample and communication-eff… ▽ More

    Submitted 2 February, 2022; v1 submitted 8 September, 2021; originally announced September 2021.

    Comments: 40 pages, 2 figures