Skip to main content

Showing 1–50 of 94 results for author: Sha, M

  1. arXiv:2407.11079  [pdf, ps, other

    eess.SP cs.IT

    One-Bit MIMO Detection: From Global Maximum-Likelihood Detector to Amplitude Retrieval Approach

    Authors: Mingjie Shao, Wei-Kun Chen, Cheng-Yang Yu, Ya-Feng Liu, Wing-Kin Ma

    Abstract: As communication systems advance towards the future 6G era, the incorporation of large-scale antenna arrays in base stations (BSs) presents challenges such as increased hardware costs and energy consumption. To address these issues, the use of one-bit analog-to-digital converters (ADCs)/digital-to-analog converters (DACs) has gained significant attentions. This paper focuses on one-bit multiple-in… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

  2. arXiv:2407.07487  [pdf, other

    cs.CL

    Review-LLM: Harnessing Large Language Models for Personalized Review Generation

    Authors: Qiyao Peng, Hongtao Liu, Hongyan Xu, Qing Yang, Minglai Shao, Wenjun Wang

    Abstract: Product review generation is an important task in recommender systems, which could provide explanation and persuasiveness for the recommendation. Recently, Large Language Models (LLMs, e.g., ChatGPT) have shown superior text modeling and generating ability, which could be applied in review generation. However, directly applying the LLMs for generating reviews might be troubled by the ``polite'' ph… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  3. arXiv:2407.04877  [pdf

    cond-mat.mtrl-sci cs.LG physics.chem-ph

    Leveraging Data Mining, Active Learning, and Domain Adaptation in a Multi-Stage, Machine Learning-Driven Approach for the Efficient Discovery of Advanced Acidic Oxygen Evolution Electrocatalysts

    Authors: Rui Ding, Jianguo Liu, Kang Hua, Xuebin Wang, Xiaoben Zhang, Minhua Shao, Yuxin Chen, Junhong Chen

    Abstract: Developing advanced catalysts for acidic oxygen evolution reaction (OER) is crucial for sustainable hydrogen production. This study introduces a novel, multi-stage machine learning (ML) approach to streamline the discovery and optimization of complex multi-metallic catalysts. Our method integrates data mining, active learning, and domain adaptation throughout the materials discovery process. Unlik… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: 95 pages (main text 37 pages; supplementary materials 58 pages); 38 figures (main text 6 figures; supplementary materials 32 figures)

  4. arXiv:2406.10958  [pdf, other

    math.OC cs.CL cs.MA

    City-LEO: Toward Transparent City Management Using LLM with End-to-End Optimization

    Authors: Zihao Jiao, Mengyi Sha, Haoyu Zhang, Xinyu Jiang, Wei Qi

    Abstract: Existing operations research (OR) models and tools play indispensable roles in smart-city operations, yet their practical implementation is limited by the complexity of modeling and deficiencies in optimization proficiency. To generate more relevant and accurate solutions to users' requirements, we propose a large language model (LLM)-based agent ("City-LEO") that enhances the efficiency and trans… ▽ More

    Submitted 17 June, 2024; v1 submitted 16 June, 2024; originally announced June 2024.

    Comments: 26 pages, 8 figures, 5 tables

  5. arXiv:2406.09495  [pdf, other

    cs.LG cs.AI

    Fair Data Generation via Score-based Diffusion Model

    Authors: Yujie Lin, Dong Li, Chen Zhao, Minglai Shao

    Abstract: The fairness of AI decision-making has garnered increasing attention, leading to the proposal of numerous fairness algorithms. In this paper, we aim not to address this issue by directly introducing fair learning algorithms, but rather by generating entirely new, fair synthetic data from biased datasets for use in any downstream tasks. Additionally, the distribution of test data may differ from th… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  6. arXiv:2406.07693  [pdf

    cs.CY cs.AI cs.CL cs.LG cs.SI

    A Labelled Dataset for Sentiment Analysis of Videos on YouTube, TikTok, and Other Sources about the 2024 Outbreak of Measles

    Authors: Nirmalya Thakur, Vanessa Su, Mingchen Shao, Kesha A. Patel, Hongseok Jeong, Victoria Knieling, Andrew Bian

    Abstract: The work of this paper presents a dataset that contains the data of 4011 videos about the ongoing outbreak of measles published on 264 websites on the internet between January 1, 2024, and May 31, 2024. The dataset is available at https://dx.doi.org/10.21227/40s8-xf63. These websites primarily include YouTube and TikTok, which account for 48.6% and 15.2% of the videos, respectively. The remainder… ▽ More

    Submitted 16 June, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

    Comments: 19 pages

    ACM Class: I.2.7; I.2.8; I.5.4; K.4.2; H.2.8; I.2.6

  7. arXiv:2406.05590  [pdf, other

    cs.CR cs.AI cs.CY cs.LG

    NYU CTF Dataset: A Scalable Open-Source Benchmark Dataset for Evaluating LLMs in Offensive Security

    Authors: Minghao Shao, Sofija Jancheska, Meet Udeshi, Brendan Dolan-Gavitt, Haoran Xi, Kimberly Milner, Boyuan Chen, Max Yin, Siddharth Garg, Prashanth Krishnamurthy, Farshad Khorrami, Ramesh Karri, Muhammad Shafique

    Abstract: Large Language Models (LLMs) are being deployed across various domains today. However, their capacity to solve Capture the Flag (CTF) challenges in cybersecurity has not been thoroughly evaluated. To address this, we develop a novel method to assess LLMs in solving CTF challenges by creating a scalable, open-source benchmark database specifically designed for these applications. This database incl… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  8. arXiv:2406.01284  [pdf

    physics.med-ph cs.HC

    Extraction of Weak Surface Diaphragmatic Electromyogram Using Modified Progressive FastICA Peel-Off

    Authors: Yao Li, Dongsheng Zhao, Haowen Zhao, Xu Zhang, Min Shao

    Abstract: Diaphragmatic electromyogram (EMGdi) contains crucial information about human respiration therefore can be used to monitor respiratory condition. Although it is practical to record EMGdi noninvasively and conveniently by placing surface electrodes over chest skin, extraction of such weak surface EMGdi (sEMGdi) from great noisy environment is a challenging task, limiting its clinical use compared w… ▽ More

    Submitted 28 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

  9. arXiv:2405.06516  [pdf, ps, other

    cs.IT eess.SP

    An Efficient Algorithm for Sum-Rate Maximization in Fluid Antenna-Assisted ISAC System

    Authors: Qian Zhang, Mingjie Shao, Tong Zhang, Gaojie Chen, Ju Liu

    Abstract: In this letter, we investigate the fluid antenna (FA)-assisted integrated sensing and communication (ISAC) system, where communication and radar sensing employ the co-waveform design. Specifically, we focus on the beamformer design and antenna position configuration to realize a higher communication rate while guaranteeing the minimum radar probing power. Different from existing beamformer algorit… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  10. arXiv:2405.02132  [pdf, other

    cs.SD cs.CL eess.AS

    Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets

    Authors: Xuelong Geng, Tianyi Xu, Kun Wei, Bingshen Mu, Hongfei Xue, He Wang, Yangze Li, Pengcheng Guo, Yuhang Dai, Longhao Li, Mingchen Shao, Lei Xie

    Abstract: Large Language Models (LLMs) have demonstrated unparalleled effectiveness in various NLP tasks, and integrating LLMs with automatic speech recognition (ASR) is becoming a mainstream paradigm. Building upon this momentum, our research delves into an in-depth examination of this paradigm on a large open-source Chinese dataset. Specifically, our research aims to evaluate the impact of various configu… ▽ More

    Submitted 6 May, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

  11. arXiv:2404.19383  [pdf, other

    cs.CV

    Cross-Block Fine-Grained Semantic Cascade for Skeleton-Based Sports Action Recognition

    Authors: Zhendong Liu, Haifeng Xia, Tong Guo, Libo Sun, Ming Shao, Siyu Xia

    Abstract: Human action video recognition has recently attracted more attention in applications such as video security and sports posture correction. Popular solutions, including graph convolutional networks (GCNs) that model the human skeleton as a spatiotemporal graph, have proven very effective. GCNs-based methods with stacked blocks usually utilize top-layer semantics for classification/annotation purpos… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  12. arXiv:2404.03386  [pdf, other

    cs.RO cs.AI cs.LG

    SENSOR: Imitate Third-Person Expert's Behaviors via Active Sensoring

    Authors: Kaichen Huang, Minghao Shao, Shenghua Wan, Hai-Hang Sun, Shuai Feng, Le Gan, De-Chuan Zhan

    Abstract: In many real-world visual Imitation Learning (IL) scenarios, there is a misalignment between the agent's and the expert's perspectives, which might lead to the failure of imitation. Previous methods have generally solved this problem by domain alignment, which incurs extra computation and storage costs, and these methods fail to handle the \textit{hard cases} where the viewpoint gap is too large.… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  13. arXiv:2404.03382  [pdf, other

    cs.LG cs.AI

    DIDA: Denoised Imitation Learning based on Domain Adaptation

    Authors: Kaichen Huang, Hai-Hang Sun, Shenghua Wan, Minghao Shao, Shuai Feng, Le Gan, De-Chuan Zhan

    Abstract: Imitating skills from low-quality datasets, such as sub-optimal demonstrations and observations with distractors, is common in real-world applications. In this work, we focus on the problem of Learning from Noisy Demonstrations (LND), where the imitator is required to learn from data with noise that often occurs during the processes of data collection or transmission. Previous IL methods improve t… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  14. arXiv:2404.01843  [pdf, other

    cs.CV

    Sketch3D: Style-Consistent Guidance for Sketch-to-3D Generation

    Authors: Wangguandong Zheng, Haifeng Xia, Rui Chen, Ming Shao, Siyu Xia, Zhengming Ding

    Abstract: Recently, image-to-3D approaches have achieved significant results with a natural image as input. However, it is not always possible to access these enriched color input samples in practical applications, where only sketches are available. Existing sketch-to-3D researches suffer from limitations in broad applications due to the challenges of lacking color information and multi-view content. To ove… ▽ More

    Submitted 7 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

  15. arXiv:2403.18866  [pdf, other

    cs.SI cs.LG

    Graph Bayesian Optimization for Multiplex Influence Maximization

    Authors: Zirui Yuan, Minglai Shao, Zhiqian Chen

    Abstract: Influence maximization (IM) is the problem of identifying a limited number of initial influential users within a social network to maximize the number of influenced users. However, previous research has mostly focused on individual information propagation, neglecting the simultaneous and interactive dissemination of multiple information items. In reality, when users encounter a piece of informatio… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: Proceedings of the AAAI Conference on Artificial Intelligence, 2024

  16. arXiv:2403.16334  [pdf, other

    cs.LG cs.AI

    Graphs Generalization under Distribution Shifts

    Authors: Qin Tian, Wenjun Wang, Chen Zhao, Minglai Shao, Wang Zhang, Dong Li

    Abstract: Traditional machine learning methods heavily rely on the independent and identically distribution assumption, which imposes limitations when the test distribution deviates from the training distribution. To address this crucial issue, out-of-distribution (OOD) generalization, which aims to achieve satisfactory generalization performance when faced with unknown distribution shifts, has made a signi… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  17. arXiv:2403.12839  [pdf, other

    cs.CV

    Global-guided Focal Neural Radiance Field for Large-scale Scene Rendering

    Authors: Mingqi Shao, Feng Xiong, Hang Zhang, Shuang Yang, Mu Xu, Wei Bian, Xueqian Wang

    Abstract: Neural radiance fields~(NeRF) have recently been applied to render large-scale scenes. However, their limited model capacity typically results in blurred rendering results. Existing large-scale NeRFs primarily address this limitation by partitioning the scene into blocks, which are subsequently handled by separate sub-NeRFs. These sub-NeRFs, trained from scratch and processed independently, lead t… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  18. arXiv:2402.11814  [pdf, other

    cs.CR

    An Empirical Evaluation of LLMs for Solving Offensive Security Challenges

    Authors: Minghao Shao, Boyuan Chen, Sofija Jancheska, Brendan Dolan-Gavitt, Siddharth Garg, Ramesh Karri, Muhammad Shafique

    Abstract: Capture The Flag (CTF) challenges are puzzles related to computer security scenarios. With the advent of large language models (LLMs), more and more CTF participants are using LLMs to understand and solve the challenges. However, so far no work has evaluated the effectiveness of LLMs in solving CTF challenges with a fully automated workflow. We develop two CTF-solving workflows, human-in-the-loop… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

  19. arXiv:2402.10434  [pdf, other

    cs.LG

    Parametric Augmentation for Time Series Contrastive Learning

    Authors: Xu Zheng, Tianchun Wang, Wei Cheng, Aitian Ma, Haifeng Chen, Mo Sha, Dongsheng Luo

    Abstract: Modern techniques like contrastive learning have been effectively used in many areas, including computer vision, natural language processing, and graph-structured data. Creating positive examples that assist the model in learning robust and discriminative representations is a crucial stage in contrastive learning approaches. Usually, preset human intuition directs the selection of relevant data au… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: Accepted by International Conference on Learning Representations (ICLR 2024)

  20. arXiv:2402.01327  [pdf, other

    cs.LG cs.AI cs.CY

    Supervised Algorithmic Fairness in Distribution Shifts: A Survey

    Authors: Minglai Shao, Dong Li, Chen Zhao, Xintao Wu, Yujie Lin, Qin Tian

    Abstract: Supervised fairness-aware machine learning under distribution shifts is an emerging field that addresses the challenge of maintaining equitable and unbiased predictions when faced with changes in data distributions from source to target domains. In real-world applications, machine learning models are often trained on a specific dataset but deployed in environments where the data distribution may s… ▽ More

    Submitted 4 May, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: IJCAI 2024

  21. arXiv:2312.08079  [pdf, other

    cs.CL cs.SD eess.AS

    Extending Whisper with prompt tuning to target-speaker ASR

    Authors: Hao Ma, Zhiyuan Peng, Mingjie Shao, Jing Li, Ju Liu

    Abstract: Target-speaker automatic speech recognition (ASR) aims to transcribe the desired speech of a target speaker from multi-talker overlapped utterances. Most of the existing target-speaker ASR (TS-ASR) methods involve either training from scratch or fully fine-tuning a pre-trained model, leading to significant training costs and becoming inapplicable to large foundation models. This work leverages pro… ▽ More

    Submitted 11 January, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: ICASSP 2024

  22. arXiv:2312.07941  [pdf, ps, other

    cs.IT eess.SP

    An efficient algorithm for multiuser sum-rate maximization of large-scale active RIS-aided MIMO system

    Authors: Qian Zhang, Mingjie Shao, Qiang Li, Ju Liu

    Abstract: Active reconfigurable intelligent surface (RIS) is a new RIS architecture that can reflect and amplify communication signals. It can provide enhanced performance gain compared to the conventional passive RIS systems that can only reflect the signals. On the other hand, the design problem of active RIS-aided systems is more challenging than the passive RIS-aided systems and its efficient algorithms… ▽ More

    Submitted 11 January, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: ICASSP 2024

  23. arXiv:2312.01315  [pdf, other

    cs.CV

    Few-shot Shape Recognition by Learning Deep Shape-aware Features

    Authors: Wenlong Shi, Changsheng Lu, Ming Shao, Yinjie Zhang, Siyu Xia, Piotr Koniusz

    Abstract: Traditional shape descriptors have been gradually replaced by convolutional neural networks due to their superior performance in feature extraction and classification. The state-of-the-art methods recognize object shapes via image reconstruction or pixel classification. However , these methods are biased toward texture information and overlook the essential shape descriptions, thus, they fail to g… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

    Comments: Accepted by WACV 2024; 8 pages for main paper

  24. arXiv:2310.15294  [pdf, other

    cs.CL

    Adaptive End-to-End Metric Learning for Zero-Shot Cross-Domain Slot Filling

    Authors: Yuanjun Shi, Linzhi Wu, Minglai Shao

    Abstract: Recently slot filling has witnessed great development thanks to deep learning and the availability of large-scale annotated data. However, it poses a critical challenge to handle a novel domain whose samples are never seen during training. The recognition performance might be greatly degraded due to severe domain shifts. Most prior works deal with this problem in a two-pass pipeline manner based o… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023 (Main, Long Paper)

  25. arXiv:2309.13005  [pdf, other

    cs.LG cs.AI cs.CY

    Towards Counterfactual Fairness-aware Domain Generalization in Changing Environments

    Authors: Yujie Lin, Chen Zhao, Minglai Shao, Baoluo Meng, Xujiang Zhao, Haifeng Chen

    Abstract: Recognizing the prevalence of domain shift as a common challenge in machine learning, various domain generalization (DG) techniques have been developed to enhance the performance of machine learning systems when dealing with out-of-distribution (OOD) data. Furthermore, in real-world scenarios, data distributions can gradually change across a sequence of sequential domains. While current methodolog… ▽ More

    Submitted 5 May, 2024; v1 submitted 22 September, 2023; originally announced September 2023.

    Comments: IJCAI 2024

  26. arXiv:2309.07380  [pdf

    cs.SI

    Domain-adaptive Graph Attention-supervised Network for Cross-network Edge Classification

    Authors: Xiao Shen, Mengqiu Shao, Shirui Pan, Laurence T. Yang, Xi Zhou

    Abstract: Graph neural networks (GNNs) have shown great ability in modeling graphs, however, their performance would significantly degrade when there are noisy edges connecting nodes from different classes. To alleviate negative effect of noisy edges on neighborhood aggregation, some recent GNNs propose to predict the label agreement between node pairs within a single network. However, predicting the label… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

    Comments: IEEE Transactions on Neural Networks and Learning Systems, 2023

  27. arXiv:2309.01627  [pdf, other

    cs.CV

    Cross-Consistent Deep Unfolding Network for Adaptive All-In-One Video Restoration

    Authors: Yuanshuo Cheng, Mingwen Shao, Yecong Wan, Yuanjian Qiao, Wangmeng Zuo, Deyu Meng

    Abstract: Existing Video Restoration (VR) methods always necessitate the individual deployment of models for each adverse weather to remove diverse adverse weather degradations, lacking the capability for adaptive processing of degradations. Such limitation amplifies the complexity and deployment costs in practical applications. To overcome this deficiency, in this paper, we propose a Cross-consistent Deep… ▽ More

    Submitted 10 December, 2023; v1 submitted 4 September, 2023; originally announced September 2023.

    Comments: 16 pages, 13 figures

  28. arXiv:2308.16879  [pdf, other

    cs.AI

    Adaptation Speed Analysis for Fairness-aware Causal Models

    Authors: Yujie Lin, Chen Zhao, Minglai Shao, Xujiang Zhao, Haifeng Chen

    Abstract: For example, in machine translation tasks, to achieve bidirectional translation between two languages, the source corpus is often used as the target corpus, which involves the training of two models with opposite directions. The question of which one can adapt most quickly to a domain shift is of significant importance in many fields. Specifically, consider an original distribution p that changes… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Comments: CIKM 2023

  29. arXiv:2308.16441  [pdf, other

    cs.AI

    Contrastive Representation Learning Based on Multiple Node-centered Subgraphs

    Authors: Dong Li, Wenjun Wang, Minglai Shao, Chen Zhao

    Abstract: As the basic element of graph-structured data, node has been recognized as the main object of study in graph representation learning. A single node intuitively has multiple node-centered subgraphs from the whole graph (e.g., one person in a social network has multiple social circles based on his different relationships). We study this intuition under the framework of graph contrastive learning, an… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Comments: CIKM 2023

  30. arXiv:2307.12872  [pdf, other

    cs.CV cs.CR cs.LG

    Latent Code Augmentation Based on Stable Diffusion for Data-free Substitute Attacks

    Authors: Mingwen Shao, Lingzhuang Meng, Yuanjian Qiao, Lixu Zhang, Wangmeng Zuo

    Abstract: Since the training data of the target model is not available in the black-box substitute attack, most recent schemes utilize GANs to generate data for training the substitute model. However, these GANs-based schemes suffer from low training efficiency as the generator needs to be retrained for each target model during the substitute training process, as well as low generation quality. To overcome… ▽ More

    Submitted 30 March, 2024; v1 submitted 24 July, 2023; originally announced July 2023.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  31. arXiv:2307.07688  [pdf, other

    cs.CV eess.IV

    DRM-IR: Task-Adaptive Deep Unfolding Network for All-In-One Image Restoration

    Authors: Yuanshuo Cheng, Mingwen Shao, Yecong Wan, Chao Wang

    Abstract: Existing All-In-One image restoration (IR) methods usually lack flexible modeling on various types of degradation, thus impeding the restoration performance. To achieve All-In-One IR with higher task dexterity, this work proposes an efficient Dynamic Reference Modeling paradigm (DRM-IR), which consists of task-adaptive degradation modeling and model-based image restoring. Specifically, these two s… ▽ More

    Submitted 30 November, 2023; v1 submitted 14 July, 2023; originally announced July 2023.

  32. arXiv:2307.04066  [pdf, other

    cs.CV

    Random Position Adversarial Patch for Vision Transformers

    Authors: Mingzhen Shao

    Abstract: Previous studies have shown the vulnerability of vision transformers to adversarial patches, but these studies all rely on a critical assumption: the attack patches must be perfectly aligned with the patches used for linear projection in vision transformers. Due to this stringent requirement, deploying adversarial patches for vision transformers in the physical world becomes impractical, unlike th… ▽ More

    Submitted 8 July, 2023; originally announced July 2023.

  33. arXiv:2307.00421  [pdf, other

    cs.CV

    Brightness-Restricted Adversarial Attack Patch

    Authors: Mingzhen Shao

    Abstract: Adversarial attack patches have gained increasing attention due to their practical applicability in physical-world scenarios. However, the bright colors used in attack patches represent a significant drawback, as they can be easily identified by human observers. Moreover, even though these attacks have been highly successful in deceiving target networks, which specific features of the attack patch… ▽ More

    Submitted 1 July, 2023; originally announced July 2023.

  34. arXiv:2306.15167  [pdf, other

    cs.IT eess.SP

    An Efficient Global Algorithm for One-Bit Maximum-Likelihood MIMO Detection

    Authors: Cheng-Yang Yu, Mingjie Shao, Wei-Kun Chen, Ya-Feng Liu, Wing-Kin Ma

    Abstract: There has been growing interest in implementing massive MIMO systems by one-bit analog-to-digital converters (ADCs), which have the benefit of reducing the power consumption and hardware complexity. One-bit MIMO detection arises in such a scenario. It aims to detect the multiuser signals from the one-bit quantized received signals in an uplink channel. In this paper, we consider one-bit maximum-li… ▽ More

    Submitted 3 July, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

  35. arXiv:2306.10695  [pdf, other

    cs.LG cs.AI cs.CV

    SeMAIL: Eliminating Distractors in Visual Imitation via Separated Models

    Authors: Shenghua Wan, Yucen Wang, Minghao Shao, Ruying Chen, De-Chuan Zhan

    Abstract: Model-based imitation learning (MBIL) is a popular reinforcement learning method that improves sample efficiency on high-dimension input sources, such as images and videos. Following the convention of MBIL research, existing algorithms are highly deceptive by task-irrelevant information, especially moving distractors in videos. To tackle this problem, we propose a new algorithm - named Separated M… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

    Comments: 18 pages, 7 figures

  36. arXiv:2305.11640  [pdf, other

    cs.LG math.ST stat.ME stat.ML

    Distribution-Free Matrix Prediction Under Arbitrary Missing Pattern

    Authors: Meijia Shao, Yuan Zhang

    Abstract: This paper studies the open problem of conformalized entry prediction in a row/column-exchangeable matrix. The matrix setting presents novel and unique challenges, but there exists little work on this interesting topic. We meticulously define the problem, differentiate it from closely related problems, and rigorously delineate the boundary between achievable and impossible goals. We then propose t… ▽ More

    Submitted 6 June, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: 12 pages, 4 figures

  37. arXiv:2305.09996  [pdf, other

    cs.CV cs.AI

    Restoring Images Captured in Arbitrary Hybrid Adverse Weather Conditions in One Go

    Authors: Ye-Cong Wan, Ming-Wen Shao, Yuan-Shuo Cheng, Yue-Xian Liu, Zhi-Yuan Bao

    Abstract: Adverse conditions typically suffer from stochastic hybrid weather degradations (e.g., rainy and hazy night), while existing image restoration algorithms envisage that weather degradations occur independently, thus may fail to handle real-world complicated scenarios. Besides, supervised training is not feasible due to the lack of a comprehensive paired dataset to characterize hybrid conditions. To… ▽ More

    Submitted 13 June, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: In submission

  38. arXiv:2304.09976  [pdf, other

    cs.CV

    Analyzing the Domain Shift Immunity of Deep Homography Estimation

    Authors: Mingzhen Shao, Tolga Tasdizen, Sarang Joshi

    Abstract: Homography estimation serves as a fundamental technique for image alignment in a wide array of applications. The advent of convolutional neural networks has introduced learning-based methodologies that have exhibited remarkable efficacy in this realm. Yet, the generalizability of these approaches across distinct domains remains underexplored. Unlike other conventional tasks, CNN-driven homography… ▽ More

    Submitted 29 November, 2023; v1 submitted 19 April, 2023; originally announced April 2023.

  39. On de novo Bridging Paired-end RNA-seq Data

    Authors: Xiang Li, Mingfu Shao

    Abstract: The high-throughput short-reads RNA-seq protocols often produce paired-end reads, with the middle portion of the fragments being unsequenced. We explore if the full-length fragments can be computationally reconstructed from the sequenced two ends in the absence of the reference genome - a problem here we refer to as de novo bridging. Solving this problem provides longer, more informative RNA-seq r… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

    Comments: 10 pages, 4 figures

    ACM Class: J.3

  40. arXiv:2303.10926  [pdf, other

    cs.DS

    On the Maximal Independent Sets of $k$-mers with the Edit Distance

    Authors: Leran Ma, Ke Chen, Mingfu Shao

    Abstract: In computational biology, $k$-mers and edit distance are fundamental concepts. However, little is known about the metric space of all $k$-mers equipped with the edit distance. In this work, we explore the structure of the $k$-mer space by studying its maximal independent sets (MISs). An MIS is a sparse sketch of all $k$-mers with nice theoretical properties, and therefore admits critical applicati… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: 9 pages, 1 figure

  41. arXiv:2302.13096  [pdf

    cs.HC

    Real-Time Recognition of In-Place Body Actions and Head Gestures using Only a Head-Mounted Display

    Authors: Jingbo Zhao, Mingjun Shao, Yaojun Wang, Ruolin Xu

    Abstract: Body actions and head gestures are natural interfaces for interaction in virtual environments. Existing methods for in-place body action recognition often require hardware more than a head-mounted display (HMD), making body action interfaces difficult to be introduced to ordinary virtual reality (VR) users as they usually only possess an HMD. In addition, there lacks a unified solution to recogniz… ▽ More

    Submitted 25 February, 2023; originally announced February 2023.

    Comments: 2023 IEEE International Conference on Virtual Reality and 3D User Interfaces, Shanghai, Mar 25-29, 2023

  42. arXiv:2302.11707  [pdf

    cs.LG cs.AI

    A Deep Neural Network Based Approach to Building Budget-Constrained Models for Big Data Analysis

    Authors: Rui Ming, Haiping Xu, Shannon E. Gibbs, Donghui Yan, Ming Shao

    Abstract: Deep learning approaches require collection of data on many different input features or variables for accurate model training and prediction. Since data collection on input features could be costly, it is crucial to reduce the cost by selecting a subset of features and developing a budget-constrained model (BCM). In this paper, we introduce an approach to eliminating less important features for bi… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

    Comments: 8 pages

  43. arXiv:2302.11430  [pdf

    physics.comp-ph cs.AI physics.bio-ph

    Differentiable Rotamer Sampling with Molecular Force Fields

    Authors: Congzhou M. Sha, Jian Wang, Nikolay V. Dokholyan

    Abstract: Molecular dynamics is the primary computational method by which modern structural biology explores macromolecule structure and function. Boltzmann generators have been proposed as an alternative to molecular dynamics, by replacing the integration of molecular systems over time with the training of generative neural networks. This neural network approach to MD samples rare events at a higher rate t… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

    Comments: 41 pages, 1 graphical abstract, 5 figures

  44. arXiv:2301.07234  [pdf, other

    eess.IV cs.CV

    DRIMET: Deep Registration for 3D Incompressible Motion Estimation in Tagged-MRI with Application to the Tongue

    Authors: Zhangxing Bian, Fangxu Xing, Jinglun Yu, Muhan Shao, Yihao Liu, Aaron Carass, Jiachen Zhuo, Jonghye Woo, Jerry L. Prince

    Abstract: Tagged magnetic resonance imaging~(MRI) has been used for decades to observe and quantify the detailed motion of deforming tissue. However, this technique faces several challenges such as tag fading, large motion, long computation times, and difficulties in obtaining diffeomorphic incompressible flow fields. To address these issues, this paper presents a novel unsupervised phase-based 3D motion es… ▽ More

    Submitted 30 April, 2023; v1 submitted 17 January, 2023; originally announced January 2023.

    Comments: Accepted to MIDL 2023 (oral)

  45. arXiv:2301.06114  [pdf, other

    eess.IV cs.LG

    Segmenting thalamic nuclei from manifold projections of multi-contrast MRI

    Authors: Chang Yan, Muhan Shao, Zhangxing Bian, Anqi Feng, Yuan Xue, Jiachen Zhuo, Rao P. Gullapalli, Aaron Carass, Jerry L. Prince

    Abstract: The thalamus is a subcortical gray matter structure that plays a key role in relaying sensory and motor signals within the brain. Its nuclei can atrophy or otherwise be affected by neurological disease and injuries including mild traumatic brain injury. Segmenting both the thalamus and its nuclei is challenging because of the relatively low contrast within and around the thalamus in conventional m… ▽ More

    Submitted 31 January, 2023; v1 submitted 15 January, 2023; originally announced January 2023.

    Comments: 8 pages, 3 figures, 2023 SPIE-MI Image Processing

  46. arXiv:2212.03039  [pdf, ps, other

    cs.SD eess.AS

    Covariance Regularization for Probabilistic Linear Discriminant Analysis

    Authors: Zhiyuan Peng, Mingjie Shao, Xuanji He, Xu Li, Tan Lee, Ke Ding, Guanglu Wan

    Abstract: Probabilistic linear discriminant analysis (PLDA) is commonly used in speaker verification systems to score the similarity of speaker embeddings. Recent studies improved the performance of PLDA in domain-matched conditions by diagonalizing its covariance. We suspect such brutal pruning approach could eliminate its capacity in modeling dimension correlation of speaker embeddings, leading to inadequ… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

  47. arXiv:2211.16092  [pdf, other

    cs.CV

    Unsupervised Visual Defect Detection with Score-Based Generative Model

    Authors: Yapeng Teng, Haoyang Li, Fuzhen Cai, Ming Shao, Siyu Xia

    Abstract: Anomaly Detection (AD), as a critical problem, has been widely discussed. In this paper, we specialize in one specific problem, Visual Defect Detection (VDD), in many industrial applications. And in practice, defect image samples are very rare and difficult to collect. Thus, we focus on the unsupervised visual defect detection and localization tasks and propose a novel framework based on the recen… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

  48. arXiv:2210.03888  [pdf, ps, other

    eess.SP cs.IT

    Accelerated and Deep Expectation Maximization for One-Bit MIMO-OFDM Detection

    Authors: Mingjie Shao, Wing-Kin Ma, Junbin Liu, Zihao Huang

    Abstract: In this paper we study the expectation maximization (EM) technique for one-bit MIMO-OFDM detection (OMOD). Arising from the recent interest in massive MIMO with one-bit analog-to-digital converters, OMOD is a massive-scale problem. EM is an iterative method that can exploit the OFDM structure to process the problem in a per-iteration efficient fashion. In this study we analyze the convergence rate… ▽ More

    Submitted 26 January, 2024; v1 submitted 7 October, 2022; originally announced October 2022.

  49. arXiv:2208.11836  [pdf, other

    cs.CV

    Polarimetric Inverse Rendering for Transparent Shapes Reconstruction

    Authors: Mingqi Shao, Chongkun Xia, Dongxu Duan, Xueqian Wang

    Abstract: In this work, we propose a novel method for the detailed reconstruction of transparent objects by exploiting polarimetric cues. Most of the existing methods usually lack sufficient constraints and suffer from the over-smooth problem. Hence, we introduce polarization information as a complementary cue. We implicitly represent the object's geometry as a neural network, while the polarization render… ▽ More

    Submitted 24 August, 2022; originally announced August 2022.

  50. arXiv:2206.07465  [pdf

    math.OC cs.IT physics.optics

    High-fidelity quantitative differential phase contrast deconvolution using dark-field sparse prior

    Authors: Shuhe Zhang, Tao Peng, Zeyu Ke, Meng Shao, Tos T. J. M. Berendschot, Jinhua Zhou

    Abstract: Differential phase contrast (DPC) imaging plays an important role in the family of quantitative phase measurement. However, the reconstruction algorithm for quantitative DPC (qDPC) imaging is not yet optimized, as it does not incorporate the inborn properties of qDPC imaging. In this research, we propose a simple but effective image prior, the dark-field sparse prior (DSP), to facilitate the phase… ▽ More

    Submitted 15 June, 2022; originally announced June 2022.