Skip to main content

Showing 1–50 of 238 results for author: Wei, P

  1. arXiv:2407.11699  [pdf, other

    cs.CV

    Relation DETR: Exploring Explicit Position Relation Prior for Object Detection

    Authors: Xiuquan Hou, Meiqin Liu, Senlin Zhang, Ping Wei, Badong Chen, Xuguang Lan

    Abstract: This paper presents a general scheme for enhancing the convergence and performance of DETR (DEtection TRansformer). We investigate the slow convergence problem in transformers from a new perspective, suggesting that it arises from the self-attention that introduces no structural bias over inputs. To address this issue, we explore incorporating position relation prior as attention bias to augment o… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: Accepted to ECCV 2024

  2. Sampling and active learning methods for network reliability estimation using K-terminal spanning tree

    Authors: Chen Ding, Pengfei Wei, Yan Shi, Jinxing Liu, Matteo Broggi, Michael Beer

    Abstract: Network reliability analysis remains a challenge due to the increasing size and complexity of networks. This paper presents a novel sampling method and an active learning method for efficient and accurate network reliability estimation under node failure and edge failure scenarios. The proposed sampling method adopts Monte Carlo technique to sample component lifetimes and the K-terminal spanning t… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Journal ref: Reliability Engineering & System Safety (2024) 110309

  3. arXiv:2406.16564  [pdf, other

    cs.CV

    FASTC: A Fast Attentional Framework for Semantic Traversability Classification Using Point Cloud

    Authors: Yirui Chen, Pengjin Wei, Zhenhuan Liu, Bingchao Wang, Jie Yang, Wei Liu

    Abstract: Producing traversability maps and understanding the surroundings are crucial prerequisites for autonomous navigation. In this paper, we address the problem of traversability assessment using point clouds. We propose a novel pillar feature extraction module that utilizes PointNet to capture features from point clouds organized in vertical volume and a 2D encoder-decoder structure to conduct travers… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: Accepted to ECAI2023 Our code is publicly available at [this](https://github.com/chenyirui/FASTC)

  4. arXiv:2406.14282  [pdf, other

    cs.CL cs.AI

    Learning to Plan for Retrieval-Augmented Large Language Models from Knowledge Graphs

    Authors: Junjie Wang, Mingyang Chen, Binbin Hu, Dan Yang, Ziqi Liu, Yue Shen, Peng Wei, Zhiqiang Zhang, Jinjie Gu, Jun Zhou, Jeff Z. Pan, Wen Zhang, Huajun Chen

    Abstract: Improving the performance of large language models (LLMs) in complex question-answering (QA) scenarios has always been a research focal point. Recent studies have attempted to enhance LLMs' performance by combining step-wise planning with external retrieval. While effective for advanced models like GPT-3.5, smaller LLMs face challenges in decomposing complex questions, necessitating supervised fin… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Work in progress

  5. arXiv:2406.12012  [pdf, other

    cond-mat.supr-con

    Highly Efficient Superconducting Diodes and Rectifiers for Quantum Circuitry

    Authors: Josep Ingla-Aynés, Yasen Hou, Sarah Wang, En-De Chu, Oleg A. Mukhanov, Peng Wei, Jagadeesh S. Moodera

    Abstract: Superconducting electronics is essential for energy-efficient quantum and classical high-end computing applications. Towards this goal, non-reciprocal superconducting circuit elements, such as superconducting diodes (SDs) can fulfill many critical needs. SDs have been the subject of multiple studies, but integrating several SDs in a superconducting circuit remains a challenge. Here we implement th… ▽ More

    Submitted 21 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: 6 pages, 3 figures

  6. arXiv:2406.03712  [pdf, other

    cs.CL cs.LG

    A Survey on Medical Large Language Models: Technology, Application, Trustworthiness, and Future Directions

    Authors: Lei Liu, Xiaoyan Yang, Junchi Lei, Xiaoyang Liu, Yue Shen, Zhiqiang Zhang, Peng Wei, Jinjie Gu, Zhixuan Chu, Zhan Qin, Kui Ren

    Abstract: Large language models (LLMs), such as GPT series models, have received substantial attention due to their impressive capabilities for generating and understanding human-level language. More recently, LLMs have emerged as an innovative and powerful adjunct in the medical field, transforming traditional practices and heralding a new era of enhanced healthcare services. This survey provides a compreh… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  7. arXiv:2406.00632  [pdf, other

    cs.CV

    Diff-Mosaic: Augmenting Realistic Representations in Infrared Small Target Detection via Diffusion Prior

    Authors: Yukai Shi, Yupei Lin, Pengxu Wei, Xiaoyu Xian, Tianshui Chen, Liang Lin

    Abstract: Recently, researchers have proposed various deep learning methods to accurately detect infrared targets with the characteristics of indistinct shape and texture. Due to the limited variety of infrared datasets, training deep learning models with good generalization poses a challenge. To augment the infrared dataset, researchers employ data augmentation techniques, which often involve generating ne… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  8. arXiv:2405.11459  [pdf, other

    eess.SP cs.CL q-bio.NC

    Du-IN: Discrete units-guided mask modeling for decoding speech from Intracranial Neural signals

    Authors: Hui Zheng, Hai-Teng Wang, Wei-Bang Jiang, Zhong-Tao Chen, Li He, Pei-Yang Lin, Peng-Hu Wei, Guo-Guang Zhao, Yun-Zhe Liu

    Abstract: Invasive brain-computer interfaces have garnered significant attention due to their high performance. The current intracranial stereoElectroEncephaloGraphy (sEEG) foundation models typically build univariate representations based on a single channel. Some of them further use Transformer to model the relationship among channels. However, due to the locality and specificity of brain computation, the… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

  9. arXiv:2405.11392  [pdf, ps, other

    q-fin.MF q-fin.CP

    Deep Penalty Methods: A Class of Deep Learning Algorithms for Solving High Dimensional Optimal Stopping Problems

    Authors: Yunfei Peng, Pengyu Wei, Wei Wei

    Abstract: We propose a deep learning algorithm for high dimensional optimal stopping problems. Our method is inspired by the penalty method for solving free boundary PDEs. Within our approach, the penalized PDE is approximated using the Deep BSDE framework proposed by \cite{weinan2017deep}, which leads us to coin the term "Deep Penalty Method (DPM)" to refer to our algorithm. We show that the error of the D… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

  10. arXiv:2405.04336  [pdf, other

    cs.AI

    Temporal and Heterogeneous Graph Neural Network for Remaining Useful Life Prediction

    Authors: Zhihao Wen, Yuan Fang, Pengcheng Wei, Fayao Liu, Zhenghua Chen, Min Wu

    Abstract: Predicting Remaining Useful Life (RUL) plays a crucial role in the prognostics and health management of industrial systems that involve a variety of interrelated sensors. Given a constant stream of time series sensory data from such systems, deep learning models have risen to prominence at identifying complex, nonlinear temporal dependencies in these data. In addition to the temporal dependencies… ▽ More

    Submitted 1 June, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

    Comments: 12 pages

  11. arXiv:2405.03967  [pdf, other

    cs.LG cs.AI cs.AR

    SwiftRL: Towards Efficient Reinforcement Learning on Real Processing-In-Memory Systems

    Authors: Kailash Gogineni, Sai Santosh Dayapule, Juan Gómez-Luna, Karthikeya Gogineni, Peng Wei, Tian Lan, Mohammad Sadrosadati, Onur Mutlu, Guru Venkataramani

    Abstract: Reinforcement Learning (RL) trains agents to learn optimal behavior by maximizing reward signals from experience datasets. However, RL training often faces memory limitations, leading to execution latencies and prolonged training times. To overcome this, SwiftRL explores Processing-In-Memory (PIM) architectures to accelerate RL workloads. We achieve near-linear performance scaling by implementing… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  12. arXiv:2405.00542  [pdf, other

    eess.IV cs.CV

    UWAFA-GAN: Ultra-Wide-Angle Fluorescein Angiography Transformation via Multi-scale Generation and Registration Enhancement

    Authors: Ruiquan Ge, Zhaojie Fang, Pengxue Wei, Zhanghao Chen, Hongyang Jiang, Ahmed Elazab, Wangting Li, Xiang Wan, Shaochong Zhang, Changmiao Wang

    Abstract: Fundus photography, in combination with the ultra-wide-angle fundus (UWF) techniques, becomes an indispensable diagnostic tool in clinical settings by offering a more comprehensive view of the retina. Nonetheless, UWF fluorescein angiography (UWF-FA) necessitates the administration of a fluorescent dye via injection into the patient's hand or elbow unlike UWF scanning laser ophthalmoscopy (UWF-SLO… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  13. arXiv:2404.14309  [pdf, other

    cs.CV

    Towards Better Adversarial Purification via Adversarial Denoising Diffusion Training

    Authors: Yiming Liu, Kezhao Liu, Yao Xiao, Ziyi Dong, Xiaogang Xu, Pengxu Wei, Liang Lin

    Abstract: Recently, diffusion-based purification (DBP) has emerged as a promising approach for defending against adversarial attacks. However, previous studies have used questionable methods to evaluate the robustness of DBP models, their explanations of DBP robustness also lack experimental support. We re-examine DBP robustness using precise gradient, and discuss the impact of stochasticity on DBP robustne… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  14. arXiv:2404.10264  [pdf, other

    hep-ex hep-ph quant-ph

    Calibration of the Cryogenic Measurement System of a Resonant Haloscope Cavity

    Authors: Dong He, Jie Fan, Xin Gao, Yu Gao, Nick Houston, Zhongqing Ji, Yirong Jin, Chuang Li, Jinmian Li, Tianjun Li, Shi-hang Liu, Jia-Shu Niu, Zhihui Peng, Liang Sun, Zheng Sun, Jia Wang, Puxian Wei, Lina Wu, Zhongchen Xiang, Qiaoli Yang, Chi Zhang, Wenxing Zhang, Xin Zhang, Dongning Zheng, Ruifeng Zheng , et al. (1 additional authors not shown)

    Abstract: Possible light bosonic dark matter interactions with the Standard Model photon have been searched by microwave resonant cavities. In this paper, we demonstrate the cryogenic readout system calibration of a 7.138 GHz copper cavity with a loaded quality factor $Q_l=10^4$, operated at 22 mK temperature based on a dilution refrigerator. Our readout system consists of High Electron Mobility Transistors… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 7 pages, 5 figures, version to appear in CPC

  15. arXiv:2404.09790  [pdf, other

    cs.CV

    NTIRE 2024 Challenge on Image Super-Resolution ($\times$4): Methods and Results

    Authors: Zheng Chen, Zongwei Wu, Eduard Zamfir, Kai Zhang, Yulun Zhang, Radu Timofte, Xiaokang Yang, Hongyuan Yu, Cheng Wan, Yuxin Hong, Zhijuan Huang, Yajun Zou, Yuan Huang, Jiamin Lin, Bingnan Han, Xianyu Guan, Yongsheng Yu, Daoan Zhang, Xuanwu Yin, Kunlong Zuo, Jinhua Hao, Kai Zhao, Kun Yuan, Ming Sun, Chao Zhou , et al. (63 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2024 challenge on image super-resolution ($\times$4), highlighting the solutions proposed and the outcomes obtained. The challenge involves generating corresponding high-resolution (HR) images, magnified by a factor of four, from low-resolution (LR) inputs using prior information. The LR images originate from bicubic downsampling degradation. The aim of the challenge i… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: NTIRE 2024 webpage: https://cvlai.net/ntire/2024. Code: https://github.com/zhengchen1999/NTIRE2024_ImageSR_x4

  16. arXiv:2404.09263  [pdf, other

    cs.CV cs.AI

    Task-Driven Exploration: Decoupling and Inter-Task Feedback for Joint Moment Retrieval and Highlight Detection

    Authors: Jin Yang, Ping Wei, Huan Li, Ziyang Ren

    Abstract: Video moment retrieval and highlight detection are two highly valuable tasks in video understanding, but until recently they have been jointly studied. Although existing studies have made impressive advancement recently, they predominantly follow the data-driven bottom-up paradigm. Such paradigm overlooks task-specific and inter-task effects, resulting in poor model performance. In this paper, we… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

  17. arXiv:2404.00908  [pdf, other

    hep-ex hep-ph

    Dark photon constraints from a 7.139 GHz cavity haloscope experiment

    Authors: Dong He, Jie Fan, Xin Gao, Yu Gao, Nick Houston, Zhongqing Ji, Yirong Jin, Chuang Li, Jinmian Li, Tianjun Li, Shi-hang Liu, Jia-Shu Niu, Zhihui Peng, Liang Sun, Zheng Sun, Jia Wang, Puxian Wei, Lina Wu, Zhongchen Xiang, Qiaoli Yang, Chi Zhang, Wenxing Zhang, Xin Zhang, Dongning Zheng, Ruifeng Zheng , et al. (1 additional authors not shown)

    Abstract: The dark photon is a promising candidate for the dark matter which comprises most of the matter in our visible Universe. Via kinetic mixing with the Standard Model it can also be resonantly converted to photons in an electromagnetic cavity, offering novel experimental possibilities for the discovery and study of dark matter. We report the results of a pathfinder dark photon dark matter cavity sear… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: 5 pages, 4 figures

  18. arXiv:2403.16131  [pdf, other

    cs.CV

    Salience DETR: Enhancing Detection Transformer with Hierarchical Salience Filtering Refinement

    Authors: Xiuquan Hou, Meiqin Liu, Senlin Zhang, Ping Wei, Badong Chen

    Abstract: DETR-like methods have significantly increased detection performance in an end-to-end manner. The mainstream two-stage frameworks of them perform dense self-attention and select a fraction of queries for sparse cross-attention, which is proven effective for improving performance but also introduces a heavy computational burden and high dependence on stable query selection. This paper demonstrates… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR 2024

  19. IDF-CR: Iterative Diffusion Process for Divide-and-Conquer Cloud Removal in Remote-sensing Images

    Authors: Meilin Wang, Yexing Song, Pengxu Wei, Xiaoyu Xian, Yukai Shi, Liang Lin

    Abstract: Deep learning technologies have demonstrated their effectiveness in removing cloud cover from optical remote-sensing images. Convolutional Neural Networks (CNNs) exert dominance in the cloud removal tasks. However, constrained by the inherent limitations of convolutional operations, CNNs can address only a modest fraction of cloud occlusion. In recent years, diffusion models have achieved state-of… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: Accepted by IEEE TGRS, we first present an iterative diffusion process for cloud removal, the code is available at: https://github.com/SongYxing/IDF-CR

  20. arXiv:2403.11852  [pdf, other

    cs.RO cs.AI

    Reinforcement Learning with Latent State Inference for Autonomous On-ramp Merging under Observation Delay

    Authors: Amin Tabrizian, Zhitong Huang, Peng Wei

    Abstract: This paper presents a novel approach to address the challenging problem of autonomous on-ramp merging, where a self-driving vehicle needs to seamlessly integrate into a flow of vehicles on a multi-lane highway. We introduce the Lane-keeping, Lane-changing with Latent-state Inference and Safety Controller (L3IS) agent, designed to perform the on-ramp merging task safely without comprehensive knowle… ▽ More

    Submitted 21 June, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

  21. arXiv:2403.06579  [pdf, other

    eess.SY

    Edge Information Hub: Orchestrating Satellites, UAVs, MEC, Sensing and Communications for 6G Closed-Loop Controls

    Authors: Chengleyang Lei, Wei Feng, Peng Wei, Yunfei Chen, Ning Ge, Shiwen Mao

    Abstract: An increasing number of field robots would be used for mission-critical tasks in remote or post-disaster areas. Due to usually-limited individual abilities, these robots require an edge information hub (EIH), which is capable of not only communications but also sensing and computing. Such EIH could be deployed on a flexibly-dispatched unmanned aerial vehicle (UAV). Different from traditional aeria… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: 13pages, 9 figures

  22. arXiv:2402.13769  [pdf, other

    cs.IR

    General Debiasing for Graph-based Collaborative Filtering via Adversarial Graph Dropout

    Authors: An Zhang, Wenchang Ma, Pengbo Wei, Leheng Sheng, Xiang Wang

    Abstract: Graph neural networks (GNNs) have shown impressive performance in recommender systems, particularly in collaborative filtering (CF). The key lies in aggregating neighborhood information on a user-item interaction graph to enhance user/item representations. However, we have discovered that this aggregation mechanism comes with a drawback, which amplifies biases present in the interaction graph. For… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: Accepted to WWW 2024

  23. arXiv:2401.06992  [pdf, other

    cs.CV cs.AI

    Progressive Feature Fusion Network for Enhancing Image Quality Assessment

    Authors: Kaiqun Wu, Xiaoling Jiang, Rui Yu, Yonggang Luo, Tian Jiang, Xi Wu, Peng Wei

    Abstract: Image compression has been applied in the fields of image storage and video broadcasting. However, it's formidably tough to distinguish the subtle quality differences between those distorted images generated by different algorithms. In this paper, we propose a new image quality assessment framework to decide which image is better in an image group. To capture the subtle differences, a fine-grained… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

    Comments: Data Compression Conference

  24. arXiv:2312.10299  [pdf, other

    cs.CV cs.AI cs.LG

    Image Restoration Through Generalized Ornstein-Uhlenbeck Bridge

    Authors: Conghan Yue, Zhengwei Peng, Junlong Ma, Shiyan Du, Pengxu Wei, Dongyu Zhang

    Abstract: Diffusion models exhibit powerful generative capabilities enabling noise mapping to data via reverse stochastic differential equations. However, in image restoration, the focus is on the mapping relationship from low-quality to high-quality images. Regarding this issue, we introduce the Generalized Ornstein-Uhlenbeck Bridge (GOUB) model. By leveraging the natural mean-reverting property of the gen… ▽ More

    Submitted 17 May, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: ICML 2024

  25. arXiv:2311.05374  [pdf, other

    cs.CL cs.AI

    TencentLLMEval: A Hierarchical Evaluation of Real-World Capabilities for Human-Aligned LLMs

    Authors: Shuyi Xie, Wenlin Yao, Yong Dai, Shaobo Wang, Donlin Zhou, Lifeng Jin, Xinhua Feng, Pengzhi Wei, Yujie Lin, Zhichao Hu, Dong Yu, Zhengyou Zhang, Jing Nie, Yuhong Liu

    Abstract: Large language models (LLMs) have shown impressive capabilities across various natural language tasks. However, evaluating their alignment with human preferences remains a challenge. To this end, we propose a comprehensive human evaluation framework to assess LLMs' proficiency in following instructions on diverse real-world tasks. We construct a hierarchical task tree encompassing 7 major areas co… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  26. arXiv:2310.18670  [pdf, other

    eess.SP

    Two-stage space construction for real-time modeling of distributed parameter systems under sparse sensing

    Authors: Peng Wei

    Abstract: Numerous industrial processes can be defined using distributed parameter systems (DPSs). This study introduces a two-stage spatial construction approach for real-time modeling of DPSs in cases of limited sensors. Initially, a discrete space-completion approach is created to recuperate the spatiotemporal patterns of non-monitored locations under sparse sensing. The high-dimensional space constructi… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

  27. arXiv:2310.15138  [pdf, other

    cs.RO cs.CV

    Fusion-Driven Tree Reconstruction and Fruit Localization: Advancing Precision in Agriculture

    Authors: Kaiming Fu, Peng Wei, Juan Villacres, Zhaodan Kong, Stavros G. Vougioukas, Brian N. Bailey

    Abstract: Fruit distribution is pivotal in shaping the future of both agriculture and agricultural robotics, paving the way for a streamlined supply chain. This study introduces an innovative methodology that harnesses the synergy of RGB imagery, LiDAR, and IMU data, to achieve intricate tree reconstructions and the pinpoint localization of fruits. Such integration not only offers insights into the fruit di… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: This work was presented at IEEE/RSI International Conference on Intelligent Robots and Systems (IROS) Workshop

  28. arXiv:2310.08606  [pdf, other

    eess.SP

    Multiscale Fusion for Abnormality Detection and Localization of Distributed Parameter Systems

    Authors: Peng Wei, Han-Xiong Li

    Abstract: Numerous industrial thermal processes and fluid processes can be described by distributed parameter systems (DPSs), wherein many process parameters and variables vary in space and time. Early internal abnormalities in the DPS may develop into uncontrollable thermal failures, causing serious safety incidents. In this study, the multiscale information fusion is proposed for internal abnormality dete… ▽ More

    Submitted 1 December, 2023; v1 submitted 9 October, 2023; originally announced October 2023.

  29. arXiv:2309.04803  [pdf, other

    cs.CV cs.AI

    Towards Real-World Burst Image Super-Resolution: Benchmark and Method

    Authors: Pengxu Wei, Yujing Sun, Xingbei Guo, Chang Liu, Jie Chen, Xiangyang Ji, Liang Lin

    Abstract: Despite substantial advances, single-image super-resolution (SISR) is always in a dilemma to reconstruct high-quality images with limited information from one input image, especially in realistic scenarios. In this paper, we establish a large-scale real-world burst super-resolution dataset, i.e., RealBSR, to explore the faithful reconstruction of image details from multiple frames. Furthermore, we… ▽ More

    Submitted 9 September, 2023; originally announced September 2023.

    Comments: Accepted by ICCV2023

  30. arXiv:2309.02906  [pdf, other

    math.PR

    Well-posedness and averaging principle for Lévy-type McKean-Vlasov stochastic differential equations under local Lipschitz conditions

    Authors: Ying Chao, Jinqiao Duan, Ting Gao, Pingyuan Wei

    Abstract: In this paper, we investigate a class of McKean-Vlasov stochastic differential equations under Lévy-type perturbations. We first establish the existence and uniqueness theorem for solutions of the McKean-Vlasov stochastic differential equations by utilizing the Euler-like approximation. Then under some suitable conditions, we show that the solutions of McKean-Vlasov stochastic differential equatio… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

    Comments: 29 pages, 7 figures

    MSC Class: 60H10; 60G51; 34C29; 35Q83

  31. arXiv:2308.15016  [pdf, other

    cs.CV

    C2G2: Controllable Co-speech Gesture Generation with Latent Diffusion Model

    Authors: Longbin Ji, Pengfei Wei, Yi Ren, Jinglin Liu, Chen Zhang, Xiang Yin

    Abstract: Co-speech gesture generation is crucial for automatic digital avatar animation. However, existing methods suffer from issues such as unstable training and temporal inconsistency, particularly in generating high-fidelity and comprehensive gestures. Additionally, these methods lack effective control over speaker identity and temporal editing of the generated gestures. Focusing on capturing temporal… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

    Comments: 12 pages, 6 figures, 7 tables

  32. arXiv:2308.02263  [pdf, other

    cs.SD cs.CL eess.AS

    Efficient Monaural Speech Enhancement using Spectrum Attention Fusion

    Authors: Jinyu Long, Jetic Gū, Binhao Bai, Zhibo Yang, Ping Wei, Junli Li

    Abstract: Speech enhancement is a demanding task in automated speech processing pipelines, focusing on separating clean speech from noisy channels. Transformer based models have recently bested RNN and CNN models in speech enhancement, however at the same time they are much more computationally expensive and require much more high quality training data, which is always hard to come by. In this paper, we pre… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

  33. arXiv:2308.01117  [pdf

    cs.RO eess.SY

    Optimization-Based Motion Planning for Autonomous Agricultural Vehicles Turning in Constrained Headlands

    Authors: Chen Peng, Peng Wei, Zhenghao Fei, Yuankai Zhu, Stavros G. Vougioukas

    Abstract: Headland maneuvering is a crucial aspect of unmanned field operations for autonomous agricultural vehicles (AAVs). While motion planning for headland turning in open fields has been extensively studied and integrated into commercial auto-guidance systems, the existing methods primarily address scenarios with ample headland space and thus may not work in more constrained headland geometries. Commer… ▽ More

    Submitted 11 June, 2024; v1 submitted 2 August, 2023; originally announced August 2023.

  34. arXiv:2307.16242  [pdf, other

    cs.CV

    SR-R$^2$KAC: Improving Single Image Defocus Deblurring

    Authors: Peng Tang, Zhiqiang Xu, Pengfei Wei, Xiaobin Hu, Peilin Zhao, Xin Cao, Chunlai Zhou, Tobias Lasser

    Abstract: We propose an efficient deep learning method for single image defocus deblurring (SIDD) by further exploring inverse kernel properties. Although the current inverse kernel method, i.e., kernel-sharing parallel atrous convolution (KPAC), can address spatially varying defocus blurs, it has difficulty in handling large blurs of this kind. To tackle this issue, we propose a Residual and Recursive Ke… ▽ More

    Submitted 30 July, 2023; originally announced July 2023.

    Comments: Submitted to IEEE Transactions on Cybernetics on 2023-July-24

  35. arXiv:2307.11530  [pdf, other

    eess.IV cs.CV

    UWAT-GAN: Fundus Fluorescein Angiography Synthesis via Ultra-wide-angle Transformation Multi-scale GAN

    Authors: Zhaojie Fang, Zhanghao Chen, Pengxue Wei, Wangting Li, Shaochong Zhang, Ahmed Elazab, Gangyong Jia, Ruiquan Ge, Changmiao Wang

    Abstract: Fundus photography is an essential examination for clinical and differential diagnosis of fundus diseases. Recently, Ultra-Wide-angle Fundus (UWF) techniques, UWF Fluorescein Angiography (UWF-FA) and UWF Scanning Laser Ophthalmoscopy (UWF-SLO) have been gradually put into use. However, Fluorescein Angiography (FA) and UWF-FA require injecting sodium fluorescein which may have detrimental influence… ▽ More

    Submitted 21 July, 2023; originally announced July 2023.

    Comments: 26th International Conference on Medical Image Computing and Computer Assisted Intervention

  36. arXiv:2307.07218  [pdf, other

    eess.AS cs.SD

    Mega-TTS 2: Boosting Prompting Mechanisms for Zero-Shot Speech Synthesis

    Authors: Ziyue Jiang, Jinglin Liu, Yi Ren, Jinzheng He, Zhenhui Ye, Shengpeng Ji, Qian Yang, Chen Zhang, Pengfei Wei, Chunfeng Wang, Xiang Yin, Zejun Ma, Zhou Zhao

    Abstract: Zero-shot text-to-speech (TTS) aims to synthesize voices with unseen speech prompts, which significantly reduces the data and computation requirements for voice cloning by skipping the fine-tuning process. However, the prompting mechanisms of zero-shot TTS still face challenges in the following aspects: 1) previous works of zero-shot TTS are typically trained with single-sentence prompts, which si… ▽ More

    Submitted 10 April, 2024; v1 submitted 14 July, 2023; originally announced July 2023.

    Comments: Accepted by ICLR 2024

  37. arXiv:2306.11647  [pdf, ps, other

    cs.RO eess.SY

    Safe and Scalable Real-Time Trajectory Planning Framework for Urban Air Mobility

    Authors: Abenezer Taye, Roberto Valenti, Akshay Rajhans, Anastasia Mavrommati, Pieter J. Mosterman, Peng Wei

    Abstract: This paper presents a real-time trajectory planning framework for Urban Air Mobility (UAM) that is both safe and scalable. The proposed framework employs a decentralized, free-flight concept of operation in which each aircraft independently performs separation assurance and conflict resolution, generating safe trajectories by accounting for the future states of nearby aircraft. The framework consi… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

  38. arXiv:2306.00187  [pdf, other

    cs.MA

    AccMER: Accelerating Multi-Agent Experience Replay with Cache Locality-aware Prioritization

    Authors: Kailash Gogineni, Yongsheng Mei, Peng Wei, Tian Lan, Guru Venkataramani

    Abstract: Multi-Agent Experience Replay (MER) is a key component of off-policy reinforcement learning~(RL) algorithms. By remembering and reusing experiences from the past, experience replay significantly improves the stability of RL algorithms and their learning efficiency. In many scenarios, multiple agents interact in a shared environment during online training under centralized training and decentralize… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

    Comments: Accepted to ASAP'23

  39. arXiv:2305.13996  [pdf, other

    cs.RO

    One-Shot Strategically Deconflicted Route and Operational Volume Generation for Urban Air Mobility Operations

    Authors: Ellis L Thompson, Yan Xu, Peng Wei

    Abstract: In the UAM space, strategic deconfliction provides an all-essential layer to airspace automation by providing safe, pre-emptive deconfliction or assignment of airspace resources to airspace users pre-flight. Strategic deconfliction approaches provide an elegant solution to pre-flight deconfliction operations. This overall creates safer and more efficient airspace and reduces the workload on contro… ▽ More

    Submitted 15 August, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: 8 pages, 7 Figures

  40. arXiv:2305.13411  [pdf, other

    cs.MA

    Towards Efficient Multi-Agent Learning Systems

    Authors: Kailash Gogineni, Peng Wei, Tian Lan, Guru Venkataramani

    Abstract: Multi-Agent Reinforcement Learning (MARL) is an increasingly important research field that can model and control multiple large-scale autonomous systems. Despite its achievements, existing multi-agent learning methods typically involve expensive computations in terms of training time and power arising from large observation-action space and a huge number of training steps. Therefore, a key challen… ▽ More

    Submitted 23 May, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: Accepted at MLArchSys, ISCA 2023. Compared to arXiv:2302.05007, we explore a neighbor sampling strategy to improve the locality of data access within the mini-batch sampling phase. Our preliminary experiments provide performance improvement ranging from 26.66% (3 agents) to 27.39% (12 agents) in the sampling phase training run-time

  41. arXiv:2305.10556  [pdf, other

    cs.AI

    Integrated Conflict Management for UAM with Strategic Demand Capacity Balancing and Learning-based Tactical Deconfliction

    Authors: Shulu Chen, Antony Evans, Marc Brittain, Peng Wei

    Abstract: Urban air mobility (UAM) has the potential to revolutionize our daily transportation, offering rapid and efficient deliveries of passengers and cargo between dedicated locations within and around the urban environment. Before the commercialization and adoption of this emerging transportation mode, however, aviation safety must be guaranteed, i.e., all the aircraft have to be safely separated by st… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

  42. arXiv:2305.08328  [pdf, other

    cs.IR cs.LG

    FedAds: A Benchmark for Privacy-Preserving CVR Estimation with Vertical Federated Learning

    Authors: Penghui Wei, Hongjian Dou, Shaoguo Liu, Rongjun Tang, Li Liu, Liang Wang, Bo Zheng

    Abstract: Conversion rate (CVR) estimation aims to predict the probability of conversion event after a user has clicked an ad. Typically, online publisher has user browsing interests and click feedbacks, while demand-side advertising platform collects users' post-click behaviors such as dwell time and conversion decisions. To estimate CVR accurately and protect data privacy better, vertical federated learni… ▽ More

    Submitted 14 May, 2023; originally announced May 2023.

    Comments: SIGIR 2023, Resource Track

  43. arXiv:2305.08293  [pdf, other

    cs.CV cs.MM

    Identity-Preserving Talking Face Generation with Landmark and Appearance Priors

    Authors: Weizhi Zhong, Chaowei Fang, Yinqi Cai, Pengxu Wei, Gangming Zhao, Liang Lin, Guanbin Li

    Abstract: Generating talking face videos from audio attracts lots of research interest. A few person-specific methods can generate vivid videos but require the target speaker's videos for training or fine-tuning. Existing person-generic methods have difficulty in generating realistic and lip-synced videos while preserving identity information. To tackle this problem, we propose a two-stage framework consist… ▽ More

    Submitted 14 May, 2023; originally announced May 2023.

    Comments: CVPR2023, Code: https://github.com/Weizhi-Zhong/IP_LAP

  44. arXiv:2305.05838  [pdf, other

    cs.CV cs.MM

    Generative Steganographic Flow

    Authors: Ping Wei, Ge Luo, Qi Song, Xinpeng Zhang, Zhenxing Qian, Sheng Li

    Abstract: Generative steganography (GS) is a new data hiding manner, featuring direct generation of stego media from secret data. Existing GS methods are generally criticized for their poor performances. In this paper, we propose a novel flow based GS approach -- Generative Steganographic Flow (GSF), which provides direct generation of stego images without cover image. We take the stego image generation and… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: The accepted paper in ICME 2022

  45. arXiv:2305.03472  [pdf, other

    cs.MM cs.AI

    Generative Steganography Diffusion

    Authors: Ping Wei, Qing Zhou, Zichi Wang, Zhenxing Qian, Xinpeng Zhang, Sheng Li

    Abstract: Generative steganography (GS) is an emerging technique that generates stego images directly from secret data. Various GS methods based on GANs or Flow have been developed recently. However, existing GAN-based GS methods cannot completely recover the hidden secret data due to the lack of network invertibility, while Flow-based methods produce poor image quality due to the stringent reversibility re… ▽ More

    Submitted 6 September, 2023; v1 submitted 5 May, 2023; originally announced May 2023.

    Comments: Shall not be reproduced without permission, rights reserved!

  46. arXiv:2304.00040  [pdf

    cs.LG

    A robust deep learning-based damage identification approach for SHM considering missing data

    Authors: Fan Deng, Xiaoming Tao, Pengxiang Wei, Shiyin Wei

    Abstract: Data-driven method for Structural Health Monitoring (SHM), that mine the hidden structural performance from the correlations among monitored time series data, has received widely concerns recently. However, missing data significantly impacts the conduction of this method. Missing data is a frequently encountered issue in time series data in SHM and many other real-world applications, that harms to… ▽ More

    Submitted 31 March, 2023; originally announced April 2023.

  47. arXiv:2303.07693  [pdf, other

    cs.LG cs.AI

    Adaptive Policy Learning for Offline-to-Online Reinforcement Learning

    Authors: Han Zheng, Xufang Luo, Pengfei Wei, Xuan Song, Dongsheng Li, Jing Jiang

    Abstract: Conventional reinforcement learning (RL) needs an environment to collect fresh data, which is impractical when online interactions are costly. Offline RL provides an alternative solution by directly learning from the previously collected dataset. However, it will yield unsatisfactory performance if the quality of the offline datasets is poor. In this paper, we consider an offline-to-online setting… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

    Comments: AAAI2023

  48. arXiv:2303.03052  [pdf, other

    cs.CV cs.LG

    Masked Images Are Counterfactual Samples for Robust Fine-tuning

    Authors: Yao Xiao, Ziyi Tang, Pengxu Wei, Cong Liu, Liang Lin

    Abstract: Deep learning models are challenged by the distribution shift between the training data and test data. Recently, the large models pre-trained on diverse data have demonstrated unprecedented robustness to various distribution shifts. However, fine-tuning these models can lead to a trade-off between in-distribution (ID) performance and out-of-distribution (OOD) robustness. Existing methods for tackl… ▽ More

    Submitted 2 April, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

    Comments: Accepted by CVPR 2023 (v2: improve the clarity; v3: camera ready version)

  49. arXiv:2302.10418  [pdf, other

    cs.LG cs.AI cs.MA

    MAC-PO: Multi-Agent Experience Replay via Collective Priority Optimization

    Authors: Yongsheng Mei, Hanhan Zhou, Tian Lan, Guru Venkataramani, Peng Wei

    Abstract: Experience replay is crucial for off-policy reinforcement learning (RL) methods. By remembering and reusing the experiences from past different policies, experience replay significantly improves the training efficiency and stability of RL algorithms. Many decision-making problems in practice naturally involve multiple agents and require multi-agent reinforcement learning (MARL) under centralized t… ▽ More

    Submitted 27 February, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

    Comments: The 22nd International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2023). arXiv admin note: text overlap with arXiv:2302.05593

  50. Implementation and performances of the IPbus protocol for the JUNO Large-PMT readout electronics

    Authors: Riccardo Triozzi, Andrea Serafini, Marco Bellato, Antonio Bergnoli, Matteo Bolognesi, Riccardo Brugnera, Vanessa Cerrone, Chao Chen, Barbara Clerbaux, Alberto Coppi, Daniele Corti, Flavio dal Corso, Jianmeng Dong, Wei Dou, Lei Fan, Alberto Garfagnini, Arsenii Gavrikov, Guanghua Gong, Marco Grassi, Rosa Maria Guizzetti, Shuang Hang, Cong He, Jun Hu, Roberto Isocrate, Beatrice Jelmini , et al. (107 additional authors not shown)

    Abstract: The Jiangmen Underground Neutrino Observatory (JUNO) is a large neutrino detector currently under construction in China. Thanks to the tight requirements on its optical and radio-purity properties, it will be able to perform leading measurements detecting terrestrial and astrophysical neutrinos in a wide energy range from tens of keV to hundreds of MeV. A key requirement for the success of the exp… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.