Skip to main content

Showing 1–50 of 722 results for author: Zhang, H

  1. arXiv:2407.11529  [pdf, other

    eess.IV cs.AI cs.CV

    Cross-Phase Mutual Learning Framework for Pulmonary Embolism Identification on Non-Contrast CT Scans

    Authors: Bizhe Bai, Yan-Jie Zhou, Yujian Hu, Tony C. W. Mok, Yilang Xiang, Le Lu, Hongkun Zhang, Minfeng Xu

    Abstract: Pulmonary embolism (PE) is a life-threatening condition where rapid and accurate diagnosis is imperative yet difficult due to predominantly atypical symptomatology. Computed tomography pulmonary angiography (CTPA) is acknowledged as the gold standard imaging tool in clinics, yet it can be contraindicated for emergency department (ED) patients and represents an onerous procedure, thus necessitating… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: Early accept by MICCAI 2024

  2. arXiv:2407.11322  [pdf, ps, other

    eess.SP

    Reconfigurable-Intelligent-Surface Assisted Orbital-Angular-Momentum Secure Communications

    Authors: Minmin Wang, Liping Liang, Wenchi Cheng, Wei Zhang, Ruirui Chen, Hailin Zhang

    Abstract: As a kind of wavefront with helical phase, orbital angular momentum (OAM) shows the great potential to enhance the security results of wireless communications due to its unique orthogonality and central hollow electromagnetic wave structure. Therefore, in this paper we propose the reconfigurable-intelligent-surface (RIS) assisted OAM scheme, where RIS is deployed to weaken the information acquisit… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: arXiv admin note: text overlap with arXiv:2406.05799

  3. arXiv:2407.11087  [pdf, other

    eess.IV cs.CV

    Restore-RWKV: Efficient and Effective Medical Image Restoration with RWKV

    Authors: Zhiwen Yang, Hui Zhang, Dan Zhao, Bingzheng Wei, Yan Xu

    Abstract: Transformers have revolutionized medical image restoration, but the quadratic complexity still poses limitations for their application to high-resolution medical images. The recent advent of RWKV in the NLP field has attracted much attention as it can process long sequences efficiently. To leverage its advanced design, we propose Restore-RWKV, the first RWKV-based model for medical image restorati… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: This paper introduces the first RWKV-based model for image restoration

  4. arXiv:2407.09268  [pdf, other

    eess.IV cs.CV

    Region Attention Transformer for Medical Image Restoration

    Authors: Zhiwen Yang, Haowei Chen, Ziniu Qian, Yang Zhou, Hui Zhang, Dan Zhao, Bingzheng Wei, Yan Xu

    Abstract: Transformer-based methods have demonstrated impressive results in medical image restoration, attributed to the multi-head self-attention (MSA) mechanism in the spatial dimension. However, the majority of existing Transformers conduct attention within fixed and coarsely partitioned regions (\text{e.g.} the entire image or fixed patches), resulting in interference from irrelevant regions and fragmen… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: This paper has been accepted by MICCAI 2024

  5. arXiv:2407.08093  [pdf, other

    eess.IV cs.AI cs.CV eess.SP

    MemWarp: Discontinuity-Preserving Cardiac Registration with Memorized Anatomical Filters

    Authors: Hang Zhang, Xiang Chen, Renjiu Hu, Dongdong Liu, Gaolei Li, Rongguang Wang

    Abstract: Many existing learning-based deformable image registration methods impose constraints on deformation fields to ensure they are globally smooth and continuous. However, this assumption does not hold in cardiac image registration, where different anatomical regions exhibit asymmetric motions during respiration and movements due to sliding organs within the chest. Consequently, such global constraint… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: 11 pages, 2 figure, 2 tables

  6. arXiv:2407.07554  [pdf, other

    cs.GR cs.SD eess.AS

    Beat-It: Beat-Synchronized Multi-Condition 3D Dance Generation

    Authors: Zikai Huang, Xuemiao Xu, Cheng Xu, Huaidong Zhang, Chenxi Zheng, Jing Qin, Shengfeng He

    Abstract: Dance, as an art form, fundamentally hinges on the precise synchronization with musical beats. However, achieving aesthetically pleasing dance sequences from music is challenging, with existing methods often falling short in controllability and beat alignment. To address these shortcomings, this paper introduces Beat-It, a novel framework for beat-specific, key pose-guided dance generation. Unlike… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  7. arXiv:2407.06227  [pdf, ps, other

    eess.SY cs.AI

    Communication and Control Co-Design in 6G: Sequential Decision-Making with LLMs

    Authors: Xianfu Chen, Celimuge Wu, Yi Shen, Yusheng Ji, Tsutomu Yoshinaga, Qiang Ni, Charilaos C. Zarakovitis, Honggang Zhang

    Abstract: This article investigates a control system within the context of six-generation wireless networks. The control performance optimization confronts the technical challenges that arise from the intricate interactions between communication and control sub-systems, asking for a co-design. Accounting for the system dynamics, we formulate the sequential co-design decision-makings of communication and con… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

  8. arXiv:2407.04746  [pdf

    eess.SP

    Moving Target Detection Method Based on Range? Doppler Domain Compensation and Cancellation for UAV-Mounted Radar

    Authors: Xiaodong Qu, Xiaolong Sun, Feiyang Liu, Hao Zhang, Shichao Zhong, Xiaopeng Yang

    Abstract: Combining unmanned aerial vehicle (UAV) with through-the-wall radar can realize moving targets detection in complex building scenes. However, clutters generated by obstacles and static objects are always stronger and non-stationary, which results in heavy impacts on moving targets detection. To address this issue, this paper proposes a moving target detection method based on Range-Doppler domain c… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  9. arXiv:2407.04561  [pdf, other

    cs.NI eess.SP

    Wireless Spectrum in Rural Farmlands: Status, Challenges and Opportunities

    Authors: Mukaram Shahid, Kunal Das, Taimoor Ul Islam, Christ Somiah, Daji Qiao, Arsalan Ahmad, Jimming Song, Zhengyuan Zhu, Sarath Babu, Yong Guan, Tusher Chakraborty, Suraj Jog, Ranveer Chandra, Hongwei Zhang

    Abstract: Due to factors such as low population density and expansive geographical distances, network deployment falls behind in rural regions, leading to a broadband divide. Wireless spectrum serves as the blood and flesh of wireless communications. Shared white spaces such as those in the TVWS and CBRS spectrum bands offer opportunities to expand connectivity, innovate, and provide affordable access to hi… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  10. arXiv:2407.04518  [pdf, other

    eess.AS

    From Audio Encoders to Piano Judges: Benchmarking Performance Understanding for Solo Piano

    Authors: Huan Zhang, Jinhua Liang, Simon Dixon

    Abstract: Our study investigates an approach for understanding musical performances through the lens of audio encoding models, focusing on the domain of solo Western classical piano music. Compared to composition-level attribute understanding such as key or genre, we identify a knowledge gap in performance-level music understanding, and address three critical tasks: expertise ranking, difficulty estimation,… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: Accepted by the 25th International Society for Music Information Retrieval (ISMIR)

  11. arXiv:2407.03671  [pdf

    eess.SY

    Spatio-temporal cooperative control Method of Highway Ramp Merge Based on Vehicle-road Coordination

    Authors: Xiaoxue Xu, Maokai Lai, Haitao Zhang, Xiang Dong, Tao Li, Jie Wu, Yuan Li, Ting Peng

    Abstract: The merging area of highway ramps faces multiple challenges, including traffic congestion, collision risks, speed mismatches, driver behavior uncertainties, limited visibility, and bottleneck effects. However, autonomous vehicles engaging in depth coordination between vehicle and road in merging zones, by pre-planning and uploading travel trajectories, can significantly enhance the safety and effi… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  12. arXiv:2407.02182  [pdf, other

    cs.CV cs.RO eess.IV

    Occlusion-Aware Seamless Segmentation

    Authors: Yihong Cao, Jiaming Zhang, Hao Shi, Kunyu Peng, Yuhongxuan Zhang, Hui Zhang, Rainer Stiefelhagen, Kailun Yang

    Abstract: Panoramic images can broaden the Field of View (FoV), occlusion-aware prediction can deepen the understanding of the scene, and domain adaptation can transfer across viewing domains. In this work, we introduce a novel task, Occlusion-Aware Seamless Segmentation (OASS), which simultaneously tackles all these three challenges. For benchmarking OASS, we establish a new human-annotated dataset for Ble… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Accepted to ECCV 2024. The fresh dataset and the source code will be made publicly available at https://github.com/yihong-97/OASS

  13. arXiv:2407.00681  [pdf, other

    eess.SY

    Safe Reinforcement Learning for Power System Control: A Review

    Authors: Peipei Yu, Zhenyi Wang, Hongcai Zhang, Yonghua Song

    Abstract: The large-scale integration of intermittent renewable energy resources introduces increased uncertainty and volatility to the supply side of power systems, thereby complicating system operation and control. Recently, data-driven approaches, particularly reinforcement learning (RL), have shown significant promise in addressing complex control challenges in power systems, because RL can learn from i… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  14. arXiv:2406.19043  [pdf

    eess.IV cs.AI cs.CV cs.DB

    CMRxRecon2024: A Multi-Modality, Multi-View K-Space Dataset Boosting Universal Machine Learning for Accelerated Cardiac MRI

    Authors: Zi Wang, Fanwen Wang, Chen Qin, Jun Lyu, Ouyang Cheng, Shuo Wang, Yan Li, Mengyao Yu, Haoyu Zhang, Kunyuan Guo, Zhang Shi, Qirong Li, Ziqiang Xu, Yajing Zhang, Hao Li, Sha Hua, Binghua Chen, Longyu Sun, Mengting Sun, Qin Li, Ying-Hua Chu, Wenjia Bai, Jing Qin, Xiahai Zhuang, Claudia Prieto , et al. (7 additional authors not shown)

    Abstract: Cardiac magnetic resonance imaging (MRI) has emerged as a clinically gold-standard technique for diagnosing cardiac diseases, thanks to its ability to provide diverse information with multiple modalities and anatomical views. Accelerated cardiac MRI is highly expected to achieve time-efficient and patient-friendly imaging, and then advanced image reconstruction approaches are required to recover h… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 19 pages, 3 figures, 2 tables

  15. arXiv:2406.18914  [pdf, other

    eess.SY cs.RO

    Verification and Synthesis of Compatible Control Lyapunov and Control Barrier Functions

    Authors: Hongkai Dai, Chuanrui Jiang, Hongchao Zhang, Andrew Clark

    Abstract: Safety and stability are essential properties of control systems. Control Barrier Functions (CBFs) and Control Lyapunov Functions (CLFs) have been proposed to ensure safety and stability respectively. However, previous approaches typically verify and synthesize the CBFs and CLFs separately, satisfying their respective constraints, without proving that the CBFs and CLFs are compatible with each oth… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  16. arXiv:2406.18542  [pdf, other

    cs.CV eess.SP

    Generative AI Empowered LiDAR Point Cloud Generation with Multimodal Transformer

    Authors: Mohammad Farzanullah, Han Zhang, Akram Bin Sediq, Ali Afana, Melike Erol-Kantarci

    Abstract: Integrated sensing and communications is a key enabler for the 6G wireless communication systems. The multiple sensing modalities will allow the base station to have a more accurate representation of the environment, leading to context-aware communications. Some widely equipped sensors such as cameras and RADAR sensors can provide some environmental perceptions. However, they are not enough to gen… ▽ More

    Submitted 20 May, 2024; originally announced June 2024.

    Comments: 6 pages, 4 figures, conference

  17. arXiv:2406.18102  [pdf

    eess.IV cs.CV

    A Lung Nodule Dataset with Histopathology-based Cancer Type Annotation

    Authors: Muwei Jian, Hongyu Chen, Zaiyong Zhang, Nan Yang, Haorang Zhang, Lifu Ma, Wenjing Xu, Huixiang Zhi

    Abstract: Recently, Computer-Aided Diagnosis (CAD) systems have emerged as indispensable tools in clinical diagnostic workflows, significantly alleviating the burden on radiologists. Nevertheless, despite their integration into clinical settings, CAD systems encounter limitations. Specifically, while CAD systems can achieve high performance in the detection of lung nodules, they face challenges in accuratel… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  18. arXiv:2406.18018  [pdf, other

    eess.IV

    A Cross Spatio-Temporal Pathology-based Lung Nodule Dataset

    Authors: Muwei Jian, Haoran Zhang, Mingju Shao, Hongyu Chen, Huihui Huang, Yanjie Zhong, Changlei Zhang, Bin Wang, Penghui Gao

    Abstract: Recently, intelligent analysis of lung nodules with the assistant of computer aided detection (CAD) techniques can improve the accuracy rate of lung cancer diagnosis. However, existing CAD systems and pulmonary datasets mainly focus on Computed Tomography (CT) images from one single period, while ignoring the cross spatio-temporal features associated with the progression of nodules contained in im… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  19. arXiv:2406.16150  [pdf, other

    eess.IV cs.CV

    Intensity Confusion Matters: An Intensity-Distance Guided Loss for Bronchus Segmentation

    Authors: Haifan Gong, Wenhao Huang, Huan Zhang, Yu Wang, Xiang Wan, Hong Shen, Guanbin Li, Haofeng Li

    Abstract: Automatic segmentation of the bronchial tree from CT imaging is important, as it provides structural information for disease diagnosis. Despite the merits of previous automatic bronchus segmentation methods, they have paied less attention to the issue we term as \textit{Intensity Confusion}, wherein the intensity values of certain background voxels approach those of the foreground voxels within br… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: IEEE International Conference on Multimedia & Expo (ICME) 2024

  20. arXiv:2406.16026  [pdf

    physics.med-ph cs.LG eess.IV

    CEST-KAN: Kolmogorov-Arnold Networks for CEST MRI Data Analysis

    Authors: Jiawen Wang, Pei Cai, Ziyan Wang, Huabin Zhang, Jianpan Huang

    Abstract: Purpose: This study aims to propose and investigate the feasibility of using Kolmogorov-Arnold Network (KAN) for CEST MRI data analysis (CEST-KAN). Methods: CEST MRI data were acquired from twelve healthy volunteers at 3T. Data from ten subjects were used for training, while the remaining two were reserved for testing. The performance of multi-layer perceptron (MLP) and KAN models with the same ne… ▽ More

    Submitted 25 June, 2024; v1 submitted 23 June, 2024; originally announced June 2024.

  21. arXiv:2406.15222  [pdf

    eess.IV cs.AI cs.CV

    Rapid and Accurate Diagnosis of Acute Aortic Syndrome using Non-contrast CT: A Large-scale, Retrospective, Multi-center and AI-based Study

    Authors: Yujian Hu, Yilang Xiang, Yan-Jie Zhou, Yangyan He, Shifeng Yang, Xiaolong Du, Chunlan Den, Youyao Xu, Gaofeng Wang, Zhengyao Ding, Jingyong Huang, Wenjun Zhao, Xuejun Wu, Donglin Li, Qianqian Zhu, Zhenjiang Li, Chenyang Qiu, Ziheng Wu, Yunjun He, Chen Tian, Yihui Qiu, Zuodong Lin, Xiaolong Zhang, Yuan He, Zhenpeng Yuan , et al. (15 additional authors not shown)

    Abstract: Chest pain symptoms are highly prevalent in emergency departments (EDs), where acute aortic syndrome (AAS) is a catastrophic cardiovascular emergency with a high fatality rate, especially when timely and accurate treatment is not administered. However, current triage practices in the ED can cause up to approximately half of patients with AAS to have an initially missed diagnosis or be misdiagnosed… ▽ More

    Submitted 24 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: under peer review

  22. arXiv:2406.14850  [pdf, other

    eess.AS

    DExter: Learning and Controlling Performance Expression with Diffusion Models

    Authors: Huan Zhang, Shreyan Chowdhury, Carlos Eduardo Cancino-Chacón, Jinhua Liang, Simon Dixon, Gerhard Widmer

    Abstract: In the pursuit of developing expressive music performance models using artificial intelligence, this paper introduces DExter, a new approach leveraging diffusion probabilistic models to render Western classical piano performances. In this approach, performance parameters are represented in a continuous expression space and a diffusion model is trained to predict these continuous parameters while b… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: in submission to appsci special session

  23. arXiv:2406.11175  [pdf, other

    cs.SD eess.AS

    SMRU: Split-and-Merge Recurrent-based UNet for Acoustic Echo Cancellation and Noise Suppression

    Authors: Zhihang Sun, Andong Li, Rilin Chen, Hao Zhang, Meng Yu, Yi Zhou, Dong Yu

    Abstract: The proliferation of deep neural networks has spawned the rapid development of acoustic echo cancellation and noise suppression, and plenty of prior arts have been proposed, which yield promising performance. Nevertheless, they rarely consider the deployment generality in different processing scenarios, such as edge devices, and cloud processing. To this end, this paper proposes a general model, t… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  24. arXiv:2406.11163  [pdf, other

    eess.SP

    Explainable Bayesian Recurrent Neural Smoother to Capture Global State Evolutionary Correlations

    Authors: Shi Yan, Yan Liang, Huayu Zhang, Le Zheng, Difan Zou, Binglu Wang

    Abstract: Through integrating the evolutionary correlations across global states in the bidirectional recursion, an explainable Bayesian recurrent neural smoother (EBRNS) is proposed for offline data-assisted fixed-interval state smoothing. At first, the proposed model, containing global states in the evolutionary interval, is transformed into an equivalent model with bidirectional memory. This transformati… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  25. arXiv:2406.09082  [pdf

    eess.SY cs.AI

    Data-driven modeling and supervisory control system optimization for plug-in hybrid electric vehicles

    Authors: Hao Zhang, Nuo Lei, Boli Chen, Bingbing Li, Rulong Li, Zhi Wang

    Abstract: Learning-based intelligent energy management systems for plug-in hybrid electric vehicles (PHEVs) are crucial for achieving efficient energy utilization. However, their application faces system reliability challenges in the real world, which prevents widespread acceptance by original equipment manufacturers (OEMs). This paper begins by establishing a PHEV model based on physical and data-driven mo… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  26. arXiv:2406.07162  [pdf, other

    cs.SD cs.AI cs.CL cs.MM eess.AS

    EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and Benchmark

    Authors: Ziyang Ma, Mingjie Chen, Hezhao Zhang, Zhisheng Zheng, Wenxi Chen, Xiquan Li, Jiaxin Ye, Xie Chen, Thomas Hain

    Abstract: Speech emotion recognition (SER) is an important part of human-computer interaction, receiving extensive attention from both industry and academia. However, the current research field of SER has long suffered from the following problems: 1) There are few reasonable and universal splits of the datasets, making comparing different models and methods difficult. 2) No commonly used benchmark covers nu… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Accepted by INTERSPEECH 2024. GitHub Repository: https://github.com/emo-box/EmoBox

  27. arXiv:2406.07077  [pdf, other

    eess.SY

    Meta-Backscatter: A New ISAC Paradigm for Battery-Free Internet of Things

    Authors: Xu Liu, Hongliang Zhang, Kaigui Bian, Xi Weng, Lingyang Song

    Abstract: The meta-material sensor has been regarded as a next-generation sensing technology for the battery-free Internet of Things (IoT) due to its battery-free characteristic and improved sensing performance. The meta-material sensors function as backscatter tags that change their reflection coefficients with the conditions of sensing targets such as temperature and gas concentration, allowing transceive… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  28. arXiv:2406.05780  [pdf, ps, other

    eess.SP

    Two-Stage Resource Allocation in Reconfigurable Intelligent Surface Assisted Hybrid Networks via Multi-Player Bandits

    Authors: Jingwen Tong, Hongliang Zhang, Liqun Fu, Amir Leshem, Zhu Han

    Abstract: This paper considers a resource allocation problem where several Internet-of-Things (IoT) devices send data to a base station (BS) with or without the help of the reconfigurable intelligent surface (RIS) assisted cellular network. The objective is to maximize the sum rate of all IoT devices by finding the optimal RIS and spreading factor (SF) for each device. Since these IoT devices lack prior inf… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: This paper was published in IEEE Transcation on Communications

  29. OAM-SWIPT for IoE-Driven 6G

    Authors: Runyu Lyu, Wenchi Cheng, Bazhong Shen, Zhiyuan Ren, Hailin Zhang

    Abstract: Simultaneous wireless information and power transfer (SWIPT), which achieves both wireless energy transfer (WET) and information transfer, is an attractive technique for future Internet of Everything (IoE) in the sixth-generation (6G) mobile communications. With SWIPT, battery-less IoE devices can be powered while communicating with other devices. Line-of-sight (LOS) RF transmission and near-field… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

    Comments: 7 pages, 6 figures

    Journal ref: in IEEE Communications Magazine, vol. 60, no. 3, pp. 19-25, March 2022

  30. arXiv:2406.04762  [pdf, other

    eess.SP

    Holographic Intelligence Surface Assisted Integrated Sensing and Communication

    Authors: Zhuoyang Liu, Yuchen Zhang, Haiyang Zhang, Feng Xu, Yonina C. Eldar

    Abstract: Traditional discrete-array-based systems fail to exploit interactions between closely spaced antennas, resulting in inadequate utilization of the aperture resource. In this paper, we propose a holographic intelligence surface (HIS) assisted integrated sensing and communication (HISAC) system, wherein both the transmitter and receiver are fabricated using a continuous-aperture array. A continuous-d… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  31. arXiv:2405.20064  [pdf, other

    eess.AS cs.SD

    1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem

    Authors: Mingjie Chen, Hezhao Zhang, Yuanchao Li, Jiachen Luo, Wen Wu, Ziyang Ma, Peter Bell, Catherine Lai, Joshua Reiss, Lin Wang, Philip C. Woodland, Xie Chen, Huy Phan, Thomas Hain

    Abstract: Speech emotion recognition is a challenging classification task with natural emotional speech, especially when the distribution of emotion types is imbalanced in the training and test data. In this case, it is more difficult for a model to learn to separate minority classes, resulting in those sometimes being ignored or frequently misclassified. Previous work has utilised class weighted loss for t… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  32. arXiv:2405.18205  [pdf, other

    eess.SY eess.SP

    Joint Radar Sensing, Location, and Communication Resources Optimization in 6G Network

    Authors: Haijun Zhang, Bowen Chen, Xiangnan Liu, Chao Ren

    Abstract: The possibility of jointly optimizing location sensing and communication resources, facilitated by the existence of communication and sensing spectrum sharing, is what promotes the system performance to a higher level. However, the rapid mobility of user equipment (UE) can result in inaccurate location estimation, which can severely degrade system performance. Therefore, the precise UE location se… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 12 pages,9 figures and 4 charts. This paper has been accepted for publication in the IEEE Journal on Selected Areas in Communications

  33. arXiv:2405.16664  [pdf

    eess.SP physics.med-ph

    Deep learning improved autofocus for motion artifact reduction and its application in quantitative susceptibility mapping

    Authors: Chao Li, Jinwei Zhang, Hang Zhang, Jiahao Li, Pascal Spincemaille, Thanh D. Nguyen, Yi Wang

    Abstract: Purpose: To develop a pipeline for motion artifact correction in mGRE and quantitative susceptibility mapping (QSM). Methods: Deep learning is integrated with autofocus to improve motion artifact suppression, which is applied QSM of patients with Parkinson's disease (PD). The estimation of affine motion parameters in the autofocus method depends on signal-to-noise ratio and lacks accuracy when dat… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  34. arXiv:2405.16184  [pdf, other

    eess.SY cs.AI cs.LG

    Safe Deep Model-Based Reinforcement Learning with Lyapunov Functions

    Authors: Harry Zhang

    Abstract: Model-based Reinforcement Learning (MBRL) has shown many desirable properties for intelligent control tasks. However, satisfying safety and stability constraints during training and rollout remains an open question. We propose a new Model-based RL framework to enable efficient policy learning with unknown dynamics based on learning model predictive control (LMPC) framework with mathematically prov… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  35. arXiv:2405.15153  [pdf, other

    eess.SP

    Optimal Reference Nodes Deployment for Positioning Seafloor Anchor Nodes

    Authors: Wei Huang, Pengfei Wu, Tianhe Xu, Hao Zhang, Kaitao Meng

    Abstract: Seafloor anchor nodes, which form a geodetic network, are designed to provide surface and underwater users with positioning, navigation and timing (PNT) services. Due to the non-uniform distribution of underwater sound speed, accurate positioning of underwater anchor nodes is a challenge work. Traditional anchor node positioning typically uses cross or circular shapes, however, how to optimize the… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  36. arXiv:2405.14210  [pdf, other

    cs.CV eess.IV

    Eidos: Efficient, Imperceptible Adversarial 3D Point Clouds

    Authors: Hanwei Zhang, Luo Cheng, Qisong He, Wei Huang, Renjue Li, Ronan Sicre, Xiaowei Huang, Holger Hermanns, Lijun Zhang

    Abstract: Classification of 3D point clouds is a challenging machine learning (ML) task with important real-world applications in a spectrum from autonomous driving and robot-assisted surgery to earth observation from low orbit. As with other ML tasks, classification models are notoriously brittle in the presence of adversarial attacks. These are rooted in imperceptible changes to inputs with the effect tha… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: Preprint

  37. arXiv:2405.13661  [pdf, ps, other

    cs.SD eess.AS

    Timbre Perception, Representation, and its Neuroscientific Exploration: A Comprehensive Review

    Authors: Hong Zhang, Jie Lin, Shengxuan Chen

    Abstract: Timbre, the sound's unique "color", is fundamental to how we perceive and appreciate music. This review explores the multifaceted world of timbre perception and representation. It begins by tracing the word's origin, offering an intuitive grasp of the concept. Building upon this foundation, the article delves into the complexities of defining and measuring timbre. It then explores the concept and… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  38. arXiv:2405.13476  [pdf, other

    eess.SY

    Restricting Voltage Deviation of DC Microgrids with Critical and Ordinary Nodes

    Authors: Handong Bai, Peng Li, Hongwei Zhang

    Abstract: Restricting bus voltage deviation is crucial for normal operation of multi-bus DC microgrids, yet it has received insufficient attention due to the conflict between two main control objectives in DC microgrids, i.e., voltage regulation and current sharing. By revealing a necessary and sufficient condition for achieving these two objectives, this paper proposes a compromised distributed control alg… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  39. arXiv:2405.13243  [pdf

    eess.SY

    A Novel Approach to Evaluating Battery Charger Controller Design with Nonlinear PID Controller in an Extendable CHIL Setup

    Authors: Shervin Salehi Rad, Micheal Muhlbaier, Oleg Fishman, Javad Chevinly, Elias Nadi, Hua Zhang, Fei Lu

    Abstract: The design and development of power electronics converters pose a multitude of challenges. The evaluation of power electronics converters, particularly when operating at high power levels, presents a significant task, offering designers a deeper understanding of the functionality. Several methodologies have been devised to conduct hardware-in-the-loop (HIL) tests, which are classified into two mai… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 2024 IEEE Transportation Electrification Conference & Expo (ITEC 2024) (5 pages, 11 figures)

  40. arXiv:2405.12359  [pdf

    eess.SY

    Design and Analysis of a Detuned Series-Series IPT System with Solenoid Coil Structure for Drone Charging Applications

    Authors: Elias Nadi, Hua Zhang

    Abstract: This paper proposes a new coil configuration that uses solenoid ferrites on the receiver side instead of planar ferrite coils that are employed in existing wireless charging systems.The solenoid ferrites are used in the drone legs to help mount the receiver on a moving truck while the two parts of the transmitter are placed on the truck.To validate this idea a detuned transmitter series-series (SS… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: ITEC 2024 (IEEE Transportation Electrification Conference & Expo)

  41. arXiv:2405.11131  [pdf

    eess.SY

    Gallium Nitride (GaN) based High-Power Multilevel H-Bridge Inverter for Wireless Power Transfer of Electric Vehicles

    Authors: Javad Chevinly, Shervin Salehi Rad, Elias Nadi, Bogdan Proca, John Wolgemuth, Anthony Calabro, Hua Zhang, Fei Lu

    Abstract: This paper presents a design and implementation of a high-power Gallium Nitride (GaN)-based multilevel Hbridge inverter to excite wireless charging coils for the wireless power transfer of electric vehicles (EVs). Compared to the traditional conductive charging, wireless charging technology offers a safer and more convenient way to charge EVs. Due to the increasing demand of fast charging, high-po… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

  42. arXiv:2405.10723  [pdf, other

    eess.IV cs.CV

    Eddeep: Fast eddy-current distortion correction for diffusion MRI with deep learning

    Authors: Antoine Legouhy, Ross Callaghan, Whitney Stee, Philippe Peigneux, Hojjat Azadbakht, Hui Zhang

    Abstract: Modern diffusion MRI sequences commonly acquire a large number of volumes with diffusion sensitization gradients of differing strengths or directions. Such sequences rely on echo-planar imaging (EPI) to achieve reasonable scan duration. However, EPI is vulnerable to off-resonance effects, leading to tissue susceptibility and eddy-current induced distortions. The latter is particularly problematic… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: submitted to MICCAI 2024

  43. arXiv:2405.10691  [pdf, other

    eess.IV cs.CV

    LoCI-DiffCom: Longitudinal Consistency-Informed Diffusion Model for 3D Infant Brain Image Completion

    Authors: Zihao Zhu, Tianli Tao, Yitian Tao, Haowen Deng, Xinyi Cai, Gaofeng Wu, Kaidong Wang, Haifeng Tang, Lixuan Zhu, Zhuoyang Gu, Jiawei Huang, Dinggang Shen, Han Zhang

    Abstract: The infant brain undergoes rapid development in the first few years after birth.Compared to cross-sectional studies, longitudinal studies can depict the trajectories of infants brain development with higher accuracy, statistical power and flexibility.However, the collection of infant longitudinal magnetic resonance (MR) data suffers a notorious dropout problem, resulting in incomplete datasets wit… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

  44. arXiv:2405.09569  [pdf, other

    eess.SP cs.LG

    GaitMotion: A Multitask Dataset for Pathological Gait Forecasting

    Authors: Wenwen Zhang, Hao Zhang, Zenan Jiang, Jing Wang, Amir Servati, Peyman Servati

    Abstract: Gait benchmark empowers uncounted encouraging research fields such as gait recognition, humanoid locomotion, etc. Despite the growing focus on gait analysis, the research community is hindered by the limitations of the currently available databases, which mostly consist of videos or images with limited labeling. In this paper, we introduce GaitMotion, a multitask dataset leveraging wearable sensor… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  45. arXiv:2405.08838  [pdf, other

    cs.SD cs.AI eess.AS

    PolyGlotFake: A Novel Multilingual and Multimodal DeepFake Dataset

    Authors: Yang Hou, Haitao Fu, Chuankai Chen, Zida Li, Haoyu Zhang, Jianjun Zhao

    Abstract: With the rapid advancement of generative AI, multimodal deepfakes, which manipulate both audio and visual modalities, have drawn increasing public concern. Currently, deepfake detection has emerged as a crucial strategy in countering these growing threats. However, as a key factor in training and validating deepfake detectors, most existing deepfake datasets primarily focus on the visual modal, an… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: 13 page, 4 figures

    MSC Class: 68T45 ACM Class: I.4.9

  46. arXiv:2405.07685  [pdf, other

    eess.SY

    Comprehensive Analysis of Access Control Models in Edge Computing: Challenges, Solutions, and Future Directions

    Authors: Tao Xue, Ying Zhang, Yanbin Wang, Wenbo Wang, Shuailou Li, Haibin Zhang

    Abstract: Many contemporary applications, including smart homes and autonomous vehicles, rely on the Internet of Things technology. While cloud computing provides a multitude of valuable services for these applications, it generally imposes constraints on latency-sensitive applications due to the significant propagation delays. As a complementary technique to cloud computing, edge computing situates computi… ▽ More

    Submitted 22 May, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

  47. arXiv:2405.06747  [pdf, other

    cs.SD cs.LG eess.AS

    Music Emotion Prediction Using Recurrent Neural Networks

    Authors: Xinyu Chang, Xiangyu Zhang, Haoruo Zhang, Yulu Ran

    Abstract: This study explores the application of recurrent neural networks to recognize emotions conveyed in music, aiming to enhance music recommendation systems and support therapeutic interventions by tailoring music to fit listeners' emotional states. We utilize Russell's Emotion Quadrant to categorize music into four distinct emotional regions and develop models capable of accurately predicting these c… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    Comments: 15 pages, 13 figures

  48. arXiv:2405.05133  [pdf, other

    cs.CV eess.IV

    Identifying every building's function in large-scale urban areas with multi-modality remote-sensing data

    Authors: Zhuohong Li, Wei He, Jiepan Li, Hongyan Zhang

    Abstract: Buildings, as fundamental man-made structures in urban environments, serve as crucial indicators for understanding various city function zones. Rapid urbanization has raised an urgent need for efficiently surveying building footprints and functions. In this study, we proposed a semi-supervised framework to identify every building's function in large-scale urban areas with multi-modality remote-sen… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: 5 pages, 7 figures, accepted by IGARSS 2024

  49. arXiv:2405.01515  [pdf, other

    cs.IT eess.SP

    Model-based Deep Learning for Rate Split Multiple Access in Vehicular Communications

    Authors: Hanwen Zhang, Mingzhe Chen, Alireza Vahid, Haijian Sun

    Abstract: Rate split multiple access (RSMA) has been proven as an effective communication scheme for 5G and beyond, especially in vehicular scenarios. However, RSMA requires complicated iterative algorithms for proper resource allocation, which cannot fulfill the stringent latency requirement in resource constrained vehicles. Although data driven approaches can alleviate this issue, they suffer from poor ge… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: submitted to IEEE conference

  50. arXiv:2405.01115  [pdf

    cs.RO eess.SY

    A New Self-Alignment Method without Solving Wahba Problem for SINS in Autonomous Vehicles

    Authors: Hongliang Zhang, Yilan Zhou, Lei Wang, Tengchao Huang

    Abstract: Initial alignment is one of the key technologies in strapdown inertial navigation system (SINS) to provide initial state information for vehicle attitude and navigation. For some situations, such as the attitude heading reference system, the position is not necessarily required or even available, then the self-alignment that does not rely on any external aid becomes very necessary. This study pres… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.