Skip to main content

Showing 1–50 of 160 results for author: Zhong, X

  1. arXiv:2407.04969  [pdf, other

    cs.CL

    EVA-Score: Evaluation of Long-form Summarization on Informativeness through Extraction and Validation

    Authors: Yuchen Fan, Xin Zhong, Chengsi Wang, Gaoche Wu, Bowen Zhou

    Abstract: Summarization is a fundamental task in natural language processing (NLP) and since large language models (LLMs), such as GPT-4 and Claude, come out, increasing attention has been paid to long-form summarization whose input sequences are much longer, indicating more information contained. The current evaluation metrics either use similarity-based metrics like ROUGE and BERTScore which rely on sim… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

    Comments: 16 pages, 3 figures, submitted to EMNLP

  2. arXiv:2407.04948  [pdf, other

    cs.CV

    Zero-shot Object Counting with Good Exemplars

    Authors: Huilin Zhu, Jingling Yuan, Zhengwei Yang, Yu Guo, Zheng Wang, Xian Zhong, Shengfeng He

    Abstract: Zero-shot object counting (ZOC) aims to enumerate objects in images using only the names of object classes during testing, without the need for manual annotations. However, a critical challenge in current ZOC methods lies in their inability to identify high-quality exemplars effectively. This deficiency hampers scalability across diverse classes and undermines the development of strong visual asso… ▽ More

    Submitted 9 July, 2024; v1 submitted 5 July, 2024; originally announced July 2024.

  3. arXiv:2406.10175  [pdf, other

    cs.CV

    Enhancing Incomplete Multi-modal Brain Tumor Segmentation with Intra-modal Asymmetry and Inter-modal Dependency

    Authors: Weide Liu, Jingwen Hou, Xiaoyang Zhong, Huijing Zhan, Jun Cheng, Yuming Fang, Guanghui Yue

    Abstract: Deep learning-based brain tumor segmentation (BTS) models for multi-modal MRI images have seen significant advancements in recent years. However, a common problem in practice is the unavailability of some modalities due to varying scanning protocols and patient conditions, making segmentation from incomplete MRI modalities a challenging issue. Previous methods have attempted to address this by fus… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  4. arXiv:2406.08098  [pdf, other

    cs.SE

    Scalable Defect Detection via Traversal on Code Graph

    Authors: Zhengyao Liu, Xitong Zhong, Xingjing Deng, Shuo Hong, Xiang Gao, Hailong Sun

    Abstract: Detecting defects and vulnerabilities in the early stage has long been a challenge in software engineering. Static analysis, a technique that inspects code without execution, has emerged as a key strategy to address this challenge. Among recent advancements, the use of graph-based representations, particularly Code Property Graph (CPG), has gained traction due to its comprehensive depiction of cod… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  5. arXiv:2406.07949  [pdf, other

    cs.CV

    Multi-Teacher Multi-Objective Meta-Learning for Zero-Shot Hyperspectral Band Selection

    Authors: Jie Feng, Xiaojian Zhong, Di Li, Weisheng Dong, Ronghua Shang, Licheng Jiao

    Abstract: Band selection plays a crucial role in hyperspectral image classification by removing redundant and noisy bands and retaining discriminative ones. However, most existing deep learning-based methods are aimed at dealing with a specific band selection dataset, and need to retrain parameters for new datasets, which significantly limits their generalizability.To address this issue, a novel multi-teach… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  6. arXiv:2406.05704  [pdf, other

    cs.CV

    Hierarchical Features Matter: A Deep Exploration of GAN Priors for Improved Dataset Distillation

    Authors: Xinhao Zhong, Hao Fang, Bin Chen, Xulin Gu, Tao Dai, Meikang Qiu, Shu-Tao Xia

    Abstract: Dataset distillation is an emerging dataset reduction method, which condenses large-scale datasets while maintaining task accuracy. Current methods have integrated parameterization techniques to boost synthetic dataset performance by shifting the optimization space from pixel to another informative feature domain. However, they limit themselves to a fixed optimization space for distillation, negle… ▽ More

    Submitted 12 June, 2024; v1 submitted 9 June, 2024; originally announced June 2024.

  7. arXiv:2405.05925  [pdf, other

    cs.LG cs.AI physics.ao-ph

    FuXi-ENS: A machine learning model for medium-range ensemble weather forecasting

    Authors: Xiaohui Zhong, Lei Chen, Hao Li, Jun Liu, Xu Fan, Jie Feng, Kan Dai, Jing-Jia Luo, Jie Wu, Yuan Qi, Bo Lu

    Abstract: Ensemble forecasting is crucial for improving weather predictions, especially for forecasts of extreme events. Constructing an ensemble prediction system (EPS) based on conventional NWP models is highly computationally expensive. ML models have emerged as valuable tools for deterministic weather forecasts, providing forecasts with significantly reduced computational requirements and even surpassin… ▽ More

    Submitted 5 July, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

  8. arXiv:2405.05075  [pdf, other

    cs.LG

    Towards Efficient Training and Evaluation of Robust Models against $l_0$ Bounded Adversarial Perturbations

    Authors: Xuyang Zhong, Yixiao Huang, Chen Liu

    Abstract: This work studies sparse adversarial perturbations bounded by $l_0$ norm. We propose a white-box PGD-like attack method named sparse-PGD to effectively and efficiently generate such perturbations. Furthermore, we combine sparse-PGD with a black-box attack to comprehensively and more reliably evaluate the models' robustness against $l_0$ bounded adversarial perturbations. Moreover, the efficiency o… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  9. arXiv:2405.03388  [pdf, other

    cs.CV cs.RO

    3D LiDAR Mapping in Dynamic Environments Using a 4D Implicit Neural Representation

    Authors: Xingguang Zhong, Yue Pan, Cyrill Stachniss, Jens Behley

    Abstract: Building accurate maps is a key building block to enable reliable localization, planning, and navigation of autonomous vehicles. We propose a novel approach for building accurate maps of dynamic environments utilizing a sequence of LiDAR scans. To this end, we propose encoding the 4D scene into a novel spatio-temporal implicit neural map representation by fitting a time-dependent truncated signed… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: 10 pages, CVPR 2024

  10. arXiv:2404.13134  [pdf, other

    cs.MM cs.CV cs.LG

    Deep Learning-based Text-in-Image Watermarking

    Authors: Bishwa Karki, Chun-Hua Tsai, Pei-Chi Huang, Xin Zhong

    Abstract: In this work, we introduce a novel deep learning-based approach to text-in-image watermarking, a method that embeds and extracts textual information within images to enhance data security and integrity. Leveraging the capabilities of deep learning, specifically through the use of Transformer-based architectures for text processing and Vision Transformers for image feature extraction, our method se… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  11. arXiv:2404.08522  [pdf, other

    cs.LG physics.ao-ph

    Fuxi-DA: A Generalized Deep Learning Data Assimilation Framework for Assimilating Satellite Observations

    Authors: Xiaoze Xu, Xiuyu Sun, Wei Han, Xiaohui Zhong, Lei Chen, Hao Li

    Abstract: Data assimilation (DA), as an indispensable component within contemporary Numerical Weather Prediction (NWP) systems, plays a crucial role in generating the analysis that significantly impacts forecast performance. Nevertheless, the development of an efficient DA system poses significant challenges, particularly in establishing intricate relationships between the background data and the vast amoun… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

  12. arXiv:2404.05832  [pdf, other

    cs.HC eess.SY

    Human-Machine Interaction in Automated Vehicles: Reducing Voluntary Driver Intervention

    Authors: Xinzhi Zhong, Yang Zhou, Varshini Kamaraj, Zhenhao Zhou, Wissam Kontar, Dan Negrut, John D. Lee, Soyoung Ahn

    Abstract: This paper develops a novel car-following control method to reduce voluntary driver interventions and improve traffic stability in Automated Vehicles (AVs). Through a combination of experimental and empirical analysis, we show how voluntary driver interventions can instigate substantial traffic disturbances that are amplified along the traffic upstream. Motivated by these findings, we present a fr… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  13. arXiv:2403.14690  [pdf

    cs.CY cs.AI cs.CL cs.LG

    Incorporating Graph Attention Mechanism into Geometric Problem Solving Based on Deep Reinforcement Learning

    Authors: Xiuqin Zhong, Shengyuan Yan, Gongqi Lin, Hongguang Fu, Liang Xu, Siwen Jiang, Lei Huang, Wei Fang

    Abstract: In the context of online education, designing an automatic solver for geometric problems has been considered a crucial step towards general math Artificial Intelligence (AI), empowered by natural language understanding and traditional logical inference. In most instances, problems are addressed by adding auxiliary components such as lines or points. However, adding auxiliary components automatical… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  14. arXiv:2403.11099  [pdf, other

    cs.DB

    Wait to be Faster: a Smart Pooling Framework for Dynamic Ridesharing

    Authors: Xiaoyao Zhong, Jiabao Jin, Peng Cheng, Wangze Ni, Libin Zheng, Lei Chen, Xuemin Lin

    Abstract: Ridesharing services, such as Uber or Didi, have attracted considerable attention in recent years due to their positive impact on environmental protection and the economy. Existing studies require quick responses to orders, which lack the flexibility to accommodate longer wait times for better grouping opportunities. In this paper, we address a NP-hard ridesharing problem, called Minimal Extra Tim… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: IEEE ICDE 2024

  15. arXiv:2402.04557  [pdf

    physics.chem-ph cs.LG

    An Artificial Intelligence (AI) workflow for catalyst design and optimization

    Authors: Nung Siong Lai, Yi Shen Tew, Xialin Zhong, Jun Yin, Jiali Li, Binhang Yan, Xiaonan Wang

    Abstract: In the pursuit of novel catalyst development to address pressing environmental concerns and energy demand, conventional design and optimization methods often fall short due to the complexity and vastness of the catalyst parameter space. The advent of Machine Learning (ML) has ushered in a new era in the field of catalyst optimization, offering potential solutions to the shortcomings of traditional… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: 31 pages, 7 figures

    Journal ref: Ind. Eng. Chem. Res. 2023, 62, 43, 17835-17848

  16. arXiv:2401.16669  [pdf

    cs.LG cs.AI physics.ao-ph physics.geo-ph

    Improving Global Weather and Ocean Wave Forecast with Large Artificial Intelligence Models

    Authors: Fenghua Ling, Lin Ouyang, Boufeniza Redouane Larbi, Jing-Jia Luo, Tao Han, Xiaohui Zhong, Lei Bai

    Abstract: The rapid advancement of artificial intelligence technologies, particularly in recent years, has led to the emergence of several large parameter artificial intelligence weather forecast models. These models represent a significant breakthrough, overcoming the limitations of traditional numerical weather prediction models and indicating the emergence of profound potential tools for atmosphere-ocean… ▽ More

    Submitted 18 April, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

  17. arXiv:2401.14620  [pdf, other

    cs.AR

    A RISC-V SOC for Terahertz IoT Devices: Implementation and design challenges

    Authors: Xinchao Zhong, Sean Longyu Ma, Hong-fu Chou

    Abstract: Terahertz (THz) communication is considered a viable approach to augmenting the communication capacity of prospective Internet-of-Things (IoT) resulting in enhanced spectral efficiency. This study first provides an outline of the design challenges encountered in developing THz transceivers. This paper introduces advanced approaches and a unique methodology known as Modified Pulse-width Modulation… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: 18 pages, 17 figures, journal

  18. arXiv:2401.12151  [pdf, other

    cs.IT cs.DC math.OC

    Uncoded Storage Coded Transmission Elastic Computing with Straggler Tolerance in Heterogeneous Systems

    Authors: Xi Zhong, Joerg Kliewer, Mingyue Ji

    Abstract: In 2018, Yang et al. introduced a novel and effective approach, using maximum distance separable (MDS) codes, to mitigate the impact of elasticity in cloud computing systems. This approach is referred to as coded elastic computing. Some limitations of this approach include that it assumes all virtual machines have the same computing speeds and storage capacities, and it cannot tolerate stragglers… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: 6 pages, 1 figure, accepted in ICC 2024

  19. PIN-SLAM: LiDAR SLAM Using a Point-Based Implicit Neural Representation for Achieving Global Map Consistency

    Authors: Yue Pan, Xingguang Zhong, Louis Wiesmann, Thorbjörn Posewsky, Jens Behley, Cyrill Stachniss

    Abstract: Accurate and robust localization and mapping are essential components for most autonomous robots. In this paper, we propose a SLAM system for building globally consistent maps, called PIN-SLAM, that is based on an elastic and compact point-based implicit neural map representation. Taking range measurements as input, our approach alternates between incremental learning of the local implicit signed… ▽ More

    Submitted 2 July, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

    Comments: 20 pages

  20. arXiv:2312.09926  [pdf, other

    physics.ao-ph cs.AI cs.LG

    FuXi-S2S: A machine learning model that outperforms conventional global subseasonal forecast models

    Authors: Lei Chen, Xiaohui Zhong, Hao Li, Jie Wu, Bo Lu, Deliang Chen, Shangping Xie, Qingchen Chao, Chensen Lin, Zixin Hu, Yuan Qi

    Abstract: Skillful subseasonal forecasts are crucial for various sectors of society but pose a grand scientific challenge. Recently, machine learning based weather forecasting models outperform the most successful numerical weather predictions generated by the European Centre for Medium-Range Weather Forecasts (ECMWF), but have not yet surpassed conventional models at subseasonal timescales. This paper intr… ▽ More

    Submitted 5 July, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

  21. arXiv:2312.01713  [pdf, other

    cs.CV

    Disentangled Interaction Representation for One-Stage Human-Object Interaction Detection

    Authors: Xubin Zhong, Changxing Ding, Yupeng Hu, Dacheng Tao

    Abstract: Human-Object Interaction (HOI) detection is a core task for human-centric image understanding. Recent one-stage methods adopt a transformer decoder to collect image-wide cues that are useful for interaction prediction; however, the interaction representations obtained using this method are entangled and lack interpretability. In contrast, traditional two-stage methods benefit significantly from th… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  22. arXiv:2312.01554  [pdf, other

    cs.SD cs.RO eess.AS

    Building Ears for Robots: Machine Hearing in the Age of Autonomy

    Authors: Xuan Zhong

    Abstract: This study explores the significance of robot hearing systems, emphasizing their importance for robots operating in diverse and uncertain environments. It introduces the hardware design principles using robotaxis as an example, where exterior microphone arrays are employed to detect sound events such as sirens. The challenges, goals, and test methods are discussed, focusing on achieving a suitable… ▽ More

    Submitted 5 December, 2023; v1 submitted 3 December, 2023; originally announced December 2023.

    Comments: 11 pages, 6 figures. The materials covered in this article were presented and discussed at the Hearing Seminar at Stanford University organized by Malcolm Slaney in October, 2023

  23. arXiv:2311.16828  [pdf, other

    cs.CV

    SARA: Controllable Makeup Transfer with Spatial Alignment and Region-Adaptive Normalization

    Authors: Xiaojing Zhong, Xinyi Huang, Zhonghua Wu, Guosheng Lin, Qingyao Wu

    Abstract: Makeup transfer is a process of transferring the makeup style from a reference image to the source images, while preserving the source images' identities. This technique is highly desirable and finds many applications. However, existing methods lack fine-level control of the makeup style, making it challenging to achieve high-quality results when dealing with large spatial misalignments. To addres… ▽ More

    Submitted 21 May, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

  24. arXiv:2311.16818  [pdf, other

    cs.CV

    DI-Net : Decomposed Implicit Garment Transfer Network for Digital Clothed 3D Human

    Authors: Xiaojing Zhong, Yukun Su, Zhonghua Wu, Guosheng Lin, Qingyao Wu

    Abstract: 3D virtual try-on enjoys many potential applications and hence has attracted wide attention. However, it remains a challenging task that has not been adequately solved. Existing 2D virtual try-on methods cannot be directly extended to 3D since they lack the ability to perceive the depth of each pixel. Besides, 3D virtual try-on approaches are mostly built on the fixed topological structure and wit… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  25. arXiv:2311.03652  [pdf, other

    physics.ao-ph cs.AI

    Machine Learning Parameterization of the Multi-scale Kain-Fritsch (MSKF) Convection Scheme

    Authors: Xiaohui Zhong, Xing Yu, Hao Li

    Abstract: Warm-sector heavy rainfall often occurs along the coast of South China, and it is usually localized and long-lasting, making it challenging to predict. High-resolution numerical weather prediction (NWP) models are increasingly used to better resolve topographic features and forecast such high-impact weather events. However, when the grid spacing becomes comparable to the length scales of convectio… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  26. arXiv:2311.00979  [pdf

    cs.CV

    Overhead Line Defect Recognition Based on Unsupervised Semantic Segmentation

    Authors: Weixi Wang, Xichen Zhong, Xin Li, Sizhe Li, Xun Ma

    Abstract: Overhead line inspection greatly benefits from defect recognition using visible light imagery. Addressing the limitations of existing feature extraction techniques and the heavy data dependency of deep learning approaches, this paper introduces a novel defect recognition framework. This is built on the Faster RCNN network and complemented by unsupervised semantic segmentation. The approach involve… ▽ More

    Submitted 6 December, 2023; v1 submitted 1 November, 2023; originally announced November 2023.

  27. arXiv:2310.19822  [pdf, other

    cs.LG physics.ao-ph stat.AP

    FuXi-Extreme: Improving extreme rainfall and wind forecasts with diffusion model

    Authors: Xiaohui Zhong, Lei Chen, Jun Liu, Chensen Lin, Yuan Qi, Hao Li

    Abstract: Significant advancements in the development of machine learning (ML) models for weather forecasting have produced remarkable results. State-of-the-art ML-based weather forecast models, such as FuXi, have demonstrated superior statistical forecast performance in comparison to the high-resolution forecasts (HRES) of the European Centre for Medium-Range Weather Forecasts (ECMWF). However, ML models f… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

  28. arXiv:2310.19378  [pdf, other

    cs.CV cs.AI

    Few-shot Hybrid Domain Adaptation of Image Generators

    Authors: Hengjia Li, Yang Liu, Linxuan Xia, Yuqi Lin, Tu Zheng, Zheng Yang, Wenxiao Wang, Xiaohui Zhong, Xiaobo Ren, Xiaofei He

    Abstract: Can a pre-trained generator be adapted to the hybrid of multiple target domains and generate images with integrated attributes of them? In this work, we introduce a new task -- Few-shot Hybrid Domain Adaptation (HDA). Given a source generator and several target domains, HDA aims to acquire an adapted generator that preserves the integrated attributes of all target domains, without overriding the s… ▽ More

    Submitted 6 December, 2023; v1 submitted 30 October, 2023; originally announced October 2023.

  29. arXiv:2310.09492  [pdf, other

    cs.CV

    Perception Reinforcement Using Auxiliary Learning Feature Fusion: A Modified Yolov8 for Head Detection

    Authors: Jiezhou Chen, Guankun Wang, Weixiang Liu, Xiaopin Zhong, Yibin Tian, ZongZe Wu

    Abstract: Head detection provides distribution information of pedestrian, which is crucial for scene statistical analysis, traffic management, and risk assessment and early warning. However, scene complexity and large-scale variation in the real world make accurate detection more difficult. Therefore, we present a modified Yolov8 which improves head detection performance through reinforcing target perceptio… ▽ More

    Submitted 14 October, 2023; originally announced October 2023.

  30. arXiv:2310.06448  [pdf, other

    cs.LG cs.DC

    Asynchronous Federated Learning with Incentive Mechanism Based on Contract Theory

    Authors: Danni Yang, Yun Ji, Zhoubin Kou, Xiaoxiong Zhong, Sheng Zhang

    Abstract: To address the challenges posed by the heterogeneity inherent in federated learning (FL) and to attract high-quality clients, various incentive mechanisms have been employed. However, existing incentive mechanisms are typically utilized in conventional synchronous aggregation, resulting in significant straggler issues. In this study, we propose a novel asynchronous FL framework that integrates an… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  31. arXiv:2310.05395  [pdf, other

    cs.MM cs.LG

    Robust Image Watermarking based on Cross-Attention and Invariant Domain Learning

    Authors: Agnibh Dasgupta, Xin Zhong

    Abstract: Image watermarking involves embedding and extracting watermarks within a cover image, with deep learning approaches emerging to bolster generalization and robustness. Predominantly, current methods employ convolution and concatenation for watermark embedding, while also integrating conceivable augmentation in the training process. This paper explores a robust image watermarking methodology by harn… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

  32. arXiv:2310.01208  [pdf, other

    cs.CL

    Label Supervised LLaMA Finetuning

    Authors: Zongxi Li, Xianming Li, Yuzhang Liu, Haoran Xie, Jing Li, Fu-lee Wang, Qing Li, Xiaoqin Zhong

    Abstract: The recent success of Large Language Models (LLMs) has gained significant attention in both academia and industry. Substantial efforts have been made to enhance the zero- and few-shot generalization capabilities of open-source LLMs through finetuning. Currently, the prevailing approach is instruction-tuning, which trains LLMs to complete real-world tasks by generating responses guided by natural l… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

  33. arXiv:2310.01024  [pdf, other

    cs.IT cs.NI eess.SP

    Joint Source-Channel Coding System for 6G Communication: Design, Prototype and Future Directions

    Authors: Xinchao Zhong, Sean Longyu Ma, Hong-fu Chou, Arsham Mostaani, Thang X. Vu, Symeon Chatzinotas

    Abstract: The goal of semantic communication is to surpass optimal Shannon's criterion regarding a notable problem for future communication which lies in the integration of collaborative efforts between the intelligence of the transmission source and the joint design of source coding and channel coding. The convergence of scholarly investigation and applicable products in the field of semantic communication… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Comments: 14 pages, 9 figures, Journal

  34. arXiv:2309.08159  [pdf, other

    cs.CV cs.IR cs.LG

    AdSEE: Investigating the Impact of Image Style Editing on Advertisement Attractiveness

    Authors: Liyao Jiang, Chenglin Li, Haolan Chen, Xiaodong Gao, Xinwang Zhong, Yang Qiu, Shani Ye, Di Niu

    Abstract: Online advertisements are important elements in e-commerce sites, social media platforms, and search engines. With the increasing popularity of mobile browsing, many online ads are displayed with visual information in the form of a cover image in addition to text descriptions to grab the attention of users. Various recent studies have focused on predicting the click rates of online advertisements… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

    Comments: Accepted to KDD 2023 Applied Data Science Track

  35. arXiv:2309.04195  [pdf, other

    cs.LG cs.AI

    Towards Mitigating Architecture Overfitting in Dataset Distillation

    Authors: Xuyang Zhong, Chen Liu

    Abstract: Dataset distillation methods have demonstrated remarkable performance for neural networks trained with very limited training data. However, a significant challenge arises in the form of architecture overfitting: the distilled training data synthesized by a specific network architecture (i.e., training network) generates poor performance when trained by other network architectures (i.e., test netwo… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

  36. arXiv:2309.03360  [pdf, other

    cs.CV cs.LG

    ViewMix: Augmentation for Robust Representation in Self-Supervised Learning

    Authors: Arjon Das, Xin Zhong

    Abstract: Joint Embedding Architecture-based self-supervised learning methods have attributed the composition of data augmentations as a crucial factor for their strong representation learning capabilities. While regional dropout strategies have proven to guide models to focus on lesser indicative parts of the objects in supervised methods, it hasn't been adopted by self-supervised methods for generating po… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

  37. arXiv:2308.16870  [pdf, other

    cs.RO cs.AI eess.SY

    Learning Driver Models for Automated Vehicles via Knowledge Sharing and Personalization

    Authors: Wissam Kontar, Xinzhi Zhong, Soyoung Ahn

    Abstract: This paper describes a framework for learning Automated Vehicles (AVs) driver models via knowledge sharing between vehicles and personalization. The innate variability in the transportation system makes it exceptionally challenging to expose AVs to all possible driving scenarios during empirical experimentation or testing. Consequently, AVs could be blind to certain encounters that are deemed detr… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Comments: 10 pages, 8 figures

  38. arXiv:2308.16552  [pdf, other

    cs.CV

    Prompt-enhanced Hierarchical Transformer Elevating Cardiopulmonary Resuscitation Instruction via Temporal Action Segmentation

    Authors: Yang Liu, Xiaoyun Zhong, Shiyao Zhai, Zhicheng Du, Zhenyuan Gao, Qiming Huang, Canyang Zhang, Bin Jiang, Vijay Kumar Pandey, Sanyang Han, Runming Wang, Yuxing Han, Peiwu Qin

    Abstract: The vast majority of people who suffer unexpected cardiac arrest are performed cardiopulmonary resuscitation (CPR) by passersby in a desperate attempt to restore life, but endeavors turn out to be fruitless on account of disqualification. Fortunately, many pieces of research manifest that disciplined training will help to elevate the success rate of resuscitation, which constantly desires a seamle… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Comments: Transformer for Cardiopulmonary Resuscitation

  39. arXiv:2308.15344  [pdf, other

    cs.LG cs.CR cs.CV

    Imperceptible Adversarial Attack on Deep Neural Networks from Image Boundary

    Authors: Fahad Alrasheedi, Xin Zhong

    Abstract: Although Deep Neural Networks (DNNs), such as the convolutional neural networks (CNN) and Vision Transformers (ViTs), have been successfully applied in the field of computer vision, they are demonstrated to be vulnerable to well-sought Adversarial Examples (AEs) that can easily fool the DNNs. The research in AEs has been active, and many adversarial attacks and explanations have been proposed sinc… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

  40. arXiv:2308.09259  [pdf, other

    cs.LG

    FRGNN: Mitigating the Impact of Distribution Shift on Graph Neural Networks via Test-Time Feature Reconstruction

    Authors: Rui Ding, Jielong Yang, Feng Ji, Xionghu Zhong, Linbo Xie

    Abstract: Due to inappropriate sample selection and limited training data, a distribution shift often exists between the training and test sets. This shift can adversely affect the test performance of Graph Neural Networks (GNNs). Existing approaches mitigate this issue by either enhancing the robustness of GNNs to distribution shift or reducing the shift itself. However, both approaches necessitate retrain… ▽ More

    Submitted 13 October, 2023; v1 submitted 17 August, 2023; originally announced August 2023.

  41. DAOT: Domain-Agnostically Aligned Optimal Transport for Domain-Adaptive Crowd Counting

    Authors: Huilin Zhu, Jingling Yuan, Xian Zhong, Zhengwei Yang, Zheng Wang, Shengfeng He

    Abstract: Domain adaptation is commonly employed in crowd counting to bridge the domain gaps between different datasets. However, existing domain adaptation methods tend to focus on inter-dataset differences while overlooking the intra-differences within the same dataset, leading to additional learning ambiguities. These domain-agnostic factors, e.g., density, surveillance perspective, and scale, can cause… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    Comments: 11 pages, 12 figures, 5 tables

    ACM Class: I.4.m

  42. arXiv:2308.04603  [pdf, other

    cs.MM cs.CR cs.LG

    A Brief Yet In-Depth Survey of Deep Learning-Based Image Watermarking

    Authors: Xin Zhong, Arjon Das, Fahad Alrasheedi, Abdullah Tanvir

    Abstract: This paper presents a comprehensive survey on deep learning-based image watermarking, a technique that entails the invisible embedding and extraction of watermarks within a cover image, aiming to offer a seamless blend of robustness and adaptability. We navigate the complex landscape of this interdisciplinary domain, linking historical foundations, current innovations, and prospective developments… ▽ More

    Submitted 29 October, 2023; v1 submitted 8 August, 2023; originally announced August 2023.

    Comments: This paper was accepted for publication by the MDPI Applied Sciences journal

  43. arXiv:2306.16991  [pdf, other

    cs.CV

    Integrating Large Pre-trained Models into Multimodal Named Entity Recognition with Evidential Fusion

    Authors: Weide Liu, Xiaoyang Zhong, Jingwen Hou, Shaohua Li, Haozhe Huang, Yuming Fang

    Abstract: Multimodal Named Entity Recognition (MNER) is a crucial task for information extraction from social media platforms such as Twitter. Most current methods rely on attention weights to extract information from both text and images but are often unreliable and lack interpretability. To address this problem, we propose incorporating uncertainty estimation into the MNER task, producing trustworthy pred… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

  44. arXiv:2306.12873  [pdf, other

    physics.ao-ph cs.AI cs.LG

    FuXi: A cascade machine learning forecasting system for 15-day global weather forecast

    Authors: Lei Chen, Xiaohui Zhong, Feng Zhang, Yuan Cheng, Yinghui Xu, Yuan Qi, Hao Li

    Abstract: Over the past few years, due to the rapid development of machine learning (ML) models for weather forecasting, state-of-the-art ML models have shown superior performance compared to the European Centre for Medium-Range Weather Forecasts (ECMWF)'s high-resolution forecast (HRES) in 10-day forecasts at a spatial resolution of 0.25 degree. However, the challenge remains to perform comparably to the E… ▽ More

    Submitted 20 October, 2023; v1 submitted 22 June, 2023; originally announced June 2023.

  45. arXiv:2306.10056  [pdf, other

    cs.CL cs.IR

    Generate to Understand for Representation

    Authors: Changshang Xue, Xiande Zhong, Xiaoqing Liu

    Abstract: In recent years, a significant number of high-quality pretrained models have emerged, greatly impacting Natural Language Understanding (NLU), Natural Language Generation (NLG), and Text Representation tasks. Traditionally, these models are pretrained on custom domain corpora and finetuned for specific tasks, resulting in high costs related to GPU usage and labor. Unfortunately, recent trends in la… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

    MSC Class: 68T50 (Primary) 03B65; 91F20(Secondary) ACM Class: I.7

  46. arXiv:2305.14150  [pdf, other

    cs.CL cs.AI

    WYWEB: A NLP Evaluation Benchmark For Classical Chinese

    Authors: Bo Zhou, Qianglong Chen, Tianyu Wang, Xiaomi Zhong, Yin Zhang

    Abstract: To fully evaluate the overall performance of different NLP models in a given domain, many evaluation benchmarks are proposed, such as GLUE, SuperGLUE and CLUE. The fi eld of natural language understanding has traditionally focused on benchmarks for various tasks in languages such as Chinese, English, and multilingua, however, there has been a lack of attention given to the area of classical Chines… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted by ACL 2023

    Report number: 2023.findings-acl.204

    Journal ref: https://aclanthology.org/2023.findings-acl.204

  47. arXiv:2305.12691  [pdf, other

    cs.CV

    Hi-ResNet: A High-Resolution Remote Sensing Network for Semantic Segmentation

    Authors: Yuxia Chen, Pengcheng Fang, Jianhui Yu, Xiaoling Zhong, Xiaoming Zhang, Tianrui Li

    Abstract: High-resolution remote sensing (HRS) semantic segmentation extracts key objects from high-resolution coverage areas. However, objects of the same category within HRS images generally show significant differences in scale and shape across diverse geographical environments, making it difficult to fit the data distribution. Additionally, a complex background environment causes similar appearances of… ▽ More

    Submitted 23 May, 2023; v1 submitted 21 May, 2023; originally announced May 2023.

  48. arXiv:2305.09145  [pdf, other

    cs.LG cs.AI cs.CV cs.MM

    Deep ReLU Networks Have Surprisingly Simple Polytopes

    Authors: Feng-Lei Fan, Wei Huang, Xiangru Zhong, Lecheng Ruan, Tieyong Zeng, Huan Xiong, Fei Wang

    Abstract: A ReLU network is a piecewise linear function over polytopes. Figuring out the properties of such polytopes is of fundamental importance for the research and development of neural networks. So far, either theoretical or empirical studies on polytopes only stay at the level of counting their number, which is far from a complete characterization of polytopes. To upgrade the characterization to a new… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

  49. arXiv:2305.05773  [pdf, other

    cs.MM cs.LG

    DeepTextMark: A Deep Learning-Driven Text Watermarking Approach for Identifying Large Language Model Generated Text

    Authors: Travis Munyer, Abdullah Tanvir, Arjon Das, Xin Zhong

    Abstract: The rapid advancement of Large Language Models (LLMs) has significantly enhanced the capabilities of text generators. With the potential for misuse escalating, the importance of discerning whether texts are human-authored or generated by LLMs has become paramount. Several preceding studies have ventured to address this challenge by employing binary classifiers to differentiate between human-writte… ▽ More

    Submitted 11 March, 2024; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: The paper has been accpeted for publication by IEEE Access

  50. arXiv:2305.04066  [pdf, other

    cs.LG cs.NI eess.SP

    Semi-Asynchronous Federated Edge Learning Mechanism via Over-the-air Computation

    Authors: Zhoubin Kou, Yun Ji, Xiaoxiong Zhong, Sheng Zhang

    Abstract: Over-the-air Computation (AirComp) has been demonstrated as an effective transmission scheme to boost the efficiency of federated edge learning (FEEL). However, existing FEEL systems with AirComp scheme often employ traditional synchronous aggregation mechanisms for local model aggregation in each global round, which suffer from the stragglers issues. In this paper, we propose a semi-asynchronous… ▽ More

    Submitted 29 May, 2023; v1 submitted 6 May, 2023; originally announced May 2023.