Skip to main content

Showing 1–50 of 855 results for author: Ding, Z

  1. arXiv:2407.10940  [pdf, other

    quant-ph

    Quantum Control of an Oscillator with a Kerr-cat Qubit

    Authors: Andy Z. Ding, Benjamin L. Brock, Alec Eickbusch, Akshay Koottandavida, Nicholas E. Frattini, Rodrigo G. Cortinas, Vidul R. Joshi, Stijn J. de Graaf, Benjamin J. Chapman, Suhas Ganjam, Luigi Frunzio, Robert J. Schoelkopf, Michel H. Devoret

    Abstract: Bosonic codes offer a hardware-efficient strategy for quantum error correction by redundantly encoding quantum information in the large Hilbert space of a harmonic oscillator. However, experimental realizations of these codes are often limited by ancilla errors propagating to the encoded logical qubit during syndrome measurements. The Kerr-cat qubit has been proposed as an ancilla for these codes… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  2. arXiv:2407.10078  [pdf, other

    cs.IR cs.AI

    Semantic Understanding and Data Imputation using Large Language Model to Accelerate Recommendation System

    Authors: Zhicheng Ding, Jiahao Tian, Zhenkai Wang, Jinman Zhao, Siyang Li

    Abstract: This paper aims to address the challenge of sparse and missing data in recommendation systems, a significant hurdle in the age of big data. Traditional imputation methods struggle to capture complex relationships within the data. We propose a novel approach that fine-tune Large Language Model (LLM) and use it impute missing data for recommendation systems. LLM which is trained on vast amounts of t… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

  3. arXiv:2407.09100  [pdf, other

    q-bio.NC

    Retrospective for the Dynamic Sensorium Competition for predicting large-scale mouse primary visual cortex activity from videos

    Authors: Polina Turishcheva, Paul G. Fahey, Michaela Vystrčilová, Laura Hansel, Rachel Froebe, Kayla Ponder, Yongrong Qiu, Konstantin F. Willeke, Mohammad Bashiri, Ruslan Baikulov, Yu Zhu, Lei Ma, Shan Yu, Tiejun Huang, Bryan M. Li, Wolf De Wulf, Nina Kudryashova, Matthias H. Hennig, Nathalie L. Rochefort, Arno Onken, Eric Wang, Zhiwei Ding, Andreas S. Tolias, Fabian H. Sinz, Alexander S Ecker

    Abstract: Understanding how biological visual systems process information is challenging because of the nonlinear relationship between visual input and neuronal responses. Artificial neural networks allow computational neuroscientists to create predictive models that connect biological and machine vision. Machine learning has benefited tremendously from benchmarks that compare different model on the same ta… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  4. arXiv:2407.08678  [pdf, other

    cs.LG math.OC stat.CO stat.ML

    How to beat a Bayesian adversary

    Authors: Zihan Ding, Kexin Jin, Jonas Latz, Chenguang Liu

    Abstract: Deep neural networks and other modern machine learning models are often susceptible to adversarial attacks. Indeed, an adversary may often be able to change a model's prediction through a small, directed perturbation of the model's input - an issue in safety-critical applications. Adversarially robust machine learning is usually based on a minmax optimisation problem that minimises the machine lea… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    MSC Class: 90C15; 65C35; 68T07

  5. arXiv:2407.07690  [pdf

    physics.optics physics.app-ph

    High power GaSb-based distributed feedback laser with laterally coupled dielectric gratings at 1.95μm

    Authors: Zhengqing Ding, Juntian Cao, Kun Zhan, Yihang Chen, Lidan Zhou, Hao Tan, Chenao Yang, Ying Yu, Zhichuan Niu, Siyuan Yu

    Abstract: Traditional Distributed Feedback (DFB) or Distributed Bragg Reflector (DBR) lasers typically utilize buried gratings as frequency-selective optical feedback mechanisms. However, the fabrication of such gratings often necessitates regrowth processes, which can pose technical challenges for materials platforms such as GaAs and GaSb. Metal gratings were also used for GaSb lasers but they introduce ad… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: 9 pages, 7 figures, 1 table

    MSC Class: 78A60 ACM Class: J.2.6

  6. arXiv:2407.05233  [pdf, other

    cs.CL cs.AI

    Advancing Prompt Recovery in NLP: A Deep Dive into the Integration of Gemma-2b-it and Phi2 Models

    Authors: Jianlong Chen, Wei Xu, Zhicheng Ding, Jinxin Xu, Hao Yan, Xinyu Zhang

    Abstract: Prompt recovery, a crucial task in natural language processing, entails the reconstruction of prompts or instructions that language models use to convert input text into a specific output. Although pivotal, the design and effectiveness of prompts represent a challenging and relatively untapped field within NLP research. This paper delves into an exhaustive investigation of prompt recovery methodol… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

  7. arXiv:2407.04739  [pdf, other

    eess.SP

    Classification of Power Quality Disturbances Using Resnet with Channel Attention Mechanism

    Authors: Su Pan, Xingyang Nie, Xiaoyu Zhai, Biao Wang, Huilin Ge, Cheng He, Zhenping Ding

    Abstract: The detection and classification of power quality disturbances (PQDs) carries significant importance for power systems. In response to this imperative, numerous intelligent diagnostic methods have been developed. However, existing identification methods usually concentrate on single-type signals or on complex signals with two types, rendering them susceptible to noisy labels and environmental effe… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  8. arXiv:2407.04085  [pdf, other

    cs.CV

    FIPGNet:Pyramid grafting network with feature interaction strategies

    Authors: Ziyi Ding, Like Xin

    Abstract: Salient object detection is designed to identify the objects in an image that attract the most visual attention.Currently, the most advanced method of significance object detection adopts pyramid grafting network architecture.However, pyramid-graft network architecture still has the problem of failing to accurately locate significant targets.We observe that this is mainly due to the fact that curr… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: arXiv admin note: text overlap with arXiv:2309.08365 by other authors

  9. arXiv:2407.03900  [pdf, other

    cs.CV

    Oracle Bone Inscriptions Multi-modal Dataset

    Authors: Bang Li, Donghao Luo, Yujie Liang, Jing Yang, Zengmao Ding, Xu Peng, Boyuan Jiang, Shengwei Han, Dan Sui, Peichao Qin, Pian Wu, Chaoyang Wang, Yun Qi, Taisong Jin, Chengjie Wang, Xiaoming Huang, Zhan Shu, Rongrong Ji, Yongge Liu, Yunsheng Wu

    Abstract: Oracle bone inscriptions(OBI) is the earliest developed writing system in China, bearing invaluable written exemplifications of early Shang history and paleography. However, the task of deciphering OBI, in the current climate of the scholarship, can prove extremely challenging. Out of the 4,500 oracle bone characters excavated, only a third have been successfully identified. Therefore, leveraging… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  10. arXiv:2407.03899  [pdf, ps, other

    cs.IT eess.SP

    Hybrid NOMA Assisted OFDMA Uplink Transmission

    Authors: Zhiguo Ding, H. Vincent Poor

    Abstract: Hybrid non-orthogonal multiple access (NOMA) has recently received significant research interest due to its ability to efficiently use resources from different domains and also its compatibility with various orthogonal multiple access (OMA) based legacy networks. Unlike existing studies on hybrid NOMA that focus on combining NOMA with time-division multiple access (TDMA), this work considers hybri… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  11. arXiv:2407.01612  [pdf, other

    math.CO

    A Note on Improved bounds for the Oriented Radius of Mixed Multigraphs

    Authors: Hengzhe Li, Zhiwei Ding, Jianbing Liu, Yanhong Gao, Shuli Zhao

    Abstract: For a positive integer $r$, let $f(r)$ denote the smallest number such that any 2-edge connected mixed graph with radius $r$ has an oriented radius of at most $f(r)$. Recently, Babu, Benson, and Rajendraprasad significantly improved the upper bound of $f(r)$ by establishing that $f(r) \leq 1.5r^2 + r + 1$, see [Improved bounds for the oriented radius of mixed multigraphs, J. Graph Theory, 103 (202… ▽ More

    Submitted 27 June, 2024; originally announced July 2024.

    Comments: 7 pages, 1 figure

    MSC Class: 05C12; 05C40

  12. arXiv:2407.00647  [pdf, other

    cond-mat.mes-hall quant-ph

    Critical fluctuation and noise spectra in two-dimensional Fe$_{3}$GeTe$_{2}$ magnets

    Authors: Yuxin Li, Zhe Ding, Chen Wang, Haoyu Sun, Zhousheng Chen, Pengfei Wang, Ya Wang, Ming Gong, Hualing Zeng, Fazhan Shi, Jiangfeng Du

    Abstract: Critical fluctuations play a fundamental role in determining the spin orders for low-dimensional quantum materials, especially for recently discovered two-dimensional (2D) magnets. Here we employ the quantum decoherence imaging technique utilizing nitrogen-vacancy centers in diamond to explore the critical magnetic fluctuations and the associated temporal spin noise in van der Waals magnet… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  13. arXiv:2406.18375  [pdf, other

    cs.CV

    From Majority to Minority: A Diffusion-based Augmentation for Underrepresented Groups in Skin Lesion Analysis

    Authors: Janet Wang, Yunsung Chung, Zhengming Ding, Jihun Hamm

    Abstract: AI-based diagnoses have demonstrated dermatologist-level performance in classifying skin cancer. However, such systems are prone to under-performing when tested on data from minority groups that lack sufficient representation in the training sets. Although data collection and annotation offer the best means for promoting minority groups, these processes are costly and time-consuming. Prior works h… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  14. arXiv:2406.18284  [pdf, other

    cs.CV

    RealTalk: Real-time and Realistic Audio-driven Face Generation with 3D Facial Prior-guided Identity Alignment Network

    Authors: Xiaozhong Ji, Chuming Lin, Zhonggan Ding, Ying Tai, Jian Yang, Junwei Zhu, Xiaobin Hu, Jiangning Zhang, Donghao Luo, Chengjie Wang

    Abstract: Person-generic audio-driven face generation is a challenging task in computer vision. Previous methods have achieved remarkable progress in audio-visual synchronization, but there is still a significant gap between current results and practical applications. The challenges are two-fold: 1) Preserving unique individual traits for achieving high-precision lip synchronization. 2) Generating high-qual… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  15. arXiv:2406.18204  [pdf, other

    cs.NI

    Analysis of Channel Uncertainty in Trusted Wireless Services via Repeated Interactions

    Authors: Bingwen Chen, Xintong Ling, Weihang Cao, Jiaheng Wang, Zhi Ding

    Abstract: The coexistence of heterogeneous sub-networks in 6G poses new security and trust concerns and thus calls for a perimeterless-security model. Blockchain radio access network (B-RAN) provides a trust-building approach via repeated interactions rather than relying on pre-established trust or central authentication. Such a trust-building process naturally supports dynamic trusted services across vario… ▽ More

    Submitted 2 July, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

  16. arXiv:2406.17764  [pdf, other

    cs.CL cs.AI

    BMIKE-53: Investigating Cross-Lingual Knowledge Editing with In-Context Learning

    Authors: Ercong Nie, Bo Shao, Zifeng Ding, Mingyang Wang, Helmut Schmid, Hinrich Schütze

    Abstract: Large language models (LLMs) possess extensive parametric knowledge, but this knowledge is difficult to update with new information because retraining is very expensive and infeasible for closed-source models. Knowledge editing (KE) has emerged as a viable solution for updating the knowledge of LLMs without compromising their overall performance. On-the-fly KE methods, inspired by in-context learn… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: 12 pages, 4 figures

  17. arXiv:2406.16700  [pdf, other

    cond-mat.str-el hep-th math-ph

    Anomalies in mirror symmetry enriched topological orders

    Authors: Zhaoyang Ding, Yang Qi

    Abstract: Two-dimensional mirror symmetry-enriched topological orders can be studied using the folding approach: it can be folded along the mirror axis and turned into a bilayer system on which the mirror symmetry acts as a $\mathbb Z_2$ layer-exchange symmetry. How mirror-symmetry enriches the topological order is then encoded at the mirror axis, which is a gapped boundary of the folded bilayer system. Bas… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 13 pages, 5 figures

  18. arXiv:2406.15222  [pdf

    eess.IV cs.AI cs.CV

    Rapid and Accurate Diagnosis of Acute Aortic Syndrome using Non-contrast CT: A Large-scale, Retrospective, Multi-center and AI-based Study

    Authors: Yujian Hu, Yilang Xiang, Yan-Jie Zhou, Yangyan He, Shifeng Yang, Xiaolong Du, Chunlan Den, Youyao Xu, Gaofeng Wang, Zhengyao Ding, Jingyong Huang, Wenjun Zhao, Xuejun Wu, Donglin Li, Qianqian Zhu, Zhenjiang Li, Chenyang Qiu, Ziheng Wu, Yunjun He, Chen Tian, Yihui Qiu, Zuodong Lin, Xiaolong Zhang, Yuan He, Zhenpeng Yuan , et al. (15 additional authors not shown)

    Abstract: Chest pain symptoms are highly prevalent in emergency departments (EDs), where acute aortic syndrome (AAS) is a catastrophic cardiovascular emergency with a high fatality rate, especially when timely and accurate treatment is not administered. However, current triage practices in the ED can cause up to approximately half of patients with AAS to have an initially missed diagnosis or be misdiagnosed… ▽ More

    Submitted 24 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: under peer review

  19. arXiv:2406.14556  [pdf, other

    cs.RO cs.CV

    Asynchronous Large Language Model Enhanced Planner for Autonomous Driving

    Authors: Yuan Chen, Zi-han Ding, Ziqin Wang, Yan Wang, Lijun Zhang, Si Liu

    Abstract: Despite real-time planners exhibiting remarkable performance in autonomous driving, the growing exploration of Large Language Models (LLMs) has opened avenues for enhancing the interpretability and controllability of motion planning. Nevertheless, LLM-based planners continue to encounter significant challenges, including elevated resource consumption and extended inference times, which pose substa… ▽ More

    Submitted 21 June, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

  20. arXiv:2406.11455  [pdf, other

    cs.CL cs.AI

    Adaptive Reinforcement Learning Planning: Harnessing Large Language Models for Complex Information Extraction

    Authors: Zepeng Ding, Ruiyang Ke, Wenhao Huang, Guochao Jiang, Yanda Li, Deqing Yang, Yanghua Xiao, Jiaqing Liang

    Abstract: Existing research on large language models (LLMs) shows that they can solve information extraction tasks through multi-step planning. However, their extraction behavior on complex sentences and tasks is unstable, emerging issues such as false positives and missing elements. We observe that decomposing complex extraction tasks and extracting them step by step can effectively improve LLMs' performan… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  21. arXiv:2406.09881  [pdf, other

    cs.CL

    A Unified Data Augmentation Framework for Low-Resource Multi-Domain Dialogue Generation

    Authors: Yongkang Liu, Ercong Nie, Shi Feng, Zheng Hua, Zifeng Ding, Daling Wang, Yifei Zhang, Hinrich Schütze

    Abstract: Current state-of-the-art dialogue systems heavily rely on extensive training datasets. However, challenges arise in domains where domain-specific training datasets are insufficient or entirely absent. To tackle this challenge, we propose a novel data \textbf{A}ugmentation framework for \textbf{M}ulti-\textbf{D}omain \textbf{D}ialogue \textbf{G}eneration, referred to as \textbf{AMD$^2$G}. The AMD… ▽ More

    Submitted 28 June, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

    Comments: 17pages,ECML-PKDD

    Journal ref: 2024 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases

  22. arXiv:2406.09606  [pdf, other

    cs.LG cs.AI cs.AR

    Cross-Modality Program Representation Learning for Electronic Design Automation with High-Level Synthesis

    Authors: Zongyue Qin, Yunsheng Bai, Atefeh Sohrabizadeh, Zijian Ding, Ziniu Hu, Yizhou Sun, Jason Cong

    Abstract: In recent years, domain-specific accelerators (DSAs) have gained popularity for applications such as deep learning and autonomous driving. To facilitate DSA designs, programmers use high-level synthesis (HLS) to compile a high-level description written in C/C++ into a design with low-level hardware description languages that eventually synthesize DSAs on circuits. However, creating a high-quality… ▽ More

    Submitted 27 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: 14 pages, 8 figures. arXiv admin note: text overlap with arXiv:2305.10838

  23. arXiv:2406.08882  [pdf, other

    quant-ph

    SA-DQAS: Self-attention Enhanced Differentiable Quantum Architecture Search

    Authors: Yize Sun, Jiarui Liu, Zixin Wu, Zifeng Ding, Yunpu Ma, Thomas Seidl, Volker Tresp

    Abstract: We introduce SA-DQAS in this paper, a novel framework that enhances the gradient-based Differentiable Quantum Architecture Search (DQAS) with a self-attention mechanism, aimed at optimizing circuit design for Quantum Machine Learning (QML) challenges. Analogous to a sequence of words in a sentence, a quantum circuit can be viewed as a sequence of placeholders containing quantum gates. Unlike DQAS,… ▽ More

    Submitted 11 July, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: 4 pages

  24. Practical, Automated Scenario-based Mobile App Testing

    Authors: Shengcheng Yu, Chunrong Fang, Mingzhe Du, Zimin Ding, Zhenyu Chen, Zhendong Su

    Abstract: The importance of mobile application (app) quality insurance is increasing with the rapid development of the mobile Internet. Automated test generation approaches, as a dominant direction of app quality insurance, follow specific models or strategies, targeting at optimizing the code coverage. Such approaches lead to a huge gap between testing execution and app business logic. Test scripts develop… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Accepted by IEEE Transaction on Software Engineering in 2024

  25. arXiv:2406.06085  [pdf, other

    astro-ph.CO

    Fiducial-Cosmology-dependent systematics for the DESI 2024 BAO Analysis

    Authors: A. Pérez-Fernández, L. Medina-Varela, R. Ruggeri, M. Vargas-Magaña, H. Seo, N. Padmanabhan, M. Ishak, J. Aguilar, S. Ahlen, S. Alam, O. Alves, S. Brieden, D. Brooks, A. Carnero Rosell, X. Chen, T. Claybaugh, S. Cole, K. Dawson, A. de la Macorra, A. de Mattia, Arjun Dey, Z. Ding, P. Doel, K. Fanning, C. Garcia-Quintero , et al. (38 additional authors not shown)

    Abstract: When measuring the Baryon Acoustic Oscillations (BAO) scale from galaxy surveys, one typically assumes a fiducial cosmology when converting redshift measurements into comoving distances and also when defining input parameters for the reconstruction algorithm. A parameterised template for the model to be fitted is also created based on a (possibly different) fiducial cosmology. This model reliance… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: Supporting publication of DESI 2024 III: Baryon Acoustic Oscillations from Galaxies and Quasars

  26. arXiv:2406.03508  [pdf, other

    cs.LG cs.AI cs.CR

    Mutual Information Guided Backdoor Mitigation for Pre-trained Encoders

    Authors: Tingxu Han, Weisong Sun, Ziqi Ding, Chunrong Fang, Hanwei Qian, Jiaxun Li, Zhenyu Chen, Xiangyu Zhang

    Abstract: Self-supervised learning (SSL) is increasingly attractive for pre-training encoders without requiring labeled data. Downstream tasks built on top of those pre-trained encoders can achieve nearly state-of-the-art performance. The pre-trained encoders by SSL, however, are vulnerable to backdoor attacks as demonstrated by existing studies. Numerous backdoor mitigation techniques are designed for down… ▽ More

    Submitted 11 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

  27. arXiv:2406.02081  [pdf, other

    cs.MA cs.AI cs.LG

    FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning

    Authors: Wenzhe Li, Zihan Ding, Seth Karten, Chi Jin

    Abstract: Recent advances in reinforcement learning (RL) heavily rely on a variety of well-designed benchmarks, which provide environmental platforms and consistent criteria to evaluate existing and novel algorithms. Specifically, in multi-agent RL (MARL), a plethora of benchmarks based on cooperative games have spurred the development of algorithms that improve the scalability of cooperative multi-agent sy… ▽ More

    Submitted 23 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Comments: ICML 2024

  28. arXiv:2406.01956  [pdf, other

    cs.CV

    Enhance Image-to-Image Generation with LLaVA Prompt and Negative Prompt

    Authors: Zhicheng Ding, Panfeng Li, Qikai Yang, Siyang Li

    Abstract: This paper presents a novel approach to enhance image-to-image generation by leveraging the multimodal capabilities of the Large Language and Vision Assistant (LLaVA). We propose a framework where LLaVA analyzes input images and generates textual descriptions, hereinafter LLaVA-generated prompts. These prompts, along with the original image, are fed into the image-to-image generation pipeline. Thi… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: Accepted by 2024 5th International Conference on Information Science, Parallel and Distributed Systems

  29. arXiv:2406.01213  [pdf, other

    cs.CL cs.AI

    Improving Pseudo Labels with Global-Local Denoising Framework for Cross-lingual Named Entity Recognition

    Authors: Zhuojun Ding, Wei Wei, Xiaoye Qu, Dangyang Chen

    Abstract: Cross-lingual named entity recognition (NER) aims to train an NER model for the target language leveraging only labeled source language data and unlabeled target language data. Prior approaches either perform label projection on translated source language data or employ a source model to assign pseudo labels for target language data and train a target model on these pseudo-labeled data to generali… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Accepted by IJCAI 2024

  30. arXiv:2406.00990  [pdf, other

    cs.LG cs.RO

    Constraint-Aware Diffusion Models for Trajectory Optimization

    Authors: Anjian Li, Zihan Ding, Adji Bousso Dieng, Ryne Beeson

    Abstract: The diffusion model has shown success in generating high-quality and diverse solutions to trajectory optimization problems. However, diffusion models with neural networks inevitably make prediction errors, which leads to constraint violations such as unmet goals or collisions. This paper presents a novel constraint-aware diffusion model for trajectory optimization. We introduce a novel hybrid loss… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  31. arXiv:2406.00233  [pdf, other

    eess.SP

    Plug-in UL-CSI-Assisted Precoder Upsampling Approach in Cellular FDD Systems

    Authors: Yu-Chien Lin, Yan Xin, Ta-Sung Lee, Charlie, Zhang, Yibo Ma, Zhi Ding

    Abstract: Acquiring downlink channel state information (CSI) is crucial for optimizing performance in massive Multiple Input Multiple Output (MIMO) systems operating under Frequency-Division Duplexing (FDD). Most cellular wireless communication systems employ codebook-based precoder designs, which offer advantages such as simpler, more efficient feedback mechanisms and reduced feedback overhead. Common code… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

  32. arXiv:2405.19730  [pdf

    cs.AI cs.CV cs.LG

    Research on Foundation Model for Spatial Data Intelligence: China's 2024 White Paper on Strategic Development of Spatial Data Intelligence

    Authors: Shaohua Wang, Xing Xie, Yong Li, Danhuai Guo, Zhi Cai, Yu Liu, Yang Yue, Xiao Pan, Feng Lu, Huayi Wu, Zhipeng Gui, Zhiming Ding, Bolong Zheng, Fuzheng Zhang, Tao Qin, Jingyuan Wang, Chuang Tao, Zhengchao Chen, Hao Lu, Jiayi Li, Hongyang Chen, Peng Yue, Wenhao Yu, Yao Yao, Leilei Sun , et al. (9 additional authors not shown)

    Abstract: This report focuses on spatial data intelligent large models, delving into the principles, methods, and cutting-edge applications of these models. It provides an in-depth discussion on the definition, development history, current status, and trends of spatial data intelligent large models, as well as the challenges they face. The report systematically elucidates the key technologies of spatial dat… ▽ More

    Submitted 29 June, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

    Comments: in Chinese language

  33. arXiv:2405.19027  [pdf, other

    cs.DC

    A Dual-functional Blockchain Framework for Solving Distributed Optimization

    Authors: Weihang Cao, Xintong Ling, Jiaheng Wang, Xiqi Gao, Zhi Ding

    Abstract: Proof of Work (PoW) has been extensively utilized as the foundation of blockchain's security, consistency, and tamper-resistance. However, long has it been criticized for its tremendous and inefficient utilization of computational power and energy. In this work, we design a dual-functional blockchain framework that uses solving optimization problems to reach consensus as an alternative to PoW, cha… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  34. arXiv:2405.17512  [pdf, other

    cs.LG cs.AI cs.CY

    On Fairness of Low-Rank Adaptation of Large Models

    Authors: Zhoujie Ding, Ken Ziyu Liu, Pura Peetathawatchai, Berivan Isik, Sanmi Koyejo

    Abstract: Low-rank adaptation of large models, particularly LoRA, has gained traction due to its computational efficiency. This efficiency, contrasted with the prohibitive costs of full-model fine-tuning, means that practitioners often turn to LoRA and sometimes without a complete understanding of its ramifications. In this study, we focus on fairness and ask whether LoRA has an unexamined impact on utility… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  35. arXiv:2405.17069  [pdf, other

    cs.CV cs.LG

    Training-free Editioning of Text-to-Image Models

    Authors: Jinqi Wang, Yunfei Fu, Zhangcan Ding, Bailin Deng, Yu-Kun Lai, Yipeng Qin

    Abstract: Inspired by the software industry's practice of offering different editions or versions of a product tailored to specific user groups or use cases, we propose a novel task, namely, training-free editioning, for text-to-image models. Specifically, we aim to create variations of a base text-to-image model without retraining, enabling the model to cater to the diverse needs of different user groups o… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  36. arXiv:2405.17067  [pdf, other

    cs.CL cs.AI

    Tokenization Matters! Degrading Large Language Models through Challenging Their Tokenization

    Authors: Dixuan Wang, Yanda Li, Junyuan Jiang, Zepeng Ding, Guochao Jiang, Jiaqing Liang, Deqing Yang

    Abstract: Large Language Models (LLMs) have shown remarkable capabilities in language understanding and generation. Nonetheless, it was also witnessed that LLMs tend to produce inaccurate responses to specific queries. This deficiency can be traced to the tokenization step LLMs must undergo, which is an inevitable limitation inherent to all LLMs. In fact, incorrect tokenization is the critical point that hi… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 17 pages, 3 figures, this paper is submitted to neurips 2024

  37. arXiv:2405.16283  [pdf, other

    cs.DC

    TURNIP: A "Nondeterministic" GPU Runtime with CPU RAM Offload

    Authors: Zhimin Ding, Jiawen Yao, Brianna Barrow, Tania Lorido Botran, Christopher Jermaine, Yuxin Tang, Jiehui Li, Xinyu Yao, Sleem Mahmoud Abdelghafar, Daniel Bourgeois

    Abstract: An obvious way to alleviate memory difficulties in GPU-based AI computing is via CPU offload, where data are moved between GPU and CPU RAM, so inexpensive CPU RAM is used to increase the amount of storage available. While CPU offload is an obvious idea, it can greatly slow down a computation, due to the relatively slow transfer rate between CPU RAM and GPU RAM. Thus, any system for CPU offload nee… ▽ More

    Submitted 27 May, 2024; v1 submitted 25 May, 2024; originally announced May 2024.

  38. arXiv:2405.16085  [pdf, other

    cs.CV

    Deep-PE: A Learning-Based Pose Evaluator for Point Cloud Registration

    Authors: Junjie Gao, Chongjian Wang, Zhongjun Ding, Shuangmin Chen, Shiqing Xin, Changhe Tu, Wenping Wang

    Abstract: In the realm of point cloud registration, the most prevalent pose evaluation approaches are statistics-based, identifying the optimal transformation by maximizing the number of consistent correspondences. However, registration recall decreases significantly when point clouds exhibit a low overlap rate, despite efforts in designing feature descriptors and establishing correspondences. In this paper… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: 22 pages, 16 figures

  39. arXiv:2405.15978  [pdf, ps, other

    eess.SP

    Exploring Age-of-Information Weighting in Federated Learning under Data Heterogeneity

    Authors: Kaidi Wang, Zhiguo Ding, Daniel K. C. So, Zhi Ding

    Abstract: This paper investigates federated learning in a wireless communication system, where random device selection is employed with non-independent and identically distributed (non-IID) data. The analysis indicates that while training deep learning networks using federated stochastic gradient descent (FedSGD) on non-IID datasets, device selection can generate gradient errors that accumulate, leading to… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  40. arXiv:2405.15336  [pdf, other

    cs.RO eess.IV

    An iterative closest point algorithm for marker-free 3D shape registration of continuum robots

    Authors: Matthias K. Hoffmann, Julian Mühlenhoff, Zhaoheng Ding, Thomas Sattel, Kathrin Flaßkamp

    Abstract: Continuum robots have emerged as a promising technology in the medical field due to their potential of accessing deep sited locations of the human body with low surgical trauma. When deriving physics-based models for these robots, evaluating the models poses a significant challenge due to the difficulty in accurately measuring their intricate shapes. In this work, we present an optimization based… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 11 pages, 8 figures, 2 algorithms, journal

  41. arXiv:2405.15212  [pdf, ps, other

    math.DS

    Packing topological pressure for amenable group actions

    Authors: Ziqing Ding, Ercai Chen, Xiaoyao Zhou

    Abstract: In this paper, we first prove the variational principle for amenable packing topological pressure. Then we obtain an inequality concerning amenable packing pressure for factor maps. Finally, we show that the equality about packing topological pressure of the set of generic points when the system satisfies the almost specification property, or $μ$ is ergodic.

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 27 pages

  42. arXiv:2405.12751  [pdf, other

    cs.CR

    A Stealthy Backdoor Attack for Without-Label-Sharing Split Learning

    Authors: Yuwen Pu, Zhuoyuan Ding, Jiahao Chen, Chunyi Zhou, Qingming Li, Chunqiang Hu, Shouling Ji

    Abstract: As a novel privacy-preserving paradigm aimed at reducing client computational costs and achieving data utility, split learning has garnered extensive attention and proliferated widespread applications across various fields, including smart health and smart transportation, among others. While recent studies have primarily concentrated on addressing privacy leakage concerns in split learning, such a… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 15 pages

  43. arXiv:2405.12457  [pdf, other

    physics.ins-det

    A High Compression Ratio Channel Multiplexing Method for Micro-pattern Gaseous Detectors

    Authors: Yu Wang, Shubin Liu, Hao Zhuang, Zhengwu Ding, Zhihang Yao, Changqing Feng, Zhiyong Zhang

    Abstract: Micro-pattern gas detectors (MPGD) find wide-ranging applications in particle physics experiments, industry, and medical services, owing to their large area, fine spatial resolution, and relatively low material content within the sensitive region. However, the demand for a large number of readout channels poses a bottleneck, limiting the application of MPGD to achieve higher accuracy and more exte… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: This is the first submitted version to the IEEE-TNS

  44. arXiv:2405.10515  [pdf

    cs.LG

    Improved AdaBoost for Virtual Reality Experience Prediction Based on Long Short-Term Memory Network

    Authors: Wenhan Fan, Zhicheng Ding, Ruixin Huang, Chang Zhou, Xuyang Zhang

    Abstract: A classification prediction algorithm based on Long Short-Term Memory Network (LSTM) improved AdaBoost is used to predict virtual reality (VR) user experience. The dataset is randomly divided into training and test sets in the ratio of 7:3.During the training process, the model's loss value decreases from 0.65 to 0.31, which shows that the model gradually reduces the discrepancy between the predic… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  45. arXiv:2405.07977  [pdf, other

    q-bio.QM cs.LG q-bio.NC

    A Demographic-Conditioned Variational Autoencoder for fMRI Distribution Sampling and Removal of Confounds

    Authors: Anton Orlichenko, Gang Qu, Ziyu Zhou, Anqi Liu, Hong-Wen Deng, Zhengming Ding, Julia M. Stephen, Tony W. Wilson, Vince D. Calhoun, Yu-Ping Wang

    Abstract: Objective: fMRI and derived measures such as functional connectivity (FC) have been used to predict brain age, general fluid intelligence, psychiatric disease status, and preclinical neurodegenerative disease. However, it is not always clear that all demographic confounds, such as age, sex, and race, have been removed from fMRI data. Additionally, many fMRI datasets are restricted to authorized re… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 12 pages

  46. arXiv:2405.07057  [pdf, other

    eess.SP

    On the Reliability and Security of Ambient Backscatter Uplink NOMA Networks

    Authors: Athanasios P. Chrysologou, Nestor D. Chatzidiamantis, Alexandros-Apostolos A. Boulogeorgos, Zhiguo Ding

    Abstract: A fundamental objective of the forthcoming sixth-generation wireless networks is to concurrently serve a vast array of devices many of which, such as Internet-of-Things (IoT) sensors, are projected to have low power requirements or even operate in a battery-free manner. To achieve this goal, non-orthogonal multiple access (NOMA) and ambient backscatter communications (AmBC) are regarded as two piv… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

    Comments: 14 pages, 6 figures

  47. arXiv:2405.05512  [pdf, other

    cs.LG cs.AI math.NA math.ST

    Characteristic Learning for Provable One Step Generation

    Authors: Zhao Ding, Chenguang Duan, Yuling Jiao, Ruoxuan Li, Jerry Zhijian Yang, Pingwen Zhang

    Abstract: We propose the characteristic generator, a novel one-step generative model that combines the efficiency of sampling in Generative Adversarial Networks (GANs) with the stable performance of flow-based models. Our model is driven by characteristics, along which the probability density transport can be described by ordinary differential equations (ODEs). Specifically, We estimate the velocity field t… ▽ More

    Submitted 16 July, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

  48. arXiv:2405.04960  [pdf, other

    cs.CL

    P-ICL: Point In-Context Learning for Named Entity Recognition with Large Language Models

    Authors: Guochao Jiang, Zepeng Ding, Yuchen Shi, Deqing Yang

    Abstract: In recent years, the rise of large language models (LLMs) has made it possible to directly achieve named entity recognition (NER) without any demonstration samples or only using a few samples through in-context learning (ICL). However, standard ICL only helps LLMs understand task instructions, format and input-label mapping, but neglects the particularity of the NER task itself. In this paper, we… ▽ More

    Submitted 17 June, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

  49. arXiv:2405.03764  [pdf, other

    cs.CL cs.IR

    GOVERN: Gradient Orientation Vote Ensemble for Multi-Teacher Reinforced Distillation

    Authors: Wenjie Zhou, Zhenxin Ding, Xiaodong Zhang, Haibo Shi, Junfeng Wang, Dawei Yin

    Abstract: Pre-trained language models have become an integral component of question-answering systems, achieving remarkable performance. For practical deployment, it is critical to carry out knowledge distillation to preserve high performance under computational constraints. In this paper, we address a key question: given the importance of unsupervised distillation for student performance, how does one effe… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  50. arXiv:2405.02567  [pdf, other

    eess.SP

    TiRE-GAN: Task-Incentivized Generative Learning Models for Radiomap Estimation with Radio Propagation Model

    Authors: Yueling Zhou, Achintha Wijesinghe, Songyang Zhang, Zhi Ding

    Abstract: Enriching geometric information on radio frequency (RF) signal power distribution in wireless communication systems, the radiomap has become an essential tool for resource allocation and network management. Usually, a dense radiomap is reconstructed from sparse observations collected by deployed sensors or mobile devices, which makes the radiomap estimation an urgent challenge. To leverage both ph… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.