Skip to main content

Showing 1–50 of 750 results for author: Ding, J

  1. arXiv:2407.11965  [pdf, other

    cs.CV

    UrbanWorld: An Urban World Model for 3D City Generation

    Authors: Yu Shang, Jiansheng Chen, Hangyu Fan, Jingtao Ding, Jie Feng, Yong Li

    Abstract: Cities, as the most fundamental environment of human life, encompass diverse physical elements such as buildings, roads and vegetation with complex interconnection. Crafting realistic, interactive 3D urban environments plays a crucial role in constructing AI agents capable of perceiving, decision-making, and acting like humans in real-world environments. However, creating high-fidelity 3D urban en… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: 11 pages

  2. arXiv:2407.11094  [pdf, other

    stat.ME eess.SP stat.ML

    Robust Score-Based Quickest Change Detection

    Authors: Sean Moushegian, Suya Wu, Enmao Diao, Jie Ding, Taposh Banerjee, Vahid Tarokh

    Abstract: Methods in the field of quickest change detection rapidly detect in real-time a change in the data-generating distribution of an online data stream. Existing methods have been able to detect this change point when the densities of the pre- and post-change distributions are known. Recent work has extended these results to the case where the pre- and post-change distributions are known only by their… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: arXiv admin note: text overlap with arXiv:2306.05091

  3. arXiv:2407.10990  [pdf

    cs.CL cs.AI

    MedBench: A Comprehensive, Standardized, and Reliable Benchmarking System for Evaluating Chinese Medical Large Language Models

    Authors: Mianxin Liu, Jinru Ding, Jie Xu, Weiguo Hu, Xiaoyang Li, Lifeng Zhu, Zhian Bai, Xiaoming Shi, Benyou Wang, Haitao Song, Pengfei Liu, Xiaofan Zhang, Shanshan Wang, Kang Li, Haofen Wang, Tong Ruan, Xuanjing Huang, Xin Sun, Shaoting Zhang

    Abstract: Ensuring the general efficacy and goodness for human beings from medical large language models (LLM) before real-world deployment is crucial. However, a widely accepted and accessible evaluation process for medical LLM, especially in the Chinese context, remains to be established. In this work, we introduce "MedBench", a comprehensive, standardized, and reliable benchmarking system for Chinese med… ▽ More

    Submitted 23 June, 2024; originally announced July 2024.

    Comments: 25 pages.4 figures

  4. arXiv:2407.10393  [pdf, ps, other

    eess.SP

    New Paradigm for Secure Full-Duplex Transmission: Movable Antenna-Aided Multi-User Systems

    Authors: Jingze Ding, Zijian Zhou, Bingli Jiao

    Abstract: In this paper, we investigate physical layer security (PLS) for full-duplex (FD) multi-user systems. To simultaneously protect uplink (UL) and downlink (DL) transmissions and ensure efficient use of time-frequency resources, we consider a base station (BS) that operates in FD mode and enables to emit the artificial noise (AN). Conventional fixed-position antennas (FPAs) at the BS struggle to fully… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: 13 pages

  5. arXiv:2407.10050  [pdf, other

    math.NA

    Entropy Increasing Numerical Methods for Prediction of Non-isothermal Electrokinetics in Supercapacitors

    Authors: Jie Ding, Xiang Ji, Shenggao Zhou

    Abstract: Accurate characterization of entropy plays a pivotal role in capturing reversible and irreversible heating in supercapacitors during charging/discharging cycles. However, numerical methods that can faithfully capture entropy variation in supercapacitors are still in lack. This work proposes a novel second-order accurate finite-volume scheme for a Poisson--Nernst--Planck--Fourier model developed in… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

  6. arXiv:2407.02818  [pdf, other

    cs.SE cs.ET cs.PL

    WizardMerge -- Save Us From Merging Without Any Clues

    Authors: Qingyu Zhang, Junzhe Li, Jiayi Lin, Jie Ding, Lanteng Lin, Chenxiong Qian

    Abstract: Modern software development necessitates efficient version-oriented collaboration among developers. While Git is the most popular version control system, it generates unsatisfactory version merging results due to textual-based workflow, leading to potentially unexpected results in the merged version of the project. Although numerous merging tools have been proposed for improving merge results, dev… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 22 pages

    ACM Class: D.2; D.3

  7. arXiv:2407.00747  [pdf, other

    cs.CL cs.AI

    A Comparative Study of Quality Evaluation Methods for Text Summarization

    Authors: Huyen Nguyen, Haihua Chen, Lavanya Pobbathi, Junhua Ding

    Abstract: Evaluating text summarization has been a challenging task in natural language processing (NLP). Automatic metrics which heavily rely on reference summaries are not suitable in many situations, while human evaluation is time-consuming and labor-intensive. To bridge this gap, this paper proposes a novel method based on large language models (LLMs) for evaluating text summarization. We also conducts… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: The paper is under review at Empirical Methods in Natural Language Processing (EMNLP) 2024. It has 15 pages and 4 figures

  8. arXiv:2406.19875  [pdf, other

    cs.CV

    InfiniBench: A Comprehensive Benchmark for Large Multimodal Models in Very Long Video Understanding

    Authors: Kirolos Ataallah, Chenhui Gou, Eslam Abdelrahman, Khushbu Pahwa, Jian Ding, Mohamed Elhoseiny

    Abstract: Understanding long videos, ranging from tens of minutes to several hours, presents unique challenges in video comprehension. Despite the increasing importance of long-form video content, existing benchmarks primarily focus on shorter clips. To address this gap, we introduce InfiniBench a comprehensive benchmark for very long video understanding which presents 1)The longest video duration, averagin… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: 16 page ,17 figures

  9. arXiv:2406.19614  [pdf, other

    cs.LG cs.AI

    A Survey on Data Quality Dimensions and Tools for Machine Learning

    Authors: Yuhan Zhou, Fengjiao Tu, Kewei Sha, Junhua Ding, Haihua Chen

    Abstract: Machine learning (ML) technologies have become substantial in practically all aspects of our society, and data quality (DQ) is critical for the performance, fairness, robustness, safety, and scalability of ML models. With the large and complex data in data-centric AI, traditional methods like exploratory data analysis (EDA) and cross-validation (CV) face challenges, highlighting the importance of… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: This paper has been accepted by The 6th IEEE International Conference on Artificial Intelligence Testing (IEEE AITest 2024) as an invited paper

  10. arXiv:2406.18087  [pdf, other

    cs.SE cs.AI cs.CL

    EHR-Based Mobile and Web Platform for Chronic Disease Risk Prediction Using Large Language Multimodal Models

    Authors: Chun-Chieh Liao, Wei-Ting Kuo, I-Hsuan Hu, Yen-Chen Shih, Jun-En Ding, Feng Liu, Fang-Ming Hung

    Abstract: Traditional diagnosis of chronic diseases involves in-person consultations with physicians to identify the disease. However, there is a lack of research focused on predicting and developing application systems using clinical notes and blood test values. We collected five years of Electronic Health Records (EHRs) from Taiwan's hospital database between 2017 and 2021 as an AI database. Furthermore,… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  11. arXiv:2406.14289  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci cond-mat.str-el

    Electrical switching of chirality in rhombohedral graphene Chern insulators

    Authors: Jing Ding, Hanxiao Xiang, Jiannan Hua, Wenqiang Zhou, Naitian Liu, Le Zhang, Na Xin, Kenji Watanabe, Takashi Taniguchi, Wei Zhu, Shuigang Xu

    Abstract: A Chern insulator hosts topologically protected chiral edge currents with quantized conductance characterized by its Chern number. Switching the chirality of the Chern insulator, namely, the direction of the edge current, is highly challenging due to topologically forbidden backscattering but is of considerable importance for the design of topological devices. Nevertheless, this can be achieved by… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 21 pages, 4 figures in main text

  12. arXiv:2406.12384  [pdf, other

    cs.CV

    VRSBench: A Versatile Vision-Language Benchmark Dataset for Remote Sensing Image Understanding

    Authors: Xiang Li, Jian Ding, Mohamed Elhoseiny

    Abstract: We introduce a new benchmark designed to advance the development of general-purpose, large-scale vision-language models for remote sensing images. Although several vision-language datasets in remote sensing have been proposed to pursue this goal, existing datasets are typically tailored to single tasks, lack detailed object information, or suffer from inadequate quality control. Exploring these im… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Submitted for consideration at a conference

  13. arXiv:2406.07732  [pdf, other

    quant-ph cs.CR

    Experimenting with D-Wave Quantum Annealers on Prime Factorization problems

    Authors: Jingwen Ding, Giuseppe Spallitta, Roberto Sebastiani

    Abstract: This paper builds on top of a paper we have published very recently, in which we have proposed a novel approach to prime factorization (PF) by quantum annealing, where 8,219,999=32,749x251 was the highest prime product we were able to factorize -- which, to the best of our knowledge is the largest number which was ever factorized by means of a quantum device. The series of annealing experiments wh… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Published on Frontiers

  14. arXiv:2406.06211  [pdf, other

    cs.CV

    iMotion-LLM: Motion Prediction Instruction Tuning

    Authors: Abdulwahab Felemban, Eslam Mohamed Bakr, Xiaoqian Shen, Jian Ding, Abduallah Mohamed, Mohamed Elhoseiny

    Abstract: We introduce iMotion-LLM: a Multimodal Large Language Models (LLMs) with trajectory prediction, tailored to guide interactive multi-agent scenarios. Different from conventional motion prediction approaches, iMotion-LLM capitalizes on textual instructions as key inputs for generating contextually relevant trajectories. By enriching the real-world driving scenarios in the Waymo Open Dataset with tex… ▽ More

    Submitted 11 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

  15. arXiv:2406.03761  [pdf, ps, other

    math.NA

    A second-order accurate, original energy dissipative numerical scheme for chemotaxis and its convergence analysis

    Authors: Jie Ding, Cheng Wang, Shenggao Zhou

    Abstract: This paper proposes a second-order accurate numerical scheme for the Patlak-Keller-Segel system with various mobilities for the description of chemotaxis. Formulated in a variational structure, the entropy part is novelly discretized by a modified Crank-Nicolson approach so that the solution to the proposed nonlinear scheme corresponds to a minimizer of a convex functional. A careful theoretical a… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  16. arXiv:2406.02614  [pdf, other

    cs.LG cs.AI

    Frequency Enhanced Pre-training for Cross-city Few-shot Traffic Forecasting

    Authors: Zhanyu Liu, Jianrong Ding, Guanjie Zheng

    Abstract: The field of Intelligent Transportation Systems (ITS) relies on accurate traffic forecasting to enable various downstream applications. However, developing cities often face challenges in collecting sufficient training traffic data due to limited resources and outdated infrastructure. Recognizing this obstacle, the concept of cross-city few-shot forecasting has emerged as a viable approach. While… ▽ More

    Submitted 5 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: Accepted by ECMLPKDD 2024 (Research Track)

  17. arXiv:2406.02397  [pdf, other

    math.PR

    One-arm Probabilities for Metric Graph Gaussian Free Fields below and at the Critical Dimension

    Authors: Zhenhao Cai, Jian Ding

    Abstract: For the critical level-set of the Gaussian free field on the metric graph of $\mathbb Z^d$, we consider the one-arm probability $θ_d(N)$, i.e., the probability that the boundary of a box of side length $2N$ is connected to the center. We prove that $θ_d(N)$ is $O(N^{-\frac{d}{2}+1})$ for $3\le d\le 5$, and is $N^{-2+o(1)}$ for $d=6$. Our upper bounds match the lower bounds in a previous work by Di… ▽ More

    Submitted 12 July, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

  18. arXiv:2406.02131  [pdf, other

    cs.LG cs.AI

    CondTSF: One-line Plugin of Dataset Condensation for Time Series Forecasting

    Authors: Jianrong Ding, Zhanyu Liu, Guanjie Zheng, Haiming Jin, Linghe Kong

    Abstract: Dataset condensation is a newborn technique that generates a small dataset that can be used in training deep neural networks to lower training costs. The objective of dataset condensation is to ensure that the model trained with the synthetic dataset can perform comparably to the model trained with full datasets. However, existing methods predominantly concentrate on classification tasks, posing c… ▽ More

    Submitted 11 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Comments: 23 pages, 13 figures

  19. arXiv:2405.20694  [pdf, other

    cs.NE

    Robust Stable Spiking Neural Networks

    Authors: Jianhao Ding, Zhiyu Pan, Yujia Liu, Zhaofei Yu, Tiejun Huang

    Abstract: Spiking neural networks (SNNs) are gaining popularity in deep learning due to their low energy budget on neuromorphic hardware. However, they still face challenges in lacking sufficient robustness to guard safety-critical applications such as autonomous driving. Many studies have been conducted to defend SNNs from the threat of adversarial attacks. This paper aims to uncover the robustness of SNN… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: Accepted by ICML2024

  20. arXiv:2405.20355  [pdf, other

    cs.NE cs.CR cs.CV cs.LG

    Enhancing Adversarial Robustness in SNNs with Sparse Gradients

    Authors: Yujia Liu, Tong Bu, Jianhao Ding, Zecheng Hao, Tiejun Huang, Zhaofei Yu

    Abstract: Spiking Neural Networks (SNNs) have attracted great attention for their energy-efficient operations and biologically inspired structures, offering potential advantages over Artificial Neural Networks (ANNs) in terms of energy efficiency and interpretability. Nonetheless, similar to ANNs, the robustness of SNNs remains a challenge, especially when facing adversarial attacks. Existing techniques, wh… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: accepted by ICML 2024

  21. arXiv:2405.19856  [pdf, other

    cs.CL cs.SE

    DevEval: A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories

    Authors: Jia Li, Ge Li, Yunfei Zhao, Yongmin Li, Huanyu Liu, Hao Zhu, Lecheng Wang, Kaibo Liu, Zheng Fang, Lanshen Wang, Jiazheng Ding, Xuanming Zhang, Yuqi Zhu, Yihong Dong, Zhi Jin, Binhua Li, Fei Huang, Yongbin Li

    Abstract: How to evaluate the coding abilities of Large Language Models (LLMs) remains an open question. We find that existing benchmarks are poorly aligned with real-world code repositories and are insufficient to evaluate the coding abilities of LLMs. To address the knowledge gap, we propose a new benchmark named DevEval, which has three advances. (1) DevEval aligns with real-world repositories in multi… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Accepted by the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024). arXiv admin note: substantial text overlap with arXiv:2404.00599, arXiv:2401.06401

  22. arXiv:2405.19524  [pdf, other

    cs.CR cs.AI

    AI Risk Management Should Incorporate Both Safety and Security

    Authors: Xiangyu Qi, Yangsibo Huang, Yi Zeng, Edoardo Debenedetti, Jonas Geiping, Luxi He, Kaixuan Huang, Udari Madhushani, Vikash Sehwag, Weijia Shi, Boyi Wei, Tinghao Xie, Danqi Chen, Pin-Yu Chen, Jeffrey Ding, Ruoxi Jia, Jiaqi Ma, Arvind Narayanan, Weijie J Su, Mengdi Wang, Chaowei Xiao, Bo Li, Dawn Song, Peter Henderson, Prateek Mittal

    Abstract: The exposure of security vulnerabilities in safety-aligned language models, e.g., susceptibility to adversarial attacks, has shed light on the intricate interplay between AI safety and AI security. Although the two disciplines now come together under the overarching goal of AI risk management, they have historically evolved separately, giving rise to differing perspectives. Therefore, in this pape… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  23. arXiv:2405.18937  [pdf, other

    cs.CV cs.CL

    Kestrel: Point Grounding Multimodal LLM for Part-Aware 3D Vision-Language Understanding

    Authors: Junjie Fei, Mahmoud Ahmed, Jian Ding, Eslam Mohamed Bakr, Mohamed Elhoseiny

    Abstract: While 3D MLLMs have achieved significant progress, they are restricted to object and scene understanding and struggle to understand 3D spatial structures at the part level. In this paper, we introduce Kestrel, representing a novel approach that empowers 3D MLLMs with part-aware understanding, enabling better interpretation and segmentation grounding of 3D objects at the part level. Despite its sig… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  24. arXiv:2405.18146  [pdf, other

    cs.IR cs.LG

    Unified Low-rank Compression Framework for Click-through Rate Prediction

    Authors: Hao Yu, Minghao Fu, Jiandong Ding, Yusheng Zhou, Jianxin Wu

    Abstract: Deep Click-Through Rate (CTR) prediction models play an important role in modern industrial recommendation scenarios. However, high memory overhead and computational costs limit their deployment in resource-constrained environments. Low-rank approximation is an effective method for computer vision and natural language processing models, but its application in compressing CTR prediction models has… ▽ More

    Submitted 11 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: Accepted by KDD2024 Applied Data Science (ADS) Track

  25. arXiv:2405.16663  [pdf, ps, other

    cs.DS cs.LG stat.ML

    Private Edge Density Estimation for Random Graphs: Optimal, Efficient and Robust

    Authors: Hongjie Chen, Jingqiu Ding, Yiding Hua, David Steurer

    Abstract: We give the first polynomial-time, differentially node-private, and robust algorithm for estimating the edge density of Erdős-Rényi random graphs and their generalization, inhomogeneous random graphs. We further prove information-theoretical lower bounds, showing that the error rate of our algorithm is optimal up to logarithmic factors. Previous algorithms incur either exponential running time or… ▽ More

    Submitted 3 June, 2024; v1 submitted 26 May, 2024; originally announced May 2024.

    Comments: fix minor typos; add missing references

  26. arXiv:2405.13403  [pdf, other

    eess.IV cs.MM

    Adaptive Wireless Image Semantic Transmission and Over-The-Air Testing

    Authors: Jiarun Ding, Peiwen Jiang, Chao-Kai Wen, Shi Jin

    Abstract: Semantic communication has undergone considerable evolution due to the recent rapid development of artificial intelligence (AI), significantly enhancing both communication robustness and efficiency. Despite these advancements, most current semantic communication methods for image transmission pay little attention to the differing importance of objects and backgrounds in images. To address this iss… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  27. arXiv:2405.13058  [pdf, other

    cs.SE cs.AI cs.CY cs.LG

    The AI Community Building the Future? A Quantitative Analysis of Development Activity on Hugging Face Hub

    Authors: Cailean Osborne, Jennifer Ding, Hannah Rose Kirk

    Abstract: Open model developers have emerged as key actors in the political economy of artificial intelligence (AI), but we still have a limited understanding of collaborative practices in the open AI ecosystem. This paper responds to this gap with a three-part quantitative analysis of development activity on the Hugging Face (HF) Hub, a popular platform for building, sharing, and demonstrating models. Firs… ▽ More

    Submitted 5 June, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

    Comments: 27 pages, 5 figures, 9 tables

    ACM Class: K.4.1

  28. arXiv:2405.12811  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Engineering band structures of two-dimensional materials with remote moire ferroelectricity

    Authors: Jing Ding, Hanxiao Xiang, Wenqiang Zhou, Naitian Liu, Xinjie Fang, Kangyu Wang, Linfeng Wu, Kenji Watanabe, Takashi Taniguchi, Shuigang Xu

    Abstract: The stacking order and twist angle provide abundant opportunities for engineering band structures of two-dimensional materials, including the formation of moire bands, flat bands, and topologically nontrivial bands. The inversion symmetry breaking in rhombohedral-stacked transitional metal dichalcogenides (TMDCs) endows them with an interfacial ferroelectricity associated with an out-of-plane elec… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  29. arXiv:2405.12687  [pdf, other

    cond-mat.mtrl-sci

    Large band-splitting in $g$-wave type altermagnet CrSb

    Authors: Jianyang Ding, Zhicheng Jiang, Xiuhua Chen, Zicheng Tao, Zhengtai Liu, Jishan Liu, Tongrui Li, Jiayu Liu, Yichen Yang, Runfeng Zhang, Liwei Deng, Wenchuan Jing, Yu Huang, Yuming Shi, Shan Qiao, Yilin Wang, Yanfeng Guo, Donglai Feng, Dawei Shen

    Abstract: Altermagnetism (AM), a newly discovered magnetic state, ingeniously integrates the properties of ferromagnetism and antiferromagnetism, representing a significant breakthrough in the field of magnetic materials. Despite experimental verification of some typical AM materials, such as MnTe and MnTe$_2$, the pursuit of AM materials that feature larger spin splitting and higher transition temperature… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  30. arXiv:2405.12107  [pdf, other

    cs.CV cs.CL

    Imp: Highly Capable Large Multimodal Models for Mobile Devices

    Authors: Zhenwei Shao, Zhou Yu, Jun Yu, Xuecheng Ouyang, Lihao Zheng, Zhenbiao Gai, Mingyang Wang, Jiajun Ding

    Abstract: By harnessing the capabilities of large language models (LLMs), recent large multimodal models (LMMs) have shown remarkable versatility in open-world multimodal understanding. Nevertheless, they are usually parameter-heavy and computation-intensive, thus hindering their applicability in resource-constrained scenarios. To this end, several lightweight LMMs have been proposed successively to maximiz… ▽ More

    Submitted 29 May, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

    Comments: fix some typos and correct a few number in the tables

  31. arXiv:2405.11353  [pdf, other

    cs.CR cs.AR

    NTTSuite: Number Theoretic Transform Benchmarks for Accelerating Encrypted Computation

    Authors: Juran Ding, Yuanzhe Liu, Lingbin Sun, Brandon Reagen

    Abstract: Privacy concerns have thrust privacy-preserving computation into the spotlight. Homomorphic encryption (HE) is a cryptographic system that enables computation to occur directly on encrypted data, providing users with strong privacy (and security) guarantees while using the same services they enjoy today unprotected. While promising, HE has seen little adoption due to extremely high computational o… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

    Comments: 8 pages, 5 figures, and two tables. To download the source code, see https://github.com/Dragon201701/NTTSuite

  32. arXiv:2405.11155  [pdf, other

    eess.SY cs.CC

    Inner-approximate Reachability Computation via Zonotopic Boundary Analysis

    Authors: Dejin Ren, Zhen Liang, Chenyu Wu, Jianqiang Ding, Taoran Wu, Bai Xue

    Abstract: Inner-approximate reachability analysis involves calculating subsets of reachable sets, known as inner-approximations. This analysis is crucial in the fields of dynamic systems analysis and control theory as it provides a reliable estimation of the set of states that a system can reach from given initial states at a specific time instant. In this paper, we study the inner-approximate reachability… ▽ More

    Submitted 21 May, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

    Comments: the extended version of the paper accepted by CAV 2024

  33. arXiv:2405.10255  [pdf, other

    cs.CV cs.RO

    When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models

    Authors: Xianzheng Ma, Yash Bhalgat, Brandon Smart, Shuai Chen, Xinghui Li, Jian Ding, Jindong Gu, Dave Zhenyu Chen, Songyou Peng, Jia-Wang Bian, Philip H Torr, Marc Pollefeys, Matthias Nießner, Ian D Reid, Angel X. Chang, Iro Laina, Victor Adrian Prisacariu

    Abstract: As large language models (LLMs) evolve, their integration with 3D spatial data (3D-LLMs) has seen rapid progress, offering unprecedented capabilities for understanding and interacting with physical spaces. This survey provides a comprehensive overview of the methodologies enabling LLMs to process, understand, and generate 3D data. Highlighting the unique advantages of LLMs, such as in-context lear… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  34. arXiv:2405.10121  [pdf, other

    cs.CL cs.MM

    Distilling Implicit Multimodal Knowledge into LLMs for Zero-Resource Dialogue Generation

    Authors: Bo Zhang, Hui Ma, Jian Ding, Jian Wang, Bo Xu, Hongfei Lin

    Abstract: Integrating multimodal knowledge into large language models (LLMs) represents a significant advancement in dialogue generation capabilities. However, the effective incorporation of such knowledge in zero-resource scenarios remains a substantial challenge due to the scarcity of diverse, high-quality dialogue datasets. To address this, we propose the Visual Implicit Knowledge Distillation Framework… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: Under Review

  35. arXiv:2405.08235  [pdf, other

    stat.ML cs.LG

    Additive-Effect Assisted Learning

    Authors: Jiawei Zhang, Yuhong Yang, Jie Ding

    Abstract: It is quite popular nowadays for researchers and data analysts holding different datasets to seek assistance from each other to enhance their modeling performance. We consider a scenario where different learners hold datasets with potentially distinct variables, and their observations can be aligned by a nonprivate identifier. Their collaboration faces the following difficulties: First, learners m… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  36. arXiv:2405.07488  [pdf, other

    cs.LG cs.RO cs.SC

    Predictive Modeling of Flexible EHD Pumps using Kolmogorov-Arnold Networks

    Authors: Yanhong Peng, Miao He, Fangchao Hu, Zebing Mao, Xia Huang, Jun Ding

    Abstract: We present a novel approach to predicting the pressure and flow rate of flexible electrohydrodynamic pumps using the Kolmogorov-Arnold Network. Inspired by the Kolmogorov-Arnold representation theorem, KAN replaces fixed activation functions with learnable spline-based activation functions, enabling it to approximate complex nonlinear functions more effectively than traditional models like Multi-L… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  37. arXiv:2405.06937  [pdf, other

    math.NA eess.SP

    High-Order Synchrosqueezed Chirplet Transforms for Multicomponent Signal Analysis

    Authors: Yi-Ju Yen, De-Yan Lu, Sing-Yuan Yeh, Jian-Jiun Ding, Chun-Yen Shen

    Abstract: This study focuses on the analysis of signals containing multiple components with crossover instantaneous frequencies (IF). This problem was initially solved with the chirplet transform (CT). Also, it can be sharpened by adding the synchrosqueezing step, which is called the synchrosqueezed chirplet transform (SCT). However, we found that the SCT goes wrong with the high chirp modulation signal due… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

    MSC Class: 65T99; 42C99; 42a38

  38. arXiv:2405.03460  [pdf, other

    math.PR

    Polynomial lower bound on the effective resistance for the one-dimensional critical long-range percolation

    Authors: Jian Ding, Zherui Fan, Lu-Jing Huang

    Abstract: In this work, we study the critical long-range percolation on $\mathbb{Z}$, where an edge connects $i$ and $j$ independently with probability $1-\exp\{-β|i-j|^{-2}\}$ for some fixed $β>0$. Viewing this as a random electric network where each edge has a unit conductance, we show that with high probability the effective resistances from the origin 0 to $[-N, N]^c$ and from the interval $[-N,N]$ to… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: 26 pages, 10 figures

    MSC Class: 60K35; 82B27; 82B43

  39. arXiv:2405.02550  [pdf, other

    physics.ins-det hep-ex

    Gain suppression study on LGADs at the CENPA tandem accelerator

    Authors: S. Braun, Q. Buat, J. Ding, P. Kammel, S. M. Mazza, F. McKinney-Martinez, A. Molnar, C. Lansdell, J. Ott, A. Seiden, B. Schumm, Y. Zhao

    Abstract: Low-Gain Avalanche Detectors (LGADs) are a type of thin silicon detector with a highly doped gain layer that provides moderate internal signal amplification. One recent challenge in the use of LGADs, studied by several research groups, is the gain suppression mechanism for large localized charge deposits. Using the CENPA Tandem accelerator at the University of Washington, the response of the LGADs… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  40. arXiv:2404.19563  [pdf, other

    cs.CL

    RepEval: Effective Text Evaluation with LLM Representation

    Authors: Shuqian Sheng, Yi Xu, Tianhang Zhang, Zanwei Shen, Luoyi Fu, Jiaxin Ding, Lei Zhou, Xinbing Wang, Chenghu Zhou

    Abstract: Automatic evaluation metrics for generated texts play an important role in the NLG field, especially with the rapid growth of LLMs. However, existing metrics are often limited to specific scenarios, making it challenging to meet the evaluation requirements of expanding LLM applications. Therefore, there is a demand for new, flexible, and effective metrics. In this study, we introduce RepEval, the… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  41. arXiv:2404.17456  [pdf, other

    cs.NE

    Converting High-Performance and Low-Latency SNNs through Explicit Modelling of Residual Error in ANNs

    Authors: Zhipeng Huang, Jianhao Ding, Zhiyu Pan, Haoran Li, Ying Fang, Zhaofei Yu, Jian K. Liu

    Abstract: Spiking neural networks (SNNs) have garnered interest due to their energy efficiency and superior effectiveness on neuromorphic chips compared with traditional artificial neural networks (ANNs). One of the mainstream approaches to implementing deep SNNs is the ANN-SNN conversion, which integrates the efficient training strategy of ANNs with the energy-saving potential and fast inference capability… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  42. arXiv:2404.16439  [pdf, ps, other

    math.CV math.FA

    Toeplitz Operators on Weighted Bergman Spaces over Tubular Domains

    Authors: Lvchang Li, Jiaqing Ding, Haichou Li

    Abstract: In this paper, we mainly study the necessary and sufficient conditions for the boundedness and compactness of Toeplitz operators on weighted Bergman spaces over a tubular domains by using the Carlson measures on tubular domains. We also give some related results about Carlson measures.

    Submitted 25 April, 2024; originally announced April 2024.

  43. arXiv:2404.14598  [pdf

    cond-mat.mtrl-sci

    Dynamic Nanodomains Dictate Macroscopic Properties in Lead Halide Perovskites

    Authors: Milos Dubajic, James R. Neilson, Johan Klarbring, Xia Liang, Stephanie A. Boer, Kirrily C. Rule, Josie E. Auckett, Leilei Gu, Xuguang Jia, Andreas Pusch, Ganbaatar Tumen-Ulzii, Qiyuan Wu, Thomas A. Selby, Yang Lu, Julia C. Trowbridge, Eve M. Mozur, Arianna Minelli, Nikolaj Roth, Kieran W. P. Orr, Arman Mahboubi Soufiani, Simon Kahmann, Irina Kabakova, Jianning Ding, Tom Wu, Gavin J. Conibeer , et al. (4 additional authors not shown)

    Abstract: Empirical A-site cation substitution has advanced the stability and efficiency of hybrid organic-inorganic lead halide perovskites solar cells and the functionality of X-ray detectors. Yet, the fundamental mechanisms underpinning their unique performance remain elusive. This multi-modal study unveils the link between nanoscale structural dynamics and macroscopic optoelectronic properties in these… ▽ More

    Submitted 1 May, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: Main text and supplementary information. Main text 16 pages, 4 figures. Supplementary information 42 pages, 36 figures

  44. arXiv:2404.13844  [pdf, other

    cs.LG cs.AI

    ColA: Collaborative Adaptation with Gradient Learning

    Authors: Enmao Diao, Qi Le, Suya Wu, Xinran Wang, Ali Anwar, Jie Ding, Vahid Tarokh

    Abstract: A primary function of back-propagation is to compute both the gradient of hidden representations and parameters for optimization with gradient descent. Training large models requires high computational costs due to their vast parameter sizes. While Parameter-Efficient Fine-Tuning (PEFT) methods aim to train smaller auxiliary models to save computational space, they still present computational over… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

  45. arXiv:2404.10528  [pdf, other

    cs.MM

    AllTheDocks road safety dataset: A cyclist's perspective and experience

    Authors: Chia-Yen Chiang, Ruikang Zhong, Jennifer Ding, Joseph Wood, Stephen Bee, Mona Jaber

    Abstract: Active travel is an essential component in intelligent transportation systems. Cycling, as a form of active travel, shares the road space with motorised traffic which often affects the cyclists' safety and comfort and therefore peoples' propensity to uptake cycling instead of driving. This paper presents a unique dataset, collected by cyclists across London, that includes video footage, accelerome… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  46. arXiv:2404.10318  [pdf, other

    cs.CV

    SRGS: Super-Resolution 3D Gaussian Splatting

    Authors: Xiang Feng, Yongbo He, Yubo Wang, Yan Yang, Wen Li, Yifei Chen, Zhenzhong Kuang, Jiajun ding, Jianping Fan, Yu Jun

    Abstract: Recently, 3D Gaussian Splatting (3DGS) has gained popularity as a novel explicit 3D representation. This approach relies on the representation power of Gaussian primitives to provide a high-quality rendering. However, primitives optimized at low resolution inevitably exhibit sparsity and texture deficiency, posing a challenge for achieving high-resolution novel view synthesis (HRNVS). To address t… ▽ More

    Submitted 18 June, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: The first to focus on the HRNVS of 3DGS

  47. arXiv:2404.09483  [pdf, other

    astro-ph.CO

    Deep Learning for Cosmological Parameter Inference from Dark Matter Halo Density Field

    Authors: Zhiwei Min, Xu Xiao, Jiacheng Ding, Liang Xiao, Jie Jiang, Donglin Wu, Qiufan Lin, Yin Li, Yang Wang, Shuai Liu, Zhixin Chen, Xiangru Li, Jinqu Zhang, Le Zhang, Xiao-Dong Li

    Abstract: We propose a lightweight deep convolutional neural network (lCNN) to estimate cosmological parameters from simulated three-dimensional DM halo distributions and associated statistics. The training dataset comprises 2000 realizations of a cubic box with a side length of 1000 $h^{-1}{\rm Mpc}$, and interpolated over a cubic grid of $300^3$ voxels, with each simulation produced using $512^3$ DM parti… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 10 pages,9 figures

  48. arXiv:2404.07493  [pdf, other

    cs.LG cs.AI

    Characterizing the Influence of Topology on Graph Learning Tasks

    Authors: Kailong Wu, Yule Xie, Jiaxin Ding, Yuxiang Ren, Luoyi Fu, Xinbing Wang, Chenghu Zhou

    Abstract: Graph neural networks (GNN) have achieved remarkable success in a wide range of tasks by encoding features combined with topology to create effective representations. However, the fundamental problem of understanding and analyzing how graph topology influences the performance of learning models on downstream tasks has not yet been well understood. In this paper, we propose a metric, TopoInf, which… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  49. Deep Reinforcement Learning Based Toolpath Generation for Thermal Uniformity in Laser Powder Bed Fusion Process

    Authors: Mian Qin, Junhao Ding, Shuo Qu, Xu Song, Charlie C. L. Wang, Wei-Hsin Liao

    Abstract: Laser powder bed fusion (LPBF) is a widely used metal additive manufacturing technology. However, the accumulation of internal residual stress during printing can cause significant distortion and potential failure. Although various scan patterns have been studied to reduce possible accumulated stress, such as zigzag scanning vectors with changing directions or a chessboard-based scan pattern with… ▽ More

    Submitted 16 February, 2024; originally announced April 2024.

    Journal ref: Additive Manufacturing, vol.79, 103937 (12 pages), January 2024

  50. arXiv:2404.04337  [pdf, ps, other

    hep-ph hep-ex

    Invisible and Semi-invisible Decays of Bottom Baryons

    Authors: Yong Zheng, Jian-Nan Ding, Dong-Hao Li, Lei-Yi Li, Cai-Dian Lü, Fu-Sheng Yu

    Abstract: The similar densities of dark matter and baryons in the universe imply that they might arise from the same ultraviolet model. The B-Mesogenesis, which assumes dark matter is charged under the baryon number, attempts to simultaneously explain the origin of baryon asymmetry and dark matter in the universe. In particular, the B-Mesogenesis might induce bottom-baryon decays into invisible or semi-invi… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

    Comments: 24 pages, 7 figures