Skip to main content

Showing 1–50 of 769 results for author: Ding, H

  1. arXiv:2407.13761  [pdf, other

    cs.CV

    SegPoint: Segment Any Point Cloud via Large Language Model

    Authors: Shuting He, Henghui Ding, Xudong Jiang, Bihan Wen

    Abstract: Despite significant progress in 3D point cloud segmentation, existing methods primarily address specific tasks and depend on explicit instructions to identify targets, lacking the capability to infer and understand implicit user intentions in a unified framework. In this work, we propose a model, called SegPoint, that leverages the reasoning capabilities of a multi-modal Large Language Model (LLM)… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: ECCV 2024, Project Page: https://heshuting555.github.io/SegPoint

  2. arXiv:2407.13324  [pdf, other

    astro-ph.IM astro-ph.HE

    A millisecond pulsar position determined to 0.2 milliarcsecond precision with VLBI

    Authors: Hao Ding, Adam T. Deller, Paulo C. C. Freire, Leonid Petrov

    Abstract: Precise millisecond pulsar (MSP) positions determined with very long baseline interferometry (VLBI) hold the key to building the connection between the kinematic and dynamic reference frames respectively used by VLBI and pulsar timing. The frame connection would provide an important pathway to examining the planetary ephemerides used in pulsar timing, and potentially enhancing the sensitivities of… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: 12 pages, 3 figures, 6 tables, submitted

  3. arXiv:2407.11906  [pdf, other

    cs.CV cs.RO

    SegSTRONG-C: Segmenting Surgical Tools Robustly On Non-adversarial Generated Corruptions -- An EndoVis'24 Challenge

    Authors: Hao Ding, Tuxun Lu, Yuqian Zhang, Ruixing Liang, Hongchao Shu, Lalithkumar Seenivasan, Yonghao Long, Qi Dou, Cong Gao, Mathias Unberath

    Abstract: Accurate segmentation of tools in robot-assisted surgery is critical for machine perception, as it facilitates numerous downstream tasks including augmented reality feedback. While current feed-forward neural network-based methods exhibit excellent segmentation performance under ideal conditions, these models have proven susceptible to even minor corruptions, significantly impairing the model's pe… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

  4. arXiv:2407.09694  [pdf, other

    cs.CV

    Divide and Fuse: Body Part Mesh Recovery from Partially Visible Human Images

    Authors: Tianyu Luan, Zhongpai Gao, Luyuan Xie, Abhishek Sharma, Hao Ding, Benjamin Planche, Meng Zheng, Ange Lou, Terrence Chen, Junsong Yuan, Ziyan Wu

    Abstract: We introduce a novel bottom-up approach for human body mesh reconstruction, specifically designed to address the challenges posed by partial visibility and occlusion in input images. Traditional top-down methods, relying on whole-body parametric models like SMPL, falter when only a small part of the human is visible, as they require visibility of most of the human body for accurate mesh reconstruc… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: Accepted by ECCV2024

  5. arXiv:2407.09335  [pdf, other

    hep-lat hep-ph

    Strangeness-Correlations on the pseudo-critical line in (2+1)-flavor QCD

    Authors: D. Bollweg, H. -T. Ding, J. Goswami, F. Karsch, Swagato Mukherjee, P. Petreczky, C. Schmidt

    Abstract: We present some lattice QCD results on first ($χ_1^i$) and second ($χ_2^i$) cumulants of and correlations ($χ_{11}^{ij}$) among net baryon-number ($B$), strangeness ($S$) and electric charge ($Q$) along the pseudo-critical line ($T_{pc}(μ_B)$) in the temperature ($T$)--baryon chemical potential ($μ_B$) phase diagram of (2+1)-flavor QCD. We point out that violations of the isospin symmetric limit o… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: 9 pages, 6 figures

  6. arXiv:2407.07231  [pdf, ps, other

    quant-ph

    Reproducing Kernel Hilbert Space Approach to Non-Markovian Quantum Stochastic Models

    Authors: John E. Gough, Haijin Ding, Nina H. Amini

    Abstract: We give a derivation of the non-Markovian quantum state diffusion equation of Di{ó}si and Strunz starting from a model of a quantum mechanical system coupled to a bosonic bath. We show that the complex trajectories arises as a consequence of using the Bargmann-Segal (complex wave) representation of the bath. In particular, we construct a reproducing kernel Hilbert space for the bath auto-correlati… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 16 pages

  7. arXiv:2407.03813  [pdf, other

    cs.CV

    PECTP: Parameter-Efficient Cross-Task Prompts for Incremental Vision Transformer

    Authors: Qian Feng, Hanbin Zhao, Chao Zhang, Jiahua Dong, Henghui Ding, Yu-Gang Jiang, Hui Qian

    Abstract: Incremental Learning (IL) aims to learn deep models on sequential tasks continually, where each new task includes a batch of new classes and deep models have no access to task-ID information at the inference time. Recent vast pre-trained models (PTMs) have achieved outstanding performance by prompt technique in practical IL without the old samples (rehearsal-free) and with a memory constraint (mem… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  8. arXiv:2407.03516  [pdf, other

    hep-lat hep-ph nucl-ex nucl-th

    Three-dimensional Imaging of Pion using Lattice QCD: Generalized Parton Distributions

    Authors: Heng-Tong Ding, Xiang Gao, Swagato Mukherjee, Peter Petreczky, Qi Shi, Sergey Syritsyn, Yong Zhao

    Abstract: In this work, we report a lattice calculation of $x$-dependent valence pion generalized parton distributions (GPDs) at zero skewness with multiple values of the momentum transfer $-t$. The calculations are based on an $N_f=2+1$ gauge ensemble of highly improved staggered quarks with Wilson-Clover valence fermion. The lattice spacing is 0.04 fm, and the pion valence mass is tuned to be 300 MeV. We… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 33 pages, 14 figures

  9. arXiv:2406.17005  [pdf, other

    cs.CV

    PVUW 2024 Challenge on Complex Video Understanding: Methods and Results

    Authors: Henghui Ding, Chang Liu, Yunchao Wei, Nikhila Ravi, Shuting He, Song Bai, Philip Torr, Deshui Miao, Xin Li, Zhenyu He, Yaowei Wang, Ming-Hsuan Yang, Zhensong Xu, Jiangtao Yao, Chengjing Wu, Ting Liu, Luoqi Liu, Xinyu Liu, Jing Zhang, Kexin Zhang, Yuting Yang, Licheng Jiao, Shuyuan Yang, Mingqi Gao, Jingnan Luo , et al. (12 additional authors not shown)

    Abstract: Pixel-level Video Understanding in the Wild Challenge (PVUW) focus on complex video understanding. In this CVPR 2024 workshop, we add two new tracks, Complex Video Object Segmentation Track based on MOSE dataset and Motion Expression guided Video Segmentation track based on MeViS dataset. In the two new tracks, we provide additional videos and annotations that feature challenging elements, such as… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: MOSE Challenge: https://henghuiding.github.io/MOSE/ChallengeCVPR2024, MeViS Challenge: https://henghuiding.github.io/MeViS/ChallengeCVPR2024

  10. arXiv:2406.15407  [pdf

    physics.ins-det

    Preliminary Design of a General Electronics Platform for Accelerator Facilities

    Authors: Jinfu Zhu, Hongli Ding, Haokui Li, Qiaoye Ran, Xiwen Dai, Wei Li, Jiawei Han, Yue Li, Zhiyuan Zhang, Weixin Qiu, Weiqing Zhang

    Abstract: Many accelerators require considerable electronic systems for tests, verification, and operation. In Shenzhen Superconducting Soft X-ray Free Electron Laser (S3FEL), to meet the early tests and verification of various systems, save development expenses, and improve the reusability of hardware, firmware, and software systems, we have considered the needs of each system and preliminarily designed a… ▽ More

    Submitted 11 May, 2024; originally announced June 2024.

    Comments: 3 pages, 4 figures, 2024 IEEE Real-Time Conference

  11. arXiv:2406.14555  [pdf, other

    cs.CV

    A Survey of Multimodal-Guided Image Editing with Text-to-Image Diffusion Models

    Authors: Xincheng Shuai, Henghui Ding, Xingjun Ma, Rongcheng Tu, Yu-Gang Jiang, Dacheng Tao

    Abstract: Image editing aims to edit the given synthetic or real image to meet the specific requirements from users. It is widely studied in recent years as a promising and challenging field of Artificial Intelligence Generative Content (AIGC). Recent significant advancement in this field is based on the development of text-to-image (T2I) diffusion models, which generate images according to text prompts. Th… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Project Page: https://github.com/xinchengshuai/Awesome-Image-Editing

  12. arXiv:2406.08750  [pdf

    eess.SY

    The expressway network design problem for multiple urban subregions based on the macroscopic fundamental diagram

    Authors: Yunran Di, Weihua Zhang, Haotian Shi, Heng Ding, Jinbiao Huo, Bin Ran

    Abstract: As urbanization advances, cities are expanding, leading to a more decentralized urban structure and longer average commuting durations. The construction of an urban expressway system emerges as a critical strategy to tackle this challenge. However, the traditional link-level network design method faces modeling and solution challenges when dealing with the large-scale expressway network design pro… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  13. arXiv:2406.04674  [pdf, other

    astro-ph.HE

    VLBA Astrometry of the Fastest-spinning Magnetar Swift J1818.0-1607: A Large Trigonometric Distance & A Small Transverse Velocity

    Authors: Hao Ding, Marcus E. Lower, Adam T. Deller, Ryan M. Shannon, Fernando Camilo, John Sarkissian

    Abstract: In addition to being the most magnetic objects in the known universe, magnetars are the only objects observed to generate fast-radio-burst-like emissions. The formation mechanism of magnetars is still highly debated, and may potentially be probed with the magnetar velocity distribution. We carried out a 3-year-long astrometric campaign on Swift J1818.0-1607 -- the fastest-spinning magnetar, using… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 13 pages, 4 figures, 4 tables, accepted for publication in ApJL

  14. arXiv:2406.01016  [pdf, ps, other

    eess.SY

    Sensing, Communication, and Control Co-design for Energy Efficient Satellite-UAV Networks

    Authors: Tianhao. Liang, Huahao. Ding, Yuqi. Ping, Bin. Cao, Tingting. Zhang, Qinyu. Zhang

    Abstract: Traditional terrestrial communication infrastructures often fail to collect the timely information from Internet of Thing (IoT) devices in remote areas. To address this challenge, we investigate a Satellite-unmanned aerial vehicles (UAV) integrated Non-terrestrial network (NTN), where the UAV is controlled by remote control center via UAV-to-Satellite connections. To maximize the energy efficiency… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  15. arXiv:2405.20282  [pdf, other

    cs.CV

    SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow

    Authors: Chaoyang Wang, Xiangtai Li, Lu Qi, Henghui Ding, Yunhai Tong, Ming-Hsuan Yang

    Abstract: Semantic segmentation and semantic image synthesis are two representative tasks in visual perception and generation. While existing methods consider them as two distinct tasks, we propose a unified diffusion-based framework (SemFlow) and model them as a pair of reverse problems. Specifically, motivated by rectified flow theory, we train an ordinary differential equation (ODE) model to transport be… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  16. arXiv:2405.16840  [pdf, ps, other

    math.OC

    Delay Performance Analysis of Delay-Deterministic Wireless Networks with Infinite and Finite Blocklength Transmission

    Authors: Hanxue Ding, Shaoyi Xu, Ziheng Xu, Rongtao Xu, Zonghui Li, Junhui Zhao

    Abstract: In order to achieve stable and reliable industrial manufacturing, wireless networks must meet the stringent communication requirements of industrial automation, particularly the need for deterministic low latency communication. The limited wireless resources and time-varying fading channel contribute to the random fluctuations of transmission delay, making it challenging to realize delay-determini… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  17. arXiv:2405.15349  [pdf, other

    cs.CL

    UnKE: Unstructured Knowledge Editing in Large Language Models

    Authors: Jingcheng Deng, Zihao Wei, Liang Pang, Hanxing Ding, Huawei Shen, Xueqi Cheng

    Abstract: Recent knowledge editing methods have primarily focused on modifying structured knowledge in large language models, heavily relying on the assumption that structured knowledge is stored as key-value pairs locally in MLP layers or specific neurons. However, this task setting overlooks the fact that a significant portion of real-world knowledge is stored in an unstructured format, characterized by l… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  18. arXiv:2405.11448  [pdf, other

    cs.CV

    Cross-Domain Knowledge Distillation for Low-Resolution Human Pose Estimation

    Authors: Zejun Gu, Zhong-Qiu Zhao, Henghui Ding, Hao Shen, Zhao Zhang, De-Shuang Huang

    Abstract: In practical applications of human pose estimation, low-resolution inputs frequently occur, and existing state-of-the-art models perform poorly with low-resolution images. This work focuses on boosting the performance of low-resolution models by distilling knowledge from a high-resolution model. However, we face the challenge of feature size mismatch and class number mismatch when applying knowled… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

    Comments: 11 pages, 5 figures

  19. arXiv:2405.09796  [pdf

    physics.acc-ph

    Prototype Design of a Digital Low-level RF System for S-band Deflectors

    Authors: J. F. Zhu, H. L. Ding, H. K. Li, Y. Li, X. W. Dai, J. W. Han, W. Q. Zhang

    Abstract: S-band deflectors are generally operated on pulsed mode for beam diagnosis. We plan to deploy 5 S-band (2997 MHz) deflectors to accurately measure the longitudinal time distribution of ultra-short electron beam pulses in Shenzhen Superconducting Soft X-ray Free Electron Laser (S3FEL). A microwave system of one deflector consists of a low-level RF system (LLRF), a solid-state amplifier, waveguide c… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: 3 pages, 5 figures, IPAC'24 - 15th International Particle Accelerator Conference

  20. Towards Metric DBSCAN: Exact, Approximate, and Streaming Algorithms

    Authors: Guanlin Mo, Shihong Song, Hu Ding

    Abstract: DBSCAN is a popular density-based clustering algorithm that has many different applications in practice. However, the running time of DBSCAN in high-dimensional space or general metric space ({\em e.g.,} clustering a set of texts by using edit distance) can be as large as quadratic in the input size. Moreover, most of existing accelerating techniques for DBSCAN are only available for low-dimension… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  21. arXiv:2405.06125  [pdf

    eess.SY

    Cooperative Route Guidance and Flow Control for Mixed Road Networks Comprising Expressway and Arterial Network

    Authors: Yunran Di, Haotian Shi, Weihua Zhang, Heng Ding, Xiaoyan Zheng, Bin Ran

    Abstract: Facing the congestion challenges of mixed road networks comprising expressways and arterial road networks, traditional control solutions fall short. To effectively alleviate traffic congestion in mixed road networks, it is crucial to clear the interaction between expressways and arterial networks and achieve orderly coordination between them. This study employs the multi-class cell transmission mo… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  22. arXiv:2405.04434  [pdf, other

    cs.CL cs.AI

    DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

    Authors: DeepSeek-AI, Aixin Liu, Bei Feng, Bin Wang, Bingxuan Wang, Bo Liu, Chenggang Zhao, Chengqi Dengr, Chong Ruan, Damai Dai, Daya Guo, Dejian Yang, Deli Chen, Dongjie Ji, Erhang Li, Fangyun Lin, Fuli Luo, Guangbo Hao, Guanting Chen, Guowei Li, H. Zhang, Hanwei Xu, Hao Yang, Haowei Zhang, Honghui Ding , et al. (132 additional authors not shown)

    Abstract: We present DeepSeek-V2, a strong Mixture-of-Experts (MoE) language model characterized by economical training and efficient inference. It comprises 236B total parameters, of which 21B are activated for each token, and supports a context length of 128K tokens. DeepSeek-V2 adopts innovative architectures including Multi-head Latent Attention (MLA) and DeepSeekMoE. MLA guarantees efficient inference… ▽ More

    Submitted 19 June, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

  23. arXiv:2405.03914  [pdf, other

    astro-ph.HE astro-ph.GA gr-qc

    VLBA Astrometry of the Galactic Double Neutron Stars PSR J0509+3801 and PSR J1930-1852: A Preliminary Transverse Velocity Distribution of Double Neutron Stars and Its Implications

    Authors: Hao Ding, Adam T. Deller, Joseph K. Swiggum, Ryan S. Lynch, Shami Chatterjee, Thomas M. Tauris

    Abstract: The mergers of double neutron stars (DNSs) systems are believed to drive the majority of short $γ$-ray bursts (SGRBs), while also serving as production sites of heavy r-process elements. Despite being key to i) confirming the nature of the extragalactic SGRBs, ii) addressing the poorly-understood r-process enrichment in the ultra-faint dwarf galaxies (UFDGs), and iii) probing the formation process… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: 17 pages, 3 figures, 4 tables, accepted for publication in ApJ

  24. arXiv:2404.17879  [pdf, other

    quant-ph physics.atm-clus

    Trapping polar molecules by surface acoustic waves

    Authors: Haijin Ding, Re-Bing Wu, Yu-xi Liu

    Abstract: We propose a method to trap polar molecules with the electrical force induced by the surface acoustic wave (SAW) on piezoelectric materials. In this approach, the electrical force is perpendicular to the moving direction of the polar molecules, and is used to control the positions of trapped polar molecules in the direction orthogonal to the acoustic transmission. By virtue of an external electric… ▽ More

    Submitted 7 June, 2024; v1 submitted 27 April, 2024; originally announced April 2024.

    Comments: 18 pages, 10 figures

  25. arXiv:2404.17287  [pdf, other

    cs.CL

    When to Trust LLMs: Aligning Confidence with Response Quality

    Authors: Shuchang Tao, Liuyi Yao, Hanxing Ding, Yuexiang Xie, Qi Cao, Fei Sun, Jinyang Gao, Huawei Shen, Bolin Ding

    Abstract: Despite the success of large language models (LLMs) in natural language generation, much evidence shows that LLMs may produce incorrect or nonsensical text. This limitation highlights the importance of discerning when to trust LLMs, especially in safety-critical domains. Existing methods often express reliability by confidence level, however, their effectiveness is limited by the lack of objective… ▽ More

    Submitted 9 June, 2024; v1 submitted 26 April, 2024; originally announced April 2024.

    Comments: Accepted by ACL 2024

  26. arXiv:2404.15628  [pdf, other

    quant-ph

    Identifying non-Hermitian critical points with quantum metric

    Authors: Jun-Feng Ren, Jing Li, Hai-Tao Ding, Dan-Wei Zhang

    Abstract: The geometric properties of quantum states is fully encoded by the quantum geometric tensor. The real and imaginary parts of the quantum geometric tensor are the quantum metric and Berry curvature, which characterize the distance and phase difference between two nearby quantum states in Hilbert space, respectively. For conventional Hermitian quantum systems, the quantum metric corresponds to the f… ▽ More

    Submitted 1 May, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

    Comments: Under Review

  27. arXiv:2404.13401  [pdf, other

    cs.LG

    Approximate Algorithms For $k$-Sparse Wasserstein Barycenter With Outliers

    Authors: Qingyuan Yang, Hu Ding

    Abstract: Wasserstein Barycenter (WB) is one of the most fundamental optimization problems in optimal transportation. Given a set of distributions, the goal of WB is to find a new distribution that minimizes the average Wasserstein distance to them. The problem becomes even harder if we restrict the solution to be ``$k$-sparse''. In this paper, we study the $k$-sparse WB problem in the presence of outliers,… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

  28. arXiv:2404.10830  [pdf, other

    cs.CL cs.AI cs.LG

    Fewer Truncations Improve Language Modeling

    Authors: Hantian Ding, Zijian Wang, Giovanni Paolini, Varun Kumar, Anoop Deoras, Dan Roth, Stefano Soatto

    Abstract: In large language model training, input documents are typically concatenated together and then split into sequences of equal length to avoid padding tokens. Despite its efficiency, the concatenation approach compromises data integrity -- it inevitably breaks many documents into incomplete pieces, leading to excessive truncations that hinder the model from learning to compose logically coherent and… ▽ More

    Submitted 2 May, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: ICML 2024

  29. arXiv:2404.09586  [pdf, other

    cs.CV cs.LG

    Mitigating the Curse of Dimensionality for Certified Robustness via Dual Randomized Smoothing

    Authors: Song Xia, Yi Yu, Xudong Jiang, Henghui Ding

    Abstract: Randomized Smoothing (RS) has been proven a promising method for endowing an arbitrary image classifier with certified robustness. However, the substantial uncertainty inherent in the high-dimensional isotropic Gaussian noise imposes the curse of dimensionality on RS. Specifically, the upper bound of ${\ell_2}$ certified robustness radius provided by RS exhibits a diminishing trend with the expans… ▽ More

    Submitted 15 June, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

    Comments: This paper has been accepted to the International Conference on Learning Representations (ICLR), 2024

  30. arXiv:2404.09417  [pdf

    physics.geo-ph

    Satellite observations reveal shorter periodic inner core oscillation

    Authors: Yachong An, Hao Ding, Fred D. Richards, Weiping Jiang, Jiancheng Li, Wenbin Shen

    Abstract: Detecting the Earth's inner core motions relative to the mantle presents a considerable challenge due to their indirect accessibility. Seismological observations initially provided evidence for differential/super-rotation of the inner core, but recently demonstrated a possibly about 70-year periodic oscillation. The contrasting results underscore the ongoing enigma surrounding inner core motion, l… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: 32 pages, 8 figures

  31. arXiv:2404.08562  [pdf, other

    cs.CR cs.AI cs.LG

    Dynamic Neural Control Flow Execution: An Agent-Based Deep Equilibrium Approach for Binary Vulnerability Detection

    Authors: Litao Li, Steven H. H. Ding, Andrew Walenstein, Philippe Charland, Benjamin C. M. Fung

    Abstract: Software vulnerabilities are a challenge in cybersecurity. Manual security patches are often difficult and slow to be deployed, while new vulnerabilities are created. Binary code vulnerability detection is less studied and more complex compared to source code, and this has important practical implications. Deep learning has become an efficient and powerful tool in the security domain, where it pro… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  32. arXiv:2404.04990  [pdf, other

    cs.CL

    MLaKE: Multilingual Knowledge Editing Benchmark for Large Language Models

    Authors: Zihao Wei, Jingcheng Deng, Liang Pang, Hanxing Ding, Huawei Shen, Xueqi Cheng

    Abstract: The extensive utilization of large language models (LLMs) underscores the crucial necessity for precise and contemporary knowledge embedded within their intrinsic parameters. Existing research on knowledge editing primarily concentrates on monolingual scenarios, neglecting the complexities presented by multilingual contexts and multi-hop reasoning. To address these challenges, our study introduces… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  33. arXiv:2404.04480  [pdf

    cond-mat.mtrl-sci

    Possible charge density wave induced lattice distortion in ferromagnetic FeGe film

    Authors: Guangdong Nie, Guanghui Han, Erfa S. Z., Shijian Chen, Hao Ding, Fangdong Tang, Licong Peng, Young Sun, Deshun Hong

    Abstract: Binary compound FeGe hosts multiple structures, where skyrmion lattice emerges in the chiral B20 phase and antiferromagnet with charge density wave shows up in the hexagonal phase. Here, we synthesized monoclinic FeGe films which are ferromagnetic with Curie temperature as high as 800 K. By low temperature transmission electron microscope, lattice reconstructions in both real and reciprocal space… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  34. arXiv:2404.04412  [pdf, other

    hep-lat hep-ph nucl-ex nucl-th

    QCD Predictions for Meson Electromagnetic Form Factors at High Momenta: Testing Factorization in Exclusive Processes

    Authors: Heng-Tong Ding, Xiang Gao, Andrew D. Hanlon, Swagato Mukherjee, Peter Petreczky, Qi Shi, Sergey Syritsyn, Rui Zhang, Yong Zhao

    Abstract: We report the first lattice QCD computation of pion and kaon electromagnetic form factors, $F_M(Q^2)$, at large momentum transfer up to 10 and 28 $\mathrm{GeV}^2$, respectively. Utilizing physical masses and two fine lattices, we achieve good agreement with JLab experimental results at $Q^2 \lesssim 4~\mathrm{GeV}^2$. For $Q^2 \gtrsim 4~\mathrm{GeV}^2$, our results provide $\textit{ab-initio}$ QCD… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

    Comments: 25 pages, 9 figures

  35. arXiv:2404.03645  [pdf, other

    cs.CV

    Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation

    Authors: Shuting He, Henghui Ding

    Abstract: Referring video segmentation relies on natural language expressions to identify and segment objects, often emphasizing motion clues. Previous works treat a sentence as a whole and directly perform identification at the video-level, mixing up static image-level cues with temporal motion cues. However, image-level features cannot well comprehend motion cues in sentences, and static cues are not cruc… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: CVPR 2024, code: https://github.com/heshuting555/DsHmp

  36. arXiv:2404.02187  [pdf

    cs.LG cs.AI

    A Generative Deep Learning Approach for Crash Severity Modeling with Imbalanced Data

    Authors: Junlan Chen, Ziyuan Pu, Nan Zheng, Xiao Wen, Hongliang Ding, Xiucheng Guo

    Abstract: Crash data is often greatly imbalanced, with the majority of crashes being non-fatal crashes, and only a small number being fatal crashes due to their rarity. Such data imbalance issue poses a challenge for crash severity modeling since it struggles to fit and interpret fatal crash outcomes with very limited samples. Usually, such data imbalance issues are addressed by data resampling methods, suc… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  37. arXiv:2404.00335  [pdf, other

    cs.CV

    Learning Trimaps via Clicks for Image Matting

    Authors: Chenyi Zhang, Yihan Hu, Henghui Ding, Humphrey Shi, Yao Zhao, Yunchao Wei

    Abstract: Despite significant advancements in image matting, existing models heavily depend on manually-drawn trimaps for accurate results in natural image scenarios. However, the process of obtaining trimaps is time-consuming, lacking user-friendliness and device compatibility. This reliance greatly limits the practical application of all trimap-based matting methods. To address this issue, we introduce Cl… ▽ More

    Submitted 6 April, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

  38. arXiv:2403.18811  [pdf, other

    cs.CV cs.GR cs.SD eess.AS

    Duolando: Follower GPT with Off-Policy Reinforcement Learning for Dance Accompaniment

    Authors: Li Siyao, Tianpei Gu, Zhitao Yang, Zhengyu Lin, Ziwei Liu, Henghui Ding, Lei Yang, Chen Change Loy

    Abstract: We introduce a novel task within the field of 3D dance generation, termed dance accompaniment, which necessitates the generation of responsive movements from a dance partner, the "follower", synchronized with the lead dancer's movements and the underlying musical rhythm. Unlike existing solo or group dance generation tasks, a duet dance scenario entails a heightened degree of interaction between t… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: ICLR 2024

  39. arXiv:2403.15484  [pdf, other

    cs.CL cs.LG

    RakutenAI-7B: Extending Large Language Models for Japanese

    Authors: Rakuten Group, Aaron Levine, Connie Huang, Chenguang Wang, Eduardo Batista, Ewa Szymanska, Hongyi Ding, Hou Wei Chou, Jean-François Pessiot, Johanes Effendi, Justin Chiu, Kai Torben Ohlhus, Karan Chopra, Keiji Shinzato, Koji Murakami, Lee Xiong, Lei Chen, Maki Kubota, Maksim Tkachenko, Miroku Lee, Naoki Takahashi, Prathyusha Jwalapuram, Ryutaro Tatsushima, Saurabh Jain, Sunil Kumar Yadav , et al. (5 additional authors not shown)

    Abstract: We introduce RakutenAI-7B, a suite of Japanese-oriented large language models that achieve the best performance on the Japanese LM Harness benchmarks among the open 7B models. Along with the foundation model, we release instruction- and chat-tuned models, RakutenAI-7B-instruct and RakutenAI-7B-chat respectively, under the Apache 2.0 license.

    Submitted 21 March, 2024; originally announced March 2024.

  40. arXiv:2403.14249  [pdf, other

    quant-ph cond-mat.mes-hall

    Direct Probe of Topology and Geometry of Quantum States on IBM Q

    Authors: Tianqi Chen, Hai-Tao Ding, Ruizhe Shen, Shi-Liang Zhu, Jiangbin Gong

    Abstract: The concepts of topology and geometry are of critical importance in exploring exotic phases of quantum matter. Though they have been investigated on various experimental platforms, to date a direct probe of topological and geometric properties on a universal quantum computer even for a minimum model is still in vain. In this work, we first show that a density matrix form of the quantum geometric t… ▽ More

    Submitted 6 June, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: 14 pages, 8 figures (updated main text and references)

  41. arXiv:2403.11122  [pdf, other

    cs.CV

    LERENet: Eliminating Intra-class Differences for Metal Surface Defect Few-shot Semantic Segmentation

    Authors: Hanze Ding, Zhangkai Wu, Jiyan Zhang, Ming Ping, Yanfang Liu

    Abstract: Few-shot segmentation models excel in metal defect detection due to their rapid generalization ability to new classes and pixel-level segmentation, rendering them ideal for addressing data scarcity issues and achieving refined object delineation in industrial applications. Existing works neglect the \textit{Intra-Class Differences}, inherent in metal surface defect data, which hinders the model fr… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  42. arXiv:2403.10468  [pdf, other

    cs.SE

    An Empirical Study on Developers Shared Conversations with ChatGPT in GitHub Pull Requests and Issues

    Authors: Huizi Hao, Kazi Amit Hasan, Hong Qin, Marcos Macedo, Yuan Tian, Steven H. H. Ding, Ahmed E. Hassan

    Abstract: ChatGPT has significantly impacted software development practices, providing substantial assistance to developers in a variety of tasks, including coding, testing, and debugging. Despite its widespread adoption, the impact of ChatGPT as an assistant in collaborative coding remains largely unexplored. In this paper, we analyze a dataset of 210 and 370 developers shared conversations with ChatGPT in… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  43. arXiv:2403.09616  [pdf, other

    cs.CV

    Explore In-Context Segmentation via Latent Diffusion Models

    Authors: Chaoyang Wang, Xiangtai Li, Henghui Ding, Lu Qi, Jiangning Zhang, Yunhai Tong, Chen Change Loy, Shuicheng Yan

    Abstract: In-context segmentation has drawn more attention with the introduction of vision foundation models. Most existing approaches adopt metric learning or masked image modeling to build the correlation between visual prompts and input image queries. In this work, we explore this problem from a new perspective, using one representative generation model, the latent diffusion model (LDM). We observe a tas… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  44. arXiv:2403.09390  [pdf, other

    hep-lat hep-ph nucl-th

    Curvature of the chiral phase transition line from the magnetic equation of state of (2+1)-flavor QCD

    Authors: H. -T. Ding, O. Kaczmarek, F. Karsch, P. Petreczky, Mugdha Sarkar, C. Schmidt, Sipaz Sharma

    Abstract: We analyze the dependence of the chiral phase transition temperature on baryon number and strangeness chemical potentials by calculating the leading order curvature coefficients in the light and strange quark flavor basis as well as in the conserved charge ($B, S$) basis. Making use of scaling properties of the magnetic equation of state (MEoS) and including diagonal as well as off-diagonal contri… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: 17 pages, 10 figures

  45. arXiv:2403.08845  [pdf, other

    cs.LG cs.AI

    Bifurcated Attention: Accelerating Massively Parallel Decoding with Shared Prefixes in LLMs

    Authors: Ben Athiwaratkun, Sujan Kumar Gonugondla, Sanjay Krishna Gouda, Haifeng Qian, Hantian Ding, Qing Sun, Jun Wang, Jiacheng Guo, Liangfu Chen, Parminder Bhatia, Ramesh Nallapati, Sudipta Sengupta, Bing Xiang

    Abstract: This study introduces bifurcated attention, a method designed to enhance language model inference in shared-context batch decoding scenarios. Our approach addresses the challenge of redundant memory IO costs, a critical factor contributing to latency in high batch sizes and extended context lengths. Bifurcated attention achieves this by strategically dividing the attention mechanism during increme… ▽ More

    Submitted 11 July, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

  46. arXiv:2403.08799  [pdf, other

    cs.SE cs.CR

    Automating SBOM Generation with Zero-Shot Semantic Similarity

    Authors: Devin Pereira, Christopher Molloy, Sudipta Acharya, Steven H. H. Ding

    Abstract: It is becoming increasingly important in the software industry, especially with the growing complexity of software ecosystems and the emphasis on security and compliance for manufacturers to inventory software used on their systems. A Software-Bill-of-Materials (SBOM) is a comprehensive inventory detailing a software application's components and dependencies. Current approaches rely on case-based… ▽ More

    Submitted 3 February, 2024; originally announced March 2024.

    Comments: 8 pages, 2 figures

  47. Detecting degenerate bands topological invariants in optical lattice

    Authors: Jing-Xin Liu, Jian-Te Wang, Hai-Tao Ding

    Abstract: In this paper, we present a novel experimental approach for simulating and detecting topological invariants using ultracold fermions confined in two-dimensional hexagonal optical lattices. We propose achieving two-fold degenerate four-band models with non-trivial topologies in both the AII and A classes by introducing additional inertial forces, Raman processes, or periodic driving. By implementin… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: 15 pages, 4 figures

  48. VLBI Astrometry of Radio Stars to Link Radio and Optical Celestial Reference Frames: Observing Strategies

    Authors: Jingdong Zhang, Bo Zhang, Shuangjing Xu, Niu Liu, Wen Chen, Hao Ding, Pengfei Jiang, Yan Sun, Jinqing Wang, Lang Cui, Shiming Wen, Xiaofeng Mai, Jinling Li, Fengchun Shu, Yidan Huang

    Abstract: The Gaia celestial reference frame (Gaia-CRF) will benefit from a close assessment with independent methods, such as Very Long Baseline Interferometry (VLBI) measurements of radio stars at bright magnitudes. However, obtaining full astrometric parameters for each radio star through VLBI measurements demands a significant amount of observation time. This study proposes an efficient observing strate… ▽ More

    Submitted 26 March, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

    Comments: 9 pages, 4 figures, accepted for publication in the Monthly Notices of the Royal Astronomy Society (MNRAS)

  49. arXiv:2403.02265  [pdf, other

    cs.CV cs.GR

    DaReNeRF: Direction-aware Representation for Dynamic Scenes

    Authors: Ange Lou, Benjamin Planche, Zhongpai Gao, Yamin Li, Tianyu Luan, Hao Ding, Terrence Chen, Jack Noble, Ziyan Wu

    Abstract: Addressing the intricate challenge of modeling and re-rendering dynamic scenes, most recent approaches have sought to simplify these complexities using plane-based explicit representations, overcoming the slow training time issues associated with methods like Neural Radiance Fields (NeRF) and implicit representations. However, the straightforward decomposition of 4D dynamic scenes into multiple 2D… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: Accepted at CVPR 2024. Paper + supplementary material

  50. arXiv:2403.01560  [pdf, other

    cs.CV

    Rethinking CLIP-based Video Learners in Cross-Domain Open-Vocabulary Action Recognition

    Authors: Kun-Yu Lin, Henghui Ding, Jiaming Zhou, Yu-Ming Tang, Yi-Xing Peng, Zhilin Zhao, Chen Change Loy, Wei-Shi Zheng

    Abstract: Building upon the impressive success of CLIP (Contrastive Language-Image Pretraining), recent pioneer works have proposed to adapt the powerful CLIP to video data, leading to efficient and effective video learners for open-vocabulary action recognition. Inspired by that humans perform actions in diverse environments, our work delves into an intriguing question: Can CLIP-based video learners effect… ▽ More

    Submitted 24 May, 2024; v1 submitted 3 March, 2024; originally announced March 2024.