Skip to main content

Showing 1–50 of 668 results for author: Dong, Z

  1. arXiv:2407.09667  [pdf

    physics.med-ph eess.SP

    Real and imaginary part symmetries (RIPS) and artifacts removal in 1H magnetic resonance spectroscopy without water suppression

    Authors: Zhengchao Dong

    Abstract: Purpose: Proton MR spectroscopic imaging (1H MRSI) without water suppression (WS) possess some distinct advantages over the conventionally used 1H MRSI with WS. However, the sideband artifacts in the non-water suppressed spectra hinder the applications of the 1H MRSI without WS. Although many hardware or software techniques to tackle the sidebands have been developed, they suffer from various shor… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: 25 pages, 9 figures, 2 tables

  2. arXiv:2407.07147  [pdf, other

    hep-ph hep-ex

    Analytical Insights on Hadronic Top Quark Polarimetry

    Authors: Zhongtian Dong, Dorival Gonçalves, Kyoungchul Kong, Andrew J. Larkoski, Alberto Navarro

    Abstract: Top quark polarization provides an important tool for studying its production mechanisms, spin correlations, top quark properties, and new physics searches. Unlike lighter quarks, the top quark's polarization remains intact until its decay, enabling precise spin measurements. While the down-type fermions from $W$ boson decay are known to be effective spin analyzers, charged leptons have typically… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 14 pages and 3 figures

  3. arXiv:2407.05563  [pdf, other

    cs.CL

    LLMBox: A Comprehensive Library for Large Language Models

    Authors: Tianyi Tang, Yiwen Hu, Bingqian Li, Wenyang Luo, Zijing Qin, Haoxiang Sun, Jiapeng Wang, Shiyi Xu, Xiaoxue Cheng, Geyang Guo, Han Peng, Bowen Zheng, Yiru Tang, Yingqian Min, Yushuo Chen, Jie Chen, Yuanqian Zhao, Luran Ding, Yuhao Wang, Zican Dong, Chunxuan Xia, Junyi Li, Kun Zhou, Wayne Xin Zhao, Ji-Rong Wen

    Abstract: To facilitate the research on large language models (LLMs), this paper presents a comprehensive and unified library, LLMBox, to ease the development, use, and evaluation of LLMs. This library is featured with three main merits: (1) a unified data interface that supports the flexible implementation of various training strategies, (2) a comprehensive evaluation that covers extensive tasks, datasets,… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: Accepted by ACL 2024 Demo

  4. arXiv:2407.04404  [pdf

    cs.AR

    Fixed and Movable Antenna Technology for 6G Integrated Sensing and Communication

    Authors: Yong Zeng, Zhenjun Dong, Huizhi Wang, Lipeng Zhu, Ziyao Hong, Qingji Jiang, Dongming Wang, Shi Jin, Rui Zhang

    Abstract: By deploying antenna arrays at the transmitter/receiver to provide additional spatial-domain degrees of freedom (DoFs), multi-antenna technology greatly improves the reliability and efficiency of wireless communication. Meanwhile, the application of multi-antenna technology in the radar field has achieved spatial angle resolution and improved sensing DoF, thus significantly enhancing wireless sens… ▽ More

    Submitted 16 July, 2024; v1 submitted 5 July, 2024; originally announced July 2024.

    Comments: in Chinese language

  5. arXiv:2407.03591  [pdf, other

    physics.optics

    Controlling quasi-parametric amplifications: From multiple PT-symmetry phase transitions to non-Hermitian sensing

    Authors: Xiaoxiong Wu, Kai Bai, Penghong Yu, Zhaohui Dong, Yanyan He, Jingui Ma, Vladislav V. Yakovlev, Meng Xiao, Xianfeng Chen, Luqi Yuan

    Abstract: Quasi-parametric amplification (QPA) is a nonlinear interaction in which the idler wave is depleted through some loss mechanism. QPA plays an important role in signal amplification in ultrafast photonics and quantum light generation. The QPA process has a number of features characterized by the non-Hermitian parity-time ($\mathcal{PT}$) symmetry. In this report, we explore new interaction regimes… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 17 pages, 6 figures

  6. arXiv:2407.03531  [pdf, other

    cs.RO

    OrbitGrasp: $SE(3)$-Equivariant Grasp Learning

    Authors: Boce Hu, Xupeng Zhu, Dian Wang, Zihao Dong, Haojie Huang, Chenghao Wang, Robin Walters, Robert Platt

    Abstract: While grasp detection is an important part of any robotic manipulation pipeline, reliable and accurate grasp detection in $SE(3)$ remains a research challenge. Many robotics applications in unstructured environments such as the home or warehouse would benefit a lot from better grasp performance. This paper proposes a novel framework for detecting $SE(3)$ grasp poses based on point cloud input. Our… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  7. arXiv:2407.03442  [pdf, other

    cs.CV

    Fisher-aware Quantization for DETR Detectors with Critical-category Objectives

    Authors: Huanrui Yang, Yafeng Huang, Zhen Dong, Denis A Gudovskiy, Tomoyuki Okuno, Yohei Nakata, Yuan Du, Kurt Keutzer, Shanghang Zhang

    Abstract: The impact of quantization on the overall performance of deep learning models is a well-studied problem. However, understanding and mitigating its effects on a more fine-grained level is still lacking, especially for harder tasks such as object detection with both classification and regression objectives. This work defines the performance for a subset of task-critical categories, i.e. the critical… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: Poster presentation at the 2nd Workshop on Advancing Neural Network Training: Computational Efficiency, Scalability, and Resource Optimization (WANT@ICML 2024)

  8. arXiv:2407.03014  [pdf

    physics.optics physics.app-ph quant-ph

    Dielectric Fano Nanoantennas for Enabling Sub-Nanosecond Lifetimes in NV-based Single Photon Emitters

    Authors: Shu An, Dmitry Kalashnikov, Wenqiao Shi, Zackaria Mahfoud, Ah Bian Chew, Yan Liu, Jing Wu, Di Zhu, Weibo Gao, Cheng-Wei Qiu, Victor Leong, Zhaogang Dong

    Abstract: Solid-state quantum emitters are essential sources of single photons, and enhancing their emission rates is of paramount importance for applications in quantum communications, computing, and metrology. One approach is to couple quantum emitters with resonant photonic nanostructures, where the emission rate is enhanced due to the Purcell effect. Dielectric nanoantennas are promising as they provide… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 20 pages, 4 figures

  9. arXiv:2407.02887  [pdf, other

    cs.CV

    Explicitly Guided Information Interaction Network for Cross-modal Point Cloud Completion

    Authors: Hang Xu, Chen Long, Wenxiao Zhang, Yuan Liu, Zhen Cao, Zhen Dong, Bisheng Yang

    Abstract: In this paper, we explore a novel framework, EGIInet (Explicitly Guided Information Interaction Network), a model for View-guided Point cloud Completion (ViPC) task, which aims to restore a complete point cloud from a partial one with a single view image. In comparison with previous methods that relied on the global semantics of input images, EGIInet efficiently combines the information from two m… ▽ More

    Submitted 4 July, 2024; v1 submitted 3 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  10. arXiv:2407.01663  [pdf, other

    hep-ph hep-ex

    Hadronic Top Quark Polarimetry with ParticleNet

    Authors: Zhongtian Dong, Dorival Gonçalves, Kyoungchul Kong, Andrew J. Larkoski, Alberto Navarro

    Abstract: Precision studies for top quark physics are a cornerstone of the Large Hadron Collider program. Polarization, probed through decay kinematics, provides a unique tool to scrutinize the top quark across its various production modes and to explore potential new physics effects. However, the top quark most often decays hadronically, for which unambiguous identification of its decay products sensitive… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 7 pages and 3 figures

  11. arXiv:2407.01118  [pdf, other

    astro-ph.IM

    Preliminary results of sky brightness measurements in near-infrared at Lenghu, China

    Authors: Jinji Li, Bin Ma, Zhongnan Dong, Haoran Zhang

    Abstract: Low sky brightness is crucial for ground-based astronomical observations, because it limits the observational capability to detect fainter sources. Lenghu, located on the Tibetan Plateau in China, has been identified as an high-quality astronomical site in China, including dark sky in optical band. In this work, we will report the preliminary results of near-infrared sky brightness measurements at… ▽ More

    Submitted 8 July, 2024; v1 submitted 1 July, 2024; originally announced July 2024.

    Comments: 8 pages, 8 figures, presented at 2024 SPIE Astronomical Telescopes + Instrumentation conference

  12. arXiv:2407.00433  [pdf

    cond-mat.mtrl-sci

    Screening of half-Heuslers with temperature-induced band convergence and enhanced thermoelectric properties

    Authors: Jinyang Xi, Zirui Dong, Menghan Gao, Jun Luo, Jiong Yang

    Abstract: Enhancing band convergence is an effective way to optimize the thermoelectric (TE) properties of materials. However, the temperature-induced band renormalization is commonly ignored. By employing the recently-developed electron-phonon renormalization (EPR) method, the nature of band renormalization in half-Heusler (HH) compounds TiCoSb and NbFeSb is revealed, and the key factors for temperature-in… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  13. arXiv:2406.19853  [pdf, other

    cs.CL cs.AI

    YuLan: An Open-source Large Language Model

    Authors: Yutao Zhu, Kun Zhou, Kelong Mao, Wentong Chen, Yiding Sun, Zhipeng Chen, Qian Cao, Yihan Wu, Yushuo Chen, Feng Wang, Lei Zhang, Junyi Li, Xiaolei Wang, Lei Wang, Beichen Zhang, Zican Dong, Xiaoxue Cheng, Yuhan Chen, Xinyu Tang, Yupeng Hou, Qiangqiang Ren, Xincheng Pang, Shufang Xie, Wayne Xin Zhao, Zhicheng Dou , et al. (13 additional authors not shown)

    Abstract: Large language models (LLMs) have become the foundation of many applications, leveraging their extensive capabilities in processing and understanding natural language. While many open-source LLMs have been released with technical reports, the lack of training details hinders further research and development. This paper presents the development of YuLan, a series of open-source LLMs with $12$ billi… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  14. arXiv:2406.17507  [pdf, other

    cs.IR

    ACE: A Generative Cross-Modal Retrieval Framework with Coarse-To-Fine Semantic Modeling

    Authors: Minghui Fang, Shengpeng Ji, Jialong Zuo, Hai Huang, Yan Xia, Jieming Zhu, Xize Cheng, Xiaoda Yang, Wenrui Liu, Gang Wang, Zhenhua Dong, Zhou Zhao

    Abstract: Generative retrieval, which has demonstrated effectiveness in text-to-text retrieval, utilizes a sequence-to-sequence model to directly generate candidate identifiers based on natural language queries. Without explicitly computing the similarity between queries and candidates, generative retrieval surpasses dual-tower models in both speed and accuracy on large-scale corpora, providing new insights… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  15. arXiv:2406.17036  [pdf, other

    cond-mat.supr-con cond-mat.mes-hall cond-mat.str-el

    Superconductivity from spin-canting fluctuations in rhombohedral graphene

    Authors: Zhiyu Dong, Étienne Lantagne-Hurtubise, Jason Alicea

    Abstract: Rhombohedral graphene multilayers host various broken-symmetry metallic phases as well as superconductors whose pairing mechanism and order parameter symmetry remain unsettled. Strikingly, experiments have revealed prominent new superconducting regions in rhombohedral bilayer and trilayer graphene devices with proximity-induced Ising spin-orbit coupling. We propose that these superconductors desce… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 14 pages, 5 figures

  16. arXiv:2406.16864  [pdf, other

    cs.CV cs.AI cs.GR

    StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal

    Authors: Chongjie Ye, Lingteng Qiu, Xiaodong Gu, Qi Zuo, Yushuang Wu, Zilong Dong, Liefeng Bo, Yuliang Xiu, Xiaoguang Han

    Abstract: This work addresses the challenge of high-quality surface normal estimation from monocular colored inputs (i.e., images and videos), a field which has recently been revolutionized by repurposing diffusion priors. However, previous attempts still struggle with stochastic inference, conflicting with the deterministic nature of the Image2Normal task, and costly ensembling step, which slows down the e… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: HF Demo: hf.co/Stable-X, Video: https://www.youtube.com/watch?v=sylXTxG_U2U

  17. arXiv:2406.16003  [pdf

    physics.optics

    Unidirectional Chiral Emission via Twisted Bi-layer Metasurfaces

    Authors: Dmitrii Gromyko, Shu An, Sergey Gorelik, Jiahui Xu, Li Jun Lim, Henry Yit Loong Lee, Febiana Tjiptoharsono, Zhi-Kuang Tan, Cheng-Wei Qiu, Zhaogang Dong, Lin Wu

    Abstract: Controlling and channelling light emissions from unpolarized quantum dots into specific directions with chiral polarization remains a key challenge in modern photonics. Stacked metasurface designs offer a potential compact solution for chirality and directionality engineering. However, experimental observations of directional chiral radiation from resonant metasurfaces with quantum emitters remain… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: 16 pages, 4 figures

  18. arXiv:2406.15073  [pdf, other

    cs.AI cs.DB

    KnobTree: Intelligent Database Parameter Configuration via Explainable Reinforcement Learning

    Authors: Jiahan Chen, Shuhan Qi, Yifan Li, Zeyu Dong, Mingfeng Ding, Yulin Wu, Xuan Wang

    Abstract: Databases are fundamental to contemporary information systems, yet traditional rule-based configuration methods struggle to manage the complexity of real-world applications with hundreds of tunable parameters. Deep reinforcement learning (DRL), which combines perception and decision-making, presents a potential solution for intelligent database configuration tuning. However, due to black-box prope… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  19. arXiv:2406.14927  [pdf, other

    cs.CV cs.RO

    Gaussian-Informed Continuum for Physical Property Identification and Simulation

    Authors: Junhao Cai, Yuji Yang, Weihao Yuan, Yisheng He, Zilong Dong, Liefeng Bo, Hui Cheng, Qifeng Chen

    Abstract: This paper studies the problem of estimating physical properties (system identification) through visual observations. To facilitate geometry-aware guidance in physical property estimation, we introduce a novel hybrid framework that leverages 3D Gaussian representation to not only capture explicit shapes but also enable the simulated continuum to deduce implicit shapes during training. We propose a… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 19 pages, 8 figures

  20. arXiv:2406.14264  [pdf, other

    eess.IV cs.CV

    Zero-Shot Image Denoising for High-Resolution Electron Microscopy

    Authors: Xuanyu Tian, Zhuoya Dong, Xiyue Lin, Yue Gao, Hongjiang Wei, Yanhang Ma, Jingyi Yu, Yuyao Zhang

    Abstract: High-resolution electron microscopy (HREM) imaging technique is a powerful tool for directly visualizing a broad range of materials in real-space. However, it faces challenges in denoising due to ultra-low signal-to-noise ratio (SNR) and scarce data availability. In this work, we propose Noise2SR, a zero-shot self-supervised learning (ZS-SSL) denoising framework for HREM. Within our framework, we… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 12 pages, 12 figures

  21. arXiv:2406.14017  [pdf, other

    cs.IR

    EAGER: Two-Stream Generative Recommender with Behavior-Semantic Collaboration

    Authors: Ye Wang, Jiahao Xun, Minjie Hong, Jieming Zhu, Tao Jin, Wang Lin, Haoyuan Li, Linjun Li, Yan Xia, Zhou Zhao, Zhenhua Dong

    Abstract: Generative retrieval has recently emerged as a promising approach to sequential recommendation, framing candidate item retrieval as an autoregressive sequence generation problem. However, existing generative methods typically focus solely on either behavioral or semantic aspects of item information, neglecting their complementary nature and thus resulting in limited effectiveness. To address this… ▽ More

    Submitted 3 July, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

    Comments: Accepted by KDD 2024. Code available at https://reczoo.github.io/EAGER

  22. arXiv:2406.09807  [pdf, other

    cs.SE

    Same App, Different Behaviors: Uncovering Device-specific Behaviors in Android Apps

    Authors: Zikan Dong, Yanjie Zhao, Tianming Liu, Chao Wang, Guosheng Xu, Guoai Xu, Haoyu Wang

    Abstract: The Android ecosystem faces a notable challenge known as fragmentation, which denotes the extensive diversity within the system. This issue is mainly related to differences in system versions, device hardware specifications, and customizations introduced by manufacturers. The growing divergence among devices leads to marked variations in how a given app behaves across diverse devices. This is refe… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  23. arXiv:2406.09509  [pdf, other

    cs.AI cs.LG cs.RO

    CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making

    Authors: Zibin Dong, Yifu Yuan, Jianye Hao, Fei Ni, Yi Ma, Pengyi Li, Yan Zheng

    Abstract: Leveraging the powerful generative capability of diffusion models (DMs) to build decision-making agents has achieved extensive success. However, there is still a demand for an easy-to-use and modularized open-source library that offers customized and efficient development for DM-based decision-making algorithms. In this work, we introduce CleanDiffuser, the first DM library specifically designed f… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: The first two authors contribute equally to this work. Code and documentation: https://github.com/CleanDiffuserTeam/CleanDiffuser

  24. arXiv:2406.07932  [pdf, other

    cs.IR

    Counteracting Duration Bias in Video Recommendation via Counterfactual Watch Time

    Authors: Haiyuan Zhao, Guohao Cai, Jieming Zhu, Zhenhua Dong, Jun Xu, Ji-Rong Wen

    Abstract: In video recommendation, an ongoing effort is to satisfy users' personalized information needs by leveraging their logged watch time. However, watch time prediction suffers from duration bias, hindering its ability to reflect users' interests accurately. Existing label-correction approaches attempt to uncover user interests through grouping and normalizing observed watch time according to video du… ▽ More

    Submitted 13 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

    Comments: Accepted by KDD 2024

  25. arXiv:2406.04252  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Sub-nanometer depth resolution and single dopant visualization achieved by tilt-coupled multislice electron ptychography

    Authors: Zehao Dong, Yang Zhang, Chun-Chien Chiu, Sicheng Lu, Jianbing Zhang, Yu-Chen Liu, Suya Liu, Jan-Chi Yang, Pu Yu, Yayu Wang, Zhen Chen

    Abstract: Real-space imaging of three-dimensional atomic structures is a critical yet challenging task in materials science. Although scanning transmission electron microscopy has achieved sub-angstrom lateral resolution through techniques like electron ptychography1,2, depth resolution remains limited to only 2 to 3 nanometers with a single projection setup3,4. Attaining better depth resolution typically n… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 27 pages, 5 figures, 10 supplementary figures

  26. arXiv:2406.01238  [pdf, other

    cs.CL

    EffiQA: Efficient Question-Answering with Strategic Multi-Model Collaboration on Knowledge Graphs

    Authors: Zixuan Dong, Baoyun Peng, Yufei Wang, Jia Fu, Xiaodong Wang, Yongxue Shan, Xin Zhou

    Abstract: While large language models (LLMs) have shown remarkable capabilities in natural language processing, they struggle with complex, multi-step reasoning tasks involving knowledge graphs (KGs). Existing approaches that integrate LLMs and KGs either underutilize the reasoning abilities of LLMs or suffer from prohibitive computational costs due to tight coupling. To address these limitations, we propos… ▽ More

    Submitted 7 July, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: 10 pages, 4 figures, 3 tables

  27. arXiv:2406.00974  [pdf, other

    eess.SY

    Large Language Model Assisted Optimal Bidding of BESS in FCAS Market: An AI-agent based Approach

    Authors: Borui Zhang, Chaojie Li, Guo Chen, Zhaoyang Dong

    Abstract: To incentivize flexible resources such as Battery Energy Storage Systems (BESSs) to offer Frequency Control Ancillary Services (FCAS), Australia's National Electricity Market (NEM) has implemented changes in recent years towards shorter-term bidding rules and faster service requirements. However, firstly, existing bidding optimization methods often overlook or oversimplify the key aspects of FCAS… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  28. arXiv:2405.20228  [pdf, other

    gr-qc astro-ph.HE nucl-th

    Love-C relations for elastic hybrid stars

    Authors: Zoey Zhiyuan Dong, Joshua Cole Faggert, Shu Yan Lau, Kent Yagi

    Abstract: Neutron stars (NSs) provide a unique laboratory to study matter under extreme densities. Recent observations from gravitational and electromagnetic waves have enabled constraints on NS properties, such as tidal deformability (related to the tidal Love number) and stellar compactness. Although each of these two NS observables depends strongly on the stellar internal structure, the relation between… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 23 pages, 9 fighures, submitted to GRG

  29. arXiv:2405.19262  [pdf, other

    cs.CL cs.AI cs.LG

    Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models

    Authors: Zhanhui Zhou, Zhixuan Liu, Jie Liu, Zhichen Dong, Chao Yang, Yu Qiao

    Abstract: Large language models are usually fine-tuned to align with human preferences. However, fine-tuning a large language model can be challenging. In this work, we introduce $\textit{weak-to-strong search}$, framing the alignment of a large language model as a test-time greedy search to maximize the log-likelihood difference between small tuned and untuned models while sampling from the frozen large mo… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  30. arXiv:2405.18009  [pdf, other

    cs.CL cs.LG

    Exploring Context Window of Large Language Models via Decomposed Positional Vectors

    Authors: Zican Dong, Junyi Li, Xin Men, Wayne Xin Zhao, Bingbing Wang, Zhen Tian, Weipeng Chen, Ji-Rong Wen

    Abstract: Transformer-based large language models (LLMs) typically have a limited context window, resulting in significant performance degradation when processing text beyond the length of the context window. Extensive studies have been proposed to extend the context window and achieve length extrapolation of LLMs, but there is still a lack of in-depth interpretation of these approaches. In this study, we e… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  31. arXiv:2405.17998  [pdf, other

    cs.IR cs.AI cs.CL

    Source Echo Chamber: Exploring the Escalation of Source Bias in User, Data, and Recommender System Feedback Loop

    Authors: Yuqi Zhou, Sunhao Dai, Liang Pang, Gang Wang, Zhenhua Dong, Jun Xu, Ji-Rong Wen

    Abstract: Recently, researchers have uncovered that neural retrieval models prefer AI-generated content (AIGC), called source bias. Compared to active search behavior, recommendation represents another important means of information acquisition, where users are more prone to source bias. Furthermore, delving into the recommendation scenario, as AIGC becomes integrated within the feedback loop involving user… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  32. arXiv:2405.16546  [pdf, other

    cs.IR cs.CL

    Cocktail: A Comprehensive Information Retrieval Benchmark with LLM-Generated Documents Integration

    Authors: Sunhao Dai, Weihao Liu, Yuqi Zhou, Liang Pang, Rongju Ruan, Gang Wang, Zhenhua Dong, Jun Xu, Ji-Rong Wen

    Abstract: The proliferation of Large Language Models (LLMs) has led to an influx of AI-generated content (AIGC) on the internet, transforming the corpus of Information Retrieval (IR) systems from solely human-written to a coexistence with LLM-generated content. The impact of this surge in AIGC on IR systems remains an open question, with the primary challenge being the lack of a dedicated benchmark for rese… ▽ More

    Submitted 2 July, 2024; v1 submitted 26 May, 2024; originally announced May 2024.

    Comments: Accepted by Findings of ACL 2024; Datasets Link: https://huggingface.co/IR-Cocktail

  33. arXiv:2405.16120  [pdf, other

    cs.IR

    BankFair: Balancing Accuracy and Fairness under Varying User Traffic in Recommender System

    Authors: Xiaopeng Ye, Chen Xu, Jun Xu, Xuyang Xie, Gang Wang, Zhenhua Dong

    Abstract: Driven by sustainability and economic considerations, two-sided recommendation platforms are required to satisfy the needs of both users and providers. Previous studies often indicate that the two sides' needs differ in urgency: providers have relatively long-term exposure requirements, while users desire short-term, accurate services. However, our empirical study reveals that existing methods for… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  34. arXiv:2405.12954  [pdf, other

    cs.LG cs.AI

    A Method on Searching Better Activation Functions

    Authors: Haoyuan Sun, Zihao Wu, Bo Xia, Pu Chang, Zibin Dong, Yifu Yuan, Yongzhe Chang, Xueqian Wang

    Abstract: The success of artificial neural networks (ANNs) hinges greatly on the judicious selection of an activation function, introducing non-linearity into network and enabling them to model sophisticated relationships in data. However, the search of activation functions has largely relied on empirical knowledge in the past, lacking theoretical guidance, which has hindered the identification of more effe… ▽ More

    Submitted 22 May, 2024; v1 submitted 18 May, 2024; originally announced May 2024.

    Comments: 16 pages,3 figures

  35. arXiv:2405.12892  [pdf, other

    cs.IR cs.LG

    Retrievable Domain-Sensitive Feature Memory for Multi-Domain Recommendation

    Authors: Yuang Zhao, Zhaocheng Du, Qinglin Jia, Linxuan Zhang, Zhenhua Dong, Ruiming Tang

    Abstract: With the increase in the business scale and number of domains in online advertising, multi-domain ad recommendation has become a mainstream solution in the industry. The core of multi-domain recommendation is effectively modeling the commonalities and distinctions among domains. Existing works are dedicated to designing model architectures for implicit multi-domain modeling while overlooking an in… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  36. arXiv:2405.10800  [pdf, other

    cs.LG

    Heterogeneity-Informed Meta-Parameter Learning for Spatiotemporal Time Series Forecasting

    Authors: Zheng Dong, Renhe Jiang, Haotian Gao, Hangchen Liu, Jinliang Deng, Qingsong Wen, Xuan Song

    Abstract: Spatiotemporal time series forecasting plays a key role in a wide range of real-world applications. While significant progress has been made in this area, fully capturing and leveraging spatiotemporal heterogeneity remains a fundamental challenge. Therefore, we propose a novel Heterogeneity-Informed Meta-Parameter Learning scheme. Specifically, our approach implicitly captures spatiotemporal heter… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: Accepted by KDD'24 Research Track

  37. arXiv:2405.10596  [pdf, other

    cs.IR

    CELA: Cost-Efficient Language Model Alignment for CTR Prediction

    Authors: Xingmei Wang, Weiwen Liu, Xiaolong Chen, Qi Liu, Xu Huang, Defu Lian, Xiangyang Li, Yasheng Wang, Zhenhua Dong, Ruiming Tang

    Abstract: Click-Through Rate (CTR) prediction holds a paramount position in recommender systems. The prevailing ID-based paradigm underperforms in cold-start scenarios due to the skewed distribution of feature frequency. Additionally, the utilization of a single modality fails to exploit the knowledge contained within textual features. Recent efforts have sought to mitigate these challenges by integrating P… ▽ More

    Submitted 17 June, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

    Comments: 10 pages, 5 figures

    MSC Class: 68T07

  38. arXiv:2405.10284  [pdf, other

    quant-ph cs.LG hep-ph

    Quantum Vision Transformers for Quark-Gluon Classification

    Authors: Marçal Comajoan Cara, Gopal Ramesh Dahale, Zhongtian Dong, Roy T. Forestano, Sergei Gleyzer, Daniel Justice, Kyoungchul Kong, Tom Magorsch, Konstantin T. Matchev, Katia Matcheva, Eyup B. Unlu

    Abstract: We introduce a hybrid quantum-classical vision transformer architecture, notable for its integration of variational quantum circuits within both the attention mechanism and the multi-layer perceptrons. The research addresses the critical challenge of computational efficiency and resource constraints in analyzing data from the upcoming High Luminosity Large Hadron Collider, presenting the architect… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: 14 pages, 8 figures. Published in MDPI Axioms 2024, 13(5), 323

    MSC Class: 68Q12 (Primary) 81P68; 68T07 (Secondary)

    Journal ref: Axioms 2024, 13(5), 323

  39. arXiv:2405.08686  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall quant-ph

    Antiferromagnetic Quantum Anomalous Hall Effect Modulated by Spin Flips and Flops

    Authors: Zichen Lian, Yongchao Wang, Yongqian Wang, Yang Feng, Zehao Dong, Shuai Yang, Liangcai Xu, Yaoxin Li, Bohan Fu, Yuetan Li, Wanjun Jiang, Chang Liu, Jinsong Zhang, Yayu Wang

    Abstract: The interplay between nontrivial band topology and layered antiferromagnetism in MnBi2Te4 has opened up a new avenue for exploring topological phases of matter. Representative examples include the quantum anomalous Hall effect and axion insulator state observed in odd and even number layers of MnBi2Te4, when the top and bottom surfaces have parallel and antiparallel spin alignments respectively. T… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: 16 pages, 4 figures

  40. arXiv:2405.06999  [pdf, other

    eess.SY

    Large Language Model-aided Edge Learning in Distribution System State Estimation

    Authors: Renyou Xie, Xin Yin, Chaojie Li, Nian Liu, Bo Zhao, Zhaoyang Dong

    Abstract: Distribution system state estimation (DSSE) plays a crucial role in the real-time monitoring, control, and operation of distribution networks. Besides intensive computational requirements, conventional DSSE methods need high-quality measurements to obtain accurate states, whereas missing values often occur due to sensor failures or communication delays. To address these challenging issues, a forec… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

  41. arXiv:2405.06927  [pdf, ps, other

    cs.IR

    Multimodal Pretraining and Generation for Recommendation: A Tutorial

    Authors: Jieming Zhu, Chuhan Wu, Rui Zhang, Zhenhua Dong

    Abstract: Personalized recommendation stands as a ubiquitous channel for users to explore information or items aligned with their interests. Nevertheless, prevailing recommendation models predominantly rely on unique IDs and categorical features for user-item matching. While this ID-centric approach has witnessed considerable success, it falls short in comprehensively grasping the essence of raw item conten… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

    Comments: Published in WWW 2024 Tutorial. Find the tutorial materials at https://mmrec.github.io/tutorial/www2024/

  42. arXiv:2405.03952  [pdf, other

    cs.SD cs.CL eess.AS

    HAFFormer: A Hierarchical Attention-Free Framework for Alzheimer's Disease Detection From Spontaneous Speech

    Authors: Zhongren Dong, Zixing Zhang, Weixiang Xu, Jing Han, Jianjun Ou, Björn W. Schuller

    Abstract: Automatically detecting Alzheimer's Disease (AD) from spontaneous speech plays an important role in its early diagnosis. Recent approaches highly rely on the Transformer architectures due to its efficiency in modelling long-range context dependencies. However, the quadratic increase in computational complexity associated with self-attention and the length of audio poses a challenge when deploying… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Journal ref: publised at ICASSP 2024

  43. arXiv:2405.00742  [pdf, other

    cs.CR cs.LG stat.ML

    Federated Graph Learning for EV Charging Demand Forecasting with Personalization Against Cyberattacks

    Authors: Yi Li, Renyou Xie, Chaojie Li, Yi Wang, Zhaoyang Dong

    Abstract: Mitigating cybersecurity risk in electric vehicle (EV) charging demand forecasting plays a crucial role in the safe operation of collective EV chargings, the stability of the power grid, and the cost-effective infrastructure expansion. However, existing methods either suffer from the data privacy issue and the susceptibility to cyberattacks or fail to consider the spatial correlation among differe… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

    Comments: 11 pages,4 figures

  44. arXiv:2404.18073  [pdf, other

    cond-mat.str-el

    Charge and spin density wave orders in field-biased Bernal bilayer graphene

    Authors: Zhiyu Dong, Patrick A. Lee, Leonid Levitov

    Abstract: This paper aims to clarify the nature of a surprising ordered phase recently reported in biased Bernal bilayer graphene that occurs at the phase boundary between the isospin-polarized and unpolarized phases. Strong nonlinearity of transport at abnormally small currents, with $dI/dV$ vs. $I$ sharply rising and then falling back, is typical for a charge/spin-density-wave state (CDW or SDW) sliding t… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

    Comments: 11 pages, 7 figures

  45. arXiv:2404.16304  [pdf, other

    cs.CV

    BezierFormer: A Unified Architecture for 2D and 3D Lane Detection

    Authors: Zhiwei Dong, Xi Zhu, Xiya Cao, Ran Ding, Wei Li, Caifa Zhou, Yongliang Wang, Qiangbo Liu

    Abstract: Lane detection has made significant progress in recent years, but there is not a unified architecture for its two sub-tasks: 2D lane detection and 3D lane detection. To fill this gap, we introduce BézierFormer, a unified 2D and 3D lane detection architecture based on Bézier curve lane representation. BézierFormer formulate queries as Bézier control points and incorporate a novel Bézier curve atten… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: ICME 2024, 11 pages, 8 figures

  46. arXiv:2404.15773  [pdf, other

    cond-mat.str-el

    Possible gapless quantum spin liquid behavior in the triangular-lattice Ising antiferromagnet PrMgAl$_{11}$O$_{19}$

    Authors: Zhen Ma, Shuhan Zheng, Yingqi Chen, Ruokai Xu, Zhao-Yang Dong, Jinghui Wang, Hong Du, Jan Peter Embs, Shuaiwei Li, Yao Li, Yongjun Zhang, Meifeng Liu, Ruidan Zhong, Jun-Ming Liu, Jinsheng Wen

    Abstract: Quantum spin liquids (QSLs) represent a novel state where spins are highly entangled but do not order even at zero temperature due to strong quantum fluctuations. Such a state is mostly studied in Heisenberg models defined on geometrically frustrated lattices. Here, we turn to a new triangular-lattice antiferromagnet PrMgAl$_{11}$O$_{19}$, in which the interactions are believed to be of Ising type… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: 11 pages, 5 figures

    Journal ref: Phys. Rev. B 109, 165143 (2024)

  47. arXiv:2404.14774  [pdf, other

    cs.IR

    Contrastive Quantization based Semantic Code for Generative Recommendation

    Authors: Mengqun Jin, Zexuan Qiu, Jieming Zhu, Zhenhua Dong, Xiu Li

    Abstract: With the success of large language models, generative retrieval has emerged as a new retrieval technique for recommendation. It can be divided into two stages: the first stage involves constructing discrete Codes (i.e., codes), and the second stage involves decoding the code sequentially via the transformer architecture. Current methods often construct item semantic codes by reconstructing based q… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  48. arXiv:2404.14309  [pdf, other

    cs.CV

    Towards Better Adversarial Purification via Adversarial Denoising Diffusion Training

    Authors: Yiming Liu, Kezhao Liu, Yao Xiao, Ziyi Dong, Xiaogang Xu, Pengxu Wei, Liang Lin

    Abstract: Recently, diffusion-based purification (DBP) has emerged as a promising approach for defending against adversarial attacks. However, previous studies have used questionable methods to evaluate the robustness of DBP models, their explanations of DBP robustness also lack experimental support. We re-examine DBP robustness using precise gradient, and discuss the impact of stochasticity on DBP robustne… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  49. arXiv:2404.13655  [pdf, other

    cs.LG cs.AI

    SPGNN: Recognizing Salient Subgraph Patterns via Enhanced Graph Convolution and Pooling

    Authors: Zehao Dong, Muhan Zhang, Yixin Chen

    Abstract: Graph neural networks (GNNs) have revolutionized the field of machine learning on non-Euclidean data such as graphs and networks. GNNs effectively implement node representation learning through neighborhood aggregation and achieve impressive results in many graph-related tasks. However, most neighborhood aggregation approaches are summation-based, which can be problematic as they may not be suffic… ▽ More

    Submitted 29 April, 2024; v1 submitted 21 April, 2024; originally announced April 2024.

  50. arXiv:2404.13501  [pdf, other

    cs.AI

    A Survey on the Memory Mechanism of Large Language Model based Agents

    Authors: Zeyu Zhang, Xiaohe Bo, Chen Ma, Rui Li, Xu Chen, Quanyu Dai, Jieming Zhu, Zhenhua Dong, Ji-Rong Wen

    Abstract: Large language model (LLM) based agents have recently attracted much attention from the research and industry communities. Compared with original LLMs, LLM-based agents are featured in their self-evolving capability, which is the basis for solving real-world problems that need long-term and complex agent-environment interactions. The key component to support agent-environment interactions is the m… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

    Comments: 39 pages, 5 figures, 4 tables