Skip to main content

Showing 1–50 of 3,729 results for author: Wang, R

  1. arXiv:2407.11536  [pdf, other

    cs.CL cs.AI

    Fine-Tuning Medical Language Models for Enhanced Long-Contextual Understanding and Domain Expertise

    Authors: Qimin Yang, Rongsheng Wang, Jiexin Chen, Runqi Su, Tao Tan

    Abstract: Large Language Models (LLMs) have been widely applied in various professional fields. By fine-tuning the models using domain specific question and answer datasets, the professional domain knowledge and Q\&A abilities of these models have significantly improved, for example, medical professional LLMs that use fine-tuning of doctor-patient Q\&A data exhibit extraordinary disease diagnostic abilities… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: 5 pages, 1 figure. Accepted by the Workshop on Long-Context Foundation Models (LCFM) at ICML 2024

  2. arXiv:2407.11474  [pdf, other

    hep-ex

    Search for the rare $Λ_c^+ \to p μ^+ μ^-$ decay

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1062 additional authors not shown)

    Abstract: A search for the nonresonant $Λ_c^+ \to p μ^+ μ^-$ decay is performed using proton-proton collision data recorded at a centre-of-mass energy of 13 TeV by the LHCb experiment, corresponding to an integrated luminosity of 5.4 fb$^{-1}$. No evidence for the decay is found in the dimuon invariant-mass regions where the expected contributions of resonances is subdominant. The upper limit on the branchi… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: All figures and tables, along with any supplementary material and additional information, are available at https://cern.ch/lhcbproject/Publications/p/LHCb-PAPER-2024-005.html (LHCb public pages)

    Report number: LHCb-PAPER-2024-005, CERN-EP-2024-158

  3. arXiv:2407.11369  [pdf

    cond-mat.supr-con

    High-Resolution Spectroscopy of the Intermediate Impurity States near a Quantum Phase Transition

    Authors: Yao Zhang, Tao Xie, Zhen-Yu Liu, Rui Wang, Wenhao Zhang, Chaofei Liu, Ying-Shuang Fu

    Abstract: The intermediate behavior near a quantum phase transition is crucial for understanding the quantum criticality of various competing phases and their separate origins, yet remains unexplored for the multiple Yu-Shiba-Rusinov (YSR) states. Here, we investigated the detailed spectroscopic change of the exchange coupling-dependent YSR states near a quantum phase transition. The initially developed one… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: 8 pages, 3 figures

  4. arXiv:2407.11266  [pdf, other

    cs.CV

    Towards High-Quality 3D Motion Transfer with Realistic Apparel Animation

    Authors: Rong Wang, Wei Mao, Changsheng Lu, Hongdong Li

    Abstract: Animating stylized characters to match a reference motion sequence is a highly demanded task in film and gaming industries. Existing methods mostly focus on rigid deformations of characters' body, neglecting local deformations on the apparel driven by physical dynamics. They deform apparel the same way as the body, leading to results with limited details and unrealistic artifacts, e.g. body-appare… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: Accepted to ECCV 2024

  5. arXiv:2407.11223  [pdf, other

    eess.IV

    DD_RoTIR: Dual-Domain Image Registration via Image Translation and Hiearchical Feature-matching

    Authors: Ruixiong Wang, Stephen Cross, Alin Achim

    Abstract: Microscopy images acquired by multiple camera lenses or sensors in biological experiments offer a comprehensive understanding of the objects from diverse aspects. However, setups for multiple microscopes raise the possibility of misalignment of identical target features through different modalities. Thus, multimodal image registration is essential. In this work, we employed previous successes in b… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: 30 pages including supporting information; 15 figures for main context, 5 figures for supporting information; 5 tables; 5 equations in main, 12 in supporting imformation

  6. arXiv:2407.11031  [pdf, other

    cs.LG eess.SP

    Purification Of Contaminated Convolutional Neural Networks Via Robust Recovery: An Approach with Theoretical Guarantee in One-Hidden-Layer Case

    Authors: Hanxiao Lu, Zeyu Huang, Ren Wang

    Abstract: Convolutional neural networks (CNNs), one of the key architectures of deep learning models, have achieved superior performance on many machine learning tasks such as image classification, video recognition, and power systems. Despite their success, CNNs can be easily contaminated by natural noises and artificially injected noises such as backdoor attacks. In this paper, we propose a robust recover… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  7. arXiv:2407.10969  [pdf, other

    cs.CL cs.LG

    Q-Sparse: All Large Language Models can be Fully Sparsely-Activated

    Authors: Hongyu Wang, Shuming Ma, Ruiping Wang, Furu Wei

    Abstract: We introduce, Q-Sparse, a simple yet effective approach to training sparsely-activated large language models (LLMs). Q-Sparse enables full sparsity of activations in LLMs which can bring significant efficiency gains in inference. This is achieved by applying top-K sparsification to the activations and the straight-through-estimator to the training. The key results from this work are, (1) Q-Sparse… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: Work in progress

  8. arXiv:2407.10172  [pdf, other

    cs.CV

    Restoring Images in Adverse Weather Conditions via Histogram Transformer

    Authors: Shangquan Sun, Wenqi Ren, Xinwei Gao, Rui Wang, Xiaochun Cao

    Abstract: Transformer-based image restoration methods in adverse weather have achieved significant progress. Most of them use self-attention along the channel dimension or within spatially fixed-range blocks to reduce computational load. However, such a compromise results in limitations in capturing long-range spatial features. Inspired by the observation that the weather-induced degradation factors mainly… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: 19 pages, 7 figures, 10MB

  9. arXiv:2407.09811  [pdf, other

    cs.AI cs.HC q-bio.GN

    CellAgent: An LLM-driven Multi-Agent Framework for Automated Single-cell Data Analysis

    Authors: Yihang Xiao, Jinyi Liu, Yan Zheng, Xiaohan Xie, Jianye Hao, Mingzhi Li, Ruitao Wang, Fei Ni, Yuxiao Li, Jintian Luo, Shaoqing Jiao, Jiajie Peng

    Abstract: Single-cell RNA sequencing (scRNA-seq) data analysis is crucial for biological research, as it enables the precise characterization of cellular heterogeneity. However, manual manipulation of various tools to achieve desired outcomes can be labor-intensive for researchers. To address this, we introduce CellAgent (http://cell.agent4science.cn/), an LLM-driven multi-agent framework, specifically desi… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

  10. arXiv:2407.09251  [pdf, other

    cs.LG cs.AI eess.SP

    Deep Adversarial Defense Against Multilevel-Lp Attacks

    Authors: Ren Wang, Yuxuan Li, Alfred Hero

    Abstract: Deep learning models have shown considerable vulnerability to adversarial attacks, particularly as attacker strategies become more sophisticated. While traditional adversarial training (AT) techniques offer some resilience, they often focus on defending against a single type of attack, e.g., the $\ell_\infty$-norm attack, which can fail for other types. This paper introduces a computationally effi… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  11. arXiv:2407.09048  [pdf, other

    cs.AI

    KUNPENG: An Embodied Large Model for Intelligent Maritime

    Authors: Naiyao Wang, Tongbang Jiang, Ye Wang, Shaoyang Qiu, Bo Zhang, Xinqiang Xie, Munan Li, Chunliu Wang, Yiyang Wang, Hongxiang Ren, Ruili Wang, Hongjun Shan, Hongbo Liu

    Abstract: Intelligent maritime, as an essential component of smart ocean construction, deeply integrates advanced artificial intelligence technology and data analysis methods, which covers multiple aspects such as smart vessels, route optimization, safe navigation, aiming to enhance the efficiency of ocean resource utilization and the intelligence of transportation networks. However, the complex and dynamic… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: 9 pages, 3 figures

  12. arXiv:2407.08586  [pdf, other

    nucl-ex

    Centrality dependence of Lévy-stable two-pion Bose-Einstein correlations in $\sqrt{s_{_{NN}}}=200$ GeV Au$+$Au collisions

    Authors: PHENIX Collaboration, N. J. Abdulameer, U. Acharya, A. Adare, C. Aidala, N. N. Ajitanand, Y. Akiba, R. Akimoto, H. Al-Ta'ani, J. Alexander, A. Angerami, K. Aoki, N. Apadula, Y. Aramaki, H. Asano, E. C. Aschenauer, E. T. Atomssa, T. C. Awes, B. Azmoun, V. Babintsev, M. Bai, B. Bannier, K. N. Barish, B. Bassalleck, S. Bathe , et al. (377 additional authors not shown)

    Abstract: The PHENIX experiment measured the centrality dependence of two-pion Bose-Einstein correlation functions in $\sqrt{s_{_{NN}}}=200$~GeV Au$+$Au collisions at the Relativistic Heavy Ion Collider at Brookhaven National Laboratory. The data are well represented by Lévy-stable source distributions. The extracted source parameters are the correlation-strength parameter $λ$, the Lévy index of stability… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 401 authors from 75 institutions, 20 pages, 15 figures, 2 tables. v1 is version submitted to Physical Review C. HEPdata tables for the points plotted in figures for this and previous PHENIX publications are (or will be) publicly available at http://www.phenix.bnl.gov/papers.html

  13. arXiv:2407.08213  [pdf, other

    cs.RO

    PrefCLM: Enhancing Preference-based Reinforcement Learning with Crowdsourced Large Language Models

    Authors: Ruiqi Wang, Dezhong Zhao, Ziqin Yuan, Ike Obi, Byung-Cheol Min

    Abstract: Preference-based reinforcement learning (PbRL) is emerging as a promising approach to teaching robots through human comparative feedback, sidestepping the need for complex reward engineering. However, the substantial volume of feedback required in existing PbRL methods often lead to reliance on synthetic feedback generated by scripted teachers. This approach necessitates intricate reward engineeri… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  14. arXiv:2407.08093  [pdf, other

    eess.IV cs.AI cs.CV eess.SP

    MemWarp: Discontinuity-Preserving Cardiac Registration with Memorized Anatomical Filters

    Authors: Hang Zhang, Xiang Chen, Renjiu Hu, Dongdong Liu, Gaolei Li, Rongguang Wang

    Abstract: Many existing learning-based deformable image registration methods impose constraints on deformation fields to ensure they are globally smooth and continuous. However, this assumption does not hold in cardiac image registration, where different anatomical regions exhibit asymmetric motions during respiration and movements due to sliding organs within the chest. Consequently, such global constraint… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: 11 pages, 2 figure, 2 tables

  15. arXiv:2407.07094  [pdf, other

    cs.CL cs.AI

    AnyTaskTune: Advanced Domain-Specific Solutions through Task-Fine-Tuning

    Authors: Jiaxi Cui, Wentao Zhang, Jing Tang, Xudong Tong, Zhenwei Zhang, Amie, Jing Wen, Rongsheng Wang, Pengfei Wu

    Abstract: The pervasive deployment of Large Language Models-LLMs in various sectors often neglects the nuanced requirements of individuals and small organizations, who benefit more from models precisely tailored to their specific business contexts rather than those with broadly superior general capabilities. This work introduces \textbf{AnyTaskTune}, a novel fine-tuning methodology coined as \textbf{Task-Fi… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  16. arXiv:2407.06838  [pdf, other

    cs.CR cs.CV

    Event Trojan: Asynchronous Event-based Backdoor Attacks

    Authors: Ruofei Wang, Qing Guo, Haoliang Li, Renjie Wan

    Abstract: As asynchronous event data is more frequently engaged in various vision tasks, the risk of backdoor attacks becomes more evident. However, research into the potential risk associated with backdoor attacks in asynchronous event data has been scarce, leaving related tasks vulnerable to potential threats. This paper has uncovered the possibility of directly poisoning event data streams by proposing E… ▽ More

    Submitted 14 July, 2024; v1 submitted 9 July, 2024; originally announced July 2024.

    Comments: Accepted by ECCV2024

  17. arXiv:2407.06494  [pdf, other

    cs.LG cs.AI

    A Generative Approach to Control Complex Physical Systems

    Authors: Long Wei, Peiyan Hu, Ruiqi Feng, Haodong Feng, Yixuan Du, Tao Zhang, Rui Wang, Yue Wang, Zhi-Ming Ma, Tailin Wu

    Abstract: Controlling the evolution of complex physical systems is a fundamental task across science and engineering. Classical techniques suffer from limited applicability or huge computational costs. On the other hand, recent deep learning and reinforcement learning-based approaches often struggle to optimize long-term control sequences under the constraints of system dynamics. In this work, we introduce… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  18. arXiv:2407.06334  [pdf, other

    cs.AI q-bio.QM

    Double-Ended Synthesis Planning with Goal-Constrained Bidirectional Search

    Authors: Kevin Yu, Jihye Roh, Ziang Li, Wenhao Gao, Runzhong Wang, Connor W. Coley

    Abstract: Computer-aided synthesis planning (CASP) algorithms have demonstrated expert-level abilities in planning retrosynthetic routes to molecules of low to moderate complexity. However, current search methods assume the sufficiency of reaching arbitrary building blocks, failing to address the common real-world constraint where using specific molecules is desired. To this end, we present a formulation of… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 10 pages main, 4 figures

  19. arXiv:2407.05349  [pdf

    cond-mat.mtrl-sci

    Stable room-temperature multiferroic skyrmions in lithium niobate with enhanced Pockels effect

    Authors: Yalong Yu, Bo Xiong, Siqi Wu, Yekai Ren, Nuo Chen, Qingjiao Mi, Kangping Lou, Rui Wang, Tao Chu

    Abstract: Lithium Niobate (LN) is a ferroelectric material with exceptional electrical characteristics, including high piezoelectricity, high Pockels effect, etc. These properties make it a promising platform for numerous fields such as high-speed communication, optical computation, and quantum information processing. Besides these, the introduction of magnetic structures to LN holds significant potential t… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  20. arXiv:2407.05272  [pdf, other

    gr-qc

    Quasinormal modes and greybody factor of Schwarzschild Black Hole in the Cold Dark Matter Halo

    Authors: Shi-Jie Ma, Rui-Bo Wang, Tian-Chi Ma, He-Xu Zhang, Jian-Bo Deng, Xian-Ru Hu

    Abstract: In this article, we firstly studied wave function in static spherically symmetric spacetime and obtained effective potential of perturbed fields with spin. Then we applied $6^{\rm{th}}$ order WKB approximation to analyze quasinormal modes of Schwarzschild black hole in the Cold Dark Matter halo in perturbed fields with different spins and derived quasinormal frequencies. Further, to study the rela… ▽ More

    Submitted 10 July, 2024; v1 submitted 7 July, 2024; originally announced July 2024.

    Comments: 22 pages, 5 figures, 4 tables

  21. arXiv:2407.05169  [pdf, other

    cs.CV

    DehazeDCT: Towards Effective Non-Homogeneous Dehazing via Deformable Convolutional Transformer

    Authors: Wei Dong, Han Zhou, Ruiyi Wang, Xiaohong Liu, Guangtao Zhai, Jun Chen

    Abstract: Image dehazing, a pivotal task in low-level vision, aims to restore the visibility and detail from hazy images. Many deep learning methods with powerful representation learning capability demonstrate advanced performance on non-homogeneous dehazing, however, these methods usually struggle with processing high-resolution images (e.g., $4000 \times 6000$) due to their heavy computational demands. To… ▽ More

    Submitted 24 May, 2024; originally announced July 2024.

  22. arXiv:2407.05134  [pdf, other

    cs.AI cs.CL cs.LG

    Solving for X and Beyond: Can Large Language Models Solve Complex Math Problems with More-Than-Two Unknowns?

    Authors: Kuei-Chun Kao, Ruochen Wang, Cho-Jui Hsieh

    Abstract: Large Language Models (LLMs) have demonstrated remarkable performance in solving math problems, a hallmark of human intelligence. Despite high success rates on current benchmarks; however, these often feature simple problems with only one or two unknowns, which do not sufficiently challenge their reasoning capacities. This paper introduces a novel benchmark, BeyondX, designed to address these limi… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

  23. arXiv:2407.04845  [pdf, other

    cs.NI

    Poster: Flexible Scheduling of Network and Computing Resources for Distributed AI Tasks

    Authors: Ruikun Wang, Jiawei Zhang, Qiaolun Zhang, Bojun Zhang, Zhiqun Gu, Aryanaz Attarpour, Yuefeng Ji, Massimo Tornatore

    Abstract: Many emerging Artificial Intelligence (AI) applications require on-demand provisioning of large-scale computing, which can only be enabled by leveraging distributed computing services interconnected through networking. To address such increasing demand for networking to serve AI tasks, we investigate new scheduling strategies to improve communication efficiency and test them on a programmable test… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  24. arXiv:2407.04077  [pdf, other

    cs.NI

    Enhancing Physical Layer Security in LEO Satellite-Enabled IoT Network Communications

    Authors: Anna Talgat, Ruibo Wang, Mustafa A. Kishk, Mohamed-Slim Alouini

    Abstract: The extensive deployment of Low Earth Orbit (LEO) satellites introduces significant security challenges for communication security issues in Internet of Things (IoT) networks. With the rising number of satellites potentially acting as eavesdroppers, integrating Physical Layer Security (PLS) into satellite communications has become increasingly critical. However, these studies are facing challenges… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  25. arXiv:2407.03896  [pdf, other

    eess.SY

    Specification-guided temporal logic control for stochastic systems: a multi-layered approach

    Authors: Birgit C. van Huijgevoort, Ruohan Wang, Sadegh Soudjani, Sofie Haesaert

    Abstract: Designing controllers to satisfy temporal requirements has proven to be challenging for dynamical systems that are affected by uncertainty. This is mainly due to the states evolving in a continuous uncountable space, the stochastic evolution of the states, and infinite-horizon temporal requirements on the system evolution, all of which makes closed-form solutions generally inaccessible. A promisin… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  26. arXiv:2407.03451  [pdf, other

    cs.CR cs.HC

    The Role of Privacy Guarantees in Voluntary Donation of Private Data for Altruistic Goals

    Authors: Ruizhe Wang, Roberta De Viti, Aarushi Dubey, Elissa M. Redmiles

    Abstract: Voluntary donation of private information for altruistic purposes, such as advancing research, is common. However, concerns about data misuse and leakage may deter individuals from donating their information. While prior research has indicated that Privacy Enhancement Technologies (PETs) can alleviate these concerns, the extent to which these techniques influence willingness to donate data remains… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  27. arXiv:2407.03203  [pdf, other

    cs.FL cs.AI

    TheoremLlama: Transforming General-Purpose LLMs into Lean4 Experts

    Authors: Ruida Wang, Jipeng Zhang, Yizhen Jia, Rui Pan, Shizhe Diao, Renjie Pi, Tong Zhang

    Abstract: Proving mathematical theorems using computer-verifiable formal languages like Lean significantly impacts mathematical reasoning. One approach to formal theorem proving involves generating complete proofs using Large Language Models (LLMs) based on Natural Language (NL) proofs. Similar methods have shown promising results in code generation. However, most modern LLMs exhibit suboptimal performance… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  28. arXiv:2407.02785  [pdf

    cond-mat.mtrl-sci physics.comp-ph

    Identifying Direct Bandgap Silicon Structures with High-throughput Search and Machine Learning Methods

    Authors: Rui Wang, Hongyu Yu, Yang Zhong, Hongjun Xiang

    Abstract: Utilizations of silicon-based luminescent devices are restricted by the indirect-gap nature of diamond silicon. In this study, the high-throughput method is employed to expedite discoveries of direct-gap silicon crystals. The machine learning (ML) potential is utilized to construct a dataset comprising 2637 silicon allotropes, which is subsequently screened using an ML Hamiltonian model and densit… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  29. arXiv:2407.02404  [pdf, other

    cs.NI

    Shared-Protected Backup Paths Assignment with Mode Group Division Multiplexing in Optical Networks

    Authors: Jiaheng Xiong, Qiaolun Zhang, Ruikun Wang, Alberto Gatto, Francesco Musumeci, Massimo Tornatore

    Abstract: We evaluate the resource efficiency of Mode Group Division Multiplexing (MGDM) with shared path protection (SPP) in optical networks. On our case studies, SPP with MGDM obtains significant savings in terms of both additional backup spectrum occupation and MIMO-computing resources compared to other few-mode-transmission scenarios.

    Submitted 2 July, 2024; originally announced July 2024.

  30. arXiv:2407.01882  [pdf, other

    nucl-th

    A sign of three-nucleon short-range correlation from an analysis of nuclear mass and short-range correlation probability

    Authors: Na-Na Ma, Rong Wang

    Abstract: Three-nucleon short-range correlation ($3N$ SRC) represents a rare and intriguing part of the nuclear dynamics at short distance, beyond the two-nucleon short-range correlation ($2N$ SRC). To search its existence is a hot topic in the ongoing and future high-energy nuclear experiments and the developments of nuclear theory. In this study, we found a positive sign of $3N$ SRC in nuclei, by analyzin… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 7 pages, 3 figures

  31. arXiv:2407.01649  [pdf, other

    q-bio.QM cs.LG

    FAFE: Immune Complex Modeling with Geodesic Distance Loss on Noisy Group Frames

    Authors: Ruidong Wu, Ruihan Guo, Rui Wang, Shitong Luo, Yue Xu, Jiahan Li, Jianzhu Ma, Qiang Liu, Yunan Luo, Jian Peng

    Abstract: Despite the striking success of general protein folding models such as AlphaFold2(AF2, Jumper et al. (2021)), the accurate computational modeling of antibody-antigen complexes remains a challenging task. In this paper, we first analyze AF2's primary loss function, known as the Frame Aligned Point Error (FAPE), and raise a previously overlooked issue that FAPE tends to face gradient vanishing probl… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  32. arXiv:2407.01606  [pdf, other

    cs.LG cs.AI cs.CL cs.CV stat.ML

    On Discrete Prompt Optimization for Diffusion Models

    Authors: Ruochen Wang, Ting Liu, Cho-Jui Hsieh, Boqing Gong

    Abstract: This paper introduces the first gradient-based framework for prompt optimization in text-to-image diffusion models. We formulate prompt engineering as a discrete optimization problem over the language space. Two major challenges arise in efficiently finding a solution to this problem: (1) Enormous Domain Space: Setting the domain to the entire language space poses significant difficulty to the opt… ▽ More

    Submitted 26 June, 2024; originally announced July 2024.

    Comments: ICML 2024. Code available at https://github.com/ruocwang/dpo-diffusion

    MSC Class: 68T01

    Journal ref: Proceedings of the 41st International Conference on Machine Learning (ICML 2024)

  33. arXiv:2407.01358  [pdf, other

    cs.CL

    Evaluating Knowledge-based Cross-lingual Inconsistency in Large Language Models

    Authors: Xiaolin Xing, Zhiwei He, Haoyu Xu, Xing Wang, Rui Wang, Yu Hong

    Abstract: This paper investigates the cross-lingual inconsistencies observed in Large Language Models (LLMs), such as ChatGPT, Llama, and Baichuan, which have shown exceptional performance in various Natural Language Processing (NLP) tasks. Despite their successes, these models often exhibit significant inconsistencies when processing the same concepts across different languages. This study focuses on three… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  34. arXiv:2407.00874  [pdf, other

    hep-ph

    A plan for a super $η$ factory at Huizhou accelerator complex

    Authors: Xu-Rong Chen, Xiong-Hong He, Qiang Hu, De-Xu Lin, Yang Liu, Hao Qiu, Xu Sun, Ye Tian, Rong Wang, Hong-Lin Zhang, Ya-Peng Zhang, Cheng-Xin Zhao

    Abstract: As a Goldstone boson with zero quantum number and zero SM charge, the decays of long-lived $η$ ($η^{\prime}$) meson provide a unique window to search new physics beyond the standard model and new sources of CP violation, to test the low-energy QCD theory, and to measure the fundamental parameters of light quarks. For such goals in the physics frontiers we discuss a plan of building a super $η$ fac… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: 19 pages, 9 figures

  35. arXiv:2407.00372  [pdf, other

    hep-ph

    Study of semileptonic $B\to DP\ell^+ν_\ell$ decays based on the SU(3) flavor symmetry

    Authors: Ru-Min Wang, Yi-Jie Zhang, Meng-Yuan Wan, Xiao-Dong Cheng, Yuan-Guo Xu

    Abstract: Decays $B\to DP\ell^+ν_\ell~(\ell=e,μ,τ)$ with the non-resonance, the charmed vector resonances, the charmed scalar resonances and the charmed tensor resonances are calculated by using the SU(3) flavor symmetry. Firstly, the decay amplitudes of different modes are related by the SU(3) flavor symmetry. Then, relevant experiential data are used to constrain nonperturbative coefficients in the non-re… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Comments: 16 pages. arXiv admin note: text overlap with arXiv:2403.14929

  36. arXiv:2407.00256  [pdf, other

    cs.AI cs.CL cs.LG stat.ML

    One Prompt is not Enough: Automated Construction of a Mixture-of-Expert Prompts

    Authors: Ruochen Wang, Sohyun An, Minhao Cheng, Tianyi Zhou, Sung Ju Hwang, Cho-Jui Hsieh

    Abstract: Large Language Models (LLMs) exhibit strong generalization capabilities to novel tasks when prompted with language instructions and in-context demos. Since this ability sensitively depends on the quality of prompts, various methods have been explored to automate the instruction design. While these methods demonstrated promising results, they also restricted the searched prompt to one instruction.… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

    Comments: ICML 2024. code available at https://github.com/ruocwang/mixture-of-prompts

    MSC Class: 68T01

    Journal ref: Proceedings of the 41st International Conference on Machine Learning (ICML), Vienna, Austria, 2024

  37. arXiv:2406.20030  [pdf, other

    cs.CL

    LEMoE: Advanced Mixture of Experts Adaptor for Lifelong Model Editing of Large Language Models

    Authors: Renzhi Wang, Piji Li

    Abstract: Large language models (LLMs) require continual knowledge updates to stay abreast of the ever-changing world facts, prompting the formulation of lifelong model editing task. While recent years have witnessed the development of various techniques for single and batch editing, these methods either fail to apply or perform sub-optimally when faced with lifelong editing. In this paper, we introduce LEM… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  38. arXiv:2406.19804  [pdf, ps, other

    cs.IR

    Rateless Stochastic Coding for Delay-constrained Semantic Communication

    Authors: Cheng Peng, Rulong Wang, Yong Xiao

    Abstract: We consider the problem of joint source-channel coding with distortion and perception constraints from a rateless perspective, the purpose of which is to settle the balance between reliability (distortion/perception) and effectiveness (rate) of transmission over uncertain channels. We find a new finite-blocklength bound for the achievable joint source-channel code rate with the above two constrain… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  39. arXiv:2406.19677  [pdf, other

    cs.NI eess.SP

    End-to-End Uplink Performance Analysis of Satellite-Based IoT Networks: A Stochastic Geometry Approach

    Authors: Jiusi Zhou, Ruibo Wang, Basem Shihada, Mohamed-Slim Alouini

    Abstract: With the deployment of satellite constellations, Internet-of-Things (IoT) devices in remote areas have gained access to low-cost network connectivity. In this paper, we investigate the performance of IoT devices connecting in up-link through low Earth orbit (LEO) satellites to geosynchronous equatorial orbit (GEO) links. We model the dynamic LEO satellite constellation using the stochastic geometr… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  40. arXiv:2406.19605  [pdf, other

    math.OC

    A Customized Augmented Lagrangian Method for Block-Structured Integer Programming

    Authors: Rui Wang, Chuwen Zhang, Shanwen Pu, Jianjun Gao, Zaiwen Wen

    Abstract: Integer programming with block structures has received considerable attention recently and is widely used in many practical applications such as train timetabling and vehicle routing problems. It is known to be NP-hard due to the presence of integer variables. We define a novel augmented Lagrangian function by directly penalizing the inequality constraints and establish the strong duality between… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  41. arXiv:2406.19532  [pdf, other

    cs.DM cs.LG

    Dataless Quadratic Neural Networks for the Maximum Independent Set Problem

    Authors: Ismail Alkhouri, Cedric Le Denmat, Yingjie Li, Cunxi Yu, Jia Liu, Rongrong Wang, Alvaro Velasquez

    Abstract: Combinatorial Optimization (CO) plays a crucial role in addressing various significant problems, among them the challenging Maximum Independent Set (MIS) problem. In light of recent advancements in deep learning methods, efforts have been directed towards leveraging data-driven learning approaches, typically rooted in supervised learning and reinforcement learning, to tackle the NP-hard MIS proble… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  42. arXiv:2406.18915  [pdf, other

    cs.RO cs.CV

    Manipulate-Anything: Automating Real-World Robots using Vision-Language Models

    Authors: Jiafei Duan, Wentao Yuan, Wilbert Pumacay, Yi Ru Wang, Kiana Ehsani, Dieter Fox, Ranjay Krishna

    Abstract: Large-scale endeavors like RT-1 and widespread community efforts such as Open-X-Embodiment have contributed to growing the scale of robot demonstration data. However, there is still an opportunity to improve the quality, quantity, and diversity of robot demonstration data. Although vision-language models have been shown to automatically generate demonstration data, their utility has been limited t… ▽ More

    Submitted 27 June, 2024; v1 submitted 27 June, 2024; originally announced June 2024.

    Comments: Project page: https://robot-ma.github.io/

  43. arXiv:2406.18873  [pdf, other

    cs.AR

    LayoutCopilot: An LLM-powered Multi-agent Collaborative Framework for Interactive Analog Layout Design

    Authors: Bingyang Liu, Haoyi Zhang, Xiaohan Gao, Zichen Kong, Xiyuan Tang, Yibo Lin, Runsheng Wang, Ru Huang

    Abstract: Analog layout design heavily involves interactive processes between humans and design tools. The tools are usually designed to use scripting commands or visualized buttons for manipulation, especially for those interactive automation functionalities, which have a steep learning curve and cumbersome user experience, making a notable barrier to their adoption by designers. Aiming to address such a u… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: 8pages, 8figures

  44. Blockchain Based Zero-Knowledge Proof of Location in IoT

    Authors: Wei Wu, Erwu Liu, Xinglin Gong, Rui Wang

    Abstract: With the development of precise positioning technology, a growing number of location-based services (LBSs) facilitate people's life. Most LBSs require proof of location (PoL) to prove that the user satisfies the service requirement, which exposes the user's privacy. In this paper, we propose a zero-knowledge proof of location (zk-PoL) protocol to better protect the user's privacy. With the zk-PoL… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: Published on ICC 2020-2020 IEEE International Conference on Communications (ICC)

  45. arXiv:2406.18263  [pdf, other

    physics.chem-ph

    A Pre-trained Deep Potential Model for Sulfide Solid Electrolytes with Broad Coverage and High Accuracy

    Authors: Ruoyu Wang, Mingyu Guo, Yuxiang Gao, Xiaoxu Wang, Yuzhi Zhang, Bin Deng, Xin Chen, Mengchao Shi, Linfeng Zhang, Zhicheng Zhong

    Abstract: Solid electrolytes with fast ion transport are one of the key challenges for solid state lithium metal batteries. To improve ion conductivity, chemical doping has been the most effective strategy, and atomistic simulation with machine-learning potential helps find optimized doping by predicting ion conductivity for arbitrary composition. Yet most existing machine-learning models are trained on nar… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  46. arXiv:2406.17806  [pdf, other

    cs.CL cs.AI cs.CR cs.CV cs.LG

    MOSSBench: Is Your Multimodal Language Model Oversensitive to Safe Queries?

    Authors: Xirui Li, Hengguang Zhou, Ruochen Wang, Tianyi Zhou, Minhao Cheng, Cho-Jui Hsieh

    Abstract: Humans are prone to cognitive distortions -- biased thinking patterns that lead to exaggerated responses to specific stimuli, albeit in very different contexts. This paper demonstrates that advanced Multimodal Large Language Models (MLLMs) exhibit similar tendencies. While these models are designed to respond queries under safety mechanism, they sometimes reject harmless queries in the presence of… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

  47. arXiv:2406.17224  [pdf, other

    cs.AI cs.CL cs.CV cs.LG cs.SC

    Large Language Models are Interpretable Learners

    Authors: Ruochen Wang, Si Si, Felix Yu, Dorothea Wiesmann, Cho-Jui Hsieh, Inderjit Dhillon

    Abstract: The trade-off between expressiveness and interpretability remains a core challenge when building human-centric predictive models for classification and decision-making. While symbolic rules offer interpretability, they often lack expressiveness, whereas neural networks excel in performance but are known for being black boxes. In this paper, we show a combination of Large Language Models (LLMs) and… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: Preliminary Version, Code at [this url](https://github.com/ruocwang/llm-symbolic-program)

    MSC Class: 68T05

  48. arXiv:2406.17006  [pdf, other

    hep-ex

    Probing the nature of the $χ_{c1}(3872)$ state using radiative decays

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1094 additional authors not shown)

    Abstract: The radiative decays $χ_{c1}(3872)\rightarrowψ(2S)γ$ and $χ_{c1}(3872)\rightarrow J/ψγ$ are used to probe the~nature of the~$χ_{c1}(3872)$ state using proton-proton collision data collected with the LHCb detector, corresponding to an~integrated luminosity of~9fb$^{-1}$. Using the~$B^+\rightarrow χ_{c1}(3872)K^+$decay, the $χ_{c1}(3872)\rightarrow ψ(2S)γ$ process is observed for the first time and… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 31 pages, 2 figures. All figures and tables, along with any supplementary material and additional information, are available at https://cern.ch/lhcbproject/Publications/p/LHCb-PAPER-2024-015.html (LHCb public pages)

    Report number: LHCb-PAPER-2024-015, CERN-EP-2025-157

  49. arXiv:2406.16968  [pdf, other

    cs.LG cs.AI

    Multimodal Physiological Signals Representation Learning via Multiscale Contrasting for Depression Recognition

    Authors: Kai Shao, Rui Wang, Yixue Hao, Long Hu, Min Chen, Hans Arno Jacobsen

    Abstract: Depression recognition based on physiological signals such as functional near-infrared spectroscopy (fNIRS) and electroencephalogram (EEG) has made considerable progress. However, most existing studies ignore the complementarity and semantic consistency of multimodal physiological signals under the same stimulation task in complex spatio-temporal patterns. In this paper, we introduce a multimodal… ▽ More

    Submitted 25 June, 2024; v1 submitted 22 June, 2024; originally announced June 2024.

  50. arXiv:2406.16830  [pdf, other

    stat.ME stat.AP

    Adjusting for Selection Bias Due to Missing Eligibility Criteria in Emulated Target Trials

    Authors: Luke Benz, Rajarshi Mukherjee, Issa Dahabreh, Rui Wang, David Arterburn, Catherine Lee, Heidi Fischer, Susan Shortreed, Sebastien Haneuse

    Abstract: Target trial emulation (TTE) is a popular framework for observational studies based on electronic health records (EHR). A key component of this framework is determining the patient population eligible for inclusion in both a target trial of interest and its observational emulation. Missingness in variables that define eligibility criteria, however, presents a major challenge towards determining th… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.