Skip to main content

Showing 1–50 of 542 results for author: Jin, R

  1. arXiv:2407.07089  [pdf, other

    cs.LG

    Fine-Tuning Linear Layers Only Is a Simple yet Effective Way for Task Arithmetic

    Authors: Ruochen Jin, Bojian Hou, Jiancong Xiao, Weijie Su, Li Shen

    Abstract: Task arithmetic has recently emerged as a cost-effective and scalable approach to edit pre-trained models directly in weight space, by adding the fine-tuned weights of different tasks. The performance has been further improved by a linear property which is illustrated by weight disentanglement. Yet, conventional linearization methods (e.g., NTK linearization) not only double the time and training… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  2. arXiv:2407.06612  [pdf

    eess.IV cs.CV cs.LG

    AI-based Automatic Segmentation of Prostate on Multi-modality Images: A Review

    Authors: Rui Jin, Derun Li, Dehui Xiang, Lei Zhang, Hailing Zhou, Fei Shi, Weifang Zhu, Jing Cai, Tao Peng, Xinjian Chen

    Abstract: Prostate cancer represents a major threat to health. Early detection is vital in reducing the mortality rate among prostate cancer patients. One approach involves using multi-modality (CT, MRI, US, etc.) computer-aided diagnosis (CAD) systems for the prostate region. However, prostate segmentation is challenging due to imperfections in the images and the prostate's complex tissue structure. The ad… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  3. arXiv:2407.04303  [pdf, other

    physics.atom-ph physics.chem-ph physics.optics

    Transmission spectroscopy of CF$_4$ molecules in intense x-ray fields

    Authors: Rui Jin, Adam Fouda, Alexander Magunia, Yeonsig Nam, Marc Rebholz, Alberto De Fanis, Kai Li, Gilles Doumy, Thomas M. Baumann, Michael Straub, Sergey Usenko, Yevheniy Ovcharenko, Tommaso Mazza, Jacobo Montaño, Marcus Agåker, Maria Novella Piancastelli, Marc Simon, Jan-Erik Rubensson, Michael Meyer, Linda Young, Christian Ott, Thomas Pfeifer

    Abstract: The nonlinear interaction of x-rays with matter is at the heart of understanding and controlling ultrafast molecular dynamics from an atom-specific viewpoint, providing new scientific and analytical opportunities to explore the structure and dynamics of small quantum systems. At increasingly high x-ray intensity, the sensitivity of ultrashort x-ray pulses to specific electronic states and emerging… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: 30 pages, with 7 figures, submitted to Phys. Rev. X

  4. arXiv:2407.02042  [pdf, other

    cs.CL cs.AI

    Fake News Detection and Manipulation Reasoning via Large Vision-Language Models

    Authors: Ruihan Jin, Ruibo Fu, Zhengqi Wen, Shuai Zhang, Yukun Liu, Jianhua Tao

    Abstract: Fake news becomes a growing threat to information security and public opinion with the rapid sprawl of media manipulation. Therefore, fake news detection attracts widespread attention from academic community. Traditional fake news detection models demonstrate remarkable performance on authenticity binary classification but their ability to reason detailed faked traces based on the news content rem… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  5. arXiv:2407.00983  [pdf, other

    cs.CV

    FairMedFM: Fairness Benchmarking for Medical Imaging Foundation Models

    Authors: Ruinan Jin, Zikang Xu, Yuan Zhong, Qiongsong Yao, Qi Dou, S. Kevin Zhou, Xiaoxiao Li

    Abstract: The advent of foundation models (FMs) in healthcare offers unprecedented opportunities to enhance medical diagnostics through automated classification and segmentation tasks. However, these models also raise significant concerns about their fairness, especially when applied to diverse and underrepresented populations in healthcare applications. Currently, there is a lack of comprehensive benchmark… ▽ More

    Submitted 3 July, 2024; v1 submitted 1 July, 2024; originally announced July 2024.

    Comments: 29 pages, 17 figures

  6. arXiv:2406.18406  [pdf, other

    cs.CL cs.AI

    IRCAN: Mitigating Knowledge Conflicts in LLM Generation via Identifying and Reweighting Context-Aware Neurons

    Authors: Dan Shi, Renren Jin, Tianhao Shen, Weilong Dong, Xinwei Wu, Deyi Xiong

    Abstract: It is widely acknowledged that large language models (LLMs) encode a vast reservoir of knowledge after being trained on mass data. Recent studies disclose knowledge conflicts in LLM generation, wherein outdated or incorrect parametric knowledge (i.e., encoded knowledge) contradicts new knowledge provided in the context. To mitigate such knowledge conflicts, we propose a novel framework, IRCAN (Ide… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: 19 pages, 13 figures, 5 tables

  7. arXiv:2406.14422  [pdf, other

    cs.CV cs.AI

    FutureNet-LOF: Joint Trajectory Prediction and Lane Occupancy Field Prediction with Future Context Encoding

    Authors: Mingkun Wang, Xiaoguang Ren, Ruochun Jin, Minglong Li, Xiaochuan Zhang, Changqian Yu, Mingxu Wang, Wenjing Yang

    Abstract: Most prior motion prediction endeavors in autonomous driving have inadequately encoded future scenarios, leading to predictions that may fail to accurately capture the diverse movements of agents (e.g., vehicles or pedestrians). To address this, we propose FutureNet, which explicitly integrates initially predicted trajectories into the future scenario and further encodes these future contexts to e… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 10 pages

  8. arXiv:2406.08765  [pdf, other

    cs.LG

    LLM-based Knowledge Pruning for Time Series Data Analytics on Edge-computing Devices

    Authors: Ruibing Jin, Qing Xu, Min Wu, Yuecong Xu, Dan Li, Xiaoli Li, Zhenghua Chen

    Abstract: Limited by the scale and diversity of time series data, the neural networks trained on time series data often overfit and show unsatisfacotry performances. In comparison, large language models (LLMs) recently exhibit impressive generalization in diverse fields. Although massive LLM based approaches are proposed for time series tasks, these methods require to load the whole LLM in both training and… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 12 pages, 5 figures

  9. arXiv:2406.08764  [pdf

    physics.optics

    Numerical Insights into noise amplification of high-energy mid-infrared supercontinuum generation in normal dispersion multimode fibers

    Authors: Chaofan Yang, Dian Duan, Fan Zou, Kuo Liu, Ruibo Jin, Zechuan Liu, Haoyu Wu

    Abstract: We report on the noise properties of high-energy mid-infrared supercontinuum (MIR-SC) generation in normal dispersion multimode fibers from the numerical perspective. Noise amplification in multi-modes is primarily due to the stimulated Raman scattering (SRS) effect. This leads to the emergence of "incoherent cloud formation" and "incoherent optical wave breaking", similar to those observed in sin… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  10. arXiv:2406.08487  [pdf, other

    cs.CV

    Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models

    Authors: Yi-Fan Zhang, Qingsong Wen, Chaoyou Fu, Xue Wang, Zhang Zhang, Liang Wang, Rong Jin

    Abstract: Seeing clearly with high resolution is a foundation of Large Multimodal Models (LMMs), which has been proven to be vital for visual perception and reasoning. Existing works usually employ a straightforward resolution upscaling method, where the image consists of global and local branches, with the latter being the sliced image patches but resized to the same resolution as the former. This means th… ▽ More

    Submitted 13 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

    Comments: Project page: https://github.com/yfzhang114/SliME

  11. arXiv:2406.05419  [pdf, ps, other

    math.CO math.LO

    Foundations of iterated star maps and their use in combinatorics

    Authors: Mauro Di Nasso, Renling Jin

    Abstract: We develop a framework for nonstandard analysis that gives foundations to the interplay between external and internal iterations of the star map, and we present a few examples to show the strength and flexibility of such a nonstandard technique for applications in combinatorial number theory.

    Submitted 8 June, 2024; originally announced June 2024.

  12. arXiv:2405.20015  [pdf, other

    cs.AI cs.CL

    Efficient LLM-Jailbreaking by Introducing Visual Modality

    Authors: Zhenxing Niu, Yuyao Sun, Haodong Ren, Haoxuan Ji, Quan Wang, Xiaoke Ma, Gang Hua, Rong Jin

    Abstract: This paper focuses on jailbreaking attacks against large language models (LLMs), eliciting them to generate objectionable content in response to harmful user queries. Unlike previous LLM-jailbreaks that directly orient to LLMs, our approach begins by constructing a multimodal large language model (MLLM) through the incorporation of a visual module into the target LLM. Subsequently, we conduct an e… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  13. arXiv:2405.19811  [pdf, ps, other

    cs.LG cs.MA

    Approximate Global Convergence of Independent Learning in Multi-Agent Systems

    Authors: Ruiyang Jin, Zaiwei Chen, Yiheng Lin, Jie Song, Adam Wierman

    Abstract: Independent learning (IL), despite being a popular approach in practice to achieve scalability in large-scale multi-agent systems, usually lacks global convergence guarantees. In this paper, we study two representative algorithms, independent $Q$-learning and independent natural actor-critic, within value-based and policy-based frameworks, and provide the first finite-sample analysis for approxima… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  14. arXiv:2405.17929  [pdf, other

    cs.CV

    Towards Unified Robustness Against Both Backdoor and Adversarial Attacks

    Authors: Zhenxing Niu, Yuyao Sun, Qiguang Miao, Rong Jin, Gang Hua

    Abstract: Deep Neural Networks (DNNs) are known to be vulnerable to both backdoor and adversarial attacks. In the literature, these two types of attacks are commonly treated as distinct robustness problems and solved separately, since they belong to training-time and inference-time attacks respectively. However, this paper revealed that there is an intriguing connection between them: (1) planting a backdoor… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  15. arXiv:2405.13578  [pdf, other

    cs.CL

    ConTrans: Weak-to-Strong Alignment Engineering via Concept Transplantation

    Authors: Weilong Dong, Xinwei Wu, Renren Jin, Shaoyang Xu, Deyi Xiong

    Abstract: Ensuring large language models (LLM) behave consistently with human goals, values, and intentions is crucial for their safety but yet computationally expensive. To reduce the computational cost of alignment training of LLMs, especially for those with a huge number of parameters, and to reutilize learned value alignment, we propose ConTrans, a novel framework that enables weak-to-strong alignment t… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  16. arXiv:2405.12794  [pdf, other

    quant-ph physics.optics

    Multiphoton Quantum Imaging using Natural Light

    Authors: Fatemeh Mostafavi, Mingyuan Hong, Riley B. Dawkins, Jannatul Ferdous, Rui-Bo Jin, Roberto de J. Leon-Montiel, Chenglong You, Omar S. Magana-Loaiza

    Abstract: It is thought that schemes for quantum imaging are fragile against realistic environments in which the background noise is often stronger than the nonclassical signal of the imaging photons. Unfortunately, it is unfeasible to produce brighter quantum light sources to alleviate this problem. Here, we overcome this paradigmatic limitation by developing a quantum imaging scheme that relies on the use… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  17. arXiv:2405.11441  [pdf, other

    cs.IR cs.CL

    EmbSum: Leveraging the Summarization Capabilities of Large Language Models for Content-Based Recommendations

    Authors: Chiyu Zhang, Yifei Sun, Minghao Wu, Jun Chen, Jie Lei, Muhammad Abdul-Mageed, Rong Jin, Angli Liu, Ji Zhu, Sem Park, Ning Yao, Bo Long

    Abstract: Content-based recommendation systems play a crucial role in delivering personalized content to users in the digital world. In this work, we introduce EmbSum, a novel framework that enables offline pre-computations of users and candidate items while capturing the interactions within the user engagement history. By utilizing the pretrained encoder-decoder model and poly-attention layers, EmbSum deri… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

    Comments: Under review

  18. arXiv:2405.10142  [pdf, other

    cs.RO

    GS-Planner: A Gaussian-Splatting-based Planning Framework for Active High-Fidelity Reconstruction

    Authors: Rui Jin, Yuman Gao, Yingjian Wang, Haojian Lu, Fei Gao

    Abstract: Active reconstruction technique enables robots to autonomously collect scene data for full coverage, relieving users from tedious and time-consuming data capturing process. However, designed based on unsuitable scene representations, existing methods show unrealistic reconstruction results or the inability of online quality evaluation. Due to the recent advancements in explicit radiance field tech… ▽ More

    Submitted 24 May, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

  19. arXiv:2405.08929  [pdf, other

    cond-mat.mtrl-sci

    Size and Shape Dependence of Hydrogen-Induced Phase Transformation and Sorption Hysteresis in Palladium Nanoparticles

    Authors: Xingsheng Sun, Rong Jin

    Abstract: We establish a computational framework to explore the atomic configuration of a metal-hydrogen (M-H) system when in equilibrium with a H environment. This approach combines Diffusive Molecular Dynamics with an iteration strategy, aiming to minimize the system's free energy and ensure uniform chemical potential across the system that matches that of the H environment. Applying this framework, we in… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  20. arXiv:2405.05741  [pdf, ps, other

    cs.CL cs.AI

    Can large language models understand uncommon meanings of common words?

    Authors: Jinyang Wu, Feihu Che, Xinxin Zheng, Shuai Zhang, Ruihan Jin, Shuai Nie, Pengpeng Shao, Jianhua Tao

    Abstract: Large language models (LLMs) like ChatGPT have shown significant advancements across diverse natural language understanding (NLU) tasks, including intelligent dialogue and autonomous agents. Yet, lacking widely acknowledged testing mechanisms, answering `whether LLMs are stochastic parrots or genuinely comprehend the world' remains unclear, fostering numerous studies and sparking heated debates. P… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  21. arXiv:2405.04434  [pdf, other

    cs.CL cs.AI

    DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

    Authors: DeepSeek-AI, Aixin Liu, Bei Feng, Bin Wang, Bingxuan Wang, Bo Liu, Chenggang Zhao, Chengqi Dengr, Chong Ruan, Damai Dai, Daya Guo, Dejian Yang, Deli Chen, Dongjie Ji, Erhang Li, Fangyun Lin, Fuli Luo, Guangbo Hao, Guanting Chen, Guowei Li, H. Zhang, Hanwei Xu, Hao Yang, Haowei Zhang, Honghui Ding , et al. (132 additional authors not shown)

    Abstract: We present DeepSeek-V2, a strong Mixture-of-Experts (MoE) language model characterized by economical training and efficient inference. It comprises 236B total parameters, of which 21B are activated for each token, and supports a context length of 128K tokens. DeepSeek-V2 adopts innovative architectures including Multi-head Latent Attention (MLA) and DeepSeekMoE. MLA guarantees efficient inference… ▽ More

    Submitted 19 June, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

  22. arXiv:2405.00365  [pdf, other

    cs.IT eess.SP

    Robust Continuous-Time Beam Tracking with Liquid Neural Network

    Authors: Fenghao Zhu, Xinquan Wang, Chongwen Huang, Richeng Jin, Qianqian Yang, Ahmed Alhammadi, Zhaoyang Zhang, Chau Yuen, Mérouane Debbah

    Abstract: Millimeter-wave (mmWave) technology is increasingly recognized as a pivotal technology of the sixth-generation communication networks due to the large amounts of available spectrum at high frequencies. However, the huge overhead associated with beam training imposes a significant challenge in mmWave communications, particularly in urban environments with high background noise. To reduce this high… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  23. arXiv:2404.11070  [pdf

    cs.CV eess.SP

    Sky-GVIO: an enhanced GNSS/INS/Vision navigation with FCN-based sky-segmentation in urban canyon

    Authors: Jingrong Wang, Bo Xu, Ronghe Jin, Shoujian Zhang, Kefu Gao, Jingnan Liu

    Abstract: Accurate, continuous, and reliable positioning is a critical component of achieving autonomous driving. However, in complex urban canyon environments, the vulnerability of a stand-alone sensor and non-line-of-sight (NLOS) caused by high buildings, trees, and elevated structures seriously affect positioning results. To address these challenges, a sky-view images segmentation algorithm based on Full… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  24. arXiv:2404.07509   

    quant-ph

    Multiparameter cascaded quantum interferometer

    Authors: Baihong Li, Zhuo-zhuo Wang, Qi-qi Li, Changhua Chen, Boxin Yuan, Yiwei Zhai, Rui-Bo Jin, Xiaofei Zhang

    Abstract: We theoretically propose a multiparameter cascaded quantum interferometer in which a two-input and two-output setup is obtained by concatenating 50:50 beam splitters with n independent and adjustable time delays. A general method for deriving the coincidence probability of such an interferometer is given based on the linear transformation of the matrix of beam splitters. As examples, we analyze th… ▽ More

    Submitted 8 May, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

    Comments: We have found a serious error in this version, which may mislead readers

  25. arXiv:2404.07421  [pdf, other

    quant-ph

    Controllable transitions among phase-matching conditions in a single nonlinear crystal

    Authors: Zi-Qi Zeng, Shi-Xin You, Zi-Xiang Yang, Chenzhi Yuan, Chenglong You, Rui-Bo Jin

    Abstract: Entangled photon pairs are crucial resources for quantum information processing protocols. Via the process of spontaneous parametric down-conversion (SPDC), we can generate these photon pairs using bulk nonlinear crystals. Traditionally, the crystal is designed to satisfy specific type of phase-matching condition. Here, we report controllable transitions among different types of phase-matching in… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 8 pages, 3 figures

    Journal ref: Chinese Optics Letters, 22(2), 021901(2024)

  26. arXiv:2404.07074  [pdf

    cond-mat.mtrl-sci physics.app-ph

    Multiscale structure-property discovery via active learning in scanning tunneling microscopy

    Authors: Ganesh Narasimha, Dejia Kong, Paras Regmi, Rongying Jin, Zheng Gai, Rama Vasudevan, Maxim Ziatdinov

    Abstract: Atomic arrangements and local sub-structures fundamentally influence emergent material functionalities. The local structures are conventionally probed using spatially resolved studies and the property correlations are usually deciphered by a researcher based on sequential explorations and auxiliary information, thus limiting the throughput efficiency. Here we demonstrate a Bayesian deep learning b… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  27. arXiv:2403.19723  [pdf, other

    cs.CL cs.AI cs.DB cs.MM

    HGT: Leveraging Heterogeneous Graph-enhanced Large Language Models for Few-shot Complex Table Understanding

    Authors: Rihui Jin, Yu Li, Guilin Qi, Nan Hu, Yuan-Fang Li, Jiaoyan Chen, Jianan Wang, Yongrui Chen, Dehai Min

    Abstract: Table understanding (TU) has achieved promising advancements, but it faces the challenges of the scarcity of manually labeled tables and the presence of complex table structures.To address these challenges, we propose HGT, a framework with a heterogeneous graph (HG)-enhanced large language model (LLM) to tackle few-shot TU tasks.It leverages the LLM by aligning the table semantics with the LLM's p… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  28. arXiv:2403.14949  [pdf, other

    cs.LG

    Addressing Concept Shift in Online Time Series Forecasting: Detect-then-Adapt

    Authors: YiFan Zhang, Weiqi Chen, Zhaoyang Zhu, Dalin Qin, Liang Sun, Xue Wang, Qingsong Wen, Zhang Zhang, Liang Wang, Rong Jin

    Abstract: Online updating of time series forecasting models aims to tackle the challenge of concept drifting by adjusting forecasting models based on streaming data. While numerous algorithms have been developed, most of them focus on model design and updating. In practice, many of these methods struggle with continuous performance regression in the face of accumulated concept drifts over time. To address t… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: 7 figures, 14 pages. arXiv admin note: text overlap with arXiv:2309.12659

  29. arXiv:2403.12601  [pdf, other

    cs.CL

    LHMKE: A Large-scale Holistic Multi-subject Knowledge Evaluation Benchmark for Chinese Large Language Models

    Authors: Chuang Liu, Renren Jin, Yuqi Ren, Deyi Xiong

    Abstract: Chinese Large Language Models (LLMs) have recently demonstrated impressive capabilities across various NLP benchmarks and real-world applications. However, the existing benchmarks for comprehensively evaluating these LLMs are still insufficient, particularly in terms of measuring knowledge that LLMs capture. Current datasets collect questions from Chinese examinations across different subjects and… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Accepted by LREC-COLING 2024

  30. arXiv:2403.12316  [pdf, other

    cs.CL

    OpenEval: Benchmarking Chinese LLMs across Capability, Alignment and Safety

    Authors: Chuang Liu, Linhao Yu, Jiaxuan Li, Renren Jin, Yufei Huang, Ling Shi, Junhui Zhang, Xinmeng Ji, Tingting Cui, Tao Liu, Jinwang Song, Hongying Zan, Sun Li, Deyi Xiong

    Abstract: The rapid development of Chinese large language models (LLMs) poses big challenges for efficient LLM evaluation. While current initiatives have introduced new benchmarks or evaluation platforms for assessing Chinese LLMs, many of these focus primarily on capabilities, usually overlooking potential alignment and safety issues. To address this gap, we introduce OpenEval, an evaluation testbed that b… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  31. arXiv:2403.11693  [pdf, other

    cs.IT eess.SP

    Beamforming Design for Semantic-Bit Coexisting Communication System

    Authors: Maojun Zhang, Guangxu Zhu, Richeng Jin, Xiaoming Chen, Qingjiang Shi, Caijun Zhong, Kaibin Huang

    Abstract: Semantic communication (SemCom) is emerging as a key technology for future sixth-generation (6G) systems. Unlike traditional bit-level communication (BitCom), SemCom directly optimizes performance at the semantic level, leading to superior communication efficiency. Nevertheless, the task-oriented nature of SemCom renders it challenging to completely replace BitCom. Consequently, it is desired to c… ▽ More

    Submitted 22 March, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: Submitted to IEEE for possible publication

  32. arXiv:2403.07747  [pdf, other

    cs.CL cs.AI

    FineMath: A Fine-Grained Mathematical Evaluation Benchmark for Chinese Large Language Models

    Authors: Yan Liu, Renren Jin, Lin Shi, Zheng Yao, Deyi Xiong

    Abstract: To thoroughly assess the mathematical reasoning abilities of Large Language Models (LLMs), we need to carefully curate evaluation datasets covering diverse mathematical concepts and mathematical problems at different difficulty levels. In pursuit of this objective, we propose FineMath in this paper, a fine-grained mathematical evaluation benchmark dataset for assessing Chinese LLMs. FineMath is cr… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  33. arXiv:2403.06104  [pdf, other

    cs.CV

    Debiased Noise Editing on Foundation Models for Fair Medical Image Classification

    Authors: Ruinan Jin, Wenlong Deng, Minghui Chen, Xiaoxiao Li

    Abstract: In the era of Foundation Models' (FMs) rising prominence in AI, our study addresses the challenge of biases in medical images while the model operates in black-box (e.g., using FM API), particularly spurious correlations between pixels and sensitive attributes. Traditional methods for bias mitigation face limitations due to the restricted access to web-hosted FMs and difficulties in addressing the… ▽ More

    Submitted 12 July, 2024; v1 submitted 10 March, 2024; originally announced March 2024.

    Comments: 13 pages, 3 figures. Accepted by MICCAI 2024

  34. arXiv:2403.05935  [pdf, ps, other

    math.NA math.OC

    Unique reconstruction for discretized inverse problems: a random sketching approach

    Authors: Ruhui Jin, Qin Li, Anjali Nair, Samuel Stechmann

    Abstract: Inverse problem theory is often studied in the ideal infinite-dimensional setting. Through the lens of the PDE-constrained optimization, the well-posedness PDE theory suggests unique reconstruction of the parameter function that attain the zero-loss property of the mismatch function, when infinite amount of data is provided. Unfortunately, this is not the case in practice, when we are limited to f… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    MSC Class: 65M32; 49M41; 65F35

  35. arXiv:2403.05478  [pdf, other

    cs.RO

    HGIC: A Hand Gesture Based Interactive Control System for Efficient and Scalable Multi-UAV Operations

    Authors: Mengsha Hu, Jinzhou Li, Runxiang Jin, Chao Shi, Lei Xu, Rui Liu

    Abstract: As technological advancements continue to expand the capabilities of multi unmanned-aerial-vehicle systems (mUAV), human operators face challenges in scalability and efficiency due to the complex cognitive load and operations associated with motion adjustments and team coordination. Such cognitive demands limit the feasible size of mUAV teams and necessitate extensive operator training, impeding b… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  36. arXiv:2403.05472  [pdf, other

    cs.RO

    Federated Joint Learning of Robot Networks in Stroke Rehabilitation

    Authors: Xinyu Jiang, Yibei Guo, Mengsha Hu, Ruoming Jin, Hai Phan, Jay Alberts, Rui Liu

    Abstract: Advanced by rich perception and precise execution, robots possess immense potential to provide professional and customized rehabilitation exercises for patients with mobility impairments caused by strokes. Autonomous robotic rehabilitation significantly reduces human workloads in the long and tedious rehabilitation process. However, training a rehabilitation robot is challenging due to the data sc… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  37. arXiv:2403.05262  [pdf, other

    cs.CV

    Debiasing Multimodal Large Language Models

    Authors: Yi-Fan Zhang, Weichen Yu, Qingsong Wen, Xue Wang, Zhang Zhang, Liang Wang, Rong Jin, Tieniu Tan

    Abstract: In the realms of computer vision and natural language processing, Large Vision-Language Models (LVLMs) have become indispensable tools, proficient in generating textual descriptions based on visual inputs. Despite their advancements, our investigation reveals a noteworthy bias in the generated content, where the output is primarily influenced by the underlying Large Language Models (LLMs) prior ra… ▽ More

    Submitted 27 March, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

    Comments: 38 pages, 17 figures

  38. arXiv:2403.04911  [pdf, ps, other

    math.PR math-ph

    Fractional stochastic Landau-Lifshitz Navier-Stokes equations in dimension $d \geq 3$: Existence and (non-)triviality

    Authors: Ruhong Jin, Nicolas Perkowski

    Abstract: We investigate fractional stochastic Navier-Stokes equations in $d\ge 3$, driven by the random force $(-Δ)^{\fracθ{2}}ξ$ which, as we show, corresponds to a fractional version of the Landau-Lifshitz random force in the physics literature. We obtain the existence and uniqueness of martingale solutions on the torus $\mathbb T^d$ for $θ> \frac{d}{2}$. For $θ\le 1$ the equation is supercritical and we… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: 24 pages

  39. arXiv:2403.03645  [pdf, other

    cs.AI

    K-Link: Knowledge-Link Graph from LLMs for Enhanced Representation Learning in Multivariate Time-Series Data

    Authors: Yucheng Wang, Ruibing Jin, Min Wu, Xiaoli Li, Lihua Xie, Zhenghua Chen

    Abstract: Sourced from various sensors and organized chronologically, Multivariate Time-Series (MTS) data involves crucial spatial-temporal dependencies, e.g., correlations among sensors. To capture these dependencies, Graph Neural Networks (GNNs) have emerged as powerful tools, yet their effectiveness is restricted by the quality of graph construction from MTS data. Typically, existing approaches construct… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 12 pages,7 figures

  40. arXiv:2403.02977  [pdf, other

    cs.RO

    Fast Iterative Region Inflation for Computing Large 2-D/3-D Convex Regions of Obstacle-Free Space

    Authors: Qianhao Wang, Zhepei Wang, Mingyang Wang, Jialin Ji, Zhichao Han, Tianyue Wu, Rui Jin, Yuman Gao, Chao Xu, Fei Gao

    Abstract: Convex polytopes have compact representations and exhibit convexity, which makes them suitable for abstracting obstacle-free spaces from various environments. Existing methods for generating convex polytopes always struggle to strike a balance between two requirements, producing high-quality polytope and efficiency. Moreover, another crucial requirement for convex polytopes to accurately contain c… ▽ More

    Submitted 6 June, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

  41. arXiv:2403.00997  [pdf, other

    cond-mat.mtrl-sci

    Thermoelectric Transport in Weyl Semimetal BaMnSb2: a First-Principles Study

    Authors: Yubi Chen, Rongying Jin, Bolin Liao, Sai Mu

    Abstract: Topological materials are often associated with exceptional thermoelectric properties. Orthorhombic BaMnSb2 is a topological semimetal consisting of alternating layers of Ba, Sb, and MnSb. A recent experiment demonstrates that BaMnSb2 has a low thermal conductivity and modest thermopower, promising as a thermoelectric material. Through first-principles calculations with Coulomb repulsion and spin-… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  42. arXiv:2402.18023  [pdf, other

    cs.AI cs.CL

    Do Large Language Models Mirror Cognitive Language Processing?

    Authors: Yuqi Ren, Renren Jin, Tongxuan Zhang, Deyi Xiong

    Abstract: Large Language Models (LLMs) have demonstrated remarkable abilities in text comprehension and logical reasoning, indicating that the text representations learned by LLMs can facilitate their language processing capabilities. In cognitive science, brain cognitive processing signals are typically utilized to study human language processing. Therefore, it is natural to ask how well the text embedding… ▽ More

    Submitted 28 May, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

  43. arXiv:2402.16775  [pdf, other

    cs.CL cs.AI

    A Comprehensive Evaluation of Quantization Strategies for Large Language Models

    Authors: Renren Jin, Jiangcun Du, Wuwei Huang, Wei Liu, Jian Luan, Bin Wang, Deyi Xiong

    Abstract: Increasing the number of parameters in large language models (LLMs) usually improves performance in downstream tasks but raises compute and memory costs, making deployment difficult in resource-limited settings. Quantization techniques, which reduce the bits needed for model weights or activations with minimal performance loss, have become popular due to the rise of LLMs. However, most quantizatio… ▽ More

    Submitted 6 June, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: ACL 2024 Findings

  44. arXiv:2402.12869  [pdf, other

    cs.CL

    Exploring the Impact of Table-to-Text Methods on Augmenting LLM-based Question Answering with Domain Hybrid Data

    Authors: Dehai Min, Nan Hu, Rihui Jin, Nuo Lin, Jiaoyan Chen, Yongrui Chen, Yu Li, Guilin Qi, Yun Li, Nijun Li, Qianren Wang

    Abstract: Augmenting Large Language Models (LLMs) for Question Answering (QA) with domain specific data has attracted wide attention. However, domain data often exists in a hybrid format, including text and semi-structured tables, posing challenges for the seamless integration of information. Table-to-Text Generation is a promising solution by facilitating the transformation of hybrid data into a uniformly… ▽ More

    Submitted 9 April, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: Accepted to NAACL 2024 Industry Track Paper

  45. arXiv:2402.10816  [pdf, other

    cs.LG cs.CR cs.DC eess.SP

    TernaryVote: Differentially Private, Communication Efficient, and Byzantine Resilient Distributed Optimization on Heterogeneous Data

    Authors: Richeng Jin, Yujie Gu, Kai Yue, Xiaofan He, Zhaoyang Zhang, Huaiyu Dai

    Abstract: Distributed training of deep neural networks faces three critical challenges: privacy preservation, communication efficiency, and robustness to fault and adversarial behaviors. Although significant research efforts have been devoted to addressing these challenges independently, their synthesis remains less explored. In this paper, we propose TernaryVote, which combines a ternary compressor and the… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  46. arXiv:2402.10555  [pdf, other

    cs.IR cs.CL

    SPAR: Personalized Content-Based Recommendation via Long Engagement Attention

    Authors: Chiyu Zhang, Yifei Sun, Jun Chen, Jie Lei, Muhammad Abdul-Mageed, Sinong Wang, Rong Jin, Sem Park, Ning Yao, Bo Long

    Abstract: Leveraging users' long engagement histories is essential for personalized content recommendations. The success of pretrained language models (PLMs) in NLP has led to their use in encoding user histories and candidate items, framing content recommendations as textual semantic matching tasks. However, existing works still struggle with processing very long user historical text and insufficient user-… ▽ More

    Submitted 21 May, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: Under review

  47. arXiv:2402.05830  [pdf, other

    cs.LG cs.AI

    Sparse-VQ Transformer: An FFN-Free Framework with Vector Quantization for Enhanced Time Series Forecasting

    Authors: Yanjun Zhao, Tian Zhou, Chao Chen, Liang Sun, Yi Qian, Rong Jin

    Abstract: Time series analysis is vital for numerous applications, and transformers have become increasingly prominent in this domain. Leading methods customize the transformer architecture from NLP and CV, utilizing a patching technique to convert continuous signals into segments. Yet, time series data are uniquely challenging due to significant distribution shifts and intrinsic noise levels. To address th… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  48. arXiv:2402.05823  [pdf, other

    cs.LG cs.AI cs.CV

    FusionSF: Fuse Heterogeneous Modalities in a Vector Quantized Framework for Robust Solar Power Forecasting

    Authors: Ziqing Ma, Wenwei Wang, Tian Zhou, Chao Chen, Bingqing Peng, Liang Sun, Rong Jin

    Abstract: Accurate solar power forecasting is crucial to integrate photovoltaic plants into the electric grid, schedule and secure the power grid safety. This problem becomes more demanding for those newly installed solar plants which lack sufficient data. Current research predominantly relies on historical solar power data or numerical weather prediction in a single-modality format, ignoring the complement… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  49. arXiv:2402.05370  [pdf, other

    cs.LG cs.AI

    Attention as Robust Representation for Time Series Forecasting

    Authors: PeiSong Niu, Tian Zhou, Xue Wang, Liang Sun, Rong Jin

    Abstract: Time series forecasting is essential for many practical applications, with the adoption of transformer-based models on the rise due to their impressive performance in NLP and CV. Transformers' key feature, the attention mechanism, dynamically fusing embeddings to enhance data representation, often relegating attention weights to a byproduct role. Yet, time series data, characterized by noise and n… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  50. arXiv:2402.02309  [pdf, other

    cs.LG cs.CL cs.CR cs.CV

    Jailbreaking Attack against Multimodal Large Language Model

    Authors: Zhenxing Niu, Haodong Ren, Xinbo Gao, Gang Hua, Rong Jin

    Abstract: This paper focuses on jailbreaking attacks against multi-modal large language models (MLLMs), seeking to elicit MLLMs to generate objectionable responses to harmful user queries. A maximum likelihood-based algorithm is proposed to find an \emph{image Jailbreaking Prompt} (imgJP), enabling jailbreaks against MLLMs across multiple unseen prompts and images (i.e., data-universal property). Our approa… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.