Skip to main content

Showing 1–50 of 742 results for author: Jiang, F

  1. arXiv:2407.08174  [pdf, other

    cs.HC q-bio.NC

    An Adaptively Weighted Averaging Method for Regional Time Series Extraction of fMRI-based Brain Decoding

    Authors: Jianfei Zhu, Baichun Wei, Jiaru Tian, Feng Jiang, Chunzhi Yi

    Abstract: Brain decoding that classifies cognitive states using the functional fluctuations of the brain can provide insightful information for understanding the brain mechanisms of cognitive functions. Among the common procedures of decoding the brain cognitive states with functional magnetic resonance imaging (fMRI), extracting the time series of each brain region after brain parcellation traditionally av… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 17 pages, 4 figures

    ACM Class: J.3

  2. arXiv:2407.00339  [pdf, other

    physics.optics physics.acc-ph

    On-chip high energy photon radiation source based on microwave-dielectric undulator

    Authors: Fuming Jiang, Xinyu Xie, Chengpu Liu, Ye Tian

    Abstract: A new on-chip light source configuration has been proposed, which utilizes the interaction between microwave and a dielectric nanopillar array to generate a periodic electromagnetic near field, and applies periodic transverse acceleration to relativistic electrons to generate high-energy photon radiation. Here the dielectric nanopillar array interacting with microwave acts as the electron undulato… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  3. arXiv:2407.00020  [pdf, other

    cs.CV cs.AI cs.CL cs.IT cs.LG

    Visual Language Model based Cross-modal Semantic Communication Systems

    Authors: Feibo Jiang, Chuanguo Tang, Li Dong, Kezhi Wang, Kun Yang, Cunhua Pan

    Abstract: Semantic Communication (SC) has emerged as a novel communication paradigm in recent years, successfully transcending the Shannon physical capacity limits through innovative semantic transmission concepts. Nevertheless, extant Image Semantic Communication (ISC) systems face several challenges in dynamic environments, including low semantic density, catastrophic forgetting, and uncertain Signal-to-N… ▽ More

    Submitted 6 May, 2024; originally announced July 2024.

    Comments: 12 pages, 10 figures

  4. arXiv:2406.18034  [pdf, other

    cs.CL

    LLMs for Doctors: Leveraging Medical LLMs to Assist Doctors, Not Replace Them

    Authors: Wenya Xie, Qingying Xiao, Yu Zheng, Xidong Wang, Junying Chen, Ke Ji, Anningzhe Gao, Xiang Wan, Feng Jiang, Benyou Wang

    Abstract: The recent success of Large Language Models (LLMs) has had a significant impact on the healthcare field, providing patients with medical advice, diagnostic information, and more. However, due to a lack of professional medical knowledge, patients are easily misled by generated erroneous information from LLMs, which may result in serious medical problems. To address this issue, we focus on tuning th… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  5. arXiv:2406.14812  [pdf, other

    hep-lat cond-mat.stat-mech

    Berezinskii--Kosterlitz--Thouless transition of the two-dimensional $XY$ model on the honeycomb lattice

    Authors: Fu-Jiun Jiang

    Abstract: The Berezinskii--Kosterlitz--Thouless (BKT) transition of the two-dimensional $XY$ model on the honeycomb lattice is investigated using both the techniques of Neural Network (NN) and Monte Carlo simulations. It is demonstrated in the literature that with certain plausible assumptions, the associated critical temperature $T_{\text{BKT,H}}$ is found to be $\frac{1}{\sqrt{2}}$ exactly. Surprisingly,… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 9 pages, 13 figures

  6. arXiv:2406.14115  [pdf, other

    cs.CL

    Take the essence and discard the dross: A Rethinking on Data Selection for Fine-Tuning Large Language Models

    Authors: Ziche Liu, Rui Ke, Feng Jiang, Haizhou Li

    Abstract: Data selection for fine-tuning Large Language Models (LLMs) aims to select a high-quality subset from a given candidate dataset to train a Pending Fine-tune Model (PFM) into a Selective-Enhanced Model (SEM). It can improve the model performance and accelerate the training process. Although a few surveys have investigated related works of data selection, there is a lack of comprehensive comparison… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  7. arXiv:2406.12935  [pdf, other

    cs.CR cs.AI cs.LG

    ChatBug: A Common Vulnerability of Aligned LLMs Induced by Chat Templates

    Authors: Fengqing Jiang, Zhangchen Xu, Luyao Niu, Bill Yuchen Lin, Radha Poovendran

    Abstract: Large language models (LLMs) are expected to follow instructions from users and engage in conversations. Techniques to enhance LLMs' instruction-following capabilities typically fine-tune them using data structured according to a predefined chat template. Although chat templates are shown to be effective in optimizing LLM performance, their impact on safety alignment of LLMs has been less understo… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  8. arXiv:2406.12257  [pdf, other

    cs.AI cs.CR

    CleanGen: Mitigating Backdoor Attacks for Generation Tasks in Large Language Models

    Authors: Yuetai Li, Zhangchen Xu, Fengqing Jiang, Luyao Niu, Dinuka Sahabandu, Bhaskar Ramasubramanian, Radha Poovendran

    Abstract: The remarkable performance of large language models (LLMs) in generation tasks has enabled practitioners to leverage publicly available models to power custom applications, such as chatbots and virtual assistants. However, the data used to train or fine-tune these LLMs is often undisclosed, allowing an attacker to compromise the data and inject backdoors into the models. In this paper, we develop… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  9. arXiv:2406.08464  [pdf, other

    cs.CL cs.AI

    Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing

    Authors: Zhangchen Xu, Fengqing Jiang, Luyao Niu, Yuntian Deng, Radha Poovendran, Yejin Choi, Bill Yuchen Lin

    Abstract: High-quality instruction data is critical for aligning large language models (LLMs). Although some models, such as Llama-3-Instruct, have open weights, their alignment data remain private, which hinders the democratization of AI. High human labor costs and a limited, predefined scope for prompting prevent existing open-source data creation methods from scaling effectively, potentially limiting the… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Link: https://magpie-align.github.io/

  10. arXiv:2406.04170  [pdf

    cs.LG cs.AI cs.NE

    Element-wise Multiplication Based Physics-informed Neural Networks

    Authors: Feilong Jiang, Xiaonan Hou, Min Xia

    Abstract: As a promising framework for resolving partial differential equations (PDEs), physics-informed neural networks (PINNs) have received widespread attention from industrial and scientific fields. However, lack of expressive ability and initialization pathology issues are found to prevent the application of PINNs in complex PDEs. In this work, we propose Element-wise Multiplication Based Physics-infor… ▽ More

    Submitted 16 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

  11. arXiv:2406.04150  [pdf, other

    stat.ME stat.ML

    A novel robust meta-analysis model using the $t$ distribution for outlier accommodation and detection

    Authors: Yue Wang, Jianhua Zhao, Fen Jiang, Lei Shi, Jianxin Pan

    Abstract: Random effects meta-analysis model is an important tool for integrating results from multiple independent studies. However, the standard model is based on the assumption of normal distributions for both random effects and within-study errors, making it susceptible to outlying studies. Although robust modeling using the $t$ distribution is an appealing idea, the existing work, that explores the use… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 15 pages, 7 figures

    MSC Class: 62P10 ACM Class: I.2.6

  12. arXiv:2406.02424  [pdf, ps, other

    cs.LG math.ST stat.ME

    Contextual Dynamic Pricing: Algorithms, Optimality, and Local Differential Privacy Constraints

    Authors: Zifeng Zhao, Feiyu Jiang, Yi Yu

    Abstract: We study the contextual dynamic pricing problem where a firm sells products to $T$ sequentially arriving consumers that behave according to an unknown demand model. The firm aims to maximize its revenue, i.e. minimize its regret over a clairvoyant that knows the model in advance. The demand model is a generalized linear model (GLM), allowing for a stochastic feature vector in $\mathbb R^d$ that en… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  13. arXiv:2405.20975  [pdf, other

    cs.CR cs.AI cs.LG

    ACE: A Model Poisoning Attack on Contribution Evaluation Methods in Federated Learning

    Authors: Zhangchen Xu, Fengqing Jiang, Luyao Niu, Jinyuan Jia, Bo Li, Radha Poovendran

    Abstract: In Federated Learning (FL), a set of clients collaboratively train a machine learning model (called global model) without sharing their local training data. The local training data of clients is typically non-i.i.d. and heterogeneous, resulting in varying contributions from individual clients to the final performance of the global model. In response, many contribution evaluation methods were propo… ▽ More

    Submitted 5 June, 2024; v1 submitted 31 May, 2024; originally announced May 2024.

    Comments: To appear in the 33rd USENIX Security Symposium, 2024

  14. arXiv:2405.20215  [pdf, other

    cs.CL

    TS-Align: A Teacher-Student Collaborative Framework for Scalable Iterative Finetuning of Large Language Models

    Authors: Chen Zhang, Chengguang Tang, Dading Chong, Ke Shi, Guohua Tang, Feng Jiang, Haizhou Li

    Abstract: Mainstream approaches to aligning large language models (LLMs) heavily rely on human preference data, particularly when models require periodic updates. The standard process for iterative alignment of LLMs involves collecting new human feedback for each update. However, the data collection process is costly and challenging to scale. To address this issue, we introduce the "TS-Align" framework, whi… ▽ More

    Submitted 14 June, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

  15. arXiv:2405.19799  [pdf, other

    cs.CL

    Unsupervised Mutual Learning of Dialogue Discourse Parsing and Topic Segmentation

    Authors: Jiahui Xu, Feng Jiang, Anningzhe Gao, Haizhou Li

    Abstract: The advancement of large language models (LLMs) has propelled the development of dialogue systems. Unlike the popular ChatGPT-like assistant model, which only satisfies the user's preferences, task-oriented dialogue systems have also faced new requirements and challenges in the broader business field. They are expected to provide correct responses at each dialogue turn, at the same time, achieve t… ▽ More

    Submitted 3 June, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

  16. arXiv:2405.17306  [pdf, other

    cs.CV

    Controllable Longer Image Animation with Diffusion Models

    Authors: Qiang Wang, Minghua Liu, Junjun Hu, Fan Jiang, Mu Xu

    Abstract: Generating realistic animated videos from static images is an important area of research in computer vision. Methods based on physical simulation and motion prediction have achieved notable advances, but they are often limited to specific object textures and motion trajectories, failing to exhibit highly complex environments and physical dynamics. In this paper, we introduce an open-domain control… ▽ More

    Submitted 27 May, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

    Comments: https://wangqiang9.github.io/Controllable.github.io/

  17. arXiv:2405.12377  [pdf

    eess.SY cs.LG

    Spatio-temporal Attention-based Hidden Physics-informed Neural Network for Remaining Useful Life Prediction

    Authors: Feilong Jiang, Xiaonan Hou, Min Xia

    Abstract: Predicting the Remaining Useful Life (RUL) is essential in Prognostic Health Management (PHM) for industrial systems. Although deep learning approaches have achieved considerable success in predicting RUL, challenges such as low prediction accuracy and interpretability pose significant challenges, hindering their practical implementation. In this work, we introduce a Spatio-temporal Attention-base… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  18. arXiv:2405.11300  [pdf, other

    eess.SY cs.RO

    Ensuring Safety at Intelligent Intersections: Temporal Logic Meets Reachability Analysis

    Authors: Kaj Munhoz Arfvidsson, Frank J. Jiang, Karl H. Johansson, Jonas Mårtensson

    Abstract: In this work, we propose an approach for ensuring the safety of vehicles passing through an intelligent intersection. There are many proposals for the design of intelligent intersections that introduce central decision-makers to intersections for enhancing the efficiency and safety of the vehicles. To guarantee the safety of such designs, we develop a safety framework for intersections based on te… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

  19. arXiv:2405.10461  [pdf, other

    stat.ME

    Prediction in Measurement Error Models

    Authors: Fei Jiang, Yanyuan Ma

    Abstract: We study the well known difficult problem of prediction in measurement error models. By targeting directly at the prediction interval instead of the point prediction, we construct a prediction interval by providing estimators of both the center and the length of the interval which achieves a pre-determined prediction level. The constructing procedure requires a working model for the distribution o… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  20. arXiv:2405.08912  [pdf, other

    stat.ME

    High dimensional test for functional covariates

    Authors: Huaqing Jin, Fei Jiang

    Abstract: As medical devices become more complex, they routinely collect extensive and complicated data. While classical regressions typically examine the relationship between an outcome and a vector of predictors, it becomes imperative to identify the relationship with predictors possessing functional structures. In this article, we introduce a novel inference procedure for examining the relationship betwe… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: 35 pages,4 figures, 4 tables

  21. arXiv:2405.05911  [pdf, other

    eess.SY cs.ET cs.NI

    Small-Scale Testbed for Evaluating C-V2X Applications on 5G Cellular Networks

    Authors: Kaj Munhoz Arfvidsson, Kleio Fragkedaki, Frank J. Jiang, Vandana Narri, Hans-Cristian Lindh, Karl H. Johansson, Jonas Mårtensson

    Abstract: In this work, we present a small-scale testbed for evaluating the real-life performance of cellular V2X (C-V2X) applications on 5G cellular networks. Despite the growing interest and rapid technology development for V2X applications, researchers still struggle to prototype V2X applications with real wireless networks, hardware, and software in the loop in a controlled environment. To help alleviat… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  22. arXiv:2405.03217  [pdf, other

    cs.CR cs.AR

    PCG: Mitigating Conflict-based Cache Side-channel Attacks with Prefetching

    Authors: Fang Jiang, Fei Tong, Hongyu Wang, Xiaoyu Cheng, Zhe Zhou, Ming Ling, Yuxing Mao

    Abstract: To defend against conflict-based cache side-channel attacks, cache partitioning or remapping techniques were proposed to prevent set conflicts between different security domains or obfuscate the locations of such conflicts. But such techniques complicate cache design and may result in significant performance penalties. Therefore, there have been lightweight prefetching-based schemes proposed to in… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: 12 pages, 9 figures, submitting to a journal

  23. arXiv:2404.17170  [pdf, other

    cs.CV eess.IV

    S-IQA Image Quality Assessment With Compressive Sampling

    Authors: Ronghua Liao, Chen Hui, Lang Yuan, Feng Jiang

    Abstract: No-Reference Image Quality Assessment (IQA) aims at estimating image quality in accordance with subjective human perception. However, most existing NR-IQA methods focus on exploring increasingly complex networks or components to improve the final performance. Such practice imposes great limitations and complexity on IQA methods, especially when they are applied to high-resolution (HR) images in th… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  24. arXiv:2404.14709  [pdf, ps, other

    cs.CV eess.IV

    SC-HVPPNet: Spatial and Channel Hybrid-Attention Video Post-Processing Network with CNN and Transformer

    Authors: Tong Zhang, Wenxue Cui, Shaohui Liu, Feng Jiang

    Abstract: Convolutional Neural Network (CNN) and Transformer have attracted much attention recently for video post-processing (VPP). However, the interaction between CNN and Transformer in existing VPP methods is not fully explored, leading to inefficient communication between the local and global extracted features. In this paper, we explore the interaction between CNN and Transformer in the task of VPP, a… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  25. arXiv:2404.13238  [pdf, other

    cs.LG cs.AI cs.CL

    Personalized Wireless Federated Learning for Large Language Models

    Authors: Feibo Jiang, Li Dong, Siwei Tu, Yubo Peng, Kezhi Wang, Kun Yang, Cunhua Pan, Dusit Niyato

    Abstract: Large Language Models (LLMs) have revolutionized natural language processing tasks. However, their deployment in wireless networks still face challenges, i.e., a lack of privacy and security protection mechanisms. Federated Learning (FL) has emerged as a promising approach to address these challenges. Yet, it suffers from issues including inefficient handling with big and heterogeneous data, resou… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: 8 pages, 5 figures

  26. arXiv:2404.13067  [pdf, other

    cs.CL cs.AI cs.LG

    Towards Efficient Resume Understanding: A Multi-Granularity Multi-Modal Pre-Training Approach

    Authors: Feihu Jiang, Chuan Qin, Jingshuai Zhang, Kaichun Yao, Xi Chen, Dazhong Shen, Chen Zhu, Hengshu Zhu, Hui Xiong

    Abstract: In the contemporary era of widespread online recruitment, resume understanding has been widely acknowledged as a fundamental and crucial task, which aims to extract structured information from resume documents automatically. Compared to the traditional rule-based approaches, the utilization of recently proposed pre-trained document understanding models can greatly enhance the effectiveness of resu… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

    Comments: ICME 2024 Accepted

  27. arXiv:2404.11092  [pdf, ps, other

    econ.EM stat.ME

    Estimation for conditional moment models based on martingale difference divergence

    Authors: Kunyang Song, Feiyu Jiang, Ke Zhu

    Abstract: We provide a new estimation method for conditional moment models via the martingale difference divergence (MDD).Our MDD-based estimation method is formed in the framework of a continuum of unconditional moment restrictions. Unlike the existing estimation methods in this framework, the MDD-based estimation method adopts a non-integrable weighting function, which could grab more information from unc… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  28. arXiv:2404.08695  [pdf, other

    cs.CL cs.AI cs.IR

    Enhancing Question Answering for Enterprise Knowledge Bases using Large Language Models

    Authors: Feihu Jiang, Chuan Qin, Kaichun Yao, Chuyu Fang, Fuzhen Zhuang, Hengshu Zhu, Hui Xiong

    Abstract: Efficient knowledge management plays a pivotal role in augmenting both the operational efficiency and the innovative capacity of businesses and organizations. By indexing knowledge through vectorization, a variety of knowledge retrieval methods have emerged, significantly enhancing the efficacy of knowledge management systems. Recently, the rapid advancements in generative natural language process… ▽ More

    Submitted 20 April, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

    Comments: DASFAA 2024 Accepted

  29. arXiv:2404.08334  [pdf, other

    eess.SY cs.RO

    Guaranteed Completion of Complex Tasks via Temporal Logic Trees and Hamilton-Jacobi Reachability

    Authors: Frank J. Jiang, Kaj Munhoz Arfvidsson, Chong He, Mo Chen, Karl H. Johansson

    Abstract: In this paper, we present an approach for guaranteeing the completion of complex tasks with cyber-physical systems (CPS). Specifically, we leverage temporal logic trees constructed using Hamilton-Jacobi reachability analysis to (1) check for the existence of control policies that complete a specified task and (2) develop a computationally-efficient approach to synthesize the full set of control in… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

  30. arXiv:2404.07451  [pdf, other

    stat.CO

    SNSeg: An R Package for Time Series Segmentation via Self-Normalization

    Authors: Shubo Sun, Zifeng Zhao, Feiyu Jiang, Xiaofeng Shao

    Abstract: Time series segmentation aims to identify potential change-points in a sequence of temporally dependent data, so that the original sequence can be partitioned into several homogeneous subsequences. It is useful for modeling and predicting non-stationary time series and is widely applied in natural and social sciences. Existing segmentation methods primarily focus on only one type of parameter chan… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  31. arXiv:2404.06995  [pdf, other

    stat.ME

    Model-free Change-point Detection Using Modern Classifiers

    Authors: Rohit Kanrar, Feiyu Jiang, Zhanrui Cai

    Abstract: In contemporary data analysis, it is increasingly common to work with non-stationary complex datasets. These datasets typically extend beyond the classical low-dimensional Euclidean space, making it challenging to detect shifts in their distribution without relying on strong structural assumptions. This paper introduces a novel offline change-point detection method that leverages modern classifier… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  32. arXiv:2404.05192  [pdf, other

    cs.LG

    ATFNet: Adaptive Time-Frequency Ensembled Network for Long-term Time Series Forecasting

    Authors: Hengyu Ye, Jiadong Chen, Shijin Gong, Fuxin Jiang, Tieying Zhang, Jianjun Chen, Xiaofeng Gao

    Abstract: The intricate nature of time series data analysis benefits greatly from the distinct advantages offered by time and frequency domain representations. While the time domain is superior in representing local dependencies, particularly in non-periodic series, the frequency domain excels in capturing global dependencies, making it ideal for series with evident periodic patterns. To capitalize on both… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  33. arXiv:2404.03308  [pdf, other

    eess.SY cs.LO

    Formal Verification of Linear Temporal Logic Specifications Using Hybrid Zonotope-Based Reachability Analysis

    Authors: Loizos Hadjiloizou, Frank J. Jiang, Amr Alanwar, Karl H. Johansson

    Abstract: In this paper, we introduce a hybrid zonotope-based approach for formally verifying the behavior of autonomous systems operating under Linear Temporal Logic (LTL) specifications. In particular, we formally verify the LTL formula by constructing temporal logic trees (TLT)s via backward reachability analysis (BRA). In previous works, TLTs are predominantly constructed with either highly general and… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: 6 pages, 3 figures, 1 table, 1 algorithm

  34. arXiv:2404.02544  [pdf, other

    cs.CV

    Semi-Supervised Unconstrained Head Pose Estimation in the Wild

    Authors: Huayi Zhou, Fei Jiang, Hongtao Lu

    Abstract: Existing head pose estimation datasets are either composed of numerous samples by non-realistic synthesis or lab collection, or limited images by labor-intensive annotating. This makes deep supervised learning based solutions compromised due to the reliance on generous labeled data. To alleviate it, we propose the first semi-supervised unconstrained head pose estimation (SemiUHPE) method, which ca… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: 14 pages. Semi-Supervised Unconstrained Head Pose Estimation

  35. Reachability Analysis Using Constrained Polynomial Logical Zonotopes

    Authors: Ahmad Hafez, Frank J. Jiang, Karl H. Johansson, Amr Alanwar

    Abstract: In this paper, we propose reachability analysis using constrained polynomial logical zonotopes. We perform reachability analysis to compute the set of states that could be reached. To do this, we utilize a recently introduced set representation called polynomial logical zonotopes for performing computationally efficient and exact reachability analysis on logical systems. Notably, polynomial logica… ▽ More

    Submitted 19 June, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: IEEE Control Systems Letters (2024)

  36. arXiv:2403.14749  [pdf, other

    astro-ph.GA astro-ph.CO

    Connection between galaxy morphology and dark-matter halo structure I: a running threshold for thin discs and size predictors from the dark sector

    Authors: Jinning Liang, Fangzhou Jiang, Houjun Mo, Andrew Benson, Avishai Dekel, Noa Tavron, Philip F. Hopkins, Luis C. Ho

    Abstract: We present a series of studies on the connection between galaxy morphology and the structure of host dark-matter (DM) haloes using cosmological simulations. In this work, we introduce a new kinematic decomposition scheme that features physical identification of morphological components, enabling robust separation of thin and thick discs; and measure a wide range of halo properties, including their… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 20 pages, 17 figures, submitted to MNRAS

  37. arXiv:2403.11873  [pdf, other

    cs.CL

    CO3: Low-resource Contrastive Co-training for Generative Conversational Query Rewrite

    Authors: Yifei Yuan, Chen Shi, Runze Wang, Liyi Chen, Renjun Hu, Zengming Zhang, Feijun Jiang, Wai Lam

    Abstract: Generative query rewrite generates reconstructed query rewrites using the conversation history while rely heavily on gold rewrite pairs that are expensive to obtain. Recently, few-shot learning is gaining increasing popularity for this task, whereas these methods are sensitive to the inherent noise due to limited data size. Besides, both attempts face performance degradation when there exists lang… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: Accepted to COLING 2024

  38. arXiv:2403.09597  [pdf, other

    astro-ph.GA astro-ph.CO

    Tidal evolution of cored and cuspy dark matter halos

    Authors: Xiaolong Du, Andrew Benson, Zhichao Carton Zeng, Tommaso Treu, Annika H. G. Peter, Charlie Mace, Fangzhou Jiang, Shengqi Yang, Charles Gannon, Daniel Gilman, Anna. M. Nierenberg, Ethan O. Nadler

    Abstract: The internal structure and abundance of dark matter halos and subhalos are powerful probes of the nature of dark matter. In order to compare observations with dark matter models, accurate theoretical predictions of these quantities are needed. We present a fast and accurate method to describe the tidal evolution of subhalos within their parent halo, based on a semi-analytic approach. We first cons… ▽ More

    Submitted 15 July, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

    Comments: 25 pages, 50 figures. Version 2: matches published version. NOTE that the definition of dynamical timescale is slightly changed in version 2. The best-fit tidal stripping parameter is updated accordingly

    Journal ref: Phys. Rev. D 110, 023019 (2024)

  39. Integrated Communications and Localization for Massive MIMO LEO Satellite Systems

    Authors: Li You, Xiaoyu Qiang, Yongxiang Zhu, Fan Jiang, Christos G. Tsinos, Wenjin Wang, Henk Wymeersch, Xiqi Gao, Björn Ottersten

    Abstract: Integrated communications and localization (ICAL) will play an important part in future sixth generation (6G) networks for the realization of Internet of Everything (IoE) to support both global communications and seamless localization. Massive multiple-input multiple-output (MIMO) low earth orbit (LEO) satellite systems have great potential in providing wide coverage with enhanced gains, and thus… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: 14 pages, 7 figures, to appear in IEEE Transactions on Wireless Communications

  40. arXiv:2403.06784  [pdf, ps, other

    math.AP

    Uniqueness of the critical points of solutions to two kinds of semilinear elliptic equations in higher dimensional domains

    Authors: Haiyun Deng, Jingwen Ji, Feida Jiang, Jiabin Yin

    Abstract: In this paper, we provide an affirmative answer to the conjecture A for bounded simple rotationally symmetric domains $Ω\subset \mathbb{R}^n(n\geq 3)$ along $x_n$ axis. Precisely, we use a new simple argument to study the symmetry of positive solutions for two kinds of semilinear elliptic equations. To do this, when $f(\cdot,s)$ is strictly convex with respect to $s$, we show that the nonnegativit… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: 18 pages

    MSC Class: 35B38; 35J05; 35J25

  41. arXiv:2403.05783  [pdf, other

    cs.IT cs.LG

    Large Generative Model Assisted 3D Semantic Communication

    Authors: Feibo Jiang, Yubo Peng, Li Dong, Kezhi Wang, Kun Yang, Cunhua Pan, Xiaohu You

    Abstract: Semantic Communication (SC) is a novel paradigm for data transmission in 6G. However, there are several challenges posed when performing SC in 3D scenarios: 1) 3D semantic extraction; 2) Latent semantic redundancy; and 3) Uncertain channel estimation. To address these issues, we propose a Generative AI Model assisted 3D SC (GAM-3DSC) system. Firstly, we introduce a 3D Semantic Extractor (3DSE), wh… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: 13 pages,13 figures,1 table

  42. arXiv:2402.16508  [pdf, other

    cs.CL cs.IR

    Pre-training Cross-lingual Open Domain Question Answering with Large-scale Synthetic Supervision

    Authors: Fan Jiang, Tom Drummond, Trevor Cohn

    Abstract: Cross-lingual open domain question answering (CLQA) is a complex problem, comprising cross-lingual retrieval from a multilingual knowledge base, followed by answer generation in the query language. Both steps are usually tackled by separate models, requiring substantial annotated datasets, and typically auxiliary resources, like machine translation systems to bridge between languages. In this pape… ▽ More

    Submitted 16 June, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

  43. arXiv:2402.14851  [pdf, other

    cs.CL cs.AI cs.DB

    $R^3$: "This is My SQL, Are You With Me?" A Consensus-Based Multi-Agent System for Text-to-SQL Tasks

    Authors: Hanchen Xia, Feng Jiang, Naihao Deng, Cunxiang Wang, Guojiang Zhao, Rada Mihalcea, Yue Zhang

    Abstract: Large Language Models (LLMs) have demonstrated strong performance on various tasks. To unleash their power on the Text-to-SQL task, we propose $R^3$ (Review-Rebuttal-Revision), a consensus-based multi-agent system for Text-to-SQL tasks. $R^3$ outperforms the existing single LLM Text-to-SQL systems as well as the multi-agent Text-to-SQL systems by $1.3\%$ to $8.1\%$ on Spider and Bird. Surprisingly… ▽ More

    Submitted 10 July, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: 12 pages, 2 figures, 8 tables

  44. arXiv:2402.12452  [pdf, other

    astro-ph.CO astro-ph.GA hep-ph

    Numerical Challenges in Modeling Gravothermal Collapse in Self-Interacting Dark Matter Halos

    Authors: Igor Palubski, Oren Slone, Manoj Kaplinghat, Mariangela Lisanti, Fangzhou Jiang

    Abstract: When dark matter has a large cross section for self scattering, halos can undergo a process known as gravothermal core collapse, where the inner core rapidly increases in density and temperature. To date, several methods have been used to implement Self-Interacting Dark Matter~(SIDM) in N-body codes, but there has been no systematic study of these different methods or their accuracy in the core-co… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  45. arXiv:2402.11753  [pdf, other

    cs.CL cs.AI

    ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs

    Authors: Fengqing Jiang, Zhangchen Xu, Luyao Niu, Zhen Xiang, Bhaskar Ramasubramanian, Bo Li, Radha Poovendran

    Abstract: Safety is critical to the usage of large language models (LLMs). Multiple techniques such as data filtering and supervised fine-tuning have been developed to strengthen LLM safety. However, currently known techniques presume that corpora used for safety alignment of LLMs are solely interpreted by semantics. This assumption, however, does not hold in real-world applications, which leads to severe v… ▽ More

    Submitted 7 June, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

    Comments: To appear in ACL 2024

  46. arXiv:2402.11566  [pdf, other

    cs.CV

    Boosting Semi-Supervised 2D Human Pose Estimation by Revisiting Data Augmentation and Consistency Training

    Authors: Huayi Zhou, Mukun Luo, Fei Jiang, Yue Ding, Hongtao Lu

    Abstract: The 2D human pose estimation (HPE) is a basic visual problem. However, its supervised learning requires massive keypoint labels, which is labor-intensive to collect. Thus, we aim at boosting a pose estimator by excavating extra unlabeled data with semi-supervised learning (SSL). Most previous SSHPE methods are consistency-based and strive to maintain consistent outputs for differently augmented in… ▽ More

    Submitted 7 March, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

    Comments: 14 pages. Semi-Supervised 2D Human Pose Estimation

  47. arXiv:2402.10669  [pdf, other

    cs.CL

    Humans or LLMs as the Judge? A Study on Judgement Biases

    Authors: Guiming Hardy Chen, Shunian Chen, Ziche Liu, Feng Jiang, Benyou Wang

    Abstract: Adopting human and large language models (LLM) as judges (a.k.a human- and LLM-as-a-judge) for evaluating the performance of LLMs has recently gained attention. Nonetheless, this approach concurrently introduces potential biases from human and LLMs, questioning the reliability of the evaluation results. In this paper, we propose a novel framework that is free from referencing groundtruth annotatio… ▽ More

    Submitted 16 June, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: 25 pages

  48. arXiv:2402.08983  [pdf, other

    cs.CR cs.AI cs.CL

    SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding

    Authors: Zhangchen Xu, Fengqing Jiang, Luyao Niu, Jinyuan Jia, Bill Yuchen Lin, Radha Poovendran

    Abstract: As large language models (LLMs) become increasingly integrated into real-world applications such as code generation and chatbot assistance, extensive efforts have been made to align LLM behavior with human values, including safety. Jailbreak attacks, aiming to provoke unintended and unsafe behaviors from LLMs, remain a significant/leading LLM safety threat. In this paper, we aim to defend LLMs aga… ▽ More

    Submitted 7 June, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

    Comments: To appear in ACL 2024

  49. arXiv:2402.07201  [pdf, ps, other

    math.AP

    On the Inhibition of Rayleigh Taylor Instability by Capillarity in the Navier Stokes Korteweg Model

    Authors: Fei Jiang, Yajie Zhang, Zhipeng Zhang

    Abstract: Bresch--Desjardins--Gisclon--Sart had derived that the capillarity slows down the growth rate of Rayleigh--Taylor (RT) instability in an inhomogeneous incompressible fluid endowed with internal capillarity based on a linearized incompressible Navier--Stokes--Korteweg (NSK) equations in 2008. Later Li--Zhang further obtained another result that the capillarity inhibits RT instability also based on… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

    Comments: arXiv admin note: text overlap with arXiv:2302.01013

    MSC Class: 35Q35; 76D03; 76E99

  50. arXiv:2402.04390  [pdf

    cs.LG cs.AI

    Densely Multiplied Physics Informed Neural Networks

    Authors: Feilong Jiang, Xiaonan Hou, Min Xia

    Abstract: Although physics-informed neural networks (PINNs) have shown great potential in dealing with nonlinear partial differential equations (PDEs), it is common that PINNs will suffer from the problem of insufficient precision or obtaining incorrect outcomes. Unlike most of the existing solutions trying to enhance the ability of PINN by optimizing the training process, this paper improved the neural net… ▽ More

    Submitted 12 February, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: 15 pages, 9 figures