Skip to main content

Showing 1–50 of 359 results for author: Qiu, L

  1. arXiv:2407.10862  [pdf, other

    cs.CV

    R3D-AD: Reconstruction via Diffusion for 3D Anomaly Detection

    Authors: Zheyuan Zhou, Le Wang, Naiyu Fang, Zili Wang, Lemiao Qiu, Shuyou Zhang

    Abstract: 3D anomaly detection plays a crucial role in monitoring parts for localized inherent defects in precision manufacturing. Embedding-based and reconstruction-based approaches are among the most popular and successful methods. However, there are two major challenges to the practical application of the current approaches: 1) the embedded models suffer the prohibitive computational and storage due to t… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  2. arXiv:2407.10234  [pdf, other

    hep-ph

    Understanding the non-trivial isoscalar pseudoscalar structures in the $K_S K_Sπ^0$ spectra in the $J/ψ$ radiative decay

    Authors: Yin Cheng, Lin Qiu, Qiang Zhao

    Abstract: Initiated by the recent observation of a flattened lineshape of $IJ^{PC}=00^{-+}$ around $1.4\sim 1.5$ GeV in the $K_S K_S π^0$ invariant mass spectrum by BESIII, we make a systematic partial wave analysis of $J/ψ\toγη_X\to γK\bar{K}π$ based on an isobaric approach. We demonstrate that in the scenario of the first radial excitations of the isoscalar pseudoscalar from the $K\bar{K}π$ threshold to a… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: Revtex, 13 pages, 6 figures

  3. arXiv:2407.07071  [pdf, other

    cs.CL cs.AI cs.LG

    Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps

    Authors: Yung-Sung Chuang, Linlu Qiu, Cheng-Yu Hsieh, Ranjay Krishna, Yoon Kim, James Glass

    Abstract: When asked to summarize articles or answer questions given a passage, large language models (LLMs) can hallucinate details and respond with unsubstantiated answers that are inaccurate with respect to the input context. This paper describes a simple approach for detecting such contextual hallucinations. We hypothesize that contextual hallucinations are related to the extent to which an LLM attends… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: The source code is available at https://github.com/voidism/Lookback-Lens

  4. arXiv:2407.06174  [pdf, other

    cs.CV

    The Tug-of-War Between Deepfake Generation and Detection

    Authors: Hannah Lee, Changyeon Lee, Kevin Farhat, Lin Qiu, Steve Geluso, Aerin Kim, Oren Etzioni

    Abstract: Multimodal generative models are rapidly evolving, leading to a surge in the generation of realistic video and audio that offers exciting possibilities but also serious risks. Deepfake videos, which can convincingly impersonate individuals, have particularly garnered attention due to their potential misuse in spreading misinformation and creating fraudulent content. This survey paper examines the… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  5. arXiv:2407.02616  [pdf

    eess.IV cs.CV

    Deep Learning Based Apparent Diffusion Coefficient Map Generation from Multi-parametric MR Images for Patients with Diffuse Gliomas

    Authors: Zach Eidex, Mojtaba Safari, Jacob Wynne, Richard L. J. Qiu, Tonghe Wang, David Viar Hernandez, Hui-Kuo Shu, Hui Mao, Xiaofeng Yang

    Abstract: Purpose: Apparent diffusion coefficient (ADC) maps derived from diffusion weighted (DWI) MRI provides functional measurements about the water molecules in tissues. However, DWI is time consuming and very susceptible to image artifacts, leading to inaccurate ADC measurements. This study aims to develop a deep learning framework to synthesize ADC maps from multi-parametric MR images. Methods: We pro… ▽ More

    Submitted 4 July, 2024; v1 submitted 2 July, 2024; originally announced July 2024.

    Comments: arXiv admin note: text overlap with arXiv:2311.15044

  6. arXiv:2407.02490  [pdf, other

    cs.CL cs.LG

    MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention

    Authors: Huiqiang Jiang, Yucheng Li, Chengruidong Zhang, Qianhui Wu, Xufang Luo, Surin Ahn, Zhenhua Han, Amir H. Abdi, Dongsheng Li, Chin-Yew Lin, Yuqing Yang, Lili Qiu

    Abstract: The computational challenges of Large Language Model (LLM) inference remain a significant barrier to their widespread deployment, especially as prompt lengths continue to increase. Due to the quadratic complexity of the attention computation, it takes 30 minutes for an 8B LLM to process a prompt of 1M tokens (i.e., the pre-filling stage) on a single A100 GPU. Existing methods for speeding up prefi… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  7. arXiv:2407.01491  [pdf, other

    cs.CL cs.CV

    Expressive and Generalizable Low-rank Adaptation for Large Models via Slow Cascaded Learning

    Authors: Siwei Li, Yifan Yang, Yifei Shen, Fangyun Wei, Zongqing Lu, Lili Qiu, Yuqing Yang

    Abstract: Efficient fine-tuning plays a fundamental role in modern large models, with low-rank adaptation emerging as a particularly promising approach. However, the existing variants of LoRA are hampered by limited expressiveness, a tendency to overfit, and sensitivity to hyperparameter settings. This paper presents LoRA Slow Cascade Learning (LoRASC), an innovative technique designed to enhance LoRA's exp… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  8. arXiv:2406.16864  [pdf, other

    cs.CV cs.AI cs.GR

    StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal

    Authors: Chongjie Ye, Lingteng Qiu, Xiaodong Gu, Qi Zuo, Yushuang Wu, Zilong Dong, Liefeng Bo, Yuliang Xiu, Xiaoguang Han

    Abstract: This work addresses the challenge of high-quality surface normal estimation from monocular colored inputs (i.e., images and videos), a field which has recently been revolutionized by repurposing diffusion priors. However, previous attempts still struggle with stochastic inference, conflicting with the deterministic nature of the Image2Normal task, and costly ensembling step, which slows down the e… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: HF Demo: hf.co/Stable-X, Video: https://www.youtube.com/watch?v=sylXTxG_U2U

  9. arXiv:2406.15656  [pdf, other

    eess.IV cs.CV

    Adaptive Self-Supervised Consistency-Guided Diffusion Model for Accelerated MRI Reconstruction

    Authors: Mojtaba Safari, Zach Eidex, Shaoyan Pan, Richard L. J. Qiu, Xiaofeng Yang

    Abstract: Purpose: To propose a self-supervised deep learning-based compressed sensing MRI (DL-based CS-MRI) method named "Adaptive Self-Supervised Consistency Guided Diffusion Model (ASSCGD)" to accelerate data acquisition without requiring fully sampled datasets. Materials and Methods: We used the fastMRI multi-coil brain axial T2-weighted (T2-w) dataset from 1,376 cases and single-coil brain quantitative… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  10. arXiv:2406.11375  [pdf, other

    cs.CL cs.AI

    Boosting Scientific Concepts Understanding: Can Analogy from Teacher Models Empower Student Models?

    Authors: Siyu Yuan, Cheng Jiayang, Lin Qiu, Deqing Yang

    Abstract: Analogical reasoning plays a critical role in human cognition, enabling us to understand new concepts by associating them with familiar ones. Previous research in the AI community has mainly focused on identifying and generating analogies and then examining their quality under human evaluation, which overlooks the practical application of these analogies in real-world settings. Inspired by the hum… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  11. arXiv:2406.09931  [pdf, other

    eess.IV cs.CV cs.LG

    SCKansformer: Fine-Grained Classification of Bone Marrow Cells via Kansformer Backbone and Hierarchical Attention Mechanisms

    Authors: Yifei Chen, Zhu Zhu, Shenghao Zhu, Linwei Qiu, Binfeng Zou, Fan Jia, Yunpeng Zhu, Chenyan Zhang, Zhaojie Fang, Feiwei Qin, Jin Fan, Changmiao Wang, Yu Gao, Gang Yu

    Abstract: The incidence and mortality rates of malignant tumors, such as acute leukemia, have risen significantly. Clinically, hospitals rely on cytological examination of peripheral blood and bone marrow smears to diagnose malignant tumors, with accurate blood cell counting being crucial. Existing automated methods face challenges such as low feature expression capability, poor interpretability, and redund… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 15 pages, 6 figures

  12. arXiv:2406.05962  [pdf, other

    cs.DC cs.DB

    Data Caching for Enterprise-Grade Petabyte-Scale OLAP

    Authors: Chunxu Tang, Bin Fan, Jing Zhao, Chen Liang, Yi Wang, Beinan Wang, Ziyue Qiu, Lu Qiu, Bowen Ding, Shouzhuo Sun, Saiguang Che, Jiaming Mai, Shouwei Chen, Yu Zhu, Jianjian Xie, Yutian, Sun, Yao Li, Yangjun Zhang, Ke Wang, Mingmin Chen

    Abstract: With the exponential growth of data and evolving use cases, petabyte-scale OLAP data platforms are increasingly adopting a model that decouples compute from storage. This shift, evident in organizations like Uber and Meta, introduces operational challenges including massive, read-heavy I/O traffic with potential throttling, as well as skewed and fragmented data access patterns. Addressing these ch… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: Accepted to the USENIX Annual Technical Conference (USENIX ATC) 2024

  13. arXiv:2406.02756  [pdf, other

    cs.CL cs.AI cs.LG

    Aligning Large Language Models via Fine-grained Supervision

    Authors: Dehong Xu, Liang Qiu, Minseok Kim, Faisal Ladhak, Jaeyoung Do

    Abstract: Pre-trained large-scale language models (LLMs) excel at producing coherent articles, yet their outputs may be untruthful, toxic, or fail to align with user expectations. Current approaches focus on using reinforcement learning with human feedback (RLHF) to improve model alignment, which works by transforming coarse human preferences of LLM outputs into a feedback signal that guides the model learn… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  14. arXiv:2406.02536  [pdf, other

    cs.CL cs.LG

    Mitigate Position Bias in Large Language Models via Scaling a Single Dimension

    Authors: Yijiong Yu, Huiqiang Jiang, Xufang Luo, Qianhui Wu, Chin-Yew Lin, Dongsheng Li, Yuqing Yang, Yongfeng Huang, Lili Qiu

    Abstract: Large Language Models (LLMs) are increasingly applied in various real-world scenarios due to their excellent generalization capabilities and robust generative abilities. However, they exhibit position bias, also known as "lost in the middle", a phenomenon that is especially pronounced in long-context scenarios, which indicates the placement of the key information in different positions of a prompt… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  15. arXiv:2405.19888  [pdf, other

    cs.LG cs.AI

    Parrot: Efficient Serving of LLM-based Applications with Semantic Variable

    Authors: Chaofan Lin, Zhenhua Han, Chengruidong Zhang, Yuqing Yang, Fan Yang, Chen Chen, Lili Qiu

    Abstract: The rise of large language models (LLMs) has enabled LLM-based applications (a.k.a. AI agents or co-pilots), a new software paradigm that combines the strength of LLM and conventional software. Diverse LLM applications from different tenants could design complex workflows using multiple LLM requests to accomplish one task. However, they have to use the over-simplified request-level API provided by… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: To appear on USENIX OSDI 2024

  16. arXiv:2405.16337  [pdf, other

    cs.CL cs.AI

    Learning to Reason via Program Generation, Emulation, and Search

    Authors: Nathaniel Weir, Muhammad Khalifa, Linlu Qiu, Orion Weller, Peter Clark

    Abstract: Program synthesis with language models (LMs) has unlocked a large set of reasoning abilities; code-tuned LMs have proven adept at generating programs that solve a wide variety of algorithmic symbolic manipulation tasks (e.g. word concatenation). However, not all reasoning tasks are easily expressible as code, e.g. tasks involving commonsense reasoning, moral decision-making, and sarcasm understand… ▽ More

    Submitted 28 May, 2024; v1 submitted 25 May, 2024; originally announced May 2024.

    Comments: 16 pages, 10 figures

  17. arXiv:2405.14486  [pdf, other

    cs.CL

    RefChecker: Reference-based Fine-grained Hallucination Checker and Benchmark for Large Language Models

    Authors: Xiangkun Hu, Dongyu Ru, Lin Qiu, Qipeng Guo, Tianhang Zhang, Yang Xu, Yun Luo, Pengfei Liu, Yue Zhang, Zheng Zhang

    Abstract: Large Language Models (LLMs) have shown impressive capabilities but also a concerning tendency to hallucinate. This paper presents RefChecker, a framework that introduces claim-triplets to represent claims in LLM responses, aiming to detect fine-grained hallucinations. In RefChecker, an extractor generates claim-triplets from a response, which are then evaluated by a checker against a reference. W… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  18. arXiv:2405.13634  [pdf, other

    eess.SP

    Secure Communications in Near-Filed ISCAP Systems with Extremely Large-Scale Antenna Arrays

    Authors: Zixiang Ren, Siyao Zhang, Xinmin Li, Ling Qiu, Jie Xu, Derrick Wing Kwan Ng

    Abstract: This paper investigates secure communications in a near-field multi-functional integrated sensing, communication, and powering (ISCAP) system with an extremely large-scale antenna arrays (ELAA) equipped at the base station (BS). In this system, the BS sends confidential messages to a single communication user (CU), and at the same time wirelessly senses a point target and charges multiple energy r… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 6 pages

  19. arXiv:2405.12470  [pdf, other

    cond-mat.mes-hall cond-mat.other cond-mat.quant-gas

    Dynamical Geometry of the Haldane Model under a Quantum Quench

    Authors: Liwei Qiu, Lih-King Lim, Xin Wan

    Abstract: We explore the time evolution of a topological system when the system undergoes a sudden quantum quench within the same nontrivial phase. Using Haldane's honeycomb model as an example, we show that equilibrium states in a topological phase can be distinguished by geometrical features, such as the characteristic momentum at which the half-occupied edge modes cross, the associated edge-mode velocity… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: 13 pages, 11 figures

  20. arXiv:2405.09891  [pdf

    physics.med-ph

    Adaptive Proton Therapy Using CBCT-Guided Digital Twins

    Authors: Chih-Wei Chang, Zhen Tian, Richard L. J. Qiu, H. Scott McGinnis, Duncan Bohannon, Pretesh Patel, Yinan Wang, David S. Yu, Sagar A. Patel, Jun Zhou, Xiaofeng Yang

    Abstract: This study aims to develop a digital twin (DT) framework to enhance adaptive proton stereotactic body radiation therapy (SBRT) for prostate cancer. Prostate SBRT has emerged as a leading option for external beam radiotherapy due to its effectiveness and reduced treatment duration. However, interfractional anatomy variations can impact treatment outcomes. This study seeks to address these uncertain… ▽ More

    Submitted 17 May, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

  21. arXiv:2405.05945  [pdf, other

    cs.CV

    Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers

    Authors: Peng Gao, Le Zhuo, Dongyang Liu, Ruoyi Du, Xu Luo, Longtian Qiu, Yuhang Zhang, Chen Lin, Rongjie Huang, Shijie Geng, Renrui Zhang, Junlin Xi, Wenqi Shao, Zhengkai Jiang, Tianshuo Yang, Weicai Ye, He Tong, Jingwen He, Yu Qiao, Hongsheng Li

    Abstract: Sora unveils the potential of scaling Diffusion Transformer for generating photorealistic images and videos at arbitrary resolutions, aspect ratios, and durations, yet it still lacks sufficient implementation details. In this technical report, we introduce the Lumina-T2X family - a series of Flow-based Large Diffusion Transformers (Flag-DiT) equipped with zero-initialized attention, as a unified f… ▽ More

    Submitted 13 June, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

    Comments: Technical Report; Code at: https://github.com/Alpha-VLLM/Lumina-T2X

  22. arXiv:2405.00241  [pdf

    physics.med-ph

    Fast MRI Reconstruction Using Deep Learning-based Compressed Sensing: A Systematic Review

    Authors: Mojtaba Safari, Zach Eidex, Chih-Wei Chang, Richard L. J. Qiu, Xiaofeng Yang

    Abstract: Magnetic resonance imaging (MRI) has revolutionized medical imaging, providing a non-invasive and highly detailed look into the human body. However, the long acquisition times of MRI present challenges, causing patient discomfort, motion artifacts, and limiting real-time applications. To address these challenges, researchers are exploring various techniques to reduce acquisition time and improve t… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

  23. arXiv:2404.18388  [pdf, other

    cs.CR cs.DB

    SPECIAL: Synopsis Assisted Secure Collaborative Analytics

    Authors: Chenghong Wang, Lina Qiu, Johes Bater, Yukui Luo

    Abstract: Secure collaborative analytics (SCA) enable the processing of analytical SQL queries across multiple owners' data, even when direct data sharing is not feasible. Although essential for strong privacy, the large overhead from data-oblivious primitives in traditional SCA has hindered its practical adoption. Recent SCA variants that permit controlled leakages under differential privacy (DP) show a be… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

  24. arXiv:2404.15946  [pdf

    cs.CV cs.AI eess.IV

    Mammo-CLIP: Leveraging Contrastive Language-Image Pre-training (CLIP) for Enhanced Breast Cancer Diagnosis with Multi-view Mammography

    Authors: Xuxin Chen, Yuheng Li, Mingzhe Hu, Ella Salari, Xiaoqian Chen, Richard L. J. Qiu, Bin Zheng, Xiaofeng Yang

    Abstract: Although fusion of information from multiple views of mammograms plays an important role to increase accuracy of breast cancer detection, developing multi-view mammograms-based computer-aided diagnosis (CAD) schemes still faces challenges and no such CAD schemes have been used in clinical practice. To overcome the challenges, we investigate a new approach based on Contrastive Language-Image Pre-tr… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  25. arXiv:2404.11216  [pdf, other

    cs.CL cs.AI cs.LG

    Position Engineering: Boosting Large Language Models through Positional Information Manipulation

    Authors: Zhiyuan He, Huiqiang Jiang, Zilong Wang, Yuqing Yang, Luna Qiu, Lili Qiu

    Abstract: The performance of large language models (LLMs) is significantly influenced by the quality of the prompts provided. In response, researchers have developed enormous prompt engineering strategies aimed at modifying the prompt text to enhance task performance. In this paper, we introduce a novel technique termed position engineering, which offers a more efficient way to guide large language models.… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  26. arXiv:2404.05609  [pdf, other

    math.OC eess.SY

    Feedback Stability Under Mixed Gain and Phase Uncertainty

    Authors: Jiajin Liang, Di Zhao, Li Qiu

    Abstract: In this study, we investigate the robust feedback stability problem for multiple-input-multiple-output linear time-invariant systems involving sectored-disk uncertainty, namely, dynamic uncertainty subject to simultaneous gain and phase constraints. This problem is thereby called a sectored-disk problem. Employing a frequency-wise analysis approach, we derive a fundamental static matrix problem th… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  27. arXiv:2404.04286  [pdf, other

    cs.CL cs.AI cs.LG

    Language Model Evolution: An Iterated Learning Perspective

    Authors: Yi Ren, Shangmin Guo, Linlu Qiu, Bailin Wang, Danica J. Sutherland

    Abstract: With the widespread adoption of Large Language Models (LLMs), the prevalence of iterative interactions among these models is anticipated to increase. Notably, recent advancements in multi-round self-improving methods allow LLMs to generate new examples for training subsequent models. At the same time, multi-agent LLM systems, involving automated interactions among agents, are also increasing in pr… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  28. arXiv:2404.01617  [pdf, other

    cs.NI cs.LG cs.MM

    LLM-ABR: Designing Adaptive Bitrate Algorithms via Large Language Models

    Authors: Zhiyuan He, Aashish Gottipati, Lili Qiu, Francis Y. Yan, Xufang Luo, Kenuo Xu, Yuqing Yang

    Abstract: We present LLM-ABR, the first system that utilizes the generative capabilities of large language models (LLMs) to autonomously design adaptive bitrate (ABR) algorithms tailored for diverse network characteristics. Operating within a reinforcement learning framework, LLM-ABR empowers LLMs to design key components such as states and neural network architectures. We evaluate LLM-ABR across diverse ne… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  29. arXiv:2404.00998  [pdf, other

    cs.CL cs.AI

    LLM-RadJudge: Achieving Radiologist-Level Evaluation for X-Ray Report Generation

    Authors: Zilong Wang, Xufang Luo, Xinyang Jiang, Dongsheng Li, Lili Qiu

    Abstract: Evaluating generated radiology reports is crucial for the development of radiology AI, but existing metrics fail to reflect the task's clinical requirements. This study proposes a novel evaluation framework using large language models (LLMs) to compare radiology reports for assessment. We compare the performance of various LLMs and demonstrate that, when using GPT-4, our proposed metric achieves e… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: 11 pages, 6 figures

  30. arXiv:2404.00269  [pdf, other

    cs.CV

    IPoD: Implicit Field Learning with Point Diffusion for Generalizable 3D Object Reconstruction from Single RGB-D Images

    Authors: Yushuang Wu, Luyue Shi, Junhao Cai, Weihao Yuan, Lingteng Qiu, Zilong Dong, Liefeng Bo, Shuguang Cui, Xiaoguang Han

    Abstract: Generalizable 3D object reconstruction from single-view RGB-D images remains a challenging task, particularly with real-world data. Current state-of-the-art methods develop Transformer-based implicit field learning, necessitating an intensive learning paradigm that requires dense query-supervision uniformly sampled throughout the entire space. We propose a novel approach, IPoD, which harmonizes im… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: CVPR 2024

  31. arXiv:2404.00209  [pdf, other

    cs.CL

    EventGround: Narrative Reasoning by Grounding to Eventuality-centric Knowledge Graphs

    Authors: Cheng Jiayang, Lin Qiu, Chunkit Chan, Xin Liu, Yangqiu Song, Zheng Zhang

    Abstract: Narrative reasoning relies on the understanding of eventualities in story contexts, which requires a wealth of background world knowledge. To help machines leverage such knowledge, existing solutions can be categorized into two groups. Some focus on implicitly modeling eventuality knowledge by pretraining language models (LMs) with eventuality-aware objectives. However, this approach breaks down k… ▽ More

    Submitted 7 July, 2024; v1 submitted 29 March, 2024; originally announced April 2024.

  32. arXiv:2403.16353  [pdf, other

    cs.IT eess.SP

    Energy-Efficient Hybrid Beamforming with Dynamic On-off Control for Integrated Sensing, Communications, and Powering

    Authors: Zeyu Hao, Yuan Fang, Xianghao Yu, Jie Xu, Ling Qiu, Lexi Xu, Shuguang Cui

    Abstract: This paper investigates the energy-efficient hybrid beamforming design for a multi-functional integrated sensing, communications, and powering (ISCAP) system. In this system, a base station (BS) with a hybrid analog-digital (HAD) architecture sends unified wireless signals to communicate with multiple information receivers (IRs), sense multiple point targets, and wirelessly charge multiple energy… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: 13 pages, 6 figures, submitted to IEEE Transactions on Communications

  33. arXiv:2403.12968  [pdf, other

    cs.CL cs.LG

    LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression

    Authors: Zhuoshi Pan, Qianhui Wu, Huiqiang Jiang, Menglin Xia, Xufang Luo, Jue Zhang, Qingwei Lin, Victor Rühle, Yuqing Yang, Chin-Yew Lin, H. Vicky Zhao, Lili Qiu, Dongmei Zhang

    Abstract: This paper focuses on task-agnostic prompt compression for better generalizability and efficiency. Considering the redundancy in natural language, existing approaches compress prompts by removing tokens or lexical units according to their information entropy obtained from a causal language model such as LLaMa-7B. The challenge is that information entropy may be a suboptimal compression metric: (i)… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  34. arXiv:2403.12370  [pdf, other

    cs.CV

    XPose: eXplainable Human Pose Estimation

    Authors: Luyu Qiu, Jianing Li, Lei Wen, Chi Su, Fei Hao, Chen Jason Zhang, Lei Chen

    Abstract: Current approaches in pose estimation primarily concentrate on enhancing model architectures, often overlooking the importance of comprehensively understanding the rationale behind model decisions. In this paper, we propose XPose, a novel framework that incorporates Explainable AI (XAI) principles into pose estimation. This integration aims to elucidate the individual contribution of each keypoint… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  35. arXiv:2403.12010  [pdf, other

    cs.CV cs.AI cs.GR

    VideoMV: Consistent Multi-View Generation Based on Large Video Generative Model

    Authors: Qi Zuo, Xiaodong Gu, Lingteng Qiu, Yuan Dong, Zhengyi Zhao, Weihao Yuan, Rui Peng, Siyu Zhu, Zilong Dong, Liefeng Bo, Qixing Huang

    Abstract: Generating multi-view images based on text or single-image prompts is a critical capability for the creation of 3D content. Two fundamental questions on this topic are what data we use for training and how to ensure multi-view consistency. This paper introduces a novel framework that makes fundamental contributions to both questions. Unlike leveraging images from 2D diffusion models for training,… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: Project page: aigc3d.github.io/VideoMV/

  36. arXiv:2403.11890  [pdf

    physics.med-ph

    Dual-Energy Cone-Beam CT Using Two Complementary Limited-Angle Scans with A Projection-Consistent Diffusion Model

    Authors: Junbo Peng, Chih-Wei Chang, Richard L. J. Qiu, Tonghe Wang, Justin Roper, Beth Ghavidel, Xiangyang Tang, Xiaofeng Yang

    Abstract: Background: Dual-energy imaging on cone-beam CT (CBCT) scanners has great potential in different clinical applications, including image-guided surgery and adaptive proton therapy. However, the clinical practice of dual-energy CBCT (DE-CBCT) has been hindered by the requirement of sophisticated hardware components. Purpose: In this work, we aim to propose a practical solution for single-scan dual-e… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  37. arXiv:2403.11465  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Ultra-Long Homochiral Graphene Nanoribbons Grown Within h-BN Stacks for High-Performance Electronics

    Authors: Bosai Lyu, Jiajun Chen, Sen Wang, Shuo Lou, Peiyue Shen, Jingxu Xie, Lu Qiu, Izaac Mitchell, Can Li, Cheng Hu, Xianliang Zhou, Kenji Watanabe, Takashi Taniguchi, Xiaoqun Wang, Jinfeng Jia, Qi Liang, Guorui Chen, Tingxin Li, Shiyong Wang, Wengen Ouyang, Oded Hod, Feng Ding, Michael Urbakh, Zhiwen Shi

    Abstract: Van der Waals encapsulation of two-dimensional materials within hexagonal boron nitride (h-BN) stacks has proven to be a promising way to create ultrahigh-performance electronic devices. However, contemporary approaches for achieving van der Waals encapsulation, which involve artificial layer stacking using mechanical transfer techniques, are difficult to control, prone to contamination, and unsca… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  38. arXiv:2403.07858  [pdf, other

    cs.DC

    Accelerating Biclique Counting on GPU

    Authors: Linshan Qiu, Zhonggen Li, Xiangyu Ke, Lu Chen, Yunjun Gao

    Abstract: Counting (p,q)-bicliques in bipartite graphs poses a foundational challenge with broad applications, from densest subgraph discovery in algorithmic research to personalized content recommendation in practical scenarios. Despite its significance, current leading (p,q)-biclique counting algorithms fall short, particularly when faced with larger graph sizes and clique scales. Fortunately, the problem… ▽ More

    Submitted 20 March, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: This paper has been accepted by ICDE24

  39. Energy loss of a heavy fermion in a collisional QED plasma

    Authors: Yun Guo, Luhua Qiu, Ruizhe Zhao, Michael Strickland

    Abstract: We compute the energy loss of heavy fermions moving in a plasma, taking into account the modification of the photon collective modes induced by collisions using a Bhatnagar-Gross-Krook collisional kernel. We include contributions from both hard and soft scatterings of the heavy fermion using a collisionally modified hard-thermal-loop resummed propagator. Using this method, one does not need to int… ▽ More

    Submitted 15 June, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: 25 pages, 7 figures, final version published in PRD

    Journal ref: Phys. Rev. D 109, 114025 (2024)

  40. Switching intrinsic magnetic skyrmions with controllable magnetic anisotropy in van der Waals multiferroic heterostructures

    Authors: Ze-quan Wang, Feng Xue, Liang Qiu, Zhe Wang, Ruqian Wu, Yusheng Hou

    Abstract: Magnetic skyrmions, topologically nontrivial whirling spin textures at nanometer scales, have emerged as potential information carriers for spintronic devices. The ability to efficiently create and erase magnetic skyrmions is vital yet challenging for such applications. Based on first-principles studies, we find that switching between intrinsic magnetic skyrmion and high-temperature ferromagnetic… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: 20 Pages, 3 figures, 1 table; accepted by Nano Letters

  41. arXiv:2402.18495  [pdf, other

    cs.LG

    ROG$_{PL}$: Robust Open-Set Graph Learning via Region-Based Prototype Learning

    Authors: Qin Zhang, Xiaowei Li, Jiexin Lu, Liping Qiu, Shirui Pan, Xiaojun Chen, Junyang Chen

    Abstract: Open-set graph learning is a practical task that aims to classify the known class nodes and to identify unknown class samples as unknowns. Conventional node classification methods usually perform unsatisfactorily in open-set scenarios due to the complex data they encounter, such as out-of-distribution (OOD) data and in-distribution (IND) noise. OOD data are samples that do not belong to any known… ▽ More

    Submitted 29 February, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: 9 pages, 5 figures

  42. arXiv:2402.17033  [pdf, other

    cond-mat.soft physics.bio-ph

    Mechanics and wrinkling patterns of pressurized bent tubes

    Authors: Cesar L. Pastrana, Luyi Qiu, John W. Hutchinson, Ariel Amir, Ulrich Gerland

    Abstract: Take a drinking straw and bend it from its ends. After sufficient bending, the tube buckles forming a kink, where the curvature is localized in a very small area. This instability, known generally as the Brazier effect, is inherent to thin-walled cylindrical shells, which are particularly ubiquitous in living systems, such as rod-shaped bacteria. However, tubular biological structures are often pr… ▽ More

    Submitted 10 April, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: 5 pages, 4 figures

  43. arXiv:2402.05935  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models

    Authors: Dongyang Liu, Renrui Zhang, Longtian Qiu, Siyuan Huang, Weifeng Lin, Shitian Zhao, Shijie Geng, Ziyi Lin, Peng Jin, Kaipeng Zhang, Wenqi Shao, Chao Xu, Conghui He, Junjun He, Hao Shao, Pan Lu, Hongsheng Li, Yu Qiao, Peng Gao

    Abstract: We propose SPHINX-X, an extensive Multimodality Large Language Model (MLLM) series developed upon SPHINX. To improve the architecture and training efficiency, we modify the SPHINX framework by removing redundant visual encoders, bypassing fully-padded sub-images with skip tokens, and simplifying multi-stage training into a one-stage all-in-one paradigm. To fully unleash the potential of MLLMs, we… ▽ More

    Submitted 26 June, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: Accepted by ICML 2024. Code and models are released at https://github.com/Alpha-VLLM/LLaMA2-Accessory

  44. arXiv:2402.03259  [pdf, other

    cs.HC

    Meeting Bridges: Designing Information Artifacts that Bridge from Synchronous Meetings to Asynchronous Collaboration

    Authors: Ruotong Wang, Lin Qiu, Justin Cranshaw, Amy X. Zhang

    Abstract: A recent surge in remote meetings has led to complaints of ``Zoom fatigue'' and ``collaboration overload,'' negatively impacting worker productivity and well-being. One way to alleviate the burden of meetings is to de-emphasize their synchronous participation by shifting work to and enabling sensemaking during post-meeting asynchronous activities. Towards this goal, we propose the design concept o… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: accepted to CSCW 2024

  45. arXiv:2402.02349  [pdf

    eess.IV cs.CV

    Vision Transformer-based Multimodal Feature Fusion Network for Lymphoma Segmentation on PET/CT Images

    Authors: Huan Huang, Liheng Qiu, Shenmiao Yang, Longxi Li, Jiaofen Nan, Yanting Li, Chuang Han, Fubao Zhu, Chen Zhao, Weihua Zhou

    Abstract: Background: Diffuse large B-cell lymphoma (DLBCL) segmentation is a challenge in medical image analysis. Traditional segmentation methods for lymphoma struggle with the complex patterns and the presence of DLBCL lesions. Objective: We aim to develop an accurate method for lymphoma segmentation with 18F-Fluorodeoxyglucose positron emission tomography (PET) and computed tomography (CT) images. Metho… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: 14 pages, 6 figures; reference added

  46. arXiv:2401.17018  [pdf, other

    cs.DC

    GPU-Accelerated Batch-Dynamic Subgraph Matching

    Authors: Linshan Qiu, Lu Chen, Hailiang Jie, Xiangyu Ke, Yunjun Gao, Yang Liu, Zetao Zhang

    Abstract: Subgraph matching has garnered increasing attention for its diverse real-world applications. Given the dynamic nature of real-world graphs, addressing evolving scenarios without incurring prohibitive overheads has been a focus of research. However, existing approaches for dynamic subgraph matching often proceed serially, retrieving incremental matches for each updated edge individually. This appro… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: This paper has been accepted by ICDE 2024

  47. arXiv:2401.12433  [pdf, other

    cs.CV

    A Novel Garment Transfer Method Supervised by Distilled Knowledge of Virtual Try-on Model

    Authors: Naiyu Fang, Lemiao Qiu, Shuyou Zhang, Zili Wang, Kerui Hu, Jianrong Tan

    Abstract: This paper proposes a novel garment transfer method supervised with knowledge distillation from virtual try-on. Our method first reasons the transfer parsing to provide shape prior to downstream tasks. We employ a multi-phase teaching strategy to supervise the training of the transfer parsing reasoning model, learning the response and feature knowledge from the try-on parsing reasoning model. To c… ▽ More

    Submitted 4 April, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

  48. arXiv:2401.11740  [pdf, other

    cs.CV cs.LG

    Multi-level Cross-modal Alignment for Image Clustering

    Authors: Liping Qiu, Qin Zhang, Xiaojun Chen, Shaotian Cai

    Abstract: Recently, the cross-modal pretraining model has been employed to produce meaningful pseudo-labels to supervise the training of an image clustering model. However, numerous erroneous alignments in a cross-modal pre-training model could produce poor-quality pseudo-labels and degrade clustering performance. To solve the aforementioned issue, we propose a novel \textbf{Multi-level Cross-modal Alignmen… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  49. arXiv:2401.10600  [pdf, other

    astro-ph.GA astro-ph.HE

    Changing-look NLS1 galaxies, their detection with SVOM, and the case of NGC 1566

    Authors: D. W. Xu, S. Komossa, D. Grupe, J. Wang, L. P. Xin, X. H. Han, J. Y. Wei, J. Y. Bai, E. Bon, F. Cangemi, B. Cordier, M. Dennefeld, L. C. Gallo, W. Kollatschny, De-Feng Kong, M. W. Ochmann, Y. L. Qiu, N. Schartel

    Abstract: We discuss applications of the study of the new and barely explored class of changing-look (CL) narrow-line Seyfert 1 (NLS1) galaxies and comment on their detection with the space mission SVOM (Space Variable Objects Monitor). We highlight the case of NGC 1566, which is outstanding in many respects, for instance as one of the nearest known CL AGN undergoing exceptional outbursts. Its NLS1 nature i… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

    Comments: 14 pages, 4 figures. Accepted for publication in the Universe Special Issue "A Multimessenger View of Supermassive Black Holes and the Quasar Main Sequence"

  50. arXiv:2401.10278  [pdf, other

    eess.SP cs.AI cs.LG cs.MM q-bio.NC

    EEGFormer: Towards Transferable and Interpretable Large-Scale EEG Foundation Model

    Authors: Yuqi Chen, Kan Ren, Kaitao Song, Yansen Wang, Yifan Wang, Dongsheng Li, Lili Qiu

    Abstract: Self-supervised learning has emerged as a highly effective approach in the fields of natural language processing and computer vision. It is also applicable to brain signals such as electroencephalography (EEG) data, given the abundance of available unlabeled data that exist in a wide spectrum of real-world medical applications ranging from seizure detection to wave analysis. The existing works lev… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Comments: A preprint version of an ongoing work