Skip to main content

Showing 1–50 of 606 results for author: Xiao, W

  1. arXiv:2407.11208  [pdf

    cond-mat.mtrl-sci physics.comp-ph

    Machine learning accelerated prediction of Ce-based ternary compounds involving antagonistic pairs

    Authors: Weiyi Xia, Wei-Shen Tee, Paul C. Canfield, Fernando Assis Garcia, Raquel D Ribeiro, Yongbin Lee, Liqin Ke, Rebecca Flint, Cai-Zhuang Wang

    Abstract: The discovery of novel quantum materials within ternary phase spaces containing antagonistic pair such as Fe with Bi, Pb, In, and Ag, presents significant challenges yet holds great potential. In this work, we investigate the stabilization of these immiscible pairs through the integration of Cerium (Ce), an abundant rare-earth and cost-effective element. By employing a machine learning (ML)-guided… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  2. arXiv:2407.07871  [pdf, other

    cs.IR

    Enhancing HNSW Index for Real-Time Updates: Addressing Unreachable Points and Performance Degradation

    Authors: Wentao Xiao, Yueyang Zhan, Rui Xi, Mengshu Hou, Jianming Liao

    Abstract: The approximate nearest neighbor search (ANNS) is a fundamental and essential component in data mining and information retrieval, with graph-based methodologies demonstrating superior performance compared to alternative approaches. Extensive research efforts have been dedicated to improving search efficiency by developing various graph-based indices, such as HNSW (Hierarchical Navigable Small Worl… ▽ More

    Submitted 15 July, 2024; v1 submitted 10 July, 2024; originally announced July 2024.

  3. arXiv:2407.07614  [pdf, other

    cs.CV

    MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis

    Authors: Wanggui He, Siming Fu, Mushui Liu, Xierui Wang, Wenyi Xiao, Fangxun Shu, Yi Wang, Lei Zhang, Zhelun Yu, Haoyuan Li, Ziwei Huang, LeiLei Gan, Hao Jiang

    Abstract: Auto-regressive models have made significant progress in the realm of language generation, yet they do not perform on par with diffusion models in the domain of image synthesis. In this work, we introduce MARS, a novel framework for T2I generation that incorporates a specially designed Semantic Vision-Language Integration Expert (SemVIE). This innovative component integrates pre-trained LLMs by in… ▽ More

    Submitted 11 July, 2024; v1 submitted 10 July, 2024; originally announced July 2024.

    Comments: 14 pages, 9 figures

  4. arXiv:2407.05984  [pdf, other

    eess.IV

    MBA-Net: SAM-driven Bidirectional Aggregation Network for Ovarian Tumor Segmentation

    Authors: Yifan Gao, Wei Xia, Wenkui Wang, Xin Gao

    Abstract: Accurate segmentation of ovarian tumors from medical images is crucial for early diagnosis, treatment planning, and patient management. However, the diverse morphological characteristics and heterogeneous appearances of ovarian tumors pose significant challenges to automated segmentation methods. In this paper, we propose MBA-Net, a novel architecture that integrates the powerful segmentation capa… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: MICCAI 2024

  5. arXiv:2407.02883  [pdf, other

    cs.IR cs.CL

    CoIR: A Comprehensive Benchmark for Code Information Retrieval Models

    Authors: Xiangyang Li, Kuicai Dong, Yi Quan Lee, Wei Xia, Yichun Yin, Hao Zhang, Yong Liu, Yasheng Wang, Ruiming Tang

    Abstract: Despite the substantial success of Information Retrieval (IR) in various NLP tasks, most IR systems predominantly handle queries and corpora in natural language, neglecting the domain of code retrieval. Code retrieval is critically important yet remains under-explored, with existing methods and benchmarks inadequately representing the diversity of code in various domains and tasks. Addressing this… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  6. arXiv:2407.02875  [pdf, ps, other

    math.DG

    The structure of deformed double complexes on the Iwasawa manifold

    Authors: Yan Hu, Wei Xia

    Abstract: The Kuranishi family of the Iwasawa manifold give rise naturally to a family of (deformed) double complexes. By using the structure theorem of double complexes due to Stelzig and Qi-Khovanov, we show there are exactly $3$ isomorphism types in this family and determine explicitly structures of these $3$ types. As an application, we computed the Frölicher spectral sequence for each fiber in the Kura… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 26 pages

    MSC Class: 57T15; 32Q99; 32C35; 18G40

  7. arXiv:2407.01245  [pdf, other

    cs.AI cs.CY

    SINKT: A Structure-Aware Inductive Knowledge Tracing Model with Large Language Model

    Authors: Lingyue Fu, Hao Guan, Kounianhua Du, Jianghao Lin, Wei Xia, Weinan Zhang, Ruiming Tang, Yasheng Wang, Yong Yu

    Abstract: Knowledge Tracing (KT) aims to determine whether students will respond correctly to the next question, which is a crucial task in intelligent tutoring systems (ITS). In educational KT scenarios, transductive ID-based methods often face severe data sparsity and cold start problems, where interactions between individual students and questions are sparse, and new questions and concepts consistently a… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  8. arXiv:2407.00772  [pdf, other

    cond-mat.str-el cond-mat.mtrl-sci physics.chem-ph

    Core-level signature of long-range density-wave order and short-range excitonic correlations probed by attosecond broadband spectroscopy

    Authors: Alfred Zong, Sheng-Chih Lin, Shunsuke A. Sato, Emma Berger, Bailey R. Nebgen, Marcus Hui, B. Q. Lv, Yun Cheng, Wei Xia, Yanfeng Guo, Dao Xiang, Michael W. Zuerch

    Abstract: Advances in attosecond core-level spectroscopies have successfully unlocked the fastest dynamics involving high-energy electrons. Yet, these techniques are not conventionally regarded as an appropriate probe for low-energy quasiparticle interactions that govern the ground state of quantum materials, nor for studying long-range order because of their limited sensitivity to local charge environments… ▽ More

    Submitted 16 July, 2024; v1 submitted 30 June, 2024; originally announced July 2024.

  9. arXiv:2406.19646  [pdf, other

    cs.RO

    Time-optimal Flight in Cluttered Environments via Safe Reinforcement Learning

    Authors: Wei Xiao, Zhaohan Feng, Ziyu Zhou, Jian Sun, Gang Wang, Jie Chen

    Abstract: This paper addresses the problem of guiding a quadrotor through a predefined sequence of waypoints in cluttered environments, aiming to minimize the flight time while avoiding collisions. Previous approaches either suffer from prolonged computational time caused by solving complex non-convex optimization problems or are limited by the inherent smoothness of polynomial trajectory representations, t… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: 7 pages, 3 figures,

  10. arXiv:2406.16935  [pdf, other

    eess.SP cs.AI

    Benchmarking Out-of-Distribution Generalization Capabilities of DNN-based Encoding Models for the Ventral Visual Cortex

    Authors: Spandan Madan, Will Xiao, Mingran Cao, Hanspeter Pfister, Margaret Livingstone, Gabriel Kreiman

    Abstract: We characterized the generalization capabilities of DNN-based encoding models when predicting neuronal responses from the visual cortex. We collected \textit{MacaqueITBench}, a large-scale dataset of neural population responses from the macaque inferior temporal (IT) cortex to over $300,000$ images, comprising $8,233$ unique natural images presented to seven monkeys over $109$ sessions. Using \tex… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  11. arXiv:2406.14024  [pdf, other

    cs.CL

    LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Feedback

    Authors: Bofei Gao, Zefan Cai, Runxin Xu, Peiyi Wang, Ce Zheng, Runji Lin, Keming Lu, Dayiheng Liu, Chang Zhou, Wen Xiao, Junjie Hu, Tianyu Liu, Baobao Chang

    Abstract: Mathematical verfier achieves success in mathematical reasoning tasks by validating the correctness of solutions. However, existing verifiers are trained with binary classification labels, which are not informative enough for the model to accurately assess the solutions. To mitigate the aforementioned insufficiency of binary labels, we introduce step-wise natural language feedbacks as rationale la… ▽ More

    Submitted 8 July, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

    Comments: 9 pages

  12. arXiv:2406.13702  [pdf

    cond-mat.str-el cond-mat.mtrl-sci

    Van-Hove annihilation and nematic instability on a Kagome lattice

    Authors: Yu-Xiao Jiang, Sen Shao, Wei Xia, M. Michael Denner, Julian Ingham, Md Shafayat Hossain, Qingzheng Qiu, Xiquan Zheng, Hongyu Chen, Zi-Jia Cheng, Xian P. Yang, Byunghoon Kim, Jia-Xin Yin, Songbo Zhang, Maksim Litskevich, Qi Zhang, Tyler A. Cochran, Yingying Peng, Guoqing Chang, Yanfeng Guo, Ronny Thomale, Titus Neupert, M. Zahid Hasan

    Abstract: Novel states of matter arise in quantum materials due to strong interactions among electrons. A nematic phase breaks the point group symmetry of the crystal lattice and is known to emerge in correlated materials. Here we report the observation of an intra-unit-cell nematic order and signatures of Pomeranchuk instability in the Kagome metal ScV6Sn6. Using scanning tunneling microscopy and spectrosc… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 19 pages, 5 figures, accepted for publication in Nature materials

    Journal ref: Nat. Mater. (2024)

  13. arXiv:2406.13025  [pdf, other

    cs.LG cs.RO eess.SY

    ABNet: Attention BarrierNet for Safe and Scalable Robot Learning

    Authors: Wei Xiao, Tsun-Hsuan Wang, Daniela Rus

    Abstract: Safe learning is central to AI-enabled robots where a single failure may lead to catastrophic results. Barrier-based method is one of the dominant approaches for safe robot learning. However, this method is not scalable, hard to train, and tends to generate unstable signals under noisy inputs that are challenging to be deployed for robots. To address these challenges, we propose a novel Attentio… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 18 pages

  14. arXiv:2406.12463  [pdf, other

    cs.CV eess.IV

    LFMamba: Light Field Image Super-Resolution with State Space Model

    Authors: Wang xia, Yao Lu, Shunzhou Wang, Ziqi Wang, Peiqi Xia, Tianfei Zhou

    Abstract: Recent years have witnessed significant advancements in light field image super-resolution (LFSR) owing to the progress of modern neural networks. However, these methods often face challenges in capturing long-range dependencies (CNN-based) or encounter quadratic computational complexities (Transformer-based), which limit their performance. Recently, the State Space Model (SSM) with selective scan… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  15. arXiv:2406.11162  [pdf, other

    cs.CL

    How Good are LLMs at Relation Extraction under Low-Resource Scenario? Comprehensive Evaluation

    Authors: Dawulie Jinensibieke, Mieradilijiang Maimaiti, Wentao Xiao, Yuanhang Zheng, Xiaobo Wang

    Abstract: Relation Extraction (RE) serves as a crucial technology for transforming unstructured text into structured information, especially within the framework of Knowledge Graph development. Its importance is emphasized by its essential role in various downstream tasks. Besides the conventional RE methods which are based on neural networks and pre-trained language models, large language models (LLMs) are… ▽ More

    Submitted 25 June, 2024; v1 submitted 16 June, 2024; originally announced June 2024.

  16. arXiv:2406.10943  [pdf, other

    cs.CV

    Rectified Iterative Disparity for Stereo Matching

    Authors: Weiqing Xiao

    Abstract: Both uncertainty-assisted and iteration-based methods have achieved great success in stereo matching. However, existing uncertainty estimation methods take a single image and the corresponding disparity as input, which imposes higher demands on the estimation network. In this paper, we propose Cost volume-based disparity Uncertainty Estimation (UEC). Based on the rich similarity information in the… ▽ More

    Submitted 2 July, 2024; v1 submitted 16 June, 2024; originally announced June 2024.

  17. arXiv:2406.08858  [pdf, other

    cs.RO cs.CV cs.LG eess.SY

    OmniH2O: Universal and Dexterous Human-to-Humanoid Whole-Body Teleoperation and Learning

    Authors: Tairan He, Zhengyi Luo, Xialin He, Wenli Xiao, Chong Zhang, Weinan Zhang, Kris Kitani, Changliu Liu, Guanya Shi

    Abstract: We present OmniH2O (Omni Human-to-Humanoid), a learning-based system for whole-body humanoid teleoperation and autonomy. Using kinematic pose as a universal control interface, OmniH2O enables various ways for a human to control a full-sized humanoid with dexterous hands, including using real-time teleoperation through VR headset, verbal instruction, and RGB camera. OmniH2O also enables full autono… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Project page: https://omni.human2humanoid.com/

  18. arXiv:2406.08839  [pdf, other

    cs.CV

    NeRF Director: Revisiting View Selection in Neural Volume Rendering

    Authors: Wenhui Xiao, Rodrigo Santa Cruz, David Ahmedt-Aristizabal, Olivier Salvado, Clinton Fookes, Leo Lebrat

    Abstract: Neural Rendering representations have significantly contributed to the field of 3D computer vision. Given their potential, considerable efforts have been invested to improve their performance. Nonetheless, the essential question of selecting training views is yet to be thoroughly investigated. This key aspect plays a vital role in achieving high-quality results and aligns with the well-known tenet… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: CVPR2024

  19. arXiv:2406.08313  [pdf, other

    hep-ph hep-ex

    Searching for bound states in the open strangeness systems

    Authors: C. W. Xiao, J. J. Wu

    Abstract: Inspired by the recent findings of $Z_{cs}$ and $P_{cs}$ states, we investigate the strong interactions of the systems with open strangeness(es) from the light sector to the heavy sector (no beauty quark), where the interaction potential is derived from the vector meson exchange mechanism in $t$- and $u$-channels. In the current work, we discuss all of single channel cases for the open strangeness… ▽ More

    Submitted 19 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

    Comments: More comments added

  20. arXiv:2406.06953  [pdf, other

    cs.CV

    Stepwise Regression and Pre-trained Edge for Robust Stereo Matching

    Authors: Weiqing Xiao, Wei Zhao

    Abstract: Due to the difficulty in obtaining real samples and ground truth, the generalization performance and the fine-tuned performance are critical for the feasibility of stereo matching methods in real-world applications. However, the presence of substantial disparity distributions and density variations across different datasets presents significant challenges for the generalization and fine-tuning of… ▽ More

    Submitted 16 June, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

  21. arXiv:2406.06005  [pdf, other

    cs.RO cs.GR eess.SY

    WoCoCo: Learning Whole-Body Humanoid Control with Sequential Contacts

    Authors: Chong Zhang, Wenli Xiao, Tairan He, Guanya Shi

    Abstract: Humanoid activities involving sequential contacts are crucial for complex robotic interactions and operations in the real world and are traditionally solved by model-based motion planning, which is time-consuming and often relies on simplified dynamics models. Although model-free reinforcement learning (RL) has become a powerful tool for versatile and robust whole-body humanoid control, it still r… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: Website and Videos: https://lecar-lab.github.io/wococo/

  22. arXiv:2406.04594  [pdf, other

    cs.DC cs.AI cs.LG

    Boosting Large-scale Parallel Training Efficiency with C4: A Communication-Driven Approach

    Authors: Jianbo Dong, Bin Luo, Jun Zhang, Pengcheng Zhang, Fei Feng, Yikai Zhu, Ang Liu, Zian Chen, Yi Shi, Hairong Jiao, Gang Lu, Yu Guan, Ennan Zhai, Wencong Xiao, Hanyu Zhao, Man Yuan, Siran Yang, Xiang Li, Jiamang Wang, Rui Men, Jianwei Zhang, Huang Zhong, Dennis Cai, Yuan Xie, Binzhang Fu

    Abstract: The emergence of Large Language Models (LLMs) has necessitated the adoption of parallel training techniques, involving the deployment of thousands of GPUs to train a single model. Unfortunately, we have found that the efficiency of current parallel training is often suboptimal, largely due to the following two main issues. Firstly, hardware failures are inevitable, leading to interruptions in the… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  23. arXiv:2406.03243  [pdf, other

    cs.AR cs.DC cs.LG

    Llumnix: Dynamic Scheduling for Large Language Model Serving

    Authors: Biao Sun, Ziming Huang, Hanyu Zhao, Wencong Xiao, Xinyi Zhang, Yong Li, Wei Lin

    Abstract: Inference serving for large language models (LLMs) is the key to unleashing their potential in people's daily lives. However, efficient LLM serving remains challenging today because the requests are inherently heterogeneous and unpredictable in terms of resource and latency requirements, as a result of the diverse applications and the dynamic execution nature of LLMs. Existing systems are fundamen… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: To appear at OSDI '24; open-source repo will be available in June 2024

  24. arXiv:2406.02069  [pdf, other

    cs.CL cs.AI

    PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling

    Authors: Zefan Cai., Yichi Zhang, Bofei Gao, Yuliang Liu, Tianyu Liu, Keming Lu, Wayne Xiong, Yue Dong, Baobao Chang, Junjie Hu, Wen Xiao

    Abstract: In this study, we investigate whether attention-based information flow inside large language models (LLMs) is aggregated through noticeable patterns for long context processing. Our observations reveal that LLMs aggregate information through Pyramidal Information Funneling where attention is scattering widely in lower layers, progressively consolidating within specific contexts, and ultimately foc… ▽ More

    Submitted 16 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

  25. arXiv:2406.00439  [pdf, other

    cs.RO cs.CV

    Learning Manipulation by Predicting Interaction

    Authors: Jia Zeng, Qingwen Bu, Bangjun Wang, Wenke Xia, Li Chen, Hao Dong, Haoming Song, Dong Wang, Di Hu, Ping Luo, Heming Cui, Bin Zhao, Xuelong Li, Yu Qiao, Hongyang Li

    Abstract: Representation learning approaches for robotic manipulation have boomed in recent years. Due to the scarcity of in-domain robot data, prevailing methodologies tend to leverage large-scale human video datasets to extract generalizable features for visuomotor policy learning. Despite the progress achieved, prior endeavors disregard the interactive dynamics that capture behavior patterns and physical… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: Accepted to RSS 2024. Project page: https://github.com/OpenDriveLab/MPI

  26. arXiv:2405.19728  [pdf, ps, other

    math.NT

    Legendre symbols related to $D_p(b,1)$

    Authors: Xin-Qi Luo, Wei Xia

    Abstract: Let $p$ be an odd prime. For any $b,c\in\mathbb{Z}$, Z.-W. Sun introduced the new-type determinant $$D_p(b,c)=|(i^2+bij+cj^2)^{p-2}|_{1\leqslant i,j\leqslant p-1},$$ and studied its arithmetic properties. In this paper we mainly prove that $$\left(\frac{D_p(b,1)}{p}\right)=\left(\frac{2b}{p}\right)$$ when $(\frac{b^2-4}{p})=-1$ and $p\equiv1\pmod 4$. As an application of our result, we confirm sev… ▽ More

    Submitted 31 May, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

    Comments: 8pages

  27. arXiv:2405.19586  [pdf, other

    cs.CV cs.LG cs.RO

    SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation

    Authors: Junjie Zhang, Chenjia Bai, Haoran He, Wenke Xia, Zhigang Wang, Bin Zhao, Xiu Li, Xuelong Li

    Abstract: Acquiring a multi-task imitation policy in 3D manipulation poses challenges in terms of scene understanding and action prediction. Current methods employ both 3D representation and multi-view 2D representation to predict the poses of the robot's end-effector. However, they still require a considerable amount of high-quality robot trajectories, and suffer from limited generalization in unseen tasks… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: ICML 2024. Project page: https://sam-embodied.github.io

  28. arXiv:2405.19487  [pdf, other

    cs.CL

    A Full-duplex Speech Dialogue Scheme Based On Large Language Models

    Authors: Peng Wang, Songshuo Lu, Yaohua Tang, Sijie Yan, Yuanjun Xiong, Wei Xia

    Abstract: We present a generative dialogue system capable of operating in a full-duplex manner, allowing for seamless interaction. It is based on a large language model (LLM) carefully aligned to be aware of a perception module, a motor function module, and the concept of a simple finite state machine (called neural FSM) with two states. The perception and motor function modules operate simultaneously, allo… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  29. arXiv:2405.17627  [pdf, other

    cs.LG

    Salutary Labeling with Zero Human Annotation

    Authors: Wenxiao Xiao, Hongfu Liu

    Abstract: Active learning strategically selects informative unlabeled data points and queries their ground truth labels for model training. The prevailing assumption underlying this machine learning paradigm is that acquiring these ground truth labels will optimally enhance model performance. However, this assumption may not always hold true or maximize learning capacity, particularly considering the costly… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  30. arXiv:2405.16765  [pdf, ps, other

    cs.LG eess.SP

    Study of Robust Direction Finding Based on Joint Sparse Representation

    Authors: Y. Li, W. Xiao, L. Zhao, Z. Huang, Q. Li, L. Li, R. C. de Lamare

    Abstract: Standard Direction of Arrival (DOA) estimation methods are typically derived based on the Gaussian noise assumption, making them highly sensitive to outliers. Therefore, in the presence of impulsive noise, the performance of these methods may significantly deteriorate. In this paper, we model impulsive noise as Gaussian noise mixed with sparse outliers. By exploiting their statistical differences,… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: 6 pages, 4 figures

  31. arXiv:2405.15210  [pdf

    cond-mat.str-el

    Spin chirality engineering induced giant topological Hall effect in a kagome magnet

    Authors: Wei Xia, Shihao Zhang, Jian Yuan, Yurui Wei, Haonan Wang, Hong Du, Xiangqi Liu, Jiangteng Guo, Zicheng Tao, Ke Qu, Xia Wang, Xuerong Liu, Wenbo Wang, Jinguang Cheng, Yulin Chen, Jianpeng Liu, Ruidan Zhong, Xuewen Fu, Zhenzhong Yang, Yanfeng Guo

    Abstract: The ferrimagnet TbMn6Sn6 has attracted vast attention, because its pristine Mn kagome lattice with strong spin-orbit coupling and out-of-plane Tb-Mn exchange supports quantum-limit Chern topological magnetism which can be described by the simple spinless Haldane model. We unveil herein that engineering the pristine kagome lattice through partial replacement of Mn by nonmagnetic Cr which tends to c… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 33 pages,4 main figures and 16 SI figures

  32. arXiv:2405.15202  [pdf, other

    cs.CL cs.CR

    Cross-Task Defense: Instruction-Tuning LLMs for Content Safety

    Authors: Yu Fu, Wen Xiao, Jia Chen, Jiachen Li, Evangelos Papalexakis, Aichi Chien, Yue Dong

    Abstract: Recent studies reveal that Large Language Models (LLMs) face challenges in balancing safety with utility, particularly when processing long texts for NLP tasks like summarization and translation. Despite defenses against malicious short questions, the ability of LLMs to safely handle dangerous long content, such as manuals teaching illicit activities, remains unclear. Our work aims to develop robu… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: accepted to NAACL2024 TrustNLP workshop

  33. arXiv:2405.12442  [pdf, other

    cs.IR cs.AI

    Learning Structure and Knowledge Aware Representation with Large Language Models for Concept Recommendation

    Authors: Qingyao Li, Wei Xia, Kounianhua Du, Qiji Zhang, Weinan Zhang, Ruiming Tang, Yong Yu

    Abstract: Concept recommendation aims to suggest the next concept for learners to study based on their knowledge states and the human knowledge system. While knowledge states can be predicted using knowledge tracing models, previous approaches have not effectively integrated the human knowledge system into the process of designing these educational models. In the era of rapidly evolving Large Language Model… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: 11 pages, 8 figures

  34. arXiv:2405.11024  [pdf, other

    cs.LG cs.AI

    GraSS: Combining Graph Neural Networks with Expert Knowledge for SAT Solver Selection

    Authors: Zhanguang Zhang, Didier Chetelat, Joseph Cotnareanu, Amur Ghose, Wenyi Xiao, Hui-Ling Zhen, Yingxue Zhang, Jianye Hao, Mark Coates, Mingxuan Yuan

    Abstract: Boolean satisfiability (SAT) problems are routinely solved by SAT solvers in real-life applications, yet solving time can vary drastically between solvers for the same instance. This has motivated research into machine learning models that can predict, for a given SAT instance, which solver to select among several options. Existing SAT solver selection methods all rely on some hand-picked instance… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: Accepted by KDD 2024

  35. arXiv:2405.07156  [pdf

    cond-mat.str-el

    Direct visualization of the impurity occupancy roadmap in Ni-substituted van der Waals ferromagnet Fe3GaTe2

    Authors: Jian Yuan, Haonan Wang, Xiaofei Hou, Binshuo Zhang, Yurui Wei, Jiangteng Guo, Lu Sun, Zhenhai Yu, Zhikai Li, Xiangqi Liu, Wei Xia, Xia Wang, Xuerong Liu, Yulin Chen, Shihao Zhang, Xuewen Fu, Ke Qu, Zhenzhong Yang, Yanfeng Guo

    Abstract: Impurity substitution is a general strategy to study the intrinsic properties of a quantum material. However, when the target element has more than one Wyckoff position in the lattice, it is a big challenge but with extreme necessity to know the exact position and order of the occupancy of impurity atoms. Via comprehensive experimental and theoretical investigations, we establish herein the roadma… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    Comments: 24 pages,5 main figures+4 SI figures+2 SI tables

  36. arXiv:2405.05739  [pdf

    physics.plasm-ph

    Preliminary Exploration on the Low-Pressure Ar-O2 Plasma Generated by Low-Frequency Alternating Current (AC) Power Supply

    Authors: Niaz Wali, W. W. Xiao, Q. U. Din, N. U. Rehman, C. Y. Wang, J. T. Ma, W. J. Zhong, Q. W. Yang

    Abstract: This study reports a low-frequency alternating current (AC) power supply as a novel approach for generating low-pressure capacitively coupled Ar-O2 plasma, offering advantages in cost, compactness, and operational simplicity, which are crucial for both material science and biological applications. The effectiveness of low-frequency AC-generated plasma against traditional RF systems by examining ke… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: 16 pages, 7 figures

  37. arXiv:2405.04434  [pdf, other

    cs.CL cs.AI

    DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

    Authors: DeepSeek-AI, Aixin Liu, Bei Feng, Bin Wang, Bingxuan Wang, Bo Liu, Chenggang Zhao, Chengqi Dengr, Chong Ruan, Damai Dai, Daya Guo, Dejian Yang, Deli Chen, Dongjie Ji, Erhang Li, Fangyun Lin, Fuli Luo, Guangbo Hao, Guanting Chen, Guowei Li, H. Zhang, Hanwei Xu, Hao Yang, Haowei Zhang, Honghui Ding , et al. (132 additional authors not shown)

    Abstract: We present DeepSeek-V2, a strong Mixture-of-Experts (MoE) language model characterized by economical training and efficient inference. It comprises 236B total parameters, of which 21B are activated for each token, and supports a context length of 128K tokens. DeepSeek-V2 adopts innovative architectures including Multi-head Latent Attention (MLA) and DeepSeekMoE. MLA guarantees efficient inference… ▽ More

    Submitted 19 June, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

  38. arXiv:2405.02355  [pdf, other

    cs.SE cs.AI

    CodeGRAG: Extracting Composed Syntax Graphs for Retrieval Augmented Cross-Lingual Code Generation

    Authors: Kounianhua Du, Renting Rui, Huacan Chai, Lingyue Fu, Wei Xia, Yasheng Wang, Ruiming Tang, Yong Yu, Weinan Zhang

    Abstract: Utilizing large language models to generate codes has shown promising meaning in software development revolution. Despite the intelligence shown by the general large language models, their specificity in code generation can still be improved due to the syntactic gap and mismatched vocabulary existing among natural language and different programming languages. In addition, programming languages are… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  39. arXiv:2405.02180  [pdf, other

    cs.LG eess.SY

    A Flow-Based Model for Conditional and Probabilistic Electricity Consumption Profile Generation and Prediction

    Authors: Weijie Xia, Chenguang Wang, Peter Palensky, Pedro P. Vergara

    Abstract: Residential Load Profile (RLP) generation and prediction are critical for the operation and planning of distribution networks, especially as diverse low-carbon technologies (e.g., photovoltaic and electric vehicles) are increasingly adopted. This paper introduces a novel flow-based generative model, termed Full Convolutional Profile Flow (FCPFlow), which is uniquely designed for both conditional a… ▽ More

    Submitted 9 May, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

  40. arXiv:2404.16147  [pdf, other

    cs.RO

    Chat2Scenario: Scenario Extraction From Dataset Through Utilization of Large Language Model

    Authors: Yongqi Zhao, Wenbo Xiao, Tomislav Mihalj, Jia Hu, Arno Eichberger

    Abstract: The advent of Large Language Models (LLM) provides new insights to validate Automated Driving Systems (ADS). In the herein-introduced work, a novel approach to extracting scenarios from naturalistic driving datasets is presented. A framework called Chat2Scenario is proposed leveraging the advanced Natural Language Processing (NLP) capabilities of LLM to understand and identify different driving sc… ▽ More

    Submitted 26 April, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

    Comments: IEEE Intelligent Vehicles Symposium (IV 2024)

  41. arXiv:2404.14233  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI Feedback

    Authors: Wenyi Xiao, Ziwei Huang, Leilei Gan, Wanggui He, Haoyuan Li, Zhelun Yu, Hao Jiang, Fei Wu, Linchao Zhu

    Abstract: The rapidly developing Large Vision Language Models (LVLMs) have shown notable capabilities on a range of multi-modal tasks, but still face the hallucination phenomena where the generated texts do not align with the given contexts, significantly restricting the usages of LVLMs. Most previous work detects and mitigates hallucination at the coarse-grained level or requires expensive annotation (e.g.… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  42. arXiv:2404.13804  [pdf, other

    cs.DC cs.LG cs.NI eess.SY

    Adaptive Heterogeneous Client Sampling for Federated Learning over Wireless Networks

    Authors: Bing Luo, Wenli Xiao, Shiqiang Wang, Jianwei Huang, Leandros Tassiulas

    Abstract: Federated learning (FL) algorithms usually sample a fraction of clients in each round (partial participation) when the number of participants is large and the server's communication bandwidth is limited. Recent works on the convergence analysis of FL have focused on unbiased client sampling, e.g., sampling uniformly at random, which suffers from slow wall-clock time for convergence due to high deg… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: Published in IEEE Transactions on Mobile Computing (TMC). arXiv admin note: substantial text overlap with arXiv:2112.11256

  43. arXiv:2404.13033  [pdf, other

    cs.CL

    Sample Design Engineering: An Empirical Study of What Makes Good Downstream Fine-Tuning Samples for LLMs

    Authors: Biyang Guo, He Wang, Wenyilin Xiao, Hong Chen, Zhuxin Lee, Songqiao Han, Hailiang Huang

    Abstract: In the burgeoning field of Large Language Models (LLMs) like ChatGPT and LLaMA, Prompt Engineering (PE) is renowned for boosting zero-shot or in-context learning (ICL) through prompt modifications. Yet, the realm of the sample design for downstream fine-tuning, crucial for task-specific LLM adaptation, is largely unexplored. This paper introduces Sample Design Engineering (SDE), a methodical appro… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: 23 pages, 12 figures, 14 tables

  44. arXiv:2404.12728  [pdf, other

    cs.CL

    Relevant or Random: Can LLMs Truly Perform Analogical Reasoning?

    Authors: Chengwei Qin, Wenhan Xia, Tan Wang, Fangkai Jiao, Yuchen Hu, Bosheng Ding, Ruirui Chen, Shafiq Joty

    Abstract: Analogical reasoning is a unique ability of humans to address unfamiliar challenges by transferring strategies from relevant past experiences. One key finding in psychology is that compared with irrelevant past experiences, recalling relevant ones can help humans better handle new tasks. Coincidentally, the NLP community has also recently found that self-generating relevant examples in the context… ▽ More

    Submitted 23 June, 2024; v1 submitted 19 April, 2024; originally announced April 2024.

  45. arXiv:2404.08055  [pdf, other

    quant-ph

    Complexity enriched dynamical phases for fermions on graphs

    Authors: Wei Xia, Jie Zou, Xiaopeng Li

    Abstract: Dynamical quantum phase transitions, encompassing phenomena like many-body localization transitions and measurement-induced phase transitions, are often characterized and identified through the analysis of quantum entanglement. Here, we highlight that the dynamical phases defined by entanglement are further enriched by complexity. We investigate both the entanglement and Krylov complexity for ferm… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  46. arXiv:2404.07202  [pdf, other

    cs.CV cs.AI cs.CL

    UMBRAE: Unified Multimodal Decoding of Brain Signals

    Authors: Weihao Xia, Raoul de Charette, Cengiz Öztireli, Jing-Hao Xue

    Abstract: We address prevailing challenges of the brain-powered research, departing from the observation that the literature hardly recover accurate spatial information and require subject-specific models. To address these challenges, we propose UMBRAE, a unified multimodal decoding of brain signals. First, to extract instance-level conceptual and spatial details from neural signals, we introduce an efficie… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: Project Page: https://weihaox.github.io/UMBRAE

  47. arXiv:2404.02507  [pdf, other

    cs.CL

    Lifelong Event Detection with Embedding Space Separation and Compaction

    Authors: Chengwei Qin, Ruirui Chen, Ruochen Zhao, Wenhan Xia, Shafiq Joty

    Abstract: To mitigate forgetting, existing lifelong event detection methods typically maintain a memory module and replay the stored memory data during the learning of a new task. However, the simple combination of memory data and new-task samples can still result in substantial forgetting of previously acquired knowledge, which may occur due to the potential overlap between the feature distribution of new… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: NAACL 2024 main conference

  48. arXiv:2404.00881  [pdf, other

    math.OC

    Auxiliary-Variable Adaptive Control Lyapunov Barrier Functions for Spatio-Temporally Constrained Safety-Critical Applications

    Authors: Shuo Liu, Wei Xiao, Calin A. Belta

    Abstract: Recent work has shown that stabilizing an affine control system while optimizing a quadratic cost subject to state and control constraints can be mapped to a sequence of Quadratic Programs (QPs) using Control Barrier Functions (CBFs) and Control Lyapunov Functions (CLFs). One of the main challenges in this method is that the QPs could easily become infeasible under safety and spatio-temporal const… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Comments: 8 pages, 4 figures. arXiv admin note: text overlap with arXiv:2310.00238

  49. arXiv:2403.16006  [pdf, other

    q-fin.PR q-fin.MF

    Crypto Inverse-Power Options and Fractional Stochastic Volatility

    Authors: Boyi Li, Weixuan Xia

    Abstract: Recent empirical evidence has highlighted the crucial role of jumps in both price and volatility within the cryptocurrency market. In this paper, we introduce an analytical model framework featuring fractional stochastic volatility, accommodating price--volatility co-jumps and volatility short-term dependency concurrently. We particularly focus on inverse options, including the emerging Quanto inv… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: 42 pages, 2 tables, 5 figures

    MSC Class: 60G22; 60G51; 60E10

  50. arXiv:2403.12959  [pdf, other

    cs.CV cs.AI cs.GR cs.LG cs.RO

    WHAC: World-grounded Humans and Cameras

    Authors: Wanqi Yin, Zhongang Cai, Ruisi Wang, Fanzhou Wang, Chen Wei, Haiyi Mei, Weiye Xiao, Zhitao Yang, Qingping Sun, Atsushi Yamashita, Ziwei Liu, Lei Yang

    Abstract: Estimating human and camera trajectories with accurate scale in the world coordinate system from a monocular video is a highly desirable yet challenging and ill-posed problem. In this study, we aim to recover expressive parametric human models (i.e., SMPL-X) and corresponding camera poses jointly, by leveraging the synergy between three critical players: the world, the human, and the camera. Our a… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Homepage: https://wqyin.github.io/projects/WHAC/