Skip to main content

Showing 1–50 of 352 results for author: Jia, W

  1. arXiv:2407.09121  [pdf, other

    cs.CL cs.AI

    Refuse Whenever You Feel Unsafe: Improving Safety in LLMs via Decoupled Refusal Training

    Authors: Youliang Yuan, Wenxiang Jiao, Wenxuan Wang, Jen-tse Huang, Jiahao Xu, Tian Liang, Pinjia He, Zhaopeng Tu

    Abstract: This study addresses a critical gap in safety tuning practices for Large Language Models (LLMs) by identifying and tackling a refusal position bias within safety tuning data, which compromises the models' ability to appropriately refuse generating unsafe content. We introduce a novel approach, Decoupled Refusal Training (DeRTa), designed to empower LLMs to refuse compliance to harmful prompts at a… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  2. HAFormer: Unleashing the Power of Hierarchy-Aware Features for Lightweight Semantic Segmentation

    Authors: Guoan Xu, Wenjing Jia, Tao Wu, Ligeng Chen, Guangwei Gao

    Abstract: Both Convolutional Neural Networks (CNNs) and Transformers have shown great success in semantic segmentation tasks. Efforts have been made to integrate CNNs with Transformer models to capture both local and global context interactions. However, there is still room for enhancement, particularly when considering constraints on computational resources. In this paper, we introduce HAFormer, a model th… ▽ More

    Submitted 10 July, 2024; v1 submitted 10 July, 2024; originally announced July 2024.

    Comments: 13 pages, 10 figures, 8 tables, IEEE Transactions on Image Processing

  3. arXiv:2407.02695  [pdf, other

    hep-th gr-qc math-ph math.DG

    Topics in Weyl Geometry and Quantum Anomalies

    Authors: Weizhen Jia

    Abstract: The first part of this thesis focuses on the Weyl-covariant nature of holography. We generalize the Fefferman-Graham (FG) ambient construction for conformal geometry to a corresponding construction for Weyl geometry. Through the Weyl-ambient construction, we investigate Weyl-covariant quantities on the Weyl manifold and define Weyl-obstruction tensors. We show that Weyl-obstruction tensors appear… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: 178 pages, 3 figures; Ph.D. dissertation

  4. arXiv:2407.01026  [pdf, other

    cs.CL cs.AI

    Augmenting Document-level Relation Extraction with Efficient Multi-Supervision

    Authors: Xiangyu Lin, Weijia Jia, Zhiguo Gong

    Abstract: Despite its popularity in sentence-level relation extraction, distantly supervised data is rarely utilized by existing work in document-level relation extraction due to its noisy nature and low information density. Among its current applications, distantly supervised data is mostly used as a whole for pertaining, which is of low time efficiency. To fill in the gap of efficient and robust utilizati… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  5. arXiv:2406.14663  [pdf, other

    astro-ph.GA astro-ph.SR

    MagMar III -- Resisting the Pressure, Is the Magnetic Field Overwhelmed in NGC6334I?

    Authors: Paulo C. Cortes, Josep M. Girart, Patricio Sanhueza, Junhao Liu, Sergio Martin, Ian W. Stephens, Henrik Beuther, Patrick M. Koch, M. Fernandez-Lopez, Alvaro Sanchez-Monge, Jia-Wei Wang, Kaho Morii, Shanghuo Li, Piyali Saha, Qizhou Zhang, David Rebolledo, Luis A. Zapata, Ji-hyun Kang, Wenyu Jiao, Jongsoo Kim, Yu Cheng, Jihye Hwang, Eun Jung Chung, Spandan Choudhury, A-Ran Lyo , et al. (1 additional authors not shown)

    Abstract: We report on ALMA observations of polarized dust emission at 1.2 mm from NGC6334I, a source known for its significant flux outbursts. Between five months, our data show no substantial change in total intensity and a modest 8\% variation in linear polarization, suggesting a phase of stability or the conclusion of the outburst. The magnetic field, inferred from this polarized emission, displays a pr… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Accepted for Publication at the Astrophysical Journal

  6. arXiv:2406.13404  [pdf, other

    cs.DC

    Low-Latency Layer-Aware Proactive and Passive Container Migration in Meta Computing

    Authors: Mengjie Liu, Yihua Li, Fangyi Mou, Zhiqing Tang, Jiong Lou, Jianxiong Guo, Weijia Jia

    Abstract: Meta computing is a new computing paradigm that aims to efficiently utilize all network computing resources to provide fault-tolerant, personalized services with strong security and privacy guarantees. It also seeks to virtualize the Internet as many meta computers. In meta computing, tasks can be assigned to containers at edge nodes for processing, based on container images with multiple layers.… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: to be published in IEEE ICMC 2024

  7. arXiv:2406.13399  [pdf, other

    cs.AI

    VELO: A Vector Database-Assisted Cloud-Edge Collaborative LLM QoS Optimization Framework

    Authors: Zhi Yao, Zhiqing Tang, Jiong Lou, Ping Shen, Weijia Jia

    Abstract: The Large Language Model (LLM) has gained significant popularity and is extensively utilized across various domains. Most LLM deployments occur within cloud data centers, where they encounter substantial response delays and incur high costs, thereby impacting the Quality of Services (QoS) at the network edge. Leveraging vector database caching to store LLM request results at the edge can substanti… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: to be published in IEEE ICWS 2024

  8. arXiv:2406.13381  [pdf, other

    cs.CL

    CoAct: A Global-Local Hierarchy for Autonomous Agent Collaboration

    Authors: Xinming Hou, Mingming Yang, Wenxiang Jiao, Xing Wang, Zhaopeng Tu, Wayne Xin Zhao

    Abstract: Existing LLMs exhibit remarkable performance on various NLP tasks, but still struggle with complex real-world tasks, even equipped with advanced strategies like CoT and ReAct. In this work, we propose the CoAct framework, which transfers the hierarchical planning and collaboration patterns in human society to LLM systems. Specifically, our CoAct framework involves two agents: (1) A global planning… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 9 pages, 4 figures

  9. arXiv:2406.08308  [pdf, other

    cs.GR

    FSH: 3D Representation via Fibonacci Spherical Harmonics

    Authors: Zikuan Li, Anyi Huang, Wenru Jia, Qiaoyun Wu, Mingqiang Wei, Jun Wang

    Abstract: Spherical harmonics are a favorable technique for 3D representation, employing a frequency-based approach through the spherical harmonic transform (SHT). Typically, SHT is performed using equiangular sampling grids. However, these grids are non-uniform on spherical surfaces and exhibit local anisotropy, a common limitation in existing spherical harmonic decomposition methods. This paper proposes a… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  10. arXiv:2406.06561  [pdf, other

    cs.CL cs.AI

    Brainstorming Brings Power to Large Language Models of Knowledge Reasoning

    Authors: Zining Qin, Chenhao Wang, Huiling Qin, Weijia Jia

    Abstract: Large Language Models (LLMs) have demonstrated amazing capabilities in language generation, text comprehension, and knowledge reasoning. While a single powerful model can already handle multiple tasks, relying on a single perspective can lead to biased and unstable results. Recent studies have further improved the model's reasoning ability on a wide range of tasks by introducing multi-model collab… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  11. arXiv:2406.03246  [pdf, other

    physics.flu-dyn

    Intrinsic permeability of heterogeneous porous media

    Authors: Wenqiao Jiao, David Scheidweiler, Nolwenn Delouche, Alberto Guadagnini, Pietro de Anna

    Abstract: Providing a sound appraisal of the nature of the relationship between flow $(Q)$ and pressure drop $(ΔP)$ for porous media is a long-standing fundamental research challenge. A wide variety of environmental, societal and industrial issues, ranging, e.g., from water-soil system remediation to subsurface energy optimization, is affected by this critical issue. While such dependence is well represente… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 12 pages, 6 figures

  12. arXiv:2405.14312  [pdf, other

    cs.CV cs.CL cs.MM

    Improving Gloss-free Sign Language Translation by Reducing Representation Density

    Authors: Jinhui Ye, Xing Wang, Wenxiang Jiao, Junwei Liang, Hui Xiong

    Abstract: Gloss-free sign language translation (SLT) aims to develop well-performing SLT systems with no requirement for the costly gloss annotations, but currently still lags behind gloss-based approaches significantly. In this paper, we identify a representation density problem that could be a bottleneck in restricting the performance of gloss-free SLT. Specifically, the representation density problem des… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: Representation Density and Performance Drop

  13. arXiv:2405.12834  [pdf, other

    physics.flu-dyn

    Effect of Synthetic Jets Actuator Parameters on Deep Reinforcement Learning-Based Flow Control Performance in a Square Cylinder

    Authors: Wang Jia, Hang Xu

    Abstract: We utilize deep reinforcement learning (DRL) algorithms to precisely control the mass flow rates of synthetic jets located on the upper and lower surfaces of a square cylinder for active flow control. Through DRL-based active flow control (AFC) technology, we significantly reduce the lift and drag coefficients of the square cylinder at Reynolds number (Re) = 100 and Re=500, while completely suppre… ▽ More

    Submitted 22 May, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

  14. arXiv:2405.11758  [pdf, other

    cs.LG cs.AI

    Fed-Credit: Robust Federated Learning with Credibility Management

    Authors: Jiayan Chen, Zhirong Qian, Tianhui Meng, Xitong Gao, Tian Wang, Weijia Jia

    Abstract: Aiming at privacy preservation, Federated Learning (FL) is an emerging machine learning approach enabling model training on decentralized devices or data sources. The learning mechanism of FL relies on aggregating parameter updates from individual clients. However, this process may pose a potential security risk due to the presence of malicious devices. Existing solutions are either costly due to… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

  15. arXiv:2405.11586  [pdf

    physics.ed-ph

    The Bragg Diffraction Experiment Based on Ultrasonic Wave and Artificial Crystal Lattice

    Authors: Qiusong Chen, Wei Hou, Song Lin, GaoFu Liu, Weiyao Jia

    Abstract: The traditional Bragg crystal diffraction experiments use X-rays, harming the participants bodies. Therefore, many universities have not offered this basic experiment. Although microwave simulation Bragg experiments can reduce harm, there are still some potential dangers. To solve this dilemma, this article takes ultrasound as the experimental object and uses an artificial simulation of crystals t… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

  16. arXiv:2405.10987  [pdf, other

    cs.LG cs.AI

    Manifold-based Incomplete Multi-view Clustering via Bi-Consistency Guidance

    Authors: Huibing Wang, Mingze Yao, Yawei Chen, Yunqiu Xu, Haipeng Liu, Wei Jia, Xianping Fu, Yang Wang

    Abstract: Incomplete multi-view clustering primarily focuses on dividing unlabeled data into corresponding categories with missing instances, and has received intensive attention due to its superiority in real applications. Considering the influence of incomplete data, the existing methods mostly attempt to recover data by adding extra terms. However, for the unsupervised methods, a simple recovery strategy… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  17. arXiv:2405.08633  [pdf, other

    cond-mat.supr-con cond-mat.mtrl-sci cond-mat.str-el

    On the superconducting gap structure of the miassite Rh17S15: Nodal or nodeless?

    Authors: J. Y. Nie, C. C. Zhao, C. Q. Xu, B. Li, C. P. Tu, X. Zhang, D. Z. Dai, H. R. Wang, S. Xu, Wenhe Jiao, B. M. Wang, Zhu'an Xu, Xiaofeng Xu, S. Y. Li

    Abstract: Recent penetration depth measurement claimed the observation of unconventional superconductivity in the miassite Rh$_{17}$S$_{15}$ single crystals, evidenced by the linear-in-temperature penetration depth at low temperatures, thereby arguing for the presence of the lines of node in its superconducting gap structure. Here we measure the thermal conductivity of Rh$_{17}$S$_{15}$ single crystals down… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: 7 pages, 6 figures

  18. arXiv:2405.05505  [pdf, other

    cond-mat.mes-hall cond-mat.other cond-mat.quant-gas quant-ph

    Unveiling Higher-Order Topology via Polarized Topological Charges

    Authors: Wei Jia, Bao-Zong Wang, Ming-Jian Gao, Jun-Hong An

    Abstract: Real-space topological invariants were widely used to characterize chiral-symmetric higher-order topological phases (HOTPs). However, a momentum-space characterization to these HOTPs, which essentially reveals their intrinsic bulk-boundary correspondence and facilitates their detection in quantum simulation systems, is still lacking. Here, we propose an experimentally observable momentum-space cha… ▽ More

    Submitted 20 May, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

    Comments: 8+8 pages, 4+3 figures. References are updated.Typos are corrected

  19. arXiv:2405.03493  [pdf

    physics.optics

    Polarization-entangled photon pair generation from an epsilon-near-zero metasurface

    Authors: Wenhe Jia, Grégoire Saerens, Ülle-Linda Talts, Helena Weigand, Robert J. Chapman, Liu Li, Rachel Grange, Yuanmu Yang

    Abstract: Polarization-entangled photon pair sources are essential for diverse quantum technologies, such as quantum communication, computation, and imaging. However, the generation of complex polarization-entangled quantum states has long been constrained by the available nonlinear susceptibility tensor of natural nonlinear crystals, necessitating a cumbersome and intricate setup for additional coherent su… ▽ More

    Submitted 13 June, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

  20. arXiv:2405.01882  [pdf, other

    cs.RO cs.AI eess.SP

    Millimeter Wave Radar-based Human Activity Recognition for Healthcare Monitoring Robot

    Authors: Zhanzhong Gu, Xiangjian He, Gengfa Fang, Chengpei Xu, Feng Xia, Wenjing Jia

    Abstract: Healthcare monitoring is crucial, especially for the daily care of elderly individuals living alone. It can detect dangerous occurrences, such as falls, and provide timely alerts to save lives. Non-invasive millimeter wave (mmWave) radar-based healthcare monitoring systems using advanced human activity recognition (HAR) models have recently gained significant attention. However, they encounter cha… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  21. arXiv:2404.17151  [pdf, other

    cs.MM cs.CV

    MorphText: Deep Morphology Regularized Arbitrary-shape Scene Text Detection

    Authors: Chengpei Xu, Wenjing Jia, Ruomei Wang, Xiaonan Luo, Xiangjian He

    Abstract: Bottom-up text detection methods play an important role in arbitrary-shape scene text detection but there are two restrictions preventing them from achieving their great potential, i.e., 1) the accumulation of false text segment detections, which affects subsequent processing, and 2) the difficulty of building reliable connections between text segments. Targeting these two problems, we propose a n… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: Accepted by Transaction on Multimedia

  22. arXiv:2404.16322  [pdf, other

    cs.DB

    Bridging Speed and Accuracy to Approximate $K$-Nearest Neighbor Search

    Authors: Mingyu Yang, Jiabao Jin, Xiangyu Wang, Zhitao Shen, Wei Jia, Wentao Li, Wei Wang

    Abstract: Approximate K-Nearest Neighbor (AKNN) search in high-dimensional spaces is a critical yet challenging problem. The efficiency of AKNN search largely depends on the computation of distances, a process that significantly affects the runtime. To improve computational efficiency, existing work often opts for estimating approximate distances rather than computing exact distances, at the cost of reduced… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: 13 pages

  23. arXiv:2404.14569  [pdf, other

    gr-qc astro-ph.IM physics.ins-det quant-ph

    LIGO operates with quantum noise below the Standard Quantum Limit

    Authors: Wenxuan Jia, Victoria Xu, Kevin Kuns, Masayuki Nakano, Lisa Barsotti, Matthew Evans, Nergis Mavalvala, Rich Abbott, Ibrahim Abouelfettouh, Rana Adhikari, Alena Ananyeva, Stephen Appert, Koji Arai, Naoki Aritomi, Stuart Aston, Matthew Ball, Stefan Ballmer, David Barker, Beverly Berger, Joseph Betzwieser, Dripta Bhattacharjee, Garilynn Billingsley, Nina Bode, Edgard Bonilla, Vladimir Bossilkov , et al. (146 additional authors not shown)

    Abstract: Precision measurements of space and time, like those made by the detectors of the Laser Interferometer Gravitational-wave Observatory (LIGO), are often confronted with fundamental limitations imposed by quantum mechanics. The Heisenberg uncertainty principle dictates that the position and momentum of an object cannot both be precisely measured, giving rise to an apparent limitation called the Stan… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Report number: LIGO-P2400059

  24. arXiv:2404.13470  [pdf, other

    cs.DC cs.AI

    GWLZ: A Group-wise Learning-based Lossy Compression Framework for Scientific Data

    Authors: Wenqi Jia, Sian Jin, Jinzhen Wang, Wei Niu, Dingwen Tao, Miao Yin

    Abstract: The rapid expansion of computational capabilities and the ever-growing scale of modern HPC systems present formidable challenges in managing exascale scientific data. Faced with such vast datasets, traditional lossless compression techniques prove insufficient in reducing data size to a manageable level while preserving all information intact. In response, researchers have turned to error-bounded… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

  25. arXiv:2404.13003  [pdf, other

    physics.flu-dyn

    Deep reinforcement learning-based active flow control of an elliptical cylinder: transitioning from an elliptical cylinder to a circular cylinder and a flat plate

    Authors: Wang Jia, Hang Xu

    Abstract: We study the adaptability of deep reinforcement learning (DRL)-based active flow control (AFC) technology for bluff body flows with complex geometries. It is extended from a cylinder with Ar=1 to a flat elliptical cylinder with Ar=2, slender elliptical cylinders with Ar less than 1, and a flat plate with Ar=0. The robustness and adaptability of DRL-based control technology will be assessed under v… ▽ More

    Submitted 23 June, 2024; v1 submitted 19 April, 2024; originally announced April 2024.

  26. arXiv:2404.12553  [pdf, other

    stat.AP

    Assessing the Longitudinal Impact of Environmental Chemical Mixtures on Children's Neurodevelopment: A Bayesian Approach

    Authors: Wei Jia, Roman Jandarov

    Abstract: This manuscript presents a novel Bayesian varying coefficient quantile regression (BVCQR) model designed to assess the longitudinal effects of chemical exposure mixtures on children's neurodevelopment. Recognizing the complexity and high-dimensionality of environmental exposures, the proposed approach addresses critical gaps in existing research by offering a method that can manage the sparsity of… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  27. arXiv:2404.12123  [pdf, other

    physics.flu-dyn

    Robust and Adaptive Deep Reinforcement Learning for Enhancing Flow Control around a Square Cylinder with Varying Reynolds Numbers

    Authors: Wang Jia, Hang Xu

    Abstract: The present study applies a Deep Reinforcement Learning (DRL) algorithm to Active Flow Control (AFC) of a two-dimensional flow around a confined square cylinder. Specifically, the Soft Actor-Critic (SAC) algorithm is employed to modulate the flow of a pair of synthetic jets placed on the upper and lower surfaces of the confined squared cylinder in flow configurations characterized by Re of 100, 20… ▽ More

    Submitted 29 April, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

  28. arXiv:2404.10343  [pdf, other

    cs.CV eess.IV

    The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report

    Authors: Bin Ren, Yawei Li, Nancy Mehta, Radu Timofte, Hongyuan Yu, Cheng Wan, Yuxin Hong, Bingnan Han, Zhuoyuan Wu, Yajun Zou, Yuqing Liu, Jizhe Li, Keji He, Chao Fan, Heng Zhang, Xiaolin Zhang, Xuanwu Yin, Kunlong Zuo, Bohao Liao, Peizhe Xia, Long Peng, Zhibo Du, Xin Di, Wangkai Li, Yang Wang , et al. (109 additional authors not shown)

    Abstract: This paper provides a comprehensive review of the NTIRE 2024 challenge, focusing on efficient single-image super-resolution (ESR) solutions and their outcomes. The task of this challenge is to super-resolve an input image with a magnification factor of x4 based on pairs of low and corresponding high-resolution images. The primary objective is to develop networks that optimize various aspects such… ▽ More

    Submitted 25 June, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: The report paper of NTIRE2024 Efficient Super-resolution, accepted by CVPRW2024

  29. arXiv:2404.09790  [pdf, other

    cs.CV

    NTIRE 2024 Challenge on Image Super-Resolution ($\times$4): Methods and Results

    Authors: Zheng Chen, Zongwei Wu, Eduard Zamfir, Kai Zhang, Yulun Zhang, Radu Timofte, Xiaokang Yang, Hongyuan Yu, Cheng Wan, Yuxin Hong, Zhijuan Huang, Yajun Zou, Yuan Huang, Jiamin Lin, Bingnan Han, Xianyu Guan, Yongsheng Yu, Daoan Zhang, Xuanwu Yin, Kunlong Zuo, Jinhua Hao, Kai Zhao, Kun Yuan, Ming Sun, Chao Zhou , et al. (63 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2024 challenge on image super-resolution ($\times$4), highlighting the solutions proposed and the outcomes obtained. The challenge involves generating corresponding high-resolution (HR) images, magnified by a factor of four, from low-resolution (LR) inputs using prior information. The LR images originate from bicubic downsampling degradation. The aim of the challenge i… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: NTIRE 2024 webpage: https://cvlai.net/ntire/2024. Code: https://github.com/zhengchen1999/NTIRE2024_ImageSR_x4

  30. arXiv:2404.08965  [pdf, other

    cs.CV cs.MM

    Seeing Text in the Dark: Algorithm and Benchmark

    Authors: Chengpei Xu, Hao Fu, Long Ma, Wenjing Jia, Chengqi Zhang, Feng Xia, Xiaoyu Ai, Binghao Li, Wenjie Zhang

    Abstract: Localizing text in low-light environments is challenging due to visual degradations. Although a straightforward solution involves a two-stage pipeline with low-light image enhancement (LLE) as the initial step followed by detector, LLE is primarily designed for human vision instead of machine and can accumulate errors. In this work, we propose an efficient and effective single-stage approach for l… ▽ More

    Submitted 23 April, 2024; v1 submitted 13 April, 2024; originally announced April 2024.

  31. arXiv:2403.19125  [pdf, other

    cond-mat.mes-hall cond-mat.quant-gas cond-mat.supr-con

    Generic reduction theory for Fermi sea topology in metallic systems

    Authors: Wei Jia

    Abstract: Fermi sea in a metal can host exotic quantum topology, which determines its conductance quantization and is characterized by Euler characteristic $χ_F$. Unlike gapped band topology described by the global feature of wave function, this topology of gapless system is associated with the geometry of Fermi sea, and thus probing and identifying $χ_F$ are inherently difficult in higher-dimensional syste… ▽ More

    Submitted 27 April, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: 6 pages, 3 figures, 1 table

  32. arXiv:2403.11807  [pdf, other

    cs.AI cs.CL

    How Far Are We on the Decision-Making of LLMs? Evaluating LLMs' Gaming Ability in Multi-Agent Environments

    Authors: Jen-tse Huang, Eric John Li, Man Ho Lam, Tian Liang, Wenxuan Wang, Youliang Yuan, Wenxiang Jiao, Xing Wang, Zhaopeng Tu, Michael R. Lyu

    Abstract: Decision-making, a complicated task requiring various types of abilities, presents an excellent framework for assessing Large Language Models (LLMs). Our research investigates LLMs' decision-making capabilities through the lens of a well-established field, Game Theory. We focus specifically on games that support the participation of more than two agents simultaneously. Subsequently, we introduce o… ▽ More

    Submitted 25 April, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: 16 pages of main text. 11 pages of appendices. 15 figures, 9 tables. Updated scoring scheme

  33. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  34. arXiv:2403.05227  [pdf, ps, other

    cond-mat.supr-con

    Superconductivity in kagome metal ThRu3Si2

    Authors: Yi Liu, Jing Li, Wu-Zhang Yang, Jia-Yi Lu, Bo-Ya Cao, Hua-Xun Li, Wan-Li Chai, Si-Qi Wu, Bai-Zhuo Li, Yun-Lei Sun, Wen-He Jiao, Wang Cao, Xiao-Feng Xu, Ren Zhi, Guang-Han Cao

    Abstract: We report the physical properties of ThRu$_3$Si$_2$ featured with distorted Ru kagome lattice. The combined experiments of resistivity, magnetization and specific heat reveal bulk superconductivity with $T_{\rm{c}}$ = 3.8 K. The specific heat jump and calculated electron-phonon coupling indicate a moderate coupled BCS superconductor. In comparison with LaRu$_3$Si$_2$, the calculated electronic str… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: 7 pages, 5 figures

    Journal ref: Chinese Physics B (2024)

  35. arXiv:2403.04274  [pdf, other

    astro-ph.GA astro-ph.SR

    Relative alignment between gas structures and magnetic field in Orion A at different scales using different molecular gas tracers

    Authors: Wenyu Jiao, Ke Wang, Fengwei Xu, Chao Wang, Henrik Beuther

    Abstract: Context: Magnetic fields can play crucial roles in high-mass star formation. Nonetheless, the significance of magnetic fields at various scales and their relationship with gas structures is largely overlooked. Aims: Our goal is to examine the relationship between the magnetic field and molecular gas structures within the Orion A giant molecular cloud at different scales and density regimes. Method… ▽ More

    Submitted 19 June, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

    Comments: 12 pages, 8 figures, published in A&A

  36. arXiv:2403.03692  [pdf, ps, other

    math.CO

    Vertex-disjoint cycles of different lengths in tournaments

    Authors: Yandong Bai, Wenpei Jia

    Abstract: Bermond and Thomassen conjectured in 1981 that every digraph with minimum outdegree at least $2k-1$ contains $k$ vertex-disjoint cycles,here $k$ is a positive integer. Lichiardopol conjectured in 2014 that for every positive integer $k$ there exists an integer $g(k)$ such that every digraph with minimum outdegree at least $g(k)$ contains $k$ vertex-disjoint cycles of different lengths. Recently, C… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  37. arXiv:2403.03004  [pdf, other

    astro-ph.CO gr-qc hep-ph

    Ultralight vector dark matter search using data from the KAGRA O3GK run

    Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, H. Abe, I. Abouelfettouh, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi , et al. (1778 additional authors not shown)

    Abstract: Among the various candidates for dark matter (DM), ultralight vector DM can be probed by laser interferometric gravitational wave detectors through the measurement of oscillating length changes in the arm cavities. In this context, KAGRA has a unique feature due to differing compositions of its mirrors, enhancing the signal of vector DM in the length change in the auxiliary channels. Here we prese… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 20 pages, 5 figures

    Report number: LIGO-P2300250

  38. arXiv:2403.01126  [pdf, other

    quant-ph cond-mat.mes-hall

    Single photon scattering from a chain of giant atoms coupled to a one-dimensional waveguide

    Authors: Y. P. Peng, W. Z. Jia

    Abstract: We investigate coherent single-photon transport in a waveguide quantum electrodynamics structure containing multiple giant atoms. The single-photon scattering amplitudes are solved using a real-space method. The results give rise to a clear picture of the multi-channel scattering process. In the case of identical and equally-spaced giant atoms in a separate configuration, we also use the transfer-… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

    Comments: 19 pages, 11 figures

    Journal ref: Phys. Rev. A 108, 043709 (2023)

  39. arXiv:2403.00316  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci cond-mat.supr-con

    Surface Chern-Simons theory for third-order topological insulators and superconductors

    Authors: Zhi-Hao Huang, Yi Tan, Wei Jia, Long Zhang, Xiong-Jun Liu

    Abstract: Three-dimensional 3rd-order topological insulators (TOTIs) and superconductors (TOTSCs), as the highestorder topological phases hosting zero corner modes in physical dimension, has sparked extensive research interest. However, such topological states have not been discovered in reality due to the lack of experimental schemes of realization. Here, we propose a novel surface Chern-Simons (CS) theory… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Comments: 5+11 pages, 4+5 figures

  40. arXiv:2402.11515  [pdf, other

    cs.LG physics.flu-dyn

    Optimal Parallelization Strategies for Active Flow Control in Deep Reinforcement Learning-Based Computational Fluid Dynamics

    Authors: Wang Jia, Hang Xu

    Abstract: Deep Reinforcement Learning (DRL) has emerged as a promising approach for handling highly dynamic and nonlinear Active Flow Control (AFC) problems. However, the computational cost associated with training DRL models presents a significant performance bottleneck. To address this challenge and enable efficient scaling on high-performance computing architectures, this study focuses on optimizing DRL-… ▽ More

    Submitted 29 April, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

  41. arXiv:2402.11111  [pdf, other

    cs.CL

    Language Models as Science Tutors

    Authors: Alexis Chevalier, Jiayi Geng, Alexander Wettig, Howard Chen, Sebastian Mizera, Toni Annala, Max Jameson Aragon, Arturo Rodríguez Fanlo, Simon Frieder, Simon Machado, Akshara Prabhakar, Ellie Thieu, Jiachen T. Wang, Zirui Wang, Xindi Wu, Mengzhou Xia, Wenhan Jia, Jiatong Yu, Jun-Jie Zhu, Zhiyong Jason Ren, Sanjeev Arora, Danqi Chen

    Abstract: NLP has recently made exciting progress toward training language models (LMs) with strong scientific problem-solving skills. However, model development has not focused on real-life use-cases of LMs for science, including applications in education that require processing long scientific documents. To address this, we introduce TutorEval and TutorChat. TutorEval is a diverse question-answering bench… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: 8 pages without bibliography and appendix, 26 pages total

  42. arXiv:2402.08982  [pdf, other

    cs.LG cs.AI cs.NE

    MEL: Efficient Multi-Task Evolutionary Learning for High-Dimensional Feature Selection

    Authors: Xubin Wang, Haojiong Shangguan, Fengyi Huang, Shangrui Wu, Weijia Jia

    Abstract: Feature selection is a crucial step in data mining to enhance model performance by reducing data dimensionality. However, the increasing dimensionality of collected data exacerbates the challenge known as the "curse of dimensionality", where computation grows exponentially with the number of dimensions. To tackle this issue, evolutionary computational (EC) approaches have gained popularity due to… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  43. arXiv:2402.07726  [pdf, other

    cs.CL

    Unsupervised Sign Language Translation and Generation

    Authors: Zhengsheng Guo, Zhiwei He, Wenxiang Jiao, Xing Wang, Rui Wang, Kehai Chen, Zhaopeng Tu, Yong Xu, Min Zhang

    Abstract: Motivated by the success of unsupervised neural machine translation (UNMT), we introduce an unsupervised sign language translation and generation network (USLNet), which learns from abundant single-modality (text and video) data without parallel sign language data. USLNet comprises two main components: single-modality reconstruction modules (text and video) that rebuild the input from its noisy ve… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  44. arXiv:2401.13270  [pdf, other

    cs.CV cs.AI

    Audio-Infused Automatic Image Colorization by Exploiting Audio Scene Semantics

    Authors: Pengcheng Zhao, Yanxiang Chen, Yang Zhao, Wei Jia, Zhao Zhang, Ronggang Wang, Richang Hong

    Abstract: Automatic image colorization is inherently an ill-posed problem with uncertainty, which requires an accurate semantic understanding of scenes to estimate reasonable colors for grayscale images. Although recent interaction-based methods have achieved impressive performance, it is still a very difficult task to infer realistic and accurate colors for automatic colorization. To reduce the difficulty… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  45. arXiv:2401.12873  [pdf, other

    cs.CL cs.AI

    Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model

    Authors: Zhiwei He, Xing Wang, Wenxiang Jiao, Zhuosheng Zhang, Rui Wang, Shuming Shi, Zhaopeng Tu

    Abstract: Insufficient modeling of human preferences within the reward model is a major obstacle for leveraging human feedback to improve translation quality. Fortunately, quality estimation (QE), which predicts the quality of a given translation without reference, has achieved impressive alignment with human evaluations in the last two years. In this work, we investigate the potential of employing the QE m… ▽ More

    Submitted 18 March, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

    Comments: NAACL 2024

  46. arXiv:2401.05695  [pdf, other

    cs.CL

    Integrating Physician Diagnostic Logic into Large Language Models: Preference Learning from Process Feedback

    Authors: Chengfeng Dou, Zhi Jin, Wenpin Jiao, Haiyan Zhao, Yongqiang Zhao, Zhenwei Tao

    Abstract: The use of large language models in medical dialogue generation has garnered significant attention, with a focus on improving response quality and fluency. While previous studies have made progress in optimizing model performance for single-round medical Q&A tasks, there is a need to enhance the model's capability for multi-round conversations to avoid logical inconsistencies. To address this, we… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

  47. arXiv:2401.04322  [pdf, other

    astro-ph.GA astro-ph.SR

    The ALMA-QUARKS survey: Detection of two extremely dense substructures in a massive prestellar core

    Authors: Xiaofeng Mai, Tie Liu, Xunchuan Liu, Lei Zhu, Guido Garay, Paul F. Goldsmith, Mika Juvela, Hongli Liu, Emma Mannfors, Emma Mannfors, Anandmayee Tej, Patricio Sanhueza, Shanghuo Li, Fengwei Xu, Enrique Vazquez Semadeni, Wenyu Jiao, Yaping Peng, T. Baug, Aiyuan Yang, Lokesh Dewangan, Leonardo Bronfman, Gilberto C. Gómez, Aina Palau, Chang Won Lee, Sheng-Li Qin , et al. (11 additional authors not shown)

    Abstract: Only a handful of massive starless core candidates have been discovered so far, but none of them have been fully confirmed. Within the MM1 clump in the filamentary infrared dark cloud G34.43+0.24 that was covered by the ALMA-ATOMS survey at Band 3 ($\sim2\arcsec$, 6000\,au) and the ALMA-QUARKS survey at Band 6 ($\sim 0.3\arcsec$, 900\,au), two prestellar core candidates MM1-C and E1 with masses of… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Comments: 12 pages, 6 figures

  48. arXiv:2401.00761  [pdf, other

    cs.SE cs.AI cs.CL

    The Earth is Flat? Unveiling Factual Errors in Large Language Models

    Authors: Wenxuan Wang, Juluan Shi, Zhaopeng Tu, Youliang Yuan, Jen-tse Huang, Wenxiang Jiao, Michael R. Lyu

    Abstract: Large Language Models (LLMs) like ChatGPT are foundational in various applications due to their extensive knowledge from pre-training and fine-tuning. Despite this, they are prone to generating factual and commonsense errors, raising concerns in critical areas like healthcare, journalism, and education to mislead users. Current methods for evaluating LLMs' veracity are limited by test data leakage… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

  49. arXiv:2401.00757  [pdf, other

    cs.SE cs.AI cs.CL cs.LO

    A & B == B & A: Triggering Logical Reasoning Failures in Large Language Models

    Authors: Yuxuan Wan, Wenxuan Wang, Yiliu Yang, Youliang Yuan, Jen-tse Huang, Pinjia He, Wenxiang Jiao, Michael R. Lyu

    Abstract: Recent advancements in large language models (LLMs) have propelled Artificial Intelligence (AI) to new heights, enabling breakthroughs in various tasks such as writing assistance, code generation, and machine translation. A significant distinction of advanced LLMs, such as ChatGPT, is their demonstrated ability to "reason." However, evaluating the reasoning ability of LLMs remains a challenge as m… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

  50. arXiv:2312.15492  [pdf, other

    physics.chem-ph cond-mat.mtrl-sci physics.comp-ph

    DPA-2: Towards a universal large atomic model for molecular and material simulation

    Authors: Duo Zhang, Xinzijian Liu, Xiangyu Zhang, Chengqian Zhang, Chun Cai, Hangrui Bi, Yiming Du, Xuejian Qin, Jiameng Huang, Bowen Li, Yifan Shan, Jinzhe Zeng, Yuzhi Zhang, Siyuan Liu, Yifan Li, Junhan Chang, Xinyan Wang, Shuo Zhou, Jianchuan Liu, Xiaoshan Luo, Zhenyu Wang, Wanrun Jiang, Jing Wu, Yudi Yang, Jiyuan Yang , et al. (17 additional authors not shown)

    Abstract: The rapid development of artificial intelligence (AI) is driving significant changes in the field of atomic modeling, simulation, and design. AI-based potential energy models have been successfully used to perform large-scale and long-time simulations with the accuracy of ab initio electronic structure methods. However, the model generation process still hinders applications at scale. We envision… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.