Skip to main content

Showing 101–150 of 1,185 results for author: He, S

  1. Data Cubes in Hand: A Design Space of Tangible Cubes for Visualizing 3D Spatio-Temporal Data in Mixed Reality

    Authors: Shuqi He, Haonan Yao, Luyan Jiang, Kaiwen Li, Nan Xiang, Yue Li, Hai-Ning Liang, Lingyun Yu

    Abstract: Tangible interfaces in mixed reality (MR) environments allow for intuitive data interactions. Tangible cubes, with their rich interaction affordances, high maneuverability, and stable structure, are particularly well-suited for exploring multi-dimensional data types. However, the design potential of these cubes is underexplored. This study introduces a design space for tangible cubes in MR, focusi… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  2. arXiv:2403.05875  [pdf, other

    hep-th cond-mat.dis-nn

    Detecting quantum chaos via pseudo-entropy and negativity

    Authors: Song He, Pak Hang Chris Lau, Long Zhao

    Abstract: Quantum informatic quantities such as entanglement entropy are useful in detecting quantum phase transitions. Recently, a new entanglement measure called pseudo-entropy was proposed which is a generalization of the more well-known entanglement entropy. It has many nice properties and is useful in the study of post-selection measurements. In this paper, one of our goals is to explore the properties… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: 31 pages, 12 figures

  3. arXiv:2403.05789  [pdf, other

    cs.CL cs.AI

    ItD: Large Language Models Can Teach Themselves Induction through Deduction

    Authors: Wangtao Sun, Haotian Xu, Xuanqing Yu, Pei Chen, Shizhu He, Jun Zhao, Kang Liu

    Abstract: Although Large Language Models (LLMs) are showing impressive performance on a wide range of Natural Language Processing tasks, researchers have found that they still have limited ability to conduct induction. Recent works mainly adopt ``post processes'' paradigms to improve the performance of LLMs on induction (e.g., the hypothesis search & refinement methods), but their performance is still const… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  4. arXiv:2403.05130  [pdf, other

    cs.AI

    From Chain to Tree: Refining Chain-like Rules into Tree-like Rules on Knowledge Graphs

    Authors: Wangtao Sun, Shizhu He, Jun Zhao, Kang Liu

    Abstract: With good explanatory power and controllability, rule-based methods play an important role in many tasks such as knowledge reasoning and decision support. However, existing studies primarily focused on learning chain-like rules, which limit their semantic expressions and accurate prediction abilities. As a result, chain-like rules usually fire on the incorrect grounding values, producing inaccurat… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  5. arXiv:2403.04651  [pdf, other

    cs.PL

    Cedar: A New Language for Expressive, Fast, Safe, and Analyzable Authorization (Extended Version)

    Authors: Joseph W. Cutler, Craig Disselkoen, Aaron Eline, Shaobo He, Kyle Headley, Michael Hicks, Kesha Hietala, Eleftherios Ioannidis, John Kastner, Anwar Mamat, Darin McAdams, Matt McCutchen, Neha Rungta, Emina Torlak, Andrew Wells

    Abstract: Cedar is a new authorization policy language designed to be ergonomic, fast, safe, and analyzable. Rather than embed authorization logic in an application's code, developers can write that logic as Cedar policies and delegate access decisions to Cedar's evaluation engine. Cedar's simple and intuitive syntax supports common authorization use-cases with readable policies, naturally leveraging concep… ▽ More

    Submitted 8 March, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

  6. arXiv:2403.00127  [pdf

    cs.CL cs.CY cs.HC

    Prompting ChatGPT for Translation: A Comparative Analysis of Translation Brief and Persona Prompts

    Authors: Sui He

    Abstract: Prompt engineering has shown potential for improving translation quality in LLMs. However, the possibility of using translation concepts in prompt design remains largely underexplored. Against this backdrop, the current paper discusses the effectiveness of incorporating the conceptual tool of translation brief and the personas of translator and author into prompt design for translation tasks in Ch… ▽ More

    Submitted 28 April, 2024; v1 submitted 29 February, 2024; originally announced March 2024.

  7. arXiv:2402.15986  [pdf, other

    physics.atom-ph physics.chem-ph quant-ph

    Photoassociation of multiple cold molecules in a dipole trap

    Authors: Li Li, Yi-Jia Liu, Xiao-Long Zhou, Ze-Min Shen, Si-Jian He, Zhao-Di Liu, Jian Wang

    Abstract: The generation of cold molecules is a core topic in the field of cold atoms and molecules, which has advanced relevant research like ultracold chemistry, quantum computation, and quantum metrology. With high atomic phase space density, optical dipole trap has been widely performed to prepare and trap cold molecules, and can also be further developed for multiple cold molecule formation and dynamic… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

    Comments: 6 pages, 5 figures

  8. arXiv:2402.15627  [pdf, other

    cs.LG cs.DC

    MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs

    Authors: Ziheng Jiang, Haibin Lin, Yinmin Zhong, Qi Huang, Yangrui Chen, Zhi Zhang, Yanghua Peng, Xiang Li, Cong Xie, Shibiao Nong, Yulu Jia, Sun He, Hongmin Chen, Zhihao Bai, Qi Hou, Shipeng Yan, Ding Zhou, Yiyao Sheng, Zhuo Jiang, Haohan Xu, Haoran Wei, Zhang Zhang, Pengfei Nie, Leqi Zou, Sida Zhao , et al. (7 additional authors not shown)

    Abstract: We present the design, implementation and engineering experience in building and deploying MegaScale, a production system for training large language models (LLMs) at the scale of more than 10,000 GPUs. Training LLMs at this scale brings unprecedented challenges to training efficiency and stability. We take a full-stack approach that co-designs the algorithmic and system components across model bl… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  9. arXiv:2402.14225  [pdf, other

    eess.AS cs.SD

    SICRN: Advancing Speech Enhancement through State Space Model and Inplace Convolution Techniques

    Authors: Changjiang Zhao, Shulin He, Xueliang Zhang

    Abstract: Speech enhancement aims to improve speech quality and intelligibility, especially in noisy environments where background noise degrades speech signals. Currently, deep learning methods achieve great success in speech enhancement, e.g. the representative convolutional recurrent neural network (CRN) and its variants. However, CRN typically employs consecutive downsampling and upsampling convolution… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  10. arXiv:2402.13430  [pdf, other

    cs.LG cs.AI cs.SI

    LinkSAGE: Optimizing Job Matching Using Graph Neural Networks

    Authors: Ping Liu, Haichao Wei, Xiaochen Hou, Jianqiang Shen, Shihai He, Kay Qianqi Shen, Zhujun Chen, Fedor Borisyuk, Daniel Hewlett, Liang Wu, Srikant Veeraraghavan, Alex Tsun, Chengming Jiang, Wenjing Zhang

    Abstract: We present LinkSAGE, an innovative framework that integrates Graph Neural Networks (GNNs) into large-scale personalized job matching systems, designed to address the complex dynamics of LinkedIns extensive professional network. Our approach capitalizes on a novel job marketplace graph, the largest and most intricate of its kind in industry, with billions of nodes and edges. This graph is not merel… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  11. arXiv:2402.12851  [pdf, other

    cs.CL

    MoELoRA: Contrastive Learning Guided Mixture of Experts on Parameter-Efficient Fine-Tuning for Large Language Models

    Authors: Tongxu Luo, Jiahe Lei, Fangyu Lei, Weihao Liu, Shizhu He, Jun Zhao, Kang Liu

    Abstract: Fine-tuning is often necessary to enhance the adaptability of Large Language Models (LLM) to downstream tasks. Nonetheless, the process of updating billions of parameters demands significant computational resources and training time, which poses a substantial obstacle to the widespread application of large-scale models in various scenarios. To address this issue, Parameter-Efficient Fine-Tuning (P… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  12. arXiv:2402.12271  [pdf, other

    cs.DC cs.LG

    Secure Federated Learning Across Heterogeneous Cloud and High-Performance Computing Resources -- A Case Study on Federated Fine-tuning of LLaMA 2

    Authors: Zilinghan Li, Shilan He, Pranshu Chaturvedi, Volodymyr Kindratenko, Eliu A Huerta, Kibaek Kim, Ravi Madduri

    Abstract: Federated learning enables multiple data owners to collaboratively train robust machine learning models without transferring large or sensitive local datasets by only sharing the parameters of the locally trained models. In this paper, we elaborate on the design of our Advanced Privacy-Preserving Federated Learning (APPFL) framework, which streamlines end-to-end secure and reliable federated learn… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  13. arXiv:2402.12219  [pdf, other

    cs.CL cs.AI cs.LG

    Reformatted Alignment

    Authors: Run-Ze Fan, Xuefeng Li, Haoyang Zou, Junlong Li, Shwai He, Ethan Chern, Jiewen Hu, Pengfei Liu

    Abstract: The quality of finetuning data is crucial for aligning large language models (LLMs) with human values. Current methods to improve data quality are either labor-intensive or prone to factual errors caused by LLM hallucinations. This paper explores elevating the quality of existing instruction data to better align with human values, introducing a simple and effective approach named ReAlign, which re… ▽ More

    Submitted 17 April, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: Homepage: https://gair-nlp.github.io/ReAlign/

  14. arXiv:2402.12099   

    cs.CV

    Human Video Translation via Query Warping

    Authors: Haiming Zhu, Yangyang Xu, Shengfeng He

    Abstract: In this paper, we present QueryWarp, a novel framework for temporally coherent human motion video translation. Existing diffusion-based video editing approaches that rely solely on key and value tokens to ensure temporal consistency, which scarifies the preservation of local and structural regions. In contrast, we aim to consider complementary query priors by constructing the temporal correlations… ▽ More

    Submitted 21 May, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: This is not a complete paper and the methods and results have not been updated. We decided to withdraw and make further improvements

  15. arXiv:2402.11139  [pdf, other

    cs.LG cs.AI

    LiGNN: Graph Neural Networks at LinkedIn

    Authors: Fedor Borisyuk, Shihai He, Yunbo Ouyang, Morteza Ramezani, Peng Du, Xiaochen Hou, Chengming Jiang, Nitin Pasumarthy, Priya Bannur, Birjodh Tiwana, Ping Liu, Siddharth Dangi, Daqi Sun, Zhoutao Pei, Xiao Shi, Sirou Zhu, Qianqi Shen, Kuang-Hsuan Lee, David Stein, Baolei Li, Haichao Wei, Amol Ghoting, Souvik Ghosh

    Abstract: In this paper, we present LiGNN, a deployed large-scale Graph Neural Networks (GNNs) Framework. We share our insight on developing and deployment of GNNs at large scale at LinkedIn. We present a set of algorithmic improvements to the quality of GNN representation learning including temporal graph architectures with long term losses, effective cold start solutions via graph densification, ID embedd… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  16. arXiv:2402.10940  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Neural machine translation of clinical procedure codes for medical diagnosis and uncertainty quantification

    Authors: Pei-Hung Chung, Shuhan He, Norawit Kijpaisalratana, Abdel-badih el Ariss, Byung-Jun Yoon

    Abstract: A Clinical Decision Support System (CDSS) is designed to enhance clinician decision-making by combining system-generated recommendations with medical expertise. Given the high costs, intensive labor, and time-sensitive nature of medical treatments, there is a pressing need for efficient decision support, especially in complex emergency scenarios. In these scenarios, where information can be limite… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  17. arXiv:2402.10464  [pdf, other

    cs.LG cs.NI

    FedKit: Enabling Cross-Platform Federated Learning for Android and iOS

    Authors: Sichang He, Beilong Tang, Boyan Zhang, Jiaoqi Shao, Xiaomin Ouyang, Daniel Nata Nugraha, Bing Luo

    Abstract: We present FedKit, a federated learning (FL) system tailored for cross-platform FL research on Android and iOS devices. FedKit pipelines cross-platform FL development by enabling model conversion, hardware-accelerated training, and cross-platform model aggregation. Our FL workflow supports flexible machine learning operations (MLOps) in production, facilitating continuous model delivery and traini… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: This work has been accepted for demonstration on IEEE International Conference on Computer Communications (INFOCOM) 2024

  18. arXiv:2402.10151  [pdf, other

    cs.CL

    ControlLM: Crafting Diverse Personalities for Language Models

    Authors: Yixuan Weng, Shizhu He, Kang Liu, Shengping Liu, Jun Zhao

    Abstract: As language models continue to scale in size and capability, they display an array of emerging behaviors, both beneficial and concerning. This heightens the need to control model behaviors. We hope to be able to control the personality traits of language models at the inference-time so as to have various character features, on top of which the requirements of different types of tasks can be met. P… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: 17 pages

  19. arXiv:2402.10110  [pdf, other

    cs.CL cs.AI cs.LG

    Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning

    Authors: Ming Li, Lichang Chen, Jiuhai Chen, Shwai He, Jiuxiang Gu, Tianyi Zhou

    Abstract: Instruction tuning is critical to large language models (LLMs) for achieving better instruction following and task adaptation capabilities but its success heavily relies on the training data quality. Many recent methods focus on improving the data quality but often overlook the compatibility of the data with the student model being finetuned. This paper introduces Selective Reflection-Tuning, a no… ▽ More

    Submitted 7 June, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: ACL2024 (findings), Camera-ready

  20. arXiv:2402.07939  [pdf, other

    cs.HC cs.AI cs.CL

    UFO: A UI-Focused Agent for Windows OS Interaction

    Authors: Chaoyun Zhang, Liqun Li, Shilin He, Xu Zhang, Bo Qiao, Si Qin, Minghua Ma, Yu Kang, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang

    Abstract: We introduce UFO, an innovative UI-Focused agent to fulfill user requests tailored to applications on Windows OS, harnessing the capabilities of GPT-Vision. UFO employs a dual-agent framework to meticulously observe and analyze the graphical user interface (GUI) and control information of Windows applications. This enables the agent to seamlessly navigate and operate within individual applications… ▽ More

    Submitted 23 May, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

  21. arXiv:2402.05798  [pdf, other

    cs.HC

    Visual Harmony: Text-Visual Interplay in Circular Infographics

    Authors: Shuqi He, Yuqing Chen, Yuxin Xia, Yichun Li, Hai-Ning Liang, Lingyun Yu

    Abstract: Infographics are visual representations designed for efficient and effective communication of data and knowledge. One crucial aspect of infographic design is the interplay between text and visual elements, particularly in circular visualizations where the textual descriptions can either be embedded within the graphics or placed adjacent to the visual representation. While several studies have exam… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  22. arXiv:2402.05438  [pdf, other

    math.ST stat.ME

    Penalized spline estimation of principal components for sparse functional data: rates of convergence

    Authors: Shiyuan He, Jianhua Z. Huang, Kejun He

    Abstract: This paper gives a comprehensive treatment of the convergence rates of penalized spline estimators for simultaneously estimating several leading principal component functions, when the functional data is sparsely observed. The penalized spline estimators are defined as the solution of a penalized empirical risk minimization problem, where the loss function belongs to a general class of loss functi… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  23. arXiv:2402.01723  [pdf, other

    cs.CL cs.AI

    An Empirical Study on Large Language Models in Accuracy and Robustness under Chinese Industrial Scenarios

    Authors: Zongjie Li, Wenying Qiu, Pingchuan Ma, Yichen Li, You Li, Sijia He, Baozheng Jiang, Shuai Wang, Weixi Gu

    Abstract: Recent years have witnessed the rapid development of large language models (LLMs) in various domains. To better serve the large number of Chinese users, many commercial vendors in China have adopted localization strategies, training and providing local LLMs specifically customized for Chinese users. Furthermore, looking ahead, one of the key future applications of LLMs will be practical deployment… ▽ More

    Submitted 26 January, 2024; originally announced February 2024.

  24. arXiv:2402.00530  [pdf, other

    cs.CL

    Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning

    Authors: Ming Li, Yong Zhang, Shwai He, Zhitao Li, Hongyu Zhao, Jianzong Wang, Ning Cheng, Tianyi Zhou

    Abstract: Instruction tuning is critical to improve LLMs but usually suffers from low-quality and redundant data. Data filtering for instruction tuning has proved important in improving both the efficiency and performance of the tuning process. But it also leads to extra cost and computation due to the involvement of LLMs in this process. To reduce the filtering cost, we study Superfiltering: Can we use a s… ▽ More

    Submitted 7 June, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: ACL2024 main, Camera-ready

  25. arXiv:2402.00268  [pdf, other

    hep-th cond-mat.stat-mech quant-ph

    Relation between timelike and spacelike entanglement entropy

    Authors: Wu-zhong Guo, Song He, Yu-Xuan Zhang

    Abstract: In this study, we establish a connection between timelike and spacelike entanglement entropy. Specifically, for a diverse range of states, the timelike entanglement entropy is uniquely determined by a linear combination of the spacelike entanglement entropy and its first-order temporal derivative. This framework reveals that the imaginary component of the timelike entanglement entropy primarily or… ▽ More

    Submitted 31 January, 2024; originally announced February 2024.

    Comments: 5+8 pages, 1 figure

  26. arXiv:2401.15852  [pdf, ps, other

    math.AG math.CV

    The Spectral base and quotients of bounded symmetric domains

    Authors: Siqi He, Jie Liu, Ngaiming Mok

    Abstract: In this article, we explore Higgs bundles on a projective manifold $X$, focusing on their spectral bases, a concept introduced by T.Chen and B.Ngô. The spectral base is a specific closed subscheme within the space of symmetric differentials. We observe that if the spectral base vanishes, then any reductive representation $ρ: π_1(X) \to \text{GL}_r(\mathbb{C})$ is both rigid and integral. Additiona… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

    Comments: 21 pages

    MSC Class: 14J60; 53C35

  27. arXiv:2401.15123  [pdf, other

    cs.LG cs.AI

    Large Language Model Guided Knowledge Distillation for Time Series Anomaly Detection

    Authors: Chen Liu, Shibo He, Qihang Zhou, Shizhong Li, Wenchao Meng

    Abstract: Self-supervised methods have gained prominence in time series anomaly detection due to the scarcity of available annotations. Nevertheless, they typically demand extensive training data to acquire a generalizable representation map, which conflicts with scenarios of a few available samples, thereby limiting their performance. To overcome the limitation, we propose \textbf{AnomalyLLM}, a knowledge… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: 12 pages, 5 figures

  28. arXiv:2401.13714  [pdf, other

    cs.CV cs.LG

    Value-Driven Mixed-Precision Quantization for Patch-Based Inference on Microcontrollers

    Authors: Wei Tao, Shenglin He, Kai Lu, Xiaoyang Qu, Guokuan Li, Jiguang Wan, Jianzong Wang, Jing Xiao

    Abstract: Deploying neural networks on microcontroller units (MCUs) presents substantial challenges due to their constrained computation and memory resources. Previous researches have explored patch-based inference as a strategy to conserve memory without sacrificing model accuracy. However, this technique suffers from severe redundant computation overhead, leading to a substantial increase in execution lat… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: Accepted by the 27th Design, Automation and Test in Europe Conference (DATE 2024)

  29. arXiv:2401.11235  [pdf, other

    cs.LG cs.AI

    TreeMIL: A Multi-instance Learning Framework for Time Series Anomaly Detection with Inexact Supervision

    Authors: Chen Liu, Shibo He, Haoyu Liu, Shizhong Li

    Abstract: Time series anomaly detection (TSAD) plays a vital role in various domains such as healthcare, networks, and industry. Considering labels are crucial for detection but difficult to obtain, we turn to TSAD with inexact supervision: only series-level labels are provided during the training phase, while point-level anomalies are predicted during the testing phase. Previous works follow a traditional… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

    Comments: This paper has been accepted by IEEE ICASSP 2024

  30. Irrelevant and marginal deformed BMS field theories

    Authors: Song He, Xin-Cheng Mao

    Abstract: In this study, we investigate various deformations within the framework of Bondi-van der Burg-Metzner-Sachs invariant field theory (BMSFT). Specifically, we explore the impact of Bondi-van der Burg-Metzner-Sachs (BMS) symmetry on the theory by introducing key deformations, namely, $T \overline{T}$, $JT_μ$, and $\sqrt{T \overline{T}}$ deformations. In the context of generic seed theories possessing… ▽ More

    Submitted 27 March, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

    Comments: 54 pages, 0 figure

    Journal ref: JHEP 04 (2024) 138

  31. arXiv:2401.05483  [pdf, other

    hep-th hep-ph

    NLSM $\subset$ Tr$(φ^3)$

    Authors: Nima Arkani-Hamed, Qu Cao, Jin Dong, Carolina Figueiredo, Song He

    Abstract: Scattering amplitudes for the simplest theory of colored scalar particles - the Tr($Φ^3$) theory - have recently been the subject of active investigations. In this letter we describe an unanticipated wider implication of this work: the Tr($Φ^3$) theory secretly contains Non-linear Sigma Model (NLSM) amplitudes to all loop orders. The NLSM amplitudes are obtained from Tr$(Φ^3)$ amplitudes by a uniq… ▽ More

    Submitted 15 April, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

    Comments: 10 pages, 13 figures. Addition of new material including a derivation of the results from a simple Lagrangian, identifying the symmetry breaking pattern, as well as some further discussions

  32. arXiv:2401.04723  [pdf, other

    stat.ME

    Spatio-temporal data fusion for the analysis of in situ and remote sensing data using the INLA-SPDE approach

    Authors: Shiyu He, Samuel W. K. Wong

    Abstract: We propose a Bayesian hierarchical model to address the challenge of spatial misalignment in spatio-temporal data obtained from in situ and satellite sources. The model is fit using the INLA-SPDE approach, which provides efficient computation. Our methodology combines the different data sources in a "fusion"" model via the construction of projection matrices in both spatial and temporal domains. T… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

    Comments: 23 pages, 7 figures

  33. arXiv:2401.02880  [pdf, other

    cs.CR

    Lotto: Secure Participant Selection against Adversarial Servers in Federated Learning

    Authors: Zhifeng Jiang, Peng Ye, Shiqi He, Wei Wang, Ruichuan Chen, Bo Li

    Abstract: In Federated Learning (FL), common privacy-enhancing techniques, such as secure aggregation and distributed differential privacy, rely on the critical assumption of an honest majority among participants to withstand various attacks. In practice, however, servers are not always trusted, and an adversarial server can strategically select compromised clients to create a dishonest majority, thereby un… ▽ More

    Submitted 6 March, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

    Comments: This article has been accepted to USENIX Security '24

  34. arXiv:2401.01718  [pdf

    physics.plasm-ph physics.atom-ph physics.comp-ph physics.flu-dyn physics.optics

    RHDLPP: A multigroup radiation hydrodynamics code for laser-produced plasmas

    Authors: Qi Min, Ziyang Xu, Siqi He, Haidong Lu, Xingbang Liu, Ruizi Shen, Yanhong Wu, Qikun Pan, Chongxiao Zhao, Fei Chen, Maogen Su, Chenzhong Dong

    Abstract: We introduce the RHDLPP, a flux-limited multigroup radiation hydrodynamics numerical code designed for simulating laser-produced plasmas in diverse environments. The code bifurcates into two packages: RHDLPP-LTP for low-temperature plasmas generated by moderate-intensity nanosecond lasers, and RHDLPP-HTP for high-temperature, high-density plasmas formed by high-intensity laser pulses. The core rad… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

  35. arXiv:2401.00667  [pdf, other

    stat.ME stat.CO

    Channelling Multimodality Through a Unimodalizing Transport: Warp-U Sampler and Stochastic Bridge Sampling

    Authors: Fei Ding, David E. Jones, Shiyuan He, Xiao-Li Meng

    Abstract: Monte Carlo integration is fundamental in scientific and statistical computation, but requires reliable samples from the target distribution, which poses a substantial challenge in the case of multi-modal distributions. Existing methods often involve time-consuming tuning, and typically lack tailored estimators for efficient use of the samples. This paper adapts the Warp-U transformation [Wang et… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

  36. arXiv:2401.00041  [pdf, other

    hep-th hep-ph

    Scalar-Scaffolded Gluons and the Combinatorial Origins of Yang-Mills Theory

    Authors: Nima Arkani-Hamed, Qu Cao, Jin Dong, Carolina Figueiredo, Song He

    Abstract: We present a new formulation for Yang-Mills scattering amplitudes in any number of dimensions and at any loop order, based on the same combinatorial and binary-geometric ideas in kinematic space recently used to give an all-order description of Tr $φ^3$ theory. We propose that in a precise sense the amplitudes for a suitably "stringy" form of these two theories are identical, up to a simple shift… ▽ More

    Submitted 29 December, 2023; originally announced January 2024.

    Comments: 92 pages, 37 figures

  37. arXiv:2312.17591  [pdf, other

    cs.CL

    Towards Faithful Explanations for Text Classification with Robustness Improvement and Explanation Guided Training

    Authors: Dongfang Li, Baotian Hu, Qingcai Chen, Shan He

    Abstract: Feature attribution methods highlight the important input tokens as explanations to model predictions, which have been widely applied to deep neural networks towards trustworthy AI. However, recent works show that explanations provided by these methods face challenges of being faithful and robust. In this paper, we propose a method with Robustness improvement and Explanation Guided training toward… ▽ More

    Submitted 29 December, 2023; originally announced December 2023.

  38. arXiv:2312.16282  [pdf, other

    hep-th hep-ph

    Hidden zeros for particle/string amplitudes and the unity of colored scalars, pions and gluons

    Authors: Nima Arkani-Hamed, Qu Cao, Jin Dong, Carolina Figueiredo, Song He

    Abstract: Recent years have seen the emergence of a new understanding of scattering amplitudes in the simplest theory of colored scalar particles - the Tr$(φ^3)$ theory - based on combinatorial and geometric ideas in the kinematic space of scattering data. In this paper we report a surprise: far from the toy model it appears to be, the ''stringy'' Tr$(φ^3)$ amplitudes secretly contain the scattering amplitu… ▽ More

    Submitted 1 May, 2024; v1 submitted 26 December, 2023; originally announced December 2023.

    Comments: Added reference to early work of Gliozzi et. al. giving a different derivation of zeros for string amplitudes from monodromy relations, corrected typos

  39. arXiv:2312.16218  [pdf, other

    cs.CV

    Hyper-VolTran: Fast and Generalizable One-Shot Image to 3D Object Structure via HyperNetworks

    Authors: Christian Simon, Sen He, Juan-Manuel Perez-Rua, Mengmeng Xu, Amine Benhalloum, Tao Xiang

    Abstract: Solving image-to-3D from a single view is an ill-posed problem, and current neural reconstruction methods addressing it through diffusion models still rely on scene-specific optimization, constraining their generalization capability. To overcome the limitations of existing approaches regarding generalization and consistency, we introduce a novel neural rendering technique. Our approach employs the… ▽ More

    Submitted 5 January, 2024; v1 submitted 24 December, 2023; originally announced December 2023.

  40. arXiv:2312.15633  [pdf, other

    cs.CV eess.IV

    MuLA-GAN: Multi-Level Attention GAN for Enhanced Underwater Visibility

    Authors: Ahsan Baidar Bakht, Zikai Jia, Muhayy ud Din, Waseem Akram, Lyes Saad Soud, Lakmal Seneviratne, Defu Lin, Shaoming He, Irfan Hussain

    Abstract: The underwater environment presents unique challenges, including color distortions, reduced contrast, and blurriness, hindering accurate analysis. In this work, we introduce MuLA-GAN, a novel approach that leverages the synergistic power of Generative Adversarial Networks (GANs) and Multi-Level Attention mechanisms for comprehensive underwater image enhancement. The integration of Multi-Level Atte… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

  41. arXiv:2312.15484  [pdf, other

    hep-th

    On constructibility of AdS supergluon amplitudes

    Authors: Qu Cao, Song He, Yichao Tang

    Abstract: We prove that all tree-level $n$-point supergluon (scalar) amplitudes in AdS$_5$ can be recursively constructed, using factorization and flat-space limit. Our method is greatly facilitated by a natural R-symmetry basis for planar color-ordered amplitudes, which reduces the latter to "partial amplitudes" with simpler pole structures and factorization properties. Given the $n$-point scalar amplitude… ▽ More

    Submitted 14 January, 2024; v1 submitted 24 December, 2023; originally announced December 2023.

    Comments: 5 pages, 4 figures, major revision from v2 including new ancillary file

  42. arXiv:2312.13875  [pdf, other

    stat.ML cs.LG stat.ME

    Best Arm Identification in Batched Multi-armed Bandit Problems

    Authors: Shengyu Cao, Simai He, Ruoqing Jiang, Jin Xu, Hongsong Yuan

    Abstract: Recently multi-armed bandit problem arises in many real-life scenarios where arms must be sampled in batches, due to limited time the agent can wait for the feedback. Such applications include biological experimentation and online marketing. The problem is further complicated when the number of arms is large and the number of batches is small. We consider pure exploration in a batched multi-armed… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

  43. arXiv:2312.11988  [pdf, other

    cs.SE cs.AI cs.PL

    Xpert: Empowering Incident Management with Query Recommendations via Large Language Models

    Authors: Yuxuan Jiang, Chaoyun Zhang, Shilin He, Zhihao Yang, Minghua Ma, Si Qin, Yu Kang, Yingnong Dang, Saravan Rajmohan, Qingwei Lin, Dongmei Zhang

    Abstract: Large-scale cloud systems play a pivotal role in modern IT infrastructure. However, incidents occurring within these systems can lead to service disruptions and adversely affect user experience. To swiftly resolve such incidents, on-call engineers depend on crafting domain-specific language (DSL) queries to analyze telemetry data. However, writing these queries can be challenging and time-consumin… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: Accepted as a reseach paper at ICSE 2024

  44. arXiv:2312.11549  [pdf, other

    cs.LG

    Label-Free Multivariate Time Series Anomaly Detection

    Authors: Qihang Zhou, Shibo He, Haoyu Liu, Jiming Chen, Wenchao Meng

    Abstract: Anomaly detection in multivariate time series (MTS) has been widely studied in one-class classification (OCC) setting. The training samples in OCC are assumed to be normal, which is difficult to guarantee in practical situations. Such a case may degrade the performance of OCC-based anomaly detection methods which fit the training distribution as the normal distribution. In this paper, we propose M… ▽ More

    Submitted 6 February, 2024; v1 submitted 16 December, 2023; originally announced December 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2208.02108

  45. arXiv:2312.10979  [pdf, ps, other

    cs.SD eess.AS

    3S-TSE: Efficient Three-Stage Target Speaker Extraction for Real-Time and Low-Resource Applications

    Authors: Shulin He, Jinjiang liu, Hao Li, Yang Yang, Fei Chen, Xueliang Zhang

    Abstract: Target speaker extraction (TSE) aims to isolate a specific voice from multiple mixed speakers relying on a registerd sample. Since voiceprint features usually vary greatly, current end-to-end neural networks require large model parameters which are computational intensive and impractical for real-time applications, espetially on resource-constrained platforms. In this paper, we address the TSE tas… ▽ More

    Submitted 4 January, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: Accepted to ICASSP 2024

  46. arXiv:2312.09716  [pdf, other

    cs.CV

    Let All be Whitened: Multi-teacher Distillation for Efficient Visual Retrieval

    Authors: Zhe Ma, Jianfeng Dong, Shouling Ji, Zhenguang Liu, Xuhong Zhang, Zonghui Wang, Sifeng He, Feng Qian, Xiaobo Zhang, Lei Yang

    Abstract: Visual retrieval aims to search for the most relevant visual items, e.g., images and videos, from a candidate gallery with a given query item. Accuracy and efficiency are two competing objectives in retrieval tasks. Instead of crafting a new method pursuing further improvement on accuracy, in this paper we propose a multi-teacher distillation framework Whiten-MTD, which is able to transfer knowled… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: Accepted by AAAI 2024

  47. CAT: A Causally Graph Attention Network for Trimming Heterophilic Graph

    Authors: Silu He, Qinyao Luo, Xinsha Fu, Ling Zhao, Ronghua Du, Haifeng Li

    Abstract: Local Attention-guided Message Passing Mechanism (LAMP) adopted in Graph Attention Networks (GATs) is designed to adaptively learn the importance of neighboring nodes for better local aggregation on the graph, which can bring the representations of similar neighbors closer effectively, thus showing stronger discrimination ability. However, existing GATs suffer from a significant discrimination abi… ▽ More

    Submitted 17 June, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: 25 pages, 18 figures, 5 tables

    Journal ref: Information Science 2024

  48. arXiv:2312.05062  [pdf, ps, other

    eess.IV

    Deep Learning Enabled Semantic Communication Systems for Video Transmission

    Authors: Zhenguo Zhang, Qianqian Yang, Shibo He, Jiming Chen

    Abstract: Semantic communication has emerged as a promising approach for improving efficient transmission in the next generation of wireless networks. Inspired by the success of semantic communication in different areas, we aim to provide a new semantic communication scheme from the semantic level. In this paper, we propose a novel DL-based semantic communication system for video transmission, which compact… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  49. arXiv:2312.04557  [pdf, other

    cs.CV

    GenTron: Diffusion Transformers for Image and Video Generation

    Authors: Shoufa Chen, Mengmeng Xu, Jiawei Ren, Yuren Cong, Sen He, Yanping Xie, Animesh Sinha, Ping Luo, Tao Xiang, Juan-Manuel Perez-Rua

    Abstract: In this study, we explore Transformer-based diffusion models for image and video generation. Despite the dominance of Transformer architectures in various fields due to their flexibility and scalability, the visual generative domain primarily utilizes CNN-based U-Net architectures, particularly in diffusion-based models. We introduce GenTron, a family of Generative models employing Transformer-bas… ▽ More

    Submitted 2 June, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

    Comments: CVPR2024 Camera Ready. Website: https://www.shoufachen.com/gentron_website/

  50. arXiv:2312.02679  [pdf, other

    hep-th

    Entanglement and Pseudo Entanglement Dynamics versus Fusion in CFT

    Authors: Song He, Yu-Xuan Zhang, Long Zhao, Zi-Xuan Zhao

    Abstract: The fusion rules and operator product expansion (OPE) serve as crucial tools in the study of operator algebras within conformal field theory (CFT). Building upon the vision of using entanglement to explore the connections between fusion coefficients and OPE coefficients, we employ the replica method and Schmidt decomposition method to investigate the time evolution of entanglement entropy (EE) and… ▽ More

    Submitted 29 June, 2024; v1 submitted 5 December, 2023; originally announced December 2023.

    Comments: 29 pages, 4 figures, published version