Skip to main content

Showing 1–50 of 636 results for author: Kong, X

  1. arXiv:2407.09807  [pdf, other

    eess.AS

    A Streaming Multi-Channel End-to-End Speech Recognition System with Realistic Evaluations

    Authors: Xiangzhu Kong, Tianqi Ning, Hao Huang, Zhijian Ou

    Abstract: Recently multi-channel end-to-end (ME2E) ASR systems have emerged. While streaming single-channel end-to-end ASR has been extensively studied, streaming ME2E ASR is limited in exploration. Additionally, recent studies call attention to the gap between in-distribution (ID) and out-of-distribution (OOD) tests and doing realistic evaluations. This paper focuses on two research problems: realizing str… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

  2. arXiv:2407.09029  [pdf, other

    cs.MM cs.CV cs.SD eess.AS

    Enhancing Emotion Recognition in Incomplete Data: A Novel Cross-Modal Alignment, Reconstruction, and Refinement Framework

    Authors: Haoqin Sun, Shiwan Zhao, Shaokai Li, Xiangyu Kong, Xuechen Wang, Aobo Kong, Jiaming Zhou, Yong Chen, Wenjia Zeng, Yong Qin

    Abstract: Multimodal emotion recognition systems rely heavily on the full availability of modalities, suffering significant performance declines when modal data is incomplete. To tackle this issue, we present the Cross-Modal Alignment, Reconstruction, and Refinement (CM-ARR) framework, an innovative approach that sequentially engages in cross-modal alignment, reconstruction, and refinement phases to handle… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  3. arXiv:2407.08946  [pdf, other

    cs.LG

    Your Diffusion Model is Secretly a Noise Classifier and Benefits from Contrastive Training

    Authors: Yunshu Wu, Yingtao Luo, Xianghao Kong, Evangelos E. Papalexakis, Greg Ver Steeg

    Abstract: Diffusion models learn to denoise data and the trained denoiser is then used to generate new samples from the data distribution. In this paper, we revisit the diffusion sampling process and identify a fundamental cause of sample quality degradation: the denoiser is poorly estimated in regions that are far Outside Of the training Distribution (OOD), and the sampling process inevitably evaluates in… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  4. arXiv:2407.07433  [pdf, other

    cs.CV cs.AI

    Controllable Navigation Instruction Generation with Chain of Thought Prompting

    Authors: Xianghao Kong, Jinyu Chen, Wenguan Wang, Hang Su, Xiaolin Hu, Yi Yang, Si Liu

    Abstract: Instruction generation is a vital and multidisciplinary research area with broad applications. Existing instruction generation models are limited to generating instructions in a single style from a particular dataset, and the style and content of generated instructions cannot be controlled. Moreover, most existing instruction generation methods also disregard the spatial modeling of the navigation… ▽ More

    Submitted 16 July, 2024; v1 submitted 10 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  5. arXiv:2407.06813  [pdf, other

    cs.AI cs.MA cs.SI

    Richelieu: Self-Evolving LLM-Based Agents for AI Diplomacy

    Authors: Zhenyu Guan, Xiangyu Kong, Fangwei Zhong, Yizhou Wang

    Abstract: Diplomacy is one of the most sophisticated activities in human society. The complex interactions among multiple parties/ agents involve various abilities like social reasoning, negotiation arts, and long-term strategy planning. Previous AI agents surely have proved their capability of handling multi-step games and larger action spaces on tasks involving multiple agents. However, diplomacy involves… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  6. arXiv:2407.05954  [pdf, other

    cs.LG stat.ME

    Causality-driven Sequence Segmentation for Enhancing Multiphase Industrial Process Data Analysis and Soft Sensing

    Authors: Yimeng He, Le Yao, Xinmin Zhang, Xiangyin Kong, Zhihuan Song

    Abstract: The dynamic characteristics of multiphase industrial processes present significant challenges in the field of industrial big data modeling. Traditional soft sensing models frequently neglect the process dynamics and have difficulty in capturing transient phenomena like phase transitions. To address this issue, this article introduces a causality-driven sequence segmentation (CDSS) model. This mode… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  7. arXiv:2407.02382  [pdf, other

    cs.CV cs.LG cs.RO

    Light-SLAM: A Robust Deep-Learning Visual SLAM System Based on LightGlue under Challenging Lighting Conditions

    Authors: Zhiqi Zhao, Chang Wu, Xiaotong Kong, Zejie Lv, Xiaoqi Du, Qiyan Li

    Abstract: Simultaneous Localization and Mapping (SLAM) has become a critical technology for intelligent transportation systems and autonomous robots and is widely used in autonomous driving. However, traditional manual feature-based methods in challenging lighting environments make it difficult to ensure robustness and accuracy. Some deep learning-based methods show potential but still have significant draw… ▽ More

    Submitted 10 May, 2024; originally announced July 2024.

  8. arXiv:2407.02220  [pdf, other

    cs.RO cs.AI

    Embodied AI in Mobile Robots: Coverage Path Planning with Large Language Models

    Authors: Xiangrui Kong, Wenxiao Zhang, Jin Hong, Thomas Braunl

    Abstract: In recent years, Large Language Models (LLMs) have demonstrated remarkable capabilities in understanding and solving mathematical problems, leading to advancements in various fields. We propose an LLM-embodied path planning framework for mobile agents, focusing on solving high-level coverage path planning issues and low-level control. Our proposed multi-layer architecture uses prompted LLMs in the… ▽ More

    Submitted 3 July, 2024; v1 submitted 2 July, 2024; originally announced July 2024.

    Comments: 7 pages, 2 figures, conference

  9. arXiv:2406.09480  [pdf, other

    quant-ph

    A photon-interfaced ten qubit quantum network node

    Authors: M. Canteri, Z. X. Koong, J. Bate, A. Winkler, V. Krutyanskiy, B. P. Lanyon

    Abstract: We entangle each individual matter-qubit in a register of ten to a separate travelling photon. The qubits are encoded in a string of cotrapped atomic ions. By switching the trap confinement, ions are brought one at a time into the waist of an optical cavity and emit a photon via a laser-driven cavity-mediated Raman transition. The result is a train of photonic-qubits, each near-maximally entangled… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  10. arXiv:2406.07006  [pdf, other

    cs.CV

    MIPI 2024 Challenge on Few-shot RAW Image Denoising: Methods and Results

    Authors: Xin Jin, Chunle Guo, Xiaoming Li, Zongsheng Yue, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Yuekun Dai, Peiqing Yang, Chen Change Loy, Ruoqi Li, Chang Liu, Ziyi Wang, Yao Du, Jingjing Yang, Long Bao, Heng Sun, Xiangyu Kong, Xiaoxia Xing, Jinlong Wu, Yuanyang Xue, Hyunhee Park, Sejun Song, Changho Kim, Jingfan Tan , et al. (17 additional authors not shown)

    Abstract: The increasing demand for computational photography and imaging on mobile platforms has led to the widespread development and integration of advanced image sensors with novel algorithms in camera systems. However, the scarcity of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photogra… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: CVPR 2024 Mobile Intelligent Photography and Imaging (MIPI) Workshop--Few-shot RAWImage Denoising Challenge Report. Website: https://mipi-challenge.org/MIPI2024/

  11. arXiv:2406.05651  [pdf, other

    cs.RO cs.CL cs.CV

    A Superalignment Framework in Autonomous Driving with Large Language Models

    Authors: Xiangrui Kong, Thomas Braunl, Marco Fahmi, Yue Wang

    Abstract: Over the last year, significant advancements have been made in the realms of large language models (LLMs) and multi-modal large language models (MLLMs), particularly in their application to autonomous driving. These models have showcased remarkable abilities in processing and interacting with complex information. In autonomous driving, LLMs and MLLMs are extensively used, requiring access to sensi… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: 6 pages, 5 figures, ieeeiv24

  12. arXiv:2406.05467  [pdf, other

    astro-ph.SR physics.plasm-ph physics.space-ph

    Prevalence of non-standard collapsing of strong Langmuir turbulence in solar corona plasmas

    Authors: Yaokun Li, Haomin Sun, Hao Ning, Sulan Ni, Xiangliang Kong, Jiansen He, Yao Chen

    Abstract: We present a fully-kinetic simulation of the full life cycle of strong Langmuir turbulence (SLT) excited by electron beams that are accelerated under the solar corona conditions. We find that (1) most packets ($\sim$80%) are affected by their neighbors during their collapse, as a result, their spatial scale variations present non-standard evolutionary features, i.e., deviating away from what was p… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  13. arXiv:2406.04638  [pdf, other

    cs.CL

    Large Language Model-guided Document Selection

    Authors: Xiang Kong, Tom Gunter, Ruoming Pang

    Abstract: Large Language Model (LLM) pre-training exhausts an ever growing compute budget, yet recent research has demonstrated that careful document selection enables comparable model quality with only a fraction of the FLOPs. Inspired by efforts suggesting that domain-specific training document selection is in fact an interpretable process [Gunasekar et al., 2023], as well as research showing that instruc… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 9 pages

  14. arXiv:2406.04248  [pdf, other

    astro-ph.SR astro-ph.IM

    An adaptive parameter estimator for poor-quality spectral data of white dwarfs

    Authors: Duo Xie, Jiangchuan Zhang, Yude Bu, Zhenping Yi, Meng Liu, Xiaoming Kong

    Abstract: White dwarfs represent the end stage for 97% of stars, making precise parameter measurement crucial for understanding stellar evolution. Traditional estimation methods involve fitting spectra or photometry, which require high-quality data. In recent years, machine learning has played a crucial role in processing spectral data due to its speed, automation, and accuracy. However, two common issues h… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  15. arXiv:2406.02962  [pdf, other

    cs.CL cs.AI cs.IR

    Docs2KG: Unified Knowledge Graph Construction from Heterogeneous Documents Assisted by Large Language Models

    Authors: Qiang Sun, Yuanyi Luo, Wenxiao Zhang, Sirui Li, Jichunyang Li, Kai Niu, Xiangrui Kong, Wei Liu

    Abstract: Even for a conservative estimate, 80% of enterprise data reside in unstructured files, stored in data lakes that accommodate heterogeneous formats. Classical search engines can no longer meet information seeking needs, especially when the task is to browse and explore for insight formulation. In other words, there are no obvious search keywords to use. Knowledge graphs, due to their natural visual… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  16. arXiv:2406.00109  [pdf, other

    astro-ph.SR astro-ph.HE physics.space-ph

    Energetic Electrons Accelerated and Trapped in a Magnetic Bottle above a Solar Flare Arcade

    Authors: Bin Chen, Xiangliang Kong, Sijie Yu, Chengcai Shen, Xiaocan Li, Fan Guo, Yixian Zhang, Lindsay Glesener, Säm Krucker

    Abstract: Where and how flares efficiently accelerate charged particles remains an unresolved question. Recent studies revealed that a "magnetic bottle" structure, which forms near the bottom of a large-scale reconnection current sheet above the flare arcade, is an excellent candidate for confining and accelerating charged particles. However, further understanding its role requires linking the various obser… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

    Comments: 20 pages, 13 figures (12 pages and 10 figures for main text). Accepted for publication in The Astrophysical Journal

  17. arXiv:2406.00059  [pdf, other

    cs.CL cs.DC cs.LG

    Conveyor: Efficient Tool-aware LLM Serving with Tool Partial Execution

    Authors: Yechen Xu, Xinhao Kong, Tingjun Chen, Danyang Zhuo

    Abstract: The complexity of large language model (LLM) serving workloads has substantially increased due to the integration with external tool invocations, such as ChatGPT plugins. In this paper, we identify a new opportunity for efficient LLM serving for requests that trigger tools: tool partial execution alongside LLM decoding. To this end, we design Conveyor, an efficient LLM serving system optimized for… ▽ More

    Submitted 4 June, 2024; v1 submitted 29 May, 2024; originally announced June 2024.

    Comments: 11 pages, 8 figures

  18. arXiv:2405.15052  [pdf, other

    cs.LG cs.AI

    Revisiting MoE and Dense Speed-Accuracy Comparisons for LLM Training

    Authors: Xianzhi Du, Tom Gunter, Xiang Kong, Mark Lee, Zirui Wang, Aonan Zhang, Nan Du, Ruoming Pang

    Abstract: Mixture-of-Experts (MoE) enjoys performance gain by increasing model capacity while keeping computation cost constant. When comparing MoE to dense models, prior work typically adopt the following setting: 1) use FLOPs or activated parameters as a measure of model complexity; 2) train all models to the same number of tokens. We argue that this setting favors MoE as FLOPs and activated parameters do… ▽ More

    Submitted 28 June, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: 8 pages

  19. arXiv:2405.10895  [pdf, other

    astro-ph.HE astro-ph.GA

    The unluckiest star: A spectroscopically confirmed repeated partial tidal disruption event AT 2022dbl

    Authors: Zheyu Lin, Ning Jiang, Tinggui Wang, Xu Kong, Dongyue Li, Han He, Yibo Wang, Jiazheng Zhu, Wentao Li, Ji-an Jiang, Avinash Singh, Rishabh Singh Teja, D. K. Sahu, Chichuan Jin, Keiichi Maeda, Shifeng Huang

    Abstract: The unluckiest star orbits a supermassive black hole elliptically. Every time it reaches the pericenter, it shallowly enters the tidal radius and gets partially tidal disrupted, producing a series of flares. Confirmation of a repeated partial tidal disruption event (pTDE) requires not only evidence to rule out other types of transients, but also proof that only one star is involved, as TDEs from m… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: 15 pages, 8 figures, submitted to ApJ Letters on 2024 Apr 27

  20. arXiv:2405.08438  [pdf, other

    cond-mat.str-el cond-mat.supr-con

    Magnetic fluctuation and dominant superconducting pairing symmetry near the tunable Van Hove singularity

    Authors: Xiaohan Kong, Boyang Wen, Kaiyi Guo, Ying Liang, Tianxing Ma

    Abstract: We have investigated the magnetism and pairing correlations of the triangular lattice based on the Hubbard model using the determinant quantum Monte Carlo method and the constrained path Monte Carlo. The results show that the presence of the next-nearest-neighbor hopping integral $t^{\prime}$ introduces an additional energy scale to the system, and through $t^{\prime}$, one can regulate the shape… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: 7 pages and 9 figures. Accepted for publication as a Regular Article in Physical Review B

  21. arXiv:2405.03116  [pdf, ps, other

    astro-ph.SR

    A Multi-Peak Solar Flare with a High Turnover Frequency of The Gyrosynchrotron Spectra from the Loop-Top Source

    Authors: Zhao Wu, Alexey Kuznetsov, Sergey Anfinogentov, Victor Melnikov, Robert Sych, Bing Wang, Ruisheng Zheng, Xiangliang Kong, Baolin Tan, Zongjun Ning, Yao Chen

    Abstract: The origin of multiple peaks in lightcurves of various wavelengths remains illusive during flares. Here we discuss the flare of SOL2023-05-09T03:54M6.5 with six flux peaks as recorded by a tandem of new microwave and Hard X-ray instruments. According to its microwave spectra, the flare represents a high-turnover frequency (>15 GHz) event. The rather-complete microwave and HXR spectral coverage pro… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

    Comments: 23 pages, 11 figures

  22. arXiv:2405.02583  [pdf, other

    cs.AI

    Explainable Interface for Human-Autonomy Teaming: A Survey

    Authors: Xiangqi Kong, Yang Xing, Antonios Tsourdos, Ziyue Wang, Weisi Guo, Adolfo Perrusquia, Andreas Wikander

    Abstract: Nowadays, large-scale foundation models are being increasingly integrated into numerous safety-critical applications, including human-autonomy teaming (HAT) within transportation, medical, and defence domains. Consequently, the inherent 'black-box' nature of these sophisticated deep neural networks heightens the significance of fostering mutual understanding and trust between humans and autonomous… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

    Comments: 45 pages, 9 figures

  23. arXiv:2404.19527  [pdf, other

    cs.CV

    Revealing the Two Sides of Data Augmentation: An Asymmetric Distillation-based Win-Win Solution for Open-Set Recognition

    Authors: Yunbing Jia, Xiaoyu Kong, Fan Tang, Yixing Gao, Weiming Dong, Yi Yang

    Abstract: In this paper, we reveal the two sides of data augmentation: enhancements in closed-set recognition correlate with a significant decrease in open-set recognition. Through empirical investigation, we find that multi-sample-based augmentations would contribute to reducing feature discrimination, thereby diminishing the open-set criteria. Although knowledge distillation could impair the feature via i… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

  24. arXiv:2404.15701  [pdf, other

    astro-ph.GA

    USmorph: An Updated Framework of Automatic Classification of Galaxy Morphologies and Its Application to Galaxies in the COSMOS Field

    Authors: Jie Song, GuanWen Fang, Shuo Ba, Zesen Lin, Yizhou Gu, Chichun Zhou, Tao Wang, Cai-Na Hao, Guilin Liu, Hongxin Zhang, Yao Yao, Xu Kong

    Abstract: Morphological classification conveys abundant information on the formation, evolution, and environment of galaxies. In this work, we refine the two-step galaxy morphological classification framework ({\tt\string USmorph}), which employs a combination of unsupervised machine learning (UML) and supervised machine learning (SML) techniques, along with a self-consistent and robust data preprocessing s… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: Accepted by ApJS, 16 pages, 12 figures

  25. arXiv:2404.11409  [pdf, ps, other

    cs.IT math.CO

    Batch Array Codes

    Authors: Xiangliang Kong, Chen Wang, Yiwei Zhang

    Abstract: Batch codes are a type of codes specifically designed for coded distributed storage systems and private information retrieval protocols. These codes have got much attention in recent years due to their ability to enable efficient and secure storage in distributed systems. In this paper, we study an array code version of the batch codes, which is called the \emph{batch array code} (BAC). Under th… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: 24 pages

    MSC Class: 68P30; 94B60 ACM Class: E.4

  26. arXiv:2404.09422  [pdf, other

    astro-ph.GA

    FEASTS Combined with Interferometry (I): Overall Properties of Diffuse HI and Implications for Gas Accretion in Nearby Galaxies

    Authors: Jing Wang, Xuchen Lin, Dong Yang, Lister Staveley-Smith, Fabian Walter, Q. Daniel Wang, Ran Wang, A. J. Battisti, Barbara Catinella, Hsiao-Wen Chen, Luca Cortese, D. B. Fisher, Luis C. Ho, Suoqing Ji, Peng Jiang, Guinevere Kauffmann, Xu Kong, Ziming Liu, Li Shao, Jie Wang, Lile Wang, Shun Wang

    Abstract: We present a statistical study of the properties of diffuse HI in ten nearby galaxies, comparing the HI detected by the single-dish telescope FAST (FEASTS program) and the interferometer VLA (THINGS program), respectively. The THINGS' observation missed HI with a median of 23% due to the short-spacing problem of interferometry and limited sensitivity. We extract the diffuse HI by subtracting the d… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: 45 pages, 23 figures. In press at ApJ. Data will be released at the FEASTS site upon publication

  27. arXiv:2403.18304  [pdf, ps, other

    math.PR

    Complete moment convergence of moving average processes for $m$-widely acceptable sequence under sub-linear expectations

    Authors: Mingzhou Xu, Xuhang Kong

    Abstract: In this article, the complete moment convergence for the partial sum of moving average processes $\{X_n=\sum_{i=-\infty}^{\infty}a_iY_{i+n},n\ge 1\}$ is estabished under some proper conditions, where $\{Y_i,-\infty<i<\infty\}$ is a sequence of $m$-widely acceptable ($m$-WA) random variables, which is stochastically dominated by a random variable $Y$ in sub-linear expectations space $(Ω,\HH,\ee)$ a… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: 16 pages,submitted to Journal of Inequalities and Applications

    MSC Class: 60F15; 60F05

  28. arXiv:2403.16405  [pdf, other

    cs.LG cs.CR cs.CV

    Ensemble Adversarial Defense via Integration of Multiple Dispersed Low Curvature Models

    Authors: Kaikang Zhao, Xi Chen, Wei Huang, Liuxin Ding, Xianglong Kong, Fan Zhang

    Abstract: The integration of an ensemble of deep learning models has been extensively explored to enhance defense against adversarial attacks. The diversity among sub-models increases the attack cost required to deceive the majority of the ensemble, thereby improving the adversarial robustness. While existing approaches mainly center on increasing diversity in feature representations or dispersion of first-… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: Accepted to The 2024 International Joint Conference on Neural Networks (IJCNN)

  29. Radio-to-Submillimetre Spectral Energy Distributions of NGC 1365

    Authors: Guangwen Chen, George J. Bendo, Gary A. Fuller, Hong-Xin Zhang, Xu Kong

    Abstract: We analyse the radio-to-submillimetre spectral energy distribution (SED) for the central pseudobulge of NGC~1365 using archival data from the Atacama Large Millimeter/submillimeter Array (ALMA) and the Very Large Array (VLA). This analysis shows that free-free emission dominates the continuum emission at 50--120~GHz and produces about 75 per cent of the 103~GHz continuum emission. However, the fra… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: 17 pages, 10 figures, 11 tables, accepted for publication in MNRAS

  30. arXiv:2403.09611  [pdf, other

    cs.CV cs.CL cs.LG

    MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

    Authors: Brandon McKinzie, Zhe Gan, Jean-Philippe Fauconnier, Sam Dodge, Bowen Zhang, Philipp Dufter, Dhruti Shah, Xianzhi Du, Futang Peng, Floris Weers, Anton Belyi, Haotian Zhang, Karanjeet Singh, Doug Kang, Ankur Jain, Hongyu Hè, Max Schwarzer, Tom Gunter, Xiang Kong, Aonan Zhang, Jianyu Wang, Chong Wang, Nan Du, Tao Lei, Sam Wiseman , et al. (7 additional authors not shown)

    Abstract: In this work, we discuss building performant Multimodal Large Language Models (MLLMs). In particular, we study the importance of various architecture components and data choices. Through careful and comprehensive ablations of the image encoder, the vision language connector, and various pre-training data choices, we identified several crucial design lessons. For example, we demonstrate that for la… ▽ More

    Submitted 18 April, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

  31. arXiv:2403.05912  [pdf, other

    eess.IV cs.CV

    Mask-Enhanced Segment Anything Model for Tumor Lesion Semantic Segmentation

    Authors: Hairong Shi, Songhao Han, Shaofei Huang, Yue Liao, Guanbin Li, Xiangxing Kong, Hua Zhu, Xiaomu Wang, Si Liu

    Abstract: Tumor lesion segmentation on CT or MRI images plays a critical role in cancer diagnosis and treatment planning. Considering the inherent differences in tumor lesion segmentation data across various medical imaging modalities and equipment, integrating medical knowledge into the Segment Anything Model (SAM) presents promising capability due to its versatility and generalization potential. Recent st… ▽ More

    Submitted 11 July, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

  32. arXiv:2403.01686  [pdf, other

    astro-ph.HE astro-ph.GA

    AT2023lli: A Tidal Disruption Event with Prominent Optical Early Bump and Delayed Episodic X-ray Emission

    Authors: Shifeng Huang, Ning Jiang, Jiazheng Zhu, Yibo Wang, Tinggui Wang, Shan-Qin Wang, Wen-Pei Gan, En-Wei Liang, Yu-Jing Qin, Zheyu Lin, Lin-Na Xu, Min-Xuan Cai, Ji-An Jiang, Xu Kong, Jiaxun Li, Long Li, Jian-Guo Wang, Ze-Lin Xu, Yongquan Xue, Ye-Fei Yuan, Jingquan Cheng, Lulu Fan, Jie Gao, Lei Hu, Weida Hu , et al. (20 additional authors not shown)

    Abstract: High-cadence, multiwavelength observations have continuously revealed the diversity of tidal disruption events (TDEs), thus greatly advancing our knowledge and understanding of TDEs. In this work, we conducted an intensive optical-UV and X-ray follow-up campaign of TDE AT2023lli, and found a remarkable month-long bump in its UV/optical light curve nearly two months prior to maximum brightness. The… ▽ More

    Submitted 26 March, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

    Comments: 14 pages, 8 figures,accepted for publication by ApJL

  33. arXiv:2403.00485  [pdf, other

    cs.LG

    A Survey of Geometric Graph Neural Networks: Data Structures, Models and Applications

    Authors: Jiaqi Han, Jiacheng Cen, Liming Wu, Zongzhao Li, Xiangzhe Kong, Rui Jiao, Ziyang Yu, Tingyang Xu, Fandi Wu, Zihe Wang, Hongteng Xu, Zhewei Wei, Yang Liu, Yu Rong, Wenbing Huang

    Abstract: Geometric graph is a special kind of graph with geometric features, which is vital to model many scientific problems. Unlike generic graphs, geometric graphs often exhibit physical symmetries of translations, rotations, and reflections, making them ineffectively processed by current Graph Neural Networks (GNNs). To tackle this issue, researchers proposed a variety of Geometric Graph Neural Network… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  34. arXiv:2402.13555  [pdf, other

    q-bio.BM

    Full-Atom Peptide Design with Geometric Latent Diffusion

    Authors: Xiangzhe Kong, Yinjun Jia, Wenbing Huang, Yang Liu

    Abstract: Peptide design plays a pivotal role in therapeutics, allowing brand new possibility to leverage target binding sites that are previously undruggable. Most existing methods are either inefficient or only concerned with the target-agnostic design of 1D sequences. In this paper, we propose a generative model for full-atom \textbf{Pep}tide design with \textbf{G}eometric \textbf{LA}tent \textbf{D}iffus… ▽ More

    Submitted 21 May, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: 25 pages

  35. arXiv:2402.12714  [pdf, other

    cs.LG physics.chem-ph

    Equivariant Pretrained Transformer for Unified Geometric Learning on Multi-Domain 3D Molecules

    Authors: Rui Jiao, Xiangzhe Kong, Ziyang Yu, Wenbing Huang, Yang Liu

    Abstract: Pretraining on a large number of unlabeled 3D molecules has showcased superiority in various scientific applications. However, prior efforts typically focus on pretraining models on a specific domain, either proteins or small molecules, missing the opportunity to leverage the cross-domain knowledge. To mitigate this gap, we introduce Equivariant Pretrained Transformer (EPT), a novel pretraining fr… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  36. arXiv:2402.11907  [pdf, other

    cs.CL

    Direct Large Language Model Alignment Through Self-Rewarding Contrastive Prompt Distillation

    Authors: Aiwei Liu, Haoping Bai, Zhiyun Lu, Xiang Kong, Simon Wang, Jiulong Shan, Meng Cao, Lijie Wen

    Abstract: Aligning large language models (LLMs) with human expectations without human-annotated preference data is an important problem. In this paper, we propose a method to evaluate the response preference by using the output probabilities of response pairs under contrastive prompt pairs, which could achieve better performance on LLaMA2-7B and LLaMA2-13B compared to RLAIF. Based on this, we propose an aut… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: 24 pages, 5 pages

    MSC Class: 68T50 ACM Class: I.2.7

  37. arXiv:2402.03908  [pdf, other

    cs.CV

    EscherNet: A Generative Model for Scalable View Synthesis

    Authors: Xin Kong, Shikun Liu, Xiaoyang Lyu, Marwan Taher, Xiaojuan Qi, Andrew J. Davison

    Abstract: We introduce EscherNet, a multi-view conditioned diffusion model for view synthesis. EscherNet learns implicit and generative 3D representations coupled with a specialised camera positional encoding, allowing precise and continuous relative control of the camera transformation between an arbitrary number of reference and target views. EscherNet offers exceptional generality, flexibility, and scala… ▽ More

    Submitted 19 March, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: CVPR2024 Project Page: https://kxhit.github.io/EscherNet

  38. arXiv:2402.01755  [pdf

    cond-mat.mtrl-sci

    Spontaneous nucleation and growth of GaN nanowires: Fundamental role of crystal polarity

    Authors: Sergio Fernández-Garrido, Xiang Kong, Tobias Gotschke, Raffaella Calarco, Lutz Geelhaar, Achim Trampert, Oliver Brandt

    Abstract: We experimentally investigate whether crystal polarity affects the growth of GaN nanowires in plasma-assisted molecular beam epitaxy and whether their formation has to be induced by defects. For this purpose, we prepare smooth and coherently strained AlN layers on 6H-SiC(0001) and SiC(000$\bar{1}$) substrates to ensure a well-defined polarity and an absence of structural and morphological defects.… ▽ More

    Submitted 30 January, 2024; originally announced February 2024.

    Journal ref: Nano Lett. 2012, 12, 12, 6119

  39. arXiv:2401.17364  [pdf, other

    astro-ph.GA astro-ph.CO astro-ph.IM

    HiFAST: an HI data calibration and imaging pipeline for FAST

    Authors: Yingjie Jing, Jie Wang, Chen Xu, Ziming Liu, Qingze Chen, Tiantian Liang, Jinlong Xu, Yixian Cao, Jing Wang, Huijie Hu, Chuan-Peng Zhang, Qi Guo, Liang Gao, Mei Ai, Hengqian Gan, Xuyang Gao, Jinlin Han, Ligang Hou, Zhipeng Hou, Peng Jiang, Xu Kong, Fujia Li, Zerui Liu, Li Shao, Hengxing Pan , et al. (8 additional authors not shown)

    Abstract: The Five-hundred-meter Aperture Spherical radio Telescope (FAST) has the largest aperture and a 19-beam L-band receiver, making it powerful for investigating the neutral hydrogen atomic gas (HI) in the universe. We present HiFAST (https://hifast.readthedocs.io), a dedicated, modular, and self-contained calibration and imaging pipeline for processing the HI data of FAST. The pipeline consists of fr… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: Accepted by SCPMA. 21 pages, 14 figures. The pipeline is accessible at https://hifast.readthedocs.io

  40. Polarity-induced selective area epitaxy of GaN nanowires

    Authors: Ziani de Souza Schiaber, Gabriele Calabrese, Xiang Kong, Achim Trampert, Bernd Jenichen, José Humberto Dias da Silva, Lutz Geelhaar, Oliver Brandt, Sergio Fernández-Garrido

    Abstract: We present a conceptually novel approach to achieve selective area epitaxy of GaN nanowires. The approach is based on the fact that these nanostructures do not form in plasma-assisted molecular beam epitaxy on structurally and chemically uniform cation-polar substrates. By in situ depositing and nitridating Si on a Ga-polar GaN film, we locally reverse the polarity to induce the selective area epi… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Journal ref: Nano Lett. 2017, 17, 63

  41. MobFuzz: Adaptive Multi-objective Optimization in Gray-box Fuzzing

    Authors: Gen Zhang, Pengfei Wang, Tai Yue, Xiangdong Kong, Shan Huang, Xu Zhou, Kai Lu

    Abstract: Coverage-guided gray-box fuzzing (CGF) is an efficient software testing technique. There are usually multiple objectives to optimize in CGF. However, existing CGF methods cannot successfully find the optimal values for multiple objectives simultaneously. In this paper, we propose a gray-box fuzzer for multi-objective optimization (MOO) called MobFuzz. We model the multi-objective optimization proc… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Journal ref: Network and Distributed Systems Security (NDSS) Symposium 2022

  42. arXiv:2401.15927  [pdf, other

    cs.CL

    E-EVAL: A Comprehensive Chinese K-12 Education Evaluation Benchmark for Large Language Models

    Authors: Jinchang Hou, Chang Ao, Haihong Wu, Xiangtao Kong, Zhigang Zheng, Daijia Tang, Chengming Li, Xiping Hu, Ruifeng Xu, Shiwen Ni, Min Yang

    Abstract: With the accelerating development of Large Language Models (LLMs), many LLMs are beginning to be used in the Chinese K-12 education domain. The integration of LLMs and education is getting closer and closer, however, there is currently no benchmark for evaluating LLMs that focuses on the Chinese K-12 education domain. Therefore, there is an urgent need for a comprehensive natural language processi… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  43. arXiv:2401.13627  [pdf, other

    cs.CV

    Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild

    Authors: Fanghua Yu, Jinjin Gu, Zheyuan Li, Jinfan Hu, Xiangtao Kong, Xintao Wang, Jingwen He, Yu Qiao, Chao Dong

    Abstract: We introduce SUPIR (Scaling-UP Image Restoration), a groundbreaking image restoration method that harnesses generative prior and the power of model scaling up. Leveraging multi-modal techniques and advanced generative prior, SUPIR marks a significant advance in intelligent and realistic image restoration. As a pivotal catalyst within SUPIR, model scaling dramatically enhances its capabilities and… ▽ More

    Submitted 3 April, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: This paper has been accepted by CVPR 2024

  44. arXiv:2401.10568  [pdf, other

    cs.AI

    CivRealm: A Learning and Reasoning Odyssey in Civilization for Decision-Making Agents

    Authors: Siyuan Qi, Shuo Chen, Yexin Li, Xiangyu Kong, Junqi Wang, Bangcheng Yang, Pring Wong, Yifan Zhong, Xiaoyuan Zhang, Zhaowei Zhang, Nian Liu, Wei Wang, Yaodong Yang, Song-Chun Zhu

    Abstract: The generalization of decision-making agents encompasses two fundamental elements: learning from past experiences and reasoning in novel contexts. However, the predominant emphasis in most interactive environments is on learning, often at the expense of complexity in reasoning. In this paper, we introduce CivRealm, an environment inspired by the Civilization game. Civilization's profound alignment… ▽ More

    Submitted 12 March, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

  45. arXiv:2401.06905  [pdf, other

    physics.geo-ph physics.flu-dyn

    Physics-Informed Convolutional Decoder (PICD): A novel approach for direct inversion of heterogeneous subsurface flow

    Authors: Nanzhe Wang, Xiang-Zhao Kong, Dongxiao Zhang

    Abstract: In this study, we present the development and application of the physics-informed convolutional decoder (PICD) framework for inverse modeling of heterogenous groundwater flow. PICD stands out as a direct inversion method, eliminating the need for repeated forward model simulations. The framework leverages both data-driven and physics-driven approaches by integrating monitoring data and domain know… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

    Comments: 29 pages, 7 figures

  46. arXiv:2401.05778  [pdf, other

    cs.CL cs.AI

    Risk Taxonomy, Mitigation, and Assessment Benchmarks of Large Language Model Systems

    Authors: Tianyu Cui, Yanling Wang, Chuanpu Fu, Yong Xiao, Sijia Li, Xinhao Deng, Yunpeng Liu, Qinglin Zhang, Ziyi Qiu, Peiyang Li, Zhixing Tan, Junwu Xiong, Xinyu Kong, Zujie Wen, Ke Xu, Qi Li

    Abstract: Large language models (LLMs) have strong capabilities in solving diverse natural language processing tasks. However, the safety and security issues of LLM systems have become the major obstacle to their widespread application. Many studies have extensively investigated risks in LLM systems and developed the corresponding mitigation strategies. Leading-edge enterprises such as OpenAI, Google, Meta,… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

  47. arXiv:2401.04900  [pdf, other

    astro-ph.SR astro-ph.IM cs.LG stat.ML

    SPT: Spectral Transformer for Red Giant Stars Age and Mass Estimation

    Authors: Mengmeng Zhang, Fan Wu, Yude Bu, Shanshan Li, Zhenping Yi, Meng Liu, Xiaoming Kong

    Abstract: The age and mass of red giants are essential for understanding the structure and evolution of the Milky Way. Traditional isochrone methods for these estimations are inherently limited due to overlapping isochrones in the Hertzsprung-Russell diagram, while asteroseismology, though more precise, requires high-precision, long-term observations. In response to these challenges, we developed a novel fr… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

    Comments: Accepted by A&A

  48. arXiv:2401.03959  [pdf, ps, other

    astro-ph.SR

    Projected rotational velocities for LAMOST stars with effective temperature lower than 9000 K

    Authors: Fang Zuo, A-Li Luo, Bing Du, Yinbi Li, Hugh R. A. Jones, Yi-han Song, Xiao Kong, Yan-xin Guo

    Abstract: In Data Release 9 of LAMOST, we present measurements of v sin i for a total of 121,698 stars measured using the Medium Resolution Spectrograph (MRS) and 80,108 stars using the Low Resolution Spectrograph (LRS). These values were obtained through a chi^2 minimisation process, comparing LAMOST spectra with corresponding grids of synthetically broadened spectra. Due to the resolution and the spectral… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Comments: 13 pages, 16 figures

  49. arXiv:2401.03379  [pdf, other

    cs.CV

    Towards Effective Multiple-in-One Image Restoration: A Sequential and Prompt Learning Strategy

    Authors: Xiangtao Kong, Chao Dong, Lei Zhang

    Abstract: While single task image restoration (IR) has achieved significant successes, it remains a challenging issue to train a single model which can tackle multiple IR tasks. In this work, we investigate in-depth the multiple-in-one (MiO) IR problem, which comprises seven popular IR tasks. We point out that MiO IR faces two pivotal challenges: the optimization of diverse objectives and the adaptation to… ▽ More

    Submitted 20 March, 2024; v1 submitted 6 January, 2024; originally announced January 2024.

  50. arXiv:2312.16632  [pdf, other

    cond-mat.supr-con cond-mat.str-el

    Optical conductivity of overdoped cuprates from ab-initio out-of-plane impurity potentials

    Authors: D. M. Broun, H. U. Özdemir, Vivek Mishra, N. R. Lee-Hone, Xiangru Kong, T. Berlijn, P. J. Hirschfeld

    Abstract: Dopant impurity potentials determined by ab-initio supercell DFT calculations are used to calculate the optical conductivity of overdoped LSCO and Tl-2201 in the superconducting and normal states. Vertex corrections are included, to account for the effect of forward scattering on two-particle properties. This approach was previously shown to provide good, semiquantitative agreement with measuremen… ▽ More

    Submitted 27 December, 2023; originally announced December 2023.

    Comments: 12 pages, 7 figures

    Journal ref: Physical Review B 109, 174519 (2024)