Skip to main content

Showing 1–50 of 691 results for author: He, W

  1. arXiv:2407.07614  [pdf, other

    cs.CV

    MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis

    Authors: Wanggui He, Siming Fu, Mushui Liu, Xierui Wang, Wenyi Xiao, Fangxun Shu, Yi Wang, Lei Zhang, Zhelun Yu, Haoyuan Li, Ziwei Huang, LeiLei Gan, Hao Jiang

    Abstract: Auto-regressive models have made significant progress in the realm of language generation, yet they do not perform on par with diffusion models in the domain of image synthesis. In this work, we introduce MARS, a novel framework for T2I generation that incorporates a specially designed Semantic Vision-Language Integration Expert (SemVIE). This innovative component integrates pre-trained LLMs by in… ▽ More

    Submitted 11 July, 2024; v1 submitted 10 July, 2024; originally announced July 2024.

    Comments: 14 pages, 9 figures

  2. arXiv:2407.07060  [pdf, other

    quant-ph physics.optics

    Imaging-based Quantum Optomechanics

    Authors: Christian M. Pluchar, Wenhua He, Jack Manley, Nicolas Deshler, Saikat Guha, Dalziel J. Wilson

    Abstract: In active imaging protocols, information about a landscape is encoded into the spatial mode of a scattered photon. A common assumption is that the landscape is rigid; however, in principle it can be altered by radiation pressure, a concept that has found fruitful application in the field of quantum optomechanics. Here we explore active imaging of a mechanical resonator with an eye to generalizing… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 10 pages, 5 figures

  3. arXiv:2407.02301  [pdf, other

    cs.CL

    CFinBench: A Comprehensive Chinese Financial Benchmark for Large Language Models

    Authors: Ying Nie, Binwei Yan, Tianyu Guo, Hao Liu, Haoyu Wang, Wei He, Binfan Zheng, Weihao Wang, Qiang Li, Weijian Sun, Yunhe Wang, Dacheng Tao

    Abstract: Large language models (LLMs) have achieved remarkable performance on various NLP tasks, yet their potential in more challenging and domain-specific task, such as finance, has not been fully explored. In this paper, we present CFinBench: a meticulously crafted, the most comprehensive evaluation benchmark to date, for assessing the financial knowledge of LLMs under Chinese context. In practice, to b… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  4. arXiv:2407.00079  [pdf, other

    cs.DC cs.AI cs.AR

    Mooncake: A KVCache-centric Disaggregated Architecture for LLM Serving

    Authors: Ruoyu Qin, Zheming Li, Weiran He, Mingxing Zhang, Yongwei Wu, Weimin Zheng, Xinran Xu

    Abstract: Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI. It features a KVCache-centric disaggregated architecture that separates the prefill and decoding clusters. It also leverages the underutilized CPU, DRAM, and SSD resources of the GPU cluster to implement a disaggregated cache of KVCache. The core of Mooncake is its KVCache-centric scheduler, which balances ma… ▽ More

    Submitted 9 July, 2024; v1 submitted 23 June, 2024; originally announced July 2024.

    Comments: 23 pages, 13 figures

  5. arXiv:2406.18599  [pdf, other

    physics.ins-det nucl-ex nucl-th

    Fudan Multi-purpose Active TArget Time Projection Chamber (fMeta-TPC) for Photonnuclear Reaction Experiments

    Authors: Huang-Kai Wu, Xi-Yang Wang, Yu-Miao Wang, You-Jing Wang, De-Qing Fang, Wan-Bing He, Wei-Hu Ma, Xi-Guang Cao, Chang-Bo Fu, Xian-Gai Deng, Yu-Gang Ma

    Abstract: Active Target Time Projection Chambers (AT-TPCs) are state-of-the-art tools in the field of low-energy nuclear physics, particularly suitable for experiments using low-intensity radioactive ion beams or gamma rays. The Fudan Multi-purpose Active Target Time Projection Chamber (fMeta-TPC) with 2048 channels has been developed to study $α$-clustering nuclei. {\fcb In this work, the focus is on the s… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 10 pages, 12 figures

  6. arXiv:2406.18049  [pdf

    cs.CL cs.AI

    Improving Entity Recognition Using Ensembles of Deep Learning and Fine-tuned Large Language Models: A Case Study on Adverse Event Extraction from Multiple Sources

    Authors: Yiming Li, Deepthi Viswaroopan, William He, Jianfu Li, Xu Zuo, Hua Xu, Cui Tao

    Abstract: Adverse event (AE) extraction following COVID-19 vaccines from text data is crucial for monitoring and analyzing the safety profiles of immunizations. Traditional deep learning models are adept at learning intricate feature representations and dependencies in sequential data, but often require extensive labeled data. In contrast, large language models (LLMs) excel in understanding contextual infor… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  7. arXiv:2406.17838  [pdf, other

    cs.LG cs.AI cs.HC

    InFiConD: Interactive No-code Fine-tuning with Concept-based Knowledge Distillation

    Authors: Jinbin Huang, Wenbin He, Liang Gou, Liu Ren, Chris Bryan

    Abstract: The emergence of large-scale pre-trained models has heightened their application in various downstream tasks, yet deployment is a challenge in environments with limited computational resources. Knowledge distillation has emerged as a solution in such scenarios, whereby knowledge from large teacher models is transferred into smaller student' models, but this is a non-trivial process that traditiona… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  8. arXiv:2406.16966  [pdf, other

    cs.CV cs.LG

    Mitigating Noisy Supervision Using Synthetic Samples with Soft Labels

    Authors: Yangdi Lu, Wenbo He

    Abstract: Noisy labels are ubiquitous in real-world datasets, especially in the large-scale ones derived from crowdsourcing and web searching. It is challenging to train deep neural networks with noisy datasets since the networks are prone to overfitting the noisy labels during training, resulting in poor generalization performance. During an early learning phase, deep neural networks have been observed to… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: Noisy labels, Machine learning, Similarity Search

  9. arXiv:2406.15982  [pdf, other

    cs.CV cs.AI cs.LG

    Learning with Noisy Ground Truth: From 2D Classification to 3D Reconstruction

    Authors: Yangdi Lu, Wenbo He

    Abstract: Deep neural networks has been highly successful in data-intense computer vision applications, while such success relies heavily on the massive and clean data. In real-world scenarios, clean data sometimes is difficult to obtain. For example, in image classification and segmentation tasks, precise annotations of millions samples are generally very expensive and time-consuming. In 3D static scene re… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: Computer vision, Noisy Labels, 3D reconstruction, 3D Gaussian Splats, (Work still in progress)

  10. arXiv:2406.15973  [pdf, ps, other

    physics.ins-det hep-ex

    Performance of the plastic scintillator modules for the top veto tracker of the Taishan Antineutrino Observatory

    Authors: Guang Luo, Xiaohao Yin, Fengpeng An, Zhimin Wang, Y. K. Hor, Peizhi Lu, Ruhui Li, Yichen Li, Wei He, Wei Wang, Xiang Xiao

    Abstract: For tracking and tagging the cosmic-ray muon (CR-muon), the Taishan Antineutrino Observatory (TAO) experiment is equipped with a top veto tracker (TVT) system composed of 160 modules, each consisting of plastic scintillator (PS) strip as target material, embedded wavelength shifting fiber (WLS-fiber) as photon collection and transmission medium, and silicon photomultipliers (SiPMs) at both ends as… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

  11. arXiv:2406.15175  [pdf, other

    cs.CL cs.AI

    Enhancing Idiomatic Representation in Multiple Languages via an Adaptive Contrastive Triplet Loss

    Authors: Wei He, Marco Idiart, Carolina Scarton, Aline Villavicencio

    Abstract: Accurately modeling idiomatic or non-compositional language has been a longstanding challenge in Natural Language Processing (NLP). This is partly because these expressions do not derive their meanings solely from their constituent words, but also due to the scarcity of relevant data resources, and their impact on the performance of downstream tasks such as machine translation and simplification.… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  12. arXiv:2406.10652  [pdf, other

    cs.CV

    MDeRainNet: An Efficient Neural Network for Rain Streak Removal from Macro-pixel Images

    Authors: Tao Yan, Weijiang He, Chenglong Wang, Xiangjie Zhu, Yinghui Wang, Rynson W. H. Lau

    Abstract: Since rainy weather always degrades image quality and poses significant challenges to most computer vision-based intelligent systems, image de-raining has been a hot research topic. Fortunately, in a rainy light field (LF) image, background obscured by rain streaks in one sub-view may be visible in the other sub-views, and implicit depth information and recorded 4D structural information may benef… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: 13 pages, 13 figures, 4 tables

  13. arXiv:2406.10026  [pdf

    physics.optics

    Retiming dynamics of harmonically modelocked laser solitons in a self-driven optomechanical lattice

    Authors: Xiaocong Wang, Benhai Wang, Wenbin He, Xintong Zhang, Qi Huang, Zhiyuan Huang, Xin Jiang, Philip St. J. Russell, Meng Pang

    Abstract: Harmonic mode-locking, realized actively or passively, is an effective technique for increasing the repetition rate of lasers, with important applications in optical sampling, laser micro-machining and frequency metrology. It is critically important to understand how a harmonically mode-locked pulse train responds to external perturbations and noise, so as to make sure that it is stable and resist… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  14. arXiv:2406.09844  [pdf, other

    cs.SD eess.AS

    Vec-Tok-VC+: Residual-enhanced Robust Zero-shot Voice Conversion with Progressive Constraints in a Dual-mode Training Strategy

    Authors: Linhan Ma, Xinfa Zhu, Yuanjun Lv, Zhichao Wang, Ziqian Wang, Wendi He, Hongbin Zhou, Lei Xie

    Abstract: Zero-shot voice conversion (VC) aims to transform source speech into arbitrary unseen target voice while keeping the linguistic content unchanged. Recent VC methods have made significant progress, but semantic losses in the decoupling process as well as training-inference mismatch still hinder conversion performance. In this paper, we propose Vec-Tok-VC+, a novel prompt-based zero-shot VC model im… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: Accepted by INTERSPEECH2024

  15. arXiv:2406.08499  [pdf, ps, other

    cs.CC cs.CR

    More Efficient $k$-wise Independent Permutations from Random Reversible Circuits via log-Sobolev Inequalities

    Authors: Lucas Gretta, William He, Angelos Pelecanos

    Abstract: We prove that the permutation computed by a reversible circuit with $\tilde{O}(nk\cdot \log(1/\varepsilon))$ random $3$-bit gates is $\varepsilon$-approximately $k$-wise independent. Our bound improves on currently known bounds in the regime when the approximation error $\varepsilon$ is not too small. We obtain our results by analyzing the log-Sobolev constants of appropriate Markov chains rather… ▽ More

    Submitted 8 May, 2024; originally announced June 2024.

    Comments: 19 pages

  16. arXiv:2406.08372  [pdf, other

    cs.CV

    APSeg: Auto-Prompt Network for Cross-Domain Few-Shot Semantic Segmentation

    Authors: Weizhao He, Yang Zhang, Wei Zhuo, Linlin Shen, Jiaqi Yang, Songhe Deng, Liang Sun

    Abstract: Few-shot semantic segmentation (FSS) endeavors to segment unseen classes with only a few labeled samples. Current FSS methods are commonly built on the assumption that their training and application scenarios share similar domains, and their performances degrade significantly while applied to a distinct domain. To this end, we propose to leverage the cutting-edge foundation model, the Segment Anyt… ▽ More

    Submitted 12 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

    Comments: 15 pages, 9 figures

  17. arXiv:2406.07209  [pdf, other

    cs.CV

    MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance

    Authors: X. Wang, Siming Fu, Qihan Huang, Wanggui He, Hao Jiang

    Abstract: Recent advancements in text-to-image generation models have dramatically enhanced the generation of photorealistic images from textual prompts, leading to an increased interest in personalized text-to-image applications, particularly in multi-subject scenarios. However, these advances are hindered by two main challenges: firstly, the need to accurately maintain the details of each referenced subje… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  18. arXiv:2406.07064  [pdf, other

    physics.med-ph physics.bio-ph physics.flu-dyn

    Modeling fibrous tissue in vascular fluid-structure interaction: a morphology-based pipeline and biomechanical significance

    Authors: Yujie Sun, Jiayi Huang, Qingshuang Lu, Xinhai Yue, Xuanming Huang, Wei He, Yun Shi, Ju Liu

    Abstract: We propose a suite of technologies for analyzing the interaction between anisotropic arterial walls and blood flow for subject-specific geometries. Utilizing an established lumen modeling strategy, we present a comprehensive pipeline for generating the thick-walled artery models. Through a specialized mesh generation procedure, we obtain the meshes for the arterial lumen and wall with mesh continu… ▽ More

    Submitted 20 June, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

  19. arXiv:2406.05271  [pdf, other

    cs.CV

    USE: Universal Segment Embeddings for Open-Vocabulary Image Segmentation

    Authors: Xiaoqi Wang, Wenbin He, Xiwei Xuan, Clint Sebastian, Jorge Piazentin Ono, Xin Li, Sima Behpour, Thang Doan, Liang Gou, Han Wei Shen, Liu Ren

    Abstract: The open-vocabulary image segmentation task involves partitioning images into semantically meaningful segments and classifying them with flexible text-defined categories. The recent vision-based foundation models such as the Segment Anything Model (SAM) have shown superior performance in generating class-agnostic image segments. The main challenge in open-vocabulary image segmentation now lies in… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  20. arXiv:2406.04151  [pdf, other

    cs.AI cs.CL

    AgentGym: Evolving Large Language Model-based Agents across Diverse Environments

    Authors: Zhiheng Xi, Yiwen Ding, Wenxiang Chen, Boyang Hong, Honglin Guo, Junzhe Wang, Dingwen Yang, Chenyang Liao, Xin Guo, Wei He, Songyang Gao, Lu Chen, Rui Zheng, Yicheng Zou, Tao Gui, Qi Zhang, Xipeng Qiu, Xuanjing Huang, Zuxuan Wu, Yu-Gang Jiang

    Abstract: Building generalist agents that can handle diverse tasks and evolve themselves across different environments is a long-term goal in the AI community. Large language models (LLMs) are considered a promising foundation to build such agents due to their generalized capabilities. Current approaches either have LLM-based agents imitate expert-provided trajectories step-by-step, requiring human supervis… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Project site: https://agentgym.github.io

  21. arXiv:2406.01984  [pdf, other

    cond-mat.dis-nn

    Unified one-parameter scaling function for Anderson localization transitions in non-reciprocal non-Hermitian systems

    Authors: C. Wang, Wenxue He, X. R. Wang, Hechen Ren

    Abstract: By using dimensionless conductances as scaling variables, the conventional one-parameter scaling theory of localization fails for non-reciprocal non-Hermitian systems such as the Hanato-Nelson model. Here, we propose a one-parameter scaling function using the participation ratio as the scaling variable. Employing a highly accurate numerical procedure based on exact diagonalization, we demonstrate… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 6 pages, 2 figures

  22. arXiv:2406.01060  [pdf, other

    quant-ph

    Mechanical dynamics around higher-order exceptional point in magno-optomechanics

    Authors: Wen-Di He, Xiao-Hong Fan, Ming-Yue Liu, Guo-Qiang Zhang, Hai-Chao Li, Wei Xiong

    Abstract: We theoretically study diverse exceptional points (EPs) in an experimentally feasible magno-optomechanics consisting of an optomechanical subsystem coupled to a magnomechanical subsystem via physically direct contact. By adiabatically eliminating both the cavity and the Kittel mode, dissipative and parity-time symmetric exceptional points can be observed. When only the cavity mode is eliminated, a… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 6 pages,5 figures

  23. arXiv:2405.20046  [pdf, other

    cs.AI

    Cross-Training with Multi-View Knowledge Fusion for Heterogenous Federated Learning

    Authors: Zhuang Qi, Lei Meng, Weihao He, Ruohan Zhang, Yu Wang, Xin Qi, Xiangxu Meng

    Abstract: Federated learning benefits from cross-training strategies, which enables models to train on data from distinct sources to improve the generalization capability. However, the data heterogeneity between sources may lead models to gradually forget previously acquired knowledge when undergoing cross-training to adapt to new tasks or data sources. We argue that integrating personalized and global know… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  24. arXiv:2405.19245  [pdf, ps, other

    quant-ph math.OC

    Efficient Optimal Control of Open Quantum Systems

    Authors: Wenhao He, Tongyang Li, Xiantao Li, Zecheng Li, Chunhao Wang, Ke Wang

    Abstract: The optimal control problem for open quantum systems can be formulated as a time-dependent Lindbladian that is parameterized by a number of time-dependent control variables. Given an observable and an initial state, the goal is to tune the control variables so that the expected value of some observable with respect to the final state is maximized. In this paper, we present algorithms for solving t… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 52 pages. To appear in the proceedings of TQC 2024

  25. arXiv:2405.17915  [pdf, other

    cs.CL

    Long Context is Not Long at All: A Prospector of Long-Dependency Data for Large Language Models

    Authors: Longze Chen, Ziqiang Liu, Wanwei He, Yunshui Li, Run Luo, Min Yang

    Abstract: Long-context modeling capabilities are important for large language models (LLMs) in various applications. However, directly training LLMs with long context windows is insufficient to enhance this capability since some training samples do not exhibit strong semantic dependencies across long contexts. In this study, we propose a data mining framework \textbf{ProLong} that can assign each training s… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 13 pages, 5 figures, ACL 2024

  26. arXiv:2405.17792  [pdf, other

    hep-ex hep-ph

    JUNO Sensitivity to Invisible Decay Modes of Neutrons

    Authors: JUNO Collaboration, Angel Abusleme, Thomas Adam, Kai Adamowicz, Shakeel Ahmad, Rizwan Ahmed, Sebastiano Aiello, Fengpeng An, Qi An, Giuseppe Andronico, Nikolay Anfimov, Vito Antonelli, Tatiana Antoshkina, João Pedro Athayde Marcondes de André, Didier Auguste, Weidong Bai, Nikita Balashov, Wander Baldini, Andrea Barresi, Davide Basilico, Eric Baussan, Marco Bellato, Marco Beretta, Antonio Bergnoli, Daniel Bick , et al. (635 additional authors not shown)

    Abstract: We explore the bound neutrons decay into invisible particles (e.g., $n\rightarrow 3 ν$ or $nn \rightarrow 2 ν$) in the JUNO liquid scintillator detector. The invisible decay includes two decay modes: $ n \rightarrow { inv} $ and $ nn \rightarrow { inv} $. The invisible decays of $s$-shell neutrons in $^{12}{\rm C}$ will leave a highly excited residual nucleus. Subsequently, some de-excitation mode… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 28 pages, 7 figures, 4 tables

  27. arXiv:2405.17790  [pdf, other

    cs.CV

    Instruct-ReID++: Towards Universal Purpose Instruction-Guided Person Re-identification

    Authors: Weizhen He, Yiheng Deng, Yunfeng Yan, Feng Zhu, Yizhou Wang, Lei Bai, Qingsong Xie, Donglian Qi, Wanli Ouyang, Shixiang Tang

    Abstract: Human intelligence can retrieve any person according to both visual and language descriptions. However, the current computer vision community studies specific person re-identification (ReID) tasks in different scenarios separately, which limits the applications in the real world. This paper strives to resolve this problem by proposing a novel instruct-ReID task that requires the model to retrieve… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2306.07520

  28. arXiv:2405.17470  [pdf, other

    cs.LG cs.AI cs.CL

    Athena: Efficient Block-Wise Post-Training Quantization for Large Language Models Using Second-Order Matrix Derivative Information

    Authors: Yanshu Wang, Wenyang He, Tong Yang

    Abstract: Large Language Models (LLMs) have significantly advanced natural language processing tasks such as machine translation, text generation, and sentiment analysis. However, their large size, often consisting of billions of parameters, poses challenges for storage, computation, and deployment, particularly in resource-constrained environments like mobile devices and edge computing platforms. Effective… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  29. arXiv:2405.17459  [pdf

    cs.LG cs.AI cs.CL cs.CV

    Integrating Medical Imaging and Clinical Reports Using Multimodal Deep Learning for Advanced Disease Analysis

    Authors: Ziyan Yao, Fei Lin, Sheng Chai, Weijie He, Lu Dai, Xinghui Fei

    Abstract: In this paper, an innovative multi-modal deep learning model is proposed to deeply integrate heterogeneous information from medical images and clinical reports. First, for medical images, convolutional neural networks were used to extract high-dimensional features and capture key visual information such as focal details, texture and spatial distribution. Secondly, for clinical report text, a two-w… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  30. arXiv:2405.15232  [pdf, other

    cs.CV cs.CL

    DEEM: Diffusion Models Serve as the Eyes of Large Language Models for Image Perception

    Authors: Run Luo, Yunshui Li, Longze Chen, Wanwei He, Ting-En Lin, Ziqiang Liu, Lei Zhang, Zikai Song, Xiaobo Xia, Tongliang Liu, Min Yang, Binyuan Hui

    Abstract: The development of large language models (LLMs) has significantly advanced the emergence of large multimodal models (LMMs). While LMMs have achieved tremendous success by promoting the synergy between multimodal comprehension and creation, they often face challenges when confronted with out-of-distribution data. This is primarily due to their reliance on image encoders trained to encode images int… ▽ More

    Submitted 3 July, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: 25 pages. arXiv admin note: text overlap with arXiv:2401.10208 by other authors

  31. arXiv:2405.14636  [pdf, other

    cs.DC cs.NI

    PerLLM: Personalized Inference Scheduling with Edge-Cloud Collaboration for Diverse LLM Services

    Authors: Zheming Yang, Yuanhao Yang, Chang Zhao, Qi Guo, Wenkai He, Wen Ji

    Abstract: With the rapid growth in the number of large language model (LLM) users, it is difficult for bandwidth-constrained cloud servers to simultaneously process massive LLM services in real-time. Recently, edge-cloud infrastructures have been used to improve the processing efficiency of large-scale LLM services. However, the diversity of task requirements and the dynamics of resources pose great challen… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  32. arXiv:2405.12229  [pdf, other

    physics.chem-ph cond-mat.mtrl-sci cs.AI cs.CE physics.comp-ph

    Multi-task learning for molecular electronic structure approaching coupled-cluster accuracy

    Authors: Hao Tang, Brian Xiao, Wenhao He, Pero Subasic, Avetik R. Harutyunyan, Yao Wang, Fang Liu, Haowei Xu, Ju Li

    Abstract: Machine learning (ML) plays an important role in quantum chemistry, providing fast-to-evaluate predictive models for various properties of molecules. However, most existing ML models for molecular electronic properties use density functional theory (DFT) databases as ground truth in training, and their prediction accuracy cannot surpass that of DFT. In this work, we developed a unified ML method f… ▽ More

    Submitted 24 June, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

  33. arXiv:2405.09103  [pdf, ps, other

    math.PR

    Mean Reflected Backward Stochastic Differential Equations Driven by G-Brownian Motion with Double Constraints

    Authors: Wei He, Hanwu Li

    Abstract: In this paper, we study the backward stochastic differential equations driven by G-Brownian motion with double mean reflections, which means that the constraints are made on the law of the solution. Making full use of the backward Skorokhod problem with two nonlinear reflecting boundaries and the fixed-point theory, the existence and uniqueness of solutions are established. We also consider the ca… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  34. arXiv:2405.06858  [pdf, other

    cond-mat.supr-con cond-mat.mes-hall

    Topological Superconductivity in Monolayer T$_{\textrm{d}}$-MoTe$_2$

    Authors: Xin-Zhi Li, Zhen-Bo Qi, Quansheng Wu, Wen-Yu He

    Abstract: Topological superconductivity has attracted significant attention due to its potential applications in quantum computation, but its experimental realization remains challenging. Recently, monolayer T$_{\textrm{d}}$-MoTe$_2$ was observed to exhibit gate tunable superconductivity, and its in-plane upper critical field exceeds the Pauli limit. Here, we show that an in-plane magnetic field beyond the… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    Comments: 9 pages, 4 figures, plus Supplementary Material. Comments are welcome

  35. arXiv:2405.05133  [pdf, other

    cs.CV eess.IV

    Identifying every building's function in large-scale urban areas with multi-modality remote-sensing data

    Authors: Zhuohong Li, Wei He, Jiepan Li, Hongyan Zhang

    Abstract: Buildings, as fundamental man-made structures in urban environments, serve as crucial indicators for understanding various city function zones. Rapid urbanization has raised an urgent need for efficiently surveying building footprints and functions. In this study, we proposed a semi-supervised framework to identify every building's function in large-scale urban areas with multi-modality remote-sen… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: 5 pages, 7 figures, accepted by IGARSS 2024

  36. arXiv:2404.15016  [pdf, ps, other

    math.DG math.SG

    Convergence of the hypersymplectic flow on $T^4$ with $T^3$-symmetry

    Authors: Joel Fine, Weiyong He, Chengjian Yao

    Abstract: A hypersymplectic structure on a 4-manifold is a triple $ω_1, ω_2, ω_3$ of 2-forms for which every non-trivial linear combination $a^1ω_1 + a^2 ω_2 + a^3 ω_3$ is a symplectic form. Donaldson has conjectured that when the underlying manifold is compact, any such structure is isotopic in its cohomolgy class to a hyperkähler triple. We prove this conjecture for a hypersymplectic structure on $T^4$ wh… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 25 pages

    MSC Class: 58J35; 53C26; 53D05

  37. arXiv:2404.14648  [pdf, other

    cs.CC cs.CR math.PR

    Pseudorandom Permutations from Random Reversible Circuits

    Authors: William He, Ryan O'Donnell

    Abstract: We study pseudorandomness properties of permutations on $\{0,1\}^n$ computed by random circuits made from reversible $3$-bit gates (permutations on $\{0,1\}^3$). Our main result is that a random circuit of depth $n \cdot \tilde{O}(k^2)$, with each layer consisting of $\approx n/3$ random gates in a fixed nearest-neighbor architecture, yields almost $k$-wise independent permutations. The main techn… ▽ More

    Submitted 3 July, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: v2: added references and comparison to subsequent work, removed claim in previous Section 7.3 with error in proof

  38. arXiv:2404.14233  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI Feedback

    Authors: Wenyi Xiao, Ziwei Huang, Leilei Gan, Wanggui He, Haoyuan Li, Zhelun Yu, Hao Jiang, Fei Wu, Linchao Zhu

    Abstract: The rapidly developing Large Vision Language Models (LVLMs) have shown notable capabilities on a range of multi-modal tasks, but still face the hallucination phenomena where the generated texts do not align with the given contexts, significantly restricting the usages of LVLMs. Most previous work detects and mitigates hallucination at the coarse-grained level or requires expensive annotation (e.g.… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  39. Magnetically propagating Hund's exciton in van der Waals antiferromagnet NiPS3

    Authors: W. He, Y. Shen, K. Wohlfeld, J. Sears, J. Li, J. Pelliciari, M. Walicki, S. Johnston, E. Baldini, V. Bisogni, M. Mitrano, M. P. M. Dean

    Abstract: Magnetic van der Waals (vdW) materials have opened new frontiers for realizing novel many-body phenomena. Recently NiPS3 has received intense interest since it hosts an excitonic quasiparticle whose properties appear to be intimately linked to the magnetic state of the lattice. Despite extensive studies, the electronic character, mobility, and magnetic interactions of the exciton remain unresolved… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: 11 pages accepted in Nature Communications

    Journal ref: Nature Communications 15, 3496 (2024)

  40. arXiv:2404.07496  [pdf, other

    nucl-th nucl-ex

    Multifractal Dimension Spectrum Analysis for Nuclear Density Distribution

    Authors: Weihu Ma, Yu-Gang Ma, Wanbing He, Bo Zhou

    Abstract: We present an integral density method for calculating the multifractal dimension spectrum for the nucleon distribution in atomic nuclei. This method is then applied to analyze the non-uniformity of the density distribution in several typical types of nuclear matter distributions, including the Woods-Saxon distribution, the halo structure and the tetrahedral $α$ clustering. The subsequent discussio… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: 9 pages, 6 figures

  41. arXiv:2404.06852  [pdf, other

    cs.SE

    Research Artifacts in Software Engineering Publications: Status and Trends

    Authors: Mugeng Liu, Xiaolong Huang, Wei He, Yibing Xie, Jie M. Zhang, Xiang Jing, Zhenpeng Chen, Yun Ma

    Abstract: The Software Engineering (SE) community has been embracing the open science policy and encouraging researchers to disclose artifacts in their publications. However, the status and trends of artifact practice and quality remain unclear, lacking insights on further improvement. In this paper, we present an empirical study to characterize the research artifacts in SE publications. Specifically, we ma… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: Accepted by Journal of Systems and Software (JSS 2024). Please include JSS in any citations

  42. arXiv:2404.05850  [pdf, other

    cond-mat.str-el quant-ph

    Witnessing Quantum Entanglement Using Resonant Inelastic X-ray Scattering

    Authors: Tianhao Ren, Yao Shen, Sophia F. R. TenHuisen, Jennifer Sears, Wei He, Mary H. Upton, Diego Casa, Petra Becker, Matteo Mitrano, Mark P. M. Dean, Robert M. Konik

    Abstract: Although entanglement is both a central ingredient in our understanding of quantum many-body systems and an essential resource for quantum technologies, we only have a limited ability to quantify entanglement in real quantum materials. Thus far, entanglement metrology in quantum materials has been limited to measurements involving Hermitian operators, such as the detection of spin entanglement usi… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: 10 pages, 8 figures

  43. arXiv:2404.04697  [pdf, ps, other

    stat.ME

    Q-learning in Dynamic Treatment Regimes with Misclassified Binary Outcome

    Authors: Dan Liu, Wenqing He

    Abstract: The study of precision medicine involves dynamic treatment regimes (DTRs), which are sequences of treatment decision rules recommended by taking patient-level information as input. The primary goal of the DTR study is to identify an optimal DTR, a sequence of treatment decision rules that leads to the best expected clinical outcome. Statistical methods have been developed in recent years to estima… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

  44. arXiv:2404.04696  [pdf, ps, other

    stat.ME

    Dynamic Treatment Regimes with Replicated Observations Available for Error-prone Covariates: a Q-learning Approach

    Authors: Dan Liu, Wenqing He

    Abstract: Dynamic treatment regimes (DTRs) have received an increasing interest in recent years. DTRs are sequences of treatment decision rules tailored to patient-level information. The main goal of the DTR study is to identify an optimal DTR, a sequence of treatment decision rules that yields the best expected clinical outcome. Q-learning has been considered as one of the most popular regression-based met… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

  45. arXiv:2404.01192  [pdf, other

    eess.IV cs.CV

    iMD4GC: Incomplete Multimodal Data Integration to Advance Precise Treatment Response Prediction and Survival Analysis for Gastric Cancer

    Authors: Fengtao Zhou, Yingxue Xu, Yanfen Cui, Shenyan Zhang, Yun Zhu, Weiyang He, Jiguang Wang, Xin Wang, Ronald Chan, Louis Ho Shing Lau, Chu Han, Dafu Zhang, Zhenhui Li, Hao Chen

    Abstract: Gastric cancer (GC) is a prevalent malignancy worldwide, ranking as the fifth most common cancer with over 1 million new cases and 700 thousand deaths in 2020. Locally advanced gastric cancer (LAGC) accounts for approximately two-thirds of GC diagnoses, and neoadjuvant chemotherapy (NACT) has emerged as the standard treatment for LAGC. However, the effectiveness of NACT varies significantly among… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: 27 pages, 9 figures, 3 tables (under review)

  46. arXiv:2404.00884  [pdf, other

    cs.CL cs.AI

    Self-Demos: Eliciting Out-of-Demonstration Generalizability in Large Language Models

    Authors: Wei He, Shichun Liu, Jun Zhao, Yiwen Ding, Yi Lu, Zhiheng Xi, Tao Gui, Qi Zhang, Xuanjing Huang

    Abstract: Large language models (LLMs) have shown promising abilities of in-context learning (ICL), adapting swiftly to new tasks with only few-shot demonstrations. However, current few-shot methods heavily depend on high-quality, query-specific demos, which are often lacking. When faced with out-of-demonstration (OOD) queries, methods that rely on hand-crafted demos or external retrievers might fail. To br… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Comments: Accepted to NAACL 2024 Findings

  47. arXiv:2404.00845  [pdf

    cond-mat.mtrl-sci

    Harnessing Interlayer Magnetic Coupling for Efficient, Field-Free Current-Induced Magnetization Switching in a Magnetic Insulator

    Authors: Leran Wang, Alejandro O. Leon, Wenqing He, Zhongyu Liang, Xiaohan Li, Xiaoxiao Fang, Wenyun Yang, Licong Peng, Jinbo Yang, Caihua Wan, Gerrit E. W. Bauer, Zhaochu Luo

    Abstract: Owing to the unique features of low Gilbert damping, long spin-diffusion lengths and zero Ohmic losses, magnetic insulators are promising candidate materials for next-generation spintronic applications. However, due to the localized magnetic moments and the complex metal-oxide interface between magnetic insulators and heavy metals, spin-functional Dzyaloshinskii-Moriya interactions or spin Hall an… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  48. arXiv:2403.18373  [pdf, other

    cs.CV

    BAM: Box Abstraction Monitors for Real-time OoD Detection in Object Detection

    Authors: Changshun Wu, Weicheng He, Chih-Hong Cheng, Xiaowei Huang, Saddek Bensalem

    Abstract: Out-of-distribution (OoD) detection techniques for deep neural networks (DNNs) become crucial thanks to their filtering of abnormal inputs, especially when DNNs are used in safety-critical applications and interact with an open and dynamic environment. Nevertheless, integrating OoD detection into state-of-the-art (SOTA) object detection DNNs poses significant challenges, partly due to the complexi… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  49. arXiv:2403.14332  [pdf, ps, other

    cs.DS cs.CR cs.LG

    A Differentially Private Clustering Algorithm for Well-Clustered Graphs

    Authors: Weiqiang He, Hendrik Fichtenberger, Pan Peng

    Abstract: We study differentially private (DP) algorithms for recovering clusters in well-clustered graphs, which are graphs whose vertex set can be partitioned into a small number of sets, each inducing a subgraph of high inner conductance and small outer conductance. Such graphs have widespread application as a benchmark in the theoretical analysis of spectral clustering. We provide an efficient ($ε$,$δ$)… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  50. arXiv:2403.13427  [pdf

    cond-mat.mtrl-sci

    Observation of non-volatile anomalous Nernst effect in altermagnet with collinear Néel vector

    Authors: Lei Han, Xizhi Fu, Wenqing He, Yuxiang Zhu, Jiankun Dai, Wenfeng Yang, Wenxuan Zhu, Hua Bai, Chong Chen, Caihua Wan, Xiufeng Han, Cheng Song, Junwei Liu, Feng Pan

    Abstract: Anomalous Nernst effect (ANE), a widely investigated transverse thermoelectric effect that converts waste heat into electrical energy with remarkable flexibility and integration capability, has been extended to antiferromagnets with non-collinear spin texture recently. ANE in compensated magnet with collinear Néel vector will bring more opportunities to construct magnetic-field-immune and ultrafas… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: 25 pages, 4 figures