Skip to main content

Showing 1–50 of 790 results for author: Wen, L

  1. arXiv:2407.10173  [pdf, other

    cs.DC

    StatuScale: Status-aware and Elastic Scaling Strategy for Microservice Applications

    Authors: Linfeng Wen, Minxian Xu, Sukhpal Singh Gill, Muhammad Hafizhuddin Hilman, Satish Narayana Srirama, Kejiang Ye, Chengzhong Xu

    Abstract: Microservice architecture has transformed traditional monolithic applications into lightweight components. Scaling these lightweight microservices is more efficient than scaling servers. However, scaling microservices still faces the challenges resulted from the unexpected spikes or bursts of requests, which are difficult to detect and can degrade performance instantaneously. To address this chall… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: 26 pages

    Journal ref: ACM Transactions on Autonomous and Adaptive Systems, 2024

  2. arXiv:2407.05688  [pdf

    cs.CV cs.AI

    Learning with Alignments: Tackling the Inter- and Intra-domain Shifts for Cross-multidomain Facial Expression Recognition

    Authors: Yuxiang Yang, Lu Wen, Xinyi Zeng, Yuanyuan Xu, Xi Wu, Jiliu Zhou, Yan Wang

    Abstract: Facial Expression Recognition (FER) holds significant importance in human-computer interactions. Existing cross-domain FER methods often transfer knowledge solely from a single labeled source domain to an unlabeled target domain, neglecting the comprehensive information across multiple sources. Nevertheless, cross-multidomain FER (CMFER) is very challenging for (i) the inherent inter-domain shifts… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  3. arXiv:2407.00285  [pdf, other

    physics.atom-ph hep-ex nucl-ex

    Imaging of single barium atoms in a second matrix site in solid xenon for barium tagging in a $^{136}$Xe double beta decay experiment

    Authors: M. Yvaine, D. Fairbank, J. Soderstrom, C. Taylor, J. Stanley, T. Walton, C. Chambers, A. Iverson, W. Fairbank, S. Al Kharusi, A. Amy, E. Angelico, A. Anker, I. J. Arnquist, A. Atencio, J. Bane, V. Belov, E. P. Bernard, T. Bhatta, A. Bolotnikov, J. Breslin, P. A. Breur, J. P. Brodsky, E. Brown, T. Brunner , et al. (112 additional authors not shown)

    Abstract: Neutrinoless double beta decay is one of the most sensitive probes for new physics beyond the Standard Model of particle physics. One of the isotopes under investigation is $^{136}$Xe, which would double beta decay into $^{136}$Ba. Detecting the single $^{136}$Ba daughter provides a sort of ultimate tool in the discrimination against backgrounds. Previous work demonstrated the ability to perform s… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

    Comments: 9 pages, 8 figures

  4. arXiv:2406.10484  [pdf, other

    cs.CV

    Beyond Raw Videos: Understanding Edited Videos with Large Multimodal Model

    Authors: Lu Xu, Sijie Zhu, Chunyuan Li, Chia-Wen Kuo, Fan Chen, Xinyao Wang, Guang Chen, Dawei Du, Ye Yuan, Longyin Wen

    Abstract: The emerging video LMMs (Large Multimodal Models) have achieved significant improvements on generic video understanding in the form of VQA (Visual Question Answering), where the raw videos are captured by cameras. However, a large portion of videos in real-world applications are edited videos, \textit{e.g.}, users usually cut and add effects/modifications to the raw video before publishing it on s… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  5. arXiv:2406.08418  [pdf, other

    cs.CV cs.AI

    OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

    Authors: Qingyun Li, Zhe Chen, Weiyun Wang, Wenhai Wang, Shenglong Ye, Zhenjiang Jin, Guanzhou Chen, Yinan He, Zhangwei Gao, Erfei Cui, Jiashuo Yu, Hao Tian, Jiasheng Zhou, Chao Xu, Bin Wang, Xingjian Wei, Wei Li, Wenjian Zhang, Bo Zhang, Pinlong Cai, Licheng Wen, Xiangchao Yan, Zhenxiang Li, Pei Chu, Yi Wang , et al. (15 additional authors not shown)

    Abstract: Image-text interleaved data, consisting of multiple images and texts arranged in a natural document format, aligns with the presentation paradigm of internet data and closely resembles human reading habits. Recent studies have shown that such data aids multimodal in-context learning and maintains the capabilities of large language models during multimodal fine-tuning. However, the limited scale an… ▽ More

    Submitted 12 July, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

  6. arXiv:2406.07444  [pdf, other

    cs.CL

    On the Robustness of Document-Level Relation Extraction Models to Entity Name Variations

    Authors: Shiao Meng, Xuming Hu, Aiwei Liu, Fukun Ma, Yawen Yang, Shuang Li, Lijie Wen

    Abstract: Driven by the demand for cross-sentence and large-scale relation extraction, document-level relation extraction (DocRE) has attracted increasing research interest. Despite the continuous improvement in performance, we find that existing DocRE models which initially perform well may make more mistakes when merely changing the entity names in the document, hindering the generalization to novel entit… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Accepted to ACL 2024 Findings

    MSC Class: 68T50 ACM Class: I.2.7

  7. arXiv:2406.01007  [pdf, other

    hep-ex

    Measurement of Electron Antineutrino Oscillation Amplitude and Frequency via Neutron Capture on Hydrogen at Daya Bay

    Authors: Daya Bay collaboration, F. P. An, W. D. Bai, A. B. Balantekin, M. Bishai, S. Blyth, G. F. Cao, J. Cao, J. F. Chang, Y. Chang, H. S. Chen, H. Y. Chen, S. M. Chen, Y. Chen, Y. X. Chen, Z. Y. Chen, J. Cheng, J. Cheng, Y. -C. Cheng, Z. K. Cheng, J. J. Cherwinka, M. C. Chu, J. P. Cummings, O. Dalager, F. S. Deng , et al. (177 additional authors not shown)

    Abstract: This Letter reports the first measurement of the oscillation amplitude and frequency of reactor antineutrinos at Daya Bay via neutron capture on hydrogen using 1958 days of data. With over 3.6 million signal candidates, an optimized candidate selection, improved treatment of backgrounds and efficiencies, refined energy calibration, and an energy response model for the capture-on-hydrogen sensitive… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  8. arXiv:2406.00652  [pdf, other

    astro-ph.CO

    A catalogue of merging clusters of galaxies: cluster partners, merging subclusters, and post-collision clusters

    Authors: Z. L. Wen, J. L. Han, Z. S. Yuan

    Abstract: Clusters of galaxies are merging during the formation of large-scale structures in the Universe. Based on optical survey data, we identify a large sample of pre-mergers of galaxy clusters and merging subclusters in rich clusters. We find 39,382 partners within a velocity difference of 1500 km/s and a projected separation of 5r_{500} around 33,126 main clusters, where r_{500} is the radius of the m… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: 38 pages, 13 figures, 3 tables, re-submitted to MNRAS after referee's comments being considered

  9. arXiv:2406.00415  [pdf, other

    cs.AI

    Neural Combinatorial Optimization Algorithms for Solving Vehicle Routing Problems: A Comprehensive Survey with Perspectives

    Authors: Xuan Wu, Di Wang, Lijie Wen, Yubin Xiao, Chunguo Wu, Yuesong Wu, Chaoyu Yu, Douglas L. Maskell, You Zhou

    Abstract: Although several surveys on Neural Combinatorial Optimization (NCO) solvers specifically designed to solve Vehicle Routing Problems (VRPs) have been conducted. These existing surveys did not cover the state-of-the-art (SOTA) NCO solvers emerged recently. More importantly, to provide a comprehensive taxonomy of NCO solvers with up-to-date coverage, based on our thorough review of relevant publicati… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  10. arXiv:2405.19419  [pdf, other

    hep-ph hep-ex nucl-th

    Supernova Electron-Neutrino Interactions with Xenon in the nEXO Detector

    Authors: nEXO Collaboration, S. Hedges, S. Al Kharusi, E. Angelico, J. P. Brodsky, G. Richardson, S. Wilde, A. Amy, A. Anker, I. J. Arnquist, P. Arsenault, A. Atencio, I. Badhrees, J. Bane, V. Belov, E. P. Bernard, T. Bhatta, A. Bolotnikov, J. Breslin, P. A. Breur, E. Brown, T. Brunner, E. Caden, G. F. Cao, L. Q. Cao , et al. (121 additional authors not shown)

    Abstract: Electron-neutrino charged-current interactions with xenon nuclei were modeled in the nEXO neutrinoless double-beta decay detector (~5-tonne, 90% ${}^{136}$Xe, 10% ${}^{134}$Xe) to evaluate its sensitivity to supernova neutrinos. Predictions for event rates and detectable signatures were modeled using the MARLEY event generator. We find good agreement between MARLEY's predictions and existing theor… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 17 pages, 16 figures

    Report number: LLNL-JRNL-864783-DRAFT

  11. arXiv:2405.18720  [pdf, other

    physics.ins-det hep-ex

    Machine-Learning based photon counting for PMT waveforms and its application to the improvement of the energy resolution in large liquid scintillator detectors

    Authors: Wei Jiang, Guihong Huang, Zhen Liu, Wuming Luo, Liangjian Wen, Jianyi Luo

    Abstract: Photomultiplier tubes (PMTs) are widely used in particle experiments for photon detection. PMT waveform analysis is crucial for high-precision measurement of the position and energy of incident particles in liquid scintillator (LS) detectors. A key factor contributing to the energy resolution in large liquid scintillator detectors with PMTs is the charge smearing of PMTs. This paper presents a mac… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 10 pages, 9 figures

  12. arXiv:2405.17792  [pdf, other

    hep-ex hep-ph

    JUNO Sensitivity to Invisible Decay Modes of Neutrons

    Authors: JUNO Collaboration, Angel Abusleme, Thomas Adam, Kai Adamowicz, Shakeel Ahmad, Rizwan Ahmed, Sebastiano Aiello, Fengpeng An, Qi An, Giuseppe Andronico, Nikolay Anfimov, Vito Antonelli, Tatiana Antoshkina, João Pedro Athayde Marcondes de André, Didier Auguste, Weidong Bai, Nikita Balashov, Wander Baldini, Andrea Barresi, Davide Basilico, Eric Baussan, Marco Bellato, Marco Beretta, Antonio Bergnoli, Daniel Bick , et al. (635 additional authors not shown)

    Abstract: We explore the bound neutrons decay into invisible particles (e.g., $n\rightarrow 3 ν$ or $nn \rightarrow 2 ν$) in the JUNO liquid scintillator detector. The invisible decay includes two decay modes: $ n \rightarrow { inv} $ and $ nn \rightarrow { inv} $. The invisible decays of $s$-shell neutrons in $^{12}{\rm C}$ will leave a highly excited residual nucleus. Subsequently, some de-excitation mode… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 28 pages, 7 figures, 4 tables

  13. arXiv:2405.15324  [pdf, other

    cs.RO cs.AI cs.CV

    Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous Driving

    Authors: Jianbiao Mei, Yukai Ma, Xuemeng Yang, Licheng Wen, Xinyu Cai, Xin Li, Daocheng Fu, Bo Zhang, Pinlong Cai, Min Dou, Botian Shi, Liang He, Yong Liu, Yu Qiao

    Abstract: Autonomous driving has advanced significantly due to sensors, machine learning, and artificial intelligence improvements. However, prevailing methods struggle with intricate scenarios and causal relationships, hindering adaptability and interpretability in varied environments. To address the above problems, we introduce LeapAD, a novel paradigm for autonomous driving inspired by the human cognitiv… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 23 pages, 16 figures

  14. arXiv:2405.12635  [pdf, other

    cs.DC

    TempoScale: A Cloud Workloads Prediction Approach Integrating Short-Term and Long-Term Information

    Authors: Linfeng Wen, Minxian Xu, Adel N. Toosi, Kejiang Ye

    Abstract: Cloud native solutions are widely applied in various fields, placing higher demands on the efficient management and utilization of resource platforms. To achieve the efficiency, load forecasting and elastic scaling have become crucial technologies for dynamically adjusting cloud resources to meet user demands and minimizing resource waste. However, existing prediction-based methods lack comprehens… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 11pages, 11 figures, 4 tables

    Journal ref: In proceedings of IEEE CLOUD 2024

  15. arXiv:2405.10051  [pdf, other

    cs.CR cs.CL

    MarkLLM: An Open-Source Toolkit for LLM Watermarking

    Authors: Leyi Pan, Aiwei Liu, Zhiwei He, Zitian Gao, Xuandong Zhao, Yijian Lu, Binglin Zhou, Shuliang Liu, Xuming Hu, Lijie Wen, Irwin King

    Abstract: LLM watermarking, which embeds imperceptible yet algorithmically detectable signals in model outputs to identify LLM-generated text, has become crucial in mitigating the potential misuse of large language models. However, the abundance of LLM watermarking algorithms, their intricate mechanisms, and the complex evaluation procedures and perspectives pose challenges for researchers and the community… ▽ More

    Submitted 24 May, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

    Comments: 16 pages, 5 figures, 6 tables

    MSC Class: 68T50 ACM Class: I.2.7

  16. arXiv:2405.06326  [pdf, other

    physics.ins-det nucl-ex

    A practical approach of measuring $^{238}$U and $^{232}$Th in liquid scintillator to sub-ppq level using ICP-MS

    Authors: Yuanxia Li, Jie Zhao, Yayun Ding, Tao Hu, Jiaxuan Ye, Jian Fang, Liangjian Wen

    Abstract: Liquid scintillator (LS) is commonly utilized in experiments seeking rare events due to its high light yield, transparency, and radiopurity. The concentration of $^{238}$U and $^{232}$Th in LS consistently remains below 1 ppq (10$^{-15}$ g/g), and the current screening result is based on a minimum 20-ton detector. Inductively coupled plasma mass (ICP-MS) spectroscopy is well-regarded for its high… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  17. arXiv:2405.05949  [pdf, other

    cs.CV

    CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts

    Authors: Jiachen Li, Xinyao Wang, Sijie Zhu, Chia-Wen Kuo, Lu Xu, Fan Chen, Jitesh Jain, Humphrey Shi, Longyin Wen

    Abstract: Recent advancements in Multimodal Large Language Models (LLMs) have focused primarily on scaling by increasing text-image pair data and enhancing LLMs to improve performance on multimodal tasks. However, these scaling approaches are computationally expensive and overlook the significance of improving model capabilities from the vision side. Inspired by the successful applications of Mixture-of-Exp… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  18. arXiv:2405.04149  [pdf, ps, other

    cond-mat.quant-gas

    Ground-state properties of dipolar Bose-Einstein condensates with spin-orbit coupling and quantum fluctuations

    Authors: Xianghua Su, Wenting Dai, Tianyu Li, Jiyuan Wang, Linghua Wen

    Abstract: We study the ground-state properties of dipolar spin-1/2 Bose-Einstein condensates with quantum fluctuations and Rashba spin-orbit coupling (SOC). The combined effects of dipole-dipole interaction (DDI), SOC, and Lee-Huang-Yang (LHY) correction induced by quantum fluctuations on the ground-state structures and spin textures of the system are analyzed and discussed. For the nonrotating case and fix… ▽ More

    Submitted 8 May, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

    Comments: 11 pages,7 figures, accepted for publication in Chaos, Solitons and Fractals

    Journal ref: Chaos, Solitons and Fractals 184, 114979 (2024)

  19. arXiv:2404.16821  [pdf, other

    cs.CV

    How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites

    Authors: Zhe Chen, Weiyun Wang, Hao Tian, Shenglong Ye, Zhangwei Gao, Erfei Cui, Wenwen Tong, Kongzhi Hu, Jiapeng Luo, Zheng Ma, Ji Ma, Jiaqi Wang, Xiaoyi Dong, Hang Yan, Hewei Guo, Conghui He, Botian Shi, Zhenjiang Jin, Chao Xu, Bin Wang, Xingjian Wei, Wei Li, Wenjian Zhang, Bo Zhang, Pinlong Cai , et al. (10 additional authors not shown)

    Abstract: In this report, we introduce InternVL 1.5, an open-source multimodal large language model (MLLM) to bridge the capability gap between open-source and proprietary commercial models in multimodal understanding. We introduce three simple improvements: (1) Strong Vision Encoder: we explored a continuous learning strategy for the large-scale vision foundation model -- InternViT-6B, boosting its visual… ▽ More

    Submitted 29 April, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: Technical report

  20. arXiv:2404.14696  [pdf

    cs.CV

    Adaptive Prompt Learning with Negative Textual Semantics and Uncertainty Modeling for Universal Multi-Source Domain Adaptation

    Authors: Yuxiang Yang, Lu Wen, Yuanyuan Xu, Jiliu Zhou, Yan Wang

    Abstract: Universal Multi-source Domain Adaptation (UniMDA) transfers knowledge from multiple labeled source domains to an unlabeled target domain under domain shifts (different data distribution) and class shifts (unknown target classes). Existing solutions focus on excavating image features to detect unknown samples, ignoring abundant information contained in textual semantics. In this paper, we propose a… ▽ More

    Submitted 23 April, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: Accepted by ICME2024

  21. arXiv:2404.12753  [pdf, other

    cs.CL cs.AI

    AutoCrawler: A Progressive Understanding Web Agent for Web Crawler Generation

    Authors: Wenhao Huang, Chenghao Peng, Zhixu Li, Jiaqing Liang, Yanghua Xiao, Liqian Wen, Zulong Chen

    Abstract: Web automation is a significant technique that accomplishes complicated web tasks by automating common web actions, enhancing operational efficiency, and reducing the need for manual intervention. Traditional methods, such as wrappers, suffer from limited adaptability and scalability when faced with a new website. On the other hand, generative agents empowered by large language models (LLMs) exhib… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: 18 pages, 5 figures

  22. arXiv:2404.12683  [pdf, other

    cs.RO

    A Containerized Microservice Architecture for a ROS 2 Autonomous Driving Software: An End-to-End Latency Evaluation

    Authors: Tobias Betz, Long Wen, Fengjunjie Pan, Gemb Kaljavesi, Alexander Zuepke, Andrea Bastoni, Marco Caccamo, Alois Knoll, Johannes Betz

    Abstract: The automotive industry is transitioning from traditional ECU-based systems to software-defined vehicles. A central role of this revolution is played by containers, lightweight virtualization technologies that enable the flexible consolidation of complex software applications on a common hardware platform. Despite their widespread adoption, the impact of containerization on fundamental real-time m… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  23. arXiv:2404.07429  [pdf, other

    hep-ph hep-ex nucl-th

    Neutral-current background induced by atmospheric neutrinos at large liquid-scintillator detectors: III. Quantitative calculations for reactor neutrinos

    Authors: Jie Cheng, Min Li, Yu-Feng Li, Gao-Song Li, Hao-Qi Lu, Liang-Jian Wen

    Abstract: Atmospheric neutrinos contribute significantly to irreducible backgrounds through their neutral-current (NC) interactions with $^{12}$C nuclei in liquid-scintillator detectors, impacting diffuse supernova neutrino background, nucleon decay, and reactor neutrinos. This paper extends our prior work by systematically studying the NC backgrounds towards the MeV region of reactor neutrinos. We employ c… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 26 pages, 10 figures

  24. arXiv:2404.02002  [pdf, ps, other

    astro-ph.CO astro-ph.GA

    A catalog of 1.58 million clusters of galaxies identified from the DESI Legacy Imaging Surveys

    Authors: Z. L. Wen, J. L. Han

    Abstract: Based on the DESI Legacy Imaging Surveys released data and available spectroscopic redshifts, we identify 1.58 million clusters of galaxies by searching for the overdensity of stellar mass distribution of galaxies within redshift slices around pre-selected massive galaxies, among which 877,806 clusters are found for the first time. The identified clusters have an equivalent mass of M_{500}> 0.47*1… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: 17 pages, 14 figures, 2 tables, re-submitted to ApJS after referee's comments being incorporated

  25. arXiv:2404.01687  [pdf, other

    hep-ex

    Search for a sub-eV sterile neutrino using Daya Bay's full dataset

    Authors: F. P. An, W. D. Bai, A. B. Balantekin, M. Bishai, S. Blyth, G. F. Cao, J. Cao, J. F. Chang, Y. Chang, H. S. Chen, H. Y. Chen, S. M. Chen, Y. Chen, Y. X. Chen, Z. Y. Chen, J. Cheng, Y. C. Cheng, Z. K. Cheng, J. J. Cherwinka, M. C. Chu, J. P. Cummings, O. Dalager, F. S. Deng, X. Y. Ding, Y. Y. Ding , et al. (176 additional authors not shown)

    Abstract: This Letter presents results of a search for the mixing of a sub-eV sterile neutrino with three active neutrinos based on the full data sample of the Daya Bay Reactor Neutrino Experiment, collected during 3158 days of detector operation, which contains $5.55 \times 10^{6}$ reactor \anue candidates identified as inverse beta-decay interactions followed by neutron-capture on gadolinium. The analysis… ▽ More

    Submitted 15 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 7 pages, 4 figures, 1 table

  26. arXiv:2403.20026  [pdf, other

    cs.CV cs.CL

    FSMR: A Feature Swapping Multi-modal Reasoning Approach with Joint Textual and Visual Clues

    Authors: Shuang Li, Jiahua Wang, Lijie Wen

    Abstract: Multi-modal reasoning plays a vital role in bridging the gap between textual and visual information, enabling a deeper understanding of the context. This paper presents the Feature Swapping Multi-modal Reasoning (FSMR) model, designed to enhance multi-modal reasoning through feature swapping. FSMR leverages a pre-trained visual-language model as an encoder, accommodating both text and image inputs… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

  27. arXiv:2403.19078  [pdf, other

    cs.CV cs.AI

    MVEB: Self-Supervised Learning with Multi-View Entropy Bottleneck

    Authors: Liangjian Wen, Xiasi Wang, Jianzhuang Liu, Zenglin Xu

    Abstract: Self-supervised learning aims to learn representation that can be effectively generalized to downstream tasks. Many self-supervised approaches regard two views of an image as both the input and the self-supervised signals, assuming that either view contains the same task-relevant information and the shared information is (approximately) sufficient for predicting downstream tasks. Recent studies sh… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: Accepted by TPAMI

  28. arXiv:2403.16048  [pdf, other

    cs.CV

    Edit3K: Universal Representation Learning for Video Editing Components

    Authors: Xin Gu, Libo Zhang, Fan Chen, Longyin Wen, Yufei Wang, Tiejian Luo, Sijie Zhu

    Abstract: This paper focuses on understanding the predominant video creation pipeline, i.e., compositional video editing with six main types of editing components, including video effects, animation, transition, filter, sticker, and text. In contrast to existing visual representation learning of visual materials (i.e., images/videos), we aim to learn visual representations of editing actions/components that… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  29. arXiv:2403.12370  [pdf, other

    cs.CV

    XPose: eXplainable Human Pose Estimation

    Authors: Luyu Qiu, Jianing Li, Lei Wen, Chi Su, Fei Hao, Chen Jason Zhang, Lei Chen

    Abstract: Current approaches in pose estimation primarily concentrate on enhancing model architectures, often overlooking the importance of comprehensively understanding the rationale behind model decisions. In this paper, we propose XPose, a novel framework that incorporates Explainable AI (XAI) principles into pose estimation. This integration aims to elucidate the individual contribution of each keypoint… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  30. arXiv:2403.12077  [pdf, other

    cs.CL cs.AI cs.IR

    Evaluating Robustness of Generative Search Engine on Adversarial Factual Questions

    Authors: Xuming Hu, Xiaochuan Li, Junzhe Chen, Yinghui Li, Yangning Li, Xiaoguang Li, Yasheng Wang, Qun Liu, Lijie Wen, Philip S. Yu, Zhijiang Guo

    Abstract: Generative search engines have the potential to transform how people seek information online, but generated responses from existing large language models (LLMs)-backed generative search engines may not always be accurate. Nonetheless, retrieval-augmented generation exacerbates safety concerns, since adversaries may successfully evade the entire system by subtly manipulating the most vulnerable par… ▽ More

    Submitted 25 February, 2024; originally announced March 2024.

    Comments: 21 pages, 7 figures, 4 tables

  31. Dcl-Net: Dual Contrastive Learning Network for Semi-Supervised Multi-Organ Segmentation

    Authors: Lu Wen, Zhenghao Feng, Yun Hou, Peng Wang, Xi Wu, Jiliu Zhou, Yan Wang

    Abstract: Semi-supervised learning is a sound measure to relieve the strict demand of abundant annotated datasets, especially for challenging multi-organ segmentation . However, most existing SSL methods predict pixels in a single image independently, ignoring the relations among images and categories. In this paper, we propose a two-stage Dual Contrastive Learning Network for semi-supervised MoS, which uti… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: Published at ICASSP 2024

  32. arXiv:2403.03004  [pdf, other

    astro-ph.CO gr-qc hep-ph

    Ultralight vector dark matter search using data from the KAGRA O3GK run

    Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, H. Abe, I. Abouelfettouh, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi , et al. (1778 additional authors not shown)

    Abstract: Among the various candidates for dark matter (DM), ultralight vector DM can be probed by laser interferometric gravitational wave detectors through the measurement of oscillating length changes in the arm cavities. In this context, KAGRA has a unique feature due to differing compositions of its mirrors, enhancing the signal of vector DM in the length change in the auxiliary channels. Here we prese… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 20 pages, 5 figures

    Report number: LIGO-P2300250

  33. arXiv:2403.02574  [pdf, other

    cs.IR cs.AI cs.CL

    ChatCite: LLM Agent with Human Workflow Guidance for Comparative Literature Summary

    Authors: Yutong Li, Lu Chen, Aiwei Liu, Kai Yu, Lijie Wen

    Abstract: The literature review is an indispensable step in the research process. It provides the benefit of comprehending the research problem and understanding the current research situation while conducting a comparative analysis of prior works. However, literature summary is challenging and time consuming. The previous LLM-based studies on literature review mainly focused on the complete process, includ… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: 18 pages, 5 figures

    MSC Class: 68T50 ACM Class: I.2.7

  34. arXiv:2403.00869  [pdf, other

    cs.LG stat.ML

    Enhancing Multivariate Time Series Forecasting with Mutual Information-driven Cross-Variable and Temporal Modeling

    Authors: Shiyi Qi, Liangjian Wen, Yiduo Li, Yuanhang Yang, Zhe Li, Zhongwen Rao, Lujia Pan, Zenglin Xu

    Abstract: Recent advancements have underscored the impact of deep learning techniques on multivariate time series forecasting (MTSF). Generally, these techniques are bifurcated into two categories: Channel-independence and Channel-mixing approaches. Although Channel-independence methods typically yield better results, Channel-mixing could theoretically offer improvements by leveraging inter-variable correla… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

  35. arXiv:2403.00510  [pdf, other

    cs.CL cs.AI

    ROME: Memorization Insights from Text, Logits and Representation

    Authors: Bo Li, Qinghua Zhao, Lijie Wen

    Abstract: Previous works have evaluated memorization by comparing model outputs with training corpora, examining how factors such as data duplication, model size, and prompt length influence memorization. However, analyzing these extensive training corpora is highly time-consuming. To address this challenge, this paper proposes an innovative approach named ROME that bypasses direct processing of the trainin… ▽ More

    Submitted 16 June, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

    Comments: Submitted to EMNLP, 2024

  36. arXiv:2402.18946  [pdf, other

    cs.LG eess.SY

    Real-Time Adaptive Safety-Critical Control with Gaussian Processes in High-Order Uncertain Models

    Authors: Yu Zhang, Long Wen, Xiangtong Yao, Zhenshan Bing, Linghuan Kong, Wei He, Alois Knoll

    Abstract: This paper presents an adaptive online learning framework for systems with uncertain parameters to ensure safety-critical control in non-stationary environments. Our approach consists of two phases. The initial phase is centered on a novel sparse Gaussian process (GP) framework. We first integrate a forgetting factor to refine a variational sparse GP algorithm, thus enhancing its adaptability. Sub… ▽ More

    Submitted 5 March, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

  37. arXiv:2402.16913  [pdf, other

    cs.LG

    PDETime: Rethinking Long-Term Multivariate Time Series Forecasting from the perspective of partial differential equations

    Authors: Shiyi Qi, Zenglin Xu, Yiduo Li, Liangjian Wen, Qingsong Wen, Qifan Wang, Yuan Qi

    Abstract: Recent advancements in deep learning have led to the development of various models for long-term multivariate time-series forecasting (LMTF), many of which have shown promising results. Generally, the focus has been on historical-value-based models, which rely on past observations to predict future series. Notably, a new trend has emerged with time-index-based models, offering a more nuanced under… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

  38. arXiv:2402.16499  [pdf, other

    cs.CL

    LLMArena: Assessing Capabilities of Large Language Models in Dynamic Multi-Agent Environments

    Authors: Junzhe Chen, Xuming Hu, Shuodi Liu, Shiyu Huang, Wei-Wei Tu, Zhaofeng He, Lijie Wen

    Abstract: Recent advancements in large language models (LLMs) have revealed their potential for achieving autonomous agents possessing human-level intelligence. However, existing benchmarks for evaluating LLM Agents either use static datasets, potentially leading to data leakage or focus only on single-agent scenarios, overlooking the complexities of multi-agent interactions. There is a lack of a benchmark… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  39. arXiv:2402.16449  [pdf, other

    cs.RO cs.AI

    Online Efficient Safety-Critical Control for Mobile Robots in Unknown Dynamic Multi-Obstacle Environments

    Authors: Yu Zhang, Guangyao Tian, Long Wen, Xiangtong Yao, Liding Zhang, Zhenshan Bing, Wei He, Alois Knoll

    Abstract: This paper proposes a LiDAR-based goal-seeking and exploration framework, addressing the efficiency of online obstacle avoidance in unstructured environments populated with static and moving obstacles. This framework addresses two significant challenges associated with traditional dynamic control barrier functions (D-CBFs): their online construction and the diminished real-time performance caused… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  40. arXiv:2402.16299  [pdf, other

    cs.IR cs.LG

    Against Filter Bubbles: Diversified Music Recommendation via Weighted Hypergraph Embedding Learning

    Authors: Chaoguang Luo, Liuying Wen, Yong Qin, Liangwei Yang, Zhineng Hu, Philip S. Yu

    Abstract: Recommender systems serve a dual purpose for users: sifting out inappropriate or mismatched information while accurately identifying items that align with their preferences. Numerous recommendation algorithms are designed to provide users with a personalized array of information tailored to their preferences. Nevertheless, excessive personalization can confine users within a "filter bubble". Conse… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

  41. arXiv:2402.11907  [pdf, other

    cs.CL

    Direct Large Language Model Alignment Through Self-Rewarding Contrastive Prompt Distillation

    Authors: Aiwei Liu, Haoping Bai, Zhiyun Lu, Xiang Kong, Simon Wang, Jiulong Shan, Meng Cao, Lijie Wen

    Abstract: Aligning large language models (LLMs) with human expectations without human-annotated preference data is an important problem. In this paper, we propose a method to evaluate the response preference by using the output probabilities of response pairs under contrastive prompt pairs, which could achieve better performance on LLaMA2-7B and LLaMA2-13B compared to RLAIF. Based on this, we propose an aut… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: 24 pages, 5 pages

    MSC Class: 68T50 ACM Class: I.2.7

  42. arXiv:2402.08917  [pdf, other

    cs.DC

    An Interference-aware Approach for Co-located Container Orchestration with Novel Metric

    Authors: Xiang Li, Linfeng Wen, Minxian Xu, Kejiang Ye

    Abstract: Container orchestration technologies are widely employed in cloud computing, facilitating the co-location of online and offline services on the same infrastructure. Online services demand rapid responsiveness and high availability, whereas offline services require extensive computational resources. However, this mixed deployment can lead to resource contention, adversely affecting the performance… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: 8 pages

    Journal ref: In the Proceedings of IEEE SmartData 2023

  43. arXiv:2402.08221  [pdf, other

    cs.RO cs.CV

    MetaTra: Meta-Learning for Generalized Trajectory Prediction in Unseen Domain

    Authors: Xiaohe Li, Feilong Huang, Zide Fan, Fangli Mou, Yingyan Hou, Chen Qian, Lijie Wen

    Abstract: Trajectory prediction has garnered widespread attention in different fields, such as autonomous driving and robotic navigation. However, due to the significant variations in trajectory patterns across different scenarios, models trained in known environments often falter in unseen ones. To learn a generalized model that can directly handle unseen domains without requiring any model updating, we pr… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  44. arXiv:2402.05383  [pdf, other

    nucl-ex hep-ex

    First measurement of the yield of $^8$He isotopes produced in liquid scintillator by cosmic-ray muons at Daya Bay

    Authors: Daya Bay Collaboration, F. P. An, W. D. Bai, A. B. Balantekin, M. Bishai, S. Blyth, G. F. Cao, J. Cao, J. F. Chang, Y. Chang, H. S. Chen, H. Y. Chen, S. M. Chen, Y. Chen, Y. X. Chen, Z. Y. Chen, J. Cheng, Y. C. Cheng, Z. K. Cheng, J. J. Cherwinka, M. C. Chu, J. P. Cummings, O. Dalager, F. S. Deng, X. Y. Ding , et al. (177 additional authors not shown)

    Abstract: Daya Bay presents the first measurement of cosmogenic $^8$He isotope production in liquid scintillator, using an innovative method for identifying cascade decays of $^8$He and its child isotope, $^8$Li. We also measure the production yield of $^9$Li isotopes using well-established methodology. The results, in units of 10$^{-8}μ^{-1}$g$^{-1}$cm$^{2}$, are 0.307$\pm$0.042, 0.341$\pm$0.040, and 0.546… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  45. arXiv:2402.04566  [pdf

    eess.IV cs.CV

    Triplet-constraint Transformer with Multi-scale Refinement for Dose Prediction in Radiotherapy

    Authors: Lu Wen, Qihun Zhang, Zhenghao Feng, Yuanyuan Xu, Xiao Chen, Jiliu Zhou, Yan Wang

    Abstract: Radiotherapy is a primary treatment for cancers with the aim of applying sufficient radiation dose to the planning target volume (PTV) while minimizing dose hazards to the organs at risk (OARs). Convolutional neural networks (CNNs) have automated the radiotherapy plan-making by predicting the dose maps. However, current CNN-based methods ignore the remarkable dose difference in the dose map, i.e.,… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: accepted by 2024 IEEE ISBI

  46. arXiv:2402.03830  [pdf, other

    cs.CV

    OASim: an Open and Adaptive Simulator based on Neural Rendering for Autonomous Driving

    Authors: Guohang Yan, Jiahao Pi, Jianfei Guo, Zhaotong Luo, Min Dou, Nianchen Deng, Qiusheng Huang, Daocheng Fu, Licheng Wen, Pinlong Cai, Xing Gao, Xinyu Cai, Bo Zhang, Xuemeng Yang, Yeqi Bai, Hongbin Zhou, Botian Shi

    Abstract: With deep learning and computer vision technology development, autonomous driving provides new solutions to improve traffic safety and efficiency. The importance of building high-quality datasets is self-evident, especially with the rise of end-to-end autonomous driving algorithms in recent years. Data plays a core role in the algorithm closed-loop system. However, collecting real-world data is ex… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: 10 pages, 9 figures

  47. arXiv:2402.01246  [pdf, other

    cs.RO eess.SY

    LimSim++: A Closed-Loop Platform for Deploying Multimodal LLMs in Autonomous Driving

    Authors: Daocheng Fu, Wenjie Lei, Licheng Wen, Pinlong Cai, Song Mao, Min Dou, Botian Shi, Yu Qiao

    Abstract: The emergence of Multimodal Large Language Models ((M)LLMs) has ushered in new avenues in artificial intelligence, particularly for autonomous driving by offering enhanced understanding and reasoning capabilities. This paper introduces LimSim++, an extended version of LimSim designed for the application of (M)LLMs in autonomous driving. Acknowledging the limitations of existing simulation platform… ▽ More

    Submitted 12 April, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: Accepted by 35th IEEE Intelligent Vehicles Symposium (IV 2024)

  48. Image2Points:A 3D Point-based Context Clusters GAN for High-Quality PET Image Reconstruction

    Authors: Jiaqi Cui, Yan Wang, Lu Wen, Pinxian Zeng, Xi Wu, Jiliu Zhou, Dinggang Shen

    Abstract: To obtain high-quality Positron emission tomography (PET) images while minimizing radiation exposure, numerous methods have been proposed to reconstruct standard-dose PET (SPET) images from the corresponding low-dose PET (LPET) images. However, these methods heavily rely on voxel-based representations, which fall short of adequately accounting for the precise structure and fine-grained context, le… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: Accepted by ICASSP 2024

  49. arXiv:2401.02901  [pdf, other

    hep-ph hep-ex

    Charged-current non-standard neutrino interactions at Daya Bay

    Authors: Daya Bay collaboration, F. P. An, W. D. Bai, A. B. Balantekin, M. Bishai, S. Blyth, G. F. Cao, J. Cao, J. F. Chang, Y. Chang, H. S. Chen, H. Y. Chen, S. M. Chen, Y. Chen, Y. X. Chen, Z. Y. Chen, J. Cheng, Y. C. Cheng, Z. K. Cheng, J. J. Cherwinka, M. C. Chu, J. P. Cummings, O. Dalager, F. S. Deng, X. Y. Ding , et al. (177 additional authors not shown)

    Abstract: The full data set of the Daya Bay reactor neutrino experiment is used to probe the effect of the charged current non-standard interactions (CC-NSI) on neutrino oscillation experiments. Two different approaches are applied and constraints on the corresponding CC-NSI parameters are obtained with the neutrino flux taken from the Huber-Mueller model with a $5\%$ uncertainty. For the quantum mechanics-… ▽ More

    Submitted 19 March, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

    Comments: 25 pages, 16 figures, 6 tables; 36 pages, format changed, references added

  50. arXiv:2312.07913  [pdf, other

    cs.CL

    A Survey of Text Watermarking in the Era of Large Language Models

    Authors: Aiwei Liu, Leyi Pan, Yijian Lu, Jingjing Li, Xuming Hu, Xi Zhang, Lijie Wen, Irwin King, Hui Xiong, Philip S. Yu

    Abstract: Text watermarking algorithms play a crucial role in the copyright protection of textual content, yet their capabilities and application scenarios have been limited historically. The recent developments in large language models (LLMs) have opened new opportunities for the advancement of text watermarking techniques. LLMs not only enhance the capabilities of text watermarking algorithms through thei… ▽ More

    Submitted 23 January, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: 35 pages, 7 figures

    MSC Class: 68T50 ACM Class: I.2.7