Skip to main content

Showing 1–50 of 2,153 results for author: Cao, Y

  1. arXiv:2407.11550  [pdf, other

    cs.CL cs.AI

    Optimizing KV Cache Eviction in LLMs: Adaptive Allocation for Enhanced Budget Utilization

    Authors: Yuan Feng, Junlin Lv, Yukun Cao, Xike Xie, S. Kevin Zhou

    Abstract: Large Language Models have excelled in various fields but encounter efficiency limitations due to the extensive KV cache required for long sequences inference. Many efforts try to evict non-critical cache elements during runtime, thereby reducing cache size within a given memory budget while preserving generation quality. Our reexamination of their underlying principles discerns that prevailing st… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

  2. arXiv:2407.10299  [pdf, other

    cs.CV

    Follow the Rules: Reasoning for Video Anomaly Detection with Large Language Models

    Authors: Yuchen Yang, Kwonjoon Lee, Behzad Dariush, Yinzhi Cao, Shao-Yuan Lo

    Abstract: Video Anomaly Detection (VAD) is crucial for applications such as security surveillance and autonomous driving. However, existing VAD methods provide little rationale behind detection, hindering public trust in real-world deployments. In this paper, we approach VAD with a reasoning framework. Although Large Language Models (LLMs) have shown revolutionary reasoning ability, we find that their direc… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

  3. arXiv:2407.08529  [pdf, other

    cs.CR

    Enhancing Privacy of Spatiotemporal Federated Learning against Gradient Inversion Attacks

    Authors: Lele Zheng, Yang Cao, Renhe Jiang, Kenjiro Taura, Yulong Shen, Sheng Li, Masatoshi Yoshikawa

    Abstract: Spatiotemporal federated learning has recently raised intensive studies due to its ability to train valuable models with only shared gradients in various location-based services. On the other hand, recent studies have shown that shared gradients may be subject to gradient inversion attacks (GIA) on images or texts. However, so far there has not been any systematic study of the gradient inversion a… ▽ More

    Submitted 15 July, 2024; v1 submitted 11 July, 2024; originally announced July 2024.

    Comments: Accepted by DASFAA 2024, 16 pages

  4. arXiv:2407.08514  [pdf, other

    cs.CV

    Rethinking the Threat and Accessibility of Adversarial Attacks against Face Recognition Systems

    Authors: Yuxin Cao, Yumeng Zhu, Derui Wang, Sheng Wen, Minhui Xue, Jin Lu, Hao Ge

    Abstract: Face recognition pipelines have been widely deployed in various mission-critical systems in trust, equitable and responsible AI applications. However, the emergence of adversarial attacks has threatened the security of the entire recognition pipeline. Despite the sheer number of attack methods proposed for crafting adversarial examples in both digital and physical forms, it is never an easy task t… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 19 pages, 12 figures

  5. arXiv:2407.07501  [pdf

    cond-mat.supr-con

    Electronic Correlation and Pseudogap-like Behavior of High-Temperature Superconductor La3Ni2O7

    Authors: Yidian Li, Xian Du, Yantao Cao, Cuiying Pei, Mingxin Zhang, Wenxuan Zhao, Kaiyi Zhai, Runzhe Xu, Zhongkai Liu, Zhiwei Li, Jinkui Zhao, Gang Li, Yanpeng Qi, Hanjie Guo, Yulin Chen, Lexian Yang

    Abstract: High-temperature superconductivity (HTSC) remains one of the most challenging and fascinating mysteries in condensed matter physics. Recently, superconductivity with transition temperature exceeding liquid-nitrogen temperature is discovered in La3Ni2O7 at high pressure, which provides a new platform to explore the unconventional HTSC. In this work, using high-resolution angle-resolved photoemissio… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  6. arXiv:2407.07249  [pdf, other

    cs.CV

    Few-Shot Image Generation by Conditional Relaxing Diffusion Inversion

    Authors: Yu Cao, Shaogang Gong

    Abstract: In the field of Few-Shot Image Generation (FSIG) using Deep Generative Models (DGMs), accurately estimating the distribution of target domain with minimal samples poses a significant challenge. This requires a method that can both capture the broad diversity and the true characteristics of the target domain distribution. We present Conditional Relaxing Diffusion Inversion (CRDI), an innovative `tr… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  7. arXiv:2407.06567  [pdf, other

    cs.CL

    FinCon: A Synthesized LLM Multi-Agent System with Conceptual Verbal Reinforcement for Enhanced Financial Decision Making

    Authors: Yangyang Yu, Zhiyuan Yao, Haohang Li, Zhiyang Deng, Yupeng Cao, Zhi Chen, Jordan W. Suchow, Rong Liu, Zhenyu Cui, Denghui Zhang, Koduvayur Subbalakshmi, Guojun Xiong, Yueru He, Jimin Huang, Dong Li, Qianqian Xie

    Abstract: Large language models (LLMs) have demonstrated notable potential in conducting complex tasks and are increasingly utilized in various financial applications. However, high-quality sequential financial investment decision-making remains challenging. These tasks require multiple interactions with a volatile environment for every decision, demanding sufficient intelligence to maximize returns and man… ▽ More

    Submitted 10 July, 2024; v1 submitted 9 July, 2024; originally announced July 2024.

    Comments: LLM Applications, LLM Agents, Financial Technology, Quantitative Finance, Algorithmic Trading, Cognitive Science

  8. arXiv:2407.06505  [pdf

    cs.HC

    Not all explicit cues help communicate: Pedestrians' perceptions, fixations, and decisions toward automated vehicles with varied appearance

    Authors: Wei Lyu, Yaqin Cao, Yi Ding, Jingyu Li, Kai Tian, Hui Zhang

    Abstract: Given pedestrians' vulnerability in road traffic, it remains unclear how novel AV appearances will impact pedestrians crossing behaviour. To address this gap, this study pioneers an investigation into the influence of AVs' exterior design, correlated with their kinematics, on pedestrians' road-crossing perception and decision-making. A video-based eye-tracking experimental study was conducted with… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 37 pages, 13 figures, 4 tables

  9. arXiv:2407.06177  [pdf, other

    cs.CV cs.AI cs.CL cs.CY

    Vision-Language Models under Cultural and Inclusive Considerations

    Authors: Antonia Karamolegkou, Phillip Rust, Yong Cao, Ruixiang Cui, Anders Søgaard, Daniel Hershcovich

    Abstract: Large vision-language models (VLMs) can assist visually impaired people by describing images from their daily lives. Current evaluation datasets may not reflect diverse cultural user backgrounds or the situational context of this use case. To address this problem, we create a survey to determine caption preferences and propose a culture-centric evaluation benchmark by filtering VizWiz, an existing… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: HuCLLM @ ACL 2024

  10. arXiv:2407.05718  [pdf, other

    cs.CL

    A Factuality and Diversity Reconciled Decoding Method for Knowledge-Grounded Dialogue Generation

    Authors: Chenxu Yang, Zheng Lin, Chong Tian, Liang Pang, Lanrui Wang, Zhengyang Tong, Qirong Ho, Yanan Cao, Weiping Wang

    Abstract: Grounding external knowledge can enhance the factuality of responses in dialogue generation. However, excessive emphasis on it might result in the lack of engaging and diverse expressions. Through the introduction of randomness in sampling, current approaches can increase the diversity. Nevertheless, such sampling method could undermine the factuality in dialogue generation. In this study, to disc… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  11. arXiv:2407.05365  [pdf, other

    cs.AI

    ElecBench: a Power Dispatch Evaluation Benchmark for Large Language Models

    Authors: Xiyuan Zhou, Huan Zhao, Yuheng Cheng, Yuji Cao, Gaoqi Liang, Guolong Liu, Junhua Zhao

    Abstract: In response to the urgent demand for grid stability and the complex challenges posed by renewable energy integration and electricity market dynamics, the power sector increasingly seeks innovative technological solutions. In this context, large language models (LLMs) have become a key technology to improve efficiency and promote intelligent progress in the power sector with their excellent natural… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  12. arXiv:2407.05239  [pdf, other

    cs.DS cs.NI

    Competitive Analysis of Online Path Selection: Impacts of Path Length, Topology, and System-Level Costs

    Authors: Ying Cao, Siyuan Yu, Xiaoqi Tan, Danny H. K. Tsang

    Abstract: Consider a communication network to which a sequence of self-interested users come and send requests for data transmission between nodes. This work studies the question of how to guide the path selection choices made by those online-arriving users and maximize the social welfare. Competitive analysis is the main technical tool. Specifically, the impacts of path length bounds and topology on the co… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

  13. arXiv:2407.04999  [pdf, other

    cs.LG

    Rethinking the Effectiveness of Graph Classification Datasets in Benchmarks for Assessing GNNs

    Authors: Zhengdao Li, Yong Cao, Kefan Shuai, Yiming Miao, Kai Hwang

    Abstract: Graph classification benchmarks, vital for assessing and developing graph neural networks (GNNs), have recently been scrutinized, as simple methods like MLPs have demonstrated comparable performance. This leads to an important question: Do these benchmarks effectively distinguish the advancements of GNNs over other methodologies? If so, how do we quantitatively measure this effectiveness? In respo… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

  14. arXiv:2407.04531  [pdf, other

    astro-ph.GA

    Neutral atomic and molecular gas dynamics in the nearby spiral galaxies NGC 1512, NGC 4535, and NGC 7496

    Authors: Sebastian Laudage, Cosima Eibensteiner, Frank Bigiel, Adam K. Leroy, Sharon Meidt, Eva Schinnerer, W. J. G. de Blok, Miguele Querejeta, Sophia Stuber, Dario Colombo, Erik Rosolowsky, D. J. Pisano, Dyas Utomo, Rebecca C. Levy, Ralf Klessen, Yixian Cao, Eric W. Koch, Sushma Kurapati, Patricia Sanchez-Blazquez, Justus Neumann, Lukas Neumann, Hsi-An Pan, Thomas G. Williams

    Abstract: Neutral atomic gas (HI) effectively traces galactic dynamics across mid to large galactocentric radii. However, its limitations in observing small-scale changes within the central few kiloparsecs, coupled with the often observed HI deficit in galactic centers, necessitates using molecular gas emission as a preferred tracer in these regions. Understanding the dynamics of both neutral atomic and mol… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: accepted for publication in A&A; 13 pages, 9 Figures (+2 appendix pages)

  15. arXiv:2407.03959  [pdf, other

    cond-mat.mtrl-sci

    Skyrmion Hall effect in altermagnets

    Authors: Zhejunyu Jin, Zhaozhuo Zeng, Yunshan Cao, Peng Yan

    Abstract: It is widely believed that the skyrmion Hall effect is absent in antiferromagnets because of the vanishing topological charge. However, the Aharonov-Casher theory indicates the possibility of topological effects for neutral particles. In this work, we predict the skyrmion Hall effect in emerging altermagnets with zero net magnetization and zero skyrmion charge. We first show that the neutral skyrm… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: 6 pages and 5 figures

  16. arXiv:2407.03320  [pdf, other

    cs.CV cs.CL

    InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

    Authors: Pan Zhang, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Rui Qian, Lin Chen, Qipeng Guo, Haodong Duan, Bin Wang, Linke Ouyang, Songyang Zhang, Wenwei Zhang, Yining Li, Yang Gao, Peng Sun, Xinyue Zhang, Wei Li, Jingwen Li, Wenhai Wang, Hang Yan, Conghui He, Xingcheng Zhang, Kai Chen, Jifeng Dai, Yu Qiao , et al. (2 additional authors not shown)

    Abstract: We present InternLM-XComposer-2.5 (IXC-2.5), a versatile large-vision language model that supports long-contextual input and output. IXC-2.5 excels in various text-image comprehension and composition applications, achieving GPT-4V level capabilities with merely 7B LLM backend. Trained with 24K interleaved image-text contexts, it can seamlessly extend to 96K long contexts via RoPE extrapolation. Th… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: Technical Report. https://github.com/InternLM/InternLM-XComposer

  17. arXiv:2407.02715  [pdf, other

    cond-mat.mtrl-sci cond-mat.str-el

    Revealing the Electronic Structure of NiPS$_3$ through Synchrotron-Based ARPES and Alkali Metal Dosing

    Authors: Yifeng Cao, Qishuo Tan, Yucheng Guo, Clóvis Guerim Vieira, Mário S. C. Mazzon, Jude Laverock, Nicholas Russo, Hongze Gao, Chris Jozwiak, Aaron Bostwick, Eli Rotenberg, Jinghua Guo, Ming Yi, Matheus J. S. Matos, Xi Ling, Kevin E. Smith

    Abstract: This study presents a comprehensive analysis of the band structure in NiPS$_3$, a van der Waals layered antiferromagnet, utilizing high-resolution synchrotron-based angle-resolved photoemission spectroscopy (ARPES) and corroborative density functional theory (DFT) calculations. By tuning the parameters of the light source, we obtained a very clear and wide energy range band structure of NiPS$_3$.… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: 4 figures

  18. arXiv:2407.02542  [pdf, other

    cs.IR cs.AI cs.LG

    ECAT: A Entire space Continual and Adaptive Transfer Learning Framework for Cross-Domain Recommendation

    Authors: Chaoqun Hou, Yuanhang Zhou, Yi Cao, Tong Liu

    Abstract: In industrial recommendation systems, there are several mini-apps designed to meet the diverse interests and needs of users. The sample space of them is merely a small subset of the entire space, making it challenging to train an efficient model. In recent years, there have been many excellent studies related to cross-domain recommendation aimed at mitigating the problem of data sparsity. However,… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  19. arXiv:2407.02182  [pdf, other

    cs.CV cs.RO eess.IV

    Occlusion-Aware Seamless Segmentation

    Authors: Yihong Cao, Jiaming Zhang, Hao Shi, Kunyu Peng, Yuhongxuan Zhang, Hui Zhang, Rainer Stiefelhagen, Kailun Yang

    Abstract: Panoramic images can broaden the Field of View (FoV), occlusion-aware prediction can deepen the understanding of the scene, and domain adaptation can transfer across viewing domains. In this work, we introduce a novel task, Occlusion-Aware Seamless Segmentation (OASS), which simultaneously tackles all these three challenges. For benchmarking OASS, we establish a new human-annotated dataset for Ble… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Accepted to ECCV 2024. The fresh dataset and the source code will be made publicly available at https://github.com/yihong-97/OASS

  20. arXiv:2407.02159  [pdf, other

    cs.CV eess.IV

    SparseSSP: 3D Subcellular Structure Prediction from Sparse-View Transmitted Light Images

    Authors: Jintu Zheng, Yi Ding, Qizhe Liu, Yi Cao, Ying Hu, Zenan Wang

    Abstract: Traditional fluorescence staining is phototoxic to live cells, slow, and expensive; thus, the subcellular structure prediction (SSP) from transmitted light (TL) images is emerging as a label-free, faster, low-cost alternative. However, existing approaches utilize 3D networks for one-to-one voxel level dense prediction, which necessitates a frequent and time-consuming Z-axis imaging process. Moreov… ▽ More

    Submitted 3 July, 2024; v1 submitted 2 July, 2024; originally announced July 2024.

    Comments: Accpeted to ECCV2024

  21. arXiv:2407.01953  [pdf, other

    cs.CE cs.AI cs.LG q-fin.CP

    CatMemo at the FinLLM Challenge Task: Fine-Tuning Large Language Models using Data Fusion in Financial Applications

    Authors: Yupeng Cao, Zhiyuan Yao, Zhi Chen, Zhiyang Deng

    Abstract: The integration of Large Language Models (LLMs) into financial analysis has garnered significant attention in the NLP community. This paper presents our solution to IJCAI-2024 FinLLM challenge, investigating the capabilities of LLMs within three critical areas of financial tasks: financial classification, financial text summarization, and single stock trading. We adopted Llama3-8B and Mistral-7B a… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  22. arXiv:2407.01881  [pdf, other

    cond-mat.str-el cond-mat.other

    Spectral evidence for NiPS3 as a Mott-Hubbard insulator

    Authors: Yifeng Cao, Nicholas Russo, Qishuo Tan, Xi Ling, Jinghua Guo, Yi-de Chuang, Kevin E. Smith

    Abstract: The layered van der Waals trichalcogenide NiPS3 has attracted widespread attention due to its unique optical, magnetic, and electronic properties. The complexity of NiPS3 itself, however, has also led to ongoing debates regarding its characteristics such as the existence of self-doped ligand holes. In this study, X-ray absorption spectroscopy and resonant inelastic X-ray scattering have been appli… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 6 figures

  23. arXiv:2407.01716  [pdf, other

    astro-ph.GA

    PHANGS-MeerKAT and MHONGOOSE HI observations of nearby spiral galaxies: physical drivers of the molecular gas fraction, $R_{\mathrm{mol}}$

    Authors: Cosima Eibensteiner, Jiayi Sun, Frank Bigiel, Adam K. Leroy, Eva Schinnerer, Erik Rosolowsky, Sushma Kurapati, D. J. Pisano, W. J. G de Blok, Ashley T. Barnes, Mallory Thorp, Dario Colombo, Eric W. Koch, I-Da Chiang, Eve C. Ostriker, Eric J. Murphy, Nikki Zabel, Sebstian Laudage, Filippo M. Maccagni, Julia Healy, Srikrishna Sekhar, Dyas Utomo, Jakob den Brok, Yixian Cao, Mélanie Chevance , et al. (14 additional authors not shown)

    Abstract: The molecular-to-atomic gas ratio is crucial to the evolution of the interstellar medium in galaxies. We investigate the balance between the atomic ($Σ_{\rm HI}$) and molecular gas ($Σ_{\rm H2}$) surface densities in eight nearby star-forming galaxies using new high-quality observations from MeerKAT and ALMA (for HI and CO, respectively). We define the molecular gas ratio as… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: accepted for publication in A&A; 20 pages, 12 Figures (+4 appendix pages)

  24. arXiv:2407.01523  [pdf, other

    cs.CV cs.CL

    MMLongBench-Doc: Benchmarking Long-context Document Understanding with Visualizations

    Authors: Yubo Ma, Yuhang Zang, Liangyu Chen, Meiqi Chen, Yizhu Jiao, Xinze Li, Xinyuan Lu, Ziyu Liu, Yan Ma, Xiaoyi Dong, Pan Zhang, Liangming Pan, Yu-Gang Jiang, Jiaqi Wang, Yixin Cao, Aixin Sun

    Abstract: Understanding documents with rich layouts and multi-modal components is a long-standing and practical task. Recent Large Vision-Language Models (LVLMs) have made remarkable strides in various tasks, particularly in single-page document understanding (DU). However, their abilities on long-context DU remain an open problem. This work presents MMLongBench-Doc, a long-context, multi-modal benchmark co… ▽ More

    Submitted 10 July, 2024; v1 submitted 1 July, 2024; originally announced July 2024.

  25. arXiv:2407.01121  [pdf, other

    math.CO

    On well (edge) dominated and equimatchable strong product graphs

    Authors: Yixin Cao, Guiqiang Mou, Jianxin Wang

    Abstract: A graph is well-(edge-)dominated if every minimal (edge) dominating set is minimum. A graph is equimatchable if every maximal matching is maximum. We study these concepts on strong product graphs. We fully characterize well-edge-dominated and equimatchable strong product graphs of nontrivial graphs, and identify a large family of graphs whose strong products with any well-dominated graph are well-… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  26. arXiv:2407.01008  [pdf

    physics.optics

    Periodic domain inversion in single crystal barium titanate-on-insulator thin film

    Authors: Pragati Aashna, Hong-Lin Lin, Yu Cao, Yuhui Yin, Yuan Gao, Sakthi Sanjeev Mohanraj, Di Zhu, Aaron Danner

    Abstract: We report experimentally achieving first-ever electric field periodic poling of single crystal barium titanate (BTO, or BaTiO3) thin film on insulator. Owing to the outstanding optical nonlinearities of BTO, this result is a key step towards achieving quasi-phase-matching in BTO. We first grow the BTO thin film on a dysprosium scandate substrate using pulsed laser deposition with a thin layer of s… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  27. arXiv:2407.00497  [pdf, other

    cs.CL

    LLMs-as-Instructors: Learning from Errors Toward Automating Model Improvement

    Authors: Jiahao Ying, Mingbao Lin, Yixin Cao, Wei Tang, Bo Wang, Qianru Sun, Xuanjing Huang, Shuicheng Yan

    Abstract: This paper introduces the innovative "LLMs-as-Instructors" framework, which leverages the advanced Large Language Models (LLMs) to autonomously enhance the training of smaller target models. Inspired by the theory of "Learning from Errors", this framework employs an instructor LLM to meticulously analyze the specific errors within a target model, facilitating targeted and efficient training cycles… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  28. arXiv:2406.20006  [pdf, other

    cs.LG

    On the Trade-off between Flatness and Optimization in Distributed Learning

    Authors: Ying Cao, Zhaoxian Wu, Kun Yuan, Ali H. Sayed

    Abstract: This paper proposes a theoretical framework to evaluate and compare the performance of gradient-descent algorithms for distributed learning in relation to their behavior around local minima in nonconvex environments. Previous works have noticed that convergence toward flat local minima tend to enhance the generalization ability of learning algorithms. This work discovers two interesting results. F… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  29. arXiv:2406.19317  [pdf, other

    cs.LG cs.AI cs.CL

    Jump Starting Bandits with LLM-Generated Prior Knowledge

    Authors: Parand A. Alamdari, Yanshuai Cao, Kevin H. Wilson

    Abstract: We present substantial evidence demonstrating the benefits of integrating Large Language Models (LLMs) with a Contextual Multi-Armed Bandit framework. Contextual bandits have been widely used in recommendation systems to generate personalized suggestions based on user-specific contexts. We show that LLMs, pre-trained on extensive corpora rich in human knowledge and preferences, can simulate human… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  30. arXiv:2406.17072  [pdf, other

    astro-ph.GA

    GATOS: missing molecular gas in the outflow of NGC5728 revealed by JWST

    Authors: R. Davies, T. Shimizu, M. Pereira-Santaella, A. Alonso-Herrero, A. Audibert, E. Bellocchi, P. Boorman, S. Campbell, Y. Cao, F. Combes, D. Delaney, T. Diaz-Santos, F. Eisenhauer, D. Esparza Arredondo, H. Feuchtgruber, N. M. Forster Schreiber, L. Fuller, P. Gandhi, I. Garcia-Bernete, S. Garcia-Burillo, B. Garcia-Lorenzo, R. Genzel, S. Gillessen, O. Gonzalez Martin, H. Haidar , et al. (27 additional authors not shown)

    Abstract: The ionisation cones of NGC5728 have a deficit of molecular gas based on millimetre observations of CO(2-1) emission. Although photoionisation from the active nucleus may lead to suppression of this transition, warm molecular gas can still be present. We report the detection of eight mid-infrared rotational H$_2$ lines throughout the central kiloparsec, including the ionisation cones, using integr… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: A&A accepted; 16 pages

  31. arXiv:2406.17067  [pdf

    cond-mat.mes-hall cond-mat.dis-nn physics.app-ph

    Optical Control of Adaptive Nanoscale Domain Networks

    Authors: Marc Zajac, Tao Zhou, Tiannan Yang, Sujit Das, Yue Cao, Burak Guzelturk, Vladimir Stoica, Mathew Cherukara, John W. Freeland, Venkatraman Gopalan, Ramamoorthy Ramesh, Lane W. Martin, Long-Qing Chen, Martin Holt, Stephan Hruszkewycz, Haidan Wen

    Abstract: Adaptive networks can sense and adjust to dynamic environments to optimize their performance. Understanding their nanoscale responses to external stimuli is essential for applications in nanodevices and neuromorphic computing. However, it is challenging to image such responses on the nanoscale with crystallographic sensitivity. Here, the evolution of nanodomain networks in (PbTiO3)n/(SrTiO3)n supe… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  32. arXiv:2406.16671  [pdf, other

    cs.RO

    STAR: Swarm Technology for Aerial Robotics Research

    Authors: Jimmy Chiun, Yan Rui Tan, Yuhong Cao, John Tan, Guillaume Sartoretti

    Abstract: In recent years, the field of aerial robotics has witnessed significant progress, finding applications in diverse domains, including post-disaster search and rescue operations. Despite these strides, the prohibitive acquisition costs associated with deploying physical multi-UAV systems have posed challenges, impeding their widespread utilization in research endeavors. To overcome these challenges,… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  33. arXiv:2406.16353  [pdf

    cond-mat.soft

    Micropores can enhance intrinsic fracture energy of hydrogels

    Authors: Puyu Cao, Bin Chen, Yi Cao, Huajian Gao

    Abstract: It is widely known that hydrogels, a class of soft materials made of a polymer chain network, are prone to fatigue failure. To understand the underlying mechanism, here we simulate polymer scission and fatigue initiation in the vicinity of a crack tip in a two-dimensional chain network. For a network without pores, our findings reveal that polymer scission can take place across multiple layers of… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  34. arXiv:2406.16253  [pdf, other

    cs.CL

    LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing

    Authors: Jiangshu Du, Yibo Wang, Wenting Zhao, Zhongfen Deng, Shuaiqi Liu, Renze Lou, Henry Peng Zou, Pranav Narayanan Venkit, Nan Zhang, Mukund Srinath, Haoran Ranran Zhang, Vipul Gupta, Yinghui Li, Tao Li, Fei Wang, Qin Liu, Tianlin Liu, Pengzhi Gao, Congying Xia, Chen Xing, Jiayang Cheng, Zhaowei Wang, Ying Su, Raj Sanjay Shah, Ruohao Guo , et al. (15 additional authors not shown)

    Abstract: This work is motivated by two key trends. On one hand, large language models (LLMs) have shown remarkable versatility in various generative tasks such as writing, drawing, and question answering, significantly reducing the time required for many routine tasks. On the other hand, researchers, whose work is not only time-consuming but also highly expertise-demanding, face increasing challenges as th… ▽ More

    Submitted 25 June, 2024; v1 submitted 23 June, 2024; originally announced June 2024.

  35. arXiv:2406.16058  [pdf, other

    eess.AS

    Text-Queried Target Sound Event Localization

    Authors: Jinzheng Zhao, Xinyuan Qian, Yong Xu, Haohe Liu, Yin Cao, Davide Berghi, Wenwu Wang

    Abstract: Sound event localization and detection (SELD) aims to determine the appearance of sound classes, together with their Direction of Arrival (DOA). However, current SELD systems can only predict the activities of specific classes, for example, 13 classes in DCASE challenges. In this paper, we propose text-queried target sound event localization (SEL), a new paradigm that allows the user to input the… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: Accepted by EUSIPCO 2024

  36. arXiv:2406.15439  [pdf

    physics.soc-ph stat.AP

    Heterogeneous peer effects of college roommates on academic performance

    Authors: Yi Cao, Tao Zhou, Jian Gao

    Abstract: Understanding how student peers influence learning outcomes is crucial for effective education management in complex social systems. The complexities of peer selection and evolving peer relationships, however, pose challenges for identifying peer effects using static observational data. Here we use both null-model and regression approaches to examine peer effects using longitudinal data from 5,272… ▽ More

    Submitted 29 May, 2024; originally announced June 2024.

    Comments: 56 pages, 4 figures, 2 tables, with Supplementary Information

    Journal ref: Nature Communications, 15(1), 4785 (2024)

  37. arXiv:2406.14912  [pdf, other

    cs.CV

    FC3DNet: A Fully Connected Encoder-Decoder for Efficient Demoir'eing

    Authors: Zhibo Du, Long Peng, Yang Wang, Yang Cao, Zheng-Jun Zha

    Abstract: Moiré patterns are commonly seen when taking photos of screens. Camera devices usually have limited hardware performance but take high-resolution photos. However, users are sensitive to the photo processing time, which presents a hardly considered challenge of efficiency for demoiréing methods. To balance the network speed and quality of results, we propose a \textbf{F}ully \textbf{C}onnected en\t… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: Accepted by ICIP2024

  38. arXiv:2406.14841  [pdf, other

    cs.CR cs.DB cs.LG

    TabularMark: Watermarking Tabular Datasets for Machine Learning

    Authors: Yihao Zheng, Haocheng Xia, Junyuan Pang, Jinfei Liu, Kui Ren, Lingyang Chu, Yang Cao, Li Xiong

    Abstract: Watermarking is broadly utilized to protect ownership of shared data while preserving data utility. However, existing watermarking methods for tabular datasets fall short on the desired properties (detectability, non-intrusiveness, and robustness) and only preserve data utility from the perspective of data statistics, ignoring the performance of downstream ML models trained on the datasets. Can we… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  39. arXiv:2406.13870  [pdf, other

    cs.CV

    Splatter a Video: Video Gaussian Representation for Versatile Processing

    Authors: Yang-Tian Sun, Yi-Hua Huang, Lin Ma, Xiaoyang Lyu, Yan-Pei Cao, Xiaojuan Qi

    Abstract: Video representation is a long-standing problem that is crucial for various down-stream tasks, such as tracking,depth prediction,segmentation,view synthesis,and editing. However, current methods either struggle to model complex motions due to the absence of 3D structure or rely on implicit 3D representations that are ill-suited for manipulation tasks. To address these challenges, we introduce a no… ▽ More

    Submitted 26 June, 2024; v1 submitted 19 June, 2024; originally announced June 2024.

  40. arXiv:2406.13167  [pdf, other

    cs.CL

    QRMeM: Unleash the Length Limitation through Question then Reflection Memory Mechanism

    Authors: Bo Wang, Heyan Huang, Yixin Cao, Jiahao Ying, Wei Tang, Chong Feng

    Abstract: While large language models (LLMs) have made notable advancements in natural language processing, they continue to struggle with processing extensive text. Memory mechanism offers a flexible solution for managing long contexts, utilizing techniques such as compression, summarization, and structuring to facilitate nuanced and efficient handling of large volumes of text. However, existing techniques… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  41. arXiv:2406.13093  [pdf, other

    cs.CV cs.AI cs.HC

    RITA: A Real-time Interactive Talking Avatars Framework

    Authors: Wuxinlin Cheng, Cheng Wan, Yupeng Cao, Sihan Chen

    Abstract: RITA presents a high-quality real-time interactive framework built upon generative models, designed with practical applications in mind. Our framework enables the transformation of user-uploaded photos into digital avatars that can engage in real-time dialogue interactions. By leveraging the latest advancements in generative modeling, we have developed a versatile platform that not only enhances t… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  42. arXiv:2406.12333  [pdf

    hep-ex physics.flu-dyn

    Permeability distribution of gas drainage of borehole with the different moisture content caused polar permeability effect

    Authors: Lei Zhang, Yao Zhang, Hongyu Pan, Yan Cao, Yuhang Chu, Shihua Yang

    Abstract: In order to study the penetration characteristics in areas with different water content and different stress distributions in the radial direction of the hole after hydraulicization measures, an improved LFTD1812 triaxial permeability meter was used to conduct a test to measure the polar permeability characteristics of coal with different water content combinations were measured by permeability in… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 12 pages,10 figures

  43. arXiv:2406.12268  [pdf, ps, other

    eess.SP

    Channel Twinning: An Enabler for Next-Generation Ubiquitous Wireless Connectivity

    Authors: Yashuai Cao, Jingbo Tan, Jintao Wang, Wei Ni, Ekram Hossain, Dusit Niyato

    Abstract: The emerging concept of channel twinning (CT) has great potential to become a key enabler of ubiquitous connectivity in next-generation (xG) wireless systems. By fusing multimodal sensor data, CT advocates a high-fidelity and low-overhead channel acquisition paradigm, which is promising to provide accurate channel prediction in cross-domain and high-mobility scenarios of ubiquitous xG networks. Ho… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: submitted to IEEE

  44. arXiv:2406.12025  [pdf, other

    astro-ph.GA

    A 260 pc resolution ALMA map of HCN(1-0) in the galaxy NGC 4321

    Authors: Lukas Neumann, Frank Bigiel, Ashley T. Barnes, Molly J. Gallagher, Adam Leroy, Antonio Usero, Erik Rosolowsky, Ivana Bešlić, Médéric Boquien, Yixian Cao, Mélanie Chevance, Dario Colombo, Daniel A. Dale, Cosima Eibensteiner, Kathryn Grasha, Jonathan D. Henshaw, María J. Jiménez-Donaire, Sharon Meidt, Shyam H. Menon, Eric J. Murphy, Hsi-An Pan, Miguel Querejeta, Toshiki Saito, Eva Schinnerer, Sophia K. Stuber , et al. (2 additional authors not shown)

    Abstract: The star formation rate (SFR) is tightly connected to the amount of dense gas in molecular clouds. However, it is not fully understood how the relationship between dense molecular gas and star formation varies within galaxies and in different morphological environments. In this work, we study dense gas and star formation in the nearby spiral galaxy NGC 4321 to test how the amount of dense gas and… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 18 pages, 9 figures, accepted for pub in A&A, Jun 13, 2024

  45. arXiv:2406.11739  [pdf, other

    cs.CV

    V3Det Challenge 2024 on Vast Vocabulary and Open Vocabulary Object Detection: Methods and Results

    Authors: Jiaqi Wang, Yuhang Zang, Pan Zhang, Tao Chu, Yuhang Cao, Zeyi Sun, Ziyu Liu, Xiaoyi Dong, Tong Wu, Dahua Lin, Zeming Chen, Zhi Wang, Lingchen Meng, Wenhao Yao, Jianwei Yang, Sihong Wu, Zhineng Chen, Zuxuan Wu, Yu-Gang Jiang, Peixi Wu, Bosong Chai, Xuan Nie, Longquan Yan, Zeyu Wang, Qifan Zhou , et al. (9 additional authors not shown)

    Abstract: Detecting objects in real-world scenes is a complex task due to various challenges, including the vast range of object categories, and potential encounters with previously unknown or unseen objects. The challenges necessitate the development of public benchmarks and challenges to advance the field of object detection. Inspired by the success of previous COCO and LVIS Challenges, we organize the V3… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  46. arXiv:2406.11507  [pdf, other

    cs.CV

    Prior Normality Prompt Transformer for Multi-class Industrial Image Anomaly Detection

    Authors: Haiming Yao, Yunkang Cao, Wei Luo, Weihang Zhang, Wenyong Yu, Weiming Shen

    Abstract: Image anomaly detection plays a pivotal role in industrial inspection. Traditional approaches often demand distinct models for specific categories, resulting in substantial deployment costs. This raises concerns about multi-class anomaly detection, where a unified model is developed for multiple classes. However, applying conventional methods, particularly reconstruction-based models, directly to… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Accepted by IEEE Transactions on Industrial Informatics

  47. arXiv:2406.10650  [pdf, other

    stat.ML cs.LG

    The Implicit Bias of Adam on Separable Data

    Authors: Chenyang Zhang, Difan Zou, Yuan Cao

    Abstract: Adam has become one of the most favored optimizers in deep learning problems. Despite its success in practice, numerous mysteries persist regarding its theoretical understanding. In this paper, we study the implicit bias of Adam in linear logistic regression. Specifically, we show that when the training data are linearly separable, Adam converges towards a linear classifier that achieves the maxim… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: 33 pages, 2 figures

  48. arXiv:2406.10583  [pdf, other

    hep-ex

    Demonstration of neutron identification in neutrino interactions in the MicroBooNE liquid argon time projection chamber

    Authors: MicroBooNE collaboration, P. Abratenko, O. Alterkait, D. Andrade Aldana, L. Arellano, J. Asaadi, A. Ashkenazi, S. Balasubramanian, B. Baller, A. Barnard, G. Barr, D. Barrow, J. Barrow, V. Basque, J. Bateman, O. Benevides Rodrigues, S. Berkman, A. Bhanderi, A. Bhat, M. Bhattacharya, M. Bishai, A. Blake, B. Bogart, T. Bolton, J. Y. Book , et al. (165 additional authors not shown)

    Abstract: A significant challenge in measurements of neutrino oscillations is reconstructing the incoming neutrino energies. While modern fully-active tracking calorimeters such as liquid argon time projection chambers in principle allow the measurement of all final state particles above some detection threshold, undetected neutrons remain a considerable source of missing energy with little to no data const… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Report number: FERMILAB-PUB-24-0301

  49. arXiv:2406.10123  [pdf, other

    hep-ex physics.ins-det

    Improving neutrino energy estimation of charged-current interaction events with recurrent neural networks in MicroBooNE

    Authors: MicroBooNE collaboration, P. Abratenko, O. Alterkait, D. Andrade Aldana, L. Arellano, J. Asaadi, A. Ashkenazi, S. Balasubramanian, B. Baller, A. Barnard, G. Barr, D. Barrow, J. Barrow, V. Basque, J. Bateman, O. Benevides Rodrigues, S. Berkman, A. Bhanderi, A. Bhat, M. Bhattacharya, M. Bishai, A. Blake, B. Bogart, T. Bolton, J. Y. Book , et al. (164 additional authors not shown)

    Abstract: We present a deep learning-based method for estimating the neutrino energy of charged-current neutrino-argon interactions. We employ a recurrent neural network (RNN) architecture for neutrino energy estimation in the MicroBooNE experiment, utilizing liquid argon time projection chamber (LArTPC) detector technology. Traditional energy estimation approaches in LArTPCs, which largely rely on reconstr… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Report number: FERMILAB-PUB-24-0287

  50. arXiv:2406.09447  [pdf, ps, other

    cs.IT eess.SP

    Self-Sustainable Active Reconfigurable Intelligent Surfaces for Anti-Jamming in Wireless Communications

    Authors: Yang Cao, Wenchi Cheng, Jingqing Wang, Wei Zhang

    Abstract: Wireless devices can be easily attacked by jammers during transmission, which is a potential security threat for wireless communications. Active reconfigurable intelligent surface (RIS) attracts considerable attention and is expected to be employed in anti-jamming systems for secure transmission to significantly enhance the anti-jamming performance. However, active RIS introduces external power lo… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: submitted to IEEE systems journal