Skip to main content

Showing 1–50 of 1,690 results for author: Zhao, C

  1. arXiv:2407.11750  [pdf, other

    cs.CV

    Cycle Contrastive Adversarial Learning for Unsupervised image Deraining

    Authors: Chen Zhao, Weiling Cai, ChengWei Hu, Zheng Yuan

    Abstract: To tackle the difficulties in fitting paired real-world data for single image deraining (SID), recent unsupervised methods have achieved notable success. However, these methods often struggle to generate high-quality, rain-free images due to a lack of attention to semantic representation and image content, resulting in ineffective separation of content from the rain layer. In this paper, we propos… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

  2. arXiv:2407.10523  [pdf, other

    quant-ph physics.chem-ph

    Variational Quantum Imaginary Time Evolution for Matrix Product State Ansatz with Tests on Transcorrelated Hamiltonians

    Authors: Hao-En Li, Xiang Li, Jia-Cheng Huang, Guang-Ze Zhang, Zhu-Ping Shen, Chen Zhao, Jun Li, Han-Shi Hu

    Abstract: The matrix product state (MPS) ansatz offers a promising approach for finding the ground state of molecular Hamiltonians and solving quantum chemistry problems. Building on this concept, the proposed technique of quantum circuit MPS (QCMPS) enables the simulation of chemical systems using a relatively small number of qubits. In this study, we enhance the optimization performance of the QCMPS ansat… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: 15 pages, 8 figures

  3. arXiv:2407.09922  [pdf

    q-bio.NC

    Transcranial low-level laser stimulation in near infrared-II region for brain safety and protection

    Authors: Zhilin Li, Yongheng Zhao, Yiqing Hu, Yang Li, Keyao Zhang, Zhibing Gao, Lirou Tan, Hanli Liu, Xiaoli Li, Aihua Cao, Zaixu Cui, Chenguang Zhao

    Abstract: Background: The use of near-infrared lasers for transcranial photobiomodulation (tPBM) offers a non-invasive method for influencing brain activity and is beneficial for various neurological conditions. Objective: To investigate the safety and neuroprotective properties of tPBM using near-infrared (NIR)-II laser stimulation. Methods: We conducted thirteen experiments involving multidimensional and… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

  4. arXiv:2407.08150  [pdf, other

    cs.CV

    Hypergraph Multi-modal Large Language Model: Exploiting EEG and Eye-tracking Modalities to Evaluate Heterogeneous Responses for Video Understanding

    Authors: Minghui Wu, Chenxu Zhao, Anyang Su, Donglin Di, Tianyu Fu, Da An, Min He, Ya Gao, Meng Ma, Kun Yan, Ping Wang

    Abstract: Understanding of video creativity and content often varies among individuals, with differences in focal points and cognitive levels across different ages, experiences, and genders. There is currently a lack of research in this area, and most existing benchmarks suffer from several drawbacks: 1) a limited number of modalities and answers with restrictive length; 2) the content and scenarios within… ▽ More

    Submitted 16 July, 2024; v1 submitted 10 July, 2024; originally announced July 2024.

    Comments: Accepted by ACM MULTIMEDIA 2024

  5. arXiv:2407.08034  [pdf, other

    cs.AI

    Spatial-Temporal Generative AI for Traffic Flow Estimation with Sparse Data of Connected Vehicles

    Authors: Jianzhe Xue, Yunting Xu, Dongcheng Yuan, Caoyi Zha, Hongyang Du, Haibo Zhou, Dusit Niyato

    Abstract: Traffic flow estimation (TFE) is crucial for intelligent transportation systems. Traditional TFE methods rely on extensive road sensor networks and typically incur significant costs. Sparse mobile crowdsensing enables a cost-effective alternative by utilizing sparsely distributed probe vehicle data (PVD) provided by connected vehicles. However, as pointed out by the central limit theorem, the spar… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  6. arXiv:2407.08023  [pdf, other

    cs.CV

    Hybrid Structure-from-Motion and Camera Relocalization for Enhanced Egocentric Localization

    Authors: Jinjie Mai, Abdullah Hamdi, Silvio Giancola, Chen Zhao, Bernard Ghanem

    Abstract: We built our pipeline EgoLoc-v1, mainly inspired by EgoLoc. We propose a model ensemble strategy to improve the camera pose estimation part of the VQ3D task, which has been proven to be essential in previous work. The core idea is not only to do SfM for egocentric videos but also to do 2D-3D matching between existing 3D scans and 2D video frames. In this way, we have a hybrid SfM and camera reloca… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: 1st place winner of the 2024 Ego4D-Ego-Exo4D Challenge in VQ3D

  7. arXiv:2407.07061  [pdf, other

    cs.CL

    Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence

    Authors: Weize Chen, Ziming You, Ran Li, Yitong Guan, Chen Qian, Chenyang Zhao, Cheng Yang, Ruobing Xie, Zhiyuan Liu, Maosong Sun

    Abstract: The rapid advancement of large language models (LLMs) has paved the way for the development of highly capable autonomous agents. However, existing multi-agent frameworks often struggle with integrating diverse capable third-party agents due to reliance on agents defined within their own ecosystems. They also face challenges in simulating distributed environments, as most frameworks are limited to… ▽ More

    Submitted 10 July, 2024; v1 submitted 9 July, 2024; originally announced July 2024.

    Comments: work in progress

  8. arXiv:2407.06662  [pdf, other

    eess.SP

    Experimental Demonstration of 16D Voronoi Constellation with Two-Level Coding over 50km Four-Core Fiber

    Authors: Can Zhao, Bin Chen, Jiaqi Cai, Zhiwei Liang, Yi Lei, Junjie Xiong, Lin Ma, Daohui Hu, Lin Sun, Gangxiang Shen

    Abstract: A 16-dimensional Voronoi constellation concatenated with multilevel coding is experimentally demonstrated over a 50km four-core fiber transmission system. The proposed scheme reduces the required launch power by 6dB and provides a 17dB larger operating range than 16QAM with BICM at the outer HD-FEC BER threshold.

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 4 pages, 4 figures, accepted by 2024 European Conference on Optical Communication (ECOC)

  9. arXiv:2407.03913  [pdf, other

    cs.AI cs.HC

    MobileExperts: A Dynamic Tool-Enabled Agent Team in Mobile Devices

    Authors: Jiayi Zhang, Chuang Zhao, Yihan Zhao, Zhaoyang Yu, Ming He, Jianping Fan

    Abstract: The attainment of autonomous operations in mobile computing devices has consistently been a goal of human pursuit. With the development of Large Language Models (LLMs) and Visual Language Models (VLMs), this aspiration is progressively turning into reality. While contemporary research has explored automation of simple tasks on mobile devices via VLMs, there remains significant room for improvement… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  10. arXiv:2407.03247  [pdf, other

    cs.DC

    Bridging Model Heterogeneity in Federated Learning via Uncertainty-based Asymmetrical Reciprocity Learning

    Authors: Jiaqi Wang, Chenxu Zhao, Lingjuan Lyu, Quanzeng You, Mengdi Huai, Fenglong Ma

    Abstract: This paper presents FedType, a simple yet pioneering framework designed to fill research gaps in heterogeneous model aggregation within federated learning (FL). FedType introduces small identical proxy models for clients, serving as agents for information exchange, ensuring model security, and achieving efficient communication simultaneously. To transfer knowledge between large private and small p… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: This paper has been accepted by ICML 2024

  11. arXiv:2407.03128  [pdf

    cond-mat.mtrl-sci physics.optics

    Thorium doped strontium fluoride crystal: a unique candidate for solid nuclear optical clock material

    Authors: Qiaorui Gong, Shanming Li, Shulong Zhang, Siliang Tao, Guoliang Deng, Peixiong Zhang, Chengchun Zhao, Yin Hang, Shining Zhu, Longsheng Ma

    Abstract: We report a candidate with unique advantages in the cultivation of solid-state nuclear clock material, Th:SrF2 crystal. It not only has a segregation coefficient close to 1, which can achieve highly efficient and uniform doping of Th, but also ensures a high transmittance (~69% at 150 nm) while achieving extremely high doping concentration (232Th>6*10^20 cm^(-3). In addition, SrF2 crystal will not… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  12. arXiv:2407.02553  [pdf, other

    quant-ph cond-mat.dis-nn physics.atom-ph

    Large-scale quantum reservoir learning with an analog quantum computer

    Authors: Milan Kornjača, Hong-Ye Hu, Chen Zhao, Jonathan Wurtz, Phillip Weinberg, Majd Hamdan, Andrii Zhdanov, Sergio H. Cantu, Hengyun Zhou, Rodrigo Araiza Bravo, Kevin Bagnall, James I. Basham, Joseph Campo, Adam Choukri, Robert DeAngelo, Paige Frederick, David Haines, Julian Hammett, Ning Hsu, Ming-Guang Hu, Florian Huber, Paul Niklas Jepsen, Ningyuan Jia, Thomas Karolyshyn, Minho Kwon , et al. (28 additional authors not shown)

    Abstract: Quantum machine learning has gained considerable attention as quantum technology advances, presenting a promising approach for efficiently learning complex data patterns. Despite this promise, most contemporary quantum methods require significant resources for variational parameter optimization and face issues with vanishing gradients, leading to experiments that are either limited in scale or lac… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: 10 + 14 pages, 4 + 7 figures

  13. arXiv:2407.02190  [pdf, other

    cs.RO

    I2EKF-LO: A Dual-Iteration Extended Kalman Filter Based LiDAR Odometry

    Authors: Wenlu Yu, Jie Xu, Chengwei Zhao, Lijun Zhao, Thien-Minh Nguyen, Shenghai Yuan, Mingming Bai, Lihua Xie

    Abstract: LiDAR odometry is a pivotal technology in the fields of autonomous driving and autonomous mobile robotics. However, most of the current works focus on nonlinear optimization methods, and still existing many challenges in using the traditional Iterative Extended Kalman Filter (IEKF) framework to tackle the problem: IEKF only iterates over the observation equation, relying on a rough estimate of the… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Accepted by IROS 2024

  14. arXiv:2407.01414  [pdf, other

    cs.CV

    StyleShot: A Snapshot on Any Style

    Authors: Junyao Gao, Yanchen Liu, Yanan Sun, Yinhao Tang, Yanhong Zeng, Kai Chen, Cairong Zhao

    Abstract: In this paper, we show that, a good style representation is crucial and sufficient for generalized style transfer without test-time tuning. We achieve this through constructing a style-aware encoder and a well-organized style dataset called StyleGallery. With dedicated design for style learning, this style-aware encoder is trained to extract expressive style representation with decoupling training… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: project page:https://styleshot.github.io/

  15. arXiv:2407.01104  [pdf, other

    cs.CV

    Semantic-guided Adversarial Diffusion Model for Self-supervised Shadow Removal

    Authors: Ziqi Zeng, Chen Zhao, Weiling Cai, Chenyu Dong

    Abstract: Existing unsupervised methods have addressed the challenges of inconsistent paired data and tedious acquisition of ground-truth labels in shadow removal tasks. However, GAN-based training often faces issues such as mode collapse and unstable optimization. Furthermore, due to the complex mapping between shadow and shadow-free domains, merely relying on adversarial learning is not enough to capture… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  16. arXiv:2407.00874  [pdf, other

    hep-ph

    A plan for a super $η$ factory at Huizhou accelerator complex

    Authors: Xu-Rong Chen, Xiong-Hong He, Qiang Hu, De-Xu Lin, Yang Liu, Hao Qiu, Xu Sun, Ye Tian, Rong Wang, Hong-Lin Zhang, Ya-Peng Zhang, Cheng-Xin Zhao

    Abstract: As a Goldstone boson with zero quantum number and zero SM charge, the decays of long-lived $η$ ($η^{\prime}$) meson provide a unique window to search new physics beyond the standard model and new sources of CP violation, to test the low-energy QCD theory, and to measure the fundamental parameters of light quarks. For such goals in the physics frontiers we discuss a plan of building a super $η$ fac… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: 19 pages, 9 figures

  17. arXiv:2407.00506  [pdf, other

    cs.AI cs.GT cs.LG

    ShapG: new feature importance method based on the Shapley value

    Authors: Chi Zhao, Jing Liu, Elena Parilina

    Abstract: With wide application of Artificial Intelligence (AI), it has become particularly important to make decisions of AI systems explainable and transparent. In this paper, we proposed a new Explainable Artificial Intelligence (XAI) method called ShapG (Explanations based on Shapley value for Graphs) for measuring feature importance. ShapG is a model-agnostic global explanation method. At the first sta… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    MSC Class: 68T01; 68T20

  18. arXiv:2406.19611  [pdf, other

    q-bio.QM cs.AI

    Multimodal Data Integration for Precision Oncology: Challenges and Future Directions

    Authors: Huajun Zhou, Fengtao Zhou, Chenyu Zhao, Yingxue Xu, Luyang Luo, Hao Chen

    Abstract: The essence of precision oncology lies in its commitment to tailor targeted treatments and care measures to each patient based on the individual characteristics of the tumor. The inherent heterogeneity of tumors necessitates gathering information from diverse data sources to provide valuable insights from various perspectives, fostering a holistic comprehension of the tumor. Over the past decade,… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 15 pages, 4 figures

  19. arXiv:2406.19417  [pdf, other

    cs.CR cs.AI

    "Glue pizza and eat rocks" -- Exploiting Vulnerabilities in Retrieval-Augmented Generative Models

    Authors: Zhen Tan, Chengshuai Zhao, Raha Moraffah, Yifan Li, Song Wang, Jundong Li, Tianlong Chen, Huan Liu

    Abstract: Retrieval-Augmented Generative (RAG) models enhance Large Language Models (LLMs) by integrating external knowledge bases, improving their performance in applications like fact-checking and information searching. In this paper, we demonstrate a security threat where adversaries can exploit the openness of these knowledge bases by injecting deceptive content into the retrieval database, intentionall… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: Preprint

  20. arXiv:2406.18889  [pdf, ps, other

    quant-ph

    Leapfrogging Sycamore: Harnessing 1432 GPUs for 7$\times$ Faster Quantum Random Circuit Sampling

    Authors: Xian-He Zhao, Han-Sen Zhong, Feng Pan, Zi-Han Chen, Rong Fu, Zhongling Su, Xiaotong Xie, Chaoxing Zhao, Pan Zhang, Wanli Ouyang, Chao-Yang Lu, Jian-Wei Pan, Ming-Cheng Chen

    Abstract: Random quantum circuit sampling serves as a benchmark to demonstrate quantum computational advantage. Recent progress in classical algorithms, especially those based on tensor network methods, has significantly reduced the classical simulation time and challenged the claim of the first-generation quantum advantage experiments. However, in terms of generating uncorrelated samples, time-to-solution,… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: This work was completed on August 2023. A further 50x improvement has been achieved and will be posted on arXiv shortly

  21. arXiv:2406.18780  [pdf, other

    physics.soc-ph cs.DS cs.SI

    Investigation on centrality measures and opinion dynamics in two-layer networks with replica nodes

    Authors: Chi Zhao, Elena Parilina

    Abstract: We examine two-layer networks and centrality measures defined on them. The propose two fast and accurate algorithms to approximate the game-theoretic centrality measures and examine connection between centrality measures and characteristics of opinion dynamic processes on such networks. As an example, we consider a Zachary's karate club social network and extend it by adding the second (internal)… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    MSC Class: 90B15; 90B18; 90C40; 05C90; 68R10

  22. arXiv:2406.18360  [pdf, other

    cs.CV

    XLD: A Cross-Lane Dataset for Benchmarking Novel Driving View Synthesis

    Authors: Hao Li, Ming Yuan, Yan Zhang, Chenming Wu, Chen Zhao, Chunyu Song, Haocheng Feng, Errui Ding, Dingwen Zhang, Jingdong Wang

    Abstract: Thoroughly testing autonomy systems is crucial in the pursuit of safe autonomous driving vehicles. It necessitates creating safety-critical scenarios that go beyond what can be safely collected from real-world data, as many of these scenarios occur infrequently on public roads. However, the evaluation of most existing NVS methods relies on sporadic sampling of image frames from the training data,… ▽ More

    Submitted 26 June, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

    Comments: project page: https://3d-aigc.github.io/XLD/

  23. arXiv:2406.18198  [pdf, other

    cs.CV

    VDG: Vision-Only Dynamic Gaussian for Driving Simulation

    Authors: Hao Li, Jingfeng Li, Dingwen Zhang, Chenming Wu, Jieqi Shi, Chen Zhao, Haocheng Feng, Errui Ding, Jingdong Wang, Junwei Han

    Abstract: Dynamic Gaussian splatting has led to impressive scene reconstruction and image synthesis advances in novel views. Existing methods, however, heavily rely on pre-computed poses and Gaussian initialization by Structure from Motion (SfM) algorithms or expensive sensors. For the first time, this paper addresses this issue by integrating self-supervised VO into our pose-free dynamic Gaussian method (V… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  24. arXiv:2406.17992  [pdf, other

    cs.CL cs.AI

    Catching Chameleons: Detecting Evolving Disinformation Generated using Large Language Models

    Authors: Bohan Jiang, Chengshuai Zhao, Zhen Tan, Huan Liu

    Abstract: Despite recent advancements in detecting disinformation generated by large language models (LLMs), current efforts overlook the ever-evolving nature of this disinformation. In this work, we investigate a challenging yet practical research problem of detecting evolving LLM-generated disinformation. Disinformation evolves constantly through the rapid development of LLMs and their variants. As a cons… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: 10 pages, 5 figures

  25. arXiv:2406.17653  [pdf, other

    quant-ph

    Algorithmic Fault Tolerance for Fast Quantum Computing

    Authors: Hengyun Zhou, Chen Zhao, Madelyn Cain, Dolev Bluvstein, Casey Duckering, Hong-Ye Hu, Sheng-Tao Wang, Aleksander Kubica, Mikhail D. Lukin

    Abstract: Fast, reliable logical operations are essential for the realization of useful quantum computers, as they are required to implement practical quantum algorithms at large scale. By redundantly encoding logical qubits into many physical qubits and using syndrome measurements to detect and subsequently correct errors, one can achieve very low logical error rates. However, for most practical quantum er… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  26. Performative Debias with Fair-exposure Optimization Driven by Strategic Agents in Recommender Systems

    Authors: Zhichen Xiang, Hongke Zhao, Chuang Zhao, Ming He, Jianping Fan

    Abstract: Data bias, e.g., popularity impairs the dynamics of two-sided markets within recommender systems. This overshadows the less visible but potentially intriguing long-tail items that could capture user interest. Despite the abundance of research surrounding this issue, it still poses challenges and remains a hot topic in academic circles. Along this line, in this paper, we developed a re-ranking appr… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: SIGKDD 2024 accepted paper

  27. arXiv:2406.17147  [pdf, other

    cs.LG cs.AI q-bio.QM

    Quantifying Heterogeneous Ecosystem Services With Multi-Label Soft Classification

    Authors: Zhihui Tian, John Upchurch, G. Austin Simon, José Dubeux, Alina Zare, Chang Zhao, Joel B. Harley

    Abstract: Understanding and quantifying ecosystem services are crucial for sustainable environmental management, conservation efforts, and policy-making. The advancement of remote sensing technology and machine learning techniques has greatly facilitated this process. Yet, ground truth labels, such as biodiversity, are very difficult and expensive to measure. In addition, more easily obtainable proxy labels… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  28. arXiv:2406.16494  [pdf, other

    cs.IR cs.AI

    Cross-domain Transfer of Valence Preferences via a Meta-optimization Approach

    Authors: Chuang Zhao, Hongke Zhao, Ming He, Xiaomeng Li, Jianping Fan

    Abstract: Cross-domain recommendation offers a potential avenue for alleviating data sparsity and cold-start problems. Embedding and mapping, as a classic cross-domain research genre, aims to identify a common mapping function to perform representation transformation between two domains. Nevertheless, previous coarse-grained preference representations, non-personalized mapping functions, and excessive relia… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  29. arXiv:2406.14590  [pdf, other

    physics.optics gr-qc

    Demonstration of optical spring in an un-detuned cavity containing an optical parametric amplifier

    Authors: Jian Liu, Juntao Pan, Carl Blair, Jue Zhang, Hengxin Sun, Li Ju, Chunnong Zhao

    Abstract: Here we demonstrate the capacity to manipulate the optical spring (OS) effect by employing an optical parametric amplifier (OPA) within an optical cavity. We observed more than a factor of 2 increase in the OS frequency shift with the OPA. We also showed for the first time that the OS can be tuned by solely adjusting the OPA phase and showing an un-detuned cavity exhibiting an optical spring. The… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 6 pages, 9 figures

  30. arXiv:2406.11931  [pdf, other

    cs.SE cs.AI cs.LG

    DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

    Authors: DeepSeek-AI, Qihao Zhu, Daya Guo, Zhihong Shao, Dejian Yang, Peiyi Wang, Runxin Xu, Y. Wu, Yukun Li, Huazuo Gao, Shirong Ma, Wangding Zeng, Xiao Bi, Zihui Gu, Hanwei Xu, Damai Dai, Kai Dong, Liyue Zhang, Yishi Piao, Zhibin Gou, Zhenda Xie, Zhewen Hao, Bingxuan Wang, Junxiao Song, Deli Chen , et al. (15 additional authors not shown)

    Abstract: We present DeepSeek-Coder-V2, an open-source Mixture-of-Experts (MoE) code language model that achieves performance comparable to GPT4-Turbo in code-specific tasks. Specifically, DeepSeek-Coder-V2 is further pre-trained from an intermediate checkpoint of DeepSeek-V2 with additional 6 trillion tokens. Through this continued pre-training, DeepSeek-Coder-V2 substantially enhances the coding and mathe… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  31. arXiv:2406.11508  [pdf, other

    eess.SY

    Leveraging Cooperative Connected Automated Vehicles for Mixed Traffic Safety

    Authors: Chenguang Zhao, Tamas G. Molnar, Huan Yu

    Abstract: The introduction of connected and automated vehicles (CAV) is believed to reduce congestion, enhance safety, and improve traffic efficiency. Numerous research studies have focused on controlling pure CAV platoons in fully connected automated traffic, as well as single or multiple CAVs in mixed traffic with human-driven vehicles (HVs). CAV cruising control designs have been proposed to stabilize th… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  32. arXiv:2406.10268  [pdf, other

    cs.AI cs.CL cs.HC

    Autograding Mathematical Induction Proofs with Natural Language Processing

    Authors: Chenyan Zhao, Mariana Silva, Seth Poulsen

    Abstract: In mathematical proof education, there remains a need for interventions that help students learn to write mathematical proofs. Research has shown that timely feedback can be very helpful to students learning new skills. While for many years natural language processing models have struggled to perform well on tasks related to mathematical texts, recent developments in natural language processing ha… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  33. arXiv:2406.09495  [pdf, other

    cs.LG cs.AI

    Fair Data Generation via Score-based Diffusion Model

    Authors: Yujie Lin, Dong Li, Chen Zhao, Minglai Shao

    Abstract: The fairness of AI decision-making has garnered increasing attention, leading to the proposal of numerous fairness algorithms. In this paper, we aim not to address this issue by directly introducing fair learning algorithms, but rather by generating entirely new, fair synthetic data from biased datasets for use in any downstream tasks. Additionally, the distribution of test data may differ from th… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  34. arXiv:2406.08724  [pdf

    eess.IV cs.CV

    AGFA-Net: Attention-Guided and Feature-Aggregated Network for Coronary Artery Segmentation using Computed Tomography Angiography

    Authors: Xinyun Liu, Chen Zhao

    Abstract: Coronary artery disease (CAD) remains a prevalent cardiovascular condition, posing significant health risks worldwide. This pathology, characterized by plaque accumulation in coronary artery walls, leads to myocardial ischemia and various symptoms, including chest pain and shortness of breath. Accurate segmentation of coronary arteries from coronary computed tomography angiography (CCTA) images is… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 13 pages, 7 figures

  35. arXiv:2406.08044  [pdf, other

    cond-mat.mes-hall

    Hofstadter spectrum in a semiconductor moiré lattice

    Authors: Chen Zhao, Ming Wu, Zhen Ma, Miao Liang, Ming Lu, Jin-Hua Gao, X. C. Xie

    Abstract: Recently, the Hofstadter spectrum of a twisted $\mathrm{WSe_2/MoSe_2}$ heterobilayer has been observed in experiment [C. R. Kometter, et al. Nat.Phys.19, 1861 (2023)], but the origin of Hofstadter states remains unclear. Here, we present a comprehensive theoretical interpretation of the observed Hofstadter states by calculating its accurate Hofstadter spectrum. We point out that the valley Zeeman… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 7 pages, 4 figures

  36. arXiv:2406.07558  [pdf, other

    cs.CY cs.AI cs.CV

    A Large Medical Model based on Visual Physiological Monitoring for Public Health

    Authors: Bin Huang, Changchen Zhao, Zimeng Liu, Shenda Hong, Baochang Zhang, Wenjin Wang, Hui Liu

    Abstract: The widespread outbreak of the COVID-19 pandemic has sounded a warning about the globalization challenges in public health. In this context, the establishment of large-scale public health datasets, of medical models, and of decision-making systems with a human-centric approach holds strategic significance. Recently, groundbreaking advancements have emerged in AI methods for physiological signal mo… ▽ More

    Submitted 21 April, 2024; originally announced June 2024.

    Comments: 17 pages, 7 figures

  37. arXiv:2406.05285  [pdf, other

    cs.CV

    VISTA3D: Versatile Imaging SegmenTation and Annotation model for 3D Computed Tomography

    Authors: Yufan He, Pengfei Guo, Yucheng Tang, Andriy Myronenko, Vishwesh Nath, Ziyue Xu, Dong Yang, Can Zhao, Benjamin Simon, Mason Belue, Stephanie Harmon, Baris Turkbey, Daguang Xu, Wenqi Li

    Abstract: Segmentation foundation models have attracted great interest, however, none of them are adequate enough for the use cases in 3D computed tomography scans (CT) images. Existing works finetune on medical images with 2D foundation models trained on natural images, but interactive segmentation, especially in 2D, is too time-consuming for 3D scans and less useful for large cohort analysis. Models that… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  38. arXiv:2406.03345  [pdf, other

    cs.LG cs.AI

    Feature Contamination: Neural Networks Learn Uncorrelated Features and Fail to Generalize

    Authors: Tianren Zhang, Chujie Zhao, Guanyu Chen, Yizhou Jiang, Feng Chen

    Abstract: Learning representations that generalize under distribution shifts is critical for building robust machine learning models. However, despite significant efforts in recent years, algorithmic advances in this direction have been limited. In this work, we seek to understand the fundamental difficulty of out-of-distribution generalization with deep neural networks. We first empirically show that perha… ▽ More

    Submitted 6 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

    Comments: ICML 2024

  39. arXiv:2406.03001  [pdf, other

    cs.CV cs.AI

    EdgeSync: Faster Edge-model Updating via Adaptive Continuous Learning for Video Data Drift

    Authors: Peng Zhao, Runchu Dong, Guiqin Wang, Cong Zhao

    Abstract: Real-time video analytics systems typically place models with fewer weights on edge devices to reduce latency. The distribution of video content features may change over time for various reasons (i.e. light and weather change) , leading to accuracy degradation of existing models, to solve this problem, recent work proposes a framework that uses a remote server to continually train and adapt the li… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  40. arXiv:2406.02058  [pdf, other

    cs.CV cs.RO

    OpenGaussian: Towards Point-Level 3D Gaussian-based Open Vocabulary Understanding

    Authors: Yanmin Wu, Jiarui Meng, Haijie Li, Chenming Wu, Yahao Shi, Xinhua Cheng, Chen Zhao, Haocheng Feng, Errui Ding, Jingdong Wang, Jian Zhang

    Abstract: This paper introduces OpenGaussian, a method based on 3D Gaussian Splatting (3DGS) capable of 3D point-level open vocabulary understanding. Our primary motivation stems from observing that existing 3DGS-based open vocabulary methods mainly focus on 2D pixel-level parsing. These methods struggle with 3D point-level tasks due to weak feature expressiveness and inaccurate 2D-3D feature associations.… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: technical report, 15 pages

  41. arXiv:2406.01961  [pdf, other

    cs.RO cs.CV

    Exploring Real World Map Change Generalization of Prior-Informed HD Map Prediction Models

    Authors: Samuel M. Bateman, Ning Xu, H. Charles Zhao, Yael Ben Shalom, Vince Gong, Greg Long, Will Maddern

    Abstract: Building and maintaining High-Definition (HD) maps represents a large barrier to autonomous vehicle deployment. This, along with advances in modern online map detection models, has sparked renewed interest in the online mapping problem. However, effectively predicting online maps at a high enough quality to enable safe, driverless deployments remains a significant challenge. Recent work on these m… ▽ More

    Submitted 5 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Comments: Accepted to CVPR 2024, Workshop on Autonomous Driving

  42. arXiv:2406.00906  [pdf, other

    stat.ME

    A Bayesian Generalized Bridge Regression Approach to Covariance Estimation in the Presence of Covariates

    Authors: Christina Zhao, Ding Xiang, Galin L. Jones, Adam J. Rothman

    Abstract: A hierarchical Bayesian approach that permits simultaneous inference for the regression coefficient matrix and the error precision (inverse covariance) matrix in the multivariate linear model is proposed. Assuming a natural ordering of the elements of the response, the precision matrix is reparameterized so it can be estimated with univariate-response linear regression techniques. A novel generali… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: 49 pages, 11 figures

  43. arXiv:2406.00839  [pdf, other

    cs.CL cs.AI

    FOCUS: Forging Originality through Contrastive Use in Self-Plagiarism for Language Models

    Authors: Kaixin Lan, Tao Fang, Derek F. Wong, Yabo Xu, Lidia S. Chao, Cecilia G. Zhao

    Abstract: Pre-trained Language Models (PLMs) have shown impressive results in various Natural Language Generation (NLG) tasks, such as powering chatbots and generating stories. However, an ethical concern arises due to their potential to produce verbatim copies of paragraphs from their training data. This is problematic as PLMs are trained on corpora constructed by human authors. As such, there is a pressin… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: 16 pages, 8 figures. The paper has been accepted by ACL 2024 (Findings), with Kaixin Lan and Tao Fang contributing equally, and Derek F. Wong serving as the corresponding author

  44. arXiv:2405.20071  [pdf

    physics.med-ph cs.LG

    A Staged Approach using Machine Learning and Uncertainty Quantification to Predict the Risk of Hip Fracture

    Authors: Anjum Shaik, Kristoffer Larsen, Nancy E. Lane, Chen Zhao, Kuan-Jui Su, Joyce H. Keyak, Qing Tian, Qiuying Sha, Hui Shen, Hong-Wen Deng, Weihua Zhou

    Abstract: Despite advancements in medical care, hip fractures impose a significant burden on individuals and healthcare systems. This paper focuses on the prediction of hip fracture risk in older and middle-aged adults, where falls and compromised bone quality are predominant factors. We propose a novel staged model that combines advanced imaging and clinical data to improve predictive performance. By using… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 29 pages, 5 figures, 6 tables

  45. arXiv:2405.19990  [pdf, other

    cs.CV

    DiffPhysBA: Diffusion-based Physical Backdoor Attack against Person Re-Identification in Real-World

    Authors: Wenli Sun, Xinyang Jiang, Dongsheng Li, Cairong Zhao

    Abstract: Person Re-Identification (ReID) systems pose a significant security risk from backdoor attacks, allowing adversaries to evade tracking or impersonate others. Beyond recognizing this issue, we investigate how backdoor attacks can be deployed in real-world scenarios, where a ReID model is typically trained on data collected in the digital domain and then deployed in a physical environment. This atta… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  46. arXiv:2405.19925  [pdf, other

    eess.SP

    Integrated Sensing and Communications Framework for 6G Networks

    Authors: Hongliang Luo, Tengyu Zhang, Chuanbin Zhao, Yucong Wang, Bo Lin, Yuhua Jiang, Dongqi Luo, Feifei Gao

    Abstract: In this paper, we propose a novel integrated sensing and communications (ISAC) framework for the sixth generation (6G) mobile networks, in which we decompose the real physical world into static environment, dynamic targets, and various object materials. The ubiquitous static environment occupies the vast majority of the physical world, for which we design static environment reconstruction (SER) sc… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  47. arXiv:2405.19265  [pdf, other

    cs.CL

    AlchemistCoder: Harmonizing and Eliciting Code Capability by Hindsight Tuning on Multi-source Data

    Authors: Zifan Song, Yudong Wang, Wenwei Zhang, Kuikun Liu, Chengqi Lyu, Demin Song, Qipeng Guo, Hang Yan, Dahua Lin, Kai Chen, Cairong Zhao

    Abstract: Open-source Large Language Models (LLMs) and their specialized variants, particularly Code LLMs, have recently delivered impressive performance. However, previous Code LLMs are typically fine-tuned on single-source data with limited quality and diversity, which may insufficiently elicit the potential of pre-trained Code LLMs. In this paper, we present AlchemistCoder, a series of Code LLMs with enh… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: Preprint with 20 pages and 20 figures. Source code and models at https://github.com/InternLM/AlchemistCoder

  48. arXiv:2405.18513  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall cond-mat.str-el

    Strong Chirality Suppression in 1-D correlated Weyl Semimetal (TaSe4)2I

    Authors: Utkarsh Khandelwal, Harshvardhan Jog, Shupeng Xu, Yicong Chen, Kejian Qu, Chengxi Zhao, Eugene Mele, Daniel P. Shoemaker, Ritesh Agarwal

    Abstract: The interaction of light with correlated Weyl semimetals (WSMs) provides a unique platform for exploring non-equilibrium phases and fundamental properties such as chirality. Here, we investigate the structural chirality of (TaSe4)2I, a correlated WSM, under weak optical pumping using Circular Photogalvanic Effect (CPGE) measurements and Raman spectroscopy. Surprisingly, we find that there is a los… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 21 pages, 4 figures

  49. arXiv:2405.17208  [pdf, other

    astro-ph.CO

    Impact and mitigation of spectroscopic systematics on DESI DR1 clustering measurements

    Authors: A. Krolewski, J. Yu, A. J. Ross, S. Penmetsa, W. J. Percival, R. Zhou, J. Hou, J. Aguilar, S. Ahlen, D. Brooks, E. Chaussidon, T. Claybaugh, A. de la Macorra, Biprateep Dey, J. E. Forero-Romero, S. Gontcho A Gontcho, J. Guy, K. Honscheid, S. Juneau, D. Kirkby, T. Kisner, A. Kremin, A. Lambert, L. Le-Guillou, M. E. Levi , et al. (18 additional authors not shown)

    Abstract: The large scale structure catalogs within DESI Data Release 1 (DR1) use nearly 6 million galaxies and quasars as tracers of the large-scale structure of the universe to measure the expansion history with baryon acoustic oscillations and the growth of structure with redshift-space distortions. In order to take advantage of DESI's unprecedented statistical power, we must ensure that the galaxy clust… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 53 pages, 41 figures. Supporting paper for DESI DR1 cosmological measurements

  50. arXiv:2405.17134  [pdf, ps, other

    cond-mat.mes-hall

    Moiré flat bands in alternating twisted $\mathrm{MoTe_2}$ multilayer

    Authors: Miao Liang, Shi-Ping Ding, Ming Wu, Chen Zhao, Jin-Hua Gao

    Abstract: The long-awaited fractional quantum anomalous Hall (FQAH) effect recently has been observed in the twisted $\mathrm{MoTe_2}$ homobilayers, causing a great sensation. Here, we theoretically investigate the moiré band structures of a closely related system, the alternating twisted multilayer $\mathrm{MoTe_2}$ (ATML-$\mathrm{MoTe_2}$), where the adjacent layers have opposite twist angles. We illustra… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 11 pages, 5 figures