Skip to main content

Showing 1–50 of 375 results for author: Luo, D

  1. arXiv:2407.10458  [pdf, other

    cond-mat.mtrl-sci physics.comp-ph

    Predicting doping strategies for ternary nickel-cobalt-manganese cathode materials to enhance battery performance using graph neural networks

    Authors: Zirui Zhao, Dong Luo, Shuxing Wu, Kaitong Sun, Zhan Lin, Hai-Feng Li

    Abstract: The exceptional electrochemical performance of lithium-ion batteries has spurred considerable interest in advanced battery technologies, particularly those utilizing ternary nickel-cobalt-manganese (NCM) cathode materials, which are renowned for their robust electrochemical performance and structural stability. Building upon this research, investigators have explored doping additional elements int… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  2. arXiv:2407.08554  [pdf, other

    cs.AI cs.HC

    Establishing Rigorous and Cost-effective Clinical Trials for Artificial Intelligence Models

    Authors: Wanling Gao, Yunyou Huang, Dandan Cui, Zhuoming Yu, Wenjing Liu, Xiaoshuang Liang, Jiahui Zhao, Jiyue Xie, Hao Li, Li Ma, Ning Ye, Yumiao Kang, Dingfeng Luo, Peng Pan, Wei Huang, Zhongmou Liu, Jizhong Hu, Gangyuan Zhao, Chongrong Jiang, Fan Huang, Tianyi Wei, Suqin Tang, Bingjie Xia, Zhifei Zhang, Jianfeng Zhan

    Abstract: A profound gap persists between artificial intelligence (AI) and clinical practice in medicine, primarily due to the lack of rigorous and cost-effective evaluation methodologies. State-of-the-art and state-of-the-practice AI model evaluations are limited to laboratory studies on medical datasets or direct clinical trials with no or solely patient-centered controls. Moreover, the crucial role of cl… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 23 pages

  3. arXiv:2407.07289  [pdf, other

    cs.CV

    Deformable Feature Alignment and Refinement for Moving Infrared Dim-small Target Detection

    Authors: Dengyan Luo, Yanping Xiang, Hu Wang, Luping Ji, Shuai Li, Mao Ye

    Abstract: The detection of moving infrared dim-small targets has been a challenging and prevalent research topic. The current state-of-the-art methods are mainly based on ConvLSTM to aggregate information from adjacent frames to facilitate the detection of the current frame. However, these methods implicitly utilize motion information only in the training stage and fail to explicitly explore motion compensa… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  4. arXiv:2407.03980  [pdf, other

    quant-ph

    Practical asynchronous measurement-device-independent quantum key distribution with advantage distillation

    Authors: Di Luo, Xin Liu, Kaibiao Qin, Zhenrong Zhang, Kejin Wei

    Abstract: The advantage distillation (AD) method has proven effective in improving the performance of quantum key distribution (QKD). In this paper, we introduce the AD method into a recently proposed asynchronous measurement-device-independent (AMDI) QKD protocol, taking finite-key effects into account. Simulation results show that the AD method significantly enhances AMDIQKD, e.g., extending the transmiss… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: 13 pages, 5 figures

  5. arXiv:2407.03900  [pdf, other

    cs.CV

    Oracle Bone Inscriptions Multi-modal Dataset

    Authors: Bang Li, Donghao Luo, Yujie Liang, Jing Yang, Zengmao Ding, Xu Peng, Boyuan Jiang, Shengwei Han, Dan Sui, Peichao Qin, Pian Wu, Chaoyang Wang, Yun Qi, Taisong Jin, Chengjie Wang, Xiaoming Huang, Zhan Shu, Rongrong Ji, Yongge Liu, Yunsheng Wu

    Abstract: Oracle bone inscriptions(OBI) is the earliest developed writing system in China, bearing invaluable written exemplifications of early Shang history and paleography. However, the task of deciphering OBI, in the current climate of the scholarship, can prove extremely challenging. Out of the 4,500 oracle bone characters excavated, only a third have been successfully identified. Therefore, leveraging… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  6. arXiv:2407.02328  [pdf, other

    cs.CL

    Efficient Sparse Attention needs Adaptive Token Release

    Authors: Chaoran Zhang, Lixin Zou, Dan Luo, Min Tang, Xiangyang Luo, Zihao Li, Chenliang Li

    Abstract: In recent years, Large Language Models (LLMs) have demonstrated remarkable capabilities across a wide array of text-centric tasks. However, their `large' scale introduces significant computational and storage challenges, particularly in managing the key-value states of the transformer, which limits their wider applicability. Therefore, we propose to adaptively release resources from caches and reb… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Accepted at ACL 2024(Findings)

  7. arXiv:2407.00614  [pdf, other

    cs.RO cs.CV eess.IV

    Learning Granularity-Aware Affordances from Human-Object Interaction for Tool-Based Functional Grasping in Dexterous Robotics

    Authors: Fan Yang, Wenrui Chen, Kailun Yang, Haoran Lin, DongSheng Luo, Conghui Tang, Zhiyong Li, Yaonan Wang

    Abstract: To enable robots to use tools, the initial step is teaching robots to employ dexterous gestures for touching specific areas precisely where tasks are performed. Affordance features of objects serve as a bridge in the functional interaction between agents and objects. However, leveraging these affordance cues to help robots achieve functional tool grasping remains unresolved. To address this, we pr… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: The source code and the established dataset will be made publicly available at https://github.com/yangfan293/GAAF-DEX

  8. arXiv:2406.18284  [pdf, other

    cs.CV

    RealTalk: Real-time and Realistic Audio-driven Face Generation with 3D Facial Prior-guided Identity Alignment Network

    Authors: Xiaozhong Ji, Chuming Lin, Zhonggan Ding, Ying Tai, Jian Yang, Junwei Zhu, Xiaobin Hu, Jiangning Zhang, Donghao Luo, Chengjie Wang

    Abstract: Person-generic audio-driven face generation is a challenging task in computer vision. Previous methods have achieved remarkable progress in audio-visual synchronization, but there is still a significant gap between current results and practical applications. The challenges are two-fold: 1) Preserving unique individual traits for achieving high-precision lip synchronization. 2) Generating high-qual… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  9. arXiv:2406.17645  [pdf, other

    cond-mat.str-el cond-mat.dis-nn physics.comp-ph

    Simulating moiré quantum matter with neural network

    Authors: Di Luo, David D. Dai, Liang Fu

    Abstract: Moiré materials provide an ideal platform for exploring quantum phases of matter. However, solving the many-electron problem in moiré systems is challenging due to strong correlation effects. We introduce a powerful variational representation of quantum states, many-body neural Bloch wavefunction, to solve many-electron problems in moiré materials accurately and efficiently. Applying our method to… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  10. arXiv:2406.13495  [pdf, other

    cs.CV

    DF40: Toward Next-Generation Deepfake Detection

    Authors: Zhiyuan Yan, Taiping Yao, Shen Chen, Yandan Zhao, Xinghe Fu, Junwei Zhu, Donghao Luo, Li Yuan, Chengjie Wang, Shouhong Ding, Yunsheng Wu

    Abstract: We propose a new comprehensive benchmark to revolutionize the current deepfake detection field to the next generation. Predominantly, existing works identify top-notch detection algorithms and models by adhering to the common practice: training detectors on one specific dataset (e.g., FF++) and testing them on other prevalent deepfake datasets. This protocol is often regarded as a "golden compass"… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  11. arXiv:2406.11781  [pdf, other

    cs.IR

    DiffMM: Multi-Modal Diffusion Model for Recommendation

    Authors: Yangqin Jiang, Lianghao Xia, Wei Wei, Da Luo, Kangyi Lin, Chao Huang

    Abstract: The rise of online multi-modal sharing platforms like TikTok and YouTube has enabled personalized recommender systems to incorporate multiple modalities (such as visual, textual, and acoustic) into user representations. However, addressing the challenge of data sparsity in these systems remains a key issue. To address this limitation, recent research has introduced self-supervised learning techniq… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  12. arXiv:2406.11643  [pdf, other

    cs.CV

    AnyMaker: Zero-shot General Object Customization via Decoupled Dual-Level ID Injection

    Authors: Lingjie Kong, Kai Wu, Xiaobin Hu, Wenhui Han, Jinlong Peng, Chengming Xu, Donghao Luo, Jiangning Zhang, Chengjie Wang, Yanwei Fu

    Abstract: Text-to-image based object customization, aiming to generate images with the same identity (ID) as objects of interest in accordance with text prompts and reference images, has made significant progress. However, recent customizing research is dominated by specialized tasks, such as human customization or virtual try-on, leaving a gap in general object customization. To this end, we introduce AnyM… ▽ More

    Submitted 5 July, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

  13. Multi-source Unsupervised Domain Adaptation on Graphs with Transferability Modeling

    Authors: Tianxiang Zhao, Dongsheng Luo, Xiang Zhang, Suhang Wang

    Abstract: In this paper, we tackle a new problem of \textit{multi-source unsupervised domain adaptation (MSUDA) for graphs}, where models trained on annotated source domains need to be transferred to the unsupervised target graph for node classification. Due to the discrepancy in distribution across domains, the key challenge is how to select good source instances and how to adapt the model. Diverse graph s… ▽ More

    Submitted 22 June, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

    Journal ref: Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD '24), August 25--29, 2024, Barcelona, Spain

  14. arXiv:2406.07362  [pdf, other

    cs.HC

    AI.vs.Clinician: Unveiling Intricate Interactions Between AI and Clinicians through an Open-Access Database

    Authors: Wanling Gao, Yuan Liu, Zhuoming Yu, Dandan Cui, Wenjing Liu, Xiaoshuang Liang, Jiahui Zhao, Jiyue Xie, Hao Li, Li Ma, Ning Ye, Yumiao Kang, Dingfeng Luo, Peng Pan, Wei Huang, Zhongmou Liu, Jizhong Hu, Fan Huang, Gangyuan Zhao, Chongrong Jiang, Tianyi Wei, Zhifei Zhang, Yunyou Huang, Jianfeng Zhan

    Abstract: Artificial Intelligence (AI) plays a crucial role in medical field and has the potential to revolutionize healthcare practices. However, the success of AI models and their impacts hinge on the synergy between AI and medical specialists, with clinicians assuming a dominant role. Unfortunately, the intricate dynamics and interactions between AI and clinicians remain undiscovered and thus hinder AI f… ▽ More

    Submitted 15 June, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

    Comments: 12 pages

  15. arXiv:2406.07167  [pdf, ps, other

    math.PR

    On the pathwise uniqueness of stochastic 2D Euler equations with Kraichnan noise and $L^p$-data

    Authors: Shuaijie Jiao, Dejun Luo

    Abstract: In the recent work [arXiv:2308.03216], Coghi and Maurelli proved pathwise uniqueness of solutions to the vorticity form of stochastic 2D Euler equation, with Kraichnan transport noise and initial data in $L^1\cap L^p$ for $p>3/2$. The aim of this note is to remove the constraint on $p$, showing that pathwise uniqueness holds for all $L^1\cap L^p$ initial data with arbitrary $p>1$.

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 10 pages

  16. arXiv:2406.04000  [pdf, other

    physics.optics cs.ET

    Stochastic logic in biased coupled photonic probabilistic bits

    Authors: Michael Horodynski, Charles Roques-Carmes, Yannick Salamin, Seou Choi, Jamison Sloan, Di Luo, Marin Soljačić

    Abstract: Optical computing often employs tailor-made hardware to implement specific algorithms, trading generality for improved performance in key aspects like speed and power efficiency. An important computing approach that is still missing its corresponding optical hardware is probabilistic computing, used e.g. for solving difficult combinatorial optimization problems. In this study, we propose an experi… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  17. arXiv:2406.00132  [pdf, other

    cs.LG quant-ph

    QuanTA: Efficient High-Rank Fine-Tuning of LLMs with Quantum-Informed Tensor Adaptation

    Authors: Zhuo Chen, Rumen Dangovski, Charlotte Loh, Owen Dugan, Di Luo, Marin Soljačić

    Abstract: We propose Quantum-informed Tensor Adaptation (QuanTA), a novel, easy-to-implement, fine-tuning method with no inference overhead for large-scale pre-trained language models. By leveraging quantum-inspired methods derived from quantum circuit structures, QuanTA enables efficient high-rank fine-tuning, surpassing the limitations of Low-Rank Adaptation (LoRA)--low-rank approximation may fail for com… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

  18. arXiv:2405.20081  [pdf, other

    cs.CV cs.AI

    NoiseBoost: Alleviating Hallucination with Noise Perturbation for Multimodal Large Language Models

    Authors: Kai Wu, Boyuan Jiang, Zhengkai Jiang, Qingdong He, Donghao Luo, Shengzhi Wang, Qingwen Liu, Chengjie Wang

    Abstract: Multimodal large language models (MLLMs) contribute a powerful mechanism to understanding visual information building on large language models. However, MLLMs are notorious for suffering from hallucinations, especially when generating lengthy, detailed descriptions for images. Our analysis reveals that hallucinations stem from the inherent summarization mechanism of large language models, leading… ▽ More

    Submitted 31 May, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

    Comments: 14 pages, 5 figures with supplementary material

  19. arXiv:2405.19925  [pdf, other

    eess.SP

    Integrated Sensing and Communications Framework for 6G Networks

    Authors: Hongliang Luo, Tengyu Zhang, Chuanbin Zhao, Yucong Wang, Bo Lin, Yuhua Jiang, Dongqi Luo, Feifei Gao

    Abstract: In this paper, we propose a novel integrated sensing and communications (ISAC) framework for the sixth generation (6G) mobile networks, in which we decompose the real physical world into static environment, dynamic targets, and various object materials. The ubiquitous static environment occupies the vast majority of the physical world, for which we design static environment reconstruction (SER) sc… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  20. arXiv:2405.16558  [pdf, other

    quant-ph

    Experimental Refrence-Frame-Independent Quantum Key Distribution over 250 km of Optical Fiber

    Authors: Xin Liu, Di Luo, Zhicheng Luo, Shizhuo Li, Zhenrong Zhang, Kejin Wei

    Abstract: The reference-frame-independent quantum key distribution (RFI-QKD) protocol enables QKD systems to function effectively despite slowly varying reference frames, offering a distinct advantage in practical scenarios, particularly in mobile platforms. In this study, we successfully distribute secure key bits over a 250 km optical fiber distance by developing an RFI-QKD system with a repetition rate o… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: 9 pages,4 figures

  21. arXiv:2405.15287  [pdf, other

    cs.CV

    StyleMaster: Towards Flexible Stylized Image Generation with Diffusion Models

    Authors: Chengming Xu, Kai Hu, Donghao Luo, Jiangning Zhang, Wei Li, Yanhao Ge, Chengjie Wang

    Abstract: Stylized Text-to-Image Generation (STIG) aims to generate images based on text prompts and style reference images. We in this paper propose a novel framework dubbed as StyleMaster for this task by leveraging pretrained Stable Diffusion (SD), which tries to solve the previous problems such as insufficient style and inconsistent semantics. The enhancement lies in two novel module, namely multi-sourc… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  22. arXiv:2405.14646  [pdf, other

    cs.CL

    Unveiling the Achilles' Heel of NLG Evaluators: A Unified Adversarial Framework Driven by Large Language Models

    Authors: Yiming Chen, Chen Zhang, Danqing Luo, Luis Fernando D'Haro, Robby T. Tan, Haizhou Li

    Abstract: The automatic evaluation of natural language generation (NLG) systems presents a long-lasting challenge. Recent studies have highlighted various neural metrics that align well with human evaluations. Yet, the robustness of these evaluators against adversarial perturbations remains largely under-explored due to the unique challenges in obtaining adversarial data for different NLG evaluation tasks.… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: ACL24 Finding

  23. arXiv:2405.13810  [pdf, other

    cs.LG cs.AI

    Leveraging 2D Information for Long-term Time Series Forecasting with Vanilla Transformers

    Authors: Xin Cheng, Xiuying Chen, Shuqi Li, Di Luo, Xun Wang, Dongyan Zhao, Rui Yan

    Abstract: Time series prediction is crucial for understanding and forecasting complex dynamics in various domains, ranging from finance and economics to climate and healthcare. Based on Transformer architecture, one approach involves encoding multiple variables from the same timestamp into a single temporal token to model global dependencies. In contrast, another approach embeds the time points of individua… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  24. arXiv:2405.09308  [pdf, other

    cs.LG cs.AI

    TimeX++: Learning Time-Series Explanations with Information Bottleneck

    Authors: Zichuan Liu, Tianchun Wang, Jimeng Shi, Xu Zheng, Zhuomin Chen, Lei Song, Wenqian Dong, Jayantha Obeysekera, Farhad Shirani, Dongsheng Luo

    Abstract: Explaining deep learning models operating on time series data is crucial in various applications of interest which require interpretable and transparent insights from time series signals. In this work, we investigate this problem from an information theoretic perspective and show that most existing measures of explainability may suffer from trivial solutions and distributional shift issues. To add… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: Accepted by International Conference on Machine Learning (ICML 2024)

  25. arXiv:2405.01045  [pdf, ps, other

    math.PR

    Well-posedness of stochastic mSQG equations with Kraichnan noise and $L^p$ data

    Authors: Shuaijie Jiao, Dejun Luo

    Abstract: We consider stochastic mSQG (modified Surface Quasi-Geostrophic) equations with multiplicative transport noise of Kraichnan type, and $L^p$-initial conditions. Inspired by the recent work of Coghi and Maurelli [arXiv:2308.03216], we show weak existence and pathwise uniqueness of solutions to the equations for suitable choices of parameters in the nonlinearity, the noise and the integrability of in… ▽ More

    Submitted 30 May, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

    Comments: 33 pages. We have updated the relation of $β_N$ and $β_L$ in Lemma 2.2, following Proposition 2.7 in arXiv:2308.03216v2. Moreover, we have simplified the statements of Theorem 1.4, covering slightly wider range of parameters

  26. arXiv:2404.18884  [pdf, ps, other

    econ.TH

    Reputation in Repeated Global Games of Regime Change with Exit

    Authors: Daniel Luo

    Abstract: I study a repeated binary-action supermodular game with endogenous exit where many short-lived agents attempt to coordinate a revolt against a regime. The regime undertakes costly actions to increase the short-run players' coordination frictions, though acts only after if the revolt is unsuccessful, inducing a lack-of-commitment problem. In the complete-information repeated game, a folk theorem ho… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  27. arXiv:2404.16826  [pdf, other

    math.OC

    Successive Convexification for Trajectory Optimization with Continuous-Time Constraint Satisfaction

    Authors: Purnanand Elango, Dayou Luo, Abhinav G. Kamath, Samet Uzun, Taewan Kim, Behçet Açıkmeşe

    Abstract: We present successive convexification, a real-time-capable solution method for nonconvex trajectory optimization, with continuous-time constraint satisfaction and guaranteed convergence, that only requires first-order information. The proposed framework combines several key methods to solve a large class of nonlinear optimal control problems: (i) exterior penalty-based reformulation of the path co… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  28. arXiv:2404.16740  [pdf, other

    hep-ph

    Calculable neutrino Dirac mass matrix and one-loop $\bar θ$ in the minimal left-right symmetric model

    Authors: Gang Li, Ding-Yi Luo, Xiang Zhao

    Abstract: We revisit the contribution to the strong CP parameter $\bar θ$ from leptonic CP violation at one-loop level in the minimal left-right symmetric model in the case of parity as the left-right symmetry. The Hermitian neutrino Dirac mass matrix $M_D$ can be calculated using the light and heavy neutrino masses and mixings. We propose a parameterization of the right-handed neutrino mixing matrix $V_R$… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: 6 pages, 1 figure

  29. arXiv:2404.16227  [pdf, other

    quant-ph

    Optimal entanglement generation in optomechanical systems via Krotov control of covariance matrix dynamics

    Authors: Peng-Ju Chen, Da-Wei Luo, Ting Yu

    Abstract: We investigated the optimal control of a continuous variable system, focusing on entanglement generation in an optomechanical system without utilizing Fock basis cutoffs. Using the Krotov algorithm to optimize the dynamics of the covariance matrix, we illustrated how to design a control objective function to manipulate the dynamics of the system to generate a desirable target state. We showed that… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: 10 pages, 5 figures

  30. arXiv:2404.15354  [pdf, other

    eess.SP cs.AI cs.LG math.NA

    Elevating Spectral GNNs through Enhanced Band-pass Filter Approximation

    Authors: Guoming Li, Jian Yang, Shangsong Liang, Dongsheng Luo

    Abstract: Spectral Graph Neural Networks (GNNs) have attracted great attention due to their capacity to capture patterns in the frequency domains with essential graph filters. Polynomial-based ones (namely poly-GNNs), which approximately construct graph filters with conventional or rational polynomials, are routinely adopted in practice for their substantial performances on graph learning tasks. However, pr… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: Preprint

  31. Jaynes-Cummings atoms coupled to a structured environment: Leakage elimination operators and the Petz recovery maps

    Authors: Da-Wei Luo, Ting Yu

    Abstract: We consider the Jaynes-Cummings (JC) model embedded in a structured environment, where the atom inside an optical cavity will be affected by a hierarchical environment consisting of the cavity and its environment. We propose several effective strategies to control and suppress the decoherence effects to protect the quantum coherence of the JC atom. We study the non-perturbative control of the syst… ▽ More

    Submitted 3 June, 2024; v1 submitted 21 April, 2024; originally announced April 2024.

    Comments: 8 pages, 3 figures

    Journal ref: Journal of the Optical Society of America B Vol. 41, Issue 8, pp. C112-C119 (2024)

  32. arXiv:2404.12659  [pdf, ps, other

    cs.CL

    SOS-1K: A Fine-grained Suicide Risk Classification Dataset for Chinese Social Media Analysis

    Authors: Hongzhi Qi, Hanfei Liu, Jianqiang Li, Qing Zhao, Wei Zhai, Dan Luo, Tian Yu He, Shuo Liu, Bing Xiang Yang, Guanghui Fu

    Abstract: In the social media, users frequently express personal emotions, a subset of which may indicate potential suicidal tendencies. The implicit and varied forms of expression in internet language complicate accurate and rapid identification of suicidal intent on social media, thus creating challenges for timely intervention efforts. The development of deep learning models for suicide risk detection is… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  33. arXiv:2404.12322  [pdf, other

    cs.CV cs.AI

    Generalizable Face Landmarking Guided by Conditional Face Warping

    Authors: Jiayi Liang, Haotian Liu, Hongteng Xu, Dixin Luo

    Abstract: As a significant step for human face modeling, editing, and generation, face landmarking aims at extracting facial keypoints from images. A generalizable face landmarker is required in practice because real-world facial images, e.g., the avatars in animations and games, are often stylized in various ways. However, achieving generalizable face landmarking is challenging due to the diversity of faci… ▽ More

    Submitted 21 April, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

    Comments: Accepted in CVPR 2024

  34. arXiv:2404.11449  [pdf, other

    cs.CL cs.LG

    AI-Enhanced Cognitive Behavioral Therapy: Deep Learning and Large Language Models for Extracting Cognitive Pathways from Social Media Texts

    Authors: Meng Jiang, Yi Jing Yu, Qing Zhao, Jianqiang Li, Changwei Song, Hongzhi Qi, Wei Zhai, Dan Luo, Xiaoqin Wang, Guanghui Fu, Bing Xiang Yang

    Abstract: Cognitive Behavioral Therapy (CBT) is an effective technique for addressing the irrational thoughts stemming from mental illnesses, but it necessitates precise identification of cognitive pathways to be successfully implemented in patient care. In current society, individuals frequently express negative emotions on social media on specific topics, often exhibiting cognitive distortions, including… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  35. arXiv:2404.10771  [pdf, other

    cs.LG math.NA physics.comp-ph

    TENG: Time-Evolving Natural Gradient for Solving PDEs With Deep Neural Nets Toward Machine Precision

    Authors: Zhuo Chen, Jacob McCarran, Esteban Vizcaino, Marin Soljačić, Di Luo

    Abstract: Partial differential equations (PDEs) are instrumental for modeling dynamical systems in science and engineering. The advent of neural networks has initiated a significant shift in tackling these complexities though challenges in accuracy persist, especially for initial value problems. In this paper, we introduce the $\textit{Time-Evolving Natural Gradient (TENG)}$, generalizing time-dependent var… ▽ More

    Submitted 3 June, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Report number: MIT-CTP/5706

  36. arXiv:2404.08288  [pdf, other

    econ.TH

    Istanbul Flower Auction: The Need for Speed

    Authors: Isa Hafalir, Donglai Luo, Cong Tao

    Abstract: We examine the unique format of the Istanbul Flower Auction and compare it to traditional Dutch and English auctions, emphasizing the need to auction large volumes rapidly. In a model with time costs, we study how this auction format, which cleverly combines Dutch and English auction mechanisms, manages time costs by dynamically adapting to initial bidding behaviors. Our numerical analysis conside… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: 35 pages, 8 figures, working paper

  37. arXiv:2404.04559  [pdf, ps, other

    cs.LG eess.SP math.NA

    Spectral GNN via Two-dimensional (2-D) Graph Convolution

    Authors: Guoming Li, Jian Yang, Shangsong Liang, Dongsheng Luo

    Abstract: Spectral Graph Neural Networks (GNNs) have achieved tremendous success in graph learning. As an essential part of spectral GNNs, spectral graph convolution extracts crucial frequency information in graph data, leading to superior performance of spectral GNNs in downstream tasks. However, in this paper, we show that existing spectral GNNs remain critical drawbacks in performing the spectral graph c… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

    Comments: Preprint

  38. arXiv:2404.02465  [pdf, other

    q-bio.QM

    DiffFit: Visually-Guided Differentiable Fitting of Molecule Structures to a Cryo-EM Map

    Authors: Deng Luo, Zainab Alsuwaykit, Dawar Khan, Ondřej Strnad, Tobias Isenberg, Ivan Viola

    Abstract: We introduce DiffFit, a differentiable algorithm for fitting protein atomistic structures into experimental reconstructed Cryo-Electron Microscopy (cryo-EM) volume map. This process is essential in structural biology to semi-automatically reconstruct large meso-scale models of complex protein assemblies and complete cellular structures that are based on measured cryo-EM data. Current approaches re… ▽ More

    Submitted 3 July, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

    Comments: 16 pages, 7 figures, 3 tables, submitted to IEEE VIS 2024

  39. arXiv:2404.02268  [pdf, other

    physics.acc-ph physics.ins-det

    Multi-Objective Bayesian Active Learning for MeV-ultrafast electron diffraction

    Authors: Fuhao Ji, Auralee Edelen, Ryan Roussel, Xiaozhe Shen, Sara Miskovich, Stephen Weathersby, Duan Luo, Mianzhen Mo, Patrick Kramer, Christopher Mayes, Mohamed A. K. Othman, Emilio Nanni, Xijie Wang, Alexander Reid, Michael Minitti, Robert Joel England

    Abstract: Ultrafast electron diffraction using MeV energy beams(MeV-UED) has enabled unprecedented scientific opportunities in the study of ultrafast structural dynamics in a variety of gas, liquid and solid state systems. Broad scientific applications usually pose different requirements for electron probe properties. Due to the complex, nonlinear and correlated nature of accelerator systems, electron beam… ▽ More

    Submitted 3 May, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Journal ref: Nat Commun 15, 4726 (2024)

  40. arXiv:2403.17712  [pdf, other

    cs.CV

    Invisible Gas Detection: An RGB-Thermal Cross Attention Network and A New Benchmark

    Authors: Jue Wang, Yuxiang Lin, Qi Zhao, Dong Luo, Shuaibao Chen, Wei Chen, Xiaojiang Peng

    Abstract: The widespread use of various chemical gases in industrial processes necessitates effective measures to prevent their leakage during transportation and storage, given their high toxicity. Thermal infrared-based computer vision detection techniques provide a straightforward approach to identify gas leakage areas. However, the development of high-quality algorithms has been challenging due to the lo… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  41. arXiv:2403.12468  [pdf, other

    cs.CL

    CrossTune: Black-Box Few-Shot Classification with Label Enhancement

    Authors: Danqing Luo, Chen Zhang, Yan Zhang, Haizhou Li

    Abstract: Training or finetuning large-scale language models (LLMs) requires substantial computation resources, motivating recent efforts to explore parameter-efficient adaptation to downstream tasks. One approach is to treat these models as black boxes and use forward passes (Inference APIs) to interact with them. Current research focuses on adapting these black-box models to downstream tasks using gradien… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Accepted by LREC-Coling 2024

  42. arXiv:2403.07061  [pdf, other

    quant-ph cond-mat.quant-gas hep-ph nucl-th

    Simulating Meson Scattering on Spin Quantum Simulators

    Authors: Elizabeth R. Bennewitz, Brayden Ware, Alexander Schuckert, Alessio Lerose, Federica M. Surace, Ron Belyansky, William Morong, De Luo, Arinjoy De, Kate S. Collins, Or Katz, Christopher Monroe, Zohreh Davoudi, Alexey V. Gorshkov

    Abstract: Studying high-energy collisions of composite particles, such as hadrons and nuclei, is an outstanding goal for quantum simulators. However, preparation of hadronic wave packets has posed a significant challenge, due to the complexity of hadrons and the precise structure of wave packets. This has limited demonstrations of hadron scattering on quantum simulators to date. Observations of confinement… ▽ More

    Submitted 13 March, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: 18 pages, 4 main figures, 2 supplementary figures

  43. arXiv:2403.06168  [pdf, other

    cs.CV cs.AI

    DiffuMatting: Synthesizing Arbitrary Objects with Matting-level Annotation

    Authors: Xiaobin Hu, Xu Peng, Donghao Luo, Xiaozhong Ji, Jinlong Peng, Zhengkai Jiang, Jiangning Zhang, Taisong Jin, Chengjie Wang, Rongrong Ji

    Abstract: Due to the difficulty and labor-consuming nature of getting highly accurate or matting annotations, there only exists a limited amount of highly accurate labels available to the public. To tackle this challenge, we propose a DiffuMatting which inherits the strong Everything generation ability of diffusion and endows the power of "matting anything". Our DiffuMatting can 1). act as an anything matti… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

  44. arXiv:2403.06013  [pdf, other

    cs.LG cs.CV

    Are Classification Robustness and Explanation Robustness Really Strongly Correlated? An Analysis Through Input Loss Landscape

    Authors: Tiejin Chen, Wenwang Huang, Linsey Pang, Dongsheng Luo, Hua Wei

    Abstract: This paper delves into the critical area of deep learning robustness, challenging the conventional belief that classification robustness and explanation robustness in image classification systems are inherently correlated. Through a novel evaluation approach leveraging clustering for efficient assessment of explanation robustness, we demonstrate that enhancing explanation robustness does not neces… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

  45. arXiv:2403.04785  [pdf, other

    cs.CL cs.AI

    Large Language Multimodal Models for 5-Year Chronic Disease Cohort Prediction Using EHR Data

    Authors: Jun-En Ding, Phan Nguyen Minh Thao, Wen-Chih Peng, Jian-Zhe Wang, Chun-Cheng Chug, Min-Chen Hsieh, Yun-Chien Tseng, Ling Chen, Dongsheng Luo, Chi-Te Wang, Pei-fu Chen, Feng Liu, Fang-Ming Hung

    Abstract: Chronic diseases such as diabetes are the leading causes of morbidity and mortality worldwide. Numerous research studies have been attempted with various deep learning models in diagnosis. However, most previous studies had certain limitations, including using publicly available datasets (e.g. MIMIC), and imbalanced data. In this study, we collected five-year electronic health records (EHRs) from… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

  46. arXiv:2403.04731  [pdf, other

    physics.optics quant-ph

    Photonic probabilistic machine learning using quantum vacuum noise

    Authors: Seou Choi, Yannick Salamin, Charles Roques-Carmes, Rumen Dangovski, Di Luo, Zhuo Chen, Michael Horodynski, Jamison Sloan, Shiekh Zia Uddin, Marin Soljacic

    Abstract: Probabilistic machine learning utilizes controllable sources of randomness to encode uncertainty and enable statistical modeling. Harnessing the pure randomness of quantum vacuum noise, which stems from fluctuating electromagnetic fields, has shown promise for high speed and energy-efficient stochastic photonic elements. Nevertheless, photonic computing hardware which can control these stochastic… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  47. arXiv:2403.03199  [pdf, other

    quant-ph cond-mat.dis-nn physics.comp-ph

    Operator Learning Renormalization Group

    Authors: Xiu-Zhe Luo, Di Luo, Roger G. Melko

    Abstract: In this paper, we present a general framework for quantum many-body simulations called the operator learning renormalization group (OLRG). Inspired by machine learning perspectives, OLRG is a generalization of Wilson's numerical renormalization group and White's density matrix renormalization group, which recursively builds a simulatable system to approximate a target system of the same number of… ▽ More

    Submitted 28 May, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: 18 pages, 14 figures

    Report number: MIT-CTP/5676

  48. arXiv:2403.00733  [pdf, ps, other

    math.OC

    Remarks on "Successive Convexification: A Superlinearly Convergent Algorithm for Non-convex Optimal Control Problems"

    Authors: Dayou Luo, Purnanand Elango, Behcet Acikmese

    Abstract: The purpose of this note is to highlight and address inaccuracies in the convergence guarantees of SCvx, a nonconvex trajectory optimization algorithm proposed by Mao et al. (arXiv:1804.06539), and make connections to relevant prior work. Specifically, we identify errors in the convergence proof within Mao et al. (arXiv:1804.06539) and reestablish the proof of convergence by employing a new method… ▽ More

    Submitted 13 March, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

  49. arXiv:2402.10434  [pdf, other

    cs.LG

    Parametric Augmentation for Time Series Contrastive Learning

    Authors: Xu Zheng, Tianchun Wang, Wei Cheng, Aitian Ma, Haifeng Chen, Mo Sha, Dongsheng Luo

    Abstract: Modern techniques like contrastive learning have been effectively used in many areas, including computer vision, natural language processing, and graph-structured data. Creating positive examples that assist the model in learning robust and discriminative representations is a crucial stage in contrastive learning approaches. Usually, preset human intuition directs the selection of relevant data au… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: Accepted by International Conference on Learning Representations (ICLR 2024)

  50. arXiv:2402.07484  [pdf, other

    math.PR

    An elementary approach to mixing and dissipation enhancement by transport noise

    Authors: Dejun Luo, Bin Tang, Guohuan Zhao

    Abstract: We investigate the mixing properties of solutions to the stochastic transport equation $d u= \circ d W \cdot\nabla u$, where the driving noise $W(t,x)$ is white in time, colored and divergence-free in space. Furthermore, we prove the dissipation enhancement in the presence of a small viscous term. Applying our results, we also derive the mixing properties for a regularized stochastic 2D Euler equa… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Comments: 34 pages