Skip to main content

Showing 1–50 of 334 results for author: Dai, D

  1. arXiv:2407.09204  [pdf, other

    cond-mat.mes-hall cond-mat.str-el

    Electron bubbles in highly excited states of the lowest Landau level

    Authors: David D. Dai, Liang Fu

    Abstract: We study the entire energy spectrum of an electron droplet in the lowest Landau level. By exact diagonalization calculations, we find highly excited states in the middle of the spectrum that display unexpected density distribution and pair correlation. We show that these exceptional excited states contain tightly bound electron bubbles with local filling $ν= 1$ that form various ordered structures… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: Main: 6 pages, 4 figures. SM: 11 pages, 6 figures

  2. arXiv:2407.08515  [pdf, other

    cs.CV cs.AI

    15M Multimodal Facial Image-Text Dataset

    Authors: Dawei Dai, YuTang Li, YingGe Liu, Mingming Jia, Zhang YuanHui, Guoyin Wang

    Abstract: Currently, image-text-driven multi-modal deep learning models have demonstrated their outstanding potential in many fields. In practice, tasks centered around facial images have broad application prospects. This paper presents \textbf{FaceCaption-15M}, a large-scale, diverse, and high-quality dataset of facial images accompanied by their natural language descriptions (facial image-to-text). This d… ▽ More

    Submitted 11 July, 2024; v1 submitted 11 July, 2024; originally announced July 2024.

    Comments: 15 pages, 8 figures

  3. arXiv:2407.01906  [pdf, other

    cs.CL cs.AI cs.LG

    Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language Models

    Authors: Zihan Wang, Deli Chen, Damai Dai, Runxin Xu, Zhuoshu Li, Y. Wu

    Abstract: Parameter-efficient fine-tuning (PEFT) is crucial for customizing Large Language Models (LLMs) with constrained resources. Although there have been various PEFT methods for dense-architecture LLMs, PEFT for sparse-architecture LLMs is still underexplored. In this work, we study the PEFT method for LLMs with the Mixture-of-Experts (MoE) architecture and the contents of this work are mainly threefol… ▽ More

    Submitted 4 July, 2024; v1 submitted 1 July, 2024; originally announced July 2024.

  4. arXiv:2407.00070  [pdf

    physics.app-ph physics.optics

    Nonvolatile Silicon Photonic MEMS Switch Based on Centrally-Clamped Stepped Bistable Mechanical Beams

    Authors: Qian Ma, Yinpeng Hu, Ye Lu, Yunzhi Liu, Huan Li, Daoxin Dai

    Abstract: High-performance photonic switches are essential for large-scale optical routing for AI large models and Internet of things. Realizing nonvolatility can further reduce power consumption and expand application scenarios. We propose a nonvolatile 2*2 silicon photonic micro-electromechanical system (MEMS) switch compatible with standard silicon photonic foundry processes. The switch employs electrost… ▽ More

    Submitted 2 July, 2024; v1 submitted 19 June, 2024; originally announced July 2024.

  5. arXiv:2406.17645  [pdf, other

    cond-mat.str-el cond-mat.dis-nn physics.comp-ph

    Simulating moiré quantum matter with neural network

    Authors: Di Luo, David D. Dai, Liang Fu

    Abstract: Moiré materials provide an ideal platform for exploring quantum phases of matter. However, solving the many-electron problem in moiré systems is challenging due to strong correlation effects. We introduce a powerful variational representation of quantum states, many-body neural Bloch wavefunction, to solve many-electron problems in moiré materials accurately and efficiently. Applying our method to… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  6. arXiv:2406.11931  [pdf, other

    cs.SE cs.AI cs.LG

    DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

    Authors: DeepSeek-AI, Qihao Zhu, Daya Guo, Zhihong Shao, Dejian Yang, Peiyi Wang, Runxin Xu, Y. Wu, Yukun Li, Huazuo Gao, Shirong Ma, Wangding Zeng, Xiao Bi, Zihui Gu, Hanwei Xu, Damai Dai, Kai Dong, Liyue Zhang, Yishi Piao, Zhibin Gou, Zhenda Xie, Zhewen Hao, Bingxuan Wang, Junxiao Song, Deli Chen , et al. (15 additional authors not shown)

    Abstract: We present DeepSeek-Coder-V2, an open-source Mixture-of-Experts (MoE) code language model that achieves performance comparable to GPT4-Turbo in code-specific tasks. Specifically, DeepSeek-Coder-V2 is further pre-trained from an intermediate checkpoint of DeepSeek-V2 with additional 6 trillion tokens. Through this continued pre-training, DeepSeek-Coder-V2 substantially enhances the coding and mathe… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  7. arXiv:2405.17799  [pdf, other

    cs.LG cs.CL

    Exploring Activation Patterns of Parameters in Language Models

    Authors: Yudong Wang, Damai Dai, Zhifang Sui

    Abstract: Most work treats large language models as black boxes without in-depth understanding of their internal working mechanism. In order to explain the internal representations of LLMs, we propose a gradient-based metric to assess the activation level of model parameters. Based on this metric, we obtain three preliminary findings. (1) When the inputs are in the same domain, parameters in the shallow lay… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  8. arXiv:2405.16481  [pdf, ps, other

    gr-qc astro-ph.CO

    Studies on particle creation during the universe expansion with a laser system

    Authors: De-Chang Dai, Changbo Fu

    Abstract: While two highly intensive laser beams collide, they create a region where the refractive index varies so quickly that photons are created. The variance of the refractive index is analog to the universe scale factor variance. Therefore, this laser system can be an analog to the expansion of the universe. We find that several hundreds of photons can be created under feasible conditions. This system… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: 12 page. 3 figures

    Journal ref: Modern Physics Letters A (2024) 2450070 (10 pages)

  9. arXiv:2405.08633  [pdf, other

    cond-mat.supr-con cond-mat.mtrl-sci cond-mat.str-el

    On the superconducting gap structure of the miassite Rh17S15: Nodal or nodeless?

    Authors: J. Y. Nie, C. C. Zhao, C. Q. Xu, B. Li, C. P. Tu, X. Zhang, D. Z. Dai, H. R. Wang, S. Xu, Wenhe Jiao, B. M. Wang, Zhu'an Xu, Xiaofeng Xu, S. Y. Li

    Abstract: Recent penetration depth measurement claimed the observation of unconventional superconductivity in the miassite Rh$_{17}$S$_{15}$ single crystals, evidenced by the linear-in-temperature penetration depth at low temperatures, thereby arguing for the presence of the lines of node in its superconducting gap structure. Here we measure the thermal conductivity of Rh$_{17}$S$_{15}$ single crystals down… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: 7 pages, 6 figures

  10. arXiv:2405.06274  [pdf

    physics.optics physics.app-ph

    Hybrid thin-film lithium niobate micro-ring acousto-optic modulator for microwave-to-optical conversion

    Authors: Lei Wan, Jiying Huang, Meixun Wen, Huan Li, Wenfeng Zhou, Zhiqiang Yang, Yuping Chen, Huilong Liu, Siqing Zeng, Dong Liu, Shuixian Yang, Daoxin Dai, Zhaohui Li

    Abstract: Highly efficient acousto-optic modulation plays a vital role in the microwave-to-optical conversion. Herein, we demonstrate a hybrid thin-film lithium niobate (TFLN) racetrack micro-ring acousto-optic modulator (AOM) implemented with low-loss chalcogenide (ChG) waveguide. By engineering the electrode configuration of the interdigital transducer, the double-arm micro-ring acousto-optic modulation i… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  11. arXiv:2405.04434  [pdf, other

    cs.CL cs.AI

    DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

    Authors: DeepSeek-AI, Aixin Liu, Bei Feng, Bin Wang, Bingxuan Wang, Bo Liu, Chenggang Zhao, Chengqi Dengr, Chong Ruan, Damai Dai, Daya Guo, Dejian Yang, Deli Chen, Dongjie Ji, Erhang Li, Fangyun Lin, Fuli Luo, Guangbo Hao, Guanting Chen, Guowei Li, H. Zhang, Hanwei Xu, Hao Yang, Haowei Zhang, Honghui Ding , et al. (132 additional authors not shown)

    Abstract: We present DeepSeek-V2, a strong Mixture-of-Experts (MoE) language model characterized by economical training and efficient inference. It comprises 236B total parameters, of which 21B are activated for each token, and supports a context length of 128K tokens. DeepSeek-V2 adopts innovative architectures including Multi-head Latent Attention (MLA) and DeepSeekMoE. MLA guarantees efficient inference… ▽ More

    Submitted 19 June, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

  12. A Reinforcement Learning Based Backfilling Strategy for HPC Batch Jobs

    Authors: Elliot Kolker-Hicks, Di Zhang, Dong Dai

    Abstract: High Performance Computing (HPC) systems are used across a wide range of disciplines for both large and complex computations. HPC systems often receive many thousands of computational tasks at a time, colloquially referred to as jobs. These jobs must then be scheduled as optimally as possible so they can be completed within a reasonable timeframe. HPC scheduling systems often employ a technique ca… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: This paper was originally published in the Workshops of the International Conference on High Performance Computing, Networking, Storage, and Analysis (PMBS 2023). This version has been updated to address several issues identified after publication

  13. arXiv:2403.19346  [pdf, other

    cs.CL

    Large Language Models Are Unconscious of Unreasonability in Math Problems

    Authors: Jingyuan Ma, Damai Dai, Lei Sha, Zhifang Sui

    Abstract: Large language models (LLMs) demonstrate substantial capabilities in solving math problems. However, they tend to produce hallucinations when given questions containing unreasonable errors. In this paper, we study the behavior of LLMs when faced with unreasonable math problems and further explore their potential to address these problems. We construct the Unreasonable Math Problem (UMP) benchmark… ▽ More

    Submitted 16 April, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: 11 pages, 3 figures

  14. arXiv:2403.17253  [pdf, ps, other

    quant-ph cond-mat.mes-hall physics.atom-ph physics.optics

    Convert laser light into single photons via interference

    Authors: Yanfeng Li, Manman Wang, Guoqi Huang, Li Liu, Wenyan Wang, Weijie Ji, Hanqing Liu, Xiangbin Su, Shulun Li, Deyan Dai, Xiangjun Shang, Haiqiao Ni, Zhichuan Niu, Chengyong Hu

    Abstract: Laser light possesses perfect coherence, but cannot be attenuated to single photons via linear optics. An elegant route to convert laser light into single photons is based on photon blockade in a cavity with a single atom in the strong coupling regime. However, the single-photon purity achieved by this method remains relatively low. Here we propose an interference-based approach where laser light… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: Comments are welcome

  15. arXiv:2403.16475  [pdf, ps, other

    math.PR math-ph math.CA

    Asymptotics of the confluent hypergeometric process with a varying external potential in the super-exponential region

    Authors: Dan Dai, Luming Yao, Yu Zhai

    Abstract: In this paper, we investigate a determinantal point process on the interval $(-s,s)$, associated with the confluent hypergeometric kernel. Let $\mathcal{K}^{(α,β)}_s$ denote the trace class integral operator acting on $L^2(-s, s)$ with the confluent hypergeometric kernel. Our focus is on deriving the asymptotics of the Fredholm determinant $\det(I-γ\mathcal{K}^{(α,β)}_s)$ as $s \to +\infty$, while… ▽ More

    Submitted 5 May, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

    MSC Class: 33C10; 34M50; 82B26; 45C05

  16. arXiv:2403.05010  [pdf, other

    cs.SD cs.AI eess.AS

    RFWave: Multi-band Rectified Flow for Audio Waveform Reconstruction

    Authors: Peng Liu, Dongyang Dai, Zhiyong Wu

    Abstract: Recent advancements in generative modeling have significantly enhanced the reconstruction of audio waveforms from various representations. While diffusion models are adept at this task, they are hindered by latency issues due to their operation at the individual sample point level and the need for numerous sampling steps. In this study, we introduce RFWave, a cutting-edge multi-band Rectified Flow… ▽ More

    Submitted 2 June, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

  17. arXiv:2403.02894  [pdf

    eess.SP

    DIFNet: SAR RFI suppression based on domain invariant features

    Authors: Fuping Fang, Wenhao Lv, Dahai Dai

    Abstract: Synthetic aperture radar is a high-resolution two-dimensional imaging radar, however, during the imaging process, SAR is susceptible to intentional and unintentional interference, with radio frequency interference (RFI) being the most common type, leading to a severe degradation in image quality. Although inpainting networks have achieved excellent results, their generalization is unclear, and whe… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: five pages

  18. arXiv:2403.02665  [pdf, other

    cs.DS cs.DC cs.PF

    DGAP: Efficient Dynamic Graph Analysis on Persistent Memory

    Authors: Abdullah Al Raqibul Islam, Dong Dai

    Abstract: Dynamic graphs, featuring continuously updated vertices and edges, have grown in importance for numerous real-world applications. To accommodate this, graph frameworks, particularly their internal data structures, must support both persistent graph updates and rapid graph analysis simultaneously, leading to complex designs to orchestrate `fast but volatile' and `persistent but slow' storage device… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  19. arXiv:2402.16141  [pdf, other

    cs.CL

    PeriodicLoRA: Breaking the Low-Rank Bottleneck in LoRA Optimization

    Authors: Xiangdi Meng, Damai Dai, Weiyao Luo, Zhe Yang, Shaoxiang Wu, Xiaochen Wang, Peiyi Wang, Qingxiu Dong, Liang Chen, Zhifang Sui

    Abstract: Supervised fine-tuning is the most common method to adapt large language models (LLMs) to downstream tasks, but full fine-tuning LLMs requires massive computational resources. Recently, parameter-efficient fine-tuning (PEFT) methods have been widely studied due to its cost-effectiveness. LoRA is one of the most widely used methods, which assumes that the optimization process is essentially low-dim… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

  20. arXiv:2402.16032  [pdf

    physics.optics physics.app-ph

    Four-Channel WDM Graphene Optical Receiver

    Authors: Laiwen Yu, Yurui Li, Hengtai Xiang, Yuanrong Li, Hengzhen Cao, Zhongyang Ji, Liu Liu, Xi Xiao, Jianbo Yin, Jingshu Guo, Daoxin Dai

    Abstract: Silicon photonics with the advantages of low power consumption, low cost, and high yield is a crucial technology for facilitating high-capacity optical communications and interconnects. The graphene photodetectors (GPDs) featuring broadband operation, high speed, and low integration cost can be good additions to the conventional SiGe photodetectors, supporting silicon-integrated on-chip photodetec… ▽ More

    Submitted 2 March, 2024; v1 submitted 25 February, 2024; originally announced February 2024.

  21. arXiv:2402.05247  [pdf, other

    physics.flu-dyn

    A Geometric VOF Method for Interface Flow Simulations

    Authors: Dezhi Dai, Haomin Yuan, Albert Y. Tong, Adrian Tentner

    Abstract: A novel numerical technique designed for interface flow simulations using the Volume of Fluid (VOF) method on arbitrary unstructured meshes has been introduced. The method is called SimPLIC, which seamlessly integrates Piecewise Linear Interface Calculation (PLIC) and Simpson's rule. The main focus of the proposed method is to compute the volume of the primary phase that moves across a mesh face w… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  22. arXiv:2401.17544  [pdf, other

    cs.LG cs.CV

    Trainable Fixed-Point Quantization for Deep Learning Acceleration on FPGAs

    Authors: Dingyi Dai, Yichi Zhang, Jiahao Zhang, Zhanqiu Hu, Yaohui Cai, Qi Sun, Zhiru Zhang

    Abstract: Quantization is a crucial technique for deploying deep learning models on resource-constrained devices, such as embedded FPGAs. Prior efforts mostly focus on quantizing matrix multiplications, leaving other layers like BatchNorm or shortcuts in floating-point form, even though fixed-point arithmetic is more efficient on FPGAs. A common practice is to fine-tune a pre-trained model to fixed-point fo… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

  23. arXiv:2401.08045  [pdf, other

    cs.CV

    Forging Vision Foundation Models for Autonomous Driving: Challenges, Methodologies, and Opportunities

    Authors: Xu Yan, Haiming Zhang, Yingjie Cai, Jingming Guo, Weichao Qiu, Bin Gao, Kaiqiang Zhou, Yue Zhao, Huan Jin, Jiantao Gao, Zhen Li, Lihui Jiang, Wei Zhang, Hongbo Zhang, Dengxin Dai, Bingbing Liu

    Abstract: The rise of large foundation models, trained on extensive datasets, is revolutionizing the field of AI. Models such as SAM, DALL-E2, and GPT-4 showcase their adaptability by extracting intricate patterns and performing effectively across diverse tasks, thereby serving as potent building blocks for a wide range of AI applications. Autonomous driving, a vibrant front in AI applications, remains chal… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: Github Repo: https://github.com/zhanghm1995/Forge_VFM4AD

  24. arXiv:2401.06066  [pdf, other

    cs.CL

    DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

    Authors: Damai Dai, Chengqi Deng, Chenggang Zhao, R. X. Xu, Huazuo Gao, Deli Chen, Jiashi Li, Wangding Zeng, Xingkai Yu, Y. Wu, Zhenda Xie, Y. K. Li, Panpan Huang, Fuli Luo, Chong Ruan, Zhifang Sui, Wenfeng Liang

    Abstract: In the era of large language models, Mixture-of-Experts (MoE) is a promising architecture for managing computational costs when scaling up model parameters. However, conventional MoE architectures like GShard, which activate the top-$K$ out of $N$ experts, face challenges in ensuring expert specialization, i.e. each expert acquires non-overlapping and focused knowledge. In response, we propose the… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

  25. arXiv:2401.03735  [pdf, other

    cs.CL

    Language Models Know the Value of Numbers

    Authors: Fangwei Zhu, Damai Dai, Zhifang Sui

    Abstract: Large language models (LLMs) have exhibited impressive competence in various tasks, but their internal mechanisms on mathematical problems are still under-explored. In this paper, we study a fundamental question: whether language models know the value of numbers, a basic element in math. To study the question, we construct a synthetic dataset comprising addition problems and utilize linear probes… ▽ More

    Submitted 9 June, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

  26. arXiv:2401.02954  [pdf, other

    cs.CL cs.AI cs.LG

    DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

    Authors: DeepSeek-AI, :, Xiao Bi, Deli Chen, Guanting Chen, Shanhuang Chen, Damai Dai, Chengqi Deng, Honghui Ding, Kai Dong, Qiushi Du, Zhe Fu, Huazuo Gao, Kaige Gao, Wenjun Gao, Ruiqi Ge, Kang Guan, Daya Guo, Jianzhong Guo, Guangbo Hao, Zhewen Hao, Ying He, Wenjie Hu, Panpan Huang, Erhang Li , et al. (63 additional authors not shown)

    Abstract: The rapid development of open-source large language models (LLMs) has been truly remarkable. However, the scaling law described in previous literature presents varying conclusions, which casts a dark cloud over scaling LLMs. We delve into the study of scaling laws and present our distinctive findings that facilitate scaling of large scale models in two commonly used open-source configurations, 7B… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

  27. arXiv:2401.00371  [pdf

    cs.CV

    Multi-Granularity Representation Learning for Sketch-based Dynamic Face Image Retrieval

    Authors: Liang Wang, Dawei Dai, Shiyu Fu, Guoyin Wang

    Abstract: In specific scenarios, face sketch can be used to identify a person. However, drawing a face sketch often requires exceptional skill and is time-consuming, limiting its widespread applications in actual scenarios. The new framework of sketch less face image retrieval (SLFIR)[1] attempts to overcome the barriers by providing a means for humans and machines to interact during the drawing process. Co… ▽ More

    Submitted 30 December, 2023; originally announced January 2024.

    Comments: 5 pages,5 figures

  28. arXiv:2312.08935  [pdf, other

    cs.AI cs.CL cs.LG

    Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations

    Authors: Peiyi Wang, Lei Li, Zhihong Shao, R. X. Xu, Damai Dai, Yifei Li, Deli Chen, Y. Wu, Zhifang Sui

    Abstract: In this paper, we present an innovative process-oriented math process reward model called \textbf{Math-Shepherd}, which assigns a reward score to each step of math problem solutions. The training of Math-Shepherd is achieved using automatically constructed process-wise supervision data, breaking the bottleneck of heavy reliance on manual annotation in existing work. We explore the effectiveness of… ▽ More

    Submitted 19 February, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: Add Step-by-Step reinforcement learning results

  29. arXiv:2312.03251  [pdf

    cond-mat.mes-hall cond-mat.str-el

    Electrically controlled interlayer trion fluid in electron-hole bilayers

    Authors: Ruishi Qi, Qize Li, Zuocheng Zhang, Sudi Chen, Jingxu Xie, Yunbo Ou, Zhiyuan Cui, David D. Dai, Andrew Y. Joe, Takashi Taniguchi, Kenji Watanabe, Sefaattin Tongay, Alex Zettl, Liang Fu, Feng Wang

    Abstract: The combination of repulsive and attractive Coulomb interactions in a quantum electron(e)-hole(h) fluid can give rise to novel correlated phases of multiparticle charge complexes such as excitons, trions and biexcitons. Here we report the first experimental realization of an electrically controlled interlayer trion fluid in two-dimensional van der Waals heterostructures. We demonstrate that in the… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  30. arXiv:2311.15605  [pdf, other

    cs.CV

    2D Feature Distillation for Weakly- and Semi-Supervised 3D Semantic Segmentation

    Authors: Ozan Unal, Dengxin Dai, Lukas Hoyer, Yigit Baran Can, Luc Van Gool

    Abstract: As 3D perception problems grow in popularity and the need for large-scale labeled datasets for LiDAR semantic segmentation increase, new methods arise that aim to reduce the necessity for dense annotations by employing weakly-supervised training. However these methods continue to show weak boundary estimation and high false negative rates for small objects and distant sparse regions. We argue that… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: Accepted at WACV 2024

  31. arXiv:2311.10572  [pdf, other

    cs.CV cs.LG

    SSB: Simple but Strong Baseline for Boosting Performance of Open-Set Semi-Supervised Learning

    Authors: Yue Fan, Anna Kukleva, Dengxin Dai, Bernt Schiele

    Abstract: Semi-supervised learning (SSL) methods effectively leverage unlabeled data to improve model generalization. However, SSL models often underperform in open-set scenarios, where unlabeled data contain outliers from novel categories that do not appear in the labeled set. In this paper, we study the challenging and realistic open-set SSL setting, where the goal is to both correctly classify inliers an… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Comments: Paper accepted in ICCV 2023

  32. arXiv:2311.05494  [pdf, other

    cs.CV cs.RO

    Object-centric Cross-modal Feature Distillation for Event-based Object Detection

    Authors: Lei Li, Alexander Liniger, Mario Millhaeusler, Vagia Tsiminaki, Yuanyou Li, Dengxin Dai

    Abstract: Event cameras are gaining popularity due to their unique properties, such as their low latency and high dynamic range. One task where these benefits can be crucial is real-time object detection. However, RGB detectors still outperform event-based detectors due to the sparsity of the event data and missing visual details. In this paper, we develop a novel knowledge distillation approach to shrink t… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: 12 pages, 8 figures

  33. arXiv:2311.04501  [pdf, other

    cs.CV

    PRED: Pre-training via Semantic Rendering on LiDAR Point Clouds

    Authors: Hao Yang, Haiyang Wang, Di Dai, Liwei Wang

    Abstract: Pre-training is crucial in 3D-related fields such as autonomous driving where point cloud annotation is costly and challenging. Many recent studies on point cloud pre-training, however, have overlooked the issue of incompleteness, where only a fraction of the points are captured by LiDAR, leading to ambiguity during the training phase. On the other hand, images offer more comprehensive information… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

  34. arXiv:2311.02143  [pdf, other

    cond-mat.str-el cond-mat.dis-nn cs.LG physics.comp-ph quant-ph

    Pairing-based graph neural network for simulating quantum materials

    Authors: Di Luo, David D. Dai, Liang Fu

    Abstract: We develop a pairing-based graph neural network for simulating quantum many-body systems. Our architecture augments a BCS-type geminal wavefunction with a generalized pair amplitude parameterized by a graph neural network. Variational Monte Carlo with our neural network simultaneously provides an accurate, flexible, and scalable method for simulating many-electron systems. We apply this method to… ▽ More

    Submitted 21 November, 2023; v1 submitted 3 November, 2023; originally announced November 2023.

    Report number: MIT-CTP/5634

  35. arXiv:2310.13766  [pdf, other

    cs.CV

    U-BEV: Height-aware Bird's-Eye-View Segmentation and Neural Map-based Relocalization

    Authors: Andrea Boscolo Camiletto, Alfredo Bochicchio, Alexander Liniger, Dengxin Dai, Abel Gawel

    Abstract: Efficient relocalization is essential for intelligent vehicles when GPS reception is insufficient or sensor-based localization fails. Recent advances in Bird's-Eye-View (BEV) segmentation allow for accurate estimation of local scene appearance and in turn, can benefit the relocalization of the vehicle. However, one downside of BEV methods is the heavy computation required to leverage the geometric… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  36. arXiv:2310.08838  [pdf, other

    quant-ph physics.optics

    Higher-dimensional symmetric informationally complete measurement via programmable photonic integrated optics

    Authors: Lan-Tian Feng, Xiao-Min Hu, Ming Zhang, Yu-Jie Cheng, Chao Zhang, Yu Guo, Yu-Yang Ding, Zhibo Hou, Fang-Wen Sun, Guang-Can Guo, Dao-Xin Dai, Armin Tavakoli, Xi-Feng Ren, Bi-Heng Liu

    Abstract: Symmetric informationally complete measurements are both important building blocks in many quantum information protocols and the seminal example of a generalised, non-orthogonal, quantum measurement. In higher-dimensional systems, these measurements become both increasingly interesting and increasingly complex to implement. Here, we demonstrate an integrated quantum photonic platform to realize su… ▽ More

    Submitted 16 October, 2023; v1 submitted 12 October, 2023; originally announced October 2023.

    Comments: 21 pages,13 figures

  37. arXiv:2310.08462  [pdf, other

    cond-mat.supr-con

    Multigap nodeless superconductivity in the topological semimetal PdTe

    Authors: Chengcheng Zhao, Xiangqi Liu, Jinjin Wang, Chunqiang Xu, Baomin Wang, Wei Xia, Zhenhai Yu, Xiaobo Jin, Xu Zhang, Jing Wang, Dongzhe Dai, Chengpeng Tu, Jiaying Nie, Hanru Wang, Yihan Jiao, Daniel Duong, Silu Huang, Rongying Jin, Zhu'an Xu, Yanfeng Guo, Xiaofeng Xu, Shiyan Li

    Abstract: Recently PdTe was identified as a spin-orbit coupled topological Dirac semimetal and was claimed to exhibit both bulk-nodal and surface-nodeless superconducting gaps. Here we report the ultralow-temperature thermal conductivity measurements on PdTe single crystals with $T_c$ = 4.5 K to investigate its superconducting gap structure. It is found that the residual linear term $κ_0/T$ is negligible in… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  38. arXiv:2310.08309  [pdf, other

    cs.CL

    Not All Demonstration Examples are Equally Beneficial: Reweighting Demonstration Examples for In-Context Learning

    Authors: Zhe Yang, Damai Dai, Peiyi Wang, Zhifang Sui

    Abstract: Large Language Models (LLMs) have recently gained the In-Context Learning (ICL) ability with the models scaling up, allowing them to quickly adapt to downstream tasks with only a few demonstration examples prepended in the input sequence. Nonetheless, the current practice of ICL treats all demonstration examples equally, which still warrants improvement, as the quality of examples is usually uneve… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: Findings of EMNLP 2023

  39. arXiv:2310.06229  [pdf, other

    math.NA

    Energy Stable and Structure-Preserving Schemes for the Stochastic Galerkin Shallow Water Equations

    Authors: Dihan Dai, Yekaterina Epshteyn, Akil Narayan

    Abstract: The shallow water flow model is widely used to describe water flows in rivers, lakes, and coastal areas. Accounting for uncertainty in the corresponding transport-dominated nonlinear PDE models presents theoretical and numerical challenges that motivate the central advances of this paper. Starting with a spatially one-dimensional hyperbolicity-preserving, positivity-preserving stochastic Galerkin… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    MSC Class: 35L65; 35Q35; 35R60; 65M60; 65M70; 65M08

  40. arXiv:2309.14795  [pdf, other

    physics.optics physics.app-ph

    Assessing the alignment accuracy of state-of-the-art deterministic fabrication methods for single quantum dot devices

    Authors: Abdulmalik A. Madigawa, Jan N. Donges, Benedek Gaál, Shulun Li, Martin Arentoft Jacobsen, Hanqing Liu, Deyan Dai, Xiangbin Su, Xiangjun Shang, Haiqiao Ni, Johannes Schall, Sven Rodt, Zhichuan Niu, Niels Gregersen, Stephan Reitzenstein, Battulga Munkhbat

    Abstract: The realization of efficient quantum light sources relies on the integration of self-assembled quantum dots (QDs) into photonic nanostructures with high spatial positioning accuracy. In this work, we present a comprehensive investigation of the QD position accuracy, obtained using two marker-based QD positioning techniques, photoluminescence (PL) and cathodoluminescence (CL) imaging, as well as us… ▽ More

    Submitted 29 January, 2024; v1 submitted 26 September, 2023; originally announced September 2023.

  41. Shedding new light on the absence of fermionic superradiance and maximal infalling rate of fermions into a black hole

    Authors: De-Chang Dai, Dejan Stojkovic

    Abstract: Using the complete classification of the bases in the rotating black hole background we separate superradiance from the Hawking effect. We first find that there is spontaneous particle creation for fermions by the potential outside the black hole horizon for the frequencies inside the superradiant regime, i.e. $ω<kΩ_H$. However, these particles do not enhance the total flux from the black hole. Fo… ▽ More

    Submitted 23 September, 2023; originally announced September 2023.

    Comments: 6 pages, 3 figures. arXiv admin note: text overlap with arXiv:2306.17423

    Journal ref: Phys. Rev. D 108 (2023) 084024

  42. arXiv:2309.13276  [pdf, other

    cs.CV

    Discwise Active Learning for LiDAR Semantic Segmentation

    Authors: Ozan Unal, Dengxin Dai, Ali Tamer Unal, Luc Van Gool

    Abstract: While LiDAR data acquisition is easy, labeling for semantic segmentation remains highly time consuming and must therefore be done selectively. Active learning (AL) provides a solution that can iteratively and intelligently label a dataset while retaining high performance and a low budget. In this work we explore AL for LiDAR semantic segmentation. As a human expert is a component of the pipeline,… ▽ More

    Submitted 23 September, 2023; originally announced September 2023.

    Comments: Accepted at IEEE RA-L

  43. arXiv:2308.10129  [pdf, other

    cond-mat.supr-con cond-mat.mtrl-sci cond-mat.str-el

    Pressure-induced double-dome superconductivity in kagome metal CsTi3Bi5

    Authors: J. Y. Nie, X. F. Yang, X. Zhang, X. Q. Liu, W. Xia, D. Z. Dai, C. C. Zhao, C. P. Tu, X. M. Kong, X. B. Jin, Y. F. Guo, S. Y. Li

    Abstract: We present high-pressure resistance measurements up to 40 GPa on recently discovered titanium-based kagome metal CsTi$_3$Bi$_5$. At ambient pressure, CsTi$_3$Bi$_5$ shows no evidence of superconductivity in resistivity and specific heat. By applying pressure, superconductivity emerges and the superconducting transition temperature ${\it T}_{\rm c}$ reaches its first maximum of 1.2 K at $\sim$5 GPa… ▽ More

    Submitted 19 August, 2023; originally announced August 2023.

    Comments: 7 pages, 5 figures

  44. arXiv:2308.00891  [pdf, other

    cs.DC

    PROV-IO+: A Cross-Platform Provenance Framework for Scientific Data on HPC Systems

    Authors: Runzhou Han, Mai Zheng, Suren Byna, Houjun Tang, Bin Dong, Dong Dai, Yong Chen, Dongkyun Kim, Joseph Hassoun, David Thorsley, Matthew Wolf

    Abstract: Data provenance, or data lineage, describes the life cycle of data. In scientific workflows on HPC systems, scientists often seek diverse provenance (e.g., origins of data products, usage patterns of datasets). Unfortunately, existing provenance solutions cannot address the challenges due to their incompatible provenance models and/or system implementations. In this paper, we analyze four represen… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

  45. arXiv:2308.00825  [pdf, other

    cond-mat.str-el cond-mat.mes-hall

    Strong-coupling phases of trions and excitons in electron-hole bilayers at commensurate densities

    Authors: David D. Dai, Liang Fu

    Abstract: We introduce density imbalanced electron-hole bilayers at a commensurate 2 : 1 density ratio as a platform for realizing novel phases involving electrons, excitons and trions. Three length scales are identified which characterize the interplay between kinetic energy, intralayer repulsion, and interlayer attraction. By a combination of theoretical analysis and numerical calculation, we find a varie… ▽ More

    Submitted 26 May, 2024; v1 submitted 1 August, 2023; originally announced August 2023.

    Comments: Published in PRL. Revised main text and supplement. Main: 6 pages, 4 figures. SM: 8 pages, 6 figures

    Journal ref: Phys. Rev. Lett. 132, 196202 (2024)

  46. arXiv:2307.12761  [pdf, other

    cs.CV

    LiDAR Meta Depth Completion

    Authors: Wolfgang Boettcher, Lukas Hoyer, Ozan Unal, Ke Li, Dengxin Dai

    Abstract: Depth estimation is one of the essential tasks to be addressed when creating mobile autonomous systems. While monocular depth estimation methods have improved in recent times, depth completion provides more accurate and reliable depth maps by additionally using sparse depth information from other sensors such as LiDAR. However, current methods are specifically trained for a single LiDAR sensor. As… ▽ More

    Submitted 16 August, 2023; v1 submitted 24 July, 2023; originally announced July 2023.

    Comments: Accepted at IROS 2023, v2 has updated author list and fixed a figure caption

  47. arXiv:2307.07847  [pdf, other

    cs.NI cs.CV cs.LG cs.MM

    Enabling Real-time Neural Recovery for Cloud Gaming on Mobile Devices

    Authors: Zhaoyuan He, Yifan Yang, Shuozhe Li, Diyuan Dai, Lili Qiu, Yuqing Yang

    Abstract: Cloud gaming is a multi-billion dollar industry. A client in cloud gaming sends its movement to the game server on the Internet, which renders and transmits the resulting video back. In order to provide a good gaming experience, a latency below 80 ms is required. This means that video rendering, encoding, transmission, decoding, and display have to finish within that time frame, which is especiall… ▽ More

    Submitted 22 October, 2023; v1 submitted 15 July, 2023; originally announced July 2023.

  48. arXiv:2307.02989  [pdf, other

    cond-mat.supr-con cond-mat.mtrl-sci cond-mat.str-el

    Pressure-induced superconductivity in the van der Waals semiconductor violet phosphorus

    Authors: Y. Y. Wu, L. Mu, X. Zhang, D. Z. Dai, L. Xin, X. M. Kong, S. Y. Huang, K. Meng, X. F. Yang, C. P. Tu, J. M. Ni, H. G. Yan, S. Y. Li

    Abstract: The van der Waals (vdW) semiconductor black phosphorus has been widely studied, especially after the discovery of phosphorene. On the contrary, its sister compound violet phosphorus, also a vdW semiconductor, has been rarely studied. Here we report the pressure-induced superconductivity in violet phosphorus up to $\sim$40 GPa. The superconductivity emerges at 2.75 GPa, which is well below the stru… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

    Comments: 6 pages, 4 figures

  49. arXiv:2306.17770  [pdf, other

    cs.CV

    MTR++: Multi-Agent Motion Prediction with Symmetric Scene Modeling and Guided Intention Querying

    Authors: Shaoshuai Shi, Li Jiang, Dengxin Dai, Bernt Schiele

    Abstract: Motion prediction is crucial for autonomous driving systems to understand complex driving scenarios and make informed decisions. However, this task is challenging due to the diverse behaviors of traffic participants and complex environmental contexts. In this paper, we propose Motion TRansformer (MTR) frameworks to address these challenges. The initial MTR framework utilizes a transformer encoder-… ▽ More

    Submitted 9 March, 2024; v1 submitted 30 June, 2023; originally announced June 2023.

    Comments: Accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI 2024). The winning approaches for the Waymo Motion Prediction Challenge in 2022 and 2023

  50. Separating the superradiant emission from the Hawking radiation from a rotating black hole

    Authors: De-Chang Dai, Dejan Stojkovic

    Abstract: Emission of particles created in the background of a rotating black hole can be greatly amplified taking away rotational energy of a black hole. This amplification affects both particles created near the horizon (due to the Hawing effect), and particles created near the potential barrier far from the horizon. Only the latter effect is called the superradiance in the strict sense. We explicitly cal… ▽ More

    Submitted 15 July, 2023; v1 submitted 30 June, 2023; originally announced June 2023.

    Comments: 5 pages 4 figures

    Journal ref: Physics Letters B, Volume 843, 10 August 2023, 138056