Skip to main content

Showing 1–50 of 159 results for author: Nie, X

  1. arXiv:2407.05389  [pdf, other

    cs.CV cs.AI

    Image-Conditional Diffusion Transformer for Underwater Image Enhancement

    Authors: Xingyang Nie, Su Pan, Xiaoyu Zhai, Shifei Tao, Fengzhong Qu, Biao Wang, Huilin Ge, Guojie Xiao

    Abstract: Underwater image enhancement (UIE) has attracted much attention owing to its importance for underwater operation and marine engineering. Motivated by the recent advance in generative models, we propose a novel UIE method based on image-conditional diffusion transformer (ICDT). Our method takes the degraded underwater image as the conditional input and converts it into latent space where ICDT is ap… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  2. arXiv:2407.04739  [pdf, other

    eess.SP

    Classification of Power Quality Disturbances Using Resnet with Channel Attention Mechanism

    Authors: Su Pan, Xingyang Nie, Xiaoyu Zhai, Biao Wang, Huilin Ge, Cheng He, Zhenping Ding

    Abstract: The detection and classification of power quality disturbances (PQDs) carries significant importance for power systems. In response to this imperative, numerous intelligent diagnostic methods have been developed. However, existing identification methods usually concentrate on single-type signals or on complex signals with two types, rendering them susceptible to noisy labels and environmental effe… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  3. arXiv:2407.00173  [pdf, other

    math.OC

    Approximate Solutions for Multi-Trip Route Planning in Time-Sensitive Situations

    Authors: Bahar Cavdar, Joseph Geunes, Xiaofeng Nie, Yue Wang

    Abstract: We consider emergent situations that require transporting individuals from their locations to a facility using a single capacitated vehicle, where transportation duration has a negative impact on the individuals. A dispatcher determines routes to maximize total satisfaction. We call this problem the Ambulance Bus Routing Problem. We develop efficient approximate policies for the dispatcher to allo… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

  4. arXiv:2406.11739  [pdf, other

    cs.CV

    V3Det Challenge 2024 on Vast Vocabulary and Open Vocabulary Object Detection: Methods and Results

    Authors: Jiaqi Wang, Yuhang Zang, Pan Zhang, Tao Chu, Yuhang Cao, Zeyi Sun, Ziyu Liu, Xiaoyi Dong, Tong Wu, Dahua Lin, Zeming Chen, Zhi Wang, Lingchen Meng, Wenhao Yao, Jianwei Yang, Sihong Wu, Zhineng Chen, Zuxuan Wu, Yu-Gang Jiang, Peixi Wu, Bosong Chai, Xuan Nie, Longquan Yan, Zeyu Wang, Qifan Zhou , et al. (9 additional authors not shown)

    Abstract: Detecting objects in real-world scenes is a complex task due to various challenges, including the vast range of object categories, and potential encounters with previously unknown or unseen objects. The challenges necessitate the development of public benchmarks and challenges to advance the field of object detection. Inspired by the success of previous COCO and LVIS Challenges, we organize the V3… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  5. arXiv:2406.09961  [pdf, other

    cs.SE cs.CL cs.CV

    ChartMimic: Evaluating LMM's Cross-Modal Reasoning Capability via Chart-to-Code Generation

    Authors: Chufan Shi, Cheng Yang, Yaxin Liu, Bo Shui, Junjie Wang, Mohan Jing, Linran Xu, Xinyu Zhu, Siheng Li, Yuxiang Zhang, Gongye Liu, Xiaomei Nie, Deng Cai, Yujiu Yang

    Abstract: We introduce a new benchmark, ChartMimic, aimed at assessing the visually-grounded code generation capabilities of large multimodal models (LMMs). ChartMimic utilizes information-intensive visual charts and textual instructions as inputs, requiring LMMs to generate the corresponding code for chart rendering. ChartMimic includes 1,000 human-curated (figure, instruction, code) triplets, which repres… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: Data and code are available at https://github.com/ChartMimic/ChartMimic

  6. arXiv:2406.09201  [pdf, other

    cs.CV

    Enhanced Object Detection: A Study on Vast Vocabulary Object Detection Track for V3Det Challenge 2024

    Authors: Peixi Wu, Bosong Chai, Xuan Nie, Longquan Yan, Zeyu Wang, Qifan Zhou, Boning Wang, Yansong Peng, Hebei Li

    Abstract: In this technical report, we present our findings from the research conducted on the Vast Vocabulary Visual Detection (V3Det) dataset for Supervised Vast Vocabulary Visual Detection task. How to deal with complex categories and detection boxes has become a difficulty in this track. The original supervised detector is not suitable for this task. We have designed a series of improvements, including… ▽ More

    Submitted 21 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Journal ref: Second Place in CVPR 2024 Vast Vocabulary Visual Detection Challenge

  7. arXiv:2406.04504  [pdf, other

    math.NA math-ph

    Mixed Finite Element Method for Multi-layer Elastic Contact Systems

    Authors: Zhizhuo Zhang, Mikaël Barboteu, Xiaobing Nie, Serge Dumont, Mahmoud Abdel-Aty, Jinde Cao

    Abstract: With the development of multi-layer elastic systems in the field of engineering mechanics, the corresponding variational inequality theory and algorithm design have received more attention and research. In this study, a class of equivalent saddle point problems with interlayer Tresca friction conditions and the mixed finite element method are proposed and analyzed. Then, the convergence of the num… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  8. arXiv:2406.04499  [pdf, other

    math.NA math-ph

    A layer decomposition method for multi-layer elastic contact systems with interlayer Tresca friction

    Authors: Zhizhuo Zhang, Xiaobing Nie, Mikaël Barboteu, Jinde Cao

    Abstract: With the increasing demand for the accuracy of numerical simulation of pavement mechanics, the variational inequality model and its induced finite element method which can simulate the interlayer contact state becomes a potential solution. In this paper, a layer decomposition algorithm for solving variational inequality models of multi-layer elastic contact systems with interlayer Tresca friction… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  9. arXiv:2406.01951  [pdf, other

    quant-ph

    Experimental Validation of Enhanced Information Capacity by Quantum Switch in Accordance with Thermodynamic Laws

    Authors: Cheng Xi, Xiangjing Liu, Hongfeng Liu, Keyi Huang, Xinyue Long, Daniel Ebler, Xinfang Nie, Oscar Dahlsten, Dawei Lu

    Abstract: We experimentally probe the interplay of the quantum switch with the laws of thermodynamics. The quantum switch places two channels in a superposition of orders and may be applied to thermalizing channels. Quantum-switching thermal channels has been shown to give apparent violations of the second law. Central to these apparent violations is how quantum switching channels can increase the capacity… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 6 pages, 4 figures. Comments are welcome!

  10. arXiv:2405.17188  [pdf, other

    cs.CV

    The SkatingVerse Workshop & Challenge: Methods and Results

    Authors: Jian Zhao, Lei Jin, Jianshu Li, Zheng Zhu, Yinglei Teng, Jiaojiao Zhao, Sadaf Gulshad, Zheng Wang, Bo Zhao, Xiangbo Shu, Yunchao Wei, Xuecheng Nie, Xiaojie Jin, Xiaodan Liang, Shin'ichi Satoh, Yandong Guo, Cewu Lu, Junliang Xing, Jane Shen Shengmei

    Abstract: The SkatingVerse Workshop & Challenge aims to encourage research in developing novel and accurate methods for human action understanding. The SkatingVerse dataset used for the SkatingVerse Challenge has been publicly released. There are two subsets in the dataset, i.e., the training subset and testing subset. The training subsets consists of 19,993 RGB video sequences, and the testing subsets cons… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  11. arXiv:2405.04434  [pdf, other

    cs.CL cs.AI

    DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

    Authors: DeepSeek-AI, Aixin Liu, Bei Feng, Bin Wang, Bingxuan Wang, Bo Liu, Chenggang Zhao, Chengqi Dengr, Chong Ruan, Damai Dai, Daya Guo, Dejian Yang, Deli Chen, Dongjie Ji, Erhang Li, Fangyun Lin, Fuli Luo, Guangbo Hao, Guanting Chen, Guowei Li, H. Zhang, Hanwei Xu, Hao Yang, Haowei Zhang, Honghui Ding , et al. (132 additional authors not shown)

    Abstract: We present DeepSeek-V2, a strong Mixture-of-Experts (MoE) language model characterized by economical training and efficient inference. It comprises 236B total parameters, of which 21B are activated for each token, and supports a context length of 128K tokens. DeepSeek-V2 adopts innovative architectures including Multi-head Latent Attention (MLA) and DeepSeekMoE. MLA guarantees efficient inference… ▽ More

    Submitted 19 June, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

  12. arXiv:2405.00263  [pdf, other

    cs.CL cs.AI cs.LG

    Clover: Regressive Lightweight Speculative Decoding with Sequential Knowledge

    Authors: Bin Xiao, Chunan Shi, Xiaonan Nie, Fan Yang, Xiangwei Deng, Lei Su, Weipeng Chen, Bin Cui

    Abstract: Large language models (LLMs) suffer from low efficiency as the mismatch between the requirement of auto-regressive decoding and the design of most contemporary GPUs. Specifically, billions to trillions of parameters must be loaded to the GPU cache through its limited memory bandwidth for computation, but only a small batch of tokens is actually computed. Consequently, the GPU spends most of its ti… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

  13. arXiv:2403.14910  [pdf, other

    cs.CV

    Defying Imbalanced Forgetting in Class Incremental Learning

    Authors: Shixiong Xu, Gaofeng Meng, Xing Nie, Bolin Ni, Bin Fan, Shiming Xiang

    Abstract: We observe a high level of imbalance in the accuracy of different classes in the same old task for the first time. This intriguing phenomenon, discovered in replay-based Class Incremental Learning (CIL), highlights the imbalanced forgetting of learned classes, as their accuracy is similar before the occurrence of catastrophic forgetting. This discovery remains previously unidentified due to the re… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: AAAI2024

  14. arXiv:2403.12768  [pdf, other

    cs.HC

    ContextVis: Envision Contextual Learning and Interaction with Generative Models

    Authors: Bo Shui, Chufan Shi, Yujiu Yang, Xiaomei Nie

    Abstract: ContextVis introduces a workflow by integrating generative models to create contextual learning materials. It aims to boost knowledge acquisition through the creation of resources with contextual cues. A case study on vocabulary learning demonstrates the effectiveness of generative models in developing educational resources that enrich language understanding and aid memory retention. The system co… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Accepted by HCII 2024

  15. arXiv:2403.09244  [pdf, other

    physics.acc-ph physics.ins-det

    High precision proton beam monitor system concept design on CSNS based on SiC

    Authors: Ye He, Xingchen Li, Zijun Xu, Ming Qi, Congcong Wang, Chenwei Wang, Hai Lu, Xiaojun Nie, Ruirui Fan, Hantao Jing, Weiming Song, Keqi Wang, Kai Liu, Peilian Liu, Hui Li, Zaiyi Li, Chenxi Fu, Xiyuan Zhang, Xiaoshen Kang, Zhan Li, Weiguo Lu, Suyu Xiao, Xin Shi

    Abstract: A high precision beam monitor system based on silicon carbide PIN sensor is designed for China Spallation Neutron Source 1.6 GeV proton beam to monitor the proton beam fluence.The concept design of the beam monitor system is finished together with front-end electronics with silicon carbide PIN sensors, readout system and mechanical system.Several tests are performed to study the performance of eac… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  16. arXiv:2403.06243  [pdf, other

    cs.CV

    BlazeBVD: Make Scale-Time Equalization Great Again for Blind Video Deflickering

    Authors: Xinmin Qiu, Congying Han, Zicheng Zhang, Bonan Li, Tiande Guo, Pingyu Wang, Xuecheng Nie

    Abstract: Developing blind video deflickering (BVD) algorithms to enhance video temporal consistency, is gaining importance amid the flourish of image processing and video generation. However, the intricate nature of video data complicates the training of deep learning methods, leading to high resource consumption and instability, notably under severe lighting flicker. This underscores the critical need for… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

  17. arXiv:2402.02045  [pdf, other

    cs.CV

    MLIP: Enhancing Medical Visual Representation with Divergence Encoder and Knowledge-guided Contrastive Learning

    Authors: Zhe Li, Laurence T. Yang, Bocheng Ren, Xin Nie, Zhangyang Gao, Cheng Tan, Stan Z. Li

    Abstract: The scarcity of annotated data has sparked significant interest in unsupervised pre-training methods that leverage medical reports as auxiliary signals for medical visual representation learning. However, existing research overlooks the multi-granularity nature of medical visual representation and lacks suitable contrastive learning techniques to improve the models' generalizability across differe… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

  18. Effects of Magnetic Helicity on 3D Equilibria and Self-Organized States in KTX Reversed Field Pinch

    Authors: Ke Liu, Guodong Yu, Yuhua Huang, Wenzhe Mao, Yidong Xie, Xianyi Nie, Hong Li, Tao Lan, Jinlin Xie, Weixing Ding, Wandong Liu, Ge Zhuang, Caoxiang Zhu

    Abstract: The RFP is a toroidal magnetic configuration in which plasmas can spontaneously transform into different self-organized states. Among various states, the QSH state has a dominant component for the magnetic field and significantly improves confinement. Many theoretical and experimental efforts have investigated the transitions among different states. This paper employs the MRxMHD model to study the… ▽ More

    Submitted 6 April, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

  19. arXiv:2401.14041  [pdf, other

    physics.plasm-ph

    Quasi-single-stage optimization for permanent magnet stellarators

    Authors: Guodong Yu, Ke Liu, Tianyi Qian, Yidong Xie, Xianyi Nie, Caoxiang Zhu

    Abstract: Advanced stellarators are typically optimized in two stages. The plasma equilibrium is optimized first, followed by the design of coils/permanent magnets. However, the coils/permanent magnets in the second stage may become too complex to achieve the desired equilibrium. To address this problem, a quasi-single-stage optimization method has been proposed. In this paper, we introduce this method for… ▽ More

    Submitted 30 April, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: 21 pages, 17 figures

  20. arXiv:2401.02954  [pdf, other

    cs.CL cs.AI cs.LG

    DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

    Authors: DeepSeek-AI, :, Xiao Bi, Deli Chen, Guanting Chen, Shanhuang Chen, Damai Dai, Chengqi Deng, Honghui Ding, Kai Dong, Qiushi Du, Zhe Fu, Huazuo Gao, Kaige Gao, Wenjun Gao, Ruiqi Ge, Kang Guan, Daya Guo, Jianzhong Guo, Guangbo Hao, Zhewen Hao, Ying He, Wenjie Hu, Panpan Huang, Erhang Li , et al. (63 additional authors not shown)

    Abstract: The rapid development of open-source large language models (LLMs) has been truly remarkable. However, the scaling law described in previous literature presents varying conclusions, which casts a dark cloud over scaling LLMs. We delve into the study of scaling laws and present our distinctive findings that facilitate scaling of large scale models in two commonly used open-source configurations, 7B… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

  21. arXiv:2312.06462  [pdf, other

    cs.CV cs.AI cs.SD eess.AS

    Cooperation Does Matter: Exploring Multi-Order Bilateral Relations for Audio-Visual Segmentation

    Authors: Qi Yang, Xing Nie, Tong Li, Pengfei Gao, Ying Guo, Cheng Zhen, Pengfei Yan, Shiming Xiang

    Abstract: Recently, an audio-visual segmentation (AVS) task has been introduced, aiming to group pixels with sounding objects within a given video. This task necessitates a first-ever audio-driven pixel-level understanding of the scene, posing significant challenges. In this paper, we propose an innovative audio-visual transformer framework, termed COMBO, an acronym for COoperation of Multi-order Bilateral… ▽ More

    Submitted 7 April, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

    Comments: CVPR 2024 Highlight. 13 pages, 10 figures

  22. arXiv:2312.01663  [pdf, other

    cs.CV cs.AI

    Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training

    Authors: Runze He, Shaofei Huang, Xuecheng Nie, Tianrui Hui, Luoqi Liu, Jiao Dai, Jizhong Han, Guanbin Li, Si Liu

    Abstract: In this paper, we target the adaptive source driven 3D scene editing task by proposing a CustomNeRF model that unifies a text description or a reference image as the editing prompt. However, obtaining desired editing results conformed with the editing prompt is nontrivial since there exist two significant challenges, including accurate editing of only foreground regions and multi-view consistency… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: 14 pages, 13 figures, project website: https://customnerf.github.io/

  23. arXiv:2310.18698  [pdf, other

    cs.CV cs.LG

    Triplet Attention Transformer for Spatiotemporal Predictive Learning

    Authors: Xuesong Nie, Xi Chen, Haoyuan Jin, Zhihang Zhu, Yunfeng Yan, Donglian Qi

    Abstract: Spatiotemporal predictive learning offers a self-supervised learning paradigm that enables models to learn both spatial and temporal patterns by predicting future sequences based on historical sequences. Mainstream methods are dominated by recurrent units, yet they are limited by their lack of parallelization and often underperform in real-world scenarios. To improve prediction quality while maint… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

    Comments: Accepted to WACV 2024

  24. arXiv:2310.07637  [pdf, other

    cs.AI cs.NI

    OpsEval: A Comprehensive IT Operations Benchmark Suite for Large Language Models

    Authors: Yuhe Liu, Changhua Pei, Longlong Xu, Bohan Chen, Mingze Sun, Zhirui Zhang, Yongqian Sun, Shenglin Zhang, Kun Wang, Haiming Zhang, Jianhui Li, Gaogang Xie, Xidao Wen, Xiaohui Nie, Minghua Ma, Dan Pei

    Abstract: Information Technology (IT) Operations (Ops), particularly Artificial Intelligence for IT Operations (AIOps), is the guarantee for maintaining the orderly and stable operation of existing information systems. According to Gartner's prediction, the use of AI technology for automated IT operations has become a new trend. Large language models (LLMs) that have exhibited remarkable capabilities in NLP… ▽ More

    Submitted 16 February, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

  25. arXiv:2310.03300  [pdf, ps, other

    cond-mat.quant-gas physics.atom-ph

    Efficient Creation of Ultracold Ground State $^{6}\textrm{Li}^{40}\textrm{K}$ Polar Molecules

    Authors: Canming He, Xiaoyu Nie, Victor Avalos, Sofia Botsi, Sunil Kumar, Anbang Yang, Kai Dieckmann

    Abstract: We report the creation of ultracold ground state $^{6}\textrm{Li}^{40}\textrm{K}$ polar molecules with high efficiency. Starting from weakly-bound molecules state, stimulated Raman adiabatic passage (STIRAP) is adopted to coherently transfer the molecules to their singlet ro-vibrational ground state $|\textrm{X}^{1}Σ^{+},v=0,J=0>$. By employing a singlet STIRAP pathway and low-phase-noise narrow-l… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

    Comments: 6 pages, 6 figures

  26. A spin-rotation mechanism of Einstein-de Haas effect based on a ferromagnetic disk

    Authors: Xin Nie, Jun Li, Trinanjan Datta, Dao-Xin Yao

    Abstract: Spin-rotation coupling (SRC) is a fundamental phenomenon that connects electronic spins with the rotational motion of a medium. We elucidate the Einstein-de Haas (EdH) effect and its inverse with SRC as the microscopic mechanism using the dynamic spin-lattice equations derived by elasticity theory and Lagrangian formalism. By applying the coupling equations to an iron disk in a magnetic field, we… ▽ More

    Submitted 8 April, 2024; v1 submitted 19 July, 2023; originally announced July 2023.

    Comments: 13 pages,6 figures, published to Frontiers of physics

    Journal ref: Front. Phys. 19(5), 53201 (2024)

  27. Non-equilibrium phases of Fermi gas inside a cavity with imbalanced pumping

    Authors: Xiaotian Nie, Wei Zheng

    Abstract: In this work, we investigate the non-equilibrium dynamics of one-dimensional spinless fermions loaded in a cavity with imbalanced pumping lasers. Our study is motivated by previous work on a similar setup using bosons, and we explore the unique properties of fermionic systems in this context. By considering the imbalance in the pumping, we find that the system exhibits multiple superradiant steady… ▽ More

    Submitted 14 July, 2023; originally announced July 2023.

    Comments: 7 pages, 12 figures

    Report number: Phys. Rev. A 108, 043312

  28. arXiv:2307.02031  [pdf, other

    cs.LG cs.DB cs.DC

    Improving Automatic Parallel Training via Balanced Memory Workload Optimization

    Authors: Yujie Wang, Youhe Jiang, Xupeng Miao, Fangcheng Fu, Shenhan Zhu, Xiaonan Nie, Yaofeng Tu, Bin Cui

    Abstract: Transformer models have emerged as the leading approach for achieving state-of-the-art performance across various application domains, serving as the foundation for advanced large-scale deep learning (DL) models. However, efficiently training these models across multiple GPUs remains a complex challenge due to the abundance of parallelism options. Existing DL systems either require manual efforts… ▽ More

    Submitted 24 February, 2024; v1 submitted 5 July, 2023; originally announced July 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2211.13878

  29. arXiv:2306.11570  [pdf, other

    math.GT math.DG math.MG

    Boundary metric of Epstein-Penner convex hull and discrete conformality

    Authors: Xin Nie

    Abstract: The Epstein-Penner convex hull construction associates to every decorated punctured hyperbolic surface a polyhedral convex body in the Minkowski space. It works in the de Sitter and anti-de Sitter spaces as well. In these three spaces, the quotient of the spacelike boundary part of the convex body has an induced Euclidean, spherical and hyperbolic metric, respectively, with conical singularities.… ▽ More

    Submitted 3 July, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

    Comments: 27 pages, 18 figures, comments welcome (v2: minor updates)

    MSC Class: 52C26

  30. arXiv:2306.08460  [pdf, ps, other

    cs.LG cs.AI

    Improving Generalization in Meta-Learning via Meta-Gradient Augmentation

    Authors: Ren Wang, Haoliang Sun, Qi Wei, Xiushan Nie, Yuling Ma, Yilong Yin

    Abstract: Meta-learning methods typically follow a two-loop framework, where each loop potentially suffers from notorious overfitting, hindering rapid adaptation and generalization to new tasks. Existing schemes solve it by enhancing the mutual-exclusivity or diversity of training samples, but these data manipulation strategies are data-dependent and insufficiently flexible. This work alleviates overfitting… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

  31. arXiv:2305.17431  [pdf, other

    cs.CV cs.AI

    Towards Consistent Video Editing with Text-to-Image Diffusion Models

    Authors: Zicheng Zhang, Bonan Li, Xuecheng Nie, Congying Han, Tiande Guo, Luoqi Liu

    Abstract: Existing works have advanced Text-to-Image (TTI) diffusion models for video editing in a one-shot learning manner. Despite their low requirements of data and computation, these methods might produce results of unsatisfied consistency with text prompt as well as temporal sequence, limiting their applications in the real world. In this paper, we propose to address the above issues with a novel EI… ▽ More

    Submitted 27 May, 2023; originally announced May 2023.

  32. arXiv:2305.09940   

    cs.DC

    OSDP: Optimal Sharded Data Parallel for Distributed Deep Learning

    Authors: Youhe Jiang, Fangcheng Fu, Xupeng Miao, Xiaonan Nie, Bin Cui

    Abstract: Large-scale deep learning models contribute to significant performance improvements on varieties of downstream tasks. Current data and model parallelism approaches utilize model replication and partition techniques to support the distributed training of ultra-large models. However, directly deploying these systems often leads to sub-optimal training efficiency due to the complex model architecture… ▽ More

    Submitted 17 May, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: An older version is in existence, and the article has been updated there. The URL for the updated version is arXiv:2209.13258

  33. arXiv:2305.04517  [pdf, other

    cs.CV

    DiffBFR: Bootstrapping Diffusion Model Towards Blind Face Restoration

    Authors: Xinmin Qiu, Congying Han, Zicheng Zhang, Bonan Li, Tiande Guo, Xuecheng Nie

    Abstract: Blind face restoration (BFR) is important while challenging. Prior works prefer to exploit GAN-based frameworks to tackle this task due to the balance of quality and efficiency. However, these methods suffer from poor stability and adaptability to long-tail distribution, failing to simultaneously retain source identity and restore detail. We propose DiffBFR to introduce Diffusion Probabilistic Mod… ▽ More

    Submitted 8 August, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

  34. arXiv:2305.00703  [pdf, ps, other

    math.CA

    Rearrangement inequalities of the one-dimensional maximal functions associated with general measures

    Authors: Xudong Nie, Di Wu, Panwang Wang

    Abstract: We prove a rearrangement inequality for the uncentered Hardy-Littlewood maximal function $M_μ$ associate to general measure $μ$ on $\mathbb{R}$. This inequality is analogous to the Stein's result $cf^{**}(t)\leq(Mf)^{*}(t)\leq C f^{**}(t)$, where $f^*$ is the symmetric decreasing rearrangement function of $f$ and $f^{**}(t)=\int_0^tf^*(x)dx$. Moreover, we compute the best constant of $M_μ$ on… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

  35. arXiv:2304.03946  [pdf, other

    cs.DC cs.LG

    FlexMoE: Scaling Large-scale Sparse Pre-trained Model Training via Dynamic Device Placement

    Authors: Xiaonan Nie, Xupeng Miao, Zilong Wang, Zichao Yang, Jilong Xue, Lingxiao Ma, Gang Cao, Bin Cui

    Abstract: With the increasing data volume, there is a trend of using large-scale pre-trained models to store the knowledge into an enormous number of model parameters. The training of these models is composed of lots of dense algebras, requiring a huge amount of hardware resources. Recently, sparsely-gated Mixture-of-Experts (MoEs) are becoming more popular and have demonstrated impressive pretraining scala… ▽ More

    Submitted 8 April, 2023; originally announced April 2023.

    Comments: Accepted by SIGMOD 2023

    Journal ref: Proc. ACM Manag. Data, Vol. 1, No. 1, Article 110. Publication date: May 2023

  36. arXiv:2303.06353  [pdf, ps, other

    cs.IT

    Secure and Multi-Step Computation Offloading and Resource Allocation in Ultra-Dense Multi-Task NOMA-Enabled IoT Networks

    Authors: Tianqing Zhou, Yanyan Fu, Dong Qin, Xuefang Nie, Nan Jiang, Chunguo Li

    Abstract: Ultra-dense networks are widely regarded as a promising solution to explosively growing applications of Internet-of-Things (IoT) mobile devices (IMDs). However, complicated and severe interferences need to be tackled properly in such networks. To this end, both orthogonal multiple access (OMA) and non-orthogonal multiple access (NOMA) are utilized at first. Then, in order to attain a goal of green… ▽ More

    Submitted 11 March, 2023; originally announced March 2023.

  37. arXiv:2303.03231  [pdf, other

    cs.CV

    StyO: Stylize Your Face in Only One-Shot

    Authors: Bonan Li, Zicheng Zhang, Xuecheng Nie, Congying Han, Yinhan Hu, Tiande Guo

    Abstract: This paper focuses on face stylization with a single artistic target. Existing works for this task often fail to retain the source content while achieving geometry variation. Here, we present a novel StyO model, ie. Stylize the face in only One-shot, to solve the above problem. In particular, StyO exploits a disentanglement and recombination strategy. It first disentangles the content and style of… ▽ More

    Submitted 6 March, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

  38. arXiv:2303.02868  [pdf, other

    cs.LG cs.DC

    Angel-PTM: A Scalable and Economical Large-scale Pre-training System in Tencent

    Authors: Xiaonan Nie, Yi Liu, Fangcheng Fu, Jinbao Xue, Dian Jiao, Xupeng Miao, Yangyu Tao, Bin Cui

    Abstract: Recent years have witnessed the unprecedented achievements of large-scale pre-trained models, especially the Transformer models. Many products and services in Tencent Inc., such as WeChat, QQ, and Tencent Advertisement, have been opted in to gain the power of pre-trained models. In this work, we present Angel-PTM, a productive deep learning system designed for pre-training and fine-tuning Transfor… ▽ More

    Submitted 5 March, 2023; originally announced March 2023.

  39. arXiv:2301.09585  [pdf, other

    math.DG math.GT

    On circle patterns and spherical conical metrics

    Authors: Xin Nie

    Abstract: The Koebe-Andreev-Thurston circle packing theorem, as well as its generalization to circle patterns due to Bobenko and Springborn, holds for Euclidean and hyperbolic metrics possibly with conical singularities, but fails for spherical metrics because of the non-uniqueness coming from Möbius transformations. In this paper, we show that a unique existence result for circle pattern with spherical con… ▽ More

    Submitted 23 January, 2023; originally announced January 2023.

    Comments: 9 pages, 6 figures

    MSC Class: 52C26; 57M50

  40. arXiv:2212.02222  [pdf, other

    cs.GT cs.AI cs.LG

    Real-time Bidding Strategy in Display Advertising: An Empirical Analysis

    Authors: Mengjuan Liu, Zhengning Hu, Zhi Lai, Daiwei Zheng, Xuyun Nie

    Abstract: Bidding strategies that help advertisers determine bidding prices are receiving increasing attention as more and more ad impressions are sold through real-time bidding systems. This paper first describes the problem and challenges of optimizing bidding strategies for individual advertisers in real-time bidding display advertising. Then, several representative bidding strategies are introduced, esp… ▽ More

    Submitted 30 November, 2022; originally announced December 2022.

  41. Practical quantum simulation of small-scale non-Hermitian dynamics

    Authors: Hongfeng Liu, Xiaodong Yang, Kai Tang, Liangyu Che, Xinfang Nie, Tao Xin, Jun Li, Dawei Lu

    Abstract: Non-Hermitian quantum systems have recently attracted considerable attention due to their exotic properties. Though many experimental realizations of non-Hermitian systems have been reported, the non-Hermiticity usually resorts to the hard-to-control environments and cannot last for too long times. An alternative approach is to use quantum simulation with the closed system, whereas how to simulate… ▽ More

    Submitted 7 June, 2023; v1 submitted 27 November, 2022; originally announced November 2022.

    Comments: 9 pages, 5 figures

    Journal ref: Physical Review A 107, 062608 (2023)

  42. arXiv:2211.13878  [pdf, other

    cs.LG cs.DB cs.DC

    Galvatron: Efficient Transformer Training over Multiple GPUs Using Automatic Parallelism

    Authors: Xupeng Miao, Yujie Wang, Youhe Jiang, Chunan Shi, Xiaonan Nie, Hailin Zhang, Bin Cui

    Abstract: Transformer models have achieved state-of-the-art performance on various domains of applications and gradually becomes the foundations of the advanced large deep learning (DL) models. However, how to train these models over multiple GPUs efficiently is still challenging due to a large number of parallelism choices. Existing DL systems either rely on manual efforts to make distributed training plan… ▽ More

    Submitted 24 November, 2022; originally announced November 2022.

    Journal ref: VLDB 2023

  43. arXiv:2211.09013  [pdf, other

    cs.CV cs.IT

    Masked Reconstruction Contrastive Learning with Information Bottleneck Principle

    Authors: Ziwen Liu, Bonan Li, Congying Han, Tiande Guo, Xuecheng Nie

    Abstract: Contrastive learning (CL) has shown great power in self-supervised learning due to its ability to capture insight correlations among large-scale data. Current CL models are biased to learn only the ability to discriminate positive and negative pairs due to the discriminative task setting. However, this bias would lead to ignoring its sufficiency for other downstream tasks, which we call the discri… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

  44. Control-enhanced quantum metrology under Markovian noise

    Authors: Yue Zhai, Xiaodong Yang, Kai Tang, Xinyue Long, Xinfang Nie, Tao Xin, Dawei Lu, Jun Li

    Abstract: Quantum metrology is supposed to significantly improve the precision of parameter estimation by utilizing suitable quantum resources. However, the predicted precision can be severely distorted by realistic noises. Here, we propose a control-enhanced quantum metrology scheme to defend against these noises for improving the metrology performance. Our scheme can automatically alter the parameter enco… ▽ More

    Submitted 6 February, 2023; v1 submitted 3 November, 2022; originally announced November 2022.

    Comments: 9 pages, 5 figures

    Journal ref: Physical Review A 107, 022602 (2023)

  45. Multidimensional Coherent Spectroscopy of Molecular Polaritons: Langevin Approach

    Authors: Zhedong Zhang, Xiaoyu Nie, Dangyuan Lei, Shaul Mukame

    Abstract: We present a microscopic theory for nonlinear optical spectroscopy of N molecules in an optical cavity. A quantum Langevin analytical expression is derived for the time- and frequency-resolved signals accounting for arbitrary numbers of vibrational excitations. We identify clear signatures of the polariton-polaron interaction from multidimensional projections of the signal, e.g., pathways and time… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: 6 pages, 2 figures

  46. arXiv:2210.12145  [pdf, other

    quant-ph

    Experimental realization of a topologically protected Hadamard gate via braiding Fibonacci anyons

    Authors: Yu-ang Fan, Yingcheng Li, Yuting Hu, Yishan Li, Xinyue Long, Hongfeng Liu, Xiaodong Yang, Xinfang Nie, Jun Li, Tao Xin, Dawei Lu, Yidun Wan

    Abstract: Topological quantum computation (TQC) is one of the most striking architectures that can realize fault-tolerant quantum computers. In TQC, the logical space and the quantum gates are topologically protected, i.e., robust against local disturbances. The topological protection, however, requires rather complicated lattice models and hard-to-manipulate dynamics; even the simplest system that can real… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: 8 pages, 4 figures

  47. arXiv:2210.01886  [pdf, other

    cs.CV cs.AI

    Multi-view Human Body Mesh Translator

    Authors: Xiangjian Jiang, Xuecheng Nie, Zitian Wang, Luoqi Liu, Si Liu

    Abstract: Existing methods for human mesh recovery mainly focus on single-view frameworks, but they often fail to produce accurate results due to the ill-posed setup. Considering the maturity of the multi-view motion capture system, in this paper, we propose to solve the prior ill-posed problem by leveraging multiple images from different views, thus significantly enhancing the quality of recovered meshes.… ▽ More

    Submitted 4 October, 2022; originally announced October 2022.

    Comments: 9 pages

  48. arXiv:2209.15012  [pdf, other

    eess.IV physics.optics

    Ghost translation

    Authors: Wenhan Ren, Xiaoyu Nie, Tao Peng, Marlan O. Scully

    Abstract: Artificial intelligence has recently been widely used in computational imaging. The deep neural network (DNN) improves the signal-to-noise ratio of the retrieved images, whose quality is otherwise corrupted due to the low sampling ratio or noisy environments. This work proposes a new computational imaging scheme based on the sequence transduction mechanism with the transformer network. The simulat… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

    Comments: 10 pages, 8 figures

  49. arXiv:2209.13258  [pdf, other

    cs.DC

    OSDP: Optimal Sharded Data Parallel for Distributed Deep Learning

    Authors: Youhe Jiang, Fangcheng Fu, Xupeng Miao, Xiaonan Nie, Bin Cui

    Abstract: Large-scale deep learning models contribute to significant performance improvements on varieties of downstream tasks. Current data and model parallelism approaches utilize model replication and partition techniques to support the distributed training of ultra-large models. However, directly deploying these systems often leads to sub-optimal training efficiency due to the complex model architecture… ▽ More

    Submitted 18 May, 2023; v1 submitted 27 September, 2022; originally announced September 2022.

    Comments: Accepted by IJCAI 2023

  50. arXiv:2209.08501  [pdf, other

    quant-ph

    Measuring Quantum Entanglement from Local Information by Machine Learning

    Authors: Yulei Huang, Liangyu Che, Chao Wei, Feng Xu, Xinfang Nie, Jun Li, Dawei Lu, Tao Xin

    Abstract: Entanglement is a key property in the development of quantum technologies and in the study of quantum many-body simulations. However, entanglement measurement typically requires quantum full-state tomography (FST). Here we present a neural network-assisted protocol for measuring entanglement in equilibrium and non-equilibrium states of local Hamiltonians. Instead of FST, it can learn comprehensive… ▽ More

    Submitted 18 September, 2022; originally announced September 2022.

    Comments: 5 pages, 4 figures. All comments are welcome