Skip to main content

Showing 1–50 of 76 results for author: Lei, B

  1. arXiv:2405.14906  [pdf, other

    cs.SE cs.AI

    AutoCoder: Enhancing Code Large Language Model with \textsc{AIEV-Instruct}

    Authors: Bin Lei, Yuchen Li, Qiuwu Chen

    Abstract: We introduce AutoCoder, the first Large Language Model to surpass GPT-4 Turbo (April 2024) and GPT-4o in pass@1 on the Human Eval benchmark test ($\mathbf{90.9\%}$ vs. $\mathbf{90.2\%}$). In addition, AutoCoder offers a more versatile code interpreter compared to GPT-4 Turbo and GPT-4o. It's code interpreter can install external packages instead of limiting to built-in packages. AutoCoder's traini… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  2. arXiv:2404.04735  [pdf, other

    cs.AI cs.CL cs.MA

    MACM: Utilizing a Multi-Agent System for Condition Mining in Solving Complex Mathematical Problems

    Authors: Bin Lei

    Abstract: Recent advancements in large language models, such as GPT-4, have demonstrated remarkable capabilities in processing standard queries. Despite these advancements, their performance substantially declines in \textbf{advanced mathematical problems requiring complex, multi-step logical reasoning}. To enhance their inferential capabilities, current research has delved into \textit{prompting engineerin… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

  3. arXiv:2403.20047  [pdf, other

    cs.LG cs.CV

    Embracing Unknown Step by Step: Towards Reliable Sparse Training in Real World

    Authors: Bowen Lei, Dongkuan Xu, Ruqi Zhang, Bani Mallick

    Abstract: Sparse training has emerged as a promising method for resource-efficient deep neural networks (DNNs) in real-world applications. However, the reliability of sparse models remains a crucial concern, particularly in detecting unknown out-of-distribution (OOD) data. This study addresses the knowledge gap by investigating the reliability of sparse training from an OOD perspective and reveals that spar… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

  4. arXiv:2403.11396  [pdf, other

    cs.RO

    Beyond Uncertainty: Risk-Aware Active View Acquisition for Safe Robot Navigation and 3D Scene Understanding with FisherRF

    Authors: Guangyi Liu, Wen Jiang, Boshu Lei, Vivek Pandey, Kostas Daniilidis, Nader Motee

    Abstract: This work proposes a novel approach to bolster both the robot's risk assessment and safety measures while deepening its understanding of 3D scenes, which is achieved by leveraging Radiance Field (RF) models and 3D Gaussian Splatting. To further enhance these capabilities, we incorporate additional sampled views from the environment with the RF model. One of our key contributions is the introductio… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  5. arXiv:2402.17292  [pdf, other

    cs.CV

    DivAvatar: Diverse 3D Avatar Generation with a Single Prompt

    Authors: Weijing Tao, Biwen Lei, Kunhao Liu, Shijian Lu, Miaomiao Cui, Xuansong Xie, Chunyan Miao

    Abstract: Text-to-Avatar generation has recently made significant strides due to advancements in diffusion models. However, most existing work remains constrained by limited diversity, producing avatars with subtle differences in appearance for a given text prompt. We design DivAvatar, a novel framework that generates diverse avatars, empowering 3D creatives with a multitude of distinct and richly varied 3D… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  6. arXiv:2402.16866  [pdf, other

    cs.IT cs.AI

    Computation Rate Maximization for Wireless Powered Edge Computing With Multi-User Cooperation

    Authors: Yang Li, Xing Zhang, Bo Lei, Qianying Zhao, Min Wei, Zheyan Qu, Wenbo Wang

    Abstract: The combination of mobile edge computing (MEC) and radio frequency-based wireless power transfer (WPT) presents a promising technique for providing sustainable energy supply and computing services at the network edge. This study considers a wireless-powered mobile edge computing system that includes a hybrid access point (HAP) equipped with a computing unit and multiple Internet of Things (IoT) de… ▽ More

    Submitted 22 January, 2024; originally announced February 2024.

    Comments: Accepted to IEEE Open Journal of the Communications Society

  7. arXiv:2402.02734  [pdf, other

    eess.IV cs.CV cs.NE stat.AP stat.ML

    InVA: Integrative Variational Autoencoder for Harmonization of Multi-modal Neuroimaging Data

    Authors: Bowen Lei, Rajarshi Guhaniyogi, Krishnendu Chandra, Aaron Scheffler, Bani Mallick

    Abstract: There is a significant interest in exploring non-linear associations among multiple images derived from diverse imaging modalities. While there is a growing literature on image-on-image regression to delineate predictive inference of an image based on multiple images, existing approaches have limitations in efficiently borrowing information between multiple imaging modalities in the prediction of… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  8. arXiv:2401.01173  [pdf, other

    cs.CV

    En3D: An Enhanced Generative Model for Sculpting 3D Humans from 2D Synthetic Data

    Authors: Yifang Men, Biwen Lei, Yuan Yao, Miaomiao Cui, Zhouhui Lian, Xuansong Xie

    Abstract: We present En3D, an enhanced generative scheme for sculpting high-quality 3D human avatars. Unlike previous works that rely on scarce 3D datasets or limited 2D collections with imbalanced viewing angles and imprecise pose priors, our approach aims to develop a zero-shot 3D generative scheme capable of producing visually realistic, geometrically accurate and content-wise diverse 3D humans without r… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

    Comments: Project Page: https://menyifang.github.io/projects/En3D/index.html

  9. arXiv:2401.00869  [pdf, other

    cs.CV

    FlashVideo: A Framework for Swift Inference in Text-to-Video Generation

    Authors: Bin Lei, le Chen, Caiwen Ding

    Abstract: In the evolving field of machine learning, video generation has witnessed significant advancements with autoregressive-based transformer models and diffusion models, known for synthesizing dynamic and realistic scenes. However, these models often face challenges with prolonged inference times, even for generating short video clips such as GIFs. This paper introduces FlashVideo, a novel framework t… ▽ More

    Submitted 29 December, 2023; originally announced January 2024.

  10. arXiv:2312.16837  [pdf, other

    cs.CV

    DiffusionGAN3D: Boosting Text-guided 3D Generation and Domain Adaptation by Combining 3D GANs and Diffusion Priors

    Authors: Biwen Lei, Kai Yu, Mengyang Feng, Miaomiao Cui, Xuansong Xie

    Abstract: Text-guided domain adaptation and generation of 3D-aware portraits find many applications in various fields. However, due to the lack of training data and the challenges in handling the high variety of geometry and appearance, the existing methods for these tasks suffer from issues like inflexibility, instability, and low fidelity. In this paper, we propose a novel framework DiffusionGAN3D, which… ▽ More

    Submitted 12 April, 2024; v1 submitted 28 December, 2023; originally announced December 2023.

    Comments: Accepted by CVPR2024

  11. arXiv:2312.09022  [pdf, other

    eess.IV cs.CV q-bio.NC

    BDHT: Generative AI Enables Causality Analysis for Mild Cognitive Impairment

    Authors: Qiankun Zuo, Ling Chen, Yanyan Shen, Michael Kwok-Po Ng, Baiying Lei, Shuqiang Wang

    Abstract: Effective connectivity estimation plays a crucial role in understanding the interactions and information flow between different brain regions. However, the functional time series used for estimating effective connectivity is derived from certain software, which may lead to large computing errors because of different parameter settings and degrade the ability to model complex causal relationships b… ▽ More

    Submitted 28 May, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: 13pages, 14 figures

  12. arXiv:2312.05107  [pdf, other

    cs.CV

    DreaMoving: A Human Video Generation Framework based on Diffusion Models

    Authors: Mengyang Feng, Jinlin Liu, Kai Yu, Yuan Yao, Zheng Hui, Xiefan Guo, Xianhui Lin, Haolan Xue, Chen Shi, Xiaowen Li, Aojie Li, Xiaoyang Kang, Biwen Lei, Miaomiao Cui, Peiran Ren, Xuansong Xie

    Abstract: In this paper, we present DreaMoving, a diffusion-based controllable video generation framework to produce high-quality customized human videos. Specifically, given target identity and posture sequences, DreaMoving can generate a video of the target identity moving or dancing anywhere driven by the posture sequences. To this end, we propose a Video ControlNet for motion-controlling and a Content G… ▽ More

    Submitted 11 December, 2023; v1 submitted 8 December, 2023; originally announced December 2023.

    Comments: 5 pages, 5 figures, Tech. Report

  13. arXiv:2312.03780  [pdf

    cs.LG

    Predicting the Transportation Activities of Construction Waste Hauling Trucks: An Input-Output Hidden Markov Approach

    Authors: Hongtai Yang, Boyi Lei, Ke Han, Luna Liu

    Abstract: Construction waste hauling trucks (CWHTs), as one of the most commonly seen heavy-duty vehicles in major cities around the globe, are usually subject to a series of regulations and spatial-temporal access restrictions because they not only produce significant NOx and PM emissions but also causes on-road fugitive dust. The timely and accurate prediction of CWHTs' destinations and dwell times play a… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Comments: 21 pages, 8 figures

  14. arXiv:2312.01751  [pdf, other

    cs.DC

    Joint Task Partitioning and Parallel Scheduling in Device-Assisted Mobile Edge Networks

    Authors: Yang Li, Xinlei Ge, Bo Lei, Xing Zhang, Wenbo Wang

    Abstract: With the development of the Internet of Things (IoT), certain IoT devices have the capability to not only accomplish their own tasks but also simultaneously assist other resource-constrained devices. Therefore, this paper considers a device-assisted mobile edge computing system that leverages auxiliary IoT devices to alleviate the computational burden on the edge computing server and enhance the o… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: Accepted to IEEE Internet of Things Journal

  15. arXiv:2312.01022  [pdf, other

    cs.LG

    Advanced Large Language Model (LLM)-Driven Verilog Development: Enhancing Power, Performance, and Area Optimization in Code Synthesis

    Authors: Kiran Thorat, Jiahui Zhao, Yaotian Liu, Hongwu Peng, Xi Xie, Bin Lei, Jeff Zhang, Caiwen Ding

    Abstract: The increasing use of Advanced Language Models (ALMs) in diverse sectors, particularly due to their impressive capability to generate top-tier content following linguistic instructions, forms the core of this investigation. This study probes into ALMs' deployment in electronic hardware design, with a specific emphasis on the synthesis and enhancement of Verilog programming. We introduce an innovat… ▽ More

    Submitted 9 January, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

  16. arXiv:2311.17874  [pdf, other

    cs.CV

    FisherRF: Active View Selection and Uncertainty Quantification for Radiance Fields using Fisher Information

    Authors: Wen Jiang, Boshu Lei, Kostas Daniilidis

    Abstract: This study addresses the challenging problem of active view selection and uncertainty quantification within the domain of Radiance Fields. Neural Radiance Fields (NeRF) have greatly advanced image rendering and reconstruction, but the limited availability of 2D images poses uncertainties stemming from occlusions, depth ambiguities, and imaging errors. Efficiently selecting informative views become… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: Project page: https://jiangwenpl.github.io/FisherRF/

  17. arXiv:2311.16540  [pdf, other

    cs.LG cs.DC cs.NI

    Communication Efficiency Optimization of Federated Learning for Computing and Network Convergence of 6G Networks

    Authors: Yizhuo Cai, Bo Lei, Qianying Zhao, Jing Peng, Min Wei, Yushun Zhang, Xing Zhang

    Abstract: Federated learning effectively addresses issues such as data privacy by collaborating across participating devices to train global models. However, factors such as network topology and device computing power can affect its training or communication process in complex network environments. A new network architecture and paradigm with computing-measurable, perceptible, distributable, dispatchable, a… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: 13 pages, 11 figures, accepted by Frontiers of Information Technology & Electronic Engineering

  18. arXiv:2311.06505  [pdf, other

    cs.LG

    CompCodeVet: A Compiler-guided Validation and Enhancement Approach for Code Dataset

    Authors: Le Chen, Arijit Bhattacharjee, Nesreen K. Ahmed, Niranjan Hasabnis, Gal Oren, Bin Lei, Ali Jannesari

    Abstract: Large language models (LLMs) have become increasingly prominent in academia and industry due to their remarkable performance in diverse applications. As these models evolve with increasing parameters, they excel in tasks like sentiment analysis and machine translation. However, even models with billions of parameters face challenges in tasks demanding multi-step reasoning. Code generation and comp… ▽ More

    Submitted 11 November, 2023; originally announced November 2023.

  19. arXiv:2309.11032  [pdf, other

    cs.RO

    Multi-Risk-RRT: An Efficient Motion Planning Algorithm for Robotic Autonomous Luggage Trolley Collection at Airports

    Authors: Zhirui Sun, Boshu Lei, Peijia Xie, Fugang Liu, Junjie Gao, Ying Zhang, Jiankun Wang

    Abstract: Robots have become increasingly prevalent in dynamic and crowded environments such as airports and shopping malls. In these scenarios, the critical challenges for robot navigation are reliability and timely arrival at predetermined destinations. While existing risk-based motion planning algorithms effectively reduce collision risks with static and dynamic obstacles, there is still a need for signi… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

  20. arXiv:2308.08649  [pdf, other

    cs.NE cs.AI

    Towards Zero Memory Footprint Spiking Neural Network Training

    Authors: Bin Lei, Sheng Lin, Pei-Hung Lin, Chunhua Liao, Caiwen Ding

    Abstract: Biologically-inspired Spiking Neural Networks (SNNs), processing information using discrete-time events known as spikes rather than continuous values, have garnered significant attention due to their hardware-friendly and energy-efficient characteristics. However, the training of SNNs necessitates a considerably large memory footprint, given the additional storage requirements for spikes or events… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

  21. arXiv:2308.08614  [pdf, other

    cs.LG cs.AI cs.CL

    Boosting Logical Reasoning in Large Language Models through a New Framework: The Graph of Thought

    Authors: Bin Lei, pei-Hung Lin, Chunhua Liao, Caiwen Ding

    Abstract: Recent advancements in large-scale models, such as GPT-4, have showcased remarkable capabilities in addressing standard queries. However, when facing complex problems that require multi-step logical reasoning, their accuracy dramatically decreases. Current research has explored the realm of \textit{prompting engineering} to bolster the inferential capacities of these models. Our paper unveils a pi… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

  22. arXiv:2307.12463  [pdf, other

    cs.CV cs.LG

    Rethinking Data Distillation: Do Not Overlook Calibration

    Authors: Dongyao Zhu, Bowen Lei, Jie Zhang, Yanbo Fang, Ruqi Zhang, Yiqun Xie, Dongkuan Xu

    Abstract: Neural networks trained on distilled data often produce over-confident output and require correction by calibration methods. Existing calibration methods such as temperature scaling and mixup work well for networks trained on original large-scale data. However, we find that these methods fail to calibrate networks trained on data distilled from large source datasets. In this paper, we show that di… ▽ More

    Submitted 14 September, 2023; v1 submitted 23 July, 2023; originally announced July 2023.

    Comments: ICCV 2023

  23. arXiv:2307.09858  [pdf, other

    cs.AI

    Towards Reliable Rare Category Analysis on Graphs via Individual Calibration

    Authors: Longfeng Wu, Bowen Lei, Dongkuan Xu, Dawei Zhou

    Abstract: Rare categories abound in a number of real-world networks and play a pivotal role in a variety of high-stakes applications, including financial fraud detection, network intrusion detection, and rare disease diagnosis. Rare category analysis (RCA) refers to the task of detecting, characterizing, and comprehending the behaviors of minority classes in a highly-imbalanced data distribution. While the… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

  24. arXiv:2307.07686  [pdf, other

    cs.SE cs.AI cs.LG

    Creating a Dataset for High-Performance Computing Code Translation using LLMs: A Bridge Between OpenMP Fortran and C++

    Authors: Bin Lei, Caiwen Ding, Le Chen, Pei-Hung Lin, Chunhua Liao

    Abstract: In this study, we present a novel dataset for training machine learning models translating between OpenMP Fortran and C++ code. To ensure reliability and applicability, the dataset is created from a range of representative open-source OpenMP benchmarks. It is also refined using a meticulous code similarity test. The effectiveness of our dataset is assessed using both quantitative (CodeBLEU) and qu… ▽ More

    Submitted 18 September, 2023; v1 submitted 14 July, 2023; originally announced July 2023.

    Comments: This paper was accepted by the HPEC 2023 conference and received the Outstanding Student Paper Award

  25. arXiv:2305.18323  [pdf, other

    cs.CL cs.AI

    ReWOO: Decoupling Reasoning from Observations for Efficient Augmented Language Models

    Authors: Binfeng Xu, Zhiyuan Peng, Bowen Lei, Subhabrata Mukherjee, Yuchen Liu, Dongkuan Xu

    Abstract: Augmented Language Models (ALMs) blend the reasoning capabilities of Large Language Models (LLMs) with tools that allow for knowledge retrieval and action execution. Existing ALM systems trigger LLM thought processes while pulling observations from these tools in an interleaved fashion. Specifically, an LLM reasons to call an external tool, gets halted to fetch the tool's response, and then decide… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  26. arXiv:2305.14404  [pdf, other

    q-bio.NC cs.AI cs.LG eess.IV

    Brain Structure-Function Fusing Representation Learning using Adversarial Decomposed-VAE for Analyzing MCI

    Authors: Qiankun Zuo, Baiying Lei, Ning Zhong, Yi Pan, Shuqiang Wang

    Abstract: Integrating the brain structural and functional connectivity features is of great significance in both exploring brain science and analyzing cognitive impairment clinically. However, it remains a challenge to effectively fuse structural and functional features in exploring the brain network. In this paper, a novel brain structure-function fusing-representation learning (BSFL) model is proposed to… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  27. arXiv:2305.12646  [pdf, other

    eess.IV cs.CV

    SG-GAN: Fine Stereoscopic-Aware Generation for 3D Brain Point Cloud Up-sampling from a Single Image

    Authors: Bowen Hu, Baiying Lei, Shuqiang Wang

    Abstract: In minimally-invasive brain surgeries with indirect and narrow operating environments, 3D brain reconstruction is crucial. However, as requirements of accuracy for some new minimally-invasive surgeries (such as brain-computer interface surgery) are higher and higher, the outputs of conventional 3D reconstruction, such as point cloud (PC), are facing the challenges that sample points are too sparse… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

  28. arXiv:2304.12214  [pdf, other

    cs.NE

    Neurogenesis Dynamics-inspired Spiking Neural Network Training Acceleration

    Authors: Shaoyi Huang, Haowen Fang, Kaleel Mahmood, Bowen Lei, Nuo Xu, Bin Lei, Yue Sun, Dongkuan Xu, Wujie Wen, Caiwen Ding

    Abstract: Biologically inspired Spiking Neural Networks (SNNs) have attracted significant attention for their ability to provide extremely energy-efficient machine intelligence through event-driven operation and sparse activities. As artificial intelligence (AI) becomes ever more democratized, there is an increasing need to execute SNN models on edge devices. Existing works adopt weight pruning to reduce SN… ▽ More

    Submitted 24 April, 2023; originally announced April 2023.

  29. arXiv:2304.08697  [pdf

    cs.NI cs.PF eess.SP

    Performance Analysis and Comparison of Non-ideal Wireless PBFT and RAFT Consensus Networks in 6G Communications

    Authors: Haoxiang Luo, Xiangyue Yang, Hongfang Yu, Gang Sun, Bo Lei, Mohsen Guizani

    Abstract: Due to advantages in security and privacy, blockchain is considered a key enabling technology to support 6G communications. Practical Byzantine Fault Tolerance (PBFT) and RAFT are seen as the most applicable consensus mechanisms (CMs) in blockchain-enabled wireless networks. However, previous studies on PBFT and RAFT rarely consider the channel performance of the physical layer, such as path loss… ▽ More

    Submitted 2 August, 2023; v1 submitted 17 April, 2023; originally announced April 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2303.15759

  30. arXiv:2303.06410  [pdf, other

    cs.AI cs.LG eess.IV

    Brain Diffuser: An End-to-End Brain Image to Brain Network Pipeline

    Authors: Xuhang Chen, Baiying Lei, Chi-Man Pun, Shuqiang Wang

    Abstract: Brain network analysis is essential for diagnosing and intervention for Alzheimer's disease (AD). However, previous research relied primarily on specific time-consuming and subjective toolkits. Only few tools can obtain the structural brain networks from brain diffusion tensor images (DTI). In this paper, we propose a diffusion based end-to-end brain network generative model Brain Diffuser that di… ▽ More

    Submitted 11 March, 2023; originally announced March 2023.

  31. arXiv:2302.14434  [pdf, other

    cs.CV

    A Hierarchical Representation Network for Accurate and Detailed Face Reconstruction from In-The-Wild Images

    Authors: Biwen Lei, Jianqiang Ren, Mengyang Feng, Miaomiao Cui, Xuansong Xie

    Abstract: Limited by the nature of the low-dimensional representational capacity of 3DMM, most of the 3DMM-based face reconstruction (FR) methods fail to recover high-frequency facial details, such as wrinkles, dimples, etc. Some attempt to solve the problem by introducing detail maps or non-linear operations, however, the results are still not vivid. To this end, we in this paper present a novel hierarchic… ▽ More

    Submitted 28 March, 2023; v1 submitted 28 February, 2023; originally announced February 2023.

    Comments: Accepted by CVPR2023

  32. arXiv:2302.13929  [pdf, other

    cs.LG stat.ML

    Efficient Informed Proposals for Discrete Distributions via Newton's Series Approximation

    Authors: Yue Xiang, Dongyao Zhu, Bowen Lei, Dongkuan Xu, Ruqi Zhang

    Abstract: Gradients have been exploited in proposal distributions to accelerate the convergence of Markov chain Monte Carlo algorithms on discrete distributions. However, these methods require a natural differentiable extension of the target discrete distribution, which often does not exist or does not provide effective gradient guidance. In this paper, we develop a gradient-like proposal for any discrete d… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

    Comments: Published at AISTATS 2023

  33. arXiv:2302.09369  [pdf, other

    cs.LG cs.AI cs.CV

    Calibrating the Rigged Lottery: Making All Tickets Reliable

    Authors: Bowen Lei, Ruqi Zhang, Dongkuan Xu, Bani Mallick

    Abstract: Although sparse training has been successfully used in various resource-limited deep learning tasks to save memory, accelerate training, and reduce inference time, the reliability of the produced sparse models remains unexplored. Previous research has shown that deep neural networks tend to be over-confident, and we find that sparse training exacerbates this problem. Therefore, calibrating the spa… ▽ More

    Submitted 28 February, 2023; v1 submitted 18 February, 2023; originally announced February 2023.

  34. arXiv:2301.03573  [pdf, other

    cs.LG cs.AI cs.CV

    Balance is Essence: Accelerating Sparse Training via Adaptive Gradient Correction

    Authors: Bowen Lei, Dongkuan Xu, Ruqi Zhang, Shuren He, Bani K. Mallick

    Abstract: Despite impressive performance, deep neural networks require significant memory and computation costs, prohibiting their application in resource-constrained scenarios. Sparse training is one of the most common techniques to reduce these costs, however, the sparsity constraints add difficulty to the optimization, resulting in an increase in training time and instability. In this work, we aim to ove… ▽ More

    Submitted 5 December, 2023; v1 submitted 9 January, 2023; originally announced January 2023.

  35. arXiv:2212.13105  [pdf, other

    cs.CV

    SuperGF: Unifying Local and Global Features for Visual Localization

    Authors: Wenzheng Song, Ran Yan, Boshu Lei, Takayuki Okatani

    Abstract: Advanced visual localization techniques encompass image retrieval challenges and 6 Degree-of-Freedom (DoF) camera pose estimation, such as hierarchical localization. Thus, they must extract global and local features from input images. Previous methods have achieved this through resource-intensive or accuracy-reducing means, such as combinatorial pipelines or multi-task distillation. In this study,… ▽ More

    Submitted 23 December, 2022; originally announced December 2022.

  36. arXiv:2212.06152  [pdf, other

    cs.LG cs.AI

    Accelerating Dataset Distillation via Model Augmentation

    Authors: Lei Zhang, Jie Zhang, Bowen Lei, Subhabrata Mukherjee, Xiang Pan, Bo Zhao, Caiwen Ding, Yao Li, Dongkuan Xu

    Abstract: Dataset Distillation (DD), a newly emerging field, aims at generating much smaller but efficient synthetic training datasets from large ones. Existing DD methods based on gradient matching achieve leading performance; however, they are extremely computationally intensive as they require continuously optimizing a dataset among thousands of randomly initialized models. In this paper, we assume that… ▽ More

    Submitted 15 April, 2023; v1 submitted 12 December, 2022; originally announced December 2022.

  37. arXiv:2211.16667  [pdf, other

    cs.LG cs.AI cs.CV

    Dynamic Sparse Training via Balancing the Exploration-Exploitation Trade-off

    Authors: Shaoyi Huang, Bowen Lei, Dongkuan Xu, Hongwu Peng, Yue Sun, Mimi Xie, Caiwen Ding

    Abstract: Over-parameterization of deep neural networks (DNNs) has shown high prediction accuracy for many applications. Although effective, the large number of parameters hinders its popularity on resource-limited devices and has an outsize environmental impact. Sparse training (using a fixed number of nonzero weights in each iteration) could significantly mitigate the training costs by reducing the model… ▽ More

    Submitted 24 April, 2023; v1 submitted 29 November, 2022; originally announced November 2022.

  38. arXiv:2211.03033  [pdf, other

    cs.LG

    Efficient Traffic State Forecasting using Spatio-Temporal Network Dependencies: A Sparse Graph Neural Network Approach

    Authors: Bin Lei, Shaoyi Huang, Caiwen Ding, Monika Filipovska

    Abstract: Traffic state prediction in a transportation network is paramount for effective traffic operations and management, as well as informed user and system-level decision-making. However, long-term traffic prediction (beyond 30 minutes into the future) remains challenging in current research. In this work, we integrate the spatio-temporal dependencies in the transportation network from network modeling… ▽ More

    Submitted 6 November, 2022; originally announced November 2022.

  39. arXiv:2211.02923  [pdf, other

    cs.HC

    Cross-Subject Emotion Recognition with Sparsely-Labeled Peripheral Physiological Data Using SHAP-Explained Tree Ensembles

    Authors: Feng Zhou, Tao Chen, Baiying Lei

    Abstract: There are still many challenges of emotion recognition using physiological data despite the substantial progress made recently. In this paper, we attempted to address two major challenges. First, in order to deal with the sparsely-labeled physiological data, we first decomposed the raw physiological data using signal spectrum analysis, based on which we extracted both complexity and energy feature… ▽ More

    Submitted 5 November, 2022; originally announced November 2022.

  40. arXiv:2210.06080  [pdf, other

    cs.NI

    Computing Power Network: A Survey

    Authors: Yukun Sun, Bo Lei, Junlin Liu, Haonan Huang, Xing Zhang, Jing Peng, Wenbo Wang

    Abstract: With the rapid development of cloud computing, edge computing, and smart devices, computing power resources indicate a trend of ubiquitous deployment. The traditional network architecture cannot efficiently leverage these distributed computing power resources due to computing power island effect. To overcome these problems and improve network efficiency, a new network computing paradigm is propose… ▽ More

    Submitted 15 November, 2022; v1 submitted 12 October, 2022; originally announced October 2022.

  41. arXiv:2204.10972  [pdf, other

    cs.CV

    GRM: Gradient Rectification Module for Visual Place Retrieval

    Authors: Boshu Lei, Wenjie Ding, Limeng Qiao, Xi Qiu

    Abstract: Visual place retrieval aims to search images in the database that depict similar places as the query image. However, global descriptors encoded by the network usually fall into a low dimensional principal space, which is harmful to the retrieval performance. We first analyze the cause of this phenomenon, pointing out that it is due to degraded distribution of the gradients of descriptors. Then, we… ▽ More

    Submitted 27 February, 2023; v1 submitted 22 April, 2022; originally announced April 2022.

    Comments: Accepted to the 2023 International Conference on Robotics and Automation (ICRA 2023)

  42. arXiv:2203.04670  [pdf, other

    cs.CV

    Structure-Aware Flow Generation for Human Body Reshaping

    Authors: Jianqiang Ren, Yuan Yao, Biwen Lei, Miaomiao Cui, Xuansong Xie

    Abstract: Body reshaping is an important procedure in portrait photo retouching. Due to the complicated structure and multifarious appearance of human bodies, existing methods either fall back on the 3D domain via body morphable model or resort to keypoint-based image deformation, leading to inefficiency and unsatisfied visual quality. In this paper, we address these limitations by formulating an end-to-end… ▽ More

    Submitted 10 March, 2022; v1 submitted 9 March, 2022; originally announced March 2022.

  43. arXiv:2202.08994  [pdf, other

    eess.IV cs.CV

    REFUGE2 Challenge: A Treasure Trove for Multi-Dimension Analysis and Evaluation in Glaucoma Screening

    Authors: Huihui Fang, Fei Li, Junde Wu, Huazhu Fu, Xu Sun, Jaemin Son, Shuang Yu, Menglu Zhang, Chenglang Yuan, Cheng Bian, Baiying Lei, Benjian Zhao, Xinxing Xu, Shaohua Li, Francisco Fumero, José Sigut, Haidar Almubarak, Yakoub Bazi, Yuanhao Guo, Yating Zhou, Ujjwal Baid, Shubham Innani, Tianjiao Guo, Jie Yang, José Ignacio Orlando , et al. (3 additional authors not shown)

    Abstract: With the rapid development of artificial intelligence (AI) in medical image processing, deep learning in color fundus photography (CFP) analysis is also evolving. Although there are some open-source, labeled datasets of CFPs in the ophthalmology community, large-scale datasets for screening only have labels of disease categories, and datasets with annotations of fundus structures are usually small… ▽ More

    Submitted 29 December, 2022; v1 submitted 17 February, 2022; originally announced February 2022.

    Comments: 29 pages, 21 figures

  44. ADAM Challenge: Detecting Age-related Macular Degeneration from Fundus Images

    Authors: Huihui Fang, Fei Li, Huazhu Fu, Xu Sun, Xingxing Cao, Fengbin Lin, Jaemin Son, Sunho Kim, Gwenole Quellec, Sarah Matta, Sharath M Shankaranarayana, Yi-Ting Chen, Chuen-heng Wang, Nisarg A. Shah, Chia-Yen Lee, Chih-Chung Hsu, Hai Xie, Baiying Lei, Ujjwal Baid, Shubham Innani, Kang Dang, Wenxiu Shi, Ravi Kamble, Nitin Singhal, Ching-Wei Wang , et al. (6 additional authors not shown)

    Abstract: Age-related macular degeneration (AMD) is the leading cause of visual impairment among elderly in the world. Early detection of AMD is of great importance, as the vision loss caused by this disease is irreversible and permanent. Color fundus photography is the most cost-effective imaging modality to screen for retinal disorders. Cutting edge deep learning based algorithms have been recently develo… ▽ More

    Submitted 6 May, 2022; v1 submitted 16 February, 2022; originally announced February 2022.

    Comments: 31 pages, 17 figures

  45. arXiv:2111.12886  [pdf, other

    eess.IV cs.CV

    Morphological feature visualization of Alzheimer's disease via Multidirectional Perception GAN

    Authors: Wen Yu, Baiying Lei, Yanyan Shen, Shuqiang Wang, Yong Liu, Zhiguang Feng, Yong Hu, Michael K. Ng

    Abstract: The diagnosis of early stages of Alzheimer's disease (AD) is essential for timely treatment to slow further deterioration. Visualizing the morphological features for the early stages of AD is of great clinical value. In this work, a novel Multidirectional Perception Generative Adversarial Network (MP-GAN) is proposed to visualize the morphological features indicating the severity of AD for patient… ▽ More

    Submitted 24 November, 2021; originally announced November 2021.

  46. arXiv:2110.09302  [pdf, other

    cs.LG cs.AI cs.CV eess.IV

    A Prior Guided Adversarial Representation Learning and Hypergraph Perceptual Network for Predicting Abnormal Connections of Alzheimer's Disease

    Authors: Qiankun Zuo, Baiying Lei, Shuqiang Wang, Yong Liu, Bingchuan Wang, Yanyan Shen

    Abstract: Alzheimer's disease is characterized by alterations of the brain's structural and functional connectivity during its progressive degenerative processes. Existing auxiliary diagnostic methods have accomplished the classification task, but few of them can accurately evaluate the changing characteristics of brain connectivity. In this work, a prior guided adversarial representation learning and hyper… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

  47. arXiv:2110.05712  [pdf, other

    cs.LG cs.AI

    DecGAN: Decoupling Generative Adversarial Network detecting abnormal neural circuits for Alzheimer's disease

    Authors: Junren Pan, Baiying Lei, Shuqiang Wang, Bingchuan Wang, Yong Liu, Yanyan Shen

    Abstract: One of the main reasons for Alzheimer's disease (AD) is the disorder of some neural circuits. Existing methods for AD prediction have achieved great success, however, detecting abnormal neural circuits from the perspective of brain networks is still a big challenge. In this work, a novel decoupling generative adversarial network (DecGAN) is proposed to detect abnormal neural circuits for AD. Concr… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

  48. arXiv:2107.11010  [pdf, other

    eess.IV cs.CV

    3D Brain Reconstruction by Hierarchical Shape-Perception Network from a Single Incomplete Image

    Authors: Bowen Hu, Baiying Lei, Shuqiang Wang, Yong Liu, Bingchuan Wang, Min Gan, Yanyan Shen

    Abstract: 3D shape reconstruction is essential in the navigation of minimally-invasive and auto robot-guided surgeries whose operating environments are indirect and narrow, and there have been some works that focused on reconstructing the 3D shape of the surgical organ through limited 2D information available. However, the lack and incompleteness of such information caused by intraoperative emergencies (suc… ▽ More

    Submitted 11 October, 2021; v1 submitted 22 July, 2021; originally announced July 2021.

  49. arXiv:2107.09953  [pdf, other

    cs.CV

    Characterization Multimodal Connectivity of Brain Network by Hypergraph GAN for Alzheimer's Disease Analysis

    Authors: Junren Pan, Baiying Lei, Yanyan Shen, Yong Liu, Zhiguang Feng, Shuqiang Wang

    Abstract: Using multimodal neuroimaging data to characterize brain network is currently an advanced technique for Alzheimer's disease(AD) Analysis. Over recent years the neuroimaging community has made tremendous progress in the study of resting-state functional magnetic resonance imaging (rs-fMRI) derived from blood-oxygen-level-dependent (BOLD) signals and Diffusion Tensor Imaging (DTI) derived from white… ▽ More

    Submitted 21 July, 2021; originally announced July 2021.

  50. arXiv:2107.09928  [pdf, other

    cs.CV

    Multimodal Representations Learning and Adversarial Hypergraph Fusion for Early Alzheimer's Disease Prediction

    Authors: Qiankun Zuo, Baiying Lei, Yanyan Shen, Yong Liu, Zhiguang Feng, Shuqiang Wang

    Abstract: Multimodal neuroimage can provide complementary information about the dementia, but small size of complete multimodal data limits the ability in representation learning. Moreover, the data distribution inconsistency from different modalities may lead to ineffective fusion, which fails to sufficiently explore the intra-modal and inter-modal interactions and compromises the disease diagnosis perform… ▽ More

    Submitted 21 July, 2021; originally announced July 2021.

    Comments: 13 pages, 3 figures