Skip to main content

Showing 1–47 of 47 results for author: Lu, C X

  1. arXiv:2405.14014  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    RadarOcc: Robust 3D Occupancy Prediction with 4D Imaging Radar

    Authors: Fangqiang Ding, Xiangyu Wen, Lawrence Zhu, Yiming Li, Chris Xiaoxuan Lu

    Abstract: 3D occupancy-based perception pipeline has significantly advanced autonomous driving by capturing detailed scene descriptions and demonstrating strong generalizability across various object categories and shapes. Current methods predominantly rely on LiDAR or camera inputs for 3D occupancy prediction. These methods are susceptible to adverse weather conditions, limiting the all-weather deployment… ▽ More

    Submitted 13 June, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

    Comments: 16 pages, 3 figures

  2. arXiv:2403.14526  [pdf, other

    cs.RO cs.AI cs.CV

    Click to Grasp: Zero-Shot Precise Manipulation via Visual Diffusion Descriptors

    Authors: Nikolaos Tsagkas, Jack Rome, Subramanian Ramamoorthy, Oisin Mac Aodha, Chris Xiaoxuan Lu

    Abstract: Precise manipulation that is generalizable across scenes and objects remains a persistent challenge in robotics. Current approaches for this task heavily depend on having a significant number of training instances to handle objects with pronounced visual and/or geometric part ambiguities. Our work explores the grounding of fine-grained part descriptors for precise manipulation in a zero-shot setti… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 8 pages, 4 figures

  3. arXiv:2403.09871  [pdf, other

    cs.CV cs.AI cs.HC cs.LG

    ThermoHands: A Benchmark for 3D Hand Pose Estimation from Egocentric Thermal Images

    Authors: Fangqiang Ding, Lawrence Zhu, Xiangyu Wen, Gaowen Liu, Chris Xiaoxuan Lu

    Abstract: In this work, we present ThermoHands, a new benchmark for thermal image-based egocentric 3D hand pose estimation, aimed at overcoming challenges like varying lighting conditions and obstructions (e.g., handwear). The benchmark includes a multi-view and multi-spectral dataset collected from 28 subjects performing hand-object and hand-virtual interactions under diverse scenarios, accurately annotate… ▽ More

    Submitted 13 June, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

    Comments: 15 pages, 6 figures, 4 tables

  4. arXiv:2403.04908  [pdf, other

    cs.CV

    Self-Adapting Large Visual-Language Models to Edge Devices across Visual Modalities

    Authors: Kaiwen Cai, Zhekai Duan, Gaowen Liu, Charles Fleming, Chris Xiaoxuan Lu

    Abstract: Recent advancements in Vision-Language (VL) models have sparked interest in their deployment on edge devices, yet challenges in handling diverse visual modalities, manual annotation, and computational constraints remain. We introduce EdgeVL, a novel framework that bridges this gap by seamlessly integrating dual-modality knowledge distillation and quantization-aware contrastive learning. This appro… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: Under review

  5. arXiv:2311.13182  [pdf, other

    cs.CV

    Differentiable Radio Frequency Ray Tracing for Millimeter-Wave Sensing

    Authors: Xingyu Chen, Xinyu Zhang, Qiyue Xia, Xinmin Fang, Chris Xiaoxuan Lu, Zhengxiong Li

    Abstract: Millimeter wave (mmWave) sensing is an emerging technology with applications in 3D object characterization and environment mapping. However, realizing precise 3D reconstruction from sparse mmWave signals remains challenging. Existing methods rely on data-driven learning, constrained by dataset availability and difficulty in generalization. We propose DiffSBR, a differentiable framework for mmWave-… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

  6. arXiv:2311.10601  [pdf, other

    cs.CV eess.SP

    Multimodal Indoor Localization Using Crowdsourced Radio Maps

    Authors: Zhaoguang Yi, Xiangyu Wen, Qiyue Xia, Peize Li, Francisco Zampella, Firas Alsehly, Chris Xiaoxuan Lu

    Abstract: Indoor Positioning Systems (IPS) traditionally rely on odometry and building infrastructures like WiFi, often supplemented by building floor plans for increased accuracy. However, the limitation of floor plans in terms of availability and timeliness of updates challenges their wide applicability. In contrast, the proliferation of smartphones and WiFi-enabled robots has made crowdsourced radio maps… ▽ More

    Submitted 12 March, 2024; v1 submitted 17 November, 2023; originally announced November 2023.

    Comments: 7 pages, 4 figures; ICRA'24 https://youtu.be/NTTKwJBFN5w

  7. arXiv:2309.17336  [pdf, other

    cs.CV cs.RO

    Robust 3D Object Detection from LiDAR-Radar Point Clouds via Cross-Modal Feature Augmentation

    Authors: Jianning Deng, Gabriel Chan, Hantao Zhong, Chris Xiaoxuan Lu

    Abstract: This paper presents a novel framework for robust 3D object detection from point clouds via cross-modal hallucination. Our proposed approach is agnostic to either hallucination direction between LiDAR and 4D radar. We introduce multiple alignments on both spatial and feature levels to achieve simultaneous backbone refinement and hallucination generation. Specifically, spatial alignment is proposed… ▽ More

    Submitted 12 March, 2024; v1 submitted 29 September, 2023; originally announced September 2023.

    Comments: Accepted to ICRA 2024. 8 pages, 4 figures. Equal contribution for Gabriel Chan and Hantao Zhong, listed randomly

  8. arXiv:2309.09737  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    RaTrack: Moving Object Detection and Tracking with 4D Radar Point Cloud

    Authors: Zhijun Pan, Fangqiang Ding, Hantao Zhong, Chris Xiaoxuan Lu

    Abstract: Mobile autonomy relies on the precise perception of dynamic environments. Robustly tracking moving objects in 3D world thus plays a pivotal role for applications like trajectory prediction, obstacle avoidance, and path planning. While most current methods utilize LiDARs or cameras for Multiple Object Tracking (MOT), the capabilities of 4D imaging radars remain largely unexplored. Recognizing the c… ▽ More

    Submitted 11 March, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: Accepted to ICRA 2024. 8 pages, 4 figures. Co-first authorship for Zhijun Pan, Fangqiang Ding and Hantao Zhong, listed randomly. See demo vide at: https://www.youtube.com/watch?v=_uSpbxOlLGw

  9. arXiv:2308.14039  [pdf, other

    cs.CV

    Deep Learning for Visual Localization and Mapping: A Survey

    Authors: Changhao Chen, Bing Wang, Chris Xiaoxuan Lu, Niki Trigoni, Andrew Markham

    Abstract: Deep learning based localization and mapping approaches have recently emerged as a new research direction and receive significant attentions from both industry and academia. Instead of creating hand-designed algorithms based on physical models or geometric theories, deep learning solutions provide an alternative to solve the problem in a data-driven way. Benefiting from the ever-increasing volumes… ▽ More

    Submitted 27 August, 2023; originally announced August 2023.

    Comments: Accepted by IEEE Transactions on Neural Networks and Learning Systems. This is an updated version of arXiv:2006.12567

  10. arXiv:2307.07336  [pdf, other

    cs.CV

    Risk Controlled Image Retrieval

    Authors: Kaiwen Cai, Chris Xiaoxuan Lu, Xingyu Zhao, Xiaowei Huang

    Abstract: Most image retrieval research focuses on improving predictive performance, ignoring scenarios where the reliability of the prediction is also crucial. Uncertainty quantification technique can be applied to mitigate this issue by assessing uncertainty for retrieval sets, but it can provide only a heuristic estimate of uncertainty rather than a guarantee. To address these limitations, we present Ris… ▽ More

    Submitted 16 August, 2023; v1 submitted 14 July, 2023; originally announced July 2023.

  11. arXiv:2307.03623  [pdf, other

    cs.CV cs.RO

    Robust Human Detection under Visual Degradation via Thermal and mmWave Radar Fusion

    Authors: Kaiwen Cai, Qiyue Xia, Peize Li, John Stankovic, Chris Xiaoxuan Lu

    Abstract: The majority of human detection methods rely on the sensor using visible lights (e.g., RGB cameras) but such sensors are limited in scenarios with degraded vision conditions. In this paper, we present a multimodal human detection system that combines portable thermal cameras and single-chip mmWave radars. To mitigate the noisy detection features caused by the low contrast of thermal cameras and th… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

    Comments: To appear at the 2023 International Conference on Embedded Wireless Systems and Networks

  12. arXiv:2306.17010  [pdf, other

    cs.CV cs.AI cs.LG

    milliFlow: Scene Flow Estimation on mmWave Radar Point Cloud for Human Motion Sensing

    Authors: Fangqiang Ding, Zhen Luo, Peijun Zhao, Chris Xiaoxuan Lu

    Abstract: Human motion sensing plays a crucial role in smart systems for decision-making, user interaction, and personalized services. Extensive research that has been conducted is predominantly based on cameras, whose intrusive nature limits their use in smart home applications. To address this, mmWave radars have gained popularity due to their privacy-friendly features. In this work, we propose milliFlow,… ▽ More

    Submitted 12 July, 2024; v1 submitted 29 June, 2023; originally announced June 2023.

    Comments: 28 pages, 8 figures, 8 tables. This paper has been accepted by ECCV 2024. See the code and dataset at https://github.com/Toytiny/milliFlow

  13. arXiv:2305.12427  [pdf, other

    cs.CV

    VL-Fields: Towards Language-Grounded Neural Implicit Spatial Representations

    Authors: Nikolaos Tsagkas, Oisin Mac Aodha, Chris Xiaoxuan Lu

    Abstract: We present Visual-Language Fields (VL-Fields), a neural implicit spatial representation that enables open-vocabulary semantic queries. Our model encodes and fuses the geometry of a scene with vision-language trained latent features by distilling information from a language-driven segmentation model. VL-Fields is trained without requiring any prior knowledge of the scene object classes, which makes… ▽ More

    Submitted 25 May, 2023; v1 submitted 21 May, 2023; originally announced May 2023.

    Comments: Project page: https://tsagkas.github.io/vl-fields/

  14. arXiv:2305.10345  [pdf, other

    eess.SP cs.AI cs.CV cs.MM

    MM-Fi: Multi-Modal Non-Intrusive 4D Human Dataset for Versatile Wireless Sensing

    Authors: Jianfei Yang, He Huang, Yunjiao Zhou, Xinyan Chen, Yuecong Xu, Shenghai Yuan, Han Zou, Chris Xiaoxuan Lu, Lihua Xie

    Abstract: 4D human perception plays an essential role in a myriad of applications, such as home automation and metaverse avatar simulation. However, existing solutions which mainly rely on cameras and wearable devices are either privacy intrusive or inconvenient to use. To address these issues, wireless sensing has emerged as a promising alternative, leveraging LiDAR, mmWave radar, and WiFi signals for devi… ▽ More

    Submitted 24 September, 2023; v1 submitted 12 May, 2023; originally announced May 2023.

    Comments: The paper has been accepted by NeurIPS 2023 Datasets and Benchmarks Track. Project page: https://ntu-aiot-lab.github.io/mm-fi

  15. arXiv:2303.00462  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Hidden Gems: 4D Radar Scene Flow Learning Using Cross-Modal Supervision

    Authors: Fangqiang Ding, Andras Palffy, Dariu M. Gavrila, Chris Xiaoxuan Lu

    Abstract: This work proposes a novel approach to 4D radar-based scene flow estimation via cross-modal learning. Our approach is motivated by the co-located sensing redundancy in modern autonomous vehicles. Such redundancy implicitly provides various forms of supervision cues to the radar scene flow estimation. Specifically, we introduce a multi-task model architecture for the identified cross-modal learning… ▽ More

    Submitted 17 March, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

    Comments: 10 pages, 7 figures. Accepted by CVPR 2023. See our code at https://github.com/Toytiny/CMFlow. Supplementary materials can be found at https://drive.google.com/file/d/1Iewcqnjzecge2ePBM8k2tg-85LX5xs3N/view

  16. arXiv:2209.14602  [pdf, other

    cs.RO cs.CV

    Uncertainty Estimation for 3D Dense Prediction via Cross-Point Embeddings

    Authors: Kaiwen Cai, Chris Xiaoxuan Lu, Xiaowei Huang

    Abstract: Dense prediction tasks are common for 3D point clouds, but the uncertainties inherent in massive points and their embeddings have long been ignored. In this work, we present CUE, a novel uncertainty estimation method for dense prediction tasks in 3D point clouds. Inspired by metric learning, the key idea of CUE is to explore cross-point embeddings upon a conventional 3D dense prediction pipeline.… ▽ More

    Submitted 24 February, 2023; v1 submitted 29 September, 2022; originally announced September 2022.

    Comments: Accepted by IEEE Robotics and Automation Letters

  17. arXiv:2208.14326  [pdf, other

    cs.CV cs.AI

    GaitFi: Robust Device-Free Human Identification via WiFi and Vision Multimodal Learning

    Authors: Lang Deng, Jianfei Yang, Shenghai Yuan, Han Zou, Chris Xiaoxuan Lu, Lihua Xie

    Abstract: As an important biomarker for human identification, human gait can be collected at a distance by passive sensors without subject cooperation, which plays an essential role in crime prevention, security detection and other human identification applications. At present, most research works are based on cameras and computer vision techniques to perform gait recognition. However, vision-based methods… ▽ More

    Submitted 30 August, 2022; originally announced August 2022.

    Comments: 12 pages, 8 figures, accepted by IEEE Internet of Things Journal

  18. arXiv:2207.07896  [pdf, other

    cs.CV

    Cross Vision-RF Gait Re-identification with Low-cost RGB-D Cameras and mmWave Radars

    Authors: Dongjiang Cao, Ruofeng Liu, Hao Li, Shuai Wang, Wenchao Jiang, Chris Xiaoxuan Lu

    Abstract: Human identification is a key requirement for many applications in everyday life, such as personalized services, automatic surveillance, continuous authentication, and contact tracing during pandemics, etc. This work studies the problem of cross-modal human re-identification (ReID), in response to the regular human movements across camera-allowed regions (e.g., streets) and camera-restricted regio… ▽ More

    Submitted 16 July, 2022; originally announced July 2022.

    Comments: 24 pages, 20 figures, accepted to IMWUT

  19. arXiv:2207.07859  [pdf, other

    cs.LG cs.AI eess.SP

    SenseFi: A Library and Benchmark on Deep-Learning-Empowered WiFi Human Sensing

    Authors: Jianfei Yang, Xinyan Chen, Dazhuo Wang, Han Zou, Chris Xiaoxuan Lu, Sumei Sun, Lihua Xie

    Abstract: WiFi sensing has been evolving rapidly in recent years. Empowered by propagation models and deep learning methods, many challenging applications are realized such as WiFi-based human activity recognition and gesture recognition. However, in contrast to deep learning for visual recognition and natural language processing, no sufficiently comprehensive public benchmark exists. In this paper, we revi… ▽ More

    Submitted 17 February, 2023; v1 submitted 16 July, 2022; originally announced July 2022.

    Comments: A benchmark and model zoo for WiFi CSI Human sensing based on deep learning methods. Accepted by Patterns, Cell Press

  20. arXiv:2206.01589  [pdf, other

    cs.RO

    OdomBeyondVision: An Indoor Multi-modal Multi-platform Odometry Dataset Beyond the Visible Spectrum

    Authors: Peize Li, Kaiwen Cai, Muhamad Risqi U. Saputra, Zhuangzhuang Dai, Chris Xiaoxuan Lu, Andrew Markham, Niki Trigoni

    Abstract: This paper presents a multimodal indoor odometry dataset, OdomBeyondVision, featuring multiple sensors across the different spectrum and collected with different mobile platforms. Not only does OdomBeyondVision contain the traditional navigation sensors, sensors such as IMUs, mechanical LiDAR, RGBD camera, it also includes several emerging sensors such as the single-chip mmWave radar, LWIR thermal… ▽ More

    Submitted 14 September, 2022; v1 submitted 3 June, 2022; originally announced June 2022.

  21. arXiv:2203.01851  [pdf, other

    cs.CV cs.RO

    STUN: Self-Teaching Uncertainty Estimation for Place Recognition

    Authors: Kaiwen Cai, Chris Xiaoxuan Lu, Xiaowei Huang

    Abstract: Place recognition is key to Simultaneous Localization and Mapping (SLAM) and spatial perception. However, a place recognition in the wild often suffers from erroneous predictions due to image variations, e.g., changing viewpoints and street appearance. Integrating uncertainty estimation into the life cycle of place recognition is a promising method to mitigate the impact of variations on place rec… ▽ More

    Submitted 13 September, 2022; v1 submitted 3 March, 2022; originally announced March 2022.

    Comments: To appear at the 35th IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS2022)

  22. Self-Supervised Scene Flow Estimation with 4-D Automotive Radar

    Authors: Fangqiang Ding, Zhijun Pan, Yimin Deng, Jianning Deng, Chris Xiaoxuan Lu

    Abstract: Scene flow allows autonomous vehicles to reason about the arbitrary motion of multiple independent objects which is the key to long-term mobile autonomy. While estimating the scene flow from LiDAR has progressed recently, it remains largely unknown how to estimate the scene flow from a 4-D radar - an increasingly popular automotive sensor for its robustness against adverse weather and lighting con… ▽ More

    Submitted 2 July, 2022; v1 submitted 2 March, 2022; originally announced March 2022.

    Comments: Copyright (c) 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

    Journal ref: IEEE Robotics and Automation Letters (RA-L), 2022

  23. arXiv:2112.14887  [pdf, other

    cs.RO

    DC-Loc: Accurate Automotive Radar Based Metric Localization with Explicit Doppler Compensation

    Authors: Pengen Gao, Shengkai Zhang, Wei Wang, Chris Xiaoxuan Lu

    Abstract: Automotive mmWave radar has been widely used in the automotive industry due to its small size, low cost, and complementary advantages to optical sensors (e.g., cameras, LiDAR, etc.) in adverse weathers, e.g., fog, raining, and snowing. On the other side, its large wavelength also poses fundamental challenges to perceive the environment. Recent advances have made breakthroughs on its inherent drawb… ▽ More

    Submitted 21 February, 2022; v1 submitted 29 December, 2021; originally announced December 2021.

    Comments: 7 pages, accepted by IEEE Conference on Robotics and Automation (ICRA) 2022

  24. arXiv:2112.13937  [pdf, other

    cs.AI cs.RO

    Multiagent Model-based Credit Assignment for Continuous Control

    Authors: Dongge Han, Chris Xiaoxuan Lu, Tomasz Michalak, Michael Wooldridge

    Abstract: Deep reinforcement learning (RL) has recently shown great promise in robotic continuous control tasks. Nevertheless, prior research in this vein center around the centralized learning setting that largely relies on the communication availability among all the components of a robot. However, agents in the real world often operate in a decentralised fashion without communication due to latency requi… ▽ More

    Submitted 27 December, 2021; originally announced December 2021.

    Comments: To Appear in AAMAS2022 (Oral)

  25. arXiv:2112.05665  [pdf

    cs.RO eess.SY

    Deep Odometry Systems on Edge with EKF-LoRa Backend for Real-Time Positioning in Adverse Environment

    Authors: Zhuangzhuang Dai, Muhamad Risqi U. Saputra, Chris Xiaoxuan Lu, Andrew Markham, Niki Trigoni

    Abstract: Ubiquitous positioning for pedestrian in adverse environment has served a long standing challenge. Despite dramatic progress made by Deep Learning, multi-sensor deep odometry systems yet pose a high computational cost and suffer from cumulative drifting errors over time. Thanks to the increasing computational power of edge devices, we propose a novel ubiquitous positioning solution by integrating… ▽ More

    Submitted 10 December, 2021; originally announced December 2021.

  26. arXiv:2112.02469  [pdf, other

    cs.CV cs.NE

    RADA: Robust Adversarial Data Augmentation for Camera Localization in Challenging Weather

    Authors: Jialu Wang, Muhamad Risqi U. Saputra, Chris Xiaoxuan Lu, Niki Trigon, Andrew Markham

    Abstract: Camera localization is a fundamental and crucial problem for many robotic applications. In recent years, using deep-learning for camera-based localization has become a popular research direction. However, they lack robustness to large domain shifts, which can be caused by seasonal or illumination changes between training and testing data sets. Data augmentation is an attractive approach to tackle… ▽ More

    Submitted 4 December, 2021; originally announced December 2021.

  27. arXiv:2111.03976  [pdf, other

    cs.LG eess.SP

    CubeLearn: End-to-end Learning for Human Motion Recognition from Raw mmWave Radar Signals

    Authors: Peijun Zhao, Chris Xiaoxuan Lu, Bing Wang, Niki Trigoni, Andrew Markham

    Abstract: mmWave FMCW radar has attracted huge amount of research interest for human-centered applications in recent years, such as human gesture/activity recognition. Most existing pipelines are built upon conventional Discrete Fourier Transform (DFT) pre-processing and deep neural network classifier hybrid methods, with a majority of previous works focusing on designing the downstream classifier to improv… ▽ More

    Submitted 6 November, 2021; originally announced November 2021.

  28. arXiv:2109.08652  [pdf, other

    cs.RO

    AutoPlace: Robust Place Recognition with Single-chip Automotive Radar

    Authors: Kaiwen Cai, Bing Wang, Chris Xiaoxuan Lu

    Abstract: This paper presents a novel place recognition approach to autonomous vehicles by using low-cost, single-chip automotive radar. Aimed at improving recognition robustness and fully exploiting the rich information provided by this emerging automotive radar, our approach follows a principled pipeline that comprises (1) dynamic points removal from instant Doppler measurement, (2) spatial-temporal featu… ▽ More

    Submitted 17 February, 2022; v1 submitted 17 September, 2021; originally announced September 2021.

    Comments: Accepted by IEEE Conference on Robotics and Automation (ICRA), 8 pages

  29. arXiv:2104.07196  [pdf, other

    cs.CV cs.RO

    Graph-based Thermal-Inertial SLAM with Probabilistic Neural Networks

    Authors: Muhamad Risqi U. Saputra, Chris Xiaoxuan Lu, Pedro P. B. de Gusmao, Bing Wang, Andrew Markham, Niki Trigoni

    Abstract: Simultaneous Localization and Mapping (SLAM) system typically employ vision-based sensors to observe the surrounding environment. However, the performance of such systems highly depends on the ambient illumination conditions. In scenarios with adverse visibility or in the presence of airborne particulates (e.g. smoke, dust, etc.), alternative modalities such as those based on thermal imaging and i… ▽ More

    Submitted 29 October, 2021; v1 submitted 14 April, 2021; originally announced April 2021.

    Comments: Accepted to IEEE Transactions on Robotics

  30. arXiv:2103.01055  [pdf, other

    cs.CV

    P2-Net: Joint Description and Detection of Local Features for Pixel and Point Matching

    Authors: Bing Wang, Changhao Chen, Zhaopeng Cui, Jie Qin, Chris Xiaoxuan Lu, Zhengdi Yu, Peijun Zhao, Zhen Dong, Fan Zhu, Niki Trigoni, Andrew Markham

    Abstract: Accurately describing and detecting 2D and 3D keypoints is crucial to establishing correspondences across images and point clouds. Despite a plethora of learning-based 2D or 3D local feature descriptors and detectors having been proposed, the derivation of a shared descriptor and joint keypoint detector that directly matches pixels and points remains under-explored by the community. This work take… ▽ More

    Submitted 29 July, 2021; v1 submitted 1 March, 2021; originally announced March 2021.

    Comments: ICCV 2021

  31. arXiv:2101.07061  [pdf, other

    cs.LG

    Deep Inertial Odometry with Accurate IMU Preintegration

    Authors: Rooholla Khorrambakht, Chris Xiaoxuan Lu, Hamed Damirchi, Zhenghua Chen, Zhengguo Li

    Abstract: Inertial Measurement Units (IMUs) are interceptive modalities that provide ego-motion measurements independent of the environmental factors. They are widely adopted in various autonomous systems. Motivated by the limitations in processing the noisy measurements from these sensors using their mathematical models, researchers have recently proposed various deep learning architectures to estimate ine… ▽ More

    Submitted 18 January, 2021; originally announced January 2021.

  32. arXiv:2011.06730  [pdf, other

    cs.RO

    3-D Motion Capture of an Unmodified Drone with Single-chip Millimeter Wave Radar

    Authors: Peijun Zhao, Chris Xiaoxuan Lu, Bing Wang, Niki Trigoni, Andrew Markham

    Abstract: Accurate motion capture of aerial robots in 3-D is a key enabler for autonomous operation in indoor environments such as warehouses or factories, as well as driving forward research in these areas. The most commonly used solutions at present are optical motion capture (e.g. VICON) and Ultrawideband (UWB), but these are costly and cumbersome to deploy, due to their requirement of multiple cameras/s… ▽ More

    Submitted 12 November, 2020; originally announced November 2020.

    Comments: Submitted to The 2021 International Conference on Robotics and Automation (ICRA 2021)

  33. arXiv:2010.13750  [pdf, other

    cs.CV cs.LG cs.RO

    Demo Abstract: Indoor Positioning System in Visually-Degraded Environments with Millimetre-Wave Radar and Inertial Sensors

    Authors: Zhuangzhuang Dai, Muhamad Risqi U. Saputra, Chris Xiaoxuan Lu, Niki Trigoni, Andrew Markham

    Abstract: Positional estimation is of great importance in the public safety sector. Emergency responders such as fire fighters, medical rescue teams, and the police will all benefit from a resilient positioning system to deliver safe and effective emergency services. Unfortunately, satellite navigation (e.g., GPS) offers limited coverage in indoor environments. It is also not possible to rely on infrastruct… ▽ More

    Submitted 26 October, 2020; originally announced October 2020.

    Comments: Appear as demo abstract at the ACM Conference on Embedded Networked Sensor Systems (SenSys 2020)

  34. arXiv:2006.12567  [pdf, other

    cs.CV cs.LG cs.RO eess.IV

    A Survey on Deep Learning for Localization and Mapping: Towards the Age of Spatial Machine Intelligence

    Authors: Changhao Chen, Bing Wang, Chris Xiaoxuan Lu, Niki Trigoni, Andrew Markham

    Abstract: Deep learning based localization and mapping has recently attracted significant attention. Instead of creating hand-designed algorithms through exploitation of physical models or geometric theories, deep learning based solutions provide an alternative to solve the problem in a data-driven way. Benefiting from ever-increasing volumes of data and computational power, these methods are fast evolving… ▽ More

    Submitted 29 June, 2020; v1 submitted 22 June, 2020; originally announced June 2020.

    Comments: 26 pages, 10 figures. Project website: https://github.com/changhao-chen/deep-learning-localization-mapping

  35. arXiv:2006.02266  [pdf, other

    cs.RO

    milliEgo: Single-chip mmWave Radar Aided Egomotion Estimation via Deep Sensor Fusion

    Authors: Chris Xiaoxuan Lu, Muhamad Risqi U. Saputra, Peijun Zhao, Yasin Almalioglu, Pedro P. B. de Gusmao, Changhao Chen, Ke Sun, Niki Trigoni, Andrew Markham

    Abstract: Robust and accurate trajectory estimation of mobile agents such as people and robots is a key requirement for providing spatial awareness for emerging capabilities such as augmented reality or autonomous interaction. Although currently dominated by optical techniques e.g., visual-inertial odometry, these suffer from challenges with scene illumination or featureless surfaces. As an alternative, we… ▽ More

    Submitted 19 October, 2020; v1 submitted 3 June, 2020; originally announced June 2020.

    Comments: Appear at the ACM Conference on Embedded Networked Sensor Systems (SenSys 2020)

  36. Nowhere to Hide: Cross-modal Identity Leakage between Biometrics and Devices

    Authors: Chris Xiaoxuan Lu, Yang Li, Yuanbo Xiangli, Zhengxiong Li

    Abstract: Along with the benefits of Internet of Things (IoT) come potential privacy risks, since billions of the connected devices are granted permission to track information about their users and communicate it to other parties over the Internet. Of particular interest to the adversary is the user identity which constantly plays an important role in launching attacks. While the exposure of a certain type… ▽ More

    Submitted 21 January, 2020; originally announced January 2020.

    Comments: 12 pages

  37. arXiv:2001.04061  [pdf, other

    cs.RO cs.LG

    Deep Learning based Pedestrian Inertial Navigation: Methods, Dataset and On-Device Inference

    Authors: Changhao Chen, Peijun Zhao, Chris Xiaoxuan Lu, Wei Wang, Andrew Markham, Niki Trigoni

    Abstract: Modern inertial measurements units (IMUs) are small, cheap, energy efficient, and widely employed in smart devices and mobile robots. Exploiting inertial data for accurate and reliable pedestrian navigation supports is a key component for emerging Internet-of-Things applications and services. Recently, there has been a growing interest in applying deep neural networks (DNNs) to motion sensing and… ▽ More

    Submitted 12 January, 2020; originally announced January 2020.

    Comments: Accepted to IEEE Internet of Things Journal

  38. arXiv:1912.13077  [pdf, other

    cs.CV cs.LG cs.RO

    Learning Selective Sensor Fusion for States Estimation

    Authors: Changhao Chen, Stefano Rosa, Chris Xiaoxuan Lu, Bing Wang, Niki Trigoni, Andrew Markham

    Abstract: Autonomous vehicles and mobile robotic systems are typically equipped with multiple sensors to provide redundancy. By integrating the observations from different sensors, these mobile agents are able to perceive the environment and estimate system states, e.g. locations and orientations. Although deep learning approaches for multimodal odometry estimation and localization have gained traction, the… ▽ More

    Submitted 18 May, 2022; v1 submitted 30 December, 2019; originally announced December 2019.

    Comments: Accepted by IEEE Transactions on Neural Networks and Learning Systems (TNNLS). arXiv admin note: text overlap with arXiv:1903.01534

  39. arXiv:1912.04836  [pdf, other

    cs.HC cs.CR cs.LG

    Snoopy: Sniffing Your Smartwatch Passwords via Deep Sequence Learning

    Authors: Chris Xiaoxuan Lu, Bowen Du, Hongkai Wen, Sen Wang, Andrew Markham, Ivan Martinovic, Yiran Shen, Niki Trigoni

    Abstract: Demand for smartwatches has taken off in recent years with new models which can run independently from smartphones and provide more useful features, becoming first-class mobile platforms. One can access online banking or even make payments on a smartwatch without a paired phone. This makes smartwatches more attractive and vulnerable to malicious attacks, which to date have been largely overlooked.… ▽ More

    Submitted 11 December, 2019; v1 submitted 10 December, 2019; originally announced December 2019.

    Comments: 27 pages. Originally published at ACM UbiComp 2018. This version corrects some errors in the original version and add the pointer to released code & dataset

  40. arXiv:1909.07231  [pdf, other

    cs.CV cs.LG cs.RO

    DeepTIO: A Deep Thermal-Inertial Odometry with Visual Hallucination

    Authors: Muhamad Risqi U. Saputra, Pedro P. B. de Gusmao, Chris Xiaoxuan Lu, Yasin Almalioglu, Stefano Rosa, Changhao Chen, Johan Wahlström, Wei Wang, Andrew Markham, Niki Trigoni

    Abstract: Visual odometry shows excellent performance in a wide range of environments. However, in visually-denied scenarios (e.g. heavy smoke or darkness), pose estimates degrade or even fail. Thermal cameras are commonly used for perception and inspection when the environment has low visibility. However, their use in odometry estimation is hampered by the lack of robust visual features. In part, this is a… ▽ More

    Submitted 19 January, 2020; v1 submitted 16 September, 2019; originally announced September 2019.

    Comments: Accepted to IEEE Robotics and Automation Letters (RAL)

  41. Milli-RIO: Ego-Motion Estimation with Low-Cost Millimetre-Wave Radar

    Authors: Yasin Almalioglu, Mehmet Turan, Chris Xiaoxuan Lu, Niki Trigoni, Andrew Markham

    Abstract: Robust indoor ego-motion estimation has attracted significant interest in the last decades due to the fast-growing demand for location-based services in indoor environments. Among various solutions, frequency-modulated continuous-wave (FMCW) radar sensors in millimeter-wave (MMWave) spectrum are gaining more prominence due to their intrinsic advantages such as penetration capability and high accur… ▽ More

    Submitted 6 March, 2020; v1 submitted 12 September, 2019; originally announced September 2019.

    Comments: Submitted to IEEE Sensors, 9pages

  42. arXiv:1909.03557  [pdf, other

    cs.CV

    AtLoc: Attention Guided Camera Localization

    Authors: Bing Wang, Changhao Chen, Chris Xiaoxuan Lu, Peijun Zhao, Niki Trigoni, Andrew Markham

    Abstract: Deep learning has achieved impressive results in camera localization, but current single-image techniques typically suffer from a lack of robustness, leading to large outliers. To some extent, this has been tackled by sequential (multi-images) or geometry constraint approaches, which can learn to reject dynamic objects and illumination conditions to achieve better performance. In this work, we sho… ▽ More

    Submitted 28 October, 2019; v1 submitted 8 September, 2019; originally announced September 2019.

  43. arXiv:1908.09002  [pdf, other

    cs.CV cs.LG cs.NI stat.ML

    Autonomous Learning for Face Recognition in the Wild via Ambient Wireless Cues

    Authors: Chris Xiaoxuan Lu, Xuan Kan, Bowen Du, Changhao Chen, Hongkai Wen, Andrew Markham, Niki Trigoni, John Stankovic

    Abstract: Facial recognition is a key enabling component for emerging Internet of Things (IoT) services such as smart homes or responsive offices. Through the use of deep neural networks, facial recognition has achieved excellent performance. However, this is only possibly when trained with hundreds of images of each user in different viewing and lighting conditions. Clearly, this level of effort in enrolme… ▽ More

    Submitted 14 August, 2019; originally announced August 2019.

    Comments: 11 pages, accepted in the Web Conference (WWW'2019)

  44. arXiv:1908.03918  [pdf, other

    cs.LG cs.CV cs.RO stat.ML

    DynaNet: Neural Kalman Dynamical Model for Motion Estimation and Prediction

    Authors: Changhao Chen, Chris Xiaoxuan Lu, Bing Wang, Niki Trigoni, Andrew Markham

    Abstract: Dynamical models estimate and predict the temporal evolution of physical systems. State Space Models (SSMs) in particular represent the system dynamics with many desirable properties, such as being able to model uncertainty in both the model and measurements, and optimal (in the Bayesian sense) recursive formulations e.g. the Kalman Filter. However, they require significant domain knowledge to der… ▽ More

    Submitted 11 September, 2021; v1 submitted 11 August, 2019; originally announced August 2019.

    Comments: Accepted by IEEE Transactions on Neural Networks and Learning Systems (TNNLS)

  45. arXiv:1903.01534  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Selective Sensor Fusion for Neural Visual-Inertial Odometry

    Authors: Changhao Chen, Stefano Rosa, Yishu Miao, Chris Xiaoxuan Lu, Wei Wu, Andrew Markham, Niki Trigoni

    Abstract: Deep learning approaches for Visual-Inertial Odometry (VIO) have proven successful, but they rarely focus on incorporating robust fusion strategies for dealing with imperfect input sensory data. We propose a novel end-to-end selective sensor fusion framework for monocular VIO, which fuses monocular images and inertial measurements in order to estimate the trajectory whilst improving robustness to… ▽ More

    Submitted 4 March, 2019; originally announced March 2019.

    Comments: Accepted by CVPR 2019

  46. arXiv:1810.02076  [pdf, other

    cs.LG cs.CV cs.RO stat.ML

    Transferring Physical Motion Between Domains for Neural Inertial Tracking

    Authors: Changhao Chen, Yishu Miao, Chris Xiaoxuan Lu, Phil Blunsom, Andrew Markham, Niki Trigoni

    Abstract: Inertial information processing plays a pivotal role in ego-motion awareness for mobile agents, as inertial measurements are entirely egocentric and not environment dependent. However, they are affected greatly by changes in sensor placement/orientation or motion dynamics, and it is infeasible to collect labelled data from every domain. To overcome the challenges of domain adaptation on long senso… ▽ More

    Submitted 4 October, 2018; originally announced October 2018.

    Comments: NIPS 2018 workshop on Modeling the Physical World: Perception, Learning, and Control. A complete version will be released soon

  47. arXiv:1809.07491  [pdf, other

    cs.RO cs.CV cs.LG

    OxIOD: The Dataset for Deep Inertial Odometry

    Authors: Changhao Chen, Peijun Zhao, Chris Xiaoxuan Lu, Wei Wang, Andrew Markham, Niki Trigoni

    Abstract: Advances in micro-electro-mechanical (MEMS) techniques enable inertial measurements units (IMUs) to be small, cheap, energy efficient, and widely used in smartphones, robots, and drones. Exploiting inertial data for accurate and reliable navigation and localization has attracted significant research and industrial interest, as IMU measurements are completely ego-centric and generally environment a… ▽ More

    Submitted 20 September, 2018; originally announced September 2018.