Skip to main content

Showing 1–24 of 24 results for author: Lang, X

  1. arXiv:2406.02147  [pdf, other

    cs.CV

    UA-Track: Uncertainty-Aware End-to-End 3D Multi-Object Tracking

    Authors: Lijun Zhou, Tao Tang, Pengkun Hao, Zihang He, Kalok Ho, Shuo Gu, Wenbo Hou, Zhihui Hao, Haiyang Sun, Kun Zhan, Peng Jia, Xianpeng Lang, Xiaodan Liang

    Abstract: 3D multiple object tracking (MOT) plays a crucial role in autonomous driving perception. Recent end-to-end query-based trackers simultaneously detect and track objects, which have shown promising potential for the 3D MOT task. However, existing methods overlook the uncertainty issue, which refers to the lack of precise confidence about the state and location of tracked objects. Uncertainty arises… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  2. arXiv:2406.01587  [pdf, other

    cs.RO

    PlanAgent: A Multi-modal Large Language Agent for Closed-loop Vehicle Motion Planning

    Authors: Yupeng Zheng, Zebin Xing, Qichao Zhang, Bu Jin, Pengfei Li, Yuhang Zheng, Zhongpu Xia, Kun Zhan, Xianpeng Lang, Yaran Chen, Dongbin Zhao

    Abstract: Vehicle motion planning is an essential component of autonomous driving technology. Current rule-based vehicle motion planning methods perform satisfactorily in common scenarios but struggle to generalize to long-tailed situations. Meanwhile, learning-based methods have yet to achieve superior performance over rule-based approaches in large-scale closed-loop scenarios. To address these issues, we… ▽ More

    Submitted 4 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: This work has been submitted to the IEEE for possible publication

  3. arXiv:2406.01349  [pdf, other

    cs.CV

    Unleashing Generalization of End-to-End Autonomous Driving with Controllable Long Video Generation

    Authors: Enhui Ma, Lijun Zhou, Tao Tang, Zhan Zhang, Dong Han, Junpeng Jiang, Kun Zhan, Peng Jia, Xianpeng Lang, Haiyang Sun, Di Lin, Kaicheng Yu

    Abstract: Using generative models to synthesize new data has become a de-facto standard in autonomous driving to address the data scarcity issue. Though existing approaches are able to boost perception models, we discover that these approaches fail to improve the performance of planning of end-to-end autonomous driving models as the generated videos are usually less than 8 frames and the spatial and tempora… ▽ More

    Submitted 6 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: Project Page: https://westlake-autolab.github.io/delphi.github.io/, 8 figures

  4. arXiv:2405.13651  [pdf

    cs.AI cs.RO

    ConcertoRL: An Innovative Time-Interleaved Reinforcement Learning Approach for Enhanced Control in Direct-Drive Tandem-Wing Vehicles

    Authors: Minghao Zhang, Bifeng Song, Changhao Chen, Xinyu Lang

    Abstract: In control problems for insect-scale direct-drive experimental platforms under tandem wing influence, the primary challenge facing existing reinforcement learning models is their limited safety in the exploration process and the stability of the continuous training process. We introduce the ConcertoRL algorithm to enhance control precision and stabilize the online training process, which consists… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 48 pages, 35 figures

    MSC Class: 68T40 ACM Class: I.2.9

  5. arXiv:2405.10874  [pdf, other

    cs.RO

    Square-Root Inverse Filter-based GNSS-Visual-Inertial Navigation

    Authors: Jun Hu, Xiaoming Lang, Feng Zhang, Yinian Mao, Guoquan Huang

    Abstract: While Global Navigation Satellite System (GNSS) is often used to provide global positioning if available, its intermittency and/or inaccuracy calls for fusion with other sensors. In this paper, we develop a novel GNSS-Visual-Inertial Navigation System (GVINS) that fuses visual, inertial, and raw GNSS measurements within the square-root inverse sliding window filtering (SRI-SWF) framework in a tigh… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

  6. arXiv:2405.02207  [pdf

    physics.chem-ph

    Water Structure and Electric Fields at the Interface of Oil Droplets

    Authors: Lixue Shi, R. Allen LaCour, Xiaoqi Lang, Joseph P. Heindel, Teresa Head-Gordon, Wei Min

    Abstract: Mesoscale water-hydrophobic interfaces are of fundamental importance in multiple disciplines, but their molecular properties have remained elusive for decades due to experimental complications and alternate theoretical explanations. Surface-specific spectroscopies, such as vibrational sum-frequency techniques, suffer from either sample preparation issues or the need for complex spectral correction… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  7. arXiv:2404.06926  [pdf, other

    cs.RO

    Gaussian-LIC: Photo-realistic LiDAR-Inertial-Camera SLAM with 3D Gaussian Splatting

    Authors: Xiaolei Lang, Laijian Li, Hang Zhang, Feng Xiong, Mu Xu, Yong Liu, Xingxing Zuo, Jiajun Lv

    Abstract: We present a real-time LiDAR-Inertial-Camera SLAM system with 3D Gaussian Splatting as the mapping backend. Leveraging robust pose estimates from our LiDAR-Inertial-Camera odometry, Coco-LIC, an incremental photo-realistic mapping system is proposed in this paper. We initialize 3D Gaussians from colorized LiDAR points and optimize them using differentiable rendering powered by 3D Gaussian Splattin… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: Submitted to IROS 2024

  8. arXiv:2402.12289  [pdf, other

    cs.CV

    DriveVLM: The Convergence of Autonomous Driving and Large Vision-Language Models

    Authors: Xiaoyu Tian, Junru Gu, Bailin Li, Yicheng Liu, Yang Wang, Zhiyong Zhao, Kun Zhan, Peng Jia, Xianpeng Lang, Hang Zhao

    Abstract: A primary hurdle of autonomous driving in urban environments is understanding complex and long-tail scenarios, such as challenging road conditions and delicate human behaviors. We introduce DriveVLM, an autonomous driving system leveraging Vision-Language Models (VLMs) for enhanced scene understanding and planning capabilities. DriveVLM integrates a unique combination of reasoning modules for scen… ▽ More

    Submitted 25 June, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: Project Page: https://tsinghua-mars-lab.github.io/DriveVLM/

  9. arXiv:2401.01339  [pdf, other

    cs.CV cs.GR

    Street Gaussians: Modeling Dynamic Urban Scenes with Gaussian Splatting

    Authors: Yunzhi Yan, Haotong Lin, Chenxu Zhou, Weijie Wang, Haiyang Sun, Kun Zhan, Xianpeng Lang, Xiaowei Zhou, Sida Peng

    Abstract: This paper aims to tackle the problem of modeling dynamic urban streets for autonomous driving scenes. Recent methods extend NeRF by incorporating tracked vehicle poses to animate vehicles, enabling photo-realistic view synthesis of dynamic urban street scenes. However, significant limitations are their slow training and rendering speed. We introduce Street Gaussians, a new explicit scene represen… ▽ More

    Submitted 16 July, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

    Comments: Project page: https://zju3dv.github.io/street_gaussians/

  10. arXiv:2401.01065  [pdf, other

    cs.CV cs.AI

    BEV-TSR: Text-Scene Retrieval in BEV Space for Autonomous Driving

    Authors: Tao Tang, Dafeng Wei, Zhengyu Jia, Tian Gao, Changwei Cai, Chengkai Hou, Peng Jia, Kun Zhan, Haiyang Sun, Jingchen Fan, Yixing Zhao, Fu Liu, Xiaodan Liang, Xianpeng Lang, Yang Wang

    Abstract: The rapid development of the autonomous driving industry has led to a significant accumulation of autonomous driving data. Consequently, there comes a growing demand for retrieving data to provide specialized optimization. However, directly applying previous image retrieval methods faces several challenges, such as the lack of global feature representation and inadequate text retrieval ability for… ▽ More

    Submitted 18 June, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

  11. Coco-LIC: Continuous-Time Tightly-Coupled LiDAR-Inertial-Camera Odometry using Non-Uniform B-spline

    Authors: Xiaolei Lang, Chao Chen, Kai Tang, Yukai Ma, Jiajun Lv, Yong Liu, Xingxing Zuo

    Abstract: In this paper, we propose an efficient continuous-time LiDAR-Inertial-Camera Odometry, utilizing non-uniform B-splines to tightly couple measurements from the LiDAR, IMU, and camera. In contrast to uniform B-spline-based continuous-time methods, our non-uniform B-spline approach offers significant advantages in terms of achieving real-time efficiency and high accuracy. This is accomplished by dyna… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

    Comments: has been accepted by RAL 2023

  12. arXiv:2307.04070  [pdf, ps, other

    econ.TH

    A Belief-Based Characterization of Reduced-Form Auctions

    Authors: Xu Lang

    Abstract: We study games of chance (e.g., pokers, dices, horse races) in the form of agents' first-order posterior beliefs about game outcomes. We ask for any profile of agents' posterior beliefs, is there a game that can generate these beliefs? We completely characterize all feasible joint posterior beliefs from these games. The characterization enables us to find a new variant of Border's inequalities (Bo… ▽ More

    Submitted 8 July, 2023; originally announced July 2023.

    Comments: Games of Chance, Posterior Beliefs, Reduced Form Auctions, Aumann's Agreement Theorem, Bayesian Persuasion

  13. arXiv:2302.14350  [pdf, other

    cs.CV

    Knowledge Augmented Relation Inference for Group Activity Recognition

    Authors: Xianglong Lang, Zhuming Wang, Zun Li, Meng Tian, Ge Shi, Lifang Wu, Liang Wang

    Abstract: Most existing group activity recognition methods construct spatial-temporal relations merely based on visual representation. Some methods introduce extra knowledge, such as action labels, to build semantic relations and use them to refine the visual presentation. However, the knowledge they explored just stay at the semantic-level, which is insufficient for pursing notable accuracy. In this paper,… ▽ More

    Submitted 1 March, 2023; v1 submitted 28 February, 2023; originally announced February 2023.

  14. arXiv:2302.07456  [pdf, other

    cs.RO

    Continuous-Time Fixed-Lag Smoothing for LiDAR-Inertial-Camera SLAM

    Authors: Jiajun Lv, Xiaolei Lang, Jinhong Xu, Mengmeng Wang, Yong Liu, Xingxing Zuo

    Abstract: Localization and mapping with heterogeneous multi-sensor fusion have been prevalent in recent years. To adequately fuse multi-modal sensor measurements received at different time instants and different frequencies, we estimate the continuous-time trajectory by fixed-lag smoothing within a factor-graph optimization framework. With the continuous-time formulation, we can query poses at any time inst… ▽ More

    Submitted 14 February, 2023; originally announced February 2023.

  15. arXiv:2211.06830  [pdf, ps, other

    econ.TH

    Two-Person Bargaining when the Disagreement Point is Private Information

    Authors: Eric van Damme, Xu Lang

    Abstract: We consider two-person bargaining problems in which (only) the disagreement outcome is private (and possibly correlated) information and it is common knowledge that disagreement is inefficient. We show that if the Pareto frontier is linear, the outcome of an ex post efficient mechanism cannot depend on the disagreement payoffs. If the frontier is non-linear, the result continues to hold when the d… ▽ More

    Submitted 9 January, 2024; v1 submitted 13 November, 2022; originally announced November 2022.

    Comments: bargaining problem, incomplete information, axiomatic method, efficiency, disagreement, correlation

  16. arXiv:2208.12008  [pdf, other

    cs.RO

    Ctrl-VIO: Continuous-Time Visual-Inertial Odometry for Rolling Shutter Cameras

    Authors: Xiaolei Lang, Jiajun Lv, Jianxin Huang, Yukai Ma, Yong Liu, Xingxing Zuo

    Abstract: In this paper, we propose a probabilistic continuous-time visual-inertial odometry (VIO) for rolling shutter cameras. The continuous-time trajectory formulation naturally facilitates the fusion of asynchronized high-frequency IMU data and motion-distorted rolling shutter images. To prevent intractable computation load, the proposed VIO is sliding-window and keyframe-based. We propose to probabilis… ▽ More

    Submitted 25 August, 2022; originally announced August 2022.

    Journal ref: 2022 RAL

  17. arXiv:2207.09253  [pdf, ps, other

    econ.TH

    Symmetric reduced form voting

    Authors: Xu Lang, Debasis Mishra

    Abstract: We study a model of voting with two alternatives in a symmetric environment. We characterize the interim allocation probabilities that can be implemented by a symmetric voting rule. We show that every such interim allocation probabilities can be implemented as a convex combination of two families of deterministic voting rules: qualified majority and qualified anti-majority. We also provide analogo… ▽ More

    Submitted 3 April, 2023; v1 submitted 19 July, 2022; originally announced July 2022.

  18. arXiv:2205.13442  [pdf, ps, other

    math.NT

    Rational points on $x^{3} + x^{2} y^{2} + y^{3} = k$

    Authors: Xiaoan Lang, Jeremy Rouse

    Abstract: We study the problem of determining, given an integer $k$, the rational solutions to $C_{k} : x^{3}z + x^{2} y^{2} + y^{3}z = kz^{4}$. For $k \ne 0$, the curve $C_{k}$ has genus $3$ and there are maps from $C_{k}$ to three elliptic curves $E_{1,k}$, $E_{2,k}$, $E_{3,k}$. We explicitly determine the rational points on $C_{k}$ under the assumption that one of these elliptic curves has rank zero. We… ▽ More

    Submitted 23 March, 2023; v1 submitted 26 May, 2022; originally announced May 2022.

    Comments: 18 pages

    MSC Class: Primary 11G05; Secondary 11G30; 14H45

  19. arXiv:2202.06245  [pdf, ps, other

    econ.TH cs.GT

    Reduced-Form Allocations with Complementarity: A 2-Person Case

    Authors: Xu Lang

    Abstract: We investigate the implementation of reduced-form allocation probabilities in a two-person bargaining problem without side payments, where the agents have to select one alternative from a finite set of social alternatives. We provide a necessary and sufficient condition for the implementability. We find that the implementability condition in bargaining has some new feature compared to Border's the… ▽ More

    Submitted 22 February, 2022; v1 submitted 13 February, 2022; originally announced February 2022.

    Comments: 23 pages

  20. arXiv:1501.00570  [pdf, ps, other

    cond-mat.mes-hall

    Qubit detection with a T-shaped double quantum dot detector

    Authors: JunYan Luo, HuJun Jiao, Jing Hu, Xiao-Ling He, XiaoLi Lang, Shi-Kuan Wang

    Abstract: We propose to continuously monitor a charge qubit by utilizing a T-shaped double quantum dot detector, in which the qubit and double dot are arranged in such a unique way that the detector turns out to be particularly susceptible to the charge states of the qubit. Special attention is paid to the regime where acquisition of qubit information and backaction upon the measured system exhibit nontrivi… ▽ More

    Submitted 15 January, 2015; v1 submitted 3 January, 2015; originally announced January 2015.

    Comments: 6 figures, typoes corrected

  21. Inelastic electron tunneling spectroscopy of nanoporous gold films

    Authors: H. W. Liu, R. Nishitani, T. Fujita, W. Li, L. Zhang, X. Y. Lang, P. Richard, K. S. Nakayama, X. Chen, M. W. Chen, Q. K. Xue

    Abstract: We investigated the localized electronic properties of nanoporous gold films by using an ultra-high vacuum scanning tunneling microscope at low temperature (4.2 K). Second derivative scanning tunneling spectroscopy shows the plasmon peaks of the nanoporous gold films, which are excited by inelastic tunneling electrons. We propose that the nanorod model is appropriate for nanoporous gold studies at… ▽ More

    Submitted 9 June, 2014; originally announced June 2014.

    Comments: 6 pages, 3 figures. This is the authors' version. The published, high resolution version of this paper, Copyright (2014) by the American Physical Society, can be found at http://journals.aps.org/prb/

    Journal ref: Physical Review B 89, 035426 (2014)

  22. arXiv:1405.1662  [pdf

    cond-mat.mtrl-sci

    Directly grown monolayer MoS2 on Au foils as efficient hydrogen evolution catalysts

    Authors: Jianping Shi, Donglin Ma, Gao-Feng Han, Yu Zhang, Qingqing Ji, Teng Gao, Jingyu Sun, Cong Li, Xing-You Lang, Yanfeng Zhang, Zhongfan Liu

    Abstract: Synthesis of monolayer MoS2 is essential for fulfilling the potential of MoS2 in catalysis, optoelectronics and valleytronics, etc. Herein, we report for the first time the scalable growth of high quality, domain size tunable (edge length from ~ 200 nm to 50 μm), strictly monolayer MoS2 on commercially available Au foils, via a low pressure chemical vapor deposition method. The nanosized triangula… ▽ More

    Submitted 7 May, 2014; originally announced May 2014.

    Comments: 28 pages, 5 figures

  23. Conditional spin counting statistics as a probe of Coulomb interaction and spin-resolved bunching

    Authors: JunYan Luo, Jing Hu, XiaoLi Lang, Yu Shen, Xiao-Ling He, HuJun Jiao

    Abstract: Full counting statistics is a powerful tool to characterize the noise and correlations in transport through mesoscopic systems. In this work, we propose the theory of conditional spin counting statistics, i.e., the statistical fluctuations of spin-up (down) current given the observation of the spin-down (up) current. In the context of transport through a single quantum dot, it is demonstrated that… ▽ More

    Submitted 16 February, 2014; v1 submitted 22 February, 2013; originally announced February 2013.

    Comments: 9 pages, 7 figures

    Journal ref: Phys. Lett. A 378, 892-898 (2014)

  24. arXiv:1203.2233  [pdf, ps, other

    cond-mat.mes-hall

    Non-Markovian dynamics and noise characteristics in continuous measurement of a solid-state charge qubit

    Authors: JunYan Luo, HuJun Jiao, Xiao-Li Lang, BiTao Xiong, Xiao-Ling He

    Abstract: We investigate the non-Markovian characteristics in continuous measurement of a charge qubit by a quantum point contact. The backflow of information from the reservoir to the system in the non-Markovian domain gives rise to strikingly different qubit relaxation and dephasing in comparison with the Markovian case. The intriguing non-Markovian dynamics is found to have a direct impact on the output… ▽ More

    Submitted 17 August, 2013; v1 submitted 10 March, 2012; originally announced March 2012.

    Comments: 10 pages, 7 figures