Skip to main content

Showing 1–50 of 98 results for author: Weng, X

  1. arXiv:2407.00959  [pdf, other

    cs.AI cs.RO

    Tokenize the World into Object-level Knowledge to Address Long-tail Events in Autonomous Driving

    Authors: Ran Tian, Boyi Li, Xinshuo Weng, Yuxiao Chen, Edward Schmerling, Yue Wang, Boris Ivanovic, Marco Pavone

    Abstract: The autonomous driving industry is increasingly adopting end-to-end learning from sensory inputs to minimize human biases in system design. Traditional end-to-end driving models, however, suffer from long-tail events due to rare or unseen inputs within their training distributions. To address this, we propose TOKEN, a novel Multi-Modal Large Language Model (MM-LLM) that tokenizes the world into ob… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2406.15349  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    NAVSIM: Data-Driven Non-Reactive Autonomous Vehicle Simulation and Benchmarking

    Authors: Daniel Dauner, Marcel Hallgarten, Tianyu Li, Xinshuo Weng, Zhiyu Huang, Zetong Yang, Hongyang Li, Igor Gilitschenski, Boris Ivanovic, Marco Pavone, Andreas Geiger, Kashyap Chitta

    Abstract: Benchmarking vision-based driving policies is challenging. On one hand, open-loop evaluation with real data is easy, but these results do not reflect closed-loop performance. On the other, closed-loop evaluation is possible in simulation, but is hard to scale due to its significant computational demands. Further, the simulators available today exhibit a large domain gap to real data. This has resu… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  3. arXiv:2406.07077  [pdf, other

    eess.SY

    Meta-Backscatter: A New ISAC Paradigm for Battery-Free Internet of Things

    Authors: Xu Liu, Hongliang Zhang, Kaigui Bian, Xi Weng, Lingyang Song

    Abstract: The meta-material sensor has been regarded as a next-generation sensing technology for the battery-free Internet of Things (IoT) due to its battery-free characteristic and improved sensing performance. The meta-material sensors function as backscatter tags that change their reflection coefficients with the conditions of sensing targets such as temperature and gas concentration, allowing transceive… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  4. arXiv:2406.05414  [pdf, other

    nucl-th

    Angular Momentum-Resolved Inelastic Electron Scattering for Nuclear Giant Resonances

    Authors: Zhi-Wei Lu, Liang Guo, Mamutjan Ababekri, Jia-lin Zhang, Xiu-Feng Weng, Yuanbin Wu, Yi-Fei Niu, Jian-Xing Li

    Abstract: Giant resonances (GRs) provide crucial insights into nuclear physics and astrophysics. Exciting GRs using particles like electrons is effective, yet the angular momentum (AM) transfer of electrons, including both intrinsic spin and orbital degrees of freedom in inelastic scattering, has never been studied. Here, we investigate AM transfer in GRs excited by plane-wave and vortex electrons, developi… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  5. arXiv:2405.20714  [pdf

    cond-mat.mtrl-sci cond-mat.str-el

    Large low-field magnetocaloric response in a ferromagnetic gadolinium orthophosphate

    Authors: Ziyu W. Yang, Jie Zhang, Maocai Pi, Xubin Ye, Chenxu Kang, Xiaoliang Weng, Wei Tang, Hongzhi Cui, Yu-Jia Zeng, Youwen Long

    Abstract: Bulk magnetic and thermodynamic measurements, along with mean-field calculations, were conducted on the ferromagnetic K3Gd5(PO4)6 powders. No magnetic ordering was observed until 2 K, while the application of an external field B > 1 T resulted in the splitting of the Gd3+ ground state multiplet and induced a non-cooperative Schottky effect. The average nearest-neighbor exchange strength |J1/kB| is… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: 7 pages, 5 figures

  6. arXiv:2405.19039  [pdf, other

    hep-ph hep-ex hep-lat

    Heavy baryons in the relativized quark model with chromodynamics

    Authors: Xin-Zhen Weng, Wei-Zhen Deng, Shi-Lin Zhu

    Abstract: Following the work of Capstick and Isgur [\href{https://doi.org/10.1103/PhysRevD.34.2809}{Phys.~Rev.~D~34,~2809~(1986)}], we systematically study the mass spectrum of the heavy baryons in the relativized quark potential model with chromodynamics. Besides the original Godfrey-Isgur (GI) model, we also adopt a modified GI model which replaces the linear confinement by a screened one. The two models… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 20 pages, 10 figures

  7. arXiv:2405.11788  [pdf, other

    cs.LG

    TinyLLaVA Factory: A Modularized Codebase for Small-scale Large Multimodal Models

    Authors: Junlong Jia, Ying Hu, Xi Weng, Yiming Shi, Miao Li, Xingjian Zhang, Baichuan Zhou, Ziyu Liu, Jie Luo, Lei Huang, Ji Wu

    Abstract: We present TinyLLaVA Factory, an open-source modular codebase for small-scale large multimodal models (LMMs) with a focus on simplicity of code implementations, extensibility of new features, and reproducibility of training results. Following the design philosophy of the factory pattern in software engineering, TinyLLaVA Factory modularizes the entire system into interchangeable components, with e… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: Our codebase is made public at https://github.com/TinyLLaVA/TinyLLaVA_Factory with documentation available at https://tinyllava-factory.readthedocs.io/en/latest/

  8. arXiv:2405.03685  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Language-Image Models with 3D Understanding

    Authors: Jang Hyun Cho, Boris Ivanovic, Yulong Cao, Edward Schmerling, Yue Wang, Xinshuo Weng, Boyi Li, Yurong You, Philipp Krähenbühl, Yan Wang, Marco Pavone

    Abstract: Multi-modal large language models (MLLMs) have shown incredible capabilities in a variety of 2D vision and language tasks. We extend MLLMs' perceptual capabilities to ground and reason about images in 3-dimensional space. To that end, we first develop a large-scale pre-training dataset for 2D and 3D called LV3D by combining multiple existing 2D and 3D recognition datasets under a common task formu… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: Project page: https://janghyuncho.github.io/Cube-LLM

  9. arXiv:2403.20213  [pdf, other

    cs.CV

    H2RSVLM: Towards Helpful and Honest Remote Sensing Large Vision Language Model

    Authors: Chao Pang, Jiang Wu, Jiayu Li, Yi Liu, Jiaxing Sun, Weijia Li, Xingxing Weng, Shuai Wang, Litong Feng, Gui-Song Xia, Conghui He

    Abstract: The generic large Vision-Language Models (VLMs) is rapidly developing, but still perform poorly in Remote Sensing (RS) domain, which is due to the unique and specialized nature of RS imagery and the comparatively limited spatial perception of current VLMs. Existing Remote Sensing specific Vision Language Models (RSVLMs) still have considerable potential for improvement, primarily owing to the lack… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: Equal contribution: Chao Pang, Jiang Wu; Corresponding author: Gui-Song Xia, Conghui He

  10. arXiv:2402.14289  [pdf, other

    cs.LG cs.CL

    TinyLLaVA: A Framework of Small-scale Large Multimodal Models

    Authors: Baichuan Zhou, Ying Hu, Xi Weng, Junlong Jia, Jie Luo, Xien Liu, Ji Wu, Lei Huang

    Abstract: We present the TinyLLaVA framework that provides a unified perspective in designing and analyzing the small-scale Large Multimodal Models (LMMs). We empirically study the effects of different vision encoders, connection modules, language models, training data and training recipes. Our extensive experiments showed that better quality of data combined with better training recipes, smaller LMMs can c… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: Our model weights and codes will be made public at https://github.com/DLCV-BUAA/TinyLLaVABench

  11. arXiv:2401.10752  [pdf, other

    cs.CV

    HiCD: Change Detection in Quality-Varied Images via Hierarchical Correlation Distillation

    Authors: Chao Pang, Xingxing Weng, Jiang Wu, Qiang Wang, Gui-Song Xia

    Abstract: Advanced change detection techniques primarily target image pairs of equal and high quality. However, variations in imaging conditions and platforms frequently lead to image pairs with distinct qualities: one image being high-quality, while the other being low-quality. These disparities in image quality present significant challenges for understanding image pairs semantically and extracting change… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

    Comments: accepted by TGRS

  12. arXiv:2401.10685  [pdf, other

    cs.LG cs.AI eess.SP

    Towards End-to-End GPS Localization with Neural Pseudorange Correction

    Authors: Xu Weng, KV Ling, Haochen Liu, Kun Cao

    Abstract: Pseudorange errors are the root cause of localization inaccuracy in GPS. Previous data-driven methods regress and eliminate pseudorange errors using handcrafted intermediate labels. Unlike them, we propose an end-to-end GPS localization framework, E2E-PrNet, to train a neural network for pseudorange correction (PrNet) directly using the final task loss calculated with the ground truth of GPS recei… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

  13. arXiv:2401.09107  [pdf, other

    hep-ph

    Properties of $Q^{5}q$ dibaryons

    Authors: Xin-Zhen Weng

    Abstract: We investigate heavy flavor dibaryons with five heavy quarks $Q$ ($Q=\{c,b\}$) and one light quark $q$ ($q=\{u,d,s\}$), namely the $Q^{5}q$ dibaryons. In the framework of an extended chromomagnetic model, we systematically study the mass spectrum of these dibaryons. We find no stable state below the corresponding baryon-baryon thresholds. In addition to the analysis of the masses, we also study th… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: 25 pages, 5 figures. arXiv admin note: text overlap with arXiv:2207.05505

  14. Visual Tomography: Physically Faithful Volumetric Models of Partially Translucent Objects

    Authors: David Nakath, Xiangyu Weng, Mengkun She, Kevin Köser

    Abstract: When created faithfully from real-world data, Digital 3D representations of objects can be useful for human or computer-assisted analysis. Such models can also serve for generating training data for machine learning approaches in settings where data is difficult to obtain or where too few training data exists, e.g. by providing novel views or images in varying conditions. While the vast amount of… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Comments: Accepted for publication at 3DV '24

  15. arXiv:2311.04079  [pdf, other

    cs.CV

    Augmenting Lane Perception and Topology Understanding with Standard Definition Navigation Maps

    Authors: Katie Z Luo, Xinshuo Weng, Yan Wang, Shuang Wu, Jie Li, Kilian Q Weinberger, Yue Wang, Marco Pavone

    Abstract: Autonomous driving has traditionally relied heavily on costly and labor-intensive High Definition (HD) maps, hindering scalability. In contrast, Standard Definition (SD) maps are more affordable and have worldwide coverage, offering a scalable alternative. In this work, we systematically explore the effect of SD maps for real-time lane-topology understanding. We propose a novel framework to integr… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  16. arXiv:2311.02077  [pdf, other

    cs.CV

    EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision

    Authors: Jiawei Yang, Boris Ivanovic, Or Litany, Xinshuo Weng, Seung Wook Kim, Boyi Li, Tong Che, Danfei Xu, Sanja Fidler, Marco Pavone, Yue Wang

    Abstract: We present EmerNeRF, a simple yet powerful approach for learning spatial-temporal representations of dynamic driving scenes. Grounded in neural fields, EmerNeRF simultaneously captures scene geometry, appearance, motion, and semantics via self-bootstrapping. EmerNeRF hinges upon two core components: First, it stratifies scenes into static and dynamic fields. This decomposition emerges purely from… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

    Comments: See the project page for code, data, and request pre-trained models: https://emernerf.github.io

  17. arXiv:2310.16306  [pdf, other

    hep-ph

    Generation of $γ$ photons with extremely large orbital angular momenta

    Authors: Ren-Tong Guo, Mamutjan Ababekri, Qian Zhao, Yousef I. Salamin, Liang-Liang Ji, Zhi-Gang Bu, Zhong-Feng Xu, Xiu-Feng Weng, Jian-Xing Li

    Abstract: Vortex $γ$ photons, which carry large intrinsic orbital angular momenta (OAM), have significant applications in nuclear, atomic, hadron, particle and astro-physics, but their production remains unclear. In this work, we investigate the generation of such photons from nonlinear Compton scattering of circularly polarized monochromatic lasers on vortex electrons. We develop a quantum radiation theory… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: 7 pages, 4 figures

  18. arXiv:2309.12204  [pdf, other

    cs.LG eess.SP

    PrNet: A Neural Network for Correcting Pseudoranges to Improve Positioning with Android Raw GNSS Measurements

    Authors: Xu Weng, Keck Voon Ling, Haochen Liu

    Abstract: We present a neural network for mitigating biased errors in pseudoranges to improve localization performance with data collected from mobile phones. A satellite-wise Multilayer Perceptron (MLP) is designed to regress the pseudorange bias correction from six satellite, receiver, context-related features derived from Android raw Global Navigation Satellite System (GNSS) measurements. To train the ML… ▽ More

    Submitted 22 December, 2023; v1 submitted 16 September, 2023; originally announced September 2023.

  19. arXiv:2309.08936  [pdf, other

    eess.SP

    Localization with Noisy Android Raw GNSS Measurements

    Authors: Xu Weng, Keck Voon Ling

    Abstract: Android raw Global Navigation Satellite System (GNSS) measurements are expected to bring smartphones power to take on demanding localization tasks that are traditionally performed by specialized GNSS receivers. The hardware constraints, however, make Android raw GNSS measurements much noisier than geodetic-quality ones. This study elucidates the principles of localization using Android raw GNSS me… ▽ More

    Submitted 28 September, 2023; v1 submitted 16 September, 2023; originally announced September 2023.

  20. arXiv:2307.07947  [pdf, other

    cs.CV

    Language Conditioned Traffic Generation

    Authors: Shuhan Tan, Boris Ivanovic, Xinshuo Weng, Marco Pavone, Philipp Kraehenbuehl

    Abstract: Simulation forms the backbone of modern self-driving development. Simulators help develop, test, and improve driving systems without putting humans, vehicles, or their environment at risk. However, simulators face a major challenge: They rely on realistic, scalable, yet interesting content. While recent advances in rendering and scene reconstruction make great strides in creating static scene asse… ▽ More

    Submitted 16 July, 2023; originally announced July 2023.

    Comments: Technical Report. Website available at https://ariostgx.github.io/lctgen

  21. arXiv:2306.00704  [pdf, other

    cs.CV

    DAM-Net: Global Flood Detection from SAR Imagery Using Differential Attention Metric-Based Vision Transformers

    Authors: Tamer Saleh, Xingxing Weng, Shimaa Holail, Chen Hao, Gui-Song Xia

    Abstract: The detection of flooded areas using high-resolution synthetic aperture radar (SAR) imagery is a critical task with applications in crisis and disaster management, as well as environmental resource planning. However, the complex nature of SAR images presents a challenge that often leads to an overestimation of the flood extent. To address this issue, we propose a novel differential attention metri… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: 16 pages, 11 figures

  22. arXiv:2305.16789  [pdf, other

    cs.LG cs.CV eess.SP

    Modulate Your Spectrum in Self-Supervised Learning

    Authors: Xi Weng, Yunhao Ni, Tengwei Song, Jie Luo, Rao Muhammad Anwer, Salman Khan, Fahad Shahbaz Khan, Lei Huang

    Abstract: Whitening loss offers a theoretical guarantee against feature collapse in self-supervised learning (SSL) with joint embedding architectures. Typically, it involves a hard whitening approach, transforming the embedding and applying loss to the whitened output. In this work, we introduce Spectral Transformation (ST), a framework to modulate the spectrum of embedding and to seek for functions beyond… ▽ More

    Submitted 21 January, 2024; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: Accepted at ICLR 2024. The code is available at https://github.com/winci-ai/intl

  23. PolarDB-IMCI: A Cloud-Native HTAP Database System at Alibaba

    Authors: Jianying Wang, Tongliang Li, Haoze Song, Xinjun Yang, Wenchao Zhou, Feifei Li, Baoyue Yan, Qianqian Wu, Yukun Liang, Chengjun Ying, Yujie Wang, Baokai Chen, Chang Cai, Yubin Ruan, Xiaoyi Weng, Shibin Chen, Liang Yin, Chengzhong Yang, Xin Cai, Hongyan Xing, Nanlong Yu, Xiaofei Chen, Dapeng Huang, Jianling Sun

    Abstract: Cloud-native databases have become the de-facto choice for mission-critical applications on the cloud due to the need for high availability, resource elasticity, and cost efficiency. Meanwhile, driven by the increasing connectivity between data generation and analysis, users prefer a single database to efficiently process both OLTP and OLAP workloads, which enhances data freshness and reduces the… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

    Comments: 14 pages, 16 figures, to be published in ACM SIGMOD 2023

  24. arXiv:2305.01870  [pdf, other

    cs.RO

    Task-Aware Risk Estimation of Perception Failures for Autonomous Vehicles

    Authors: Pasquale Antonante, Sushant Veer, Karen Leung, Xinshuo Weng, Luca Carlone, Marco Pavone

    Abstract: Safety and performance are key enablers for autonomous driving: on the one hand we want our autonomous vehicles (AVs) to be safe, while at the same time their performance (e.g., comfort or progression) is key to adoption. To effectively walk the tight-rope between safety and performance, AVs need to be risk-averse, but not entirely risk-avoidant. To facilitate safe-yet-performant driving, in this… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

  25. arXiv:2304.12172  [pdf, other

    physics.optics

    Parameterized Learning and Distillation with Vortex-encoded Spectral Correlations

    Authors: Altai Perry, Xiaojing Weng, Erfan Nozari, Luat Vuong

    Abstract: Spectral computational methods leverage modal or nonlocal representations of data, and a physically realized approach to spectral computation pertains to encoded diffraction. Encoded diffraction offers a hybrid approach that pairs analog wave propagation with digital back-end electronics, however the intermediate sensor patterns are correlations rather than linear signal weights, which limits the… ▽ More

    Submitted 6 October, 2023; v1 submitted 24 April, 2023; originally announced April 2023.

    Comments: Code is available at: https://github.com/altaiperry/Reconstruction23_Perry

  26. arXiv:2303.11701  [pdf, other

    eess.IV cs.CV cs.LG

    A High-Frequency Focused Network for Lightweight Single Image Super-Resolution

    Authors: Xiaotian Weng, Yi Chen, Zhichao Zheng, Yanhui Gu, Junsheng Zhou, Yudong Zhang

    Abstract: Lightweight neural networks for single-image super-resolution (SISR) tasks have made substantial breakthroughs in recent years. Compared to low-frequency information, high-frequency detail is much more difficult to reconstruct. Most SISR models allocate equal computational resources for low-frequency and high-frequency information, which leads to redundant processing of simple low-frequency inform… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

  27. arXiv:2301.11902  [pdf, other

    cs.RO eess.SY

    Tree-structured Policy Planning with Learned Behavior Models

    Authors: Yuxiao Chen, Peter Karkus, Boris Ivanovic, Xinshuo Weng, Marco Pavone

    Abstract: Autonomous vehicles (AVs) need to reason about the multimodal behavior of neighboring agents while planning their own motion. Many existing trajectory planners seek a single trajectory that performs well under \emph{all} plausible futures simultaneously, ignoring bi-directional interactions and thus leading to overly conservative plans. Policy planning, whereby the ego agent plans a policy that re… ▽ More

    Submitted 26 February, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

  28. arXiv:2211.12338  [pdf, other

    cond-mat.stat-mech eess.SP physics.data-an

    Singular Value Decomposition and Entropy Dimension of Fractals

    Authors: Xiaojing Weng, Altai Perry, Michael Maroun, Luat T. Vuong

    Abstract: We analyze the singular value decomposition (SVD) and SVD entropy of Cantor fractals produced by the Kronecker product. Our primary results show that SVD entropy is a measure of image ``complexity dimension" that is invariant under the number of Kronecker-product self-iterations (i.e., fractal order). SVD entropy is therefore similar to the fractal Hausdorff complexity dimension but suitable for c… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

  29. arXiv:2211.10829  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall physics.app-ph

    Depositing boron on Cu(111): Borophene or boride?

    Authors: Xiao-Ji Weng, Jie Bai, Jingyu Hou, Yi Zhu, Li Wang, Penghui Li, Anmin Nie, Bo Xu, Xiang-Feng Zhou, Yongjun Tian

    Abstract: Large-area single-crystal surface structures were successfully prepared on Cu(111) substrate with boron deposition, which is critical for prospective applications. However, the proposed borophene structures do not match the scanning tunneling microscopy (STM) results very well, while the proposed copper boride is at odds with the traditional knowledge that ordered copper-rich borides normally do n… ▽ More

    Submitted 19 November, 2022; originally announced November 2022.

    Comments: 15 pages, 4 figures

  30. arXiv:2211.07362  [pdf, other

    econ.TH

    Optimal Pricing Schemes in the Presence of Social Learning and Costly Reporting

    Authors: Kaiwei Zhang, Xi Weng, Xienan Cheng

    Abstract: A monopoly platform sells either a risky product (with unknown utility) or a safe product (with known utility) to agents who sequentially arrive and learn the utility of the risky product by the reporting of previous agents. It is costly for agents to report utility; hence the platform has to design both the prices and the reporting bonus to motivate the agents to explore and generate new informat… ▽ More

    Submitted 9 December, 2023; v1 submitted 14 November, 2022; originally announced November 2022.

  31. arXiv:2210.03586  [pdf, other

    cs.CV cs.LG

    An Investigation into Whitening Loss for Self-supervised Learning

    Authors: Xi Weng, Lei Huang, Lei Zhao, Rao Muhammad Anwer, Salman Khan, Fahad Shahbaz Khan

    Abstract: A desirable objective in self-supervised learning (SSL) is to avoid feature collapse. Whitening loss guarantees collapse avoidance by minimizing the distance between embeddings of positive pairs under the conditioning that the embeddings from different views are whitened. In this paper, we propose a framework with an informative indicator to analyze whitening loss, which provides a clue to demysti… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

    Comments: Accepted at NeurIPS 2022. The Code is available at: https://github.com/winci-ai/CW-RGP

  32. arXiv:2209.15241  [pdf, other

    cond-mat.supr-con physics.comp-ph

    Helium-bearing superconductor at high pressure

    Authors: Jingyu Hou, Xiao Dong, Artem R. Oganov, Xiao-Ji Weng, Chun-Mei Hao, Guochun Yang, Hui-Tian Wang, Xiang-Feng Zhou, Yongjun Tian

    Abstract: Helium (He) is the most inert noble gas at ambient conditions. It adopts a hexagonal close packed structure (P63/mmc) and remains in the insulating phase up to 32 TPa. In contrast, lithium (Li) is one of the most reactive metals at zero pressure, while its cubic high-pressure phase (Fd-3m) is a weak metallic electride above 475 GPa. Strikingly, a stable compound of Li5He2 (R-3m) was formed by mixi… ▽ More

    Submitted 30 September, 2022; originally announced September 2022.

    Comments: 5 pages, 3 figures

  33. arXiv:2208.01929  [pdf

    physics.optics

    Physical essence of propagable fractional-strength optical vortices in free space

    Authors: Xiaoyu Weng, Yu Miao, Yang Li, Xiangmei Dong, Xiumin Gao, Songlin Zhuang

    Abstract: Fractional-order vector vortex beams are recently demonstrated to be new carriers of fractional-strength optical vortices. However, why can those new vortex beams formed by the combination of both unstable states propagate stably in free space? Here, we solve this scientific problem by revealing the physical essence of propagable fractional-strength optical vortices in free space.Three new underst… ▽ More

    Submitted 3 August, 2022; originally announced August 2022.

  34. arXiv:2208.00094  [pdf, other

    cs.LG cs.AI cs.CR cs.CV

    Robust Trajectory Prediction against Adversarial Attacks

    Authors: Yulong Cao, Danfei Xu, Xinshuo Weng, Zhuoqing Mao, Anima Anandkumar, Chaowei Xiao, Marco Pavone

    Abstract: Trajectory prediction using deep neural networks (DNNs) is an essential component of autonomous driving (AD) systems. However, these methods are vulnerable to adversarial attacks, leading to serious consequences such as collisions. In this work, we identify two key ingredients to defend trajectory prediction models against adversarial attacks including (1) designing effective adversarial training… ▽ More

    Submitted 29 July, 2022; originally announced August 2022.

  35. arXiv:2207.11243  [pdf, other

    cs.CV cs.GR

    Multiface: A Dataset for Neural Face Rendering

    Authors: Cheng-hsin Wuu, Ningyuan Zheng, Scott Ardisson, Rohan Bali, Danielle Belko, Eric Brockmeyer, Lucas Evans, Timothy Godisart, Hyowon Ha, Xuhua Huang, Alexander Hypes, Taylor Koska, Steven Krenn, Stephen Lombardi, Xiaomin Luo, Kevyn McPhail, Laura Millerschoen, Michal Perdoch, Mark Pitts, Alexander Richard, Jason Saragih, Junko Saragih, Takaaki Shiratori, Tomas Simon, Matt Stewart , et al. (6 additional authors not shown)

    Abstract: Photorealistic avatars of human faces have come a long way in recent years, yet research along this area is limited by a lack of publicly available, high-quality datasets covering both, dense multi-view camera captures, and rich facial expressions of the captured subjects. In this work, we present Multiface, a new multi-view, high-resolution human face dataset collected from 13 identities at Reali… ▽ More

    Submitted 26 June, 2023; v1 submitted 22 July, 2022; originally announced July 2022.

  36. arXiv:2207.05505  [pdf, other

    hep-ph hep-ex hep-lat nucl-ex nucl-th

    Systematics of fully heavy dibaryons

    Authors: Xin-Zhen Weng, Shi-Lin Zhu

    Abstract: We systematically study the mass spectra of the fully heavy dibaryons in an extended chromomagnetic model, which includes both the colorelectric and chromomagnetic interactions. We find no stable state below the corresponding baryon-baryon thresholds. Besides the masses, we also estimate the relative width ratios of the two-body decay channels. We hope our study will be of help for future experime… ▽ More

    Submitted 6 February, 2024; v1 submitted 12 July, 2022; originally announced July 2022.

    Comments: 19 pages, 1 figure

    Journal ref: Eur. Phys. J. C 84, 126 (2024)

  37. arXiv:2203.14360  [pdf, other

    cs.CV

    Observation-Centric SORT: Rethinking SORT for Robust Multi-Object Tracking

    Authors: Jinkun Cao, Jiangmiao Pang, Xinshuo Weng, Rawal Khirodkar, Kris Kitani

    Abstract: Kalman filter (KF) based methods for multi-object tracking (MOT) make an assumption that objects move linearly. While this assumption is acceptable for very short periods of occlusion, linear estimates of motion for prolonged time can be highly inaccurate. Moreover, when there is no measurement available to update Kalman filter parameters, the standard convention is to trust the priori state estim… ▽ More

    Submitted 15 March, 2023; v1 submitted 27 March, 2022; originally announced March 2022.

    Comments: Accepted by CVPR 2023. 8 pages + 10 pages of appendix. Renamed OOS as Observation-centric Re-Update (ORU)

  38. Deep Multi-Branch Aggregation Network for Real-Time Semantic Segmentation in Street Scenes

    Authors: Xi Weng, Yan Yan, Genshun Dong, Chang Shu, Biao Wang, Hanzi Wang, Ji Zhang

    Abstract: Real-time semantic segmentation, which aims to achieve high segmentation accuracy at real-time inference speed, has received substantial attention over the past few years. However, many state-of-the-art real-time semantic segmentation methods tend to sacrifice some spatial details or contextual information for fast inference, thus leading to degradation in segmentation quality. In this paper, we p… ▽ More

    Submitted 8 March, 2022; originally announced March 2022.

  39. Stage-Aware Feature Alignment Network for Real-Time Semantic Segmentation of Street Scenes

    Authors: Xi Weng, Yan Yan, Si Chen, Jing-Hao Xue, Hanzi Wang

    Abstract: Over the past few years, deep convolutional neural network-based methods have made great progress in semantic segmentation of street scenes. Some recent methods align feature maps to alleviate the semantic gap between them and achieve high segmentation accuracy. However, they usually adopt the feature alignment modules with the same network configuration in the decoder and thus ignore the differen… ▽ More

    Submitted 8 March, 2022; originally announced March 2022.

  40. arXiv:2201.00742  [pdf

    physics.optics

    Property unification of inherent amplitude, phase and polarization within a light beam

    Authors: Xiaoyu Weng, Yu Miao, Guanxue Wang, Yihui Wang, Qiufang Zhan, Xiangmei Dong, Junle Qu, Xiumin Gao, Songlin Zhuang

    Abstract: Is it possible to modulate the inherent properties of a single light beam, namely amplitude, phase and polarization, simultaneously, by merely its phase? Here, we solve this scientific problem by unifying all these three properties of a single light beam using phase vectorization and phase version of Malus's law. Full-property spatial light modulator is therefore developed based on the unification… ▽ More

    Submitted 3 January, 2022; originally announced January 2022.

  41. arXiv:2111.15334  [pdf, other

    cond-mat.mtrl-sci physics.comp-ph

    Unusual phase transition of layer-stacked borophene under pressure

    Authors: Xiao-Ji Weng, QuanSheng Wu, Xi Shao, Oleg V. Yazyev, Xin-Ling He, Xiao Dong, Hui-Tian Wang, Xiang-Feng Zhou, Yongjun Tian

    Abstract: The 8-Pmmn borophene, a boron analogue of graphene, hosts tilted and anisotropic massless Dirac fermion quasiparticles owing to the presence of the distorted graphene-like sublattice. First-principles calculations show that the stacked 8-Pmmn borophene is transformed into the fused three-dimensional borophene under pressure, being accompanied by the partially bond-breaking and bond-reforming. Stri… ▽ More

    Submitted 26 April, 2022; v1 submitted 30 November, 2021; originally announced November 2021.

    Comments: 6 pages, 4 figures

  42. arXiv:2111.02446  [pdf, ps, other

    physics.optics

    Optical demultiplexing of fractal-structured beams in turbulent atmospheric environments

    Authors: Xiaojing Weng, Luat T. Vuong

    Abstract: When information is spatially repeated in self-similar fractal beam patterns, only a portion of the diffracted beam is needed to reconstruct the kernel data. What is unique to a fractal-encoding scheme is that the image demultiplexing process can be, to a first approximation, easily performed optically. In prior work, we experimentally and numerically study fractal-encoded optical beams and their… ▽ More

    Submitted 22 January, 2024; v1 submitted 3 November, 2021; originally announced November 2021.

  43. arXiv:2110.09481  [pdf, other

    cs.CV cs.MA cs.RO

    MTP: Multi-Hypothesis Tracking and Prediction for Reduced Error Propagation

    Authors: Xinshuo Weng, Boris Ivanovic, Marco Pavone

    Abstract: Recently, there has been tremendous progress in developing each individual module of the standard perception-planning robot autonomy pipeline, including detection, tracking, prediction of other agents' trajectories, and ego-agent trajectory planning. Nevertheless, there has been less attention given to the principled integration of these components, particularly in terms of the characterization an… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

    Comments: Project page: https://www.xinshuoweng.com/projects/MTP

  44. Triply heavy tetraquark states

    Authors: Xin-Zhen Weng, Wei-Zhen Deng, Shi-Lin Zhu

    Abstract: In the framework of an extended chromomagnetic model, we systematically study the mass spectrum of the $S$-wave $qQ\bar{Q}\bar{Q}$ tetraquarks. Their mass spectra are mainly determined by the color interaction. For the $qc\bar{c}\bar{c}$, $qb\bar{c}\bar{c}$ and $qb\bar{b}\bar{b}$ tetraquarks, the color interaction favors the color-sextet $\ket{(qQ)^{6_{c}}(\bar{Q}\bar{Q})^{\bar{6}_{c}}}$ configura… ▽ More

    Submitted 24 February, 2022; v1 submitted 11 September, 2021; originally announced September 2021.

    Comments: 25 pages, 6 figures. revised version accepted by PRD

    Journal ref: Phys. Rev. D 105, 034026 (2022)

  45. Doubly heavy tetraquarks in an extended chromomagnetic model

    Authors: Xin-Zhen Weng, Wei-Zhen Deng, Shi-Lin Zhu

    Abstract: Using an extended chromomagnetic model, we perform a systematic study of the masses of the doubly heavy tetraquarks. We find that the ground states of the doubly heavy tetraquarks are dominated by color-triplet $\ket{(qq)^{\bar{3}_{c}}(\bar{Q}\bar{Q})^{3_{c}}}$ configuration, which is opposite to that of the fully heavy tetraquarks. The combined results suggest that the color-triplet configuration… ▽ More

    Submitted 5 October, 2021; v1 submitted 16 August, 2021; originally announced August 2021.

    Comments: 30 pages, 6 figures; to be published in CPC

    Journal ref: Chin. Phys. C 46, 013102 (2022)

  46. arXiv:2107.11470  [pdf, other

    cs.CV

    Multi-Echo LiDAR for 3D Object Detection

    Authors: Yunze Man, Xinshuo Weng, Prasanna Kumar Sivakuma, Matthew O'Toole, Kris Kitani

    Abstract: LiDAR sensors can be used to obtain a wide range of measurement signals other than a simple 3D point cloud, and those signals can be leveraged to improve perception tasks like 3D object detection. A single laser pulse can be partially reflected by multiple objects along its path, resulting in multiple measurements called echoes. Multi-echo measurement can provide information about object contours… ▽ More

    Submitted 23 July, 2021; originally announced July 2021.

  47. arXiv:2107.04013  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Multi-Modality Task Cascade for 3D Object Detection

    Authors: Jinhyung Park, Xinshuo Weng, Yunze Man, Kris Kitani

    Abstract: Point clouds and RGB images are naturally complementary modalities for 3D visual understanding - the former provides sparse but accurate locations of points on objects, while the latter contains dense color and texture information. Despite this potential for close sensor fusion, many methods train two models in isolation and use simple feature concatenation to represent 3D sensor data. This separa… ▽ More

    Submitted 8 July, 2021; originally announced July 2021.

  48. arXiv:2105.11251  [pdf

    physics.optics

    Light beam carrying natural non-integer orbital angular momentum in free space

    Authors: Xiaoyu Weng, Yu Miao, Guanxue Wang, Qiufang Zhan, Xiangmei Dong, Junle Qu, Xiumin Gao, Songlin Zhuang

    Abstract: Light beam with optical vortices can propagate in free space only with integer orbital angular momentum. Here, we invert this scientific consensus theoretically and experimentally by proposing light beams carrying natural non-integer orbital angular momentum. These peculiar light beams are actually special solutions of wave function, which possess optical vortices with the topological charge l+0.5… ▽ More

    Submitted 24 May, 2021; originally announced May 2021.

  49. arXiv:2104.08568  [pdf, other

    cs.CV

    Wide-Baseline Multi-Camera Calibration using Person Re-Identification

    Authors: Yan Xu, Yu-Jhe Li, Xinshuo Weng, Kris Kitani

    Abstract: We address the problem of estimating the 3D pose of a network of cameras for large-environment wide-baseline scenarios, e.g., cameras for construction sites, sports stadiums, and public spaces. This task is challenging since detecting and matching the same 3D keypoint observed from two very different camera views is difficult, making standard structure-from-motion (SfM) pipelines inapplicable. In… ▽ More

    Submitted 17 April, 2021; originally announced April 2021.

  50. arXiv:2103.14023  [pdf, other

    cs.AI cs.CV cs.LG cs.MA cs.RO

    AgentFormer: Agent-Aware Transformers for Socio-Temporal Multi-Agent Forecasting

    Authors: Ye Yuan, Xinshuo Weng, Yanglan Ou, Kris Kitani

    Abstract: Predicting accurate future trajectories of multiple agents is essential for autonomous systems, but is challenging due to the complex agent interaction and the uncertainty in each agent's future behavior. Forecasting multi-agent trajectories requires modeling two key dimensions: (1) time dimension, where we model the influence of past agent states over future states; (2) social dimension, where we… ▽ More

    Submitted 7 October, 2021; v1 submitted 25 March, 2021; originally announced March 2021.

    Comments: ICCV 2021. Code: https://github.com/Khrylx/AgentFormer. Project page: https://www.ye-yuan.com/agentformer