Skip to main content

Showing 1–50 of 392 results for author: Dai, B

  1. arXiv:2407.10448  [pdf, other

    cs.LG stat.ML

    Spectral Representation for Causal Estimation with Hidden Confounders

    Authors: Tongzheng Ren, Haotian Sun, Antoine Moulin, Arthur Gretton, Bo Dai

    Abstract: We address the problem of causal effect estimation where hidden confounders are present, with a focus on two settings: instrumental variable regression with additional observed confounders, and proxy causal learning. Our approach uses a singular value decomposition of a conditional expectation operator, followed by a saddle-point optimization problem, which, in the context of IV regression, can be… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  2. arXiv:2407.09522  [pdf, other

    cs.DB cs.AI cs.LG stat.ML

    UQE: A Query Engine for Unstructured Databases

    Authors: Hanjun Dai, Bethany Yixin Wang, Xingchen Wan, Bo Dai, Sherry Yang, Azade Nova, Pengcheng Yin, Phitchaya Mangpo Phothilimthana, Charles Sutton, Dale Schuurmans

    Abstract: Analytics on structured data is a mature field with many successful methods. However, most real world data exists in unstructured form, such as images and conversations. We investigate the potential of Large Language Models (LLMs) to enable unstructured data analytics. In particular, we propose a new Universal Query Engine (UQE) that directly interrogates and draws insights from unstructured data… ▽ More

    Submitted 23 June, 2024; originally announced July 2024.

  3. arXiv:2407.04553  [pdf, other

    astro-ph.HE hep-ph

    The Nature of the High-energy Gamma-Ray Radiation Associated with the High-redshift Blazar B3 1343+451

    Authors: Fan Wu, Wen Hu, Benzhong Dai

    Abstract: High-redshift blazars are the most powerful extragalactic astrophysical sources ever detected in the high-energy gamma-ray band. In this study, we present a temporal and spectral analysis of the high-redshift blazar B3 1343+451 based on 14 years of Fermi-LAT observations, spanning from 2008 August 4 to 2022 June 6 (MJD 54686-59733). We extract a seven-day binned $γ$-ray light curve in the energy r… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: 13 pages, 5 figures.Accepted for publication in APJ

  4. arXiv:2406.17601  [pdf, other

    cs.CV

    Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text

    Authors: Xinyang Li, Zhangyu Lai, Linning Xu, Yansong Qu, Liujuan Cao, Shengchuan Zhang, Bo Dai, Rongrong Ji

    Abstract: Recent advancements in 3D generation have leveraged synthetic datasets with ground truth 3D assets and predefined cameras. However, the potential of adopting real-world datasets, which can produce significantly more realistic 3D scenes, remains largely unexplored. In this work, we delve into the key challenge of the complex and scene-specific camera trajectories found in real-world captures. We in… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Code: https://github.com/imlixinyang/director3d

  5. arXiv:2406.16121  [pdf, other

    cs.LG cs.AI

    Diffusion Spectral Representation for Reinforcement Learning

    Authors: Dmitry Shribak, Chen-Xiao Gao, Yitong Li, Chenjun Xiao, Bo Dai

    Abstract: Diffusion-based models have achieved notable empirical successes in reinforcement learning (RL) due to their expressiveness in modeling complex distributions. Despite existing methods being promising, the key challenge of extending existing methods for broader real-world applications lies in the computational cost at inference time, i.e., sampling from a diffusion model is considerably slow as it… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: Under review

  6. arXiv:2406.08698  [pdf, other

    astro-ph.HE hep-ph

    Constraints on Ultra Heavy Dark Matter Properties from Dwarf Spheroidal Galaxies with LHAASO Observations

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: In this work we try to search for signals generated by ultra-heavy dark matter at the Large High Altitude Air Shower Observatory (LHAASO) data. We look for possible gamma-ray by dark matter annihilation or decay from 16 dwarf spheroidal galaxies in the field of view of LHAASO. Dwarf spheroidal galaxies are among the most promising targets for indirect detection of dark matter which have low fluxes… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 17 pages, 12 figures, accepted by PRL

  7. arXiv:2406.02888  [pdf, other

    cs.CL cs.AI cs.LG

    HYDRA: Model Factorization Framework for Black-Box LLM Personalization

    Authors: Yuchen Zhuang, Haotian Sun, Yue Yu, Rushi Qiang, Qifan Wang, Chao Zhang, Bo Dai

    Abstract: Personalization has emerged as a critical research area in modern intelligent systems, focusing on mining users' behavioral history and adapting to their preferences for delivering tailored experiences. Despite the remarkable few-shot capabilities exhibited by black-box large language models (LLMs), the inherent opacity of their model parameters presents significant challenges in aligning the gene… ▽ More

    Submitted 10 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Comments: 24 pages, 6 figures, work in progress

  8. arXiv:2406.02461  [pdf, other

    cs.CV

    RoomTex: Texturing Compositional Indoor Scenes via Iterative Inpainting

    Authors: Qi Wang, Ruijie Lu, Xudong Xu, Jingbo Wang, Michael Yu Wang, Bo Dai, Gang Zeng, Dan Xu

    Abstract: The advancement of diffusion models has pushed the boundary of text-to-3D object generation. While it is straightforward to composite objects into a scene with reasonable geometry, it is nontrivial to texture such a scene perfectly due to style inconsistency and occlusions between objects. To tackle these problems, we propose a coarse-to-fine 3D scene texturing framework, referred to as RoomTex, t… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  9. arXiv:2405.21043  [pdf, other

    cs.LG cs.AI

    Target Networks and Over-parameterization Stabilize Off-policy Bootstrapping with Function Approximation

    Authors: Fengdi Che, Chenjun Xiao, Jincheng Mei, Bo Dai, Ramki Gummadi, Oscar A Ramirez, Christopher K Harris, A. Rupam Mahmood, Dale Schuurmans

    Abstract: We prove that the combination of a target network and over-parameterized linear function approximation establishes a weaker convergence condition for bootstrapped value estimation in certain cases, even with off-policy data. Our condition is naturally satisfied for expected updates over the entire state-action space or learning with a batch of complete trajectories from episodic Markov decision pr… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Journal ref: Proceedings of the 41 st International Conference on Machine Learning, 2024

  10. arXiv:2405.19320  [pdf, other

    cs.LG cs.AI stat.ML

    Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF

    Authors: Shicong Cen, Jincheng Mei, Katayoon Goshvadi, Hanjun Dai, Tong Yang, Sherry Yang, Dale Schuurmans, Yuejie Chi, Bo Dai

    Abstract: Reinforcement learning from human feedback (RLHF) has demonstrated great promise in aligning large language models (LLMs) with human preference. Depending on the availability of preference data, both online and offline RLHF are active areas of investigation. A key bottleneck is understanding how to incorporate uncertainty estimation in the reward function learned from the preference data for RLHF,… ▽ More

    Submitted 5 July, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

  11. The influence of the Sun and Moon on the observation of very high energy gamma-ray sources using EAS arrays

    Authors: Tao Wen, Songzhan Chen, BenZhong Dai

    Abstract: With great advance of ground-based extensive air shower array, such as LHAASO and HAWC, many very high energy (VHE) gamma-ray sources have been discovered and are been monitored regardless of the day and the night. Hence, the Sun and Moon would have some compact on the observation of gamma-ray sources, which have not been taken into account in previous analysis. In this paper, the influence of the… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 11 pages, 7 figures, accepted to RAA

  12. arXiv:2405.12245  [pdf, ps, other

    cs.IT

    Low Complexity Successive Cancellation Decoding of Polar Codes based on Pruning Strategy in Deletion Error Channels

    Authors: He Sun, Rongke Liu, Bin Dai

    Abstract: A novel SC decoding method of polar codes is proposed in $d$-deletion channels, where a new pruning strategy is designed to reduce decoding complexity. Considering the difference of the scenario weight distributions, pruning thresholds for each node are designed separately according to a uniform constraint on the pruning error probability, which further reduce the number of scenarios that need to… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

  13. arXiv:2405.11826  [pdf, other

    astro-ph.IM hep-ex physics.ins-det

    Data quality control system and long-term performance monitor of the LHAASO-KM2A

    Authors: Zhen Cao, F. Aharonian, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, W. Bian, A. V. Bukevich, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, H. X. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. Chen , et al. (263 additional authors not shown)

    Abstract: The KM2A is the largest sub-array of the Large High Altitude Air Shower Observatory (LHAASO). It consists of 5216 electromagnetic particle detectors (EDs) and 1188 muon detectors (MDs). The data recorded by the EDs and MDs are used to reconstruct primary information of cosmic ray and gamma-ray showers. This information is used for physical analysis in gamma-ray astronomy and cosmic ray physics. To… ▽ More

    Submitted 13 June, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

    Comments: 15 pages, 9 figures

  14. arXiv:2405.10823  [pdf, other

    math.AP

    The well-posedness and blow up phenomenon for a Tsunamis model with time-fractional derivative

    Authors: Bingbing Dai, Wei Luo, Zhaoyang Yin, Pei Zheng

    Abstract: This paper is concerned with the well-posedness of a time-fractional shallow-water equations, which has received little attention. In the realm of fractional calculus, numerous types of fractional derivatives have been explored in the literature. Among these, one of the most notable and well-structured ones is the conformable fractional derivative. In this paper, we delve into the local well-posed… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

  15. arXiv:2405.09874  [pdf, other

    cs.CV

    Dual3D: Efficient and Consistent Text-to-3D Generation with Dual-mode Multi-view Latent Diffusion

    Authors: Xinyang Li, Zhangyu Lai, Linning Xu, Jianfei Guo, Liujuan Cao, Shengchuan Zhang, Bo Dai, Rongrong Ji

    Abstract: We present Dual3D, a novel text-to-3D generation framework that generates high-quality 3D assets from texts in only $1$ minute.The key component is a dual-mode multi-view latent diffusion model. Given the noisy multi-view latents, the 2D mode can efficiently denoise them with a single latent denoising network, while the 3D mode can generate a tri-plane neural surface for consistent rendering-based… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: Project Page: https://dual3d.github.io

  16. arXiv:2405.07691  [pdf, other

    astro-ph.HE

    Discovery of Very-high-energy Gamma-ray Emissions from the Low Luminosity AGN NGC 4278 by LHAASO

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: The first source catalog of Large High Altitude Air Shower Observatory reported the detection of a very-high-energy gamma ray source, 1LHAASO J1219+2915. In this paper a further detailed study of the spectral and temporal behavior of this point-like source have been carried. The best-fit position of the TeV source ($\rm{RA}=185.05^{\circ}\pm0.04^{\circ}$, $\rm{Dec}=29.25^{\circ}\pm0.03^{\circ}$) i… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 11 pages, 5 figures

  17. arXiv:2405.06878  [pdf, ps, other

    math.AP math.DS

    A nonlocal diffusion single population model in advective environment

    Authors: Yaobin Tang, Binxiang Dai

    Abstract: This paper is devoted to a nonlocal reaction-diffusion-advection model that describes the spatial dynamics of freshwater organisms in a river with a directional motion. Our goal is to investigate how the advection rate affects the dynamic behaviors of species. We first establish the well-posedness of global solutions, where the regularized problem containing a viscosity term and the re-established… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    Comments: 39 pages;8 figures;

    MSC Class: 35K57; 35R35; 35B40; 92D25

  18. Discovering the Mass-Scaled Damping Timescale from Microquasars to Blazars

    Authors: Haoyang Zhang, Shenbang Yang, Benzhong Dai

    Abstract: Studying the variability of the accretion disks of black holes and jets is important to identify their internal physical processes. In this letter, we obtain the characteristic damping timescale of 34 blazars and seven microquasars from the Fermi-Large Area Telescope and the XMM-Newton X-ray telescope, respectively. We found that the mass-scaled characteristic timescales, ranging from the microqua… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: 11 pages, 2 figures, 3 tables, Accepted for publication in ApJ Letters

  19. arXiv:2405.04686  [pdf

    physics.app-ph physics.optics

    Ultrafast dynamics of wavelength-sensitive magnons in unconventional compensated semiconducting antiferromagnet

    Authors: Hanshen Huang, Tao Qu, Yang Cheng, Lixuan Tai, Christopher Eckberg, Quanjun Pan, Abdullah Alrasheed, Su Kong Chong, Bingqian Dai, Yaochen Li, Qingyuan Shu, Chao-Yao Yang, Jie-Xiang Yu, Gen Yin, Kang L. Wang

    Abstract: Antiferromagnet is a promising candidate for the next generation spintronic devices, benefiting from its ultrafast dynamics and spontaneous zero stray field. However, the understanding of their ultrafast spin behaviors is lacking due to the challenges of controlling/detecting the quenched net magnetization. Unconventional compensated semiconducting antiferromagnets present strong time-reversal sym… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  20. arXiv:2405.04390  [pdf, other

    cs.CV

    DriveWorld: 4D Pre-trained Scene Understanding via World Models for Autonomous Driving

    Authors: Chen Min, Dawei Zhao, Liang Xiao, Jian Zhao, Xinli Xu, Zheng Zhu, Lei Jin, Jianshu Li, Yulan Guo, Junliang Xing, Liping Jing, Yiming Nie, Bin Dai

    Abstract: Vision-centric autonomous driving has recently raised wide attention due to its lower cost. Pre-training is essential for extracting a universal representation. However, current vision-centric pre-training typically relies on either 2D or 3D pre-text tasks, overlooking the temporal characteristics of autonomous driving as a 4D scene understanding task. In this paper, we address this challenge by i… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: Accepted by CVPR2024

  21. arXiv:2404.19759  [pdf, other

    cs.CV

    MotionLCM: Real-time Controllable Motion Generation via Latent Consistency Model

    Authors: Wenxun Dai, Ling-Hao Chen, Jingbo Wang, Jinpeng Liu, Bo Dai, Yansong Tang

    Abstract: This work introduces MotionLCM, extending controllable motion generation to a real-time level. Existing methods for spatial control in text-conditioned motion generation suffer from significant runtime inefficiency. To address this issue, we first propose the motion latent consistency model (MotionLCM) for motion generation, building upon the latent diffusion model (MLD). By employing one-step (or… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: MotionLCM project version 1.0

  22. arXiv:2404.19722  [pdf, other

    cs.CV

    PACER+: On-Demand Pedestrian Animation Controller in Driving Scenarios

    Authors: Jingbo Wang, Zhengyi Luo, Ye Yuan, Yixuan Li, Bo Dai

    Abstract: We address the challenge of content diversity and controllability in pedestrian simulation for driving scenarios. Recent pedestrian animation frameworks have a significant limitation wherein they primarily focus on either following trajectory [46] or the content of the reference video [57], consequently overlooking the potential diversity of human motion within such scenarios. This limitation rest… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  23. arXiv:2404.16748  [pdf, other

    cs.CV

    TELA: Text to Layer-wise 3D Clothed Human Generation

    Authors: Junting Dong, Qi Fang, Zehuan Huang, Xudong Xu, Jingbo Wang, Sida Peng, Bo Dai

    Abstract: This paper addresses the task of 3D clothed human generation from textural descriptions. Previous works usually encode the human body and clothes as a holistic model and generate the whole model in a single-stage optimization, which makes them struggle for clothing editing and meanwhile lose fine-grained control over the whole generation process. To solve this, we propose a layer-wise clothed huma… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  24. arXiv:2404.16666  [pdf, other

    cs.CV

    PhyRecon: Physically Plausible Neural Scene Reconstruction

    Authors: Junfeng Ni, Yixin Chen, Bohan Jing, Nan Jiang, Bin Wang, Bo Dai, Puhao Li, Yixin Zhu, Song-Chun Zhu, Siyuan Huang

    Abstract: Neural implicit representations have gained popularity in multi-view 3D reconstruction. However, most previous work struggles to yield physically plausible results, limiting their utility in domains requiring rigorous physical accuracy, such as embodied AI and robotics. This lack of plausibility stems from the absence of physics modeling in existing methods and their inability to recover intricate… ▽ More

    Submitted 2 June, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: project page: https://phyrecon.github.io/. arXiv admin note: text overlap with arXiv:2303.08605 by other authors

  25. arXiv:2404.13676  [pdf, other

    math.NA

    Lowest-degree robust finite element schemes for inhomogeneous bi-Laplace problems

    Authors: Bin Dai, Huilan Zeng, Chensong Zhang, Shuo Zhang

    Abstract: In this paper, we study the numerical method for the bi-Laplace problems with inhomogeneous coefficients; particularly, we propose finite element schemes on rectangular grids respectively for an inhomogeneous fourth-order elliptic singular perturbation problem and for the Helmholtz transmission eigenvalue problem. The new methods use the reduced rectangle Morley (RRM for short) element space with… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

  26. arXiv:2404.08089  [pdf, other

    cs.LG math.OC

    Efficient Duple Perturbation Robustness in Low-rank MDPs

    Authors: Yang Hu, Haitong Ma, Bo Dai, Na Li

    Abstract: The pursuit of robustness has recently been a popular topic in reinforcement learning (RL) research, yet the existing methods generally suffer from efficiency issues that obstruct their real-world implementation. In this paper, we introduce duple perturbation robustness, i.e. perturbation on both the feature and factor vectors for low-rank Markov decision processes (MDPs), via a novel characteriza… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: 25 pages, 8 figures, in submission to ICML'24

  27. arXiv:2404.05337  [pdf, other

    cs.CL cs.AI

    Towards Objectively Benchmarking Social Intelligence for Language Agents at Action Level

    Authors: Chenxu Wang, Bin Dai, Huaping Liu, Baoyuan Wang

    Abstract: Prominent large language models have exhibited human-level performance in many domains, even enabling the derived agents to simulate human and social interactions. While practical works have substantiated the practicability of grounding language agents in sandbox simulation or embodied simulators, current social intelligence benchmarks either stay at the language level or use subjective metrics. I… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  28. arXiv:2404.05051  [pdf, other

    cs.LG cs.RO

    Skill Transfer and Discovery for Sim-to-Real Learning: A Representation-Based Viewpoint

    Authors: Haitong Ma, Zhaolin Ren, Bo Dai, Na Li

    Abstract: We study sim-to-real skill transfer and discovery in the context of robotics control using representation learning. We draw inspiration from spectral decomposition of Markov decision processes. The spectral decomposition brings about representation that can linearly represent the state-action value function induced by any policies, thus can be regarded as skills. The skill representations are tran… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Comments: 9 pages, 6 figures. Project page: https://congharvard.github.io/steady-sim-to-real/

  29. arXiv:2404.04801  [pdf, ps, other

    astro-ph.IM astro-ph.HE

    LHAASO-KM2A detector simulation using Geant4

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (254 additional authors not shown)

    Abstract: KM2A is one of the main sub-arrays of LHAASO, working on gamma ray astronomy and cosmic ray physics at energies above 10 TeV. Detector simulation is the important foundation for estimating detector performance and data analysis. It is a big challenge to simulate the KM2A detector in the framework of Geant4 due to the need to track numerous photons from a large number of detector units (>6000) with… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  30. arXiv:2404.03590  [pdf, other

    cs.CV cs.AI

    SemGrasp: Semantic Grasp Generation via Language Aligned Discretization

    Authors: Kailin Li, Jingbo Wang, Lixin Yang, Cewu Lu, Bo Dai

    Abstract: Generating natural human grasps necessitates consideration of not just object geometry but also semantic information. Solely depending on object shape for grasp generation confines the applications of prior methods in downstream tasks. This paper presents a novel semantic-based grasp generation method, termed SemGrasp, which generates a static human grasp pose by incorporating semantic information… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  31. arXiv:2404.03194  [pdf, other

    cs.DB

    Reservoir Sampling over Joins

    Authors: Binyang Dai, Xiao Hu, Ke Yi

    Abstract: Sampling over joins is a fundamental task in large-scale data analytics. Instead of computing the full join results, which could be massive, a uniform sample of the join results would suffice for many purposes, such as answering analytical queries or training machine learning models. In this paper, we study the problem of how to maintain a random sample over joins while the tuples are streaming in… ▽ More

    Submitted 9 April, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

  32. arXiv:2404.02101  [pdf, other

    cs.CV

    CameraCtrl: Enabling Camera Control for Text-to-Video Generation

    Authors: Hao He, Yinghao Xu, Yuwei Guo, Gordon Wetzstein, Bo Dai, Hongsheng Li, Ceyuan Yang

    Abstract: Controllability plays a crucial role in video generation since it allows users to create desired content. However, existing models largely overlooked the precise control of camera pose that serves as a cinematic language to express deeper narrative nuances. To alleviate this issue, we introduce CameraCtrl, enabling accurate camera pose control for text-to-video(T2V) models. After precisely paramet… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: Project page: https://hehao13.github.io/projects-CameraCtrl/ Code: https://github.com/hehao13/CameraCtrl

  33. arXiv:2403.18206  [pdf, other

    cs.RO

    Sailing Through Point Clouds: Safe Navigation Using Point Cloud Based Control Barrier Functions

    Authors: Bolun Dai, Rooholla Khorrambakht, Prashanth Krishnamurthy, Farshad Khorrami

    Abstract: The capability to navigate safely in an unstructured environment is crucial when deploying robotic systems in real-world scenarios. Recently, control barrier function (CBF) based approaches have been highly effective in synthesizing safety-critical controllers. In this work, we propose a novel CBF-based local planner comprised of two components: Vessel and Mariner. The Vessel is a novel scaling fa… ▽ More

    Submitted 16 July, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

  34. arXiv:2403.17898  [pdf, other

    cs.CV

    Octree-GS: Towards Consistent Real-time Rendering with LOD-Structured 3D Gaussians

    Authors: Kerui Ren, Lihan Jiang, Tao Lu, Mulin Yu, Linning Xu, Zhangkai Ni, Bo Dai

    Abstract: The recent 3D Gaussian splatting (3D-GS) has shown remarkable rendering fidelity and efficiency compared to NeRF-based neural scene representations. While demonstrating the potential for real-time rendering, 3D-GS encounters rendering bottlenecks in large scenes with complex details due to an excessive number of Gaussian primitives located within the viewing frustum. This limitation is particularl… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: Project page: https://city-super.github.io/octree-gs/

  35. arXiv:2403.16964  [pdf, other

    cs.CV

    GSDF: 3DGS Meets SDF for Improved Rendering and Reconstruction

    Authors: Mulin Yu, Tao Lu, Linning Xu, Lihan Jiang, Yuanbo Xiangli, Bo Dai

    Abstract: Presenting a 3D scene from multiview images remains a core and long-standing challenge in computer vision and computer graphics. Two main requirements lie in rendering and reconstruction. Notably, SOTA rendering quality is usually achieved with neural volumetric rendering techniques, which rely on aggregated point/primitive-wise color and neglect the underlying scene geometry. Learning of neural i… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: Project page: https://city-super.github.io/GSDF

  36. arXiv:2403.16897  [pdf, other

    cs.CV

    Make-It-Vivid: Dressing Your Animatable Biped Cartoon Characters from Text

    Authors: Junshu Tang, Yanhong Zeng, Ke Fan, Xuheng Wang, Bo Dai, Kai Chen, Lizhuang Ma

    Abstract: Creating and animating 3D biped cartoon characters is crucial and valuable in various applications. Compared with geometry, the diverse texture design plays an important role in making 3D biped cartoon characters vivid and charming. Therefore, we focus on automatic texture design for cartoon characters based on input instructions. This is challenging for domain-specific requirements and a lack of… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: Project page: https://make-it-vivid.github.io/

  37. arXiv:2403.12019  [pdf, other

    cs.CV

    LN3Diff: Scalable Latent Neural Fields Diffusion for Speedy 3D Generation

    Authors: Yushi Lan, Fangzhou Hong, Shuai Yang, Shangchen Zhou, Xuyi Meng, Bo Dai, Xingang Pan, Chen Change Loy

    Abstract: The field of neural rendering has witnessed significant progress with advancements in generative models and differentiable rendering techniques. Though 2D diffusion has achieved success, a unified 3D diffusion pipeline remains unsettled. This paper introduces a novel framework called LN3Diff to address this gap and enable fast, high-quality, and generic conditional 3D generation. Our approach harn… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: project webpage: https://nirvanalan.github.io/projects/ln3diff/

  38. arXiv:2403.11990  [pdf, other

    cs.CV

    GetMesh: A Controllable Model for High-quality Mesh Generation and Manipulation

    Authors: Zhaoyang Lyu, Ben Fei, Jinyi Wang, Xudong Xu, Ya Zhang, Weidong Yang, Bo Dai

    Abstract: Mesh is a fundamental representation of 3D assets in various industrial applications, and is widely supported by professional softwares. However, due to its irregular structure, mesh creation and manipulation is often time-consuming and labor-intensive. In this paper, we propose a highly controllable generative model, GetMesh, for mesh generation and manipulation across different categories. By ta… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  39. Measurements of All-Particle Energy Spectrum and Mean Logarithmic Mass of Cosmic Rays from 0.3 to 30 PeV with LHAASO-KM2A

    Authors: The LHAASO Collaboration, Zhen Cao, F. Aharonian, Q. An, A. Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen , et al. (256 additional authors not shown)

    Abstract: We present the measurements of all-particle energy spectrum and mean logarithmic mass of cosmic rays in the energy range of 0.3-30 PeV using data collected from LHAASO-KM2A between September 2021 and December 2022, which is based on a nearly composition-independent energy reconstruction method, achieving unprecedented accuracy. Our analysis reveals the position of the knee at… ▽ More

    Submitted 26 March, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

    Comments: 8 pages, 3 figures

    Journal ref: Physical Review Letters 132, 131002 (2024)

  40. arXiv:2403.09630  [pdf, other

    cs.CV

    Generalized Predictive Model for Autonomous Driving

    Authors: Jiazhi Yang, Shenyuan Gao, Yihang Qiu, Li Chen, Tianyu Li, Bo Dai, Kashyap Chitta, Penghao Wu, Jia Zeng, Ping Luo, Jun Zhang, Andreas Geiger, Yu Qiao, Hongyang Li

    Abstract: In this paper, we introduce the first large-scale video prediction model in the autonomous driving discipline. To eliminate the restriction of high-cost data collection and empower the generalization ability of our model, we acquire massive data from the web and pair it with diverse and high-quality text descriptions. The resultant dataset accumulates over 2000 hours of driving videos, spanning ar… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR 2024

  41. arXiv:2403.03490  [pdf, other

    astro-ph.CO physics.data-an

    A comparative study of cosmological constraints from weak lensing using Convolutional Neural Networks

    Authors: Divij Sharma, Biwei Dai, Uros Seljak

    Abstract: Weak Lensing (WL) surveys are reaching unprecedented depths, enabling the investigation of very small angular scales. At these scales, nonlinear gravitational effects lead to higher-order correlations making the matter distribution highly non-Gaussian. Extracting this information using traditional statistics has proven difficult, and Machine Learning based summary statistics have emerged as a powe… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 16 pages, 2 figures. Comments Welcome

  42. arXiv:2402.17235  [pdf, other

    cs.LG

    Stochastic Gradient Succeeds for Bandits

    Authors: Jincheng Mei, Zixin Zhong, Bo Dai, Alekh Agarwal, Csaba Szepesvari, Dale Schuurmans

    Abstract: We show that the \emph{stochastic gradient} bandit algorithm converges to a \emph{globally optimal} policy at an $O(1/t)$ rate, even with a \emph{constant} step size. Remarkably, global convergence of the stochastic gradient bandit algorithm has not been previously established, even though it is an old algorithm known to be applicable to bandits. The new result is achieved by establishing two nove… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 39 pages; Correction for a previous version published at ICML 2023 conference

  43. arXiv:2402.08219  [pdf, other

    cs.CL cs.AI cs.LG

    BBox-Adapter: Lightweight Adapting for Black-Box Large Language Models

    Authors: Haotian Sun, Yuchen Zhuang, Wei Wei, Chao Zhang, Bo Dai

    Abstract: Adapting state-of-the-art Large Language Models (LLMs) like GPT-4 and Gemini for specific tasks is challenging. Due to the opacity in their parameters, embeddings, and even output probabilities, existing fine-tuning adaptation methods are inapplicable. Consequently, adapting these black-box LLMs is only possible through their API services, raising concerns about transparency, privacy, and cost. To… ▽ More

    Submitted 28 May, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

    Comments: 25 pages, 10 figures

  44. arXiv:2402.02698  [pdf, other

    cs.LG cs.AI math.OC

    Beyond Expectations: Learning with Stochastic Dominance Made Practical

    Authors: Shicong Cen, Jincheng Mei, Hanjun Dai, Dale Schuurmans, Yuejie Chi, Bo Dai

    Abstract: Stochastic dominance models risk-averse preferences for decision making with uncertain outcomes, which naturally captures the intrinsic structure of the underlying uncertainty, in contrast to simply resorting to the expectations. Despite theoretically appealing, the application of stochastic dominance in machine learning has been scarce, due to the following challenges: $\textbf{i)}$, the original… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  45. arXiv:2401.15891  [pdf, other

    astro-ph.CO

    A field-level emulator for modeling baryonic effects across hydrodynamic simulations

    Authors: Divij Sharma, Biwei Dai, Francisco Villaescusa-Navarro, Uros Seljak

    Abstract: We develop a new and simple method to model baryonic effects at the field level relevant for weak lensing analyses. We analyze thousands of state-of-the-art hydrodynamic simulations from the CAMELS project, each with different cosmology and strength of feedback, and we find that the cross-correlation coefficient between full hydrodynamic and N-body simulations is very close to 1 down to… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: 12 pages, 9 figures. Comments welcome

  46. arXiv:2401.11775  [pdf, other

    cs.CV

    Collaborative Position Reasoning Network for Referring Image Segmentation

    Authors: Jianjian Cao, Beiya Dai, Yulin Li, Xiameng Qin, Jingdong Wang

    Abstract: Given an image and a natural language expression as input, the goal of referring image segmentation is to segment the foreground masks of the entities referred by the expression. Existing methods mainly focus on interactive learning between vision and language to enhance the multi-modal representations for global context reasoning. However, predicting directly in pixel-level space can lead to coll… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  47. arXiv:2401.10155  [pdf, other

    cs.LG

    A novel hybrid time-varying graph neural network for traffic flow forecasting

    Authors: Ben-Ao Dai, Bao-Lin Ye, Lingxi Li

    Abstract: Real-time and precise traffic flow prediction is vital for the efficiency of intelligent transportation systems. Traditional methods often employ graph neural networks (GNNs) with predefined graphs to describe spatial correlations among traffic nodes in urban road networks. However, these pre-defined graphs are limited by existing knowledge and graph generation methodologies, offering an incomplet… ▽ More

    Submitted 17 June, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

    Comments: 16 pages,7 figures

  48. arXiv:2401.06182  [pdf, other

    q-bio.QM cs.CV cs.LG eess.IV

    Prediction of Cellular Identities from Trajectory and Cell Fate Information

    Authors: Baiyang Dai, Jiamin Yang, Hari Shroff, Patrick La Riviere

    Abstract: Determining cell identities in imaging sequences is an important yet challenging task. The conventional method for cell identification is via cell tracking, which is complex and can be time-consuming. In this study, we propose an innovative approach to cell identification during early $\textit{C. elegans}$ embryogenesis using machine learning. Cell identification during $\textit{C. elegans}$ embry… ▽ More

    Submitted 2 March, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

  49. arXiv:2401.05353  [pdf, other

    cs.CV cs.LG

    ImbaGCD: Imbalanced Generalized Category Discovery

    Authors: Ziyun Li, Ben Dai, Furkan Simsek, Christoph Meinel, Haojin Yang

    Abstract: Generalized class discovery (GCD) aims to infer known and unknown categories in an unlabeled dataset leveraging prior knowledge of a labeled set comprising known classes. Existing research implicitly/explicitly assumes that the frequency of occurrence for each category, whether known or unknown, is approximately the same in the unlabeled data. However, in nature, we are more likely to encounter kn… ▽ More

    Submitted 4 December, 2023; originally announced January 2024.

    Comments: CVPR 2023 Computer Vision in the Wild Workshop \textbf{Spotlight} paper

  50. Robust Control of An Aerial Manipulator Based on A Variable Inertia Parameters Model

    Authors: Guangyu Zhang, Yuqing He, Bo Dai, Feng Gu, Jianda Han, Guangjun Liu

    Abstract: Aerial manipulator, which is composed of an UAV (Unmanned Aerial Vehicle) and a multi-link manipulator and can perform aerial manipulation, has shown great potential of applications. However, dynamic coupling between the UAV and the manipulator makes it difficult to control the aerial manipulator with high performance. In this paper, system modeling and control problem of the aerial manipulator ar… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Journal ref: IEEE Trans. Ind. Electron. 67(2020)9515-9525