Skip to main content

Showing 1–27 of 27 results for author: Weng, B

  1. arXiv:2405.20013  [pdf, other

    cs.RO

    Repeatable and Reliable Efforts of Accelerated Risk Assessment

    Authors: Linda Capito, Guillermo A. Castillo, Bowen Weng

    Abstract: Risk assessment of a robot in controlled environments, such as laboratories and proving grounds, is a common means to assess, certify, validate, verify, and characterize the robots' safety performance before, during, and even after their commercialization in the real-world. A standard testing program that acquires the risk estimate is expected to be (i) repeatable, such that it obtains similar ris… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  2. arXiv:2404.09022  [pdf, other

    cs.LG cs.AI cs.CL

    Navigating the Landscape of Large Language Models: A Comprehensive Review and Analysis of Paradigms and Fine-Tuning Strategies

    Authors: Benjue Weng

    Abstract: With the surge of ChatGPT,the use of large models has significantly increased,rapidly rising to prominence across the industry and sweeping across the internet. This article is a comprehensive review of fine-tuning methods for large models. This paper investigates the latest technological advancements and the application of advanced methods in aspects such as task-adaptive fine-tuning,domain-adapt… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

  3. arXiv:2309.15740  [pdf, other

    cs.RO

    Data-Driven Latent Space Representation for Robust Bipedal Locomotion Learning

    Authors: Guillermo A. Castillo, Bowen Weng, Wei Zhang, Ayonga Hereid

    Abstract: This paper presents a novel framework for learning robust bipedal walking by combining a data-driven state representation with a Reinforcement Learning (RL) based locomotion policy. The framework utilizes an autoencoder to learn a low-dimensional latent space that captures the complex dynamics of bipedal locomotion from existing locomotion data. This reduced dimensional state representation is the… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: Supplemental video: https://youtu.be/SUIkrigsrao

  4. arXiv:2309.15442  [pdf, other

    cs.RO

    Template Model Inspired Task Space Learning for Robust Bipedal Locomotion

    Authors: Guillermo A. Castillo, Bowen Weng, Shunpeng Yang, Wei Zhang, Ayonga Hereid

    Abstract: This work presents a hierarchical framework for bipedal locomotion that combines a Reinforcement Learning (RL)-based high-level (HL) planner policy for the online generation of task space commands with a model-based low-level (LL) controller to track the desired task space trajectories. Different from traditional end-to-end learning approaches, our HL policy takes insights from the angular momentu… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: Accepted at 2023 International Conference on Intelligent Robots and Systems (IROS). Supplemental Video: https://youtu.be/YTjMgGka4Ig

  5. arXiv:2308.14636  [pdf, other

    cs.RO

    Towards Standardized Disturbance Rejection Testing of Legged Robot Locomotion with Linear Impactor: A Preliminary Study, Observations, and Implications

    Authors: Bowen Weng, Guillermo A. Castillo, Yun-Seok Kang, Ayonga Hereid

    Abstract: Dynamic locomotion in legged robots is close to industrial collaboration, but a lack of standardized testing obstructs commercialization. The issues are not merely political, theoretical, or algorithmic but also physical, indicating limited studies and comprehension regarding standard testing infrastructure and equipment. For decades, the approaches we have been testing legged robots were rarely s… ▽ More

    Submitted 29 January, 2024; v1 submitted 28 August, 2023; originally announced August 2023.

    Comments: A modified version of this preprint has been accepted at IEEE International Conference on Robotics and Automation (ICRA) 2024

  6. arXiv:2306.14657  [pdf, other

    cs.RO eess.SY

    A Diversity Analysis of Safety Metrics Comparing Vehicle Performance in the Lead-Vehicle Interaction Regime

    Authors: Harnarayan Singh, Bowen Weng, Sughosh J. Rao, Devin Elsasser

    Abstract: Vehicle performance metrics analyze data sets consisting of subject vehicle's interactions with other road users in a nominal driving environment and provide certain performance measures as outputs. To the best of the authors' knowledge, the vehicle safety performance metrics research dates back to at least 1967. To date, there still does not exist a community-wide accepted metric or a set of metr… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

    Comments: A modified manuscript of this preprint has been accepted to be published as a regular paper at IEEE Transactions on Intelligent Transportation Systems

  7. On the Adversarial Scenario-based Safety Testing of Robots: the Comparability and Optimal Aggressiveness

    Authors: Bowen Weng, Guillermo A. Castillo, Wei Zhang, Ayonga Hereid

    Abstract: This paper studies the class of scenario-based safety testing algorithms in the black-box safety testing configuration. For algorithms sharing the same state-action set coverage with different sampling distributions, it is commonly believed that prioritizing the exploration of high-risk state-actions leads to a better sampling efficiency. Our proposal disputes the above intuition by introducing an… ▽ More

    Submitted 3 April, 2023; v1 submitted 20 September, 2022; originally announced September 2022.

    Journal ref: IEEE Transactions on Robotics, 2023

  8. On Safety Testing, Validation, and Characterization with Scenario-Sampling: A Case Study of Legged Robots

    Authors: Bowen Weng, Guillermo A. Castillo, Wei Zhang, Ayonga Hereid

    Abstract: The dynamic response of the legged robot locomotion is non-Lipschitz and can be stochastic due to environmental uncertainties. To test, validate, and characterize the safety performance of legged robots, existing solutions on observed and inferred risk can be incomplete and sampling inefficient. Some formal verification methods suffer from the model precision and other surrogate assumptions. In th… ▽ More

    Submitted 16 April, 2022; originally announced April 2022.

    Journal ref: 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

  9. arXiv:2202.08935  [pdf, other

    cs.RO eess.SY

    A Formal Safety Characterization of Advanced Driver Assist Systems in the Car-Following Regime with Scenario-Sampling

    Authors: Bowen Weng, Minghao Zhu, Keith Redmill

    Abstract: The capability to follow a lead-vehicle and avoid rear-end collisions is one of the most important functionalities for human drivers and various Advanced Driver Assist Systems (ADAS). Existing safety performance justification of the car-following systems either relies on simple concrete scenarios with biased surrogate metrics or requires a significantly long driving distance for risk observation a… ▽ More

    Submitted 23 May, 2022; v1 submitted 17 February, 2022; originally announced February 2022.

  10. arXiv:2111.08823  [pdf, other

    cs.LG cs.AI physics.comp-ph

    Meta-Auto-Decoder for Solving Parametric Partial Differential Equations

    Authors: Xiang Huang, Zhanhong Ye, Hongsheng Liu, Beiji Shi, Zidong Wang, Kang Yang, Yang Li, Bingya Weng, Min Wang, Haotian Chu, Fan Yu, Bei Hua, Lei Chen, Bin Dong

    Abstract: Many important problems in science and engineering require solving the so-called parametric partial differential equations (PDEs), i.e., PDEs with different physical parameters, boundary conditions, shapes of computation domains, etc. Recently, building learning-based numerical solvers for parametric PDEs has become an emerging new field. One category of methods such as the Deep Galerkin Method (D… ▽ More

    Submitted 18 November, 2022; v1 submitted 14 November, 2021; originally announced November 2021.

  11. A Finite-Sampling, Operational Domain Specific, and Provably Unbiased Connected and Automated Vehicle Safety Metric

    Authors: Bowen Weng, Linda Capito, Umit Ozguner, Keith Redmill

    Abstract: A connected and automated vehicle safety metric determines the performance of a subject vehicle (SV) by analyzing the data involving the interactions among the SV and other dynamic road users and environmental features. When the data set contains only a finite set of samples collected from the naturalistic mixed-traffic driving environment, a metric is expected to generalize the safety assessment… ▽ More

    Submitted 2 February, 2022; v1 submitted 15 November, 2021; originally announced November 2021.

  12. arXiv:2111.01394  [pdf, other

    cs.LG cs.AI physics.comp-ph

    Solving Partial Differential Equations with Point Source Based on Physics-Informed Neural Networks

    Authors: Xiang Huang, Hongsheng Liu, Beiji Shi, Zidong Wang, Kang Yang, Yang Li, Bingya Weng, Min Wang, Haotian Chu, Jing Zhou, Fan Yu, Bei Hua, Lei Chen, Bin Dong

    Abstract: In recent years, deep learning technology has been used to solve partial differential equations (PDEs), among which the physics-informed neural networks (PINNs) emerges to be a promising method for solving both forward and inverse PDE problems. PDEs with a point source that is expressed as a Dirac delta function in the governing equations are mathematical models of many physical processes. However… ▽ More

    Submitted 2 November, 2021; originally announced November 2021.

  13. A Formal Characterization of Black-Box System Safety Performance with Scenario Sampling

    Authors: Bowen Weng, Linda Capito, Umit Ozguner, Keith Redmill

    Abstract: A typical scenario-based evaluation framework seeks to characterize a black-box system's safety performance (e.g., failure rate) through repeatedly sampling initialization configurations (scenario sampling) and executing a certain test policy for scenario propagation (scenario testing) with the black-box system involved as the test subject. In this letter, we first present a novel safety evaluatio… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

    Comments: A shorter version of this manuscript has been accepted to be published at IEEE Robotics and Automation Letters (RA-L)

    Journal ref: IEEE Robotics and Automation Letters, vol. 7, no. 1, pp. 199-206, Jan. 2022

  14. Towards Guaranteed Safety Assurance of Automated Driving Systems with Scenario Sampling: An Invariant Set Perspective (Extended Version)

    Authors: Bowen Weng, Linda Capito, Umit Ozguner, Keith Redmill

    Abstract: How many scenarios are sufficient to validate the safe Operational Design Domain (ODD) of an Automated Driving System (ADS) equipped vehicle? Is a more significant number of sampled scenarios guaranteeing a more accurate safety assessment of the ADS? Despite the various empirical success of ADS safety evaluation with scenario sampling in practice, some of the fundamental properties are largely unk… ▽ More

    Submitted 29 September, 2021; v1 submitted 19 April, 2021; originally announced April 2021.

    Comments: A shorter version of this manuscript has been accepted by the IEEE Transactions on Intelligent Vehicles

  15. arXiv:2103.15309  [pdf, other

    cs.RO

    Robust Feedback Motion Policy Design Using Reinforcement Learning on a 3D Digit Bipedal Robot

    Authors: Guillermo A. Castillo, Bowen Weng, Wei Zhang, Ayonga Hereid

    Abstract: In this paper, a hierarchical and robust framework for learning bipedal locomotion is presented and successfully implemented on the 3D biped robot Digit built by Agility Robotics. We propose a cascade-structure controller that combines the learning process with intuitive feedback regulations. This design allows the framework to realize robust and stable walking with a reduced-dimension state and a… ▽ More

    Submitted 28 March, 2021; originally announced March 2021.

    Comments: "Supplemental video: https://www.youtube.com/watch?v=j8KbW-a9dbw"

  16. arXiv:2101.08783  [pdf

    cs.CV

    A Person Re-identification Data Augmentation Method with Adversarial Defense Effect

    Authors: Yunpeng Gong, Zhiyong Zeng, Liwen Chen, Yifan Luo, Bin Weng, Feng Ye

    Abstract: The security of the Person Re-identification(ReID) model plays a decisive role in the application of ReID. However, deep neural networks have been shown to be vulnerable, and adding undetectable adversarial perturbations to clean images can trick deep neural networks that perform well in clean images. We propose a ReID multi-modal data augmentation method with adversarial defense effect: 1) Graysc… ▽ More

    Submitted 7 April, 2021; v1 submitted 21 January, 2021; originally announced January 2021.

    Comments: arXiv admin note: text overlap with arXiv:2101.08533

  17. arXiv:2010.01197  [pdf, other

    q-fin.ST cs.LG stat.ML

    Stock2Vec: A Hybrid Deep Learning Framework for Stock Market Prediction with Representation Learning and Temporal Convolutional Network

    Authors: Xing Wang, Yijun Wang, Bin Weng, Aleksandr Vinel

    Abstract: We have proposed to develop a global hybrid deep learning framework to predict the daily prices in the stock market. With representation learning, we derived an embedding called Stock2Vec, which gives us insight for the relationship among different stocks, while the temporal convolutional layers are used for automatically capturing effective temporal patterns both within and across series. Evaluat… ▽ More

    Submitted 29 September, 2020; originally announced October 2020.

  18. arXiv:2009.12222  [pdf, other

    cs.RO

    A Modeled Approach for Online Adversarial Test of Operational Vehicle Safety (extended version)

    Authors: Linda Capito, Bowen Weng, Umit Ozguner, Keith Redmill

    Abstract: The scenario-based testing of operational vehicle safety presents a set of principal other vehicle (POV) trajectories that seek to force the subject vehicle (SV) into a certain safety-critical situation. Current scenarios are mostly (i) statistics-driven: inspired by human driver crash data, (ii) deterministic: POV trajectories are pre-determined and are independent of SV responses, and (iii) over… ▽ More

    Submitted 20 May, 2021; v1 submitted 25 September, 2020; originally announced September 2020.

    Comments: This document is the extended version of our paper accepted to the 2021 IEEE American Control Conference

  19. arXiv:2008.00376  [pdf, other

    cs.RO

    Velocity Regulation of 3D Bipedal Walking Robots with Uncertain Dynamics Through Adaptive Neural Network Controller

    Authors: Guillermo A. Castillo, Bowen Weng, Terrence C. Stewart, Wei Zhang, Ayonga Hereid

    Abstract: This paper presents a neural-network based adaptive feedback control structure to regulate the velocity of 3D bipedal robots under dynamics uncertainties. Existing Hybrid Zero Dynamics (HZD)-based controllers regulate velocity through the implementation of heuristic regulators that do not consider model and environmental uncertainties, which may significantly affect the tracking performance of the… ▽ More

    Submitted 1 August, 2020; originally announced August 2020.

    Comments: "Accepted at 2020 International Conference on Intelligent Robots and Systems (IROS 2020). Supplemental Video: https://youtu.be/DAHk9-GFS0k"

  20. arXiv:2007.15418  [pdf, other

    cs.LG math.OC stat.ML

    Momentum Q-learning with Finite-Sample Convergence Guarantee

    Authors: Bowen Weng, Huaqing Xiong, Lin Zhao, Yingbin Liang, Wei Zhang

    Abstract: Existing studies indicate that momentum ideas in conventional optimization can be used to improve the performance of Q-learning algorithms. However, the finite-sample analysis for momentum-based Q-learning algorithms is only available for the tabular case without function approximations. This paper analyzes a class of momentum-based Q-learning algorithms with finite-sample guarantee. Specifically,… ▽ More

    Submitted 30 July, 2020; originally announced July 2020.

  21. Analysis of Q-learning with Adaptation and Momentum Restart for Gradient Descent

    Authors: Bowen Weng, Huaqing Xiong, Yingbin Liang, Wei Zhang

    Abstract: Existing convergence analyses of Q-learning mostly focus on the vanilla stochastic gradient descent (SGD) type of updates. Despite the Adaptive Moment Estimation (Adam) has been commonly used for practical Q-learning algorithms, there has not been any convergence guarantee provided for Q-learning with such type of updates. In this paper, we first characterize the convergence rate for Q-AMSGrad, wh… ▽ More

    Submitted 14 July, 2020; originally announced July 2020.

    Comments: This paper extends the work presented at the 2020 International Joint Conferences on Artificial Intelligence with supplementary materials

    Journal ref: Proceedings of the Twenty-Ninth International Joint Conference IJCAI20 (2020) 3051-3057

  22. arXiv:2005.09999  [pdf, other

    cs.RO

    Model Predictive Instantaneous Safety Metric for Evaluation of Automated Driving Systems

    Authors: Bowen Weng, Sughosh J. Rao, Eeshan Deosthale, Scott Schnelle, Frank Barickman

    Abstract: Vehicles with Automated Driving Systems (ADS) operate in a high-dimensional continuous system with multi-agent interactions. This continuous system features various types of traffic agents (non-homogeneous) governed by continuous-motion ordinary differential equations (differential-drive). Each agent makes decisions independently that may lead to conflicts with the subject vehicle (SV), as well as… ▽ More

    Submitted 20 May, 2020; originally announced May 2020.

    Comments: Accepted at IEEE Intelligent Vehicles Symposium (IV), 2020

  23. arXiv:1910.10887  [pdf, other

    cs.RO cs.MA eess.SY

    Reciprocal Collision Avoidance for General Nonlinear Agents using Reinforcement Learning

    Authors: Hao Li, Bowen Weng, Abhishek Gupta, Jia Pan, Wei Zhang

    Abstract: Finding feasible and collision-free paths for multiple nonlinear agents is challenging in the decentralized scenarios due to limited available information of other agents and complex dynamics constraints. In this paper, we propose a fast multi-agent collision avoidance algorithm for general nonlinear agents with continuous action space, where each agent observes only positions and velocities of ne… ▽ More

    Submitted 2 March, 2020; v1 submitted 23 October, 2019; originally announced October 2019.

  24. arXiv:1910.09670  [pdf, other

    math.OC cs.LG stat.ML

    History-Gradient Aided Batch Size Adaptation for Variance Reduced Algorithms

    Authors: Kaiyi Ji, Zhe Wang, Bowen Weng, Yi Zhou, Wei Zhang, Yingbin Liang

    Abstract: Variance-reduced algorithms, although achieve great theoretical performance, can run slowly in practice due to the periodic gradient estimation with a large batch of data. Batch-size adaptation thus arises as a promising approach to accelerate such algorithms. However, existing schemes either apply prescribed batch-size adaption rule or exploit the information along optimization path via additiona… ▽ More

    Submitted 26 July, 2020; v1 submitted 21 October, 2019; originally announced October 2019.

    Comments: 46 pages, 23 figures; Published in ICML 2020

  25. arXiv:1910.01748  [pdf, other

    cs.RO cs.LG cs.NE

    Hybrid Zero Dynamics Inspired Feedback Control Policy Design for 3D Bipedal Locomotion using Reinforcement Learning

    Authors: Guillermo A. Castillo, Bowen Weng, Wei Zhang, Ayonga Hereid

    Abstract: This paper presents a novel model-free reinforcement learning (RL) framework to design feedback control policies for 3D bipedal walking. Existing RL algorithms are often trained in an end-to-end manner or rely on prior knowledge of some reference joint trajectories. Different from these studies, we propose a novel policy structure that appropriately incorporates physical insights gained from the h… ▽ More

    Submitted 3 October, 2019; originally announced October 2019.

    Comments: Supplemental video: https://youtu.be/GOT6bnxqwuU

  26. arXiv:1905.02841   

    cs.LG math.OC stat.ML

    Accelerated Target Updates for Q-learning

    Authors: Bowen Weng, Huaqing Xiong, Wei Zhang

    Abstract: This paper studies accelerations in Q-learning algorithms. We propose an accelerated target update scheme by incorporating the historical iterates of Q functions. The idea is conceptually inspired by the momentum-based accelerated methods in the optimization theory. Conditions under which the proposed accelerated algorithms converge are established. The algorithms are validated using commonly adop… ▽ More

    Submitted 11 May, 2019; v1 submitted 7 May, 2019; originally announced May 2019.

    Comments: We need further adjustment of some parts of the papaer

  27. arXiv:1810.01977  [pdf, other

    cs.RO

    Reinforcement Learning Meets Hybrid Zero Dynamics: A Case Study for RABBIT

    Authors: Guillermo A. Castillo, Bowen Weng, Ayonga Hereid, Wei Zhang

    Abstract: The design of feedback controllers for bipedal robots is challenging due to the hybrid nature of its dynamics and the complexity imposed by high-dimensional bipedal models. In this paper, we present a novel approach for the design of feedback controllers using Reinforcement Learning (RL) and Hybrid Zero Dynamics (HZD). Existing RL approaches for bipedal walking are inefficient as they do not consi… ▽ More

    Submitted 3 October, 2018; originally announced October 2018.

    Comments: Supplemental video: https://www.youtube.com/watch?v=dhHMfnl7YlU