Skip to main content

Showing 1–50 of 137 results for author: Kuang, Y

  1. arXiv:2407.04689  [pdf, other

    cs.RO cs.CV

    RAM: Retrieval-Based Affordance Transfer for Generalizable Zero-Shot Robotic Manipulation

    Authors: Yuxuan Kuang, Junjie Ye, Haoran Geng, Jiageng Mao, Congyue Deng, Leonidas Guibas, He Wang, Yue Wang

    Abstract: This work proposes a retrieve-and-transfer framework for zero-shot robotic manipulation, dubbed RAM, featuring generalizability across various objects, environments, and embodiments. Unlike existing approaches that learn manipulation from expensive in-domain demonstrations, RAM capitalizes on a retrieval-based affordance transfer paradigm to acquire versatile manipulation capabilities from abundan… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  2. arXiv:2406.09740  [pdf, other

    cs.LG

    Deep Symbolic Optimization for Combinatorial Optimization: Accelerating Node Selection by Discovering Potential Heuristics

    Authors: Hongyu Liu, Haoyang Liu, Yufei Kuang, Jie Wang, Bin Li

    Abstract: Combinatorial optimization (CO) is one of the most fundamental mathematical models in real-world applications. Traditional CO solvers, such as Branch-and-Bound (B&B) solvers, heavily rely on expert-designed heuristics, which are reliable but require substantial manual tuning. Recent studies have leveraged deep learning (DL) models as an alternative to capture rich feature patterns for improved per… ▽ More

    Submitted 10 July, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

  3. arXiv:2406.03420  [pdf, other

    math.DS

    Dynamic properties of a class of van der Pol-Duffing oscillators

    Authors: Yelei Kuang, Xuemei Li

    Abstract: In this paper, we study the existence of bifurcation of a van der Pol-Duffing oscillator with quintic terms and its quasi-periodic solutions by means of qualitative and bifurcation theories. Firstly, we analyze the autonomous system and find that it has two kinds of local bifurcations and a global bifurcation: pitchfork bifurcation, Hopf bifurcation, homoclinic bifurcation. It is worth noting that… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  4. arXiv:2405.11982  [pdf, other

    cs.LG cs.AI

    Robust Deep Reinforcement Learning with Adaptive Adversarial Perturbations in Action Space

    Authors: Qianmei Liu, Yufei Kuang, Jie Wang

    Abstract: Deep reinforcement learning (DRL) algorithms can suffer from modeling errors between the simulation and the real world. Many studies use adversarial learning to generate perturbation during training process to model the discrepancy and improve the robustness of DRL. However, most of these approaches use a fixed parameter to control the intensity of the adversarial perturbation, which can lead to a… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  5. arXiv:2404.12638  [pdf, other

    cs.AI

    Learning to Cut via Hierarchical Sequence/Set Model for Efficient Mixed-Integer Programming

    Authors: Jie Wang, Zhihai Wang, Xijun Li, Yufei Kuang, Zhihao Shi, Fangzhou Zhu, Mingxuan Yuan, Jia Zeng, Yongdong Zhang, Feng Wu

    Abstract: Cutting planes (cuts) play an important role in solving mixed-integer linear programs (MILPs), which formulate many important real-world applications. Cut selection heavily depends on (P1) which cuts to prefer and (P2) how many cuts to select. Although modern MILP solvers tackle (P1)-(P2) by human-designed heuristics, machine learning carries the potential to learn more effective heuristics. Howev… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2302.00244

  6. arXiv:2403.16860  [pdf, other

    cs.CR

    CipherFormer: Efficient Transformer Private Inference with Low Round Complexity

    Authors: Weize Wang, Yi Kuang

    Abstract: There is a growing trend to outsource the inference task of large transformer models to cloud servers. However, this poses a severe threat to users' private data as they are exposed to cloud servers after uploading. Although several works attempted to provide private inference for transformer models, their hundreds of communication rounds limit the application scenarios. Motivated by the desire to… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: Accepted by CSCWD 2024 (27th International Conference on Computer Supported Cooperative Work in Design)

  7. arXiv:2403.12993  [pdf

    cs.LG

    Simple Full-Spectrum Correlated k-Distribution Model based on Multilayer Perceptron

    Authors: Xin Wang, Yucheng Kuang, Chaojun Wang, Hongyuan Di, Boshu He

    Abstract: While neural networks have been successfully applied to the full-spectrum k-distribution (FSCK) method at a large range of thermodynamics with k-values predicted by a trained multilayer perceptron (MLP) model, the required a-values still need to be calculated on-the-fly, which theoretically degrades the FSCK method and may lead to errors. On the other hand, too complicated structure of the current… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  8. arXiv:2403.03566  [pdf, ps, other

    nucl-ex nucl-th

    Neutron radius determination of 133Cs and its impact on the interpretation of CEvNS-CsI measurement

    Authors: Y. Huang, S. Y. Xia, Y. F. Li, X. L. Tu, J. T. Zhang, C. J. Shao, K. Yue, P. Ma, Y. F. Niu, Z. P. Li, Y. Kuang, X. Q. Liu, J. F. Han, P. Egelhof, Yu. A. Litvinov, M. Wang, Y. H. Zhang, X. H. Zhou, Z. Y. Sun

    Abstract: Proton-$^{133}$Cs elastic scattering at low momentum transfer is performed using an in-ring reaction technique at the Cooler Storage Ring at the Heavy Ion Research Facility in Lanzhou. Recoil protons from the elastic collisions between the internal H$_2$-gas target and the circulating $^{133}$Cs ions at 199.4 MeV/u are detected by a silicon-strip detector. The matter radius of $^{133}$Cs is deduce… ▽ More

    Submitted 8 April, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

  9. arXiv:2402.10670  [pdf, other

    cs.CL cs.RO

    OpenFMNav: Towards Open-Set Zero-Shot Object Navigation via Vision-Language Foundation Models

    Authors: Yuxuan Kuang, Hai Lin, Meng Jiang

    Abstract: Object navigation (ObjectNav) requires an agent to navigate through unseen environments to find queried objects. Many previous methods attempted to solve this task by relying on supervised or reinforcement learning, where they are trained on limited household datasets with close-set objects. However, two key challenges are unsolved: understanding free-form natural language instructions that demand… ▽ More

    Submitted 24 March, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: NAACL 2024 Findings

  10. arXiv:2402.07132  [pdf, other

    cs.SE

    BAFLineDP: Code Bilinear Attention Fusion Framework for Line-Level Defect Prediction

    Authors: Shaojian Qiu, Huihao Huang, Jianxiang Luo, Yingjie Kuang, Haoyu Luo

    Abstract: Software defect prediction aims to identify defect-prone code, aiding developers in optimizing testing resource allocation. Most defect prediction approaches primarily focus on coarse-grained, file-level defect prediction, which fails to provide developers with the precision required to locate defective code. Recently, some researchers have proposed fine-grained, line-level defect prediction metho… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

    Comments: Accepted by IEEE SANER 2024

  11. arXiv:2402.05946  [pdf, other

    cs.LG cs.AI

    Unveiling Latent Causal Rules: A Temporal Point Process Approach for Abnormal Event Explanation

    Authors: Yiling Kuang, Chao Yang, Yang Yang, Shuang Li

    Abstract: In high-stakes systems such as healthcare, it is critical to understand the causal reasons behind unusual events, such as sudden changes in patient's health. Unveiling the causal reasons helps with quick diagnoses and precise treatment planning. In this paper, we propose an automated method for uncovering "if-then" logic rules to explain observational events. We introduce temporal point processes… ▽ More

    Submitted 19 March, 2024; v1 submitted 3 February, 2024; originally announced February 2024.

    Comments: Accepted by AISTATS 2024

  12. arXiv:2401.05960  [pdf, other

    cs.AI

    Machine Learning Insides OptVerse AI Solver: Design Principles and Applications

    Authors: Xijun Li, Fangzhou Zhu, Hui-Ling Zhen, Weilin Luo, Meng Lu, Yimin Huang, Zhenan Fan, Zirui Zhou, Yufei Kuang, Zhihai Wang, Zijie Geng, Yang Li, Haoyang Liu, Zhiwu An, Muming Yang, Jianshu Li, Jie Wang, Junchi Yan, Defeng Sun, Tao Zhong, Yong Zhang, Jia Zeng, Mingxuan Yuan, Jianye Hao, Jun Yao , et al. (1 additional authors not shown)

    Abstract: In an era of digital ubiquity, efficient resource management and decision-making are paramount across numerous industries. To this end, we present a comprehensive study on the integration of machine learning (ML) techniques into Huawei Cloud's OptVerse AI Solver, which aims to mitigate the scarcity of real-world mathematical programming instances, and to surpass the capabilities of traditional opt… ▽ More

    Submitted 17 January, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

  13. arXiv:2401.05506  [pdf, ps, other

    math.KT math.GR math.NT math.RA math.RT

    On the Coherency of Completed Group Algebra

    Authors: David Burns, Yu Kuang, Dingli Liang

    Abstract: We investigate coherency properties of certain completed integral group rings, precisely for compact $p$-adic Lie groups.

    Submitted 16 January, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

    Comments: 16 pages. Submitted

    MSC Class: 16D10; 16E05; 20E18(primary); 16S34(secondary)

  14. arXiv:2312.17173  [pdf, other

    stat.ML cs.LG

    Non-Vacuous Generalization Bounds for Large Language Models

    Authors: Sanae Lotfi, Marc Finzi, Yilun Kuang, Tim G. J. Rudner, Micah Goldblum, Andrew Gordon Wilson

    Abstract: Modern language models can contain billions of parameters, raising the question of whether they can generalize beyond the training data or simply regurgitate their training corpora. We provide the first non-vacuous generalization bounds for pretrained large language models (LLMs), indicating that language models are capable of discovering regularities that generalize to unseen data. In particular,… ▽ More

    Submitted 12 February, 2024; v1 submitted 28 December, 2023; originally announced December 2023.

  15. arXiv:2312.14409  [pdf, other

    astro-ph.CO gr-qc

    Probing scalar induced gravitational waves with PTA and LISA: The Importance of third order correction

    Authors: Zhe Chang, Yu-Ting Kuang, Di Wu, Jing-Zhi Zhou

    Abstract: We revisit the calculation of third order \acp{SIGW} and extend it from a monochromatic primordial power spectrum to a more general log-normal one. We investigate the impact of third order SIGWs on \ac{SNR} of \ac{LISA} and \ac{PTA} observations, and find that third order SIGWs significantly contribute to the total energy density spectrum of \acp{GW} in high-frequency region. For a primordial powe… ▽ More

    Submitted 26 February, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

  16. arXiv:2312.02791  [pdf, ps, other

    q-bio.NC

    Unsupervised learning on spontaneous retinal activity leads to efficient neural representation geometry

    Authors: Andrew Ligeralde, Yilun Kuang, Thomas Edward Yerxa, Miah N. Pitcher, Marla Feller, SueYeon Chung

    Abstract: Prior to the onset of vision, neurons in the developing mammalian retina spontaneously fire in correlated activity patterns known as retinal waves. Experimental evidence suggests that retinal waves strongly influence the emergence of sensory representations before visual experience. We aim to model this early stage of functional development by using movies of neurally active developing retinas as… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  17. arXiv:2311.11676  [pdf, other

    nucl-th

    Extracting neutron skin from elastic proton-nucleus scattering with deep neural network

    Authors: G. H. Yang, Y. Kuang, Z. X. Yang, Z. P. Li

    Abstract: Based on the relativistic impulse approximation of proton-nucleus elastic scattering theory, the nucleon density distribution and neutron skin thickness of $^{48}$Ca are estimated via the deep learning method. The neural-network-generated densities are mainly compressed to be lower inside the nucleus compared with the results from the relativistic PC-PK1 density functional, resulting in a signific… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  18. arXiv:2311.05102  [pdf, other

    astro-ph.CO gr-qc

    New constraints on primordial non-Gaussianity from missing two-loop contributions of scalar induced gravitational waves

    Authors: Zhe Chang, Yu-Ting Kuang, Di Wu, Jing-Zhi Zhou, Qing-Hua Zhu

    Abstract: We analyze the energy density spectrum of \acp{SIGW} using the NANOGrav 15-year data set, thereby constraining the primordial non-Gaussian parameter $f_{\mathrm{NL}}$. For the first time, we calculate the seventeen missing two-loop diagrams proportional to $f_{\mathrm{NL}}A_ζ^3$ that correspond to the two-point correlation function $\langle h^{λ,(3)}_{\mathbf{k}} h^{λ',(2)}_{\mathbf{k}'} \rangle$… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 8 pages, 4 figures

  19. arXiv:2310.15888  [pdf, other

    cs.LG

    State Sequences Prediction via Fourier Transform for Representation Learning

    Authors: Mingxuan Ye, Yufei Kuang, Jie Wang, Rui Yang, Wengang Zhou, Houqiang Li, Feng Wu

    Abstract: While deep reinforcement learning (RL) has been demonstrated effective in solving complex control tasks, sample efficiency remains a key challenge due to the large amounts of data required for remarkable performance. Existing research explores the application of representation learning for data-efficient RL, e.g., learning predictive representations by predicting long-term future states. However,… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

  20. arXiv:2310.15651  [pdf, other

    physics.comp-ph

    Towards chemical accuracy using a multi-mesh adaptive finite element method in all-electron density functional theory

    Authors: Yang Kuang, Yedan Shen, Guanghui Hu

    Abstract: Chemical accuracy serves as an important metric for assessing the effectiveness of the numerical method in Kohn--Sham density functional theory. It is found that to achieve chemical accuracy, not only the Kohn--Sham wavefunctions but also the Hartree potential, should be approximated accurately. Under the adaptive finite element framework, this can be implemented by constructing the \emph{a poster… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: 19pages, 17 figures

  21. arXiv:2310.14161  [pdf, other

    cs.LG

    Promoting Generalization for Exact Solvers via Adversarial Instance Augmentation

    Authors: Haoyang Liu, Yufei Kuang, Jie Wang, Xijun Li, Yongdong Zhang, Feng Wu

    Abstract: Machine learning has been successfully applied to improve the efficiency of Mixed-Integer Linear Programming (MILP) solvers. However, the learning-based solvers often suffer from severe performance degradation on unseen MILP instances -- especially on large-scale instances from a perturbed environment -- due to the limited diversity of training distributions. To tackle this problem, we propose a n… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

  22. arXiv:2310.11845  [pdf, other

    cs.LG

    Accelerate Presolve in Large-Scale Linear Programming via Reinforcement Learning

    Authors: Yufei Kuang, Xijun Li, Jie Wang, Fangzhou Zhu, Meng Lu, Zhihai Wang, Jia Zeng, Houqiang Li, Yongdong Zhang, Feng Wu

    Abstract: Large-scale LP problems from industry usually contain much redundancy that severely hurts the efficiency and reliability of solving LPs, making presolve (i.e., the problem simplification module) one of the most critical components in modern LP solvers. However, how to design high-quality presolve routines -- that is, the program determining (P1) which presolvers to select, (P2) in what order to ex… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

  23. arXiv:2310.05717  [pdf, other

    cs.RO cs.AI cs.CV

    STOPNet: Multiview-based 6-DoF Suction Detection for Transparent Objects on Production Lines

    Authors: Yuxuan Kuang, Qin Han, Danshi Li, Qiyu Dai, Lian Ding, Dong Sun, Hanlin Zhao, He Wang

    Abstract: In this work, we present STOPNet, a framework for 6-DoF object suction detection on production lines, with a focus on but not limited to transparent objects, which is an important and challenging problem in robotic systems and modern industry. Current methods requiring depth input fail on transparent objects due to depth cameras' deficiency in sensing their geometry, while we proposed a novel fram… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: Under Review. ICRA 2024 submission

  24. arXiv:2309.06676  [pdf, other

    astro-ph.CO gr-qc

    Scalar Induced Gravitational Waves from Finslerian Inflation and Pulsar Timing Arrays Observations

    Authors: Zhe Chang, Yu-Ting Kuang, Di Wu, Jing-Zhi Zhou

    Abstract: The recent data from NANOGrav provide strong evidence of the existence of the \acp{SGWB}. We investigate \acp{SIGW} from Finslerian inflation as a potential source of stochastic gravitational wave background. Small-scale ($\lesssim$1 Mpc) statistically anisotropic primordial scalar perturbations can be generated in Finslerian inflation. The second order \acp{SIGW} from Finslerian inflation are als… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

  25. Robotic Table Tennis: A Case Study into a High Speed Learning System

    Authors: David B. D'Ambrosio, Jonathan Abelian, Saminda Abeyruwan, Michael Ahn, Alex Bewley, Justin Boyd, Krzysztof Choromanski, Omar Cortes, Erwin Coumans, Tianli Ding, Wenbo Gao, Laura Graesser, Atil Iscen, Navdeep Jaitly, Deepali Jain, Juhana Kangaspunta, Satoshi Kataoka, Gus Kouretas, Yuheng Kuang, Nevena Lazic, Corey Lynch, Reza Mahjourian, Sherry Q. Moore, Thinh Nguyen, Ken Oslund , et al. (10 additional authors not shown)

    Abstract: We present a deep-dive into a real-world robotic learning system that, in previous work, was shown to be capable of hundreds of table tennis rallies with a human and has the ability to precisely return the ball to desired targets. This system puts together a highly optimized perception subsystem, a high-speed low-latency robot controller, a simulation paradigm that can prevent damage in the real w… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

    Comments: Published and presented at Robotics: Science and Systems (RSS2023)

  26. arXiv:2307.15818  [pdf, other

    cs.RO cs.CL cs.CV cs.LG

    RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control

    Authors: Anthony Brohan, Noah Brown, Justice Carbajal, Yevgen Chebotar, Xi Chen, Krzysztof Choromanski, Tianli Ding, Danny Driess, Avinava Dubey, Chelsea Finn, Pete Florence, Chuyuan Fu, Montse Gonzalez Arenas, Keerthana Gopalakrishnan, Kehang Han, Karol Hausman, Alexander Herzog, Jasmine Hsu, Brian Ichter, Alex Irpan, Nikhil Joshi, Ryan Julian, Dmitry Kalashnikov, Yuheng Kuang, Isabel Leal , et al. (29 additional authors not shown)

    Abstract: We study how vision-language models trained on Internet-scale data can be incorporated directly into end-to-end robotic control to boost generalization and enable emergent semantic reasoning. Our goal is to enable a single end-to-end trained model to both learn to map robot observations to actions and enjoy the benefits of large-scale pretraining on language and vision-language data from the web.… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

    Comments: Website: https://robotics-transformer.github.io/

  27. arXiv:2307.02067  [pdf, other

    astro-ph.CO gr-qc

    Primordial black holes from second order density perturbations as probes of the small-scale primordial power spectrum

    Authors: Yu-Ting Kuang, Jing-Zhi Zhou, Zhe Chang, Xukun Zhang, Qing-Hua Zhu

    Abstract: We investigate the second order energy density perturbation $δ^{(2)}$ induced by small-scale Gaussian and local-type non-Gaussian primordial curvature perturbations. The relative abundance of primordial black hole is calculated in terms of the probability density function of total energy density perturbation $δ_r=δ^{(1)}+\frac{1}{2}δ^{(2)}$. The effects of second order density perturbation greatly… ▽ More

    Submitted 7 July, 2023; v1 submitted 5 July, 2023; originally announced July 2023.

    Comments: 5 pages, 4 figures

  28. arXiv:2305.14654  [pdf, other

    cs.RO cs.AI

    Barkour: Benchmarking Animal-level Agility with Quadruped Robots

    Authors: Ken Caluwaerts, Atil Iscen, J. Chase Kew, Wenhao Yu, Tingnan Zhang, Daniel Freeman, Kuang-Huei Lee, Lisa Lee, Stefano Saliceti, Vincent Zhuang, Nathan Batchelor, Steven Bohez, Federico Casarini, Jose Enrique Chen, Omar Cortes, Erwin Coumans, Adil Dostmohamed, Gabriel Dulac-Arnold, Alejandro Escontrela, Erik Frey, Roland Hafner, Deepali Jain, Bauyrjan Jyenis, Yuheng Kuang, Edward Lee , et al. (19 additional authors not shown)

    Abstract: Animals have evolved various agile locomotion strategies, such as sprinting, leaping, and jumping. There is a growing interest in developing legged robots that move like their biological counterparts and show various agile skills to navigate complex environments quickly. Despite the interest, the field lacks systematic benchmarks to measure the performance of control policies and hardware in agili… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: 17 pages, 19 figures

  29. arXiv:2303.03307  [pdf, other

    cs.CV q-bio.NC

    Learning Efficient Coding of Natural Images with Maximum Manifold Capacity Representations

    Authors: Thomas Yerxa, Yilun Kuang, Eero Simoncelli, SueYeon Chung

    Abstract: The efficient coding hypothesis proposes that the response properties of sensory systems are adapted to the statistics of their inputs such that they capture maximal information about the environment, subject to biological constraints. While elegant, information theoretic properties are notoriously difficult to measure in practical settings or to employ as objective functions in optimization. This… ▽ More

    Submitted 3 December, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

    Comments: Accepted at NeurIPS 2023

  30. arXiv:2303.00183  [pdf, other

    astro-ph.HE hep-ph

    Pulsars as candidates of LHAASO sources J2226+6057, J1908+0621 and J1825-1326: The leptonic origin

    Authors: Zhe Chang, Yu-Ting Kuang, Xukun Zhang, Jing-Zhi Zhou

    Abstract: Recently, from 12 $γ$-ray Galactic sources, the LHAASO has detected ultrahigh-energy photons up to 1.4PeV. The $γ$-ray spectra of the sources J2226+6057, J1908+0621, J1825-1326 and the suggested origin pulsars near the sources have been published. In our previous work, we studied the hadronic $γ$-ray spectra of the sources J2226+6057, J1908+0621, J1825-1326 in terms of the Hertzian dipole model of… ▽ More

    Submitted 26 April, 2023; v1 submitted 28 February, 2023; originally announced March 2023.

  31. arXiv:2302.00244  [pdf, other

    cs.LG math.OC

    Learning Cut Selection for Mixed-Integer Linear Programming via Hierarchical Sequence Model

    Authors: Zhihai Wang, Xijun Li, Jie Wang, Yufei Kuang, Mingxuan Yuan, Jia Zeng, Yongdong Zhang, Feng Wu

    Abstract: Cutting planes (cuts) are important for solving mixed-integer linear programs (MILPs), which formulate a wide range of important real-world applications. Cut selection -- which aims to select a proper subset of the candidate cuts to improve the efficiency of solving MILPs -- heavily depends on (P1) which cuts should be preferred, and (P2) how many cuts should be selected. Although many modern MILP… ▽ More

    Submitted 31 January, 2023; originally announced February 2023.

    Comments: Accepted to ICLR2023

  32. arXiv:2212.06817  [pdf, other

    cs.RO cs.AI cs.CL cs.CV cs.LG

    RT-1: Robotics Transformer for Real-World Control at Scale

    Authors: Anthony Brohan, Noah Brown, Justice Carbajal, Yevgen Chebotar, Joseph Dabis, Chelsea Finn, Keerthana Gopalakrishnan, Karol Hausman, Alex Herzog, Jasmine Hsu, Julian Ibarz, Brian Ichter, Alex Irpan, Tomas Jackson, Sally Jesmonth, Nikhil J Joshi, Ryan Julian, Dmitry Kalashnikov, Yuheng Kuang, Isabel Leal, Kuang-Huei Lee, Sergey Levine, Yao Lu, Utsav Malla, Deeksha Manjunath , et al. (26 additional authors not shown)

    Abstract: By transferring knowledge from large, diverse, task-agnostic datasets, modern machine learning models can solve specific downstream tasks either zero-shot or with small task-specific datasets to a high level of performance. While this capability has been demonstrated in other fields such as computer vision, natural language processing or speech recognition, it remains to be shown in robotics, wher… ▽ More

    Submitted 11 August, 2023; v1 submitted 13 December, 2022; originally announced December 2022.

    Comments: See website at robotics-transformer1.github.io

  33. arXiv:2211.11948  [pdf, other

    astro-ph.CO gr-qc

    Primordial gravitational waves and curvature perturbations induced energy density perturbation

    Authors: Zhe Chang, Yu-Ting Kuang, Xukun Zhang, Jing-Zhi Zhou

    Abstract: We study the second order scalar and density perturbations generated by the Gaussian curvature perturbations and primordial gravitational waves in the radiation-dominated era. After presenting all the possible second-order source terms, we obtain the explicit expressions of the kernel functions and the power spectra of the second order scalar perturbations. It shows that the primordial gravitation… ▽ More

    Submitted 28 January, 2024; v1 submitted 21 November, 2022; originally announced November 2022.

  34. arXiv:2210.10347  [pdf, other

    math.NT

    On Galois-Gauss sums and the square root of the inverse different

    Authors: Y. Kuang

    Abstract: We discuss a possible generalisation of a conjecture of Bley, Burns and Hahn concerning the relation between the second Adams-operator twisted Galois-Gauss sums of weakly ramified Artin characters and the square root of the inverse different of finite, odd degree, Galois extensions of number fields, to the setting of all finite Galois extensions of number fields for which a square root of the inve… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

  35. arXiv:2209.12404  [pdf, other

    astro-ph.CO gr-qc

    Primordial black holes and third order scalar induced gravitational waves

    Authors: Zhe Chang, Yu-Ting Kuang, Xukun Zhang, Jing-Zhi Zhou

    Abstract: The process of \acp{PBH} formation would be inevitably accompanied by \acp{SIGW}. This strong correlation between \acp{PBH} and \acp{SIGW} signals could be a promising approach to detecting \acp{PBH} in the upcoming \ac{GW} experiments, such as \ac{LISA}. We investigate the third order \acp{SIGW} during a \ac{RD} era in the case of a monochromatic primordial power spectrum… ▽ More

    Submitted 22 March, 2023; v1 submitted 26 September, 2022; originally announced September 2022.

  36. arXiv:2209.07563  [pdf, other

    q-bio.PE math.DS physics.soc-ph

    The emergence of a virus variant: dynamics of a competition model with cross-immunity time-delay validated by wastewater surveillance data for COVID-19

    Authors: Bruce Pell, Samantha Brozak, Tin Phan, Fuqing Wu, Yang Kuang

    Abstract: We consider the dynamics of a virus spreading through a population that produces a mutant strain with the ability to infect individuals that were infected with the established strain. Temporary cross-immunity is included using a time delay, but is found to be a harmless delay. We provide some sufficient conditions that guarantee local and global asymptotic stability of the disease-free equilibrium… ▽ More

    Submitted 15 February, 2023; v1 submitted 15 September, 2022; originally announced September 2022.

  37. arXiv:2204.11602  [pdf, other

    cs.IR cs.LG

    Broad Recommender System: An Efficient Nonlinear Collaborative Filtering Approach

    Authors: Ling Huang, Can-Rong Guan, Zhen-Wei Huang, Yuefang Gao, Yingjie Kuang, Chang-Dong Wang, C. L. Philip Chen

    Abstract: Recently, Deep Neural Networks (DNNs) have been widely introduced into Collaborative Filtering (CF) to produce more accurate recommendation results due to their capability of capturing the complex nonlinear relationships between items and users.However, the DNNs-based models usually suffer from high computational complexity, i.e., consuming very long training time and storing huge amount of traina… ▽ More

    Submitted 24 February, 2024; v1 submitted 19 April, 2022; originally announced April 2022.

  38. arXiv:2204.01691  [pdf, other

    cs.RO cs.CL cs.LG

    Do As I Can, Not As I Say: Grounding Language in Robotic Affordances

    Authors: Michael Ahn, Anthony Brohan, Noah Brown, Yevgen Chebotar, Omar Cortes, Byron David, Chelsea Finn, Chuyuan Fu, Keerthana Gopalakrishnan, Karol Hausman, Alex Herzog, Daniel Ho, Jasmine Hsu, Julian Ibarz, Brian Ichter, Alex Irpan, Eric Jang, Rosario Jauregui Ruano, Kyle Jeffrey, Sally Jesmonth, Nikhil J Joshi, Ryan Julian, Dmitry Kalashnikov, Yuheng Kuang, Kuang-Huei Lee , et al. (20 additional authors not shown)

    Abstract: Large language models can encode a wealth of semantic knowledge about the world. Such knowledge could be extremely useful to robots aiming to act upon high-level, temporally extended instructions expressed in natural language. However, a significant weakness of language models is that they lack real-world experience, which makes it difficult to leverage them for decision making within a given embo… ▽ More

    Submitted 16 August, 2022; v1 submitted 4 April, 2022; originally announced April 2022.

    Comments: See website at https://say-can.github.io/ V1. Initial Upload. V2. Added PaLM results. Added study about new capabilities (drawer manipulation, chain of thought prompting, multilingual instructions). Added an ablation study of language model size. Added an open-source version of \algname on a simulated tabletop environment. Improved readability

  39. arXiv:2203.14131  [pdf, other

    math.NT

    On the Galois-Gauss sums of weakly ramified characters

    Authors: Y. Kuang

    Abstract: Bley, Burns and Hahn used relative algebraic $K$-theory methods to formulate a precise conjectural link between the (second Adams-operator twisted) Galois-Gauss sums of weakly ramified Artin characters and the square root of the inverse different of finite, odd degree, Galois extensions of number fields. We provide concrete new evidence for this conjecture in the setting of extensions of odd prime… ▽ More

    Submitted 9 March, 2023; v1 submitted 26 March, 2022; originally announced March 2022.

    Comments: Corrected an error in the proof of Theorem 3.1

  40. arXiv:2112.10513  [pdf, other

    cs.LG stat.ML

    Learning Robust Policy against Disturbance in Transition Dynamics via State-Conservative Policy Optimization

    Authors: Yufei Kuang, Miao Lu, Jie Wang, Qi Zhou, Bin Li, Houqiang Li

    Abstract: Deep reinforcement learning algorithms can perform poorly in real-world tasks due to the discrepancy between source and target environments. This discrepancy is commonly viewed as the disturbance in transition dynamics. Many existing algorithms learn robust policies by modeling the disturbance and applying it to source environments during training, which usually requires prior knowledge about the… ▽ More

    Submitted 20 December, 2021; originally announced December 2021.

    Comments: Accepted to AAAI 2022

  41. arXiv:2111.02493  [pdf

    eess.SP cs.AI cs.CV physics.ins-det

    Roadmap on Signal Processing for Next Generation Measurement Systems

    Authors: D. K. Iakovidis, M. Ooi, Y. C. Kuang, S. Demidenko, A. Shestakov, V. Sinitsin, M. Henry, A. Sciacchitano, A. Discetti, S. Donati, M. Norgia, A. Menychtas, I. Maglogiannis, S. C. Wriessnegger, L. A. Barradas Chacon, G. Dimas, D. Filos, A. H. Aletras, J. Töger, F. Dong, S. Ren, A. Uhl, J. Paziewski, J. Geng, F. Fioranelli , et al. (9 additional authors not shown)

    Abstract: Signal processing is a fundamental component of almost any sensor-enabled system, with a wide range of applications across different scientific disciplines. Time series data, images, and video sequences comprise representative forms of signals that can be enhanced and analysed for information extraction and quantification. The recent advances in artificial intelligence and machine learning are shi… ▽ More

    Submitted 28 January, 2022; v1 submitted 3 November, 2021; originally announced November 2021.

    Comments: 48 pages, https://iopscience.iop.org/article/10.1088/1361-6501/ac2dbd

    Journal ref: Measurement Science and Technology 33(1) (2022) 1-48

  42. arXiv:2109.12762  [pdf, other

    physics.comp-ph hep-lat

    Regularization of Complex Langevin Method

    Authors: Zhenning Cai, Yang Kuang, Hong Kiat Tan

    Abstract: The complex Langevin method, a numerical method used to compute the ensemble average with a complex partition function, often suffers from runaway instability. We study the regularization of the complex Langevin method via augmenting the action with a stabilization term. Since the regularization introduces biases to the numerical result, two approaches, named 2R and 3R methods, are introduced to r… ▽ More

    Submitted 26 September, 2021; originally announced September 2021.

  43. arXiv:2109.04527  [pdf, other

    cs.CV

    CrowdDriven: A New Challenging Dataset for Outdoor Visual Localization

    Authors: Ara Jafarzadeh, Manuel Lopez Antequera, Pau Gargallo, Yubin Kuang, Carl Toft, Fredrik Kahl, Torsten Sattler

    Abstract: Visual localization is the problem of estimating the position and orientation from which a given image (or a sequence of images) is taken in a known scene. It is an important part of a wide range of computer vision and robotics applications, from self-driving cars to augmented/virtual reality systems. Visual localization techniques should work reliably and robustly under a wide range of conditions… ▽ More

    Submitted 9 September, 2021; originally announced September 2021.

  44. arXiv:2106.12428  [pdf, other

    math.NA

    An entropic method for discrete systems with Gibbs entropy

    Authors: Zhenning Cai, Jingwei Hu, Yang Kuang, Bo Lin

    Abstract: We consider general systems of ordinary differential equations with monotonic Gibbs entropy, and introduce an entropic scheme that simply imposes an entropy fix after every time step of any existing time integrator. It is proved that in the general case, our entropy fix has only infinitesimal influence on the numerical order of the original scheme, and in many circumstances, it can be shown that t… ▽ More

    Submitted 23 June, 2021; originally announced June 2021.

    MSC Class: 65L05

  45. arXiv:2012.08141  [pdf, other

    cs.PL cs.AI cs.GR

    AsyncTaichi: On-the-fly Inter-kernel Optimizations for Imperative and Spatially Sparse Programming

    Authors: Yuanming Hu, Mingkuan Xu, Ye Kuang, Frédo Durand

    Abstract: Leveraging spatial sparsity has become a popular approach to accelerate 3D computer graphics applications. Spatially sparse data structures and efficient sparse kernels (such as parallel stencil operations on active voxels), are key to achieve high performance. Existing work focuses on improving performance within a single sparse computational kernel. We show that a system that looks beyond a sing… ▽ More

    Submitted 22 June, 2021; v1 submitted 15 December, 2020; originally announced December 2020.

    Comments: 18 pages, 20 figures, submitted to ACM SIGGRAPH Asia

    ACM Class: D.3.2; I.3.6; I.2.5

  46. Second-order accurate BGK schemes for the special relativistic hydrodynamics with the Synge equation of state

    Authors: Yaping Chen, Yangyu Kuang, Huazhong Tang

    Abstract: This paper extends the second-order accurate BGK finite volume schemes for the ultra-relativistic flow simulations [5] to the 1D and 2D special relativistic hydrodynamics with the Synge equation of state. It is shown that such 2D schemes are very time-consuming due to the moment integrals (triple integrals) so that they are no longer practical. In view of this, the simplified BGK (sBGK) schemes ar… ▽ More

    Submitted 2 November, 2020; originally announced November 2020.

    Comments: 47 pages, 17 figures, 2 tables

  47. arXiv:2007.14228  [pdf, other

    physics.comp-ph math.OC

    An orthogonalization-free parallelizable framework for all-electron calculations in density functional theory

    Authors: Bin Gao, Guanghui Hu, Yang Kuang, Xin Liu

    Abstract: All-electron calculations play an important role in density functional theory, in which improving computational efficiency is one of the most needed and challenging tasks. In the model formulations, both nonlinear eigenvalue problem and total energy minimization problem pursue orthogonal solutions. Most existing algorithms for solving these two models invoke orthogonalization process either explic… ▽ More

    Submitted 28 July, 2020; originally announced July 2020.

    Comments: 20 pages, 7 figures, 4 tables

  48. arXiv:2007.10198  [pdf, other

    math.NA

    On the validity of complex Langevin method for path integral computations

    Authors: Zhenning Cai, Xiaoyu Dong, Yang Kuang

    Abstract: The complex Langevin (CL) method is a classical numerical strategy to alleviate the numerical sign problem in the computation of lattice field theories. Mathematically, it is a simple numerical tool to compute a wide class of high-dimensional and oscillatory integrals. However, it is often observed that the CL method converges but the limiting result is incorrect. The literature has several unclea… ▽ More

    Submitted 5 November, 2020; v1 submitted 20 July, 2020; originally announced July 2020.

    Comments: 28 pages,9 figures

  49. arXiv:2005.11343  [pdf, other

    q-bio.PE q-bio.QM

    Mathematical analysis and potential therapeutic implications of a novel HIV-1 model of basal and activated transcription in T-cells and macrophages

    Authors: Tin Phan, Catherine DeMarino, Fatah Kashanchi, Yang Kuang, Daniel M. Anderson, Maria Emelianenko

    Abstract: HIV-1 affects tens of millions of people worldwide. Current treatments often involve a cocktail of antiretroviral drugs, which are effective in reducing the virus and extending life spans. However, there is currently no FDA-approved HIV-1 transcription inhibitor. Furthermore, there have only been a few attempts to model the transcription process in HIV-1. In this work, we extend a novel three-stat… ▽ More

    Submitted 22 May, 2020; originally announced May 2020.

    MSC Class: 37C75; 92C42; 92C50

  50. arXiv:2004.03251  [pdf, ps, other

    q-bio.PE q-bio.QM

    To mask or not to mask: Modeling the potential for face mask use by the general public to curtail the COVID-19 pandemic

    Authors: Steffen E. Eikenberry, Marina Mancuso, Enahoro Iboi, Tin Phan, Keenan Eikenberry, Yang Kuang, Eric Kostelich, Abba B. Gumel

    Abstract: Face mask use by the general public for limiting the spread of the COVID-19 pandemic is controversial, though increasingly recommended, and the potential of this intervention is not well understood. We develop a compartmental model for assessing the community-wide impact of mask use by the general, asymptomatic public, a portion of which may be asymptomatically infectious. Model simulations, using… ▽ More

    Submitted 7 April, 2020; originally announced April 2020.

    Comments: 20 pages, 9 figures

    Journal ref: Infectious Disease Modelling. 5 (2020) 248-255