Skip to main content

Showing 1–50 of 228 results for author: Tan, L

  1. arXiv:2407.08874  [pdf

    cs.DB

    Implications of mappings between ICD clinical diagnosis codes and Human Phenotype Ontology terms

    Authors: Amelia LM Tan, Rafael S Gonçalves, William Yuan, Gabriel A Brat, The Consortium for Clinical Characterization of COVID-19 by EHR, Robert Gentleman, Isaac S Kohane

    Abstract: Objective: Integrating EHR data with other resources is essential in rare disease research due to low disease prevalence. Such integration is dependent on the alignment of ontologies used for data annotation. The International Classification of Diseases (ICD) is used to annotate clinical diagnoses; the Human Phenotype Ontology (HPO) to annotate phenotypes. Although these ontologies overlap in biom… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  2. arXiv:2407.08068  [pdf, other

    cs.FL

    More on Maximally Permissive Similarity Control of Discrete Event Systems

    Authors: Yu Wang, Zhaohui Zhu, Rob van Glabbeek, Jinjin Zhang, Lixing Tan

    Abstract: Takai proposed a method for constructing a maximally permissive supervisor for the similarity control problem (IEEE Transactions on Automatic Control, 66(7):3197-3204, 2021). This paper points out flaws in his results by providing a counterexample. Inspired by Takai's construction, the notion of a (saturated) (G, R)-automaton is introduced and metatheorems concerning (maximally permissive) supervi… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: 8 pages

  3. arXiv:2407.01402  [pdf, ps, other

    cs.CC cs.DS cs.LG

    Superconstant Inapproximability of Decision Tree Learning

    Authors: Caleb Koch, Carmen Strassle, Li-Yang Tan

    Abstract: We consider the task of properly PAC learning decision trees with queries. Recent work of Koch, Strassle, and Tan showed that the strictest version of this task, where the hypothesis tree $T$ is required to be optimally small, is NP-hard. Their work leaves open the question of whether the task remains intractable if $T$ is only required to be close to optimal, say within a factor of 2, rather than… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 29 pages, 5 figures, COLT 2024

  4. arXiv:2405.16340  [pdf, ps, other

    cs.CC

    A Strong Direct Sum Theorem for Distributional Query Complexity

    Authors: Guy Blanc, Caleb Koch, Carmen Strassle, Li-Yang Tan

    Abstract: Consider the expected query complexity of computing the $k$-fold direct product $f^{\otimes k}$ of a function $f$ to error $\varepsilon$ with respect to a distribution $μ^k$. One strategy is to sequentially compute each of the $k$ copies to error $\varepsilon/k$ with respect to $μ$ and apply the union bound. We prove a strong direct sum theorem showing that this naive strategy is essentially optim… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: 34 pages, 4 figures, CCC 2024

  5. arXiv:2405.16027  [pdf, other

    cs.LG

    Feature Protection For Out-of-distribution Generalization

    Authors: Lu Tan, Huei Zhou, Yinxiang Huang, Zeming Zheng, Yujiu Yang

    Abstract: With the availability of large pre-trained models, a modern workflow for building real-world machine learning solutions is to fine-tune such models on a downstream task with a relatively small domain-specific dataset. In such applications, one major challenge is that the small fine-tuning dataset does not have sufficient coverage of the distribution encountered when the model is deployed. It is th… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2309.06256

  6. arXiv:2405.12213  [pdf, other

    cs.RO cs.LG

    Octo: An Open-Source Generalist Robot Policy

    Authors: Octo Model Team, Dibya Ghosh, Homer Walke, Karl Pertsch, Kevin Black, Oier Mees, Sudeep Dasari, Joey Hejna, Tobias Kreiman, Charles Xu, Jianlan Luo, You Liang Tan, Lawrence Yunliang Chen, Pannag Sanketi, Quan Vuong, Ted Xiao, Dorsa Sadigh, Chelsea Finn, Sergey Levine

    Abstract: Large policies pretrained on diverse robot datasets have the potential to transform robotic learning: instead of training new policies from scratch, such generalist robot policies may be finetuned with only a little in-domain data, yet generalize broadly. However, to be widely applicable across a range of robotic learning scenarios, environments, and tasks, such policies need to handle diverse sen… ▽ More

    Submitted 26 May, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

    Comments: Project website: https://octo-models.github.io

  7. arXiv:2405.10492  [pdf

    cs.CL cs.LG

    Automatic News Generation and Fact-Checking System Based on Language Processing

    Authors: Xirui Peng, Qiming Xu, Zheng Feng, Haopeng Zhao, Lianghao Tan, Yan Zhou, Zecheng Zhang, Chenwei Gong, Yingqiao Zheng

    Abstract: This paper explores an automatic news generation and fact-checking system based on language processing, aimed at enhancing the efficiency and quality of news production while ensuring the authenticity and reliability of the news content. With the rapid development of Natural Language Processing (NLP) and deep learning technologies, automatic news generation systems are capable of extracting key in… ▽ More

    Submitted 20 May, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

    ACM Class: I.5; H.4

  8. arXiv:2404.15242  [pdf, other

    cs.LG math.NA

    A Hybrid Kernel-Free Boundary Integral Method with Operator Learning for Solving Parametric Partial Differential Equations In Complex Domains

    Authors: Shuo Ling, Liwei Tan, Wenjun Ying

    Abstract: The Kernel-Free Boundary Integral (KFBI) method presents an iterative solution to boundary integral equations arising from elliptic partial differential equations (PDEs). This method effectively addresses elliptic PDEs on irregular domains, including the modified Helmholtz, Stokes, and elasticity equations. The rapid evolution of neural networks and deep learning has invigorated the exploration of… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 30 pages,6 figures

  9. arXiv:2404.11889  [pdf, other

    eess.IV cs.CV

    Multi-view X-ray Image Synthesis with Multiple Domain Disentanglement from CT Scans

    Authors: Lixing Tan, Shuang Song, Kangneng Zhou, Chengbo Duan, Lanying Wang, Huayang Ren, Linlin Liu, Wei Zhang, Ruoxiu Xiao

    Abstract: X-ray images play a vital role in the intraoperative processes due to their high resolution and fast imaging speed and greatly promote the subsequent segmentation, registration and reconstruction. However, over-dosed X-rays superimpose potential risks to human health to some extent. Data-driven algorithms from volume scans to X-ray images are restricted by the scarcity of paired X-ray and volume d… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: 13 pages, 10 figures

  10. arXiv:2402.17259  [pdf, other

    cs.SD eess.AS

    EDTC: enhance depth of text comprehension in automated audio captioning

    Authors: Liwen Tan, Yin Cao, Yi Zhou

    Abstract: Modality discrepancies have perpetually posed significant challenges within the realm of Automated Audio Captioning (AAC) and across all multi-modal domains. Facilitating models in comprehending text information plays a pivotal role in establishing a seamless connection between the two modalities of text and audio. While recent research has focused on closing the gap between these two modalities t… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  11. arXiv:2401.16013  [pdf, other

    cs.RO cs.AI

    SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning

    Authors: Jianlan Luo, Zheyuan Hu, Charles Xu, You Liang Tan, Jacob Berg, Archit Sharma, Stefan Schaal, Chelsea Finn, Abhishek Gupta, Sergey Levine

    Abstract: In recent years, significant progress has been made in the field of robotic reinforcement learning (RL), enabling methods that handle complex image observations, train in the real world, and incorporate auxiliary data, such as demonstrations and prior experience. However, despite these advances, robotic RL remains hard to use. It is acknowledged among practitioners that the particular implementati… ▽ More

    Submitted 12 February, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

    Comments: ICRA 2024

  12. Mapping the Design Space of Teachable Social Media Feed Experiences

    Authors: K. J. Kevin Feng, Xander Koo, Lawrence Tan, Amy Bruckman, David W. McDonald, Amy X. Zhang

    Abstract: Social media feeds are deeply personal spaces that reflect individual values and preferences. However, top-down, platform-wide content algorithms can reduce users' sense of agency and fail to account for nuanced experiences and values. Drawing on the paradigm of interactive machine teaching (IMT), an interaction framework for non-expert algorithmic adaptation, we map out a design space for teachab… ▽ More

    Submitted 29 January, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: CHI 2024

  13. arXiv:2401.11946  [pdf, other

    cs.CR

    A Dynamic YOLO-Based Sequence-Matching Model for Efficient Coverless Image Steganography

    Authors: Jiajun Liu, Lina Tan, Zhili Zhou, Yi Li, Peng Chen

    Abstract: Many existing coverless steganography methods establish a mapping relationship between cover images and hidden data. There exists an issue that the number of images stored in the database grows exponentially as the steganographic capacity rises. The need for a high steganographic capacity makes it challenging to build an image database. To improve the image library utilization and anti-attack capa… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  14. arXiv:2401.08553  [pdf, other

    cs.RO

    FMB: a Functional Manipulation Benchmark for Generalizable Robotic Learning

    Authors: Jianlan Luo, Charles Xu, Fangchen Liu, Liam Tan, Zipeng Lin, Jeffrey Wu, Pieter Abbeel, Sergey Levine

    Abstract: In this paper, we propose a real-world benchmark for studying robotic learning in the context of functional manipulation: a robot needs to accomplish complex long-horizon behaviors by composing individual manipulation skills in functionally relevant ways. The core design principles of our Functional Manipulation Benchmark (FMB) emphasize a harmonious balance between complexity and accessibility. T… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  15. arXiv:2401.02173  [pdf, ps, other

    cs.CV cs.AI

    Prompt Decoupling for Text-to-Image Person Re-identification

    Authors: Weihao Li, Lei Tan, Pingyang Dai, Yan Zhang

    Abstract: Text-to-image person re-identification (TIReID) aims to retrieve the target person from an image gallery via a textual description query. Recently, pre-trained vision-language models like CLIP have attracted significant attention and have been widely utilized for this task due to their robust capacity for semantic concept learning and rich multi-modal knowledge. However, recent CLIP-based TIReID m… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

  16. arXiv:2401.00375  [pdf

    cs.RO

    Shape-programmable Adaptive Multi-material Microrobots for Biomedical Applications

    Authors: Liyuan Tan, Yang Yang, Li Fang, David J. Cappelleri

    Abstract: Flagellated microorganisms can swim at low Reynolds numbers and adapt to changes in their environment. Specifically, the flagella can switch their shapes or modes through gene expression. In the past decade, efforts have been made to fabricate and investigate rigid types of microrobots without any adaptation to the environments. More recently, obtaining adaptive microrobots mimicking real microorg… ▽ More

    Submitted 30 December, 2023; originally announced January 2024.

  17. arXiv:2312.15821  [pdf, other

    cs.SD cs.LG eess.AS

    Audiobox: Unified Audio Generation with Natural Language Prompts

    Authors: Apoorv Vyas, Bowen Shi, Matthew Le, Andros Tjandra, Yi-Chiao Wu, Baishan Guo, Jiemin Zhang, Xinyue Zhang, Robert Adkins, William Ngan, Jeff Wang, Ivan Cruz, Bapi Akula, Akinniyi Akinyemi, Brian Ellis, Rashel Moritz, Yael Yungster, Alice Rakotoarison, Liang Tan, Chris Summers, Carleigh Wood, Joshua Lane, Mary Williamson, Wei-Ning Hsu

    Abstract: Audio is an essential part of our life, but creating it often requires expertise and is time-consuming. Research communities have made great progress over the past year advancing the performance of large scale audio generative models for a single modality (speech, sound, or music) through adopting more powerful generative models and scaling data. However, these models lack controllability in sever… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

  18. arXiv:2312.13279  [pdf, other

    cs.RO

    Stretch with Stretch: Physical Therapy Exercise Games Led by a Mobile Manipulator

    Authors: Matthew Lamsey, You Liang Tan, Meredith D. Wells, Madeline Beatty, Zexuan Liu, Arjun Majumdar, Kendra Washington, Jerry Feldman, Naveen Kuppuswamy, Elizabeth Nguyen, Arielle Wallenstein, Madeleine E. Hackney, Charles C. Kemp

    Abstract: Physical therapy (PT) is a key component of many rehabilitation regimens, such as treatments for Parkinson's disease (PD). However, there are shortages of physical therapists and adherence to self-guided PT is low. Robots have the potential to support physical therapists and increase adherence to self-guided PT, but prior robotic systems have been large and immobile, which can be a barrier to use… ▽ More

    Submitted 21 December, 2023; v1 submitted 20 December, 2023; originally announced December 2023.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  19. arXiv:2312.04948  [pdf, other

    cs.CV astro-ph.GA cs.LG

    Scientific Preparation for CSST: Classification of Galaxy and Nebula/Star Cluster Based on Deep Learning

    Authors: Yuquan Zhang, Zhong Cao, Feng Wang, Lam, Man I, Hui Deng, Ying Mei, Lei Tan

    Abstract: The Chinese Space Station Telescope (abbreviated as CSST) is a future advanced space telescope. Real-time identification of galaxy and nebula/star cluster (abbreviated as NSC) images is of great value during CSST survey. While recent research on celestial object recognition has progressed, the rapid and efficient identification of high-resolution local celestial images remains challenging. In this… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  20. arXiv:2311.15060  [pdf, ps, other

    eess.SP cs.IT

    Key Issues in Wireless Transmission for NTN-Assisted Internet of Things

    Authors: Chenhao Qi, Jing Wang, Leyi Lyu, Lei Tan, Jinming Zhang, Geoffrey Ye Li

    Abstract: Non-terrestrial networks (NTNs) have become appealing resolutions for seamless coverage in the next-generation wireless transmission, where a large number of Internet of Things (IoT) devices diversely distributed can be efficiently served. The explosively growing number of IoT devices brings a new challenge for massive connection. The long-distance wireless signal propagation in NTNs leads to seve… ▽ More

    Submitted 25 November, 2023; originally announced November 2023.

    Comments: 7 pages, 6 figures

  21. arXiv:2311.13809  [pdf, other

    cs.RO

    Responsive Hydrogel-based Modular Microrobots for Multi-functional Micromanipulation

    Authors: Liyuan Tan, David J. Cappelleri

    Abstract: Microrobots show great potential in biomedical applications such as drug delivery and cell manipulations. However, current microrobots are mostly fabricated as a single entity and type and the tasks they can perform are limited. In this paper, modular microrobots, with an overall size of 120 $μ$m $\times$ 200 $μ$m, are proposed with responsive mating components, made from stimuli-responsive hydrog… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

    Comments: 15 pages, 7 figures

  22. arXiv:2311.13721  [pdf, other

    cs.SE cs.AI

    Nova: Generative Language Models for Assembly Code with Hierarchical Attention and Contrastive Learning

    Authors: Nan Jiang, Chengxiao Wang, Kevin Liu, Xiangzhe Xu, Lin Tan, Xiangyu Zhang

    Abstract: Binary code analysis is the foundation of crucial tasks in the security domain; thus building effective binary analysis techniques is more important than ever. Large language models (LLMs) although have brought impressive improvement to source code tasks, do not directly generalize to assembly code due to the unique challenges of assembly: (1) the low information density of assembly and (2) the di… ▽ More

    Submitted 24 June, 2024; v1 submitted 22 November, 2023; originally announced November 2023.

  23. arXiv:2311.08880  [pdf, other

    cs.RO eess.SY

    Motion Control of Two Mobile Robots under Allowable Collisions

    Authors: Li Tan, Wei Ren, Xi-Ming Sun, Junlin Xiong

    Abstract: This letter investigates the motion control problem of two mobile robots under allowable collisions. Here, the allowable collisions mean that the collisions do not damage the mobile robots. The occurrence of the collisions is discussed and the effects of the collisions on the mobile robots are analyzed to develop a hybrid model of each mobile robot under allowable collisions. Based on the effects… ▽ More

    Submitted 26 April, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: 8 pages, 5 figures

  24. arXiv:2310.08864  [pdf, other

    cs.RO

    Open X-Embodiment: Robotic Learning Datasets and RT-X Models

    Authors: Open X-Embodiment Collaboration, Abby O'Neill, Abdul Rehman, Abhinav Gupta, Abhiram Maddukuri, Abhishek Gupta, Abhishek Padalkar, Abraham Lee, Acorn Pooley, Agrim Gupta, Ajay Mandlekar, Ajinkya Jain, Albert Tung, Alex Bewley, Alex Herzog, Alex Irpan, Alexander Khazatsky, Anant Rai, Anchit Gupta, Andrew Wang, Andrey Kolobov, Anikait Singh, Animesh Garg, Aniruddha Kembhavi, Annie Xie , et al. (267 additional authors not shown)

    Abstract: Large, high-capacity models trained on diverse datasets have shown remarkable successes on efficiently tackling downstream applications. In domains from NLP to Computer Vision, this has led to a consolidation of pretrained models, with general pretrained backbones serving as a starting point for many applications. Can such a consolidation happen in robotics? Conventionally, robotic learning method… ▽ More

    Submitted 1 June, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: Project website: https://robotics-transformer-x.github.io

  25. arXiv:2310.05348  [pdf, other

    cs.LG cs.AI

    Continuous Invariance Learning

    Authors: Yong Lin, Fan Zhou, Lu Tan, Lintao Ma, Jiameng Liu, Yansu He, Yuan Yuan, Yu Liu, James Zhang, Yujiu Yang, Hao Wang

    Abstract: Invariance learning methods aim to learn invariant features in the hope that they generalize under distributional shifts. Although many tasks are naturally characterized by continuous domains, current invariance learning techniques generally assume categorically indexed domains. For example, auto-scaling in cloud computing often needs a CPU utilization prediction model that generalizes across diff… ▽ More

    Submitted 22 April, 2024; v1 submitted 8 October, 2023; originally announced October 2023.

  26. arXiv:2310.01551  [pdf, other

    cs.LG cs.AI cs.DS

    Harnessing the Power of Choices in Decision Tree Learning

    Authors: Guy Blanc, Jane Lange, Chirag Pabbaraju, Colin Sullivan, Li-Yang Tan, Mo Tiwari

    Abstract: We propose a simple generalization of standard and empirically successful decision tree learning algorithms such as ID3, C4.5, and CART. These algorithms, which have been central to machine learning for decades, are greedy in nature: they grow a decision tree by iteratively splitting on the best attribute. Our algorithm, Top-$k$, considers the $k$ best attributes as possible splits instead of just… ▽ More

    Submitted 25 October, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023

    ACM Class: I.2.0; I.2.m

  27. arXiv:2309.17230  [pdf, other

    cs.LG

    Spurious Feature Diversification Improves Out-of-distribution Generalization

    Authors: Yong Lin, Lu Tan, Yifan Hao, Honam Wong, Hanze Dong, Weizhong Zhang, Yujiu Yang, Tong Zhang

    Abstract: Generalization to out-of-distribution (OOD) data is a critical challenge in machine learning. Ensemble-based methods, like weight space ensembles that interpolate model parameters, have been shown to achieve superior OOD performance. However, the underlying mechanism for their effectiveness remains unclear. In this study, we closely examine WiSE-FT, a popular weight space ensemble method that inte… ▽ More

    Submitted 14 July, 2024; v1 submitted 29 September, 2023; originally announced September 2023.

    Comments: ICLR 2024

  28. arXiv:2309.15941  [pdf, other

    cs.CV

    AutoEncoding Tree for City Generation and Applications

    Authors: Wenyu Han, Congcong Wen, Lazarus Chok, Yan Liang Tan, Sheung Lung Chan, Hang Zhao, Chen Feng

    Abstract: City modeling and generation have attracted an increased interest in various applications, including gaming, urban planning, and autonomous driving. Unlike previous works focused on the generation of single objects or indoor scenes, the huge volumes of spatial data in cities pose a challenge to the generative models. Furthermore, few publicly available 3D real-world city datasets also hinder the d… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

  29. arXiv:2309.12312  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    ForceSight: Text-Guided Mobile Manipulation with Visual-Force Goals

    Authors: Jeremy A. Collins, Cody Houff, You Liang Tan, Charles C. Kemp

    Abstract: We present ForceSight, a system for text-guided mobile manipulation that predicts visual-force goals using a deep neural network. Given a single RGBD image combined with a text prompt, ForceSight determines a target end-effector pose in the camera frame (kinematic goal) and the associated forces (force goal). Together, these two components form a visual-force goal. Prior work has demonstrated that… ▽ More

    Submitted 23 September, 2023; v1 submitted 21 September, 2023; originally announced September 2023.

  30. Dense Voxel 3D Reconstruction Using a Monocular Event Camera

    Authors: Haodong Chen, Vera Chung, Li Tan, Xiaoming Chen

    Abstract: Event cameras are sensors inspired by biological systems that specialize in capturing changes in brightness. These emerging cameras offer many advantages over conventional frame-based cameras, including high dynamic range, high frame rates, and extremely low power consumption. Due to these advantages, event cameras have increasingly been adapted in various fields, such as frame interpolation, sema… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

  31. Deep Image Harmonization in Dual Color Spaces

    Authors: Linfeng Tan, Jiangtong Li, Li Niu, Liqing Zhang

    Abstract: Image harmonization is an essential step in image composition that adjusts the appearance of composite foreground to address the inconsistency between foreground and background. Existing methods primarily operate in correlated $RGB$ color space, leading to entangled features and limited representation ability. In contrast, decorrelated color space (e.g., $Lab$) has decorrelated channels that provi… ▽ More

    Submitted 5 August, 2023; originally announced August 2023.

    Comments: Accepted by ACMMM 2023

  32. arXiv:2308.00356  [pdf, other

    cs.CV

    Deep Image Harmonization with Globally Guided Feature Transformation and Relation Distillation

    Authors: Li Niu, Linfeng Tan, Xinhao Tao, Junyan Cao, Fengjun Guo, Teng Long, Liqing Zhang

    Abstract: Given a composite image, image harmonization aims to adjust the foreground illumination to be consistent with background. Previous methods have explored transforming foreground features to achieve competitive performance. In this work, we show that using global information to guide foreground feature transformation could achieve significant improvement. Besides, we propose to transfer the foregrou… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

    Comments: Accepted by ICCV 2023

  33. arXiv:2307.09248  [pdf, other

    cs.LG eess.SP

    Application of BERT in Wind Power Forecasting-Teletraan's Solution in Baidu KDD Cup 2022

    Authors: Longxing Tan, Hongying Yue

    Abstract: Nowadays, wind energy has drawn increasing attention as its important role in carbon neutrality and sustainable development. When wind power is integrated into the power grid, precise forecasting is necessary for the sustainability and security of the system. However, the unpredictable nature and long sequence prediction make it especially challenging. In this technical report, we introduce the BE… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

  34. arXiv:2307.08927  [pdf, other

    cs.RO cs.AI

    Multi-Stage Cable Routing through Hierarchical Imitation Learning

    Authors: Jianlan Luo, Charles Xu, Xinyang Geng, Gilbert Feng, Kuan Fang, Liam Tan, Stefan Schaal, Sergey Levine

    Abstract: We study the problem of learning to perform multi-stage robotic manipulation tasks, with applications to cable routing, where the robot must route a cable through a series of clips. This setting presents challenges representative of complex multi-stage robotic manipulation scenarios: handling deformable objects, closing the loop on visual perception, and handling extended behaviors consisting of m… ▽ More

    Submitted 13 January, 2024; v1 submitted 17 July, 2023; originally announced July 2023.

    Comments: T-RO 2024

  35. arXiv:2307.04093  [pdf, ps, other

    cs.CC cs.DS cs.LG

    Properly Learning Decision Trees with Queries Is NP-Hard

    Authors: Caleb Koch, Carmen Strassle, Li-Yang Tan

    Abstract: We prove that it is NP-hard to properly PAC learn decision trees with queries, resolving a longstanding open problem in learning theory (Bshouty 1993; Guijarro-Lavin-Raghavan 1999; Mehta-Raghavan 2002; Feldman 2016). While there has been a long line of work, dating back to (Pitt-Valiant 1988), establishing the hardness of properly learning decision trees from random examples, the more challenging… ▽ More

    Submitted 9 July, 2023; originally announced July 2023.

    Comments: 41 pages, 10 figures, FOCS 2023

  36. arXiv:2307.04039  [pdf, ps, other

    cs.CC cs.DS

    A Strong Composition Theorem for Junta Complexity and the Boosting of Property Testers

    Authors: Guy Blanc, Caleb Koch, Carmen Strassle, Li-Yang Tan

    Abstract: We prove a strong composition theorem for junta complexity and show how such theorems can be used to generically boost the performance of property testers. The $\varepsilon$-approximate junta complexity of a function $f$ is the smallest integer $r$ such that $f$ is $\varepsilon$-close to a function that depends only on $r$ variables. A strong composition theorem states that if $f$ has large… ▽ More

    Submitted 8 July, 2023; originally announced July 2023.

    Comments: 44 pages, 1 figure, FOCS 2023

  37. arXiv:2306.03324  [pdf, other

    cs.SE

    Impact of Large Language Models on Generating Software Specifications

    Authors: Danning Xie, Byungwoo Yoo, Nan Jiang, Mijung Kim, Lin Tan, Xiangyu Zhang, Judy S. Lee

    Abstract: Software specifications are essential for ensuring the reliability of software systems. Existing specification extraction approaches, however, suffer from limited generalizability and require manual efforts. The recent emergence of Large Language Models (LLMs), which have been successfully applied to numerous software engineering tasks, offers a promising avenue for automating this process. In thi… ▽ More

    Submitted 2 October, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

  38. arXiv:2306.02546  [pdf, other

    cs.SE

    Leveraging Generative Models to Recover Variable Names from Stripped Binary

    Authors: Xiangzhe Xu, Zhuo Zhang, Zian Su, Ziyang Huang, Shiwei Feng, Yapeng Ye, Nan Jiang, Danning Xie, Siyuan Cheng, Lin Tan, Xiangyu Zhang

    Abstract: Decompilation aims to recover the source code form of a binary executable. It has many security applications such as malware analysis, vulnerability detection and code hardening. A prominent challenge in decompilation is to recover variable names. We propose a novel technique that leverages the strengths of generative models while suppressing potential hallucinations and overcoming the input token… ▽ More

    Submitted 30 April, 2024; v1 submitted 4 June, 2023; originally announced June 2023.

  39. arXiv:2305.18607  [pdf, other

    cs.SE cs.AI cs.CR

    How Effective Are Neural Networks for Fixing Security Vulnerabilities

    Authors: Yi Wu, Nan Jiang, Hung Viet Pham, Thibaud Lutellier, Jordan Davis, Lin Tan, Petr Babkin, Sameena Shah

    Abstract: Security vulnerability repair is a difficult task that is in dire need of automation. Two groups of techniques have shown promise: (1) large code language models (LLMs) that have been pre-trained on source code for tasks such as code completion, and (2) automated program repair (APR) techniques that use deep learning (DL) models to automatically fix software bugs. This paper is the first to stud… ▽ More

    Submitted 1 April, 2024; v1 submitted 29 May, 2023; originally announced May 2023.

    Comments: This paper was accepted in the proceedings of the 32nd ACM SIGSOFT International Symposium on Software Testing and Analysis (ISSTA 2023), and was presented at the conference, that was held in Seattle, USA, 17-21 July 2023

  40. arXiv:2305.13499  [pdf, other

    cs.CL

    Learning Easily Updated General Purpose Text Representations with Adaptable Task-Specific Prefixes

    Authors: Kuan-Hao Huang, Liang Tan, Rui Hou, Sinong Wang, Amjad Almahairi, Ruty Rinott

    Abstract: Many real-world applications require making multiple predictions from the same text. Fine-tuning a large pre-trained language model for each downstream task causes computational burdens in the inference time due to several times of forward passes. To amortize the computational cost, freezing the language model and building lightweight models for downstream tasks based on fixed text representations… ▽ More

    Submitted 14 October, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: Paper accepted by EMNLP 2023 Findings

  41. arXiv:2305.05529  [pdf, other

    stat.CO cs.LG math.PR math.ST stat.ML

    Accelerate Langevin Sampling with Birth-Death process and Exploration Component

    Authors: Lezhi Tan, Jianfeng Lu

    Abstract: Sampling a probability distribution with known likelihood is a fundamental task in computational science and engineering. Aiming at multimodality, we propose a new sampling method that takes advantage of both birth-death process and exploration component. The main idea of this method is \textit{look before you leap}. We keep two sets of samplers, one at warmer temperature and one at original tempe… ▽ More

    Submitted 6 May, 2023; originally announced May 2023.

    Comments: 23 pages, 10 figures

  42. arXiv:2304.01482  [pdf, other

    cs.CV

    Defending Against Patch-based Backdoor Attacks on Self-Supervised Learning

    Authors: Ajinkya Tejankar, Maziar Sanjabi, Qifan Wang, Sinong Wang, Hamed Firooz, Hamed Pirsiavash, Liang Tan

    Abstract: Recently, self-supervised learning (SSL) was shown to be vulnerable to patch-based data poisoning backdoor attacks. It was shown that an adversary can poison a small part of the unlabeled data so that when a victim trains an SSL model on it, the final model will have a backdoor that the adversary can exploit. This work aims to defend self-supervised learning against such attacks. We use a three-st… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

    Comments: Accepted to CVPR 2023

  43. arXiv:2303.16208  [pdf, ps, other

    stat.ML cs.CC cs.DS cs.LG

    Lifting uniform learners via distributional decomposition

    Authors: Guy Blanc, Jane Lange, Ali Malik, Li-Yang Tan

    Abstract: We show how any PAC learning algorithm that works under the uniform distribution can be transformed, in a blackbox fashion, into one that works under an arbitrary and unknown distribution $\mathcal{D}$. The efficiency of our transformation scales with the inherent complexity of $\mathcal{D}$, running in $\mathrm{poly}(n, (md)^d)$ time for distributions over $\{\pm 1\}^n$ whose pmfs are computed by… ▽ More

    Submitted 29 March, 2023; v1 submitted 27 March, 2023; originally announced March 2023.

    Comments: To appear in STOC 2023

  44. arXiv:2303.13592  [pdf, other

    cs.CL cs.AI

    Prompting Multilingual Large Language Models to Generate Code-Mixed Texts: The Case of South East Asian Languages

    Authors: Zheng-Xin Yong, Ruochen Zhang, Jessica Zosa Forde, Skyler Wang, Arjun Subramonian, Holy Lovenia, Samuel Cahyawijaya, Genta Indra Winata, Lintang Sutawika, Jan Christian Blaise Cruz, Yin Lin Tan, Long Phan, Rowena Garcia, Thamar Solorio, Alham Fikri Aji

    Abstract: While code-mixing is a common linguistic practice in many parts of the world, collecting high-quality and low-cost code-mixed data remains a challenge for natural language processing (NLP) research. The recent proliferation of Large Language Models (LLMs) compels one to ask: how capable are these systems in generating code-mixed data? In this paper, we explore prompting multilingual LLMs in a zero… ▽ More

    Submitted 12 September, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

    Comments: Updating Authors

  45. arXiv:2303.10976  [pdf, other

    cs.CV

    Attention Disturbance and Dual-Path Constraint Network for Occluded Person Re-identification

    Authors: Jiaer Xia, Lei Tan, Pingyang Dai, Mingbo Zhao, Yongjian Wu, Liujuan Cao

    Abstract: Occluded person re-identification (Re-ID) aims to address the potential occlusion problem when matching occluded or holistic pedestrians from different camera views. Many methods use the background as artificial occlusion and rely on attention networks to exclude noisy interference. However, the significant discrepancy between simple background occlusion and realistic occlusion can negatively impa… ▽ More

    Submitted 22 February, 2024; v1 submitted 20 March, 2023; originally announced March 2023.

    Comments: AAAI2024

  46. arXiv:2302.10175  [pdf, other

    q-fin.PM cs.LG q-fin.TR stat.ML

    Spatio-Temporal Momentum: Jointly Learning Time-Series and Cross-Sectional Strategies

    Authors: Wee Ling Tan, Stephen Roberts, Stefan Zohren

    Abstract: We introduce Spatio-Temporal Momentum strategies, a class of models that unify both time-series and cross-sectional momentum strategies by trading assets based on their cross-sectional momentum features over time. While both time-series and cross-sectional momentum strategies are designed to systematically capture momentum risk premia, these strategies are regarded as distinct implementations and… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

    Journal ref: The Journal of Financial Data Science, Summer 2023

  47. arXiv:2302.05020  [pdf, other

    cs.SE

    Impact of Code Language Models on Automated Program Repair

    Authors: Nan Jiang, Kevin Liu, Thibaud Lutellier, Lin Tan

    Abstract: Automated program repair (APR) aims to help developers improve software reliability by generating patches for buggy programs. Although many code language models (CLM) are developed and effective in many software tasks such as code completion, there has been little comprehensive, in-depth work to evaluate CLMs' fixing capabilities and to fine-tune CLMs for the APR task. Firstly, this work is the… ▽ More

    Submitted 16 April, 2023; v1 submitted 9 February, 2023; originally announced February 2023.

    Comments: This paper is accepted by 2023 IEEE/ACM 45th International Conference on Software Engineering (ICSE)

  48. arXiv:2302.01857  [pdf, other

    cs.SE cs.AI

    KNOD: Domain Knowledge Distilled Tree Decoder for Automated Program Repair

    Authors: Nan Jiang, Thibaud Lutellier, Yiling Lou, Lin Tan, Dan Goldwasser, Xiangyu Zhang

    Abstract: Automated Program Repair (APR) improves software reliability by generating patches for a buggy program automatically. Recent APR techniques leverage deep learning (DL) to build models to learn to generate patches from existing patches and code corpora. While promising, DL-based APR techniques suffer from the abundant syntactically or semantically incorrect patches in the patch space. These patches… ▽ More

    Submitted 16 April, 2023; v1 submitted 3 February, 2023; originally announced February 2023.

    Comments: This paper is accepted by 2023 IEEE/ACM 45th International Conference on Software Engineering (ICSE)

  49. arXiv:2302.01512  [pdf, other

    cs.CV

    Spectral Aware Softmax for Visible-Infrared Person Re-Identification

    Authors: Lei Tan, Pingyang Dai, Qixiang Ye, Mingliang Xu, Yongjian Wu, Rongrong Ji

    Abstract: Visible-infrared person re-identification (VI-ReID) aims to match specific pedestrian images from different modalities. Although suffering an extra modality discrepancy, existing methods still follow the softmax loss training paradigm, which is widely used in single-modality classification tasks. The softmax loss lacks an explicit penalty for the apparent modality gap, which adversely limits the p… ▽ More

    Submitted 2 February, 2023; originally announced February 2023.

  50. arXiv:2302.00884  [pdf, other

    cs.CV

    Exploring Invariant Representation for Visible-Infrared Person Re-Identification

    Authors: Lei Tan, Yukang Zhang, Shengmei Shen, Yan Wang, Pingyang Dai, Xianming Lin, Yongjian Wu, Rongrong Ji

    Abstract: Cross-spectral person re-identification, which aims to associate identities to pedestrians across different spectra, faces a main challenge of the modality discrepancy. In this paper, we address the problem from both image-level and feature-level in an end-to-end hybrid learning framework named robust feature mining network (RFM). In particular, we observe that the reflective intensity of the same… ▽ More

    Submitted 2 February, 2023; originally announced February 2023.