Skip to main content

Showing 1–50 of 115 results for author: Liang, T

  1. arXiv:2407.11588  [pdf, other

    cs.CV

    Progressive Pretext Task Learning for Human Trajectory Prediction

    Authors: Xiaotong Lin, Tianming Liang, Jianhuang Lai, Jian-Fang Hu

    Abstract: Human trajectory prediction is a practical task of predicting the future positions of pedestrians on the road, which typically covers all temporal ranges from short-term to long-term within a trajectory. However, existing works attempt to address the entire trajectory prediction with a singular, uniform training paradigm, neglecting the distinction between short-term and long-term dynamics in huma… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: Accepted to ECCV 2024

  2. arXiv:2407.09121  [pdf, other

    cs.CL cs.AI

    Refuse Whenever You Feel Unsafe: Improving Safety in LLMs via Decoupled Refusal Training

    Authors: Youliang Yuan, Wenxiang Jiao, Wenxuan Wang, Jen-tse Huang, Jiahao Xu, Tian Liang, Pinjia He, Zhaopeng Tu

    Abstract: This study addresses a critical gap in safety tuning practices for Large Language Models (LLMs) by identifying and tackling a refusal position bias within safety tuning data, which compromises the models' ability to appropriately refuse generating unsafe content. We introduce a novel approach, Decoupled Refusal Training (DeRTa), designed to empower LLMs to refuse compliance to harmful prompts at a… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  3. arXiv:2407.06579  [pdf, other

    cs.CL

    NoisyAG-News: A Benchmark for Addressing Instance-Dependent Noise in Text Classification

    Authors: Hongfei Huang, Tingting Liang, Xixi Sun, Zikang Jin, Yuyu Yin

    Abstract: Existing research on learning with noisy labels predominantly focuses on synthetic label noise. Although synthetic noise possesses well-defined structural properties, it often fails to accurately replicate real-world noise patterns. In recent years, there has been a concerted effort to construct generalizable and controllable instance-dependent noise datasets for image classification, significantl… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 20 pages , 13 figure

  4. arXiv:2406.15904  [pdf, other

    cs.LG stat.ME stat.ML

    Learning When the Concept Shifts: Confounding, Invariance, and Dimension Reduction

    Authors: Kulunu Dharmakeerthi, YoonHaeng Hur, Tengyuan Liang

    Abstract: Practitioners often deploy a learned prediction model in a new environment where the joint distribution of covariate and response has shifted. In observational data, the distribution shift is often driven by unobserved confounding factors lurking in the environment, with the underlying mechanism unknown. Confounding can obfuscate the definition of the best prediction model (concept shift) and shif… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

  5. arXiv:2406.10797  [pdf, other

    cs.CV

    STAR: Scale-wise Text-to-image generation via Auto-Regressive representations

    Authors: Xiaoxiao Ma, Mohan Zhou, Tao Liang, Yalong Bai, Tiejun Zhao, Huaian Chen, Yi Jin

    Abstract: We present STAR, a text-to-image model that employs scale-wise auto-regressive paradigm. Unlike VAR, which is limited to class-conditioned synthesis within a fixed set of predetermined categories, our STAR enables text-driven open-set generation through three key designs: To boost diversity and generalizability with unseen combinations of objects and concepts, we introduce a pre-trained text encod… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: 12 pages, 6 figures

  6. arXiv:2405.19088  [pdf, other

    cs.CL cs.CV

    Cracking the Code of Juxtaposition: Can AI Models Understand the Humorous Contradictions

    Authors: Zhe Hu, Tuo Liang, Jing Li, Yiren Lu, Yunlai Zhou, Yiran Qiao, Jing Ma, Yu Yin

    Abstract: Recent advancements in large multimodal language models have demonstrated remarkable proficiency across a wide range of tasks. Yet, these models still struggle with understanding the nuances of human humor through juxtaposition, particularly when it involves nonlinear narratives that underpin many jokes and humor cues. This paper investigates this challenge by focusing on comics with contradictory… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  7. arXiv:2404.11824  [pdf, other

    cs.CV

    TextCenGen: Attention-Guided Text-Centric Background Adaptation for Text-to-Image Generation

    Authors: Tianyi Liang, Jiangqi Liu, Sicheng Song, Shiqi Jiang, Yifei Huang, Changbo Wang, Chenhui Li

    Abstract: Recent advancements in Text-to-image (T2I) generation have witnessed a shift from adapting text to fixed backgrounds to creating images around text. Traditional approaches are often limited to generate layouts within static images for effective text placement. Our proposed approach, TextCenGen, introduces a dynamic adaptation of the blank region for text-friendly image generation, emphasizing text… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: 7 pages, 7 figures

  8. arXiv:2403.19460  [pdf, other

    cs.RO cs.AI

    RiEMann: Near Real-Time SE(3)-Equivariant Robot Manipulation without Point Cloud Segmentation

    Authors: Chongkai Gao, Zhengrong Xue, Shuying Deng, Tianhai Liang, Siqi Yang, Lin Shao, Huazhe Xu

    Abstract: We present RiEMann, an end-to-end near Real-time SE(3)-Equivariant Robot Manipulation imitation learning framework from scene point cloud input. Compared to previous methods that rely on descriptor field matching, RiEMann directly predicts the target poses of objects for manipulation without any object segmentation. RiEMann learns a manipulation task from scratch with 5 to 10 demonstrations, gener… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  9. arXiv:2403.15027  [pdf, other

    cs.LG cs.AI

    Grey-informed neural network for time-series forecasting

    Authors: Wanli Xie, Ruibin Zhao, Zhenguo Xu, Tingting Liang

    Abstract: Neural network models have shown outstanding performance and successful resolutions to complex problems in various fields. However, the majority of these models are viewed as black-box, requiring a significant amount of data for development. Consequently, in situations with limited data, constructing appropriate models becomes challenging due to the lack of transparency and scarcity of data. To ta… ▽ More

    Submitted 3 April, 2024; v1 submitted 22 March, 2024; originally announced March 2024.

  10. arXiv:2403.14430  [pdf, other

    cs.CV

    Ranking Distillation for Open-Ended Video Question Answering with Insufficient Labels

    Authors: Tianming Liang, Chaolei Tan, Beihao Xia, Wei-Shi Zheng, Jian-Fang Hu

    Abstract: This paper focuses on open-ended video question answering, which aims to find the correct answers from a large answer set in response to a video-related question. This is essentially a multi-label classification task, since a question may have multiple answers. However, due to annotation costs, the labels in existing benchmarks are always extremely insufficient, typically one answer per question.… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR 2024

  11. arXiv:2403.11807  [pdf, other

    cs.AI cs.CL

    How Far Are We on the Decision-Making of LLMs? Evaluating LLMs' Gaming Ability in Multi-Agent Environments

    Authors: Jen-tse Huang, Eric John Li, Man Ho Lam, Tian Liang, Wenxuan Wang, Youliang Yuan, Wenxiang Jiao, Xing Wang, Zhaopeng Tu, Michael R. Lyu

    Abstract: Decision-making, a complicated task requiring various types of abilities, presents an excellent framework for assessing Large Language Models (LLMs). Our research investigates LLMs' decision-making capabilities through the lens of a well-established field, Game Theory. We focus specifically on games that support the participation of more than two agents simultaneously. Subsequently, we introduce o… ▽ More

    Submitted 25 April, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: 16 pages of main text. 11 pages of appendices. 15 figures, 9 tables. Updated scoring scheme

  12. arXiv:2403.00179  [pdf, other

    cs.HC

    Counterspeakers' Perspectives: Unveiling Barriers and AI Needs in the Fight against Online Hate

    Authors: Jimin Mun, Cathy Buerger, Jenny T. Liang, Joshua Garland, Maarten Sap

    Abstract: Counterspeech, i.e., direct responses against hate speech, has become an important tool to address the increasing amount of hate online while avoiding censorship. Although AI has been proposed to help scale up counterspeech efforts, this raises questions of how exactly AI could assist in this process, since counterspeech is a deeply empathetic and agentic process for those involved. In this work,… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

    Comments: To appear in CHI 2024. 22 pages, 3 figures, 7 tables

  13. arXiv:2402.15290  [pdf, other

    cs.LG cs.AI

    Appendix for Linear Dynamics-embedded Neural Network for Long-Sequence Modeling

    Authors: Tongyi Liang, Han-Xiong Li

    Abstract: This appendix provides all necessary materials for the paper 'Linear Dynamics-embedded Neural Network for Long-Sequence Modeling', including model details, experimental configurations, and PyTorch implementation.

    Submitted 2 June, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

  14. arXiv:2402.15284  [pdf, other

    cs.LG cs.AI eess.SY

    Spatiotemporal Observer Design for Predictive Learning of High-Dimensional Data

    Authors: Tongyi Liang, Han-Xiong Li

    Abstract: Although deep learning-based methods have shown great success in spatiotemporal predictive learning, the framework of those models is designed mainly by intuition. How to make spatiotemporal forecasting with theoretical guarantees is still a challenging issue. In this work, we tackle this problem by applying domain knowledge from the dynamical system to the framework design of deep learning models… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: Under review by IEEE Transactions on Pattern Analysis and Machine Intelligence

  15. arXiv:2402.14809  [pdf, other

    cs.CL cs.AI cs.LG

    CriticBench: Benchmarking LLMs for Critique-Correct Reasoning

    Authors: Zicheng Lin, Zhibin Gou, Tian Liang, Ruilin Luo, Haowei Liu, Yujiu Yang

    Abstract: The ability of Large Language Models (LLMs) to critique and refine their reasoning is crucial for their application in evaluation, feedback provision, and self-improvement. This paper introduces CriticBench, a comprehensive benchmark designed to assess LLMs' abilities to critique and rectify their reasoning across a variety of tasks. CriticBench encompasses five reasoning domains: mathematical, co… ▽ More

    Submitted 1 June, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: ACL 2024 Findings

  16. arXiv:2312.06946  [pdf, other

    cs.CV

    WaterHE-NeRF: Water-ray Tracing Neural Radiance Fields for Underwater Scene Reconstruction

    Authors: Jingchun Zhou, Tianyu Liang, Dehuan Zhang, Zongxin He

    Abstract: Neural Radiance Field (NeRF) technology demonstrates immense potential in novel viewpoint synthesis tasks, due to its physics-based volumetric rendering process, which is particularly promising in underwater scenes. Addressing the limitations of existing underwater NeRF methods in handling light attenuation caused by the water medium and the lack of real Ground Truth (GT) supervision, this study p… ▽ More

    Submitted 18 January, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

  17. arXiv:2311.02126  [pdf, other

    cs.CV

    PILL: Plug Into LLM with Adapter Expert and Attention Gate

    Authors: Fangyuan Zhang, Tingting Liang, Zhengyuan Wu, Yuyu Yin

    Abstract: Due to the remarkable capabilities of powerful Large Language Models (LLMs) in effectively following instructions, there has been a growing number of assistants in the community to assist humans. Recently, significant progress has been made in the development of Vision Language Models (VLMs), expanding the capabilities of LLMs and enabling them to execute more diverse instructions. However, it is… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

  18. arXiv:2310.20499  [pdf, other

    cs.CL

    Leveraging Word Guessing Games to Assess the Intelligence of Large Language Models

    Authors: Tian Liang, Zhiwei He, Jen-tse Huang, Wenxuan Wang, Wenxiang Jiao, Rui Wang, Yujiu Yang, Zhaopeng Tu, Shuming Shi, Xing Wang

    Abstract: The automatic evaluation of LLM-based agent intelligence is critical in developing advanced LLM-based agents. Although considerable effort has been devoted to developing human-annotated evaluation datasets, such as AlpacaEval, existing techniques are costly, time-consuming, and lack adaptability. In this paper, inspired by the popular language game ``Who is Spy'', we propose to use the word guessi… ▽ More

    Submitted 5 November, 2023; v1 submitted 31 October, 2023; originally announced October 2023.

    Comments: Work in progress

  19. Sentence Bag Graph Formulation for Biomedical Distant Supervision Relation Extraction

    Authors: Hao Zhang, Yang Liu, Xiaoyan Liu, Tianming Liang, Gaurav Sharma, Liang Xue, Maozu Guo

    Abstract: We introduce a novel graph-based framework for alleviating key challenges in distantly-supervised relation extraction and demonstrate its effectiveness in the challenging and important domain of biomedical data. Specifically, we propose a graph view of sentence bags referring to an entity pair, which enables message-passing based aggregation of information related to the entity pair over the sente… ▽ More

    Submitted 29 October, 2023; originally announced October 2023.

    Comments: in IEEE Transactions on Knowledge and Data Engineering, 2024

  20. arXiv:2310.15419  [pdf, other

    cs.CE cs.DC cs.DS

    Fast multiplication of random dense matrices with fixed sparse matrices

    Authors: Tianyu Liang, Riley Murray, Aydın Buluç, James Demmel

    Abstract: This work focuses on accelerating the multiplication of a dense random matrix with a (fixed) sparse matrix, which is frequently used in sketching algorithms. We develop a novel scheme that takes advantage of blocking and recomputation (on-the-fly random number generation) to accelerate this operation. The techniques we propose decrease memory movement, thereby increasing the algorithm's parallel s… ▽ More

    Submitted 12 May, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

  21. arXiv:2310.01727  [pdf, other

    cs.SE cs.AI

    Can GPT-4 Replicate Empirical Software Engineering Research?

    Authors: Jenny T. Liang, Carmen Badea, Christian Bird, Robert DeLine, Denae Ford, Nicole Forsgren, Thomas Zimmermann

    Abstract: Empirical software engineering research on production systems has brought forth a better understanding of the software engineering process for practitioners and researchers alike. However, only a small subset of production systems is studied, limiting the impact of this research. While software engineering practitioners could benefit from replicating research on their own data, this poses its own… ▽ More

    Submitted 19 June, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

  22. arXiv:2310.00242  [pdf, ps, other

    cs.RO cs.CV

    Walking = Traversable? : Traversability Prediction via Multiple Human Object Tracking under Occlusion

    Authors: Jonathan Tay Yu Liang, Kanji Tanaka

    Abstract: The emerging ``Floor plan from human trails (PfH)" technique has great potential for improving indoor robot navigation by predicting the traversability of occluded floors. This study presents an innovative approach that replaces first-person-view sensors with a third-person-view monocular camera mounted on the observer robot. This approach can gather measurements from multiple humans, expanding it… ▽ More

    Submitted 29 September, 2023; originally announced October 2023.

    Comments: 6 figures, technical report

  23. arXiv:2309.12397  [pdf, other

    cs.RO cs.CV

    POLAR3D: Augmenting NASA's POLAR Dataset for Data-Driven Lunar Perception and Rover Simulation

    Authors: Bo-Hsun Chen, Peter Negrut, Thomas Liang, Nevindu Batagoda, Harry Zhang, Dan Negrut

    Abstract: We report on an effort that led to POLAR3D, a set of digital assets that enhance the POLAR dataset of stereo images generated by NASA to mimic lunar lighting conditions. Our contributions are twofold. First, we have annotated each photo in the POLAR dataset, providing approximately 23 000 labels for rocks and their shadows. Second, we digitized several lunar terrain scenarios available in the POLA… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Comments: 7 pages, 4 figures; this work has been submitted to the 2024 IEEE Conference on Robotics and Automation (ICRA) under review

  24. arXiv:2309.03563  [pdf, other

    cs.CL

    All Labels Together: Low-shot Intent Detection with an Efficient Label Semantic Encoding Paradigm

    Authors: Jiangshu Du, Congying Xia, Wenpeng Yin, Tingting Liang, Philip S. Yu

    Abstract: In intent detection tasks, leveraging meaningful semantic information from intent labels can be particularly beneficial for few-shot scenarios. However, existing few-shot intent detection methods either ignore the intent labels, (e.g. treating intents as indices) or do not fully utilize this information (e.g. only using part of the intent labels). In this work, we present an end-to-end One-to-All… ▽ More

    Submitted 7 September, 2023; v1 submitted 7 September, 2023; originally announced September 2023.

    Comments: Accepted by IJCNLP-AACL 2023

  25. arXiv:2308.15232  [pdf, other

    cs.LG cs.CL cs.IR

    Classification-Aware Neural Topic Model Combined With Interpretable Analysis -- For Conflict Classification

    Authors: Tianyu Liang, Yida Mu, Soonho Kim, Darline Larissa Kengne Kuate, Julie Lang, Rob Vos, Xingyi Song

    Abstract: A large number of conflict events are affecting the world all the time. In order to analyse such conflict events effectively, this paper presents a Classification-Aware Neural Topic Model (CANTM-IA) for Conflict Information Classification and Topic Discovery. The model provides a reliable interpretation of classification results and discovered topics by introducing interpretability analysis. At th… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

    Comments: Accepted by RANLP 2023

  26. arXiv:2308.09987  [pdf, other

    cs.RO cs.AI cs.CV

    ClothesNet: An Information-Rich 3D Garment Model Repository with Simulated Clothes Environment

    Authors: Bingyang Zhou, Haoyu Zhou, Tianhai Liang, Qiaojun Yu, Siheng Zhao, Yuwei Zeng, Jun Lv, Siyuan Luo, Qiancai Wang, Xinyuan Yu, Haonan Chen, Cewu Lu, Lin Shao

    Abstract: We present ClothesNet: a large-scale dataset of 3D clothes objects with information-rich annotations. Our dataset consists of around 4400 models covering 11 categories annotated with clothes features, boundary lines, and keypoints. ClothesNet can be used to facilitate a variety of computer vision and robot interaction tasks. Using our dataset, we establish benchmark tasks for clothes perception, i… ▽ More

    Submitted 19 August, 2023; originally announced August 2023.

    Comments: IEEE/CVF International Conference on Computer Vision (ICCV) 2023

  27. arXiv:2307.14105  [pdf, ps, other

    cs.RO

    Active Robot Vision for Distant Object Change Detection: A Lightweight Training Simulator Inspired by Multi-Armed Bandits

    Authors: Kouki Terashima, Kanji Tanaka, Ryogo Yamamoto, Jonathan Tay Yu Liang

    Abstract: In ground-view object change detection, the recently emerging mapless navigation has great potential to navigate a robot to objects distantly detected (e.g., books, cups, clothes) and acquire high-resolution object images, to identify their change states (no-change/appear/disappear). However, naively performing full journeys for every distant object requires huge sense/plan/action costs, proportio… ▽ More

    Submitted 24 October, 2023; v1 submitted 26 July, 2023; originally announced July 2023.

    Comments: 7 pages, 7 figures, technical report

  28. arXiv:2307.10168  [pdf, other

    cs.CL cs.HC

    LLMs as Workers in Human-Computational Algorithms? Replicating Crowdsourcing Pipelines with LLMs

    Authors: Tongshuang Wu, Haiyi Zhu, Maya Albayrak, Alexis Axon, Amanda Bertsch, Wenxing Deng, Ziqi Ding, Bill Guo, Sireesh Gururaja, Tzu-Sheng Kuo, Jenny T. Liang, Ryan Liu, Ihita Mandal, Jeremiah Milbauer, Xiaolin Ni, Namrata Padmanabhan, Subhashini Ramkumar, Alexis Sudjianto, Jordan Taylor, Ying-Jui Tseng, Patricia Vaidos, Zhijin Wu, Wei Wu, Chenyang Yang

    Abstract: LLMs have shown promise in replicating human-like behavior in crowdsourcing tasks that were previously thought to be exclusive to human abilities. However, current efforts focus mainly on simple atomic tasks. We explore whether LLMs can replicate more complex crowdsourcing pipelines. We find that modern LLMs can simulate some of crowdworkers' abilities in these "human computation algorithms," but… ▽ More

    Submitted 19 July, 2023; v1 submitted 19 July, 2023; originally announced July 2023.

  29. Bridging the Gap: Multi-Level Cross-Modality Joint Alignment for Visible-Infrared Person Re-Identification

    Authors: Tengfei Liang, Yi Jin, Wu Liu, Tao Wang, Songhe Feng, Yidong Li

    Abstract: Visible-Infrared person Re-IDentification (VI-ReID) is a challenging cross-modality image retrieval task that aims to match pedestrians' images across visible and infrared cameras. To solve the modality gap, existing mainstream methods adopt a learning paradigm converting the image retrieval task into an image classification task with cross-entropy loss and auxiliary metric learning losses. These… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

    Comments: 10 pages, 4 figures, 5 tables

    Journal ref: 2024 IEEE Transactions on Circuits and Systems for Video Technology (TCSVT)

  30. arXiv:2306.11351  [pdf, other

    cs.AR

    A Versatility-Performance Balanced Hardware Architecture for Scene Text Detection

    Authors: Yao Xin, Guoming Tang, Donglong Chen, Rumin Zhang, Teng Liang, Ray C. C. Cheung, Cetin Kaya Koc

    Abstract: Detecting and extracting textual information from natural scene images needs Scene Text Detection (STD) algorithms. Fully Convolutional Neural Networks (FCNs) are usually utilized as the backbone model to extract features in these instance segmentation based STD algorithms. FCNs naturally come with high computational complexity. Furthermore, to keep up with the growing variety of models, flexible… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

  31. arXiv:2306.03835  [pdf, other

    eess.IV cs.CV cs.LG

    Atrial Septal Defect Detection in Children Based on Ultrasound Video Using Multiple Instances Learning

    Authors: Yiman Liu, Qiming Huang, Xiaoxiang Han, Tongtong Liang, Zhifang Zhang, Lijun Chen, Jinfeng Wang, Angelos Stefanidis, Jionglong Su, Jiangang Chen, Qingli Li, Yuqi Zhang

    Abstract: Purpose: Congenital heart defect (CHD) is the most common birth defect. Thoracic echocardiography (TTE) can provide sufficient cardiac structure information, evaluate hemodynamics and cardiac function, and is an effective method for atrial septal defect (ASD) examination. This paper aims to study a deep learning method based on cardiac ultrasound video to assist in ASD diagnosis. Materials and met… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

  32. arXiv:2306.01943  [pdf, other

    cs.CL cs.CY cs.HC

    NLPositionality: Characterizing Design Biases of Datasets and Models

    Authors: Sebastin Santy, Jenny T. Liang, Ronan Le Bras, Katharina Reinecke, Maarten Sap

    Abstract: Design biases in NLP systems, such as performance differences for different populations, often stem from their creator's positionality, i.e., views and lived experiences shaped by identity and background. Despite the prevalence and risks of design biases, they are hard to quantify because researcher, system, and dataset positionality is often unobserved. We introduce NLPositionality, a framework f… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Comments: ACL 2023

  33. arXiv:2305.19118  [pdf, other

    cs.CL

    Encouraging Divergent Thinking in Large Language Models through Multi-Agent Debate

    Authors: Tian Liang, Zhiwei He, Wenxiang Jiao, Xing Wang, Yan Wang, Rui Wang, Yujiu Yang, Zhaopeng Tu, Shuming Shi

    Abstract: Modern large language models (LLMs) like ChatGPT have shown remarkable performance on general language tasks but still struggle on complex reasoning tasks, which drives the research on cognitive behaviors of LLMs to explore human-like problem-solving strategies. Along this direction, one representative strategy is self-reflection, which asks an LLM to refine the solution with the feedback generate… ▽ More

    Submitted 19 June, 2024; v1 submitted 30 May, 2023; originally announced May 2023.

    Comments: Work in progress

  34. arXiv:2305.04118  [pdf, other

    cs.CL

    Exploring Human-Like Translation Strategy with Large Language Models

    Authors: Zhiwei He, Tian Liang, Wenxiang Jiao, Zhuosheng Zhang, Yujiu Yang, Rui Wang, Zhaopeng Tu, Shuming Shi, Xing Wang

    Abstract: Large language models (LLMs) have demonstrated impressive capabilities in general scenarios, exhibiting a level of aptitude that approaches, in some aspects even surpasses, human-level intelligence. Among their numerous skills, the translation abilities of LLMs have received considerable attention. Compared to typical machine translation that focuses solely on source-to-target mapping, LLM-based t… ▽ More

    Submitted 29 November, 2023; v1 submitted 6 May, 2023; originally announced May 2023.

    Comments: To be published in TACL (pre-MIT Press publication version)

  35. arXiv:2304.02426  [pdf, other

    cs.CL

    ParroT: Translating during Chat using Large Language Models tuned with Human Translation and Feedback

    Authors: Wenxiang Jiao, Jen-tse Huang, Wenxuan Wang, Zhiwei He, Tian Liang, Xing Wang, Shuming Shi, Zhaopeng Tu

    Abstract: Large language models (LLMs) like ChatGPT have exhibited remarkable abilities on a wide range of natural language processing~(NLP) tasks, including various machine translation abilities accomplished during chat. However, these models are only accessible through restricted APIs, which creates barriers to new research and advancements in the field. Therefore, we propose ParroT, a framework to enhanc… ▽ More

    Submitted 2 November, 2023; v1 submitted 5 April, 2023; originally announced April 2023.

    Comments: 12 pages; EMNLP 2023 (Findings)

  36. arXiv:2303.17125  [pdf, other

    cs.SE cs.AI cs.HC

    A Large-Scale Survey on the Usability of AI Programming Assistants: Successes and Challenges

    Authors: Jenny T. Liang, Chenyang Yang, Brad A. Myers

    Abstract: The software engineering community recently has witnessed widespread deployment of AI programming assistants, such as GitHub Copilot. However, in practice, developers do not accept AI programming assistants' initial suggestions at a high frequency. This leaves a number of open questions related to the usability of these tools. To understand developers' practices while using these tools and the imp… ▽ More

    Submitted 17 September, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

    Comments: Accepted to ICSE'24

  37. arXiv:2303.06042  [pdf, other

    cs.CV

    MVImgNet: A Large-scale Dataset of Multi-view Images

    Authors: Xianggang Yu, Mutian Xu, Yidan Zhang, Haolin Liu, Chongjie Ye, Yushuang Wu, Zizheng Yan, Chenming Zhu, Zhangyang Xiong, Tianyou Liang, Guanying Chen, Shuguang Cui, Xiaoguang Han

    Abstract: Being data-driven is one of the most iconic properties of deep learning algorithms. The birth of ImageNet drives a remarkable trend of "learning from large-scale data" in computer vision. Pretraining on ImageNet to obtain rich universal representations has been manifested to benefit various 2D visual tasks, and becomes a standard in 2D vision. However, due to the laborious collection of real-world… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.

    Comments: To be appear in CVPR2023. Project page: https://gaplab.cuhk.edu.cn/projects/MVImgNet/

  38. arXiv:2303.05689  [pdf, other

    cs.CV cs.AI

    Inducing Neural Collapse to a Fixed Hierarchy-Aware Frame for Reducing Mistake Severity

    Authors: Tong Liang, Jim Davis

    Abstract: There is a recently discovered and intriguing phenomenon called Neural Collapse: at the terminal phase of training a deep neural network for classification, the within-class penultimate feature means and the associated classifier vectors of all flat classes collapse to the vertices of a simplex Equiangular Tight Frame (ETF). Recent work has tried to exploit this phenomenon by fixing the related cl… ▽ More

    Submitted 9 August, 2023; v1 submitted 9 March, 2023; originally announced March 2023.

    Comments: ICCV 2023

  39. EDMAE: An Efficient Decoupled Masked Autoencoder for Standard View Identification in Pediatric Echocardiography

    Authors: Yiman Liu, Xiaoxiang Han, Tongtong Liang, Bin Dong, Jiajun Yuan, Menghan Hu, Qiaohong Liu, Jiangang Chen, Qingli Li, Yuqi Zhang

    Abstract: This paper introduces the Efficient Decoupled Masked Autoencoder (EDMAE), a novel self-supervised method for recognizing standard views in pediatric echocardiography. EDMAE introduces a new proxy task based on the encoder-decoder structure. The EDMAE encoder is composed of a teacher and a student encoder. The teacher encoder extracts the potential representation of the masked image blocks, while t… ▽ More

    Submitted 3 August, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

    Comments: 15 pages, 5 figures, 8 tables, Published in Biomedical Signal Processing and Control

    Journal ref: Biomedical Signal Processing and Control 86 (2023) 105280

  40. arXiv:2302.11474  [pdf, other

    math.NA cs.MS math.OC

    Randomized Numerical Linear Algebra : A Perspective on the Field With an Eye to Software

    Authors: Riley Murray, James Demmel, Michael W. Mahoney, N. Benjamin Erichson, Maksim Melnichenko, Osman Asif Malik, Laura Grigori, Piotr Luszczek, Michał Dereziński, Miles E. Lopes, Tianyu Liang, Hengrui Luo, Jack Dongarra

    Abstract: Randomized numerical linear algebra - RandNLA, for short - concerns the use of randomization as a resource to develop improved algorithms for large-scale linear algebra computations. The origins of contemporary RandNLA lay in theoretical computer science, where it blossomed from a simple idea: randomization provides an avenue for computing approximate solutions to linear algebra problems more ef… ▽ More

    Submitted 12 April, 2023; v1 submitted 22 February, 2023; originally announced February 2023.

    Comments: v1: this is the first arXiv release of LAPACK Working Note 299. v2: complete rewrite of the subsection on trace estimation, among other changes. See frontmatter page ii (pdf page 5) for revision history

  41. arXiv:2302.03223  [pdf, other

    cs.RO

    A Tightly Coupled Bi-Level Coordination Framework for CAVs at Road Intersections

    Authors: Donglin Li, Tingting Zhang, Jiping Luo, Tianhao Liang, Bin Cao, Xuanli Wu, Qinyu Zhang

    Abstract: Since the traffic administration at road intersections determines the capacity bottleneck of modern transportation systems, intelligent cooperative coordination for connected autonomous vehicles (CAVs) has shown to be an effective solution. In this paper, we try to formulate a Bi-Level CAV intersection coordination framework, where coordinators from High and Low levels are tightly coupled. In the… ▽ More

    Submitted 13 November, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

  42. arXiv:2301.09789  [pdf, other

    cs.SE

    A Qualitative Study on the Implementation Design Decisions of Developers

    Authors: Jenny T. Liang, Maryam Arab, Minhyuk Ko, Amy J. Ko, Thomas D. LaToza

    Abstract: Decision-making is a key software engineering skill. Developers constantly make choices throughout the software development process, from requirements to implementation. While prior work has studied developer decision-making, the choices made while choosing what solution to write in code remain understudied. In this mixed-methods study, we examine the phenomenon where developers select one specifi… ▽ More

    Submitted 23 January, 2023; originally announced January 2023.

  43. FADO: Floorplan-Aware Directive Optimization for High-Level Synthesis Designs on Multi-Die FPGAs

    Authors: Linfeng Du, Tingyuan Liang, Sharad Sinha, Zhiyao Xie, Wei Zhang

    Abstract: Multi-die FPGAs are widely adopted to deploy large hardware accelerators. Two factors impede the performance optimization of HLS designs implemented on multi-die FPGAs. On the one hand, the long net delay due to nets crossing die-boundaries results in an NP-hard problem to properly floorplan and pipeline an application. On the other hand, traditional automated searching flow for HLS directive opti… ▽ More

    Submitted 5 February, 2023; v1 submitted 22 December, 2022; originally announced December 2022.

    Comments: Accepted as a conference paper at FPGA '23. Open source at: https://github.com/RipperJ/FADO

  44. arXiv:2212.02457  [pdf, other

    stat.ML cs.LG math.OC math.ST

    Blessings and Curses of Covariate Shifts: Adversarial Learning Dynamics, Directional Convergence, and Equilibria

    Authors: Tengyuan Liang

    Abstract: Covariate distribution shifts and adversarial perturbations present robustness challenges to the conventional statistical learning framework: mild shifts in the test covariate distribution can significantly affect the performance of the statistical model learned based on the training distribution. The model performance typically deteriorates when extrapolation happens: namely, covariates shift to… ▽ More

    Submitted 19 May, 2024; v1 submitted 5 December, 2022; originally announced December 2022.

    Comments: 27 pages, 2 figures

    Journal ref: Journal of Machine Learning Research 25 (2024) 1-27

  45. arXiv:2211.10896  [pdf, other

    cs.LG cs.AI

    Spectral Adversarial Training for Robust Graph Neural Network

    Authors: Jintang Li, Jiaying Peng, Liang Chen, Zibin Zheng, Tingting Liang, Qing Ling

    Abstract: Recent studies demonstrate that Graph Neural Networks (GNNs) are vulnerable to slight but adversarially designed perturbations, known as adversarial examples. To address this issue, robust training methods against adversarial examples have received considerable attention in the literature. \emph{Adversarial Training (AT)} is a successful approach to learning a robust model using adversarially pert… ▽ More

    Submitted 20 November, 2022; originally announced November 2022.

    Comments: Accepted by TKDE. Code availiable at https://github.com/EdisonLeeeee/SAT

  46. arXiv:2211.04194  [pdf, other

    cs.IR cs.AI

    Submission-Aware Reviewer Profiling for Reviewer Recommender System

    Authors: Omer Anjum, Alok Kamatar, Toby Liang, Jinjun Xiong, Wen-mei Hwu

    Abstract: Assigning qualified, unbiased and interested reviewers to paper submissions is vital for maintaining the integrity and quality of the academic publishing system and providing valuable reviews to authors. However, matching thousands of submissions with thousands of potential reviewers within a limited time is a daunting challenge for a conference program committee. Prior efforts based on topic mode… ▽ More

    Submitted 8 November, 2022; originally announced November 2022.

  47. arXiv:2211.01091  [pdf, ps, other

    eess.AS cs.AI cs.SD

    I4U System Description for NIST SRE'20 CTS Challenge

    Authors: Kong Aik Lee, Tomi Kinnunen, Daniele Colibro, Claudio Vair, Andreas Nautsch, Hanwu Sun, Liang He, Tianyu Liang, Qiongqiong Wang, Mickael Rouvier, Pierre-Michel Bousquet, Rohan Kumar Das, Ignacio Viñals Bailo, Meng Liu, Héctor Deldago, Xuechen Liu, Md Sahidullah, Sandro Cumani, Boning Zhang, Koji Okabe, Hitoshi Yamamoto, Ruijie Tao, Haizhou Li, Alfonso Ortega Giménez, Longbiao Wang , et al. (1 additional authors not shown)

    Abstract: This manuscript describes the I4U submission to the 2020 NIST Speaker Recognition Evaluation (SRE'20) Conversational Telephone Speech (CTS) Challenge. The I4U's submission was resulted from active collaboration among researchers across eight research teams - I$^2$R (Singapore), UEF (Finland), VALPT (Italy, Spain), NEC (Japan), THUEE (China), LIA (France), NUS (Singapore), INRIA (France) and TJU (C… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

    Comments: SRE 2021, NIST Speaker Recognition Evaluation Workshop, CTS Speaker Recognition Challenge, 14-12 December 2021

  48. arXiv:2210.08682  [pdf, other

    cs.AR

    AMF-Placer 2.0: Open Source Timing-driven Analytical Mixed-size Placer for Large-scale Heterogeneous FPGA

    Authors: Tingyuan Liang, Gengjie Chen, Jieru Zhao, Sharad Sinha, Wei Zhang

    Abstract: On modern field-programmable gate arrays (FPGAs), certain critical path portions of the designs might be prearranged into many multi-cell macros during synthesis. These movable macros with constraints of shape and resources lead to challenging mixed-size placement for FPGA designs which cannot be addressed by previous analytical placers. Moreover, general timing-driven placement algorithms are fac… ▽ More

    Submitted 3 May, 2023; v1 submitted 16 October, 2022; originally announced October 2022.

  49. arXiv:2210.06111  [pdf, ps, other

    cs.SD cs.AI eess.AS eess.SP

    THUEE system description for NIST 2020 SRE CTS challenge

    Authors: Yu Zheng, Jinghan Peng, Miao Zhao, Yufeng Ma, Min Liu, Xinyue Ma, Tianyu Liang, Tianlong Kong, Liang He, Minqiang Xu

    Abstract: This paper presents the system description of the THUEE team for the NIST 2020 Speaker Recognition Evaluation (SRE) conversational telephone speech (CTS) challenge. The subsystems including ResNet74, ResNet152, and RepVGG-B2 are developed as speaker embedding extractors in this evaluation. We used combined AM-Softmax and AAM-Softmax based loss functions, namely CM-Softmax. We adopted a two-staged… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: 3 pages, 1 table; System desciption of NIST 2020 SRE CTS challenge

  50. arXiv:2209.02222  [pdf, other

    cs.SE

    Understanding Skills for OSS Communities on GitHub

    Authors: Jenny T. Liang, Thomas Zimmermann, Denae Ford

    Abstract: The development of open source software (OSS) is a broad field which requires diverse skill sets. For example, maintainers help lead the project and promote its longevity, technical writers assist with documentation, bug reporters identify defects in software, and developers program the software. However, it is unknown which skills are used in OSS development as well as OSS contributors' general a… ▽ More

    Submitted 6 September, 2022; originally announced September 2022.