Skip to main content

Showing 1–50 of 106 results for author: Lai, H

  1. arXiv:2406.12793  [pdf, other

    cs.CL

    ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools

    Authors: Team GLM, :, Aohan Zeng, Bin Xu, Bowen Wang, Chenhui Zhang, Da Yin, Diego Rojas, Guanyu Feng, Hanlin Zhao, Hanyu Lai, Hao Yu, Hongning Wang, Jiadai Sun, Jiajie Zhang, Jiale Cheng, Jiayi Gui, Jie Tang, Jing Zhang, Juanzi Li, Lei Zhao, Lindong Wu, Lucen Zhong, Mingdao Liu, Minlie Huang , et al. (32 additional authors not shown)

    Abstract: We introduce ChatGLM, an evolving family of large language models that we have been developing over time. This report primarily focuses on the GLM-4 language series, which includes GLM-4, GLM-4-Air, and GLM-4-9B. They represent our most capable models that are trained with all the insights and lessons gained from the preceding three generations of ChatGLM. To date, the GLM-4 models are pre-trained… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  2. arXiv:2406.07288  [pdf, other

    cs.CL

    Fine-tuning with HED-IT: The impact of human post-editing for dialogical language models

    Authors: Daniela Occhipinti, Michele Marchi, Irene Mondella, Huiyuan Lai, Felice Dell'Orletta, Malvina Nissim, Marco Guerini

    Abstract: Automatic methods for generating and gathering linguistic data have proven effective for fine-tuning Language Models (LMs) in languages less resourced than English. Still, while there has been emphasis on data quantity, less attention has been given to its quality. In this work, we investigate the impact of human intervention on machine-generated data when fine-tuning dialogical models. In particu… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  3. arXiv:2406.02301  [pdf, other

    cs.CL

    mCoT: Multilingual Instruction Tuning for Reasoning Consistency in Language Models

    Authors: Huiyuan Lai, Malvina Nissim

    Abstract: Large language models (LLMs) with Chain-of-thought (CoT) have recently emerged as a powerful technique for eliciting reasoning to improve various downstream tasks. As most research mainly focuses on English, with few explorations in a multilingual context, the question of how reliable this reasoning capability is in different languages is still open. To address it directly, we study multilingual r… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: Accepted to ACL 2024 main

  4. arXiv:2405.19516  [pdf, other

    eess.SP cs.CV cs.LG cs.RO

    Enabling Visual Recognition at Radio Frequency

    Authors: Haowen Lai, Gaoxiang Luo, Yifei Liu, Mingmin Zhao

    Abstract: This paper introduces PanoRadar, a novel RF imaging system that brings RF resolution close to that of LiDAR, while providing resilience against conditions challenging for optical signals. Our LiDAR-comparable 3D imaging results enable, for the first time, a variety of visual recognition tasks at radio frequency, including surface normal estimation, semantic segmentation, and object detection. Pano… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  5. arXiv:2405.05579  [pdf

    cs.HC eess.SY

    Intelligent EC Rearview Mirror: Enhancing Driver Safety with Dynamic Glare Mitigation via Cloud Edge Collaboration

    Authors: Junyi Yang, Zefei Xu, Huayi Lai, Hongjian Chen, Sifan Kong, Yutong Wu, Huan Yang

    Abstract: Sudden glare from trailing vehicles significantly increases driving safety risks. Existing anti-glare technologies such as electronic, manually-adjusted, and electrochromic rearview mirrors, are expensive and lack effective adaptability in different lighting conditions. To address these issues, our research introduces an intelligent rearview mirror system utilizing novel all-liquid electrochromic… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  6. arXiv:2404.18252  [pdf, other

    cs.CV

    Fisher Information Improved Training-Free Conditional Diffusion Model

    Authors: Kaiyu Song, Hanjiang Lai

    Abstract: Recently, the diffusion model with the training-free methods has succeeded in conditional image generation tasks. However, there is an efficiency problem because it requires calculating the gradient with high computational cost, and previous methods make strong assumptions to solve it, sacrificing generalization. In this work, we propose the Fisher information guided diffusion model (FIGD). Concre… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

  7. arXiv:2404.15009  [pdf, other

    cs.CV eess.IV

    The Brain Tumor Segmentation in Pediatrics (BraTS-PEDs) Challenge: Focus on Pediatrics (CBTN-CONNECT-DIPGR-ASNR-MICCAI BraTS-PEDs)

    Authors: Anahita Fathi Kazerooni, Nastaran Khalili, Deep Gandhi, Xinyang Liu, Zhifan Jiang, Syed Muhammed Anwar, Jake Albrecht, Maruf Adewole, Udunna Anazodo, Hannah Anderson, Sina Bagheri, Ujjwal Baid, Timothy Bergquist, Austin J. Borja, Evan Calabrese, Verena Chung, Gian-Marco Conte, Farouk Dako, James Eddy, Ivan Ezhov, Ariana Familiar, Keyvan Farahani, Anurag Gottipati, Debanjan Haldar, Shuvanjan Haldar , et al. (51 additional authors not shown)

    Abstract: Pediatric tumors of the central nervous system are the most common cause of cancer-related death in children. The five-year survival rate for high-grade gliomas in children is less than 20%. Due to their rarity, the diagnosis of these entities is often delayed, their treatment is mainly based on historic treatment concepts, and clinical trials require multi-institutional collaborations. Here we pr… ▽ More

    Submitted 29 April, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2305.17033

  8. arXiv:2404.14135  [pdf, other

    cs.CV

    Text in the Dark: Extremely Low-Light Text Image Enhancement

    Authors: Che-Tsung Lin, Chun Chet Ng, Zhi Qin Tan, Wan Jun Nah, Xinyu Wang, Jie Long Kew, Pohao Hsu, Shang Hong Lai, Chee Seng Chan, Christopher Zach

    Abstract: Extremely low-light text images are common in natural scenes, making scene text detection and recognition challenging. One solution is to enhance these images using low-light image enhancement methods before text extraction. However, previous methods often do not try to particularly address the significance of low-level features, which are crucial for optimal performance on downstream scene text t… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: The first two authors contributed equally to this work

  9. arXiv:2404.03648  [pdf, other

    cs.CL

    AutoWebGLM: Bootstrap And Reinforce A Large Language Model-based Web Navigating Agent

    Authors: Hanyu Lai, Xiao Liu, Iat Long Iong, Shuntian Yao, Yuxuan Chen, Pengbo Shen, Hao Yu, Hanchen Zhang, Xiaohan Zhang, Yuxiao Dong, Jie Tang

    Abstract: Large language models (LLMs) have fueled many intelligent agent tasks, such as web navigation -- but most existing agents perform far from satisfying in real-world webpages due to three factors: (1) the versatility of actions on webpages, (2) HTML text exceeding model processing capacity, and (3) the complexity of decision-making due to the open-domain nature of web. In light of the challenge, we… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  10. arXiv:2402.17417  [pdf, other

    cs.CV

    CARZero: Cross-Attention Alignment for Radiology Zero-Shot Classification

    Authors: Haoran Lai, Qingsong Yao, Zihang Jiang, Rongsheng Wang, Zhiyang He, Xiaodong Tao, S. Kevin Zhou

    Abstract: The advancement of Zero-Shot Learning in the medical domain has been driven forward by using pre-trained models on large-scale image-text pairs, focusing on image-text alignment. However, existing methods primarily rely on cosine similarity for alignment, which may not fully capture the complex relationship between medical images and reports. To address this gap, we introduce a novel approach call… ▽ More

    Submitted 24 March, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

  11. arXiv:2401.10334  [pdf, other

    q-bio.QM cs.AI cs.CL cs.LG

    DrugAssist: A Large Language Model for Molecule Optimization

    Authors: Geyan Ye, Xibao Cai, Houtim Lai, Xing Wang, Junhong Huang, Longyue Wang, Wei Liu, Xiangxiang Zeng

    Abstract: Recently, the impressive performance of large language models (LLMs) on a wide range of tasks has attracted an increasing number of attempts to apply LLMs in drug discovery. However, molecule optimization, a critical task in the drug discovery pipeline, is currently an area that has seen little involvement from LLMs. Most of existing approaches focus solely on capturing the underlying patterns in… ▽ More

    Submitted 28 December, 2023; originally announced January 2024.

    Comments: Geyan Ye and Xibao Cai are equal contributors; Longyue Wang is corresponding author

  12. arXiv:2312.17606  [pdf, other

    cs.RO cs.AI cs.LG

    Adaptive Control Strategy for Quadruped Robots in Actuator Degradation Scenarios

    Authors: Xinyuan Wu, Wentao Dong, Hang Lai, Yong Yu, Ying Wen

    Abstract: Quadruped robots have strong adaptability to extreme environments but may also experience faults. Once these faults occur, robots must be repaired before returning to the task, reducing their practical feasibility. One prevalent concern among these faults is actuator degradation, stemming from factors like device aging or unexpected operational events. Traditionally, addressing this problem has re… ▽ More

    Submitted 29 December, 2023; originally announced December 2023.

    Comments: 13 pages, 14 figures, in proceeding of DAI'23

  13. arXiv:2312.13316  [pdf, other

    cs.CV

    ECAMP: Entity-centered Context-aware Medical Vision Language Pre-training

    Authors: Rongsheng Wang, Qingsong Yao, Haoran Lai, Zhiyang He, Xiaodong Tao, Zihang Jiang, S. Kevin Zhou

    Abstract: Despite significant advancements in medical vision-language pre-training, existing methods have largely overlooked the inherent entity-specific context within radiology reports and the complex cross-modality contextual relationships between text and images. To close this gap, we propose a novel Entity-centered Context-aware Medical Vision-language Pre-training (ECAMP) framework, which is designed… ▽ More

    Submitted 19 March, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

  14. arXiv:2312.05274  [pdf, other

    cs.LG cs.CV

    Target to Source: Guidance-Based Diffusion Model for Test-Time Adaptation

    Authors: Kaiyu Song, Hanjiang Lai

    Abstract: Most recent works of test-time adaptation (TTA) aim to alleviate domain shift problems by re-training source classifiers in each domain. On the other hand, the emergence of the diffusion model provides another solution to TTA, which directly maps the test data from the target domain to the source domain based on a diffusion model pre-trained in the source domain. The source classifier does not nee… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  15. arXiv:2312.04802  [pdf, other

    cs.CV

    MimicDiffusion: Purifying Adversarial Perturbation via Mimicking Clean Diffusion Model

    Authors: Kaiyu Song, Hanjiang Lai

    Abstract: Deep neural networks (DNNs) are vulnerable to adversarial perturbation, where an imperceptible perturbation is added to the image that can fool the DNNs. Diffusion-based adversarial purification focuses on using the diffusion model to generate a clean image against such adversarial attacks. Unfortunately, the generative process of the diffusion model is also inevitably affected by adversarial pert… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  16. arXiv:2312.00951  [pdf, other

    cs.RO eess.SY

    AV4EV: Open-Source Modular Autonomous Electric Vehicle Platform for Making Mobility Research Accessible

    Authors: Zhijie Qiao, Mingyan Zhou, Zhijun Zhuang, Tejas Agarwal, Felix Jahncke, Po-Jen Wang, Jason Friedman, Hongyi Lai, Divyanshu Sahu, Tomáš Nagy, Martin Endler, Jason Schlessman, Rahul Mangharam

    Abstract: When academic researchers develop and validate autonomous driving algorithms, there is a challenge in balancing high-performance capabilities with the cost and complexity of the vehicle platform. Much of today's research on autonomous vehicles (AV) is limited to experimentation on expensive commercial vehicles that require large skilled teams to retrofit the vehicles and test them in dedicated fac… ▽ More

    Submitted 12 April, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

    Comments: 6 pages, 5 figures

  17. arXiv:2311.17334  [pdf, other

    cs.CV

    Long-tailed multi-label classification with noisy label of thoracic diseases from chest X-ray

    Authors: Haoran Lai, Qingsong Yao, Zhiyang He, Xiaodong Tao, S Kevin Zhou

    Abstract: Chest X-rays (CXR) often reveal rare diseases, demanding precise diagnosis. However, current computer-aided diagnosis (CAD) methods focus on common diseases, leading to inadequate detection of rare conditions due to the absence of comprehensive datasets. To overcome this, we present a novel benchmark for long-tailed multi-label classification in CXRs, encapsulating both common and rare thoracic di… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  18. arXiv:2311.07622  [pdf, other

    cs.CV

    Pretrain like Your Inference: Masked Tuning Improves Zero-Shot Composed Image Retrieval

    Authors: Junyang Chen, Hanjiang Lai

    Abstract: Zero-shot composed image retrieval (ZS-CIR), which aims to retrieve a target image based on textual modifications to a reference image without triplet labeling, has gained more and more attention. Current ZS-CIR research mainly relies on two unlabeled pre-trained models: the vision-language model, e.g., CLIP, and the Pic2Word/textual inversion model. However, the pre-trained models and CIR tasks h… ▽ More

    Submitted 14 November, 2023; v1 submitted 12 November, 2023; originally announced November 2023.

  19. arXiv:2310.20327  [pdf, other

    cs.AI

    Improving Entropy-Based Test-Time Adaptation from a Clustering View

    Authors: Guoliang Lin, Hanjiang Lai, Yan Pan, Jian Yin

    Abstract: Domain shift is a common problem in the realistic world, where training data and test data follow different data distributions. To deal with this problem, fully test-time adaptation (TTA) leverages the unlabeled data encountered during test time to adapt the model. In particular, entropy-based TTA (EBTTA) methods, which minimize the prediction's entropy on test samples, have shown great success. I… ▽ More

    Submitted 25 April, 2024; v1 submitted 31 October, 2023; originally announced October 2023.

  20. arXiv:2310.19019  [pdf, other

    cs.CL cs.AI

    TeacherLM: Teaching to Fish Rather Than Giving the Fish, Language Modeling Likewise

    Authors: Nan He, Hanyu Lai, Chenyang Zhao, Zirui Cheng, Junting Pan, Ruoyu Qin, Ruofan Lu, Rui Lu, Yunchen Zhang, Gangming Zhao, Zhaohui Hou, Zhiyuan Huang, Shaoqing Lu, Ding Liang, Mingjie Zhan

    Abstract: Large Language Models (LLMs) exhibit impressive reasoning and data augmentation capabilities in various NLP tasks. However, what about small models? In this work, we propose TeacherLM-7.1B, capable of annotating relevant fundamentals, chain of thought, and common mistakes for most NLP samples, which makes annotation more than just an answer, thus allowing other models to learn "why" instead of jus… ▽ More

    Submitted 31 October, 2023; v1 submitted 29 October, 2023; originally announced October 2023.

    Comments: 5 figures, 15 pages

  21. arXiv:2309.16509  [pdf, other

    cs.DC cs.PL

    SIMD Everywhere Optimization from ARM NEON to RISC-V Vector Extensions

    Authors: Ju-Hung Li, Jhih-Kuan Lin, Yung-Cheng Su, Chi-Wei Chu, Lai-Tak Kuok, Hung-Ming Lai, Chao-Lin Lee, Jenq-Kuen Lee

    Abstract: Many libraries, such as OpenCV, FFmpeg, XNNPACK, and Eigen, utilize Arm or x86 SIMD Intrinsics to optimize programs for performance. With the emergence of RISC-V Vector Extensions (RVV), there is a need to migrate these performance legacy codes for RVV. Currently, the migration of NEON code to RVV code requires manual rewriting, which is a time-consuming and error-prone process. In this work, we u… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

  22. arXiv:2308.08131  [pdf, other

    cs.CV cs.AI

    Ranking-aware Uncertainty for Text-guided Image Retrieval

    Authors: Junyang Chen, Hanjiang Lai

    Abstract: Text-guided image retrieval is to incorporate conditional text to better capture users' intent. Traditionally, the existing methods focus on minimizing the embedding distances between the source inputs and the targeted image, using the provided triplets $\langle$source image, source text, target image$\rangle$. However, such triplet optimization may limit the learned retrieval model to capture mor… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

  23. arXiv:2308.03688  [pdf, other

    cs.AI cs.CL cs.LG

    AgentBench: Evaluating LLMs as Agents

    Authors: Xiao Liu, Hao Yu, Hanchen Zhang, Yifan Xu, Xuanyu Lei, Hanyu Lai, Yu Gu, Hangliang Ding, Kaiwen Men, Kejuan Yang, Shudan Zhang, Xiang Deng, Aohan Zeng, Zhengxiao Du, Chenhui Zhang, Sheng Shen, Tianjun Zhang, Yu Su, Huan Sun, Minlie Huang, Yuxiao Dong, Jie Tang

    Abstract: Large Language Models (LLMs) are becoming increasingly smart and autonomous, targeting real-world pragmatic missions beyond traditional NLP tasks. As a result, there has been an urgent need to evaluate LLMs as agents on challenging tasks in interactive environments. We present AgentBench, a multi-dimensional evolving benchmark that currently consists of 8 distinct environments to assess LLM-as-Age… ▽ More

    Submitted 25 October, 2023; v1 submitted 7 August, 2023; originally announced August 2023.

    Comments: 55 pages

  24. arXiv:2306.07906  [pdf, other

    cs.CL cs.AI

    WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human Preferences

    Authors: Xiao Liu, Hanyu Lai, Hao Yu, Yifan Xu, Aohan Zeng, Zhengxiao Du, Peng Zhang, Yuxiao Dong, Jie Tang

    Abstract: We present WebGLM, a web-enhanced question-answering system based on the General Language Model (GLM). Its goal is to augment a pre-trained large language model (LLM) with web search and retrieval capabilities while being efficient for real-world deployments. To achieve this, we develop WebGLM with strategies for the LLM-augmented retriever, bootstrapped generator, and human preference-aware score… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

    Comments: Accepted to KDD 2023

  25. arXiv:2306.00437  [pdf, other

    cs.CL

    Responsibility Perspective Transfer for Italian Femicide News

    Authors: Gosse Minnema, Huiyuan Lai, Benedetta Muscato, Malvina Nissim

    Abstract: Different ways of linguistically expressing the same real-world event can lead to different perceptions of what happened. Previous work has shown that different descriptions of gender-based violence (GBV) influence the reader's perception of who is to blame for the violence, possibly reinforcing stereotypes which see the victim as partly responsible, too. As a contribution to raise awareness on pe… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: Accepted for publication in Findings of ACL 2023

  26. arXiv:2306.00124  [pdf, other

    cs.CL

    Pre-Trained Language-Meaning Models for Multilingual Parsing and Generation

    Authors: Chunliu Wang, Huiyuan Lai, Malvina Nissim, Johan Bos

    Abstract: Pre-trained language models (PLMs) have achieved great success in NLP and have recently been used for tasks in computational semantics. However, these tasks do not fully benefit from PLMs since meaning representations are not explicitly included in the pre-training stage. We introduce multilingual pre-trained language-meaning models based on Discourse Representation Structures (DRSs), including me… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

    Comments: Accepted by ACL2023 findings

  27. arXiv:2306.00121  [pdf, other

    cs.CL

    Multilingual Multi-Figurative Language Detection

    Authors: Huiyuan Lai, Antonio Toral, Malvina Nissim

    Abstract: Figures of speech help people express abstract concepts and evoke stronger emotions than literal expressions, thereby making texts more creative and engaging. Due to its pervasive and fundamental character, figurative language understanding has been addressed in Natural Language Processing, but it's highly understudied in a multilingual setting and when considering more than one figure of speech a… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

    Comments: Accepted to ACL 2023 (Findings)

  28. arXiv:2305.18464  [pdf, other

    cs.LG cs.RO

    Privileged Knowledge Distillation for Sim-to-Real Policy Generalization

    Authors: Haoran He, Chenjia Bai, Hang Lai, Lingxiao Wang, Weinan Zhang

    Abstract: Reinforcement Learning (RL) has recently achieved remarkable success in robotic control. However, most RL methods operate in simulated environments where privileged knowledge (e.g., dynamics, surroundings, terrains) is readily available. Conversely, in real-world scenarios, robot agents usually rely solely on local states (e.g., proprioceptive feedback of robot joints) to select actions, leading t… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

    Comments: 22 pages

  29. arXiv:2305.17033  [pdf, other

    eess.IV cs.CV cs.LG q-bio.QM

    The Brain Tumor Segmentation (BraTS) Challenge 2023: Focus on Pediatrics (CBTN-CONNECT-DIPGR-ASNR-MICCAI BraTS-PEDs)

    Authors: Anahita Fathi Kazerooni, Nastaran Khalili, Xinyang Liu, Debanjan Haldar, Zhifan Jiang, Syed Muhammed Anwar, Jake Albrecht, Maruf Adewole, Udunna Anazodo, Hannah Anderson, Sina Bagheri, Ujjwal Baid, Timothy Bergquist, Austin J. Borja, Evan Calabrese, Verena Chung, Gian-Marco Conte, Farouk Dako, James Eddy, Ivan Ezhov, Ariana Familiar, Keyvan Farahani, Shuvanjan Haldar, Juan Eugenio Iglesias, Anastasia Janas , et al. (48 additional authors not shown)

    Abstract: Pediatric tumors of the central nervous system are the most common cause of cancer-related death in children. The five-year survival rate for high-grade gliomas in children is less than 20\%. Due to their rarity, the diagnosis of these entities is often delayed, their treatment is mainly based on historic treatment concepts, and clinical trials require multi-institutional collaborations. The MICCA… ▽ More

    Submitted 23 May, 2024; v1 submitted 26 May, 2023; originally announced May 2023.

  30. arXiv:2305.04145  [pdf

    cs.GT

    A Novel Reward Shaping Function for Single-Player Mahjong

    Authors: Kai Jun Chen, Lok Him Lai, Zi Iun Lai

    Abstract: Mahjong is a complex game with an intractably large state space with extremely sparse rewards, which poses challenges to develop an agent to play Mahjong. To overcome this, the ShangTing function was adopted as a reward shaping function. This was combined with a forward-search algorithm to create an agent capable of completing a winning hand in Single-player Mahjong (an average of 35 actions over… ▽ More

    Submitted 6 May, 2023; originally announced May 2023.

  31. arXiv:2305.03356  [pdf, other

    cs.CL cs.AI cs.LG

    From Parse-Execute to Parse-Execute-Refine: Improving Semantic Parser for Complex Question Answering over Knowledge Base

    Authors: Wangzhen Guo, Linyin Luo, Hanjiang Lai, Jian Yin

    Abstract: Parsing questions into executable logical forms has showed impressive results for knowledge-base question answering (KBQA). However, complex KBQA is a more challenging task that requires to perform complex multi-step reasoning. Recently, a new semantic parser called KoPL has been proposed to explicitly model the reasoning processes, which achieved the state-of-the-art on complex KBQA. In this pape… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

  32. arXiv:2305.01633  [pdf, other

    cs.CL

    Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP

    Authors: Anya Belz, Craig Thomson, Ehud Reiter, Gavin Abercrombie, Jose M. Alonso-Moral, Mohammad Arvan, Anouck Braggaar, Mark Cieliebak, Elizabeth Clark, Kees van Deemter, Tanvi Dinkar, Ondřej Dušek, Steffen Eger, Qixiang Fang, Mingqi Gao, Albert Gatt, Dimitra Gkatzia, Javier González-Corbelle, Dirk Hovy, Manuela Hürlimann, Takumi Ito, John D. Kelleher, Filip Klubicka, Emiel Krahmer, Huiyuan Lai , et al. (17 additional authors not shown)

    Abstract: We report our efforts in identifying a set of previous human evaluations in NLP that would be suitable for a coordinated study examining what makes human evaluations in NLP more/less reproducible. We present our results and findings, which include that just 13\% of papers had (i) sufficiently low barriers to reproduction, and (ii) enough obtainable information, to be considered for reproduction, a… ▽ More

    Submitted 7 August, 2023; v1 submitted 2 May, 2023; originally announced May 2023.

    Comments: 5 pages plus appendix, 4 tables, 1 figure. To appear at "Workshop on Insights from Negative Results in NLP" (co-located with EACL2023). Updated author list and acknowledgements

    MSC Class: 68 ACM Class: I.2.7

  33. arXiv:2304.13462  [pdf, other

    cs.CL

    Multidimensional Evaluation for Text Style Transfer Using ChatGPT

    Authors: Huiyuan Lai, Antonio Toral, Malvina Nissim

    Abstract: We investigate the potential of ChatGPT as a multidimensional evaluator for the task of \emph{Text Style Transfer}, alongside, and in comparison to, existing automatic metrics as well as human judgements. We focus on a zero-shot setting, i.e. prompting ChatGPT with specific task instructions, and test its performance on three commonly-used dimensions of text style transfer evaluation: style streng… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

  34. arXiv:2212.09078  [pdf, other

    cs.RO

    Multi-embodiment Legged Robot Control as a Sequence Modeling Problem

    Authors: Chen Yu, Weinan Zhang, Hang Lai, Zheng Tian, Laurent Kneip, Jun Wang

    Abstract: Robots are traditionally bounded by a fixed embodiment during their operational lifetime, which limits their ability to adapt to their surroundings. Co-optimizing control and morphology of a robot, however, is often inefficient due to the complex interplay between the controller and morphology. In this paper, we propose a learning-based control method that can inherently take morphology into consi… ▽ More

    Submitted 18 December, 2022; originally announced December 2022.

  35. arXiv:2212.07740  [pdf, other

    cs.RO cs.LG

    Sim-to-Real Transfer for Quadrupedal Locomotion via Terrain Transformer

    Authors: Hang Lai, Weinan Zhang, Xialin He, Chen Yu, Zheng Tian, Yong Yu, Jun Wang

    Abstract: Deep reinforcement learning has recently emerged as an appealing alternative for legged locomotion over multiple terrains by training a policy in physical simulation and then transferring it to the real world (i.e., sim-to-real transfer). Despite considerable progress, the capacity and scalability of traditional neural networks are still limited, which may hinder their applications in more complex… ▽ More

    Submitted 21 March, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

    Comments: Accepted by ICRA2023

  36. arXiv:2210.07138  [pdf, other

    cs.AI cs.CL

    Counterfactual Multihop QA: A Cause-Effect Approach for Reducing Disconnected Reasoning

    Authors: Wangzhen Guo, Qinkang Gong, Hanjiang Lai

    Abstract: Multi-hop QA requires reasoning over multiple supporting facts to answer the question. However, the existing QA models always rely on shortcuts, e.g., providing the true answer by only one fact, rather than multi-hop reasoning, which is referred as $\textit{disconnected reasoning}$ problem. To alleviate this issue, we propose a novel counterfactual multihop QA, a causal-effect approach that enable… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: 10 pages, 2 figures

  37. arXiv:2210.02414  [pdf, other

    cs.CL cs.AI cs.LG

    GLM-130B: An Open Bilingual Pre-trained Model

    Authors: Aohan Zeng, Xiao Liu, Zhengxiao Du, Zihan Wang, Hanyu Lai, Ming Ding, Zhuoyi Yang, Yifan Xu, Wendi Zheng, Xiao Xia, Weng Lam Tam, Zixuan Ma, Yufei Xue, Jidong Zhai, Wenguang Chen, Peng Zhang, Yuxiao Dong, Jie Tang

    Abstract: We introduce GLM-130B, a bilingual (English and Chinese) pre-trained language model with 130 billion parameters. It is an attempt to open-source a 100B-scale model at least as good as GPT-3 (davinci) and unveil how models of such a scale can be successfully pre-trained. Over the course of this effort, we face numerous unexpected technical and engineering challenges, particularly on loss spikes and… ▽ More

    Submitted 25 October, 2023; v1 submitted 5 October, 2022; originally announced October 2022.

    Comments: Accepted to ICLR 2023

  38. arXiv:2209.13816  [pdf, other

    cs.LG cs.AI

    Revisiting Few-Shot Learning from a Causal Perspective

    Authors: Guoliang Lin, Yongheng Xu, Hanjiang Lai, Jian Yin

    Abstract: Few-shot learning with $N$-way $K$-shot scheme is an open challenge in machine learning. Many metric-based approaches have been proposed to tackle this problem, e.g., the Matching Networks and CLIP-Adapter. Despite that these approaches have shown significant progress, the mechanism of why these methods succeed has not been well explored. In this paper, we try to interpret these metric-based few-s… ▽ More

    Submitted 6 May, 2024; v1 submitted 27 September, 2022; originally announced September 2022.

  39. arXiv:2209.10984  [pdf, other

    eess.IV cs.CV

    DLUNet: Semi-supervised Learning based Dual-Light UNet for Multi-organ Segmentation

    Authors: Haoran Lai, Tao Wang, Shuoling Zhou

    Abstract: The manual ground truth of abdominal multi-organ is labor-intensive. In order to make full use of CT data, we developed a semi-supervised learning based dual-light UNet. In the training phase, it consists of two light UNets, which make full use of label and unlabeled data simultaneously by using consistent-based learning. Moreover, separable convolution and residual concatenation was introduced li… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

    Comments: 13 page, 3 figures

  40. Lightweight Spatial-Channel Adaptive Coordination of Multilevel Refinement Enhancement Network for Image Reconstruction

    Authors: Yuxi Cai, Huicheng Lai, Zhenghong Jia

    Abstract: Benefiting from the vigorous development of deep learning, many CNN-based image super-resolution methods have emerged and achieved better results than traditional algorithms. However, it is difficult for most algorithms to adaptively adjust the spatial region and channel features at the same time, let alone the information exchange between them. In addition, the exchange of information between att… ▽ More

    Submitted 17 September, 2022; originally announced September 2022.

  41. arXiv:2209.01835  [pdf, other

    cs.CL

    Multi-Figurative Language Generation

    Authors: Huiyuan Lai, Malvina Nissim

    Abstract: Figurative language generation is the task of reformulating a given text in the desired figure of speech while still being faithful to the original context. We take the first step towards multi-figurative language modelling by providing a benchmark for the automatic generation of five common figurative forms in English. We train mFLAG employing a scheme for multi-figurative language pre-training o… ▽ More

    Submitted 5 September, 2022; originally announced September 2022.

    Comments: Accepted to COLING 2022

  42. arXiv:2207.06965  [pdf, other

    cs.RO cs.CV

    AutoMerge: A Framework for Map Assembling and Smoothing in City-scale Environments

    Authors: Peng Yin, Haowen Lai, Shiqi Zhao, Ruohai Ge, Ji Zhang, Howie Choset, Sebastian Scherer

    Abstract: We present AutoMerge, a LiDAR data processing framework for assembling a large number of map segments into a complete map. Traditional large-scale map merging methods are fragile to incorrect data associations, and are primarily limited to working only offline. AutoMerge utilizes multi-perspective fusion and adaptive loop closure detection for accurate data associations, and it uses incremental me… ▽ More

    Submitted 26 June, 2023; v1 submitted 14 July, 2022; originally announced July 2022.

    Comments: 19 pages, 20 figures, IEEE Transactions on Robotics (T-RO) 2023

  43. arXiv:2206.09328  [pdf, other

    cs.LG

    A Survey on Model-based Reinforcement Learning

    Authors: Fan-Ming Luo, Tian Xu, Hang Lai, Xiong-Hui Chen, Weinan Zhang, Yang Yu

    Abstract: Reinforcement learning (RL) solves sequential decision-making problems via a trial-and-error process interacting with the environment. While RL achieves outstanding success in playing complex video games that allow huge trial-and-error, making errors is always undesired in the real world. To improve the sample efficiency and thus reduce the errors, model-based reinforcement learning (MBRL) is beli… ▽ More

    Submitted 19 June, 2022; originally announced June 2022.

  44. arXiv:2205.13738  [pdf

    cs.CV eess.IV

    Image Reconstruction of Multi Branch Feature Multiplexing Fusion Network with Mixed Multi-layer Attention

    Authors: Yuxi Cai, Huicheng Lai

    Abstract: Image super-resolution reconstruction achieves better results than traditional methods with the help of the powerful nonlinear representation ability of convolution neural network. However, some existing algorithms also have some problems, such as insufficient utilization of phased features, ignoring the importance of early phased feature fusion to improve network performance, and the inability of… ▽ More

    Submitted 26 May, 2022; originally announced May 2022.

  45. arXiv:2204.07705  [pdf, other

    cs.CL cs.AI

    Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks

    Authors: Yizhong Wang, Swaroop Mishra, Pegah Alipoormolabashi, Yeganeh Kordi, Amirreza Mirzaei, Anjana Arunkumar, Arjun Ashok, Arut Selvan Dhanasekaran, Atharva Naik, David Stap, Eshaan Pathak, Giannis Karamanolakis, Haizhi Gary Lai, Ishan Purohit, Ishani Mondal, Jacob Anderson, Kirby Kuznia, Krima Doshi, Maitreya Patel, Kuntal Kumar Pal, Mehrad Moradshahi, Mihir Parmar, Mirali Purohit, Neeraj Varshney, Phani Rohitha Kaza , et al. (15 additional authors not shown)

    Abstract: How well can NLP models generalize to a variety of unseen tasks when provided with task instructions? To address this question, we first introduce Super-NaturalInstructions, a benchmark of 1,616 diverse NLP tasks and their expert-written instructions. Our collection covers 76 distinct task types, including but not limited to classification, extraction, infilling, sequence tagging, text rewriting,… ▽ More

    Submitted 24 October, 2022; v1 submitted 15 April, 2022; originally announced April 2022.

    Comments: Accepted to EMNLP 2022, 25 pages

  46. arXiv:2204.07549  [pdf, other

    cs.CL

    Human Judgement as a Compass to Navigate Automatic Metrics for Formality Transfer

    Authors: Huiyuan Lai, Jiali Mao, Antonio Toral, Malvina Nissim

    Abstract: Although text style transfer has witnessed rapid development in recent years, there is as yet no established standard for evaluation, which is performed using several automatic metrics, lacking the possibility of always resorting to human judgement. We focus on the task of formality transfer, and on the three aspects that are usually evaluated: style strength, content preservation, and fluency. To… ▽ More

    Submitted 15 April, 2022; originally announced April 2022.

    Comments: Accepted to HumEval 2022

  47. arXiv:2203.12151  [pdf, other

    eess.IV cs.CV cs.LG

    Semi-Supervised Hybrid Spine Network for Segmentation of Spine MR Images

    Authors: Meiyan Huang, Shuoling Zhou, Xiumei Chen, Haoran Lai, Qianjin Feng

    Abstract: Automatic segmentation of vertebral bodies (VBs) and intervertebral discs (IVDs) in 3D magnetic resonance (MR) images is vital in diagnosing and treating spinal diseases. However, segmenting the VBs and IVDs simultaneously is not trivial. Moreover, problems exist, including blurry segmentation caused by anisotropy resolution, high computational cost, inter-class similarity and intra-class variabil… ▽ More

    Submitted 22 March, 2022; originally announced March 2022.

  48. arXiv:2203.08552  [pdf, other

    cs.CL

    Multilingual Pre-training with Language and Task Adaptation for Multilingual Text Style Transfer

    Authors: Huiyuan Lai, Antonio Toral, Malvina Nissim

    Abstract: We exploit the pre-trained seq2seq model mBART for multilingual text style transfer. Using machine translated data as well as gold aligned English sentences yields state-of-the-art results in the three target languages we consider. Besides, in view of the general scarcity of parallel data, we propose a modular approach for multilingual formality transfer, which consists of two training strategies… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

    Comments: Accepted to ACL 2022

  49. arXiv:2201.09637  [pdf, other

    cs.LG cs.AI q-bio.QM

    DrugOOD: Out-of-Distribution (OOD) Dataset Curator and Benchmark for AI-aided Drug Discovery -- A Focus on Affinity Prediction Problems with Noise Annotations

    Authors: Yuanfeng Ji, Lu Zhang, Jiaxiang Wu, Bingzhe Wu, Long-Kai Huang, Tingyang Xu, Yu Rong, Lanqing Li, Jie Ren, Ding Xue, Houtim Lai, Shaoyong Xu, Jing Feng, Wei Liu, Ping Luo, Shuigeng Zhou, Junzhou Huang, Peilin Zhao, Yatao Bian

    Abstract: AI-aided drug discovery (AIDD) is gaining increasing popularity due to its promise of making the search for new pharmaceuticals quicker, cheaper and more efficient. In spite of its extensive use in many fields, such as ADMET prediction, virtual screening, protein folding and generative chemistry, little has been explored in terms of the out-of-distribution (OOD) learning problem with \emph{noise},… ▽ More

    Submitted 24 January, 2022; originally announced January 2022.

    Comments: 54 pages, 11 figures

  50. arXiv:2201.05541  [pdf, other

    cs.CV

    ViT2Hash: Unsupervised Information-Preserving Hashing

    Authors: Qinkang Gong, Liangdao Wang, Hanjiang Lai, Yan Pan, Jian Yin

    Abstract: Unsupervised image hashing, which maps images into binary codes without supervision, is a compressor with a high compression rate. Hence, how to preserving meaningful information of the original data is a critical problem. Inspired by the large-scale vision pre-training model, known as ViT, which has shown significant progress for learning visual representations, in this paper, we propose a simple… ▽ More

    Submitted 14 January, 2022; originally announced January 2022.