Skip to main content

Showing 1–5 of 5 results for author: Linghu, X

  1. arXiv:2402.02447  [pdf, other

    cs.LG cs.CL

    Breaking MLPerf Training: A Case Study on Optimizing BERT

    Authors: Yongdeok Kim, Jaehyung Ahn, Myeongwoo Kim, Changin Choi, Heejae Kim, Narankhuu Tuvshinjargal, Seungwon Lee, Yanzi Zhang, Yuan Pei, Xiongzhan Linghu, Jingkun Ma, Lin Chen, Yuehua Dai, Sungjoo Yoo

    Abstract: Speeding up the large-scale distributed training is challenging in that it requires improving various components of training including load balancing, communication, optimizers, etc. We present novel approaches for fast large-scale training of BERT model which individually ameliorates each component thereby leading to a new level of BERT training performance. Load balancing is imperative in distri… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: Total 15 pages (Appendix 3 pages)

  2. arXiv:2311.12871  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    An Embodied Generalist Agent in 3D World

    Authors: Jiangyong Huang, Silong Yong, Xiaojian Ma, Xiongkun Linghu, Puhao Li, Yan Wang, Qing Li, Song-Chun Zhu, Baoxiong Jia, Siyuan Huang

    Abstract: Leveraging massive knowledge from large language models (LLMs), recent machine learning models show notable successes in general-purpose task solving in diverse domains such as computer vision and robotics. However, several significant challenges remain: (i) most of these models rely on 2D images yet exhibit a limited capacity for 3D input; (ii) these models rarely explore the tasks inherently def… ▽ More

    Submitted 9 May, 2024; v1 submitted 17 November, 2023; originally announced November 2023.

    Comments: ICML 2024. The first four authors contribute equally. Project page: https://embodied-generalist.github.io

  3. arXiv:2207.13137  [pdf, other

    cs.CV

    Bayesian Evidential Learning for Few-Shot Classification

    Authors: Xiongkun Linghu, Yan Bai, Yihang Lou, Shengsen Wu, Jinze Li, Jianzhong He, Tao Bai

    Abstract: Few-Shot Classification(FSC) aims to generalize from base classes to novel classes given very limited labeled samples, which is an important step on the path toward human-like machine learning. State-of-the-art solutions involve learning to find a good metric and representation space to compute the distance between samples. Despite the promising accuracy performance, how to model uncertainty for m… ▽ More

    Submitted 18 July, 2022; originally announced July 2022.

    Comments: arXiv admin note: text overlap with arXiv:2107.10161 by other authors

  4. arXiv:2207.01036  [pdf, other

    cs.CV

    Memory-Based Label-Text Tuning for Few-Shot Class-Incremental Learning

    Authors: Jinze Li, Yan Bai, Yihang Lou, Xiongkun Linghu, Jianzhong He, Shaoyun Xu, Tao Bai

    Abstract: Few-shot class-incremental learning(FSCIL) focuses on designing learning algorithms that can continually learn a sequence of new tasks from a few samples without forgetting old ones. The difficulties are that training on a sequence of limited data from new tasks leads to severe overfitting issues and causes the well-known catastrophic forgetting problem. Existing researches mainly utilize the imag… ▽ More

    Submitted 3 July, 2022; originally announced July 2022.

  5. arXiv:2206.08289  [pdf, other

    cs.AI cs.LG

    Switchable Representation Learning Framework with Self-compatibility

    Authors: Shengsen Wu, Yan Bai, Yihang Lou, Xiongkun Linghu, Jianzhong He, Ling-Yu Duan

    Abstract: Real-world visual search systems involve deployments on multiple platforms with different computing and storage resources. Deploying a unified model that suits the minimal-constrain platforms leads to limited accuracy. It is expected to deploy models with different capacities adapting to the resource constraints, which requires features extracted by these models to be aligned in the metric space.… ▽ More

    Submitted 23 March, 2023; v1 submitted 16 June, 2022; originally announced June 2022.

    Comments: Accepted by CVPR 2023