Skip to main content

Showing 1–50 of 64 results for author: Hao, T

  1. arXiv:2407.10704  [pdf, other

    cs.CV

    Quantized Prompt for Efficient Generalization of Vision-Language Models

    Authors: Tianxiang Hao, Xiaohan Ding, Juexiao Feng, Yuhong Yang, Hui Chen, Guiguang Ding

    Abstract: In the past few years, large-scale pre-trained vision-language models like CLIP have achieved tremendous success in various fields. Naturally, how to transfer the rich knowledge in such huge pre-trained models to downstream tasks and datasets becomes a hot topic. During downstream adaptation, the most challenging problems are overfitting and catastrophic forgetting, which can cause the model to ov… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: 14 pages, 7 figures. Accepted by ECCV 2024

  2. arXiv:2405.10941  [pdf, other

    quant-ph cs.AR cs.ET

    Variational Quantum Algorithm Landscape Reconstruction by Low-Rank Tensor Completion

    Authors: Tianyi Hao, Zichang He, Ruslan Shaydulin, Marco Pistoia, Swamit Tannu

    Abstract: Variational quantum algorithms (VQAs) are a broad class of algorithms with many applications in science and industry. Applying a VQA to a problem involves optimizing a parameterized quantum circuit by maximizing or minimizing a cost function. A particular challenge associated with VQAs is understanding the properties of associated cost functions. Having the landscapes of VQA cost functions can gre… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

  3. arXiv:2404.17179  [pdf, other

    cs.HC cs.ET

    Meta-Object: Interactive and Multisensory Virtual Object Learned from the Real World for the Post-Metaverse

    Authors: Dooyoung Kim, Taewook Ha, Jinseok Hong, Seonji Kim, Selin Choi, Heejeong Ko, Woontack Woo

    Abstract: With the proliferation of wearable Augmented Reality/Virtual Reality (AR/VR) devices, ubiquitous virtual experiences seamlessly integrate into daily life through metaverse platforms. To support immersive metaverse experiences akin to reality, we propose a next-generation virtual object, a meta-object, a property-embedded virtual object that contains interactive and multisensory characteristics lea… ▽ More

    Submitted 28 April, 2024; v1 submitted 26 April, 2024; originally announced April 2024.

    Comments: 12 pages, 4 figures, under review in the IEEE CG&A magazine

  4. arXiv:2403.09192  [pdf, other

    cs.CV

    PYRA: Parallel Yielding Re-Activation for Training-Inference Efficient Task Adaptation

    Authors: Yizhe Xiong, Hui Chen, Tianxiang Hao, Zijia Lin, Jungong Han, Yuesong Zhang, Guoxin Wang, Yongjun Bao, Guiguang Ding

    Abstract: Recently, the scale of transformers has grown rapidly, which introduces considerable challenges in terms of training overhead and inference efficiency in the scope of task adaptation. Existing works, namely Parameter-Efficient Fine-Tuning (PEFT) and model compression, have separately investigated the challenges. However, PEFT cannot guarantee the inference efficiency of the original backbone, espe… ▽ More

    Submitted 16 July, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

    Comments: 14 pages, 4 figures, Accepted by ECCV 2024

  5. arXiv:2403.03310  [pdf, other

    quant-ph cs.LG

    Graph Learning for Parameter Prediction of Quantum Approximate Optimization Algorithm

    Authors: Zhiding Liang, Gang Liu, Zheyuan Liu, Jinglei Cheng, Tianyi Hao, Kecheng Liu, Hang Ren, Zhixin Song, Ji Liu, Fanny Ye, Yiyu Shi

    Abstract: In recent years, quantum computing has emerged as a transformative force in the field of combinatorial optimization, offering novel approaches to tackling complex problems that have long challenged classical computational methods. Among these, the Quantum Approximate Optimization Algorithm (QAOA) stands out for its potential to efficiently solve the Max-Cut problem, a quintessential example of com… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  6. arXiv:2312.10813  [pdf, other

    cs.CV cs.CL cs.LG

    Re-parameterized Low-rank Prompt: Generalize a Vision-Language Model within 0.5K Parameters

    Authors: Tianxiang Hao, Mengyao Lyu, Hui Chen, Sicheng Zhao, Jungong Han, Guiguang Ding

    Abstract: With the development of large pre-trained vision-language models, how to effectively transfer the knowledge of such foundational models to downstream tasks becomes a hot topic, especially in a data-deficient scenario. Recently, prompt tuning has become a popular solution. When adapting the vision-language models, researchers freeze the parameters in the backbone and only design and tune the prompt… ▽ More

    Submitted 11 January, 2024; v1 submitted 17 December, 2023; originally announced December 2023.

  7. arXiv:2311.14762  [pdf, other

    cs.CV cs.AI

    The 2nd Workshop on Maritime Computer Vision (MaCVi) 2024

    Authors: Benjamin Kiefer, Lojze Žust, Matej Kristan, Janez Perš, Matija Teršek, Arnold Wiliem, Martin Messmer, Cheng-Yen Yang, Hsiang-Wei Huang, Zhongyu Jiang, Heng-Cheng Kuo, Jie Mei, Jenq-Neng Hwang, Daniel Stadler, Lars Sommer, Kaer Huang, Aiguo Zheng, Weitu Chong, Kanokphan Lertniphonphan, Jun Xie, Feng Chen, Jian Li, Zhepeng Wang, Luca Zedda, Andrea Loddo , et al. (24 additional authors not shown)

    Abstract: The 2nd Workshop on Maritime Computer Vision (MaCVi) 2024 addresses maritime computer vision for Unmanned Aerial Vehicles (UAV) and Unmanned Surface Vehicles (USV). Three challenges categories are considered: (i) UAV-based Maritime Object Tracking with Re-identification, (ii) USV-based Maritime Obstacle Segmentation and Detection, (iii) USV-based Maritime Boat Tracking. The USV-based Maritime Obst… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

    Comments: Part of 2nd Workshop on Maritime Computer Vision (MaCVi) 2024 IEEE Xplore submission as part of WACV 2024

  8. arXiv:2311.01743  [pdf, other

    cs.IT cs.AI cs.LG cs.NI

    Energy Efficiency Optimization for Subterranean LoRaWAN Using A Reinforcement Learning Approach: A Direct-to-Satellite Scenario

    Authors: Kaiqiang Lin, Muhammad Asad Ullah, Hirley Alves, Konstantin Mikhaylov, Tong Hao

    Abstract: The integration of subterranean LoRaWAN and non-terrestrial networks (NTN) delivers substantial economic and societal benefits in remote agriculture and disaster rescue operations. The LoRa modulation leverages quasi-orthogonal spreading factors (SFs) to optimize data rates, airtime, coverage and energy consumption. However, it is still challenging to effectively assign SFs to end devices for mini… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

    Comments: 5 pages, 6 figures, paper accepted for publication in IEEE Wireless Communications Letters

  9. arXiv:2310.15106  [pdf, other

    cs.IT eess.SP

    Theoretical Analysis of the Radio Map Estimation Problem

    Authors: Daniel Romero, Tien Ngoc Ha, Raju Shrestha, Massimo Franceschetti

    Abstract: Radio maps provide radio frequency metrics, such as the received signal strength, at every location of a geographic area. These maps, which are estimated using a set of measurements collected at multiple positions, find a wide range of applications in wireless communications, including the prediction of coverage holes, network planning, resource allocation, and path planning for mobile robots. Alt… ▽ More

    Submitted 23 March, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

  10. arXiv:2310.11043  [pdf, other

    eess.SP cs.AI

    Spoofing Attack Detection in the Physical Layer with Robustness to User Movement

    Authors: Daniel Romero, Tien Ngoc Ha, Peter Gerstoft

    Abstract: In a spoofing attack, an attacker impersonates a legitimate user to access or modify data belonging to the latter. Typical approaches for spoofing detection in the physical layer declare an attack when a change is observed in certain channel features, such as the received signal strength (RSS) measured by spatially distributed receivers. However, since channels change over time, for example due to… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: WCNC. arXiv admin note: text overlap with arXiv:2211.04269

  11. arXiv:2310.11036  [pdf, other

    eess.SP cs.AI physics.app-ph

    Radio Map Estimation: Empirical Validation and Analysis

    Authors: Raju Shrestha, Tien Ngoc Ha, Pham Q. Viet, Daniel Romero

    Abstract: Radio maps quantify magnitudes such as the received signal strength at every location of a geographical region. Although the estimation of radio maps has attracted widespread interest, the vast majority of works rely on simulated data and, therefore, cannot establish the effectiveness and relative performance of existing algorithms in practice. To fill this gap, this paper presents the first compr… ▽ More

    Submitted 22 January, 2024; v1 submitted 17 October, 2023; originally announced October 2023.

    Comments: 13 pages, Journal version, submitted to the IEEE Transactions on Wireless Communications

  12. arXiv:2309.12668  [pdf, other

    cs.RO

    UWA360CAM: A 360$^{\circ}$ 24/7 Real-Time Streaming Camera System for Underwater Applications

    Authors: Quan-Dung Pham, Yipeng Zhu, Tan-Sang Ha, K. H. Long Nguyen, Binh-Son Hua, Sai-Kit Yeung

    Abstract: Omnidirectional camera is a cost-effective and information-rich sensor highly suitable for many marine applications and the ocean scientific community, encompassing several domains such as augmented reality, mapping, motion estimation, visual surveillance, and simultaneous localization and mapping. However, designing and constructing such a high-quality 360$^{\circ}$ real-time streaming camera sys… ▽ More

    Submitted 30 September, 2023; v1 submitted 22 September, 2023; originally announced September 2023.

  13. arXiv:2309.12025  [pdf, other

    cs.DS cs.CC cs.LG math.CO

    Robust Approximation Algorithms for Non-monotone $k$-Submodular Maximization under a Knapsack Constraint

    Authors: Dung T. K. Ha, Canh V. Pham, Tan D. Tran, Huan X. Hoang

    Abstract: The problem of non-monotone $k$-submodular maximization under a knapsack constraint ($\kSMK$) over the ground set size $n$ has been raised in many applications in machine learning, such as data summarization, information propagation, etc. However, existing algorithms for the problem are facing questioning of how to overcome the non-monotone case and how to fast return a good solution in case of th… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Comments: 12 pages

    Report number: KSE-ID38

  14. arXiv:2308.03213  [pdf, other

    quant-ph cs.AR cs.ET

    Enabling High Performance Debugging for Variational Quantum Algorithms using Compressed Sensing

    Authors: Kun Liu, Tianyi Hao, Swamit Tannu

    Abstract: Variational quantum algorithms (VQAs) can potentially solve practical problems using contemporary Noisy Intermediate Scale Quantum (NISQ) computers. VQAs find near-optimal solutions in the presence of qubit errors by classically optimizing a loss function computed by parameterized quantum circuits. However, developing and testing VQAs is challenging due to the limited availability of quantum hardw… ▽ More

    Submitted 6 August, 2023; originally announced August 2023.

    Comments: 13 pages, 13 figures. KL and TH contributed equally to this work

    Journal ref: In Proceedings of the 50th Annual International Symposium on Computer Architecture (ISCA '23), June 17-21, 2023, Orlando, FL, USA. Association for Computing Machinery, New York, NY, USA, Article 9, 1-13

  15. arXiv:2307.01676  [pdf, other

    cs.AI

    RaidEnv: Exploring New Challenges in Automated Content Balancing for Boss Raid Games

    Authors: Hyeon-Chang Jeon, In-Chang Baek, Cheong-mok Bae, Taehwa Park, Wonsang You, Taegwan Ha, Hoyun Jung, Jinha Noh, Seungwon Oh, Kyung-Joong Kim

    Abstract: The balance of game content significantly impacts the gaming experience. Unbalanced game content diminishes engagement or increases frustration because of repetitive failure. Although game designers intend to adjust the difficulty of game content, this is a repetitive, labor-intensive, and challenging process, especially for commercial-level games with extensive content. To address this issue, the… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

    Comments: 14 pages, 6 figures, 6 tables, 2 algorithms

  16. arXiv:2306.17181  [pdf, other

    cs.CL cs.LG

    Unsupervised Text Embedding Space Generation Using Generative Adversarial Networks for Text Synthesis

    Authors: Jun-Min Lee, Tae-Bin Ha

    Abstract: Generative Adversarial Networks (GAN) is a model for data synthesis, which creates plausible data through the competition of generator and discriminator. Although GAN application to image synthesis is extensively studied, it has inherent limitations to natural language generation. Because natural language is composed of discrete tokens, a generator has difficulty updating its gradient through back… ▽ More

    Submitted 17 October, 2023; v1 submitted 19 June, 2023; originally announced June 2023.

    Comments: NEJLT accpeted

  17. arXiv:2306.04593  [pdf, other

    cs.CV cs.IR

    MarineVRS: Marine Video Retrieval System with Explainability via Semantic Understanding

    Authors: Tan-Sang Ha, Hai Nguyen-Truong, Tuan-Anh Vu, Sai-Kit Yeung

    Abstract: Building a video retrieval system that is robust and reliable, especially for the marine environment, is a challenging task due to several factors such as dealing with massive amounts of dense and repetitive data, occlusion, blurriness, low lighting conditions, and abstract queries. To address these challenges, we present MarineVRS, a novel and flexible video retrieval system designed explicitly f… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: Accepted to OCEANS 2023 Limerick. Website: https://marinevrs.hkustvgd.com/

  18. arXiv:2305.10292  [pdf, other

    cs.DS cs.AI

    Linear Query Approximation Algorithms for Non-monotone Submodular Maximization under Knapsack Constraint

    Authors: Canh V. Pham, Tan D. Tran, Dung T. K. Ha, My T. Thai

    Abstract: This work, for the first time, introduces two constant factor approximation algorithms with linear query complexity for non-monotone submodular maximization over a ground set of size $n$ subject to a knapsack constraint, $\mathsf{DLA}$ and $\mathsf{RLA}$. $\mathsf{DLA}$ is a deterministic algorithm that provides an approximation factor of $6+ε$ while $\mathsf{RLA}$ is a randomized algorithm with a… ▽ More

    Submitted 10 July, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

  19. arXiv:2305.09296  [pdf, other

    cs.IT cs.NI

    On CSI-Free Multi-Antenna Schemes for Massive Wireless-Powered Underground Sensor Networks

    Authors: Kaiqiang Lin, Onel Luis Alcaraz López, Hirley Alves, Tong Hao

    Abstract: Radio-frequency wireless energy transfer (WET) is a promising technology to realize wireless-powered underground sensor networks (WPUSNs) and enable sustainable underground monitoring. However, due to the severe attenuation in harsh underground soil and the tight energy budget of the underground sensors, traditional WPUSNs relying on the channel state information (CSI) are highly inefficient, espe… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Comments: 13 pages, 10 figures, paper accepted for publication in IEEE Internet of Things Journal

  20. arXiv:2305.00603  [pdf, other

    cs.CV cs.AI cs.LG

    Consolidator: Mergeable Adapter with Grouped Connections for Visual Adaptation

    Authors: Tianxiang Hao, Hui Chen, Yuchen Guo, Guiguang Ding

    Abstract: Recently, transformers have shown strong ability as visual feature extractors, surpassing traditional convolution-based models in various scenarios. However, the success of vision transformers largely owes to their capacity to accommodate numerous parameters. As a result, new challenges for adapting large models to downstream tasks arise. On the one hand, classic fine-tuning tunes all parameters i… ▽ More

    Submitted 30 April, 2023; originally announced May 2023.

    Comments: ICLR 2023

  21. arXiv:2301.13513  [pdf, other

    cs.CR

    Privacy Preserving Ultra-Short-term Wind Power Prediction Based on Secure Multi Party Computation

    Authors: Hang Fan, Xiaoyu Fan, Tianyi Hao, Wei Wei, Kun Chen, Guosai Wang, Xiaofeng Jia, Yidong Li, Wei Xu

    Abstract: Mining the spatial and temporal correlation of wind farm output data is beneficial for enhancing the precision of ultra-short-term wind power prediction. However, if the wind farms are owned by separate entities, they may be reluctant to share their data directly due to privacy concerns as well as business management regulation policies. Although cryptographic approaches have been designed to prot… ▽ More

    Submitted 31 January, 2023; originally announced January 2023.

  22. Primal-Dual Cops and Robber

    Authors: Minh Tuan Ha, Paul Jungeblut, Torsten Ueckerdt, Paweł Żyliński

    Abstract: Cops and Robber is a family of two-player games played on graphs in which one player controls a number of cops and the other player controls a robber. In alternating turns, each player moves (all) their figures. The cops try to capture the robber while the latter tries to flee indefinitely. In this paper we consider a variant of the game played on a planar graph where the robber moves between adja… ▽ More

    Submitted 10 January, 2024; v1 submitted 13 January, 2023; originally announced January 2023.

    Comments: Equal to the published version

    Journal ref: Computing in Geometry and Topology, 3(2), 4:1-4:12 (2024)

  23. Exploiting In-Constraint Energy in Constrained Variational Quantum Optimization

    Authors: Tianyi Hao, Ruslan Shaydulin, Marco Pistoia, Jeffrey Larson

    Abstract: A central challenge of applying near-term quantum optimization algorithms to industrially relevant problems is the need to incorporate complex constraints. In general, such constraints cannot be easily encoded in the circuit, and the quantum circuit measurement outcomes are not guaranteed to respect the constraints. Therefore, the optimization must trade off the in-constraint probability and the q… ▽ More

    Submitted 13 November, 2022; originally announced November 2022.

    Journal ref: In Proceedings of the Third International Workshop on Quantum Computing Software (in conjunction with SC22), 2022

  24. arXiv:2210.15136  [pdf, ps, other

    cs.CV

    3D Shape Knowledge Graph for Cross-domain 3D Shape Retrieval

    Authors: Rihao Chang, Yongtao Ma, Tong Hao, Weizhi Nie

    Abstract: The surge in 3D modeling has led to a pronounced research emphasis on the field of 3D shape retrieval. Numerous contemporary approaches have been put forth to tackle this intricate challenge. Nevertheless, effectively addressing the intricacies of cross-modal 3D shape retrieval remains a formidable undertaking, owing to inherent modality-based disparities. This study presents an innovative notion,… ▽ More

    Submitted 21 December, 2023; v1 submitted 26 October, 2022; originally announced October 2022.

  25. arXiv:2209.11518  [pdf, other

    cs.CV cs.IR cs.MM

    Marine Video Kit: A New Marine Video Dataset for Content-based Analysis and Retrieval

    Authors: Quang-Trung Truong, Tuan-Anh Vu, Tan-Sang Ha, Lokoc Jakub, Yue Him Wong Tim, Ajay Joneja, Sai-Kit Yeung

    Abstract: Effective analysis of unusual domain specific video collections represents an important practical problem, where state-of-the-art general purpose models still face limitations. Hence, it is desirable to design benchmark datasets that challenge novel powerful models for specific domains with additional constraints. It is important to remember that domain specific data may be noisier (e.g., endoscop… ▽ More

    Submitted 6 December, 2022; v1 submitted 23 September, 2022; originally announced September 2022.

    Comments: Camera Ready for MMM 2023, Bergen, Norway

  26. Information Extraction from Scanned Invoice Images using Text Analysis and Layout Features

    Authors: Hien Thi Ha, Aleš Horák

    Abstract: While storing invoice content as metadata to avoid paper document processing may be the future trend, almost all of daily issued invoices are still printed on paper or generated in digital formats such as PDFs. In this paper, we introduce the OCRMiner system for information extraction from scanned document images which is based on text analysis techniques in combination with layout features to ext… ▽ More

    Submitted 8 August, 2022; originally announced August 2022.

    Comments: This is an author preprint of the article published by Elsevier in Signal Processing: Image Communication at https://doi.org/10.1016/j.image.2021.116601

    Journal ref: Signal Processing: Image Communication 102 (2022)

  27. arXiv:2205.05918  [pdf, other

    cs.CV

    Fall detection using multimodal data

    Authors: Thao V. Ha, Hoang Nguyen, Son T. Huynh, Trung T. Nguyen, Binh T. Nguyen

    Abstract: In recent years, the occurrence of falls has increased and has had detrimental effects on older adults. Therefore, various machine learning approaches and datasets have been introduced to construct an efficient fall detection algorithm for the social community. This paper studies the fall detection problem based on a large public dataset, namely the UP-Fall Detection Dataset. This dataset was coll… ▽ More

    Submitted 12 May, 2022; originally announced May 2022.

    Comments: 12 pages, 5 figures, 6 tables

  28. A quantum-inspired tensor network method for constrained combinatorial optimization problems

    Authors: Tianyi Hao, Xuxin Huang, Chunjing Jia, Cheng Peng

    Abstract: Combinatorial optimization is of general interest for both theoretical study and real-world applications. Fast-developing quantum algorithms provide a different perspective on solving combinatorial optimization problems. In this paper, we propose a quantum-inspired tensor-network-based algorithm for general locally constrained combinatorial optimization problems. Our algorithm constructs a Hamilto… ▽ More

    Submitted 5 September, 2022; v1 submitted 29 March, 2022; originally announced March 2022.

    Journal ref: Frontiers in Physics, Volume 10, Article 906590 (2022)

  29. arXiv:2111.11707  [pdf, other

    cs.CL

    Deps-SAN: Neural Machine Translation with Dependency-Scaled Self-Attention Network

    Authors: Ru Peng, Nankai Lin, Yi Fang, Shengyi Jiang, Tianyong Hao, Boyu Chen, Junbo Zhao

    Abstract: Syntax knowledge contributes its powerful strength in Neural machine translation (NMT) tasks. Early NMT works supposed that syntax details can be automatically learned from numerous texts via attention networks. However, succeeding researches pointed out that limited by the uncontrolled nature of attention computation, the NMT model requires an external syntax to capture the deep syntactic awarene… ▽ More

    Submitted 4 October, 2022; v1 submitted 23 November, 2021; originally announced November 2021.

  30. Contrastive Proposal Extension with LSTM Network for Weakly Supervised Object Detection

    Authors: Pei Lv, Suqi Hu, Tianran Hao

    Abstract: Weakly supervised object detection (WSOD) has attracted more and more attention since it only uses image-level labels and can save huge annotation costs. Most of the WSOD methods use Multiple Instance Learning (MIL) as their basic framework, which regard it as an instance classification problem. However, these methods based on MIL tends to converge only on the most discriminate regions of differen… ▽ More

    Submitted 19 October, 2022; v1 submitted 14 October, 2021; originally announced October 2021.

    Comments: 15 pages,12 figures, accepted to IEEE Transactions on Image Processing

  31. arXiv:2109.08863   

    cs.DS cs.GT

    Streaming algorithms for Budgeted $k$-Submodular Maximization problem

    Authors: Canh V. Pham, Quang C. Vu, Dung K. T. Ha, Tai T. Nguyen

    Abstract: Stimulated by practical applications arising from viral marketing. This paper investigates a novel Budgeted $k$-Submodular Maximization problem defined as follows: Given a finite set $V$, a budget $B$ and a $k$-submodular function $f: (k+1)^V \mapsto \mathbb{R}_+$, the problem asks to find a solution $\s=(S_1, S_2, \ldots, S_k)$, each element $e \in V$ has a cost $c_i(e)$ to be put into $i$-th set… ▽ More

    Submitted 22 October, 2021; v1 submitted 18 September, 2021; originally announced September 2021.

    Comments: There are some results of the article that need to be corrected

  32. arXiv:2109.04004  [pdf, ps, other

    cs.AI

    OpenClinicalAI: enabling AI to diagnose diseases in real-world clinical settings

    Authors: Yunyou Huang, Nana Wang, Suqin Tang, Li Ma, Tianshu Hao, Zihan Jiang, Fan Zhang, Guoxin Kang, Xiuxia Miao, Xianglong Guan, Ruchang Zhang, Zhifei Zhang, Jianfeng Zhan

    Abstract: This paper quantitatively reveals the state-of-the-art and state-of-the-practice AI systems only achieve acceptable performance on the stringent conditions that all categories of subjects are known, which we call closed clinical settings, but fail to work in real-world clinical settings. Compared to the diagnosis task in the closed setting, real-world clinical settings pose severe challenges, and… ▽ More

    Submitted 8 September, 2021; originally announced September 2021.

  33. arXiv:2107.14444  [pdf, other

    cs.CV cs.LG

    Manipulating Identical Filter Redundancy for Efficient Pruning on Deep and Complicated CNN

    Authors: Xiaohan Ding, Tianxiang Hao, Jungong Han, Yuchen Guo, Guiguang Ding

    Abstract: The existence of redundancy in Convolutional Neural Networks (CNNs) enables us to remove some filters/channels with acceptable performance drops. However, the training objective of CNNs usually tends to minimize an accuracy-related loss function without any attention paid to the redundancy, making the redundancy distribute randomly on all the filters, such that removing any of them may trigger inf… ▽ More

    Submitted 30 July, 2021; originally announced July 2021.

    Comments: Extension of the CVPR-2019 paper (https://openaccess.thecvf.com/content_CVPR_2019/papers/Ding_Centripetal_SGD_for_Pruning_Very_Deep_Convolutional_Networks_With_Complicated_CVPR_2019_paper.pdf). arXiv admin note: substantial text overlap with arXiv:1904.03837

  34. arXiv:2104.04650  [pdf, other

    cs.CV cs.AI

    Towards Automated and Marker-less Parkinson Disease Assessment: Predicting UPDRS Scores using Sit-stand videos

    Authors: Deval Mehta, Umar Asif, Tian Hao, Erhan Bilal, Stefan Von Cavallar, Stefan Harrer, Jeffrey Rogers

    Abstract: This paper presents a novel deep learning enabled, video based analysis framework for assessing the Unified Parkinsons Disease Rating Scale (UPDRS) that can be used in the clinic or at home. We report results from comparing the performance of the framework to that of trained clinicians on a population of 32 Parkinsons disease (PD) patients. In-person clinical assessments by trained neurologists ar… ▽ More

    Submitted 9 April, 2021; originally announced April 2021.

    Comments: Accepted by CVPR Workshops 2021

  35. arXiv:2103.13080  [pdf, other

    cs.CV cs.LG cs.NE

    Shift-and-Balance Attention

    Authors: Chunjie Luo, Jianfeng Zhan, Tianshu Hao, Lei Wang, Wanling Gao

    Abstract: Attention is an effective mechanism to improve the deep model capability. Squeeze-and-Excite (SE) introduces a light-weight attention branch to enhance the network's representational power. The attention branch is gated using the Sigmoid function and multiplied by the feature map's trunk branch. It is too sensitive to coordinate and balance the trunk and attention branches' contributions. To contr… ▽ More

    Submitted 24 March, 2021; originally announced March 2021.

  36. arXiv:2012.08743  [pdf, ps, other

    cs.CL cs.LG

    Improving Multilingual Neural Machine Translation For Low-Resource Languages: French,English - Vietnamese

    Authors: Thi-Vinh Ngo, Phuong-Thai Nguyen, Thanh-Le Ha, Khac-Quy Dinh, Le-Minh Nguyen

    Abstract: Prior works have demonstrated that a low-resource language pair can benefit from multilingual machine translation (MT) systems, which rely on many language pairs' joint training. This paper proposes two simple strategies to address the rare word issue in multilingual MT systems for two low-resource language pairs: French-Vietnamese and English-Vietnamese. The first strategy is about dynamical lear… ▽ More

    Submitted 10 July, 2021; v1 submitted 15 December, 2020; originally announced December 2020.

    Comments: The 3rd Workshop on Technologies for MT of Low Resource Languages (LoResMT 2020)

  37. arXiv:2007.03260  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    ResRep: Lossless CNN Pruning via Decoupling Remembering and Forgetting

    Authors: Xiaohan Ding, Tianxiang Hao, Jianchao Tan, Ji Liu, Jungong Han, Yuchen Guo, Guiguang Ding

    Abstract: We propose ResRep, a novel method for lossless channel pruning (a.k.a. filter pruning), which slims down a CNN by reducing the width (number of output channels) of convolutional layers. Inspired by the neurobiology research about the independence of remembering and forgetting, we propose to re-parameterize a CNN into the remembering parts and forgetting parts, where the former learn to maintain th… ▽ More

    Submitted 14 August, 2021; v1 submitted 7 July, 2020; originally announced July 2020.

    Comments: ICCV 2021

  38. arXiv:2006.15234  [pdf, other

    cs.DC physics.comp-ph quant-ph

    Efficient 2D Tensor Network Simulation of Quantum Systems

    Authors: Yuchen Pang, Tianyi Hao, Annika Dugad, Yiqing Zhou, Edgar Solomonik

    Abstract: Simulation of quantum systems is challenging due to the exponential size of the state space. Tensor networks provide a systematically improvable approximation for quantum states. 2D tensor networks such as Projected Entangled Pair States (PEPS) are well-suited for key classes of physical systems and quantum circuits. However, direct contraction of PEPS networks has exponential cost, while approxim… ▽ More

    Submitted 3 September, 2020; v1 submitted 26 June, 2020; originally announced June 2020.

    Comments: to be published in SC 2020

  39. arXiv:2005.09940  [pdf, other

    eess.AS cs.CL cs.SD

    Relative Positional Encoding for Speech Recognition and Direct Translation

    Authors: Ngoc-Quan Pham, Thanh-Le Ha, Tuan-Nam Nguyen, Thai-Son Nguyen, Elizabeth Salesky, Sebastian Stueker, Jan Niehues, Alexander Waibel

    Abstract: Transformer models are powerful sequence-to-sequence architectures that are capable of directly mapping speech inputs to transcriptions or translations. However, the mechanism for modeling positions in this model was tailored for text modeling, and thus is less ideal for acoustic inputs. In this work, we adapt the relative position encoding scheme to the Speech Transformer, where the key addition… ▽ More

    Submitted 20 May, 2020; originally announced May 2020.

    Comments: Submitted to Interspeech 2020

  40. arXiv:2004.14690  [pdf, other

    cs.AI cs.LG

    AIBench Training: Balanced Industry-Standard AI Training Benchmarking

    Authors: Fei Tang, Wanling Gao, Jianfeng Zhan, Chuanxin Lan, Xu Wen, Lei Wang, Chunjie Luo, Jiahui Dai, Zheng Cao, Xingwang Xiong, Zihan Jiang, Tianshu Hao, Fanda Fan, Fan Zhang, Yunyou Huang, Jianan Chen, Mengjia Du, Rui Ren, Chen Zheng, Daoyi Zheng, Haoning Tang, Kunlin Zhan, Biao Wang, Defei Kong, Minghe Yu , et al. (8 additional authors not shown)

    Abstract: Earlier-stage evaluations of a new AI architecture/system need affordable benchmarks. Only using a few AI component benchmarks like MLPerfalone in the other stages may lead to misleading conclusions. Moreover, the learning dynamics are not well understood, and the benchmarks' shelf-life is short. This paper proposes a balanced benchmarking methodology. We use real-world benchmarks to cover the fac… ▽ More

    Submitted 10 March, 2021; v1 submitted 30 April, 2020; originally announced April 2020.

    Comments: ISPASS 2021

  41. arXiv:2003.09891  [pdf, other

    eess.AS cs.CL cs.SD

    Low Latency ASR for Simultaneous Speech Translation

    Authors: Thai Son Nguyen, Jan Niehues, Eunah Cho, Thanh-Le Ha, Kevin Kilgour, Markus Muller, Matthias Sperber, Sebastian Stueker, Alex Waibel

    Abstract: User studies have shown that reducing the latency of our simultaneous lecture translation system should be the most important goal. We therefore have worked on several techniques for reducing the latency for both components, the automatic speech recognition and the speech translation module. Since the commonly used commitment latency is not appropriate in our case of continuous stream decoding, we… ▽ More

    Submitted 22 March, 2020; originally announced March 2020.

  42. arXiv:2002.07162  [pdf, other

    cs.PF cs.CV

    AIBench: An Agile Domain-specific Benchmarking Methodology and an AI Benchmark Suite

    Authors: Wanling Gao, Fei Tang, Jianfeng Zhan, Chuanxin Lan, Chunjie Luo, Lei Wang, Jiahui Dai, Zheng Cao, Xiongwang Xiong, Zihan Jiang, Tianshu Hao, Fanda Fan, Xu Wen, Fan Zhang, Yunyou Huang, Jianan Chen, Mengjia Du, Rui Ren, Chen Zheng, Daoyi Zheng, Haoning Tang, Kunlin Zhan, Biao Wang, Defei Kong, Minghe Yu , et al. (9 additional authors not shown)

    Abstract: Domain-specific software and hardware co-design is encouraging as it is much easier to achieve efficiency for fewer tasks. Agile domain-specific benchmarking speeds up the process as it provides not only relevant design inputs but also relevant metrics, and tools. Unfortunately, modern workloads like Big data, AI, and Internet services dwarf the traditional one in terms of code size, deployment sc… ▽ More

    Submitted 17 February, 2020; originally announced February 2020.

    Comments: 25 pages, 7 figures. arXiv admin note: substantial text overlap with arXiv:1908.08998

  43. arXiv:2002.03493  [pdf, other

    cs.DC cs.PF

    AI-oriented Medical Workload Allocation for Hierarchical Cloud/Edge/Device Computing

    Authors: Tianshu Hao, Jianfeng Zhan, Kai Hwang, Wanling Gao, Xu Wen

    Abstract: In a hierarchically-structured cloud/edge/device computing environment, workload allocation can greatly affect the overall system performance. This paper deals with AI-oriented medical workload generated in emergency rooms (ER) or intensive care units (ICU) in metropolitan areas. The goal is to optimize AI-workload allocation to cloud clusters, edge servers, and end devices so that minimum respons… ▽ More

    Submitted 9 February, 2020; originally announced February 2020.

  44. arXiv:1910.03467  [pdf, ps, other

    cs.CL cs.LG stat.ML

    Overcoming the Rare Word Problem for Low-Resource Language Pairs in Neural Machine Translation

    Authors: Thi-Vinh Ngo, Thanh-Le Ha, Phuong-Thai Nguyen, Le-Minh Nguyen

    Abstract: Among the six challenges of neural machine translation (NMT) coined by (Koehn and Knowles, 2017), rare-word problem is considered the most severe one, especially in translation of low-resource languages. In this paper, we propose three solutions to address the rare words in neural machine translation systems. First, we enhance source context to predict the target words by connecting directly the s… ▽ More

    Submitted 17 October, 2019; v1 submitted 6 October, 2019; originally announced October 2019.

    Journal ref: Proceedings of the 6th Workshop on Asian Translation, WAT 2019

  45. How Transformer Revitalizes Character-based Neural Machine Translation: An Investigation on Japanese-Vietnamese Translation Systems

    Authors: Thi-Vinh Ngo, Thanh-Le Ha, Phuong-Thai Nguyen, Le-Minh Nguyen

    Abstract: While translating between East Asian languages, many works have discovered clear advantages of using characters as the translation unit. Unfortunately, traditional recurrent neural machine translation systems hinder the practical usage of those character-based systems due to their architectural limitations. They are unfavorable in handling extremely long sequences as well as highly restricted in p… ▽ More

    Submitted 17 October, 2019; v1 submitted 5 October, 2019; originally announced October 2019.

    Journal ref: 16th International Workshop on Spoken Language Translation 2019

  46. arXiv:1908.01924  [pdf, ps, other

    cs.PF cs.DC

    Edge AIBench: Towards Comprehensive End-to-end Edge Computing Benchmarking

    Authors: Tianshu Hao, Yunyou Huang, Xu Wen, Wanling Gao, Fan Zhang, Chen Zheng, Lei Wang, Hainan Ye, Kai Hwang, Zujie Ren, Jianfeng Zhan

    Abstract: In edge computing scenarios, the distribution of data and collaboration of workloads on different layers are serious concerns for performance, privacy, and security issues. So for edge computing benchmarking, we must take an end-to-end view, considering all three layers: client-side devices, edge computing layer, and cloud servers. Unfortunately, the previous work ignores this most important point… ▽ More

    Submitted 5 August, 2019; originally announced August 2019.

  47. arXiv:1908.00298  [pdf, other

    eess.SP cs.LG cs.NE

    LoadCNN: A Low Training Cost Deep Learning Model for Day-Ahead Individual Residential Load Forecasting

    Authors: Yunyou Huang, Nana Wang, Wanling Gao, Xiaoxu Guo, Cheng Huang, Tianshu Hao, Jianfeng Zhan

    Abstract: Accurate day-ahead individual residential load forecasting is of great importance to various applications of smart grid on day-ahead market. Deep learning, as a powerful machine learning technology, has shown great advantages and promising application in load forecasting tasks. However, deep learning is a computationally-hungry method, and requires high costs (e.g., time, energy and CO2 emission)… ▽ More

    Submitted 19 December, 2019; v1 submitted 1 August, 2019; originally announced August 2019.

  48. arXiv:1906.08584  [pdf, other

    cs.CL

    Improving Zero-shot Translation with Language-Independent Constraints

    Authors: Ngoc-Quan Pham, Jan Niehues, Thanh-Le Ha, Alex Waibel

    Abstract: An important concern in training multilingual neural machine translation (NMT) is to translate between language pairs unseen during training, i.e zero-shot translation. Improving this ability kills two birds with one stone by providing an alternative to pivot translation which also allows us to better understand how the model captures information between languages. In this work, we carried out a… ▽ More

    Submitted 20 June, 2019; originally announced June 2019.

    Comments: 10 pages version accepted in WMT 2019

  49. arXiv:1905.02940  [pdf, other

    cs.AI

    A new direction to promote the implementation of artificial intelligence in natural clinical settings

    Authors: Yunyou Huang, Zhifei Zhang, Nana Wang, Nengquan Li, Mengjia Du, Tianshu Hao, Jianfeng Zhan

    Abstract: Artificial intelligence (AI) researchers claim that they have made great `achievements' in clinical realms. However, clinicians point out the so-called `achievements' have no ability to implement into natural clinical settings. The root cause for this huge gap is that many essential features of natural clinical tasks are overlooked by AI system developers without medical background. In this paper,… ▽ More

    Submitted 8 May, 2019; originally announced May 2019.

  50. Synapse: Synthetic Application Profiler and Emulator

    Authors: Andre Merzky, Ming Tai Ha, Matteo Turilli, Shantenu Jha

    Abstract: Motivated by the need to emulate workload execution characteristics on high-performance and distributed heterogeneous resources, we introduce Synapse. Synapse is used as a proxy application (or "representative application") for real workloads, with the advantage that it can be tuned in different ways and dimensions, and also at levels of granularity that are not possible with real applications. Sy… ▽ More

    Submitted 2 August, 2018; originally announced August 2018.

    Comments: Large portions of this work originally appeared as arXiv:1506.00272, which was subsequently published as a workshop paper. This is an extended version published in the "Journal of Computational Science"

    Report number: 01

    Journal ref: Journal of Computational Science, 27C (2018) pp. 329-344