Skip to main content

Showing 1–50 of 58 results for author: Chu, J

  1. arXiv:2407.02744  [pdf, other

    eess.IV cs.CV

    Highly Accelerated MRI via Implicit Neural Representation Guided Posterior Sampling of Diffusion Models

    Authors: Jiayue Chu, Chenhe Du, Xiyue Lin, Yuyao Zhang, Hongjiang Wei

    Abstract: Reconstructing high-fidelity magnetic resonance (MR) images from under-sampled k-space is a commonly used strategy to reduce scan time. The posterior sampling of diffusion models based on the real measurement data holds significant promise of improved reconstruction accuracy. However, traditional posterior sampling methods often lack effective data consistency guidance, leading to inaccurate and u… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  2. arXiv:2406.01604  [pdf, other

    cs.IR cs.AI cs.CV cs.MM

    An Empirical Study of Excitation and Aggregation Design Adaptions in CLIP4Clip for Video-Text Retrieval

    Authors: Xiaolun Jing, Genke Yang, Jian Chu

    Abstract: CLIP4Clip model transferred from the CLIP has been the de-factor standard to solve the video clip retrieval task from frame-level input, triggering the surge of CLIP4Clip-based models in the video-text retrieval domain. In this work, we rethink the inherent limitation of widely-used mean pooling operation in the frame features aggregation and investigate the adaptions of excitation and aggregation… ▽ More

    Submitted 8 June, 2024; v1 submitted 25 May, 2024; originally announced June 2024.

    Comments: 20 pages

  3. arXiv:2404.19360  [pdf, other

    cs.CV cs.CL cs.IR

    Large Language Model Informed Patent Image Retrieval

    Authors: Hao-Cheng Lo, Jung-Mei Chu, Jieh Hsiang, Chun-Chieh Cho

    Abstract: In patent prosecution, image-based retrieval systems for identifying similarities between current patent images and prior art are pivotal to ensure the novelty and non-obviousness of patent applications. Despite their growing popularity in recent years, existing attempts, while effective at recognizing images within the same patent, fail to deliver practical value due to their limited generalizabi… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: 8 pages. Under review

  4. arXiv:2404.05576  [pdf, other

    cs.LG

    Dynamic Backtracking in GFlowNets: Enhancing Decision Steps with Reward-Dependent Adjustment Mechanisms

    Authors: Shuai Guo, Jielei Chu, Lei Zhu, Zhaoyu Li, Tianrui Li

    Abstract: Generative Flow Networks (GFlowNets or GFNs) are probabilistic models predicated on Markov flows, and they employ specific amortization algorithms to learn stochastic policies that generate compositional substances including biomolecules, chemical materials, etc. With a strong ability to generate high-performance biochemical molecules, GFNs accelerate the discovery of scientific substances, effect… ▽ More

    Submitted 13 May, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

  5. arXiv:2403.17445  [pdf, other

    cs.LG cs.AI cs.CL

    Incorporating Exponential Smoothing into MLP: A Simple but Effective Sequence Model

    Authors: Jiqun Chu, Zuoquan Lin

    Abstract: Modeling long-range dependencies in sequential data is a crucial step in sequence learning. A recently developed model, the Structured State Space (S4), demonstrated significant effectiveness in modeling long-range sequences. However, It is unclear whether the success of S4 can be attributed to its intricate parameterization and HiPPO initialization or simply due to State Space Models (SSMs). To f… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: 12 pages, 5 tables, 3 figures

  6. arXiv:2403.13347  [pdf, other

    cs.CV

    vid-TLDR: Training Free Token merging for Light-weight Video Transformer

    Authors: Joonmyung Choi, Sanghyeok Lee, Jaewon Chu, Minhyuk Choi, Hyunwoo J. Kim

    Abstract: Video Transformers have become the prevalent solution for various video downstream tasks with superior expressive power and flexibility. However, these video transformers suffer from heavy computational costs induced by the massive number of tokens across the entire video frames, which has been the major barrier to training the model. Further, the patches irrelevant to the main contents, e.g., bac… ▽ More

    Submitted 30 March, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

    Comments: Conference on Computer Vision and Pattern Recognition (CVPR), 2024

  7. arXiv:2402.15119  [pdf

    cs.CY cs.SI

    A multidisciplinary framework for deconstructing bots' pluripotency in dualistic antagonism

    Authors: Wentao Xu, Kazutoshi Sasahara, Jianxun Chu, Bin Wang, Wenlu Fan, Zhiwen Hu

    Abstract: Anthropomorphic social bots are engineered to emulate human verbal communication and generate toxic or inflammatory content across social networking services (SNSs). Bot-disseminated misinformation could subtly yet profoundly reshape societal processes by complexly interweaving factors like repeated disinformation exposure, amplified political polarization, compromised indicators of democratic hea… ▽ More

    Submitted 11 May, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    ACM Class: J.4

  8. arXiv:2402.11231  [pdf

    cs.CR q-fin.GN

    Enhancing Security in Blockchain Networks: Anomalies, Frauds, and Advanced Detection Techniques

    Authors: Joerg Osterrieder, Stephen Chan, Jeffrey Chu, Yuanyuan Zhang, Branka Hadji Misheva, Codruta Mare

    Abstract: Blockchain technology, a foundational distributed ledger system, enables secure and transparent multi-party transactions. Despite its advantages, blockchain networks are susceptible to anomalies and frauds, posing significant risks to their integrity and security. This paper offers a detailed examination of blockchain's key definitions and properties, alongside a thorough analysis of the various a… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

  9. arXiv:2402.09846  [pdf

    physics.ao-ph cs.LG eess.SP

    A Deep Learning Approach to Radar-based QPE

    Authors: Ting-Shuo Yo, Shih-Hao Su, Jung-Lien Chu, Chiao-Wei Chang, Hung-Chi Kuo

    Abstract: In this study, we propose a volume-to-point framework for quantitative precipitation estimation (QPE) based on the Quantitative Precipitation Estimation and Segregation Using Multiple Sensor (QPESUMS) Mosaic Radar data set. With a data volume consisting of the time series of gridded radar reflectivities over the Taiwan area, we used machine learning algorithms to establish a statistical model for… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: 22 pages, 11 figures. Published in Earth and Space Science

    Journal ref: Earth Space Sci. 2021, 8, e2020EA001340

  10. arXiv:2402.06938  [pdf, other

    cs.DC cs.AI cs.LG

    Efficient Resource Scheduling for Distributed Infrastructures Using Negotiation Capabilities

    Authors: Junjie Chu, Prashant Singh, Salman Toor

    Abstract: In the past few decades, the rapid development of information and internet technologies has spawned massive amounts of data and information. The information explosion drives many enterprises or individuals to seek to rent cloud computing infrastructure to put their applications in the cloud. However, the agreements reached between cloud computing providers and clients are often not efficient. Many… ▽ More

    Submitted 13 February, 2024; v1 submitted 10 February, 2024; originally announced February 2024.

    Comments: Accepted in IEEE CLOUD 2023. 13 pages, 5 figures

  11. arXiv:2402.05668  [pdf, other

    cs.CR cs.AI cs.CL cs.LG

    Comprehensive Assessment of Jailbreak Attacks Against LLMs

    Authors: Junjie Chu, Yugeng Liu, Ziqing Yang, Xinyue Shen, Michael Backes, Yang Zhang

    Abstract: Misuse of the Large Language Models (LLMs) has raised widespread concern. To address this issue, safeguards have been taken to ensure that LLMs align with social ethics. However, recent findings have revealed an unsettling vulnerability bypassing the safeguards of LLMs, known as jailbreak attacks. By applying techniques, such as employing role-playing scenarios, adversarial examples, or subtle sub… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: 18 pages, 12 figures

  12. arXiv:2402.02987  [pdf, other

    cs.CR cs.AI cs.CL cs.LG

    Conversation Reconstruction Attack Against GPT Models

    Authors: Junjie Chu, Zeyang Sha, Michael Backes, Yang Zhang

    Abstract: In recent times, significant advancements have been made in the field of large language models (LLMs), represented by GPT series models. To optimize task execution, users often engage in multi-round conversations with GPT models hosted in cloud environments. These multi-round conversations, potentially replete with private information, require transmission and storage within the cloud. However, th… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 17 pages, 11 figures

  13. arXiv:2402.00421  [pdf, other

    cs.CL cs.HC cs.IR cs.LG

    From PARIS to LE-PARIS: Toward Patent Response Automation with Recommender Systems and Collaborative Large Language Models

    Authors: Jung-Mei Chu, Hao-Cheng Lo, Jieh Hsiang, Chun-Chieh Cho

    Abstract: In patent prosecution, timely and effective responses to Office Actions (OAs) are crucial for securing patents. However, past automation and artificial intelligence research have largely overlooked this aspect. To bridge this gap, our study introduces the Patent Office Action Response Intelligence System (PARIS) and its advanced version, the Large Language Model (LLM) Enhanced PARIS (LE-PARIS). Th… ▽ More

    Submitted 4 March, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: 28 pages, 5 figures, typos corrected, references added, under review

  14. arXiv:2311.16207  [pdf, other

    q-bio.QM cs.IR cs.LG

    The Graph Convolutional Network with Multi-representation Alignment for Drug Synergy Prediction

    Authors: Xinxing Yang, Genke Yang, Jian Chu

    Abstract: Drug combination refers to the use of two or more drugs to treat a specific disease at the same time. It is currently the mainstream way to treat complex diseases. Compared with single drugs, drug combinations have better efficacy and can better inhibit toxicity and drug resistance. The computational model based on deep learning concatenates the representation of multiple drugs and the correspondi… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: 14 pages;

  15. arXiv:2310.20258  [pdf, other

    cs.LG

    Advancing Bayesian Optimization via Learning Correlated Latent Space

    Authors: Seunghun Lee, Jaewon Chu, Sihyeon Kim, Juyeon Ko, Hyunwoo J. Kim

    Abstract: Bayesian optimization is a powerful method for optimizing black-box functions with limited function evaluations. Recent works have shown that optimization in a latent space through deep generative models such as variational autoencoders leads to effective and efficient Bayesian optimization for structured or discrete data. However, as the optimization does not take place in the input space, it lea… ▽ More

    Submitted 19 November, 2023; v1 submitted 31 October, 2023; originally announced October 2023.

  16. arXiv:2310.15484  [pdf, other

    cs.CL cs.AI

    NuTrea: Neural Tree Search for Context-guided Multi-hop KGQA

    Authors: Hyeong Kyu Choi, Seunghun Lee, Jaewon Chu, Hyunwoo J. Kim

    Abstract: Multi-hop Knowledge Graph Question Answering (KGQA) is a task that involves retrieving nodes from a knowledge graph (KG) to answer natural language questions. Recent GNN-based approaches formulate this task as a KG path searching problem, where messages are sequentially propagated from the seed node towards the answer nodes. However, these messages are past-oriented, and they do not consider the f… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: Neural Information Processing Systems (NeurIPS) 2023

  17. arXiv:2310.08984  [pdf, other

    cs.CV

    UniParser: Multi-Human Parsing with Unified Correlation Representation Learning

    Authors: Jiaming Chu, Lei Jin, Junliang Xing, Jian Zhao

    Abstract: Multi-human parsing is an image segmentation task necessitating both instance-level and fine-grained category-level information. However, prior research has typically processed these two types of information through separate branches and distinct output formats, leading to inefficient and redundant frameworks. This paper introduces UniParser, which integrates instance-level and category-level repr… ▽ More

    Submitted 19 May, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

  18. arXiv:2309.10508  [pdf, ps, other

    cs.NI

    Enhanced C-V2X Mode 4 to Optimize Age of Information and Reliability for IoV

    Authors: Jiahou Chu, Qiong Wu, Qiang Fan, Zhengquan Li

    Abstract: Internet of vehicles (IoV) has emerged as a key technology to realize real-time vehicular application. For IoV, vehicles adopt cellular vehicle-to-everything (C-V2X) standard to support direct communication among them. C-V2X mode 4 controls resource allocation without the assistance of cellular network, hence it is widely used for IoV. However, C-V2X mode 4 has two drawbacks. First is that vehicle… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

    Comments: This paper has been accpeted by ICCT 2023. The source code can be found at https://github.com/qiongwu86/ns3 sumo cv2x mode4.git

  19. arXiv:2309.04737  [pdf, other

    cs.LG

    Learning Spiking Neural Network from Easy to Hard task

    Authors: Lingling Tang, Jiangtao Hu, Hua Yu, Surui Liu, Jielei Chu

    Abstract: Starting with small and simple concepts, and gradually introducing complex and difficult concepts is the natural process of human learning. Spiking Neural Networks (SNNs) aim to mimic the way humans process information, but current SNNs models treat all samples equally, which does not align with the principles of human learning and overlooks the biological plausibility of SNNs. To address this, we… ▽ More

    Submitted 25 September, 2023; v1 submitted 9 September, 2023; originally announced September 2023.

  20. arXiv:2308.09363  [pdf, other

    cs.CV

    Open-vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models

    Authors: Dohwan Ko, Ji Soo Lee, Miso Choi, Jaewon Chu, Jihwan Park, Hyunwoo J. Kim

    Abstract: Video Question Answering (VideoQA) is a challenging task that entails complex multi-modal reasoning. In contrast to multiple-choice VideoQA which aims to predict the answer given several options, the goal of open-ended VideoQA is to answer questions without restricting candidate answers. However, the majority of previous VideoQA models formulate open-ended VideoQA as a classification task to class… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

    Comments: Accepted paper at ICCV 2023

  21. arXiv:2307.08989  [pdf, other

    cs.LG cs.IR q-bio.QM

    GraphCL-DTA: a graph contrastive learning with molecular semantics for drug-target binding affinity prediction

    Authors: Xinxing Yang, Genke Yang, Jian Chu

    Abstract: Drug-target binding affinity prediction plays an important role in the early stages of drug discovery, which can infer the strength of interactions between new drugs and new targets. However, the performance of previous computational models is limited by the following drawbacks. The learning of drug representation relies only on supervised data, without taking into account the information containe… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

    Comments: 13 pages, 4 figures, 5 tables

  22. Im2win: Memory Efficient Convolution On SIMD Architectures

    Authors: Shuai Lu, Jun Chu, Xu T. Liu

    Abstract: Convolution is the most expensive operation among neural network operations, thus its performance is critical to the overall performance of neural networks. Commonly used convolution approaches, including general matrix multiplication (GEMM)-based convolution and direct convolution, rely on im2col for data transformation or do not use data transformation at all, respectively. However, the im2col d… ▽ More

    Submitted 25 June, 2023; originally announced June 2023.

    Comments: Published at "2022 IEEE High Performance Extreme Computing Conference (HPEC)"

    ACM Class: I.2.10

    Journal ref: 2022 IEEE High Performance Extreme Computing Conference (HPEC), Waltham, MA, USA, 2022, pp. 1-7

  23. arXiv:2306.14316  [pdf, other

    cs.NE cs.AI cs.LG

    Im2win: An Efficient Convolution Paradigm on GPU

    Authors: Shuai Lu, Jun Chu, Luanzheng Guo, Xu T. Liu

    Abstract: Convolution is the most time-consuming operation in deep neural network operations, so its performance is critical to the overall performance of the neural network. The commonly used methods for convolution on GPU include the general matrix multiplication (GEMM)-based convolution and the direct convolution. GEMM-based convolution relies on the im2col algorithm, which results in a large memory foot… ▽ More

    Submitted 25 June, 2023; originally announced June 2023.

    Comments: Accepted at "29th International European conference on parallel and distributed computing (Euro-Par'2023)"

    ACM Class: I.2.10

  24. arXiv:2305.07290  [pdf, other

    cs.CV

    The 3rd Anti-UAV Workshop & Challenge: Methods and Results

    Authors: Jian Zhao, Jianan Li, Lei Jin, Jiaming Chu, Zhihao Zhang, Jun Wang, Jiangqiang Xia, Kai Wang, Yang Liu, Sadaf Gulshad, Jiaojiao Zhao, Tianyang Xu, Xuefeng Zhu, Shihan Liu, Zheng Zhu, Guibo Zhu, Zechao Li, Zheng Wang, Baigui Sun, Yandong Guo, Shin ichi Satoh, Junliang Xing, Jane Shen Shengmei

    Abstract: The 3rd Anti-UAV Workshop & Challenge aims to encourage research in developing novel and accurate methods for multi-scale object tracking. The Anti-UAV dataset used for the Anti-UAV Challenge has been publicly released. There are two main differences between this year's competition and the previous two. First, we have expanded the existing dataset, and for the first time, released a training set s… ▽ More

    Submitted 15 July, 2023; v1 submitted 12 May, 2023; originally announced May 2023.

    Comments: Technical report for 3rd Anti-UAV Workshop and Challenge. arXiv admin note: text overlap with arXiv:2108.09909

  25. Single-stage Multi-human Parsing via Point Sets and Center-based Offsets

    Authors: Jiaming Chu, Lei Jin, Junliang Xing, Jian Zhao

    Abstract: This work studies the multi-human parsing problem. Existing methods, either following top-down or bottom-up two-stage paradigms, usually involve expensive computational costs. We instead present a high-performance Single-stage Multi-human Parsing (SMP) deep architecture that decouples the multi-human parsing problem into two fine-grained sub-problems, i.e., locating the human body and parts. SMP l… ▽ More

    Submitted 22 April, 2023; originally announced April 2023.

  26. arXiv:2302.10301  [pdf, other

    cs.CV cs.AI

    Artificial Intelligence System for Detection and Screening of Cardiac Abnormalities using Electrocardiogram Images

    Authors: Deyun Zhang, Shijia Geng, Yang Zhou, Weilun Xu, Guodong Wei, Kai Wang, Jie Yu, Qiang Zhu, Yongkui Li, Yonghong Zhao, Xingyue Chen, Rui Zhang, Zhaoji Fu, Rongbo Zhou, Yanqi E, Sumei Fan, Qinghao Zhao, Chuandong Cheng, Nan Peng, Liang Zhang, Linlin Zheng, Jianjun Chu, Hongbin Xu, Chen Tan, Jian Liu , et al. (6 additional authors not shown)

    Abstract: The artificial intelligence (AI) system has achieved expert-level performance in electrocardiogram (ECG) signal analysis. However, in underdeveloped countries or regions where the healthcare information system is imperfect, only paper ECGs can be provided. Analysis of real-world ECG images (photos or scans of paper ECGs) remains challenging due to complex environments or interference. In this stud… ▽ More

    Submitted 10 February, 2023; originally announced February 2023.

    Comments: 47 pages, 29 figures

  27. arXiv:2301.06448  [pdf, other

    cs.CE

    The Balanced Matrix Factorization for Computational Drug Repositioning

    Authors: Xinxing Yang, Genke Yang, Jian Chu

    Abstract: Computational drug repositioning aims to discover new uses of drugs that have been marketed. However, the existing models suffer from the following limitations. Firstly, in the real world, only a minority of diseases have definite treatment drugs. This leads to an imbalance in the proportion of validated drug-disease associations (positive samples) and unvalidated drug-disease associations (negati… ▽ More

    Submitted 16 January, 2023; originally announced January 2023.

  28. SoccerNet 2022 Challenges Results

    Authors: Silvio Giancola, Anthony Cioppa, Adrien Deliège, Floriane Magera, Vladimir Somers, Le Kang, Xin Zhou, Olivier Barnich, Christophe De Vleeschouwer, Alexandre Alahi, Bernard Ghanem, Marc Van Droogenbroeck, Abdulrahman Darwish, Adrien Maglo, Albert Clapés, Andreas Luyts, Andrei Boiarov, Artur Xarles, Astrid Orcesi, Avijit Shah, Baoyu Fan, Bharath Comandur, Chen Chen, Chen Zhang, Chen Zhao , et al. (69 additional authors not shown)

    Abstract: The SoccerNet 2022 challenges were the second annual video understanding challenges organized by the SoccerNet team. In 2022, the challenges were composed of 6 vision-based tasks: (1) action spotting, focusing on retrieving action timestamps in long untrimmed videos, (2) replay grounding, focusing on retrieving the live moment of an action shown in a replay, (3) pitch localization, focusing on det… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

    Comments: Accepted at ACM MMSports 2022

  29. arXiv:2209.15574  [pdf, other

    cs.CG

    An improved algorithm for Generalized Čech complex construction

    Authors: Jie Chu, Mikael Vejdemo-Johansson, Ping Ji

    Abstract: In this paper, we present an algorithm that computes the generalized Čech complex for a finite set of disks where each may have a different radius in 2D space. An extension of this algorithm is also proposed for a set of balls in 3D space with different radius. To compute a $k$-simplex, we leverage the computation performed in the round of $(k-1)$-simplices such that we can reduce the number of… ▽ More

    Submitted 30 September, 2022; originally announced September 2022.

    MSC Class: 68U05; 57-08 ACM Class: F.2.2; I.3.5

  30. arXiv:2208.13006  [pdf, other

    math.OC cs.LG

    Neural Observer with Lyapunov Stability Guarantee for Uncertain Nonlinear Systems

    Authors: Song Chen, Shengze Cai, Tehuan Chen, Chao Xu, Jian Chu

    Abstract: In this paper, we propose a novel nonlinear observer based on neural networks, called neural observer, for observation tasks of linear time-invariant (LTI) systems and uncertain nonlinear systems. In particular, the neural observer designed for uncertain systems is inspired by the active disturbance rejection control, which can measure the uncertainty in real-time. The stability analysis (e.g., ex… ▽ More

    Submitted 16 January, 2023; v1 submitted 27 August, 2022; originally announced August 2022.

    Comments: 15 pages, submitted to IEEE journal for possible publication

  31. arXiv:2206.04688  [pdf, other

    cs.LG

    A New Frontier of AI: On-Device AI Training and Personalization

    Authors: Ji Joong Moon, Hyun Suk Lee, Jiho Chu, Donghak Park, Seungbaek Hong, Hyungjun Seo, Donghyeon Jeong, Sungsik Kong, MyungJoo Ham

    Abstract: Modern consumer electronic devices have started executing deep learning-based intelligence services on devices, not cloud servers, to keep personal data on devices and to reduce network and cloud costs. We find such a trend as the opportunity to personalize intelligence services by updating neural networks with user data without exposing the data out of devices: on-device training. However, the li… ▽ More

    Submitted 4 January, 2024; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 12 pages, 16 figures, Accepted in ICSE 2024

  32. arXiv:2206.00262  [pdf, other

    cs.LG cs.IR q-bio.BM

    Self-supervised Learning for Label Sparsity in Computational Drug Repositioning

    Authors: Xinxing Yang, Genke Yang, Jian Chu

    Abstract: The computational drug repositioning aims to discover new uses for marketed drugs, which can accelerate the drug development process and play an important role in the existing drug discovery system. However, the number of validated drug-disease associations is scarce compared to the number of drugs and diseases in the real world. Too few labeled samples will make the classification model unable to… ▽ More

    Submitted 1 June, 2022; originally announced June 2022.

    Comments: 14 pages

  33. arXiv:2204.03649  [pdf, other

    cs.CV

    Unsupervised Prompt Learning for Vision-Language Models

    Authors: Tony Huang, Jack Chu, Fangyun Wei

    Abstract: Contrastive vision-language models like CLIP have shown great progress in transfer learning. In the inference stage, the proper text description, also known as prompt, needs to be carefully designed to correctly classify the given images. In order to avoid laborious prompt engineering, recent works such as CoOp, CLIP-Adapter and Tip-Adapter propose to adapt vision-language models for downstream im… ▽ More

    Submitted 22 August, 2022; v1 submitted 7 April, 2022; originally announced April 2022.

  34. arXiv:2204.02688  [pdf, other

    cs.CV

    SEAL: A Large-scale Video Dataset of Multi-grained Spatio-temporally Action Localization

    Authors: Shimin Chen, Wei Li, Chen Chen, Jianyang Gu, Jiaming Chu, Xunqiang Tao, Yandong Guo

    Abstract: In spite of many dataset efforts for human action recognition, current computer vision algorithms are still limited to coarse-grained spatial and temporal annotations among human daily life. In this paper, we introduce a novel large-scale video dataset dubbed SEAL for multi-grained Spatio-tEmporal Action Localization. SEAL consists of two kinds of annotations, SEAL Tubes and SEAL Clips. We observe… ▽ More

    Submitted 6 April, 2022; originally announced April 2022.

    Comments: 17 pages,6 figures

  35. arXiv:2203.16014  [pdf, other

    cs.RO

    ESNI: Domestic Robots Design for Elderly and Disabled People

    Authors: Junchi Chu, Xueyun Tang

    Abstract: Our paper focuses on the research of the possibility for speech recognition intelligent agents to assist the elderly and disabled people's lives, to improve their life quality by utilizing cutting-edge technologies. After researching the attitude of elderly and disabled people toward the household agent, we propose a design framework: ESNI(Exploration, Segmentation, Navigation, Instruction) that a… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

  36. arXiv:2111.14696  [pdf, other

    cs.LG cs.AI q-bio.QM

    The Computational Drug Repositioning without Negative Sampling

    Authors: Xinxing Yang, Genke Yang, Jian Chu

    Abstract: Computational drug repositioning technology is an effective tool to accelerate drug development. Although this technique has been widely used and successful in recent decades, many existing models still suffer from multiple drawbacks such as the massive number of unvalidated drug-disease associations and the inner product. The limitations of these works are mainly due to the following two reasons:… ▽ More

    Submitted 31 May, 2022; v1 submitted 29 November, 2021; originally announced November 2021.

    Comments: 12 pages,10 figures

  37. arXiv:2110.05603  [pdf, other

    cs.CL cs.RO

    Generalizing to New Domains by Mapping Natural Language to Lifted LTL

    Authors: Eric Hsiung, Hiloni Mehta, Junchi Chu, Xinyu Liu, Roma Patel, Stefanie Tellex, George Konidaris

    Abstract: Recent work on using natural language to specify commands to robots has grounded that language to LTL. However, mapping natural language task specifications to LTL task specifications using language models require probability distributions over finite vocabulary. Existing state-of-the-art methods have extended this finite vocabulary to include unseen terms from the input sequence to improve output… ▽ More

    Submitted 9 March, 2022; v1 submitted 11 October, 2021; originally announced October 2021.

    Comments: 7 pages (6 + 1 references page), 3 figures, 2 tables. Accepted to ICRA 2022. To appear in Proceedings of the 2022 International Conference on Robotics and Automation, May 2022

  38. arXiv:2109.07690  [pdf, other

    cs.LG

    The Neural Metric Factorization for Computational Drug Repositioning

    Authors: Xinxing Yang, Genke Yangand Jian Chu

    Abstract: Computational drug repositioning aims to discover new therapeutic diseases for marketed drugs and has the advantages of low cost, short development cycle, and high controllability compared to traditional drug development. The matrix factorization model has become the cornerstone technique for computational drug repositioning due to its ease of implementation and excellent scalability. However, the… ▽ More

    Submitted 28 November, 2021; v1 submitted 15 September, 2021; originally announced September 2021.

    Comments: 16 pages

  39. arXiv:2106.07874  [pdf

    cs.SD eess.AS

    Towards the Objective Speech Assessment of Smoking Status based on Voice Features: A Review of the Literature

    Authors: Zhizhong Ma, Chris Bullen, Joanna Ting Wai Chu, Ruili Wang, Yingchun Wang, Satwinder Singh

    Abstract: In smoking cessation clinical research and practice, objective validation of self-reported smoking status is crucial for ensuring the reliability of the primary outcome, that is, smoking abstinence. Speech signals convey important information about a speaker, such as age, gender, body size, emotional state, and health state. We investigated (1) if smoking could measurably alter voice features, (2)… ▽ More

    Submitted 15 June, 2021; originally announced June 2021.

  40. GapPredict: A Language Model for Resolving Gaps in Draft Genome Assemblies

    Authors: Eric Chen, Justin Chu, Jessica Zhang, Rene L. Warren, Inanc Birol

    Abstract: Short-read DNA sequencing instruments can yield over 1e+12 bases per run, typically composed of reads 150 bases long. Despite this high throughput, de novo assembly algorithms have difficulty reconstructing contiguous genome sequences using short reads due to both repetitive and difficult-to-sequence regions in these genomes. Some of the short read assembly challenges are mitigated by scaffolding… ▽ More

    Submitted 24 May, 2021; v1 submitted 21 May, 2021; originally announced May 2021.

    Comments: 9 pages, 7 figures. IEEE/ACM Trans Comput Biol Bioinform (2021)

  41. Affinity Space Adaptation for Semantic Segmentation Across Domains

    Authors: Wei Zhou, Yukang Wang, Jiajia Chu, Jiehua Yang, Xiang Bai, Yongchao Xu

    Abstract: Semantic segmentation with dense pixel-wise annotation has achieved excellent performance thanks to deep learning. However, the generalization of semantic segmentation in the wild remains challenging. In this paper, we address the problem of unsupervised domain adaptation (UDA) in semantic segmentation. Motivated by the fact that source and target domain have invariant semantic structures, we prop… ▽ More

    Submitted 26 September, 2020; originally announced September 2020.

    Comments: Accepted by IEEE TIP

  42. arXiv:2008.03868  [pdf, ps, other

    cs.IT eess.SP

    Robust Design for NOMA-based Multi-Beam LEO Satellite Internet of Things

    Authors: Jianhang Chu, Xiaoming Chen, Caijun Zhong, Zhaoyang Zhang

    Abstract: In this paper, we investigate the issue of massive access in a beyond fifth-generation (B5G) multi-beam low earth orbit (LEO) satellite internet of things (IoT) network in the presence of channel phase uncertainty due to channel state information (CSI) conveyance from the devices to the satellite via the gateway. Rather than time division multiple access (TDMA) or frequency division multiple acces… ▽ More

    Submitted 9 August, 2020; originally announced August 2020.

  43. arXiv:2008.03468  [pdf, other

    cs.RO

    TGK-Planner: An Efficient Topology Guided Kinodynamic Planner for Autonomous Quadrotors

    Authors: Hongkai Ye, Xin Zhou, Zhepei Wang, Chao Xu, Jian Chu, Fei Gao

    Abstract: In this paper, we propose a lightweight yet effective Topology Guided Kinodynamic planner (TGK-Planner) for quadrotor aggressive flights with limited onboard computing resources. The proposed system follows the traditional hierarchical planning workflow, with novel designs to improve the robustness and efficiency in both the pathfinding and trajectory optimization sub-modules. Firstly, we propose… ▽ More

    Submitted 8 November, 2020; v1 submitted 8 August, 2020; originally announced August 2020.

  44. arXiv:2003.06321  [pdf, other

    cs.LG stat.ML

    Micro-supervised Disturbance Learning: A Perspective of Representation Probability Distribution

    Authors: Jielei Chu, Jing Liu, Hongjun Wang, Meng Hua, Zhiguo Gong, Tianrui Li

    Abstract: The instability is shown in the existing methods of representation learning based on Euclidean distance under a broad set of conditions. Furthermore, the scarcity and high cost of labels prompt us to explore more expressive representation learning methods which depends on the labels as few as possible. To address these issues, the small-perturbation ideology is firstly introduced on the representa… ▽ More

    Submitted 6 October, 2021; v1 submitted 13 March, 2020; originally announced March 2020.

    Comments: 14 pages

  45. arXiv:2003.06113  [pdf, ps, other

    cs.LG eess.SP stat.ML

    Ultra Efficient Transfer Learning with Meta Update for Cross Subject EEG Classification

    Authors: Tiehang Duan, Mihir Chauhan, Mohammad Abuzar Shaikh, Jun Chu, Sargur Srihari

    Abstract: The pattern of Electroencephalogram (EEG) signal differs significantly across different subjects, and poses challenge for EEG classifiers in terms of 1) effectively adapting a learned classifier onto a new subject, 2) retaining knowledge of known subjects after the adaptation. We propose an efficient transfer learning method, named Meta UPdate Strategy (MUPS-EEG), for continuous EEG classification… ▽ More

    Submitted 1 March, 2021; v1 submitted 13 March, 2020; originally announced March 2020.

  46. arXiv:2002.10629  [pdf, other

    cs.RO math.OC

    Alternating Minimization Based Trajectory Generation for Quadrotor Aggressive Flight

    Authors: Zhepei Wang, Xin Zhou, Chao Xu, Jian Chu, Fei Gao

    Abstract: With much research has been conducted into trajectory planning for quadrotors, planning with spatial and temporal optimal trajectories in real-time is still challenging. In this paper, we propose a framework for generating large-scale piecewise polynomial trajectories for aggressive autonomous flights, with highlights on its superior computational efficiency and simultaneous spatial-temporal optim… ▽ More

    Submitted 24 February, 2020; originally announced February 2020.

    Comments: The paper is submitted to RA-L/IROS 2020

  47. arXiv:1906.05173  [pdf, other

    cs.LG stat.ML

    Multi-local Collaborative AutoEncoder

    Authors: Jielei Chu, Hongjun Wang, Jing Liu, Zhiguo Gong, Tianrui Li

    Abstract: The excellent performance of representation learning of autoencoders have attracted considerable interest in various applications. However, the structure and multi-local collaborative relationships of unlabeled data are ignored in their encoding procedure that limits the capability of feature extraction. This paper presents a Multi-local Collaborative AutoEncoder (MC-AE), which consists of novel m… ▽ More

    Submitted 8 October, 2021; v1 submitted 12 June, 2019; originally announced June 2019.

  48. arXiv:1905.08736  [pdf

    physics.ao-ph cs.LG stat.AP

    Identification of synoptic weather types over Taiwan area with multiple classifiers

    Authors: Shih-Hao Su, Jung-Lien Chu, Ting-Shuo Yo, Lee-Yaw Lin

    Abstract: In this study, a novel machine learning approach was used to classify three types of synoptic weather events in Taiwan area from 2001 to 2010. We used reanalysis data with three machine learning algorithms to recognize weather systems and evaluated their performance. Overall, the classifiers successfully identified 52-83% of weather events (hit rate), which is higher than the performance of tradit… ▽ More

    Submitted 21 May, 2019; originally announced May 2019.

    Comments: journal article, open access

    Journal ref: Atmos Sci Lett.2018;e861

  49. Hybrid Feature Learning for Handwriting Verification

    Authors: Mohammad Abuzar Shaikh, Mihir Chauhan, Jun Chu, Sargur Srihari

    Abstract: We propose an effective Hybrid Deep Learning (HDL) architecture for the task of determining the probability that a questioned handwritten word has been written by a known writer. HDL is an amalgamation of Auto-Learned Features (ALF) and Human-Engineered Features (HEF). To extract auto-learned features we use two methods: First, Two Channel Convolutional Neural Network (TC-CNN); Second, Two Channel… ▽ More

    Submitted 18 November, 2018; originally announced December 2018.

    Comments: Accepted and presented in International Conference on Frontiers in Handwriting Recognition (ICFHR) 2018

  50. arXiv:1812.01967  [pdf, other

    cs.LG stat.ML

    Unsupervised Feature Learning Architecture with Multi-clustering Integration RBM

    Authors: Jielei Chu, Hongjun Wang, Jing Liu, Zhiguo Gong, Tianrui Li

    Abstract: In this paper, we present a novel unsupervised feature learning architecture, which consists of a multi-clustering integration module and a variant of RBM termed multi-clustering integration RBM (MIRBM). In the multi-clustering integration module, we apply three unsupervised K-means, affinity propagation and spectral clustering algorithms to obtain three different clustering partitions (CPs) witho… ▽ More

    Submitted 2 April, 2020; v1 submitted 5 December, 2018; originally announced December 2018.