Skip to main content

Showing 1–32 of 32 results for author: Yi, Q

  1. arXiv:2406.03250  [pdf, other

    cs.CV cs.AI

    Prompt-based Visual Alignment for Zero-shot Policy Transfer

    Authors: Haihan Gao, Rui Zhang, Qi Yi, Hantao Yao, Haochen Li, Jiaming Guo, Shaohui Peng, Yunkai Gao, QiCheng Wang, Xing Hu, Yuanbo Wen, Zihao Zhang, Zidong Du, Ling Li, Qi Guo, Yunji Chen

    Abstract: Overfitting in RL has become one of the main obstacles to applications in reinforcement learning(RL). Existing methods do not provide explicit semantic constrain for the feature extractor, hindering the agent from learning a unified cross-domain representation and resulting in performance degradation on unseen domains. Besides, abundant data from multiple domains are needed. To address these issue… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: This paper has been accepted by ICML2024

  2. arXiv:2405.09923  [pdf, other

    cs.CV eess.IV

    NTIRE 2024 Restore Any Image Model (RAIM) in the Wild Challenge

    Authors: Jie Liang, Radu Timofte, Qiaosi Yi, Shuaizheng Liu, Lingchen Sun, Rongyuan Wu, Xindong Zhang, Hui Zeng, Lei Zhang

    Abstract: In this paper, we review the NTIRE 2024 challenge on Restore Any Image Model (RAIM) in the Wild. The RAIM challenge constructed a benchmark for image restoration in the wild, including real-world images with/without reference ground truth in various scenarios from real applications. The participants were required to restore the real-captured images from complex and unknown degradation, where gener… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  3. arXiv:2402.06306  [pdf, other

    cs.IT eess.SP

    Multi-Modal Concurrent Transmission

    Authors: Majid Nasiri Khormuji, Alberto Giuseppe Perotti, Qin Yi, Branislav Popovic

    Abstract: This paper introduces a novel physical-layer method labelled as Multi-Modal Concurrent Transmission (MMCT) for efficient transmission of multiple data streams with different reliability-latency performance requirements. The MMCT arranges data from multiple streams within a same physical-layer transport block wherein stream-specific modulation and coding scheme (MCS) selection is combined with join… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

    Comments: 6 pages, 4 figures, 1 table

    Journal ref: 2024 IEEE Wireless Communications and Networking Conference

  4. arXiv:2312.06162  [pdf, other

    cs.CV

    Textual Prompt Guided Image Restoration

    Authors: Qiuhai Yan, Aiwen Jiang, Kang Chen, Long Peng, Qiaosi Yi, Chunjie Zhang

    Abstract: Image restoration has always been a cutting-edge topic in the academic and industrial fields of computer vision. Since degradation signals are often random and diverse, "all-in-one" models that can do blind image restoration have been concerned in recent years. Early works require training specialized headers and tails to handle each degradation of concern, which are manually cumbersome. Recent wo… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: 12 pages, 10figures

  5. arXiv:2311.04474  [pdf, other

    cs.AI

    Emergent Communication for Rules Reasoning

    Authors: Yuxuan Guo, Yifan Hao, Rui Zhang, Enshuai Zhou, Zidong Du, Xishan Zhang, Xinkai Song, Yuanbo Wen, Yongwei Zhao, Xuehai Zhou, Jiaming Guo, Qi Yi, Shaohui Peng, Di Huang, Ruizhi Chen, Qi Guo, Yunji Chen

    Abstract: Research on emergent communication between deep-learning-based agents has received extensive attention due to its inspiration for linguistics and artificial intelligence. However, previous attempts have hovered around emerging communication under perception-oriented environmental settings, that forces agents to describe low-level perceptual features intra image or symbol contexts. In this work, in… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

  6. arXiv:2311.03695  [pdf, other

    cs.LG cs.AI

    Context Shift Reduction for Offline Meta-Reinforcement Learning

    Authors: Yunkai Gao, Rui Zhang, Jiaming Guo, Fan Wu, Qi Yi, Shaohui Peng, Siming Lan, Ruizhi Chen, Zidong Du, Xing Hu, Qi Guo, Ling Li, Yunji Chen

    Abstract: Offline meta-reinforcement learning (OMRL) utilizes pre-collected offline datasets to enhance the agent's generalization ability on unseen tasks. However, the context shift problem arises due to the distribution discrepancy between the contexts used for training (from the behavior policy) and testing (from the exploration policy). The context shift problem leads to incorrect task inference and fur… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  7. arXiv:2311.02104  [pdf, other

    cs.LG cs.AI

    Efficient Symbolic Policy Learning with Differentiable Symbolic Expression

    Authors: Jiaming Guo, Rui Zhang, Shaohui Peng, Qi Yi, Xing Hu, Ruizhi Chen, Zidong Du, Xishan Zhang, Ling Li, Qi Guo, Yunji Chen

    Abstract: Deep reinforcement learning (DRL) has led to a wide range of advances in sequential decision-making tasks. However, the complexity of neural network policies makes it difficult to understand and deploy with limited computational resources. Currently, employing compact symbolic expressions as symbolic policies is a promising strategy to obtain simple and interpretable policies. Previous symbolic po… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: Accepted by NeurIPS2023

  8. arXiv:2311.01771  [pdf, other

    cs.LG stat.ML

    Efficient Generalized Low-Rank Tensor Contextual Bandits

    Authors: Qianxin Yi, Yiyang Yang, Shaojie Tang, Jiapeng Liu, Yao Wang

    Abstract: In this paper, we aim to build a novel bandits algorithm that is capable of fully harnessing the power of multi-dimensional data and the inherent non-linearity of reward functions to provide high-usable and accountable decision-making services. To this end, we introduce a generalized low-rank tensor contextual bandits model in which an action is formed from three feature vectors, and thus can be r… ▽ More

    Submitted 17 January, 2024; v1 submitted 3 November, 2023; originally announced November 2023.

  9. arXiv:2311.01075  [pdf, other

    cs.LG

    Contrastive Modules with Temporal Attention for Multi-Task Reinforcement Learning

    Authors: Siming Lan, Rui Zhang, Qi Yi, Jiaming Guo, Shaohui Peng, Yunkai Gao, Fan Wu, Ruizhi Chen, Zidong Du, Xing Hu, Xishan Zhang, Ling Li, Yunji Chen

    Abstract: In the field of multi-task reinforcement learning, the modular principle, which involves specializing functionalities into different modules and combining them appropriately, has been widely adopted as a promising approach to prevent the negative transfer problem that performance degradation due to conflicts between tasks. However, most of the existing multi-task RL methods only combine shared mod… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: This paper has been accepted at NeurIPS 2023 as a poster

  10. arXiv:2309.01352  [pdf, other

    cs.CL cs.AI

    Self-driven Grounding: Large Language Model Agents with Automatical Language-aligned Skill Learning

    Authors: Shaohui Peng, Xing Hu, Qi Yi, Rui Zhang, Jiaming Guo, Di Huang, Zikang Tian, Ruizhi Chen, Zidong Du, Qi Guo, Yunji Chen, Ling Li

    Abstract: Large language models (LLMs) show their powerful automatic reasoning and planning capability with a wealth of semantic knowledge about the human world. However, the grounding problem still hinders the applications of LLMs in the real-world environment. Existing studies try to fine-tune the LLM or utilize pre-defined behavior APIs to bridge the LLMs and the environment, which not only costs huge hu… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

  11. arXiv:2307.06608  [pdf, other

    cs.LG cs.AI cs.CR

    Introducing Foundation Models as Surrogate Models: Advancing Towards More Practical Adversarial Attacks

    Authors: Jiaming Zhang, Jitao Sang, Qi Yi, Changsheng Xu

    Abstract: Recently, the no-box adversarial attack, in which the attacker lacks access to the model's architecture, weights, and training data, become the most practical and challenging attack setup. However, there is an unawareness of the potential and flexibility inherent in the surrogate model selection process on no-box setting. Inspired by the burgeoning interest in utilizing foundational models to addr… ▽ More

    Submitted 13 July, 2023; v1 submitted 13 July, 2023; originally announced July 2023.

  12. arXiv:2306.07307  [pdf, other

    cs.LG cs.AI

    Online Prototype Alignment for Few-shot Policy Transfer

    Authors: Qi Yi, Rui Zhang, Shaohui Peng, Jiaming Guo, Yunkai Gao, Kaizhao Yuan, Ruizhi Chen, Siming Lan, Xing Hu, Zidong Du, Xishan Zhang, Qi Guo, Yunji Chen

    Abstract: Domain adaptation in reinforcement learning (RL) mainly deals with the changes of observation when transferring the policy to a new environment. Many traditional approaches of domain adaptation in RL manage to learn a mapping function between the source and target domain in explicit or implicit ways. However, they typically require access to abundant data from the target domain. Besides, they ofte… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Comments: This paper has been accepted at ICML2023

  13. arXiv:2303.05069  [pdf, other

    cs.LG

    Conceptual Reinforcement Learning for Language-Conditioned Tasks

    Authors: Shaohui Peng, Xing Hu, Rui Zhang, Jiaming Guo, Qi Yi, Ruizhi Chen, Zidong Du, Ling Li, Qi Guo, Yunji Chen

    Abstract: Despite the broad application of deep reinforcement learning (RL), transferring and adapting the policy to unseen but similar environments is still a significant challenge. Recently, the language-conditioned policy is proposed to facilitate policy transfer through learning the joint representation of observation and text that catches the compact and invariant information across environments. Exist… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

    Comments: Accepted by AAAI 2023

  14. arXiv:2301.01217  [pdf, other

    cs.CR cs.LG

    Unlearnable Clusters: Towards Label-agnostic Unlearnable Examples

    Authors: Jiaming Zhang, Xingjun Ma, Qi Yi, Jitao Sang, Yu-Gang Jiang, Yaowei Wang, Changsheng Xu

    Abstract: There is a growing interest in developing unlearnable examples (UEs) against visual privacy leaks on the Internet. UEs are training samples added with invisible but unlearnable noise, which have been found can prevent unauthorized training of machine learning models. UEs typically are generated via a bilevel optimization framework with a surrogate model to remove (minimize) errors from the origina… ▽ More

    Submitted 23 March, 2023; v1 submitted 30 December, 2022; originally announced January 2023.

    Comments: CVPR2023

  15. arXiv:2210.07802  [pdf, other

    cs.LG cs.AI

    Object-Category Aware Reinforcement Learning

    Authors: Qi Yi, Rui Zhang, Shaohui Peng, Jiaming Guo, Xing Hu, Zidong Du, Xishan Zhang, Qi Guo, Yunji Chen

    Abstract: Object-oriented reinforcement learning (OORL) is a promising way to improve the sample efficiency and generalization ability over standard RL. Recent works that try to solve OORL tasks without additional feature engineering mainly focus on learning the object representations and then solving tasks via reasoning based on these object representations. However, none of these works tries to explicitly… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: This paper is to be published on NeurIPS 2022

  16. arXiv:2210.06964  [pdf, other

    cs.LG

    Causality-driven Hierarchical Structure Discovery for Reinforcement Learning

    Authors: Shaohui Peng, Xing Hu, Rui Zhang, Ke Tang, Jiaming Guo, Qi Yi, Ruizhi Chen, Xishan Zhang, Zidong Du, Ling Li, Qi Guo, Yunji Chen

    Abstract: Hierarchical reinforcement learning (HRL) effectively improves agents' exploration efficiency on tasks with sparse reward, with the guide of high-quality hierarchical structures (e.g., subgoals or options). However, how to automatically discover high-quality hierarchical structures is still a great challenge. Previous HRL methods can hardly discover the hierarchical structures in complex environme… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: Accepted by NeurIPS 2022

  17. arXiv:2206.09410  [pdf, other

    cs.CV eess.IV

    Low-Mid Adversarial Perturbation against Unauthorized Face Recognition System

    Authors: Jiaming Zhang, Qi Yi, Dongyuan Lu, Jitao Sang

    Abstract: In light of the growing concerns regarding the unauthorized use of facial recognition systems and its implications on individual privacy, the exploration of adversarial perturbations as a potential countermeasure has gained traction. However, challenges arise in effectively deploying this approach against unauthorized facial recognition systems due to the effects of JPEG compression on image distr… ▽ More

    Submitted 2 September, 2023; v1 submitted 19 June, 2022; originally announced June 2022.

    Comments: published in Information Sciences

  18. arXiv:2206.09391  [pdf, other

    cs.LG cs.CL cs.CV cs.MM

    Towards Adversarial Attack on Vision-Language Pre-training Models

    Authors: Jiaming Zhang, Qi Yi, Jitao Sang

    Abstract: While vision-language pre-training model (VLP) has shown revolutionary improvements on various vision-language (V+L) tasks, the studies regarding its adversarial robustness remain largely unexplored. This paper studied the adversarial attack on popular VLP models and V+L tasks. First, we analyzed the performance of adversarial attacks under different settings. By examining the influence of differe… ▽ More

    Submitted 19 October, 2022; v1 submitted 19 June, 2022; originally announced June 2022.

    Comments: Accepted by ACM MM2022. Code is available in GitHub

  19. arXiv:2206.05475  [pdf, other

    cs.LG cs.MM

    Reducing Capacity Gap in Knowledge Distillation with Review Mechanism for Crowd Counting

    Authors: Yunxin Liu, Qiaosi Yi, Jinshan Zeng

    Abstract: The lightweight crowd counting models, in particular knowledge distillation (KD) based models, have attracted rising attention in recent years due to their superiority on computational efficiency and hardware requirement. However, existing KD based models usually suffer from the capacity gap issue, resulting in the performance of the student network being limited by the teacher network. In this pa… ▽ More

    Submitted 11 June, 2022; originally announced June 2022.

  20. arXiv:2204.13873  [pdf, other

    cs.CV eess.IV

    Multiple Degradation and Reconstruction Network for Single Image Denoising via Knowledge Distillation

    Authors: Juncheng Li, Hanhui Yang, Qiaosi Yi, Faming Fang, Guangwei Gao, Tieyong Zeng, Guixu Zhang

    Abstract: Single image denoising (SID) has achieved significant breakthroughs with the development of deep learning. However, the proposed methods are often accompanied by plenty of parameters, which greatly limits their application scenarios. Different from previous works that blindly increase the depth of the network, we explore the degradation mechanism of the noisy image and propose a lightweight Multip… ▽ More

    Submitted 29 April, 2022; originally announced April 2022.

    Comments: Accepted by CVPR Workshop 2022

  21. arXiv:2112.15443  [pdf

    cs.AR

    FPGA Based Accelerator for Neural Networks Computation with Flexible Pipelining

    Authors: Qingyang Yi, Heming Sun, Masahiro Fujita

    Abstract: FPGA is appropriate for fix-point neural networks computing due to high power efficiency and configurability. However, its design must be intensively refined to achieve high performance using limited hardware resources. We present an FPGA-based neural networks accelerator and its optimization framework, which can achieve optimal efficiency for various CNN models and FPGA resources. Targeting high… ▽ More

    Submitted 28 December, 2021; originally announced December 2021.

    Comments: 6 pages

  22. arXiv:2111.15200  [pdf, other

    eess.IV cs.CV

    Contrastive Learning for Local and Global Learning MRI Reconstruction

    Authors: Qiaosi Yi, Jinhao Liu, Le Hu, Faming Fang, Guixu Zhang

    Abstract: Magnetic Resonance Imaging (MRI) is an important medical imaging modality, while it requires a long acquisition time. To reduce the acquisition time, various methods have been proposed. However, these methods failed to reconstruct images with a clear structure for two main reasons. Firstly, similar patches widely exist in MR images, while most previous deep learning-based methods ignore this prope… ▽ More

    Submitted 30 November, 2021; originally announced November 2021.

  23. arXiv:2109.11703  [pdf, other

    cs.LG

    An Improved Frequent Directions Algorithm for Low-Rank Approximation via Block Krylov Iteration

    Authors: Chenhao Wang, Qianxin Yi, Xiuwu Liao, Yao Wang

    Abstract: Frequent Directions, as a deterministic matrix sketching technique, has been proposed for tackling low-rank approximation problems. This method has a high degree of accuracy and practicality, but experiences a lot of computational cost for large-scale data. Several recent works on the randomized version of Frequent Directions greatly improve the computational efficiency, but unfortunately sacrific… ▽ More

    Submitted 4 March, 2022; v1 submitted 23 September, 2021; originally announced September 2021.

  24. arXiv:2108.10129  [pdf, other

    cs.LG stat.ML

    Effective Streaming Low-tubal-rank Tensor Approximation via Frequent Directions

    Authors: Qianxin Yi, Chenhao Wang, Kaidong Wang, Yao Wang

    Abstract: Low-tubal-rank tensor approximation has been proposed to analyze large-scale and multi-dimensional data. However, finding such an accurate approximation is challenging in the streaming setting, due to the limited computational resources. To alleviate this issue, this paper extends a popular matrix sketching technique, namely Frequent Directions, for constructing an efficient and accurate low-tubal… ▽ More

    Submitted 23 August, 2021; originally announced August 2021.

  25. arXiv:2108.09079  [pdf, other

    cs.CV

    Structure-Preserving Deraining with Residue Channel Prior Guidance

    Authors: Qiaosi Yi, Juncheng Li, Qinyan Dai, Faming Fang, Guixu Zhang, Tieyong Zeng

    Abstract: Single image deraining is important for many high-level computer vision tasks since the rain streaks can severely degrade the visibility of images, thereby affecting the recognition and analysis of the image. Recently, many CNN-based methods have been proposed for rain removal. Although these methods can remove part of the rain streaks, it is difficult for them to adapt to real-world scenarios and… ▽ More

    Submitted 20 August, 2021; originally announced August 2021.

  26. arXiv:2107.12216  [pdf, other

    cs.LG cs.AI

    Hindsight Value Function for Variance Reduction in Stochastic Dynamic Environment

    Authors: Jiaming Guo, Rui Zhang, Xishan Zhang, Shaohui Peng, Qi Yi, Zidong Du, Xing Hu, Qi Guo, Yunji Chen

    Abstract: Policy gradient methods are appealing in deep reinforcement learning but suffer from high variance of gradient estimate. To reduce the variance, the state value function is applied commonly. However, the effect of the state value function becomes limited in stochastic dynamic environments, where the unexpected state dynamics and rewards will increase the variance. In this paper, we propose to repl… ▽ More

    Submitted 5 August, 2021; v1 submitted 26 July, 2021; originally announced July 2021.

    Comments: Accepted by IJCAI2021

  27. arXiv:2106.15779  [pdf, other

    cs.IR

    Dual Adversarial Variational Embedding for Robust Recommendation

    Authors: Qiaomin Yi, Ning Yang, Philip S. Yu

    Abstract: Robust recommendation aims at capturing true preference of users from noisy data, for which there are two lines of methods have been proposed. One is based on noise injection, and the other is to adopt the generative model Variational Auto-encoder (VAE). However, the existing works still face two challenges. First, the noise injection based methods often draw the noise from a fixed noise distribut… ▽ More

    Submitted 29 June, 2021; originally announced June 2021.

  28. arXiv:2106.10989  [pdf, other

    cs.CV cs.AI

    ImageNet Pre-training also Transfers Non-Robustness

    Authors: Jiaming Zhang, Jitao Sang, Qi Yi, Yunfan Yang, Huiwen Dong, Jian Yu

    Abstract: ImageNet pre-training has enabled state-of-the-art results on many tasks. In spite of its recognized contribution to generalization, we observed in this study that ImageNet pre-training also transfers adversarial non-robustness from pre-trained model into fine-tuned model in the downstream classification tasks. We first conducted experiments on various datasets and network backbones to uncover the… ▽ More

    Submitted 5 December, 2022; v1 submitted 21 June, 2021; originally announced June 2021.

    Comments: Accepted by AAAI2023

  29. arXiv:2106.00985  [pdf, other

    cs.CV

    Feedback Network for Mutually Boosted Stereo Image Super-Resolution and Disparity Estimation

    Authors: Qinyan Dai, Juncheng Li, Qiaosi Yi, Faming Fang, Guixu Zhang

    Abstract: Under stereo settings, the problem of image super-resolution (SR) and disparity estimation are interrelated that the result of each problem could help to solve the other. The effective exploitation of correspondence between different views facilitates the SR performance, while the high-resolution (HR) features with richer details benefit the correspondence estimation. According to this motivation,… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

  30. arXiv:2102.12135  [pdf, other

    cs.CV

    Efficient and Accurate Multi-scale Topological Network for Single Image Dehazing

    Authors: Qiaosi Yi, Juncheng Li, Faming Fang, Aiwen Jiang, Guixu Zhang

    Abstract: Single image dehazing is a challenging ill-posed problem that has drawn significant attention in the last few years. Recently, convolutional neural networks have achieved great success in image dehazing. However, it is still difficult for these increasingly complex models to recover accurate details from the hazy image. In this paper, we pay attention to the feature extraction and utilization of t… ▽ More

    Submitted 24 February, 2021; originally announced February 2021.

  31. arXiv:2101.01479  [pdf, other

    cs.CV

    Scale-Aware Network with Regional and Semantic Attentions for Crowd Counting under Cluttered Background

    Authors: Qiaosi Yi, Yunxing Liu, Aiwen Jiang, Juncheng Li, Kangfu Mei, Mingwen Wang

    Abstract: Crowd counting is an important task that shown great application value in public safety-related fields, which has attracted increasing attention in recent years. In the current research, the accuracy of counting numbers and crowd density estimation are the main concerns. Although the emergence of deep learning has greatly promoted the development of this field, crowd counting under cluttered backg… ▽ More

    Submitted 7 January, 2021; v1 submitted 5 January, 2021; originally announced January 2021.

  32. arXiv:2006.13511  [pdf, other

    cs.CV

    Disentangle Perceptual Learning through Online Contrastive Learning

    Authors: Kangfu Mei, Yao Lu, Qiaosi Yi, Haoyu Wu, Juncheng Li, Rui Huang

    Abstract: Pursuing realistic results according to human visual perception is the central concern in the image transformation tasks. Perceptual learning approaches like perceptual loss are empirically powerful for such tasks but they usually rely on the pre-trained classification network to provide features, which are not necessarily optimal in terms of visual perception of image transformation. In this pape… ▽ More

    Submitted 24 June, 2020; originally announced June 2020.

    Comments: 12 pages, 8 figures