Skip to main content

Showing 1–14 of 14 results for author: Cui, F

  1. arXiv:2404.11576  [pdf, other

    cs.CV

    State-space Decomposition Model for Video Prediction Considering Long-term Motion Trend

    Authors: Fei Cui, Jiaojiao Fang, Xiaojiang Wu, Zelong Lai, Mengke Yang, Menghan Jia, Guizhong Liu

    Abstract: Stochastic video prediction enables the consideration of uncertainty in future motion, thereby providing a better reflection of the dynamic nature of the environment. Stochastic video prediction methods based on image auto-regressive recurrent models need to feed their predictions back into the latent space. Conversely, the state-space models, which decouple frame synthesis and temporal prediction… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  2. arXiv:2306.17484  [pdf, other

    cs.LG

    Landmark Guided Active Exploration with State-specific Balance Coefficient

    Authors: Fei Cui, Jiaojiao Fang, Mengke Yang, Guizhong Liu

    Abstract: Goal-conditioned hierarchical reinforcement learning (GCHRL) decomposes long-horizon tasks into sub-tasks through a hierarchical framework and it has demonstrated promising results across a variety of domains. However, the high-level policy's action space is often excessively large, presenting a significant challenge to effective exploration and resulting in potentially inefficient training. In th… ▽ More

    Submitted 17 April, 2024; v1 submitted 30 June, 2023; originally announced June 2023.

  3. arXiv:2303.10912  [pdf, other

    cs.SD cs.CL cs.LG eess.AS

    Exploring Representation Learning for Small-Footprint Keyword Spotting

    Authors: Fan Cui, Liyong Guo, Quandong Wang, Peng Gao, Yujun Wang

    Abstract: In this paper, we investigate representation learning for low-resource keyword spotting (KWS). The main challenges of KWS are limited labeled data and limited available device resources. To address those challenges, we explore representation learning for KWS by self-supervised contrastive learning and self-training with pretrained model. First, local-global contrastive siamese networks (LGCSiam) a… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

  4. arXiv:2303.10897  [pdf, other

    cs.SD cs.CL eess.AS q-bio.NC

    Relate auditory speech to EEG by shallow-deep attention-based network

    Authors: Fan Cui, Liyong Guo, Lang He, Jiyao Liu, ErCheng Pei, Yujun Wang, Dongmei Jiang

    Abstract: Electroencephalography (EEG) plays a vital role in detecting how brain responses to different stimulus. In this paper, we propose a novel Shallow-Deep Attention-based Network (SDANet) to classify the correct auditory stimulus evoking the EEG signal. It adopts the Attention-based Correlation Module (ACM) to discover the connection between auditory speech and EEG from global aspect, and the Shallow-… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

  5. arXiv:2303.05678  [pdf, other

    cs.SD cs.LG eess.AS

    Improving Weakly Supervised Sound Event Detection with Causal Intervention

    Authors: Yifei Xin, Dongchao Yang, Fan Cui, Yujun Wang, Yuexian Zou

    Abstract: Existing weakly supervised sound event detection (WSSED) work has not explored both types of co-occurrences simultaneously, i.e., some sound events often co-occur, and their occurrences are usually accompanied by specific background sounds, so they would be inevitably entangled, causing misclassification and biased localization results with only clip-level supervision. To tackle this issue, we fir… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

    Comments: Accepted by ICASSP2023

  6. arXiv:2211.00508  [pdf, other

    eess.AS cs.CL cs.SD

    Predicting Multi-Codebook Vector Quantization Indexes for Knowledge Distillation

    Authors: Liyong Guo, Xiaoyu Yang, Quandong Wang, Yuxiang Kong, Zengwei Yao, Fan Cui, Fangjun Kuang, Wei Kang, Long Lin, Mingshuang Luo, Piotr Zelasko, Daniel Povey

    Abstract: Knowledge distillation(KD) is a common approach to improve model performance in automatic speech recognition (ASR), where a student model is trained to imitate the output behaviour of a teacher model. However, traditional KD methods suffer from teacher label storage issue, especially when the training corpora are large. Although on-the-fly teacher label generation tackles this issue, the training… ▽ More

    Submitted 31 October, 2022; originally announced November 2022.

    Comments: Submitted to ICASSP 2022

  7. arXiv:2205.11078  [pdf, other

    stat.ML cs.LG math.ST

    Beyond EM Algorithm on Over-specified Two-Component Location-Scale Gaussian Mixtures

    Authors: Tongzheng Ren, Fuheng Cui, Sujay Sanghavi, Nhat Ho

    Abstract: The Expectation-Maximization (EM) algorithm has been predominantly used to approximate the maximum likelihood estimation of the location-scale Gaussian mixtures. However, when the models are over-specified, namely, the chosen number of components to fit the data is larger than the unknown true number of components, EM needs a polynomial number of iterations in terms of the sample size to reach the… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

    Comments: 38 pages, 4 figures. Tongzheng Ren and Fuheng Cui contributed equally to this work

  8. arXiv:2112.10153  [pdf, other

    cs.SD eess.AS

    Detect what you want: Target Sound Detection

    Authors: Dongchao Yang, Helin Wang, Yuexian Zou, Fan Cui, Yujun Wang

    Abstract: Human beings can perceive a target sound type from a multi-source mixture signal by the selective auditory attention, however, such functionality was hardly ever explored in machine hearing. This paper addresses the target sound detection (TSD) task, which aims to detect the target sound signal from a mixture audio when a target sound's reference audio is given. We present a novel target sound det… ▽ More

    Submitted 7 July, 2022; v1 submitted 19 December, 2021; originally announced December 2021.

    Comments: Submitted to DCASE workshop2022

  9. arXiv:2110.07810  [pdf, other

    cs.LG math.ST stat.ML

    Towards Statistical and Computational Complexities of Polyak Step Size Gradient Descent

    Authors: Tongzheng Ren, Fuheng Cui, Alexia Atsidakou, Sujay Sanghavi, Nhat Ho

    Abstract: We study the statistical and computational complexities of the Polyak step size gradient descent algorithm under generalized smoothness and Lojasiewicz conditions of the population loss function, namely, the limit of the empirical loss function when the sample size goes to infinity, and the stability between the gradients of the empirical and population loss functions, namely, the polynomial growt… ▽ More

    Submitted 14 October, 2021; originally announced October 2021.

    Comments: First three authors contributed equally. 40 pages, 4 figures

  10. arXiv:2106.15057  [pdf, other

    cs.LG

    Cross-domain error minimization for unsupervised domain adaptation

    Authors: Yuntao Du, Yinghao Chen, Fengli Cui, Xiaowen Zhang, Chongjun Wang

    Abstract: Unsupervised domain adaptation aims to transfer knowledge from a labeled source domain to an unlabeled target domain. Previous methods focus on learning domain-invariant features to decrease the discrepancy between the feature distributions as well as minimizing the source error and have made remarkable progress. However, a recently proposed theory reveals that such a strategy is not sufficient fo… ▽ More

    Submitted 28 June, 2021; originally announced June 2021.

    Comments: Accepted by DASFAA 2021

  11. Predicting Biomedical Interactions with Higher-Order Graph Convolutional Networks

    Authors: Kishan KC, Rui Li, Feng Cui, Anne Haake

    Abstract: Biomedical interaction networks have incredible potential to be useful in the prediction of biologically meaningful interactions, identification of network biomarkers of disease, and the discovery of putative drug targets. Recently, graph neural networks have been proposed to effectively learn representations for biomedical entities and achieved state-of-the-art results in biomedical interaction p… ▽ More

    Submitted 16 October, 2020; originally announced October 2020.

  12. arXiv:2010.08514  [pdf, other

    cs.LG q-bio.QM

    Interpretable Structured Learning with Sparse Gated Sequence Encoder for Protein-Protein Interaction Prediction

    Authors: Kishan KC, Feng Cui, Anne Haake, Rui Li

    Abstract: Predicting protein-protein interactions (PPIs) by learning informative representations from amino acid sequences is a challenging yet important problem in biology. Although various deep learning models in Siamese architecture have been proposed to model PPIs from sequences, these methods are computationally expensive for a large number of PPIs due to the pairwise encoding process. Furthermore, the… ▽ More

    Submitted 16 October, 2020; originally announced October 2020.

  13. arXiv:1702.07673  [pdf, ps, other

    cs.OH

    Modulation and Multiple Access for 5G Networks

    Authors: Yunlong Cai, Zhijin Qin, Fangyu Cui, Geoffrey Ye Li, Julie A. McCann

    Abstract: Fifth generation (5G) wireless networks face various challenges in order to support large-scale heterogeneous traffic and users, therefore new modulation and multiple access (MA) schemes are being developed to meet the changing demands. As this research space is ever increasing, it becomes more important to analyze the various approaches, therefore in this article we present a comprehensive overvi… ▽ More

    Submitted 21 February, 2017; originally announced February 2017.

  14. arXiv:1611.06306  [pdf, ps, other

    cs.LG

    Cross-model convolutional neural network for multiple modality data representation

    Authors: Yanbin Wu, Li Wang, Fan Cui, Hongbin Zhai, Baoming Dong, Jim Jing-Yan Wang

    Abstract: A novel data representation method of convolutional neural net- work (CNN) is proposed in this paper to represent data of different modalities. We learn a CNN model for the data of each modality to map the data of differ- ent modalities to a common space, and regularize the new representations in the common space by a cross-model relevance matrix. We further impose that the class label of data poi… ▽ More

    Submitted 19 November, 2016; originally announced November 2016.