Skip to main content

Showing 1–41 of 41 results for author: Bian, W

  1. arXiv:2406.17804  [pdf, ps, other

    physics.med-ph cs.CV eess.IV

    A Review of Electromagnetic Elimination Methods for low-field portable MRI scanner

    Authors: Wanyu Bian

    Abstract: This paper presents a comprehensive analysis of both conventional and deep learning methods for eliminating electromagnetic interference (EMI) in MRI systems. We explore the underlying principles and implementation of traditional analytical and adaptive EMI elimination techniques, as well as cutting-edge deep learning approaches. Through a detailed comparison, the strengths and limitations of each… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

  2. arXiv:2406.02626  [pdf, ps, other

    eess.IV cs.CV math.OC

    A Brief Overview of Optimization-Based Algorithms for MRI Reconstruction Using Deep Learning

    Authors: Wanyu Bian

    Abstract: Magnetic resonance imaging (MRI) is renowned for its exceptional soft tissue contrast and high spatial resolution, making it a pivotal tool in medical imaging. The integration of deep learning algorithms offers significant potential for optimizing MRI reconstruction processes. Despite the growing body of research in this area, a comprehensive survey of optimization-based deep learning models tailo… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  3. arXiv:2405.18407  [pdf, other

    cs.LG cs.CV

    Phased Consistency Model

    Authors: Fu-Yun Wang, Zhaoyang Huang, Alexander William Bergman, Dazhong Shen, Peng Gao, Michael Lingelbach, Keqiang Sun, Weikang Bian, Guanglu Song, Yu Liu, Hongsheng Li, Xiaogang Wang

    Abstract: The consistency model (CM) has recently made significant progress in accelerating the generation of diffusion models. However, its application to high-resolution, text-conditioned image generation in the latent space (a.k.a., LCM) remains unsatisfactory. In this paper, we identify three key flaws in the current design of LCM. We investigate the reasons behind these limitations and propose the Phas… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  4. arXiv:2404.14700  [pdf, other

    eess.AS cs.AI cs.CL cs.LG cs.SD

    FlashSpeech: Efficient Zero-Shot Speech Synthesis

    Authors: Zhen Ye, Zeqian Ju, Haohe Liu, Xu Tan, Jianyi Chen, Yiwen Lu, Peiwen Sun, Jiahao Pan, Weizhen Bian, Shulin He, Qifeng Liu, Yike Guo, Wei Xue

    Abstract: Recent progress in large-scale zero-shot speech synthesis has been significantly advanced by language models and diffusion models. However, the generation process of both methods is slow and computationally intensive. Efficient speech synthesis using a lower computing budget to achieve quality on par with previous work remains a significant challenge. In this paper, we present FlashSpeech, a large… ▽ More

    Submitted 24 April, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: Efficient zero-shot speech synthesis

  5. arXiv:2404.14409  [pdf, other

    cs.CV

    CrossScore: Towards Multi-View Image Evaluation and Scoring

    Authors: Zirui Wang, Wenjing Bian, Omkar Parkhi, Yuheng Ren, Victor Adrian Prisacariu

    Abstract: We introduce a novel cross-reference image quality assessment method that effectively fills the gap in the image assessment landscape, complementing the array of established evaluation schemes -- ranging from full-reference metrics like SSIM, no-reference metrics such as NIQE, to general-reference metrics including FID, and Multi-modal-reference metrics, e.g., CLIPScore. Utilising a neural network… ▽ More

    Submitted 14 July, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: Accepted at ECCV 2024. Project page see https://crossscore.active.vision

  6. arXiv:2403.19966  [pdf, other

    eess.IV cs.CV math.OC

    Multi-task Magnetic Resonance Imaging Reconstruction using Meta-learning

    Authors: Wanyu Bian, Albert Jang, Fang Liu

    Abstract: Using single-task deep learning methods to reconstruct Magnetic Resonance Imaging (MRI) data acquired with different imaging sequences is inherently challenging. The trained deep learning model typically lacks generalizability, and the dissimilarity among image datasets with different types of contrast leads to suboptimal learning performance. This paper proposes a meta-learning approach to effici… ▽ More

    Submitted 21 April, 2024; v1 submitted 29 March, 2024; originally announced March 2024.

  7. arXiv:2403.12839  [pdf, other

    cs.CV

    Global-guided Focal Neural Radiance Field for Large-scale Scene Rendering

    Authors: Mingqi Shao, Feng Xiong, Hang Zhang, Shuang Yang, Mu Xu, Wei Bian, Xueqian Wang

    Abstract: Neural radiance fields~(NeRF) have recently been applied to render large-scale scenes. However, their limited model capacity typically results in blurred rendering results. Existing large-scale NeRFs primarily address this limitation by partitioning the scene into blocks, which are subsequently handled by separate sub-NeRFs. These sub-NeRFs, trained from scratch and processed independently, lead t… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  8. arXiv:2402.18178  [pdf, other

    cs.CV

    Reflection Removal Using Recurrent Polarization-to-Polarization Network

    Authors: Wenjiao Bian, Yusuke Monno, Masatoshi Okutomi

    Abstract: This paper addresses reflection removal, which is the task of separating reflection components from a captured image and deriving the image with only transmission components. Considering that the existence of the reflection changes the polarization state of a scene, some existing methods have exploited polarized images for reflection removal. While these methods apply polarized images as the input… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Journal ref: ICASSP 2024

  9. arXiv:2402.00769  [pdf, other

    cs.CV cs.LG

    AnimateLCM: Accelerating the Animation of Personalized Diffusion Models and Adapters with Decoupled Consistency Learning

    Authors: Fu-Yun Wang, Zhaoyang Huang, Xiaoyu Shi, Weikang Bian, Guanglu Song, Yu Liu, Hongsheng Li

    Abstract: Video diffusion models has been gaining increasing attention for its ability to produce videos that are both coherent and of high fidelity. However, the iterative denoising process makes it computationally intensive and time-consuming, thus limiting its applications. Inspired by the Consistency Model (CM) that distills pretrained image diffusion models to accelerate the sampling with minimal steps… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: Project Page: https://animatelcm.github.io/

  10. arXiv:2401.15977  [pdf, other

    cs.CV

    Motion-I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling

    Authors: Xiaoyu Shi, Zhaoyang Huang, Fu-Yun Wang, Weikang Bian, Dasong Li, Yi Zhang, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li

    Abstract: We introduce Motion-I2V, a novel framework for consistent and controllable image-to-video generation (I2V). In contrast to previous methods that directly learn the complicated image-to-video mapping, Motion-I2V factorizes I2V into two stages with explicit motion modeling. For the first stage, we propose a diffusion-based motion field predictor, which focuses on deducing the trajectories of the ref… ▽ More

    Submitted 31 January, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

    Comments: Project page: https://xiaoyushi97.github.io/Motion-I2V/

  11. arXiv:2310.07449  [pdf, other

    cs.CV

    PoRF: Pose Residual Field for Accurate Neural Surface Reconstruction

    Authors: Jia-Wang Bian, Wenjing Bian, Victor Adrian Prisacariu, Philip Torr

    Abstract: Neural surface reconstruction is sensitive to the camera pose noise, even if state-of-the-art pose estimators like COLMAP or ARKit are used. More importantly, existing Pose-NeRF joint optimisation methods have struggled to improve pose accuracy in challenging real-world scenarios. To overcome the challenges, we introduce the pose residual field (PoRF), a novel implicit representation that uses an… ▽ More

    Submitted 12 March, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: Accepted to ICLR 2024. Find the project page at https://porf.active.vision/

  12. arXiv:2309.00783  [pdf, other

    cs.LG

    Diffusion Modeling with Domain-conditioned Prior Guidance for Accelerated MRI and qMRI Reconstruction

    Authors: Wanyu Bian, Albert Jang, Fang Liu

    Abstract: This study introduces a novel approach for image reconstruction based on a diffusion model conditioned on the native data domain. Our method is applied to multi-coil MRI and quantitative MRI reconstruction, leveraging the domain-conditioned diffusion model within the frequency and parameter domains. The prior MRI physics are used as embeddings in the diffusion model, enforcing data consistency to… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

  13. arXiv:2306.02000  [pdf, other

    cs.CV

    Context-PIPs: Persistent Independent Particles Demands Spatial Context Features

    Authors: Weikang Bian, Zhaoyang Huang, Xiaoyu Shi, Yitong Dong, Yijin Li, Hongsheng Li

    Abstract: We tackle the problem of Persistent Independent Particles (PIPs), also called Tracking Any Point (TAP), in videos, which specifically aims at estimating persistent long-term trajectories of query points in videos. Previous methods attempted to estimate these trajectories independently to incorporate longer image sequences, therefore, ignoring the potential benefits of incorporating spatial context… ▽ More

    Submitted 5 December, 2023; v1 submitted 3 June, 2023; originally announced June 2023.

    Comments: Project Page: https://wkbian.github.io/Projects/Context-PIPs/

  14. arXiv:2303.08340  [pdf, other

    cs.CV

    VideoFlow: Exploiting Temporal Cues for Multi-frame Optical Flow Estimation

    Authors: Xiaoyu Shi, Zhaoyang Huang, Weikang Bian, Dasong Li, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li

    Abstract: We introduce VideoFlow, a novel optical flow estimation framework for videos. In contrast to previous methods that learn to estimate optical flow from two frames, VideoFlow concurrently estimates bi-directional optical flows for multiple frames that are available in videos by sufficiently exploiting temporal cues. We first propose a TRi-frame Optical Flow (TROF) module that estimates bi-directiona… ▽ More

    Submitted 20 August, 2023; v1 submitted 14 March, 2023; originally announced March 2023.

  15. arXiv:2303.01515  [pdf, other

    math.OC cs.CV

    Optimization-Based Deep learning methods for Magnetic Resonance Imaging Reconstruction and Synthesis

    Authors: Wanyu Bian

    Abstract: This dissertation is devoted to provide advanced nonconvex nonsmooth variational models of (Magnetic Resonance Image) MRI reconstruction, efficient learnable image reconstruction algorithms and parameter training algorithms that improve the accuracy and robustness of the optimization-based deep learning methods for compressed sensing MRI reconstruction and synthesis. The first part introduces a no… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

    Comments: PhD thesis, 145 pages

  16. arXiv:2212.07388  [pdf, other

    cs.CV

    NoPe-NeRF: Optimising Neural Radiance Field with No Pose Prior

    Authors: Wenjing Bian, Zirui Wang, Kejie Li, Jia-Wang Bian, Victor Adrian Prisacariu

    Abstract: Training a Neural Radiance Field (NeRF) without pre-computed camera poses is challenging. Recent advances in this direction demonstrate the possibility of jointly optimising a NeRF and camera poses in forward-facing scenes. However, these methods still face difficulties during dramatic camera movement. We tackle this challenging problem by incorporating undistorted monocular depth priors. These pr… ▽ More

    Submitted 14 April, 2023; v1 submitted 14 December, 2022; originally announced December 2022.

  17. arXiv:2209.08896  [pdf, other

    cs.CV

    NeuralMarker: A Framework for Learning General Marker Correspondence

    Authors: Zhaoyang Huang, Xiaokun Pan, Weihong Pan, Weikang Bian, Yan Xu, Ka Chun Cheung, Guofeng Zhang, Hongsheng Li

    Abstract: We tackle the problem of estimating correspondences from a general marker, such as a movie poster, to an image that captures such a marker. Conventionally, this problem is addressed by fitting a homography model based on sparse feature matching. However, they are only able to handle plane-like markers and the sparse features do not sufficiently utilize appearance information. In this paper, we pro… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

    Comments: Accepted by ToG (SIGGRAPH Asia 2022). Project Page: https://drinkingcoder.github.io/publication/neuralmarker/

  18. arXiv:2208.10174  [pdf, other

    cs.IR cs.AI

    KEEP: An Industrial Pre-Training Framework for Online Recommendation via Knowledge Extraction and Plugging

    Authors: Yujing Zhang, Zhangming Chan, Shuhao Xu, Weijie Bian, Shuguang Han, Hongbo Deng, Bo Zheng

    Abstract: An industrial recommender system generally presents a hybrid list that contains results from multiple subsystems. In practice, each subsystem is optimized with its own feedback data to avoid the disturbance among different subsystems. However, we argue that such data usage may lead to sub-optimal online performance because of the \textit{data sparsity}. To alleviate this issue, we propose to extra… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

    Comments: Accepted at CIKM 2022, 10 pages. Yujing Zhang and Zhangming Chan contributed equally to this work

  19. arXiv:2204.06747  [pdf, other

    cs.CV

    Unsupervised Domain Adaptation with Implicit Pseudo Supervision for Semantic Segmentation

    Authors: Wanyu Xu, Zengmao Wang, Wei Bian

    Abstract: Pseudo-labelling is a popular technique in unsuper-vised domain adaptation for semantic segmentation. However, pseudo labels are noisy and inevitably have confirmation bias due to the discrepancy between source and target domains and training process. In this paper, we train the model by the pseudo labels which are implicitly produced by itself to learn new complementary knowledge about target dom… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

  20. arXiv:2204.03804  [pdf, other

    eess.IV cs.CV cs.LG math.OC

    A Learnable Variational Model for Joint Multimodal MRI Reconstruction and Synthesis

    Authors: Wanyu Bian, Qingchao Zhang, Xiaojing Ye, Yunmei Chen

    Abstract: Generating multi-contrasts/modal MRI of the same anatomy enriches diagnostic information but is limited in practice due to excessive data acquisition time. In this paper, we propose a novel deep-learning model for joint reconstruction and synthesis of multi-modal MRI using incomplete k-space data of several source modalities as inputs. The output of our model includes reconstructed images of the s… ▽ More

    Submitted 28 June, 2022; v1 submitted 7 April, 2022; originally announced April 2022.

    Comments: Provisional Accepted by MICCAI2022

  21. arXiv:2112.11136  [pdf, other

    cs.IR cs.LG

    Adversarial Gradient Driven Exploration for Deep Click-Through Rate Prediction

    Authors: Kailun Wu, Zhangming Chan, Weijie Bian, Lejian Ren, Shiming Xiang, Shuguang Han, Hongbo Deng, Bo Zheng

    Abstract: Exploration-Exploitation (E{\&}E) algorithms are commonly adopted to deal with the feedback-loop issue in large-scale online recommender systems. Most of existing studies believe that high uncertainty can be a good indicator of potential reward, and thus primarily focus on the estimation of model uncertainty. We argue that such an approach overlooks the subsequent effect of exploration on model tr… ▽ More

    Submitted 30 May, 2022; v1 submitted 21 December, 2021; originally announced December 2021.

    Comments: This paper is accepted by the KDD2022

  22. arXiv:2110.00715  [pdf, other

    cs.CV math.OC

    An Optimization-Based Meta-Learning Model for MRI Reconstruction with Diverse Dataset

    Authors: Wanyu Bian, Yunmei Chen, Xiaojing Ye, Qingchao Zhang

    Abstract: Purpose: This work aims at developing a generalizable MRI reconstruction model in the meta-learning framework. The standard benchmarks in meta-learning are challenged by learning on diverse task distributions. The proposed network learns the regularization function in a variational model and reconstructs MR images with various under-sampling ratios or patterns that may or may not be seen in the tr… ▽ More

    Submitted 1 October, 2021; originally announced October 2021.

    Comments: 27 pages

  23. arXiv:2109.09738  [pdf, other

    eess.IV cs.CV cs.LG math.OC

    An Optimal Control Framework for Joint-channel Parallel MRI Reconstruction without Coil Sensitivities

    Authors: Wanyu Bian, Yunmei Chen, Xiaojing Ye

    Abstract: Goal: This work aims at developing a novel calibration-free fast parallel MRI (pMRI) reconstruction method incorporate with discrete-time optimal control framework. The reconstruction model is designed to learn a regularization that combines channels and extracts features by leveraging the information sharing among channels of multi-coil images. We propose to recover both magnitude and phase infor… ▽ More

    Submitted 23 January, 2022; v1 submitted 20 September, 2021; originally announced September 2021.

    Comments: 13 pages

  24. arXiv:2107.01899  [pdf, other

    cs.CV

    Ray-ONet: Efficient 3D Reconstruction From A Single RGB Image

    Authors: Wenjing Bian, Zirui Wang, Kejie Li, Victor Adrian Prisacariu

    Abstract: We propose Ray-ONet to reconstruct detailed 3D models from monocular images efficiently. By predicting a series of occupancy probabilities along a ray that is back-projected from a pixel in the camera coordinate, our method Ray-ONet improves the reconstruction accuracy in comparison with Occupancy Networks (ONet), while reducing the network inference complexity to O($N^2$). As a result, Ray-ONet a… ▽ More

    Submitted 22 October, 2021; v1 submitted 5 July, 2021; originally announced July 2021.

    Comments: accepted in BMVC 2021

  25. arXiv:2011.05625  [pdf, other

    cs.IR stat.ML

    CAN: Feature Co-Action for Click-Through Rate Prediction

    Authors: Weijie Bian, Kailun Wu, Lejian Ren, Qi Pi, Yujing Zhang, Can Xiao, Xiang-Rong Sheng, Yong-Nan Zhu, Zhangming Chan, Na Mou, Xinchen Luo, Shiming Xiang, Guorui Zhou, Xiaoqiang Zhu, Hongbo Deng

    Abstract: Feature interaction has been recognized as an important problem in machine learning, which is also very essential for click-through rate (CTR) prediction tasks. In recent years, Deep Neural Networks (DNNs) can automatically learn implicit nonlinear interactions from original sparse features, and therefore have been widely used in industrial CTR prediction tasks. However, the implicit feature inter… ▽ More

    Submitted 7 December, 2021; v1 submitted 11 November, 2020; originally announced November 2020.

    Comments: WSDM 2022

    MSC Class: Machine Learning (stat.ML); Information Retrieval (cs.IR); Machine Learning (cs.LG) ACM Class: I.2.6

  26. arXiv:2008.01410  [pdf, other

    eess.IV cs.CV

    Deep Parallel MRI Reconstruction Network Without Coil Sensitivities

    Authors: Wanyu Bian, Yunmei Chen, Xiaojing Ye

    Abstract: We propose a novel deep neural network architecture by mapping the robust proximal gradient scheme for fast image reconstruction in parallel MRI (pMRI) with regularization function trained from data. The proposed network learns to adaptively combine the multi-coil images from incomplete pMRI data into a single image with homogeneous contrast, which is then passed to a nonlinear encoder to efficien… ▽ More

    Submitted 18 August, 2020; v1 submitted 4 August, 2020; originally announced August 2020.

    Comments: Accepted by MICCAI international workshop MLMIR 2020

  27. arXiv:2004.03112  [pdf, other

    cs.LG stat.ML

    Repulsive Mixture Models of Exponential Family PCA for Clustering

    Authors: Maoying Qiao, Tongliang Liu, Jun Yu, Wei Bian, Dacheng Tao

    Abstract: The mixture extension of exponential family principal component analysis (EPCA) was designed to encode much more structural information about data distribution than the traditional EPCA does. For example, due to the linearity of EPCA's essential form, nonlinear cluster structures cannot be easily handled, but they are explicitly modeled by the mixing extensions. However, the traditional mixture of… ▽ More

    Submitted 7 April, 2020; originally announced April 2020.

  28. arXiv:2004.02842  [pdf, other

    cs.LG stat.ML

    Detecting Communities in Heterogeneous Multi-Relational Networks:A Message Passing based Approach

    Authors: Maoying Qiao, Jun Yu, Wei Bian, Dacheng Tao

    Abstract: Community is a common characteristic of networks including social networks, biological networks, computer and information networks, to name a few. Community detection is a basic step for exploring and analysing these network data. Typically, homogenous network is a type of networks which consists of only one type of objects with one type of links connecting them. There has been a large body of dev… ▽ More

    Submitted 6 April, 2020; originally announced April 2020.

  29. arXiv:2004.00858  [pdf, other

    cs.NE math.OC

    Projection Neural Network for a Class of Sparse Regression Problems with Cardinality Penalty

    Authors: Wenjing Li, Wei Bian

    Abstract: In this paper, we consider a class of sparse regression problems, whose objective function is the summation of a convex loss function and a cardinality penalty. By constructing a smoothing function for the cardinality function, we propose a projected neural network and design a correction method for solving this problem. The solution of the proposed neural network is unique, global existent, bound… ▽ More

    Submitted 10 June, 2021; v1 submitted 2 April, 2020; originally announced April 2020.

  30. arXiv:1906.12324  [pdf

    cs.SI cs.LG cs.MM

    Cross-Platform Modeling of Users' Behavior on Social Media

    Authors: Haiqian Gu, Jie Wang, Ziwen Wang, Bojin Zhuang, Wenhao Bian, Fei Su

    Abstract: With the booming development and popularity of mobile applications, different verticals accumulate abundant data of user information and social behavior, which are spontaneous, genuine and diversified. However, each platform describes user's portraits in only certain aspect, resulting in difficult combination of those internet footprints together. In our research, we proposed a modeling approach t… ▽ More

    Submitted 23 June, 2019; originally announced June 2019.

    Comments: Published in IEEE International Conference on Data Mining Workshops (ICDMW) 2018

    Journal ref: 2018 IEEE International Conference on Data Mining Workshops (ICDMW) (2018): 183-190

  31. arXiv:1906.11620  [pdf

    eess.AS cs.MM cs.SD

    Audio-Based Music Classification with DenseNet And Data Augmentation

    Authors: Wenhao Bian, Jie Wang, Bojin Zhuang, Jiankui Yang, Shaojun Wang, Jing Xiao

    Abstract: In recent years, deep learning technique has received intense attention owing to its great success in image recognition. A tendency of adaption of deep learning in various information processing fields has formed, including music information retrieval (MIR). In this paper, we conduct a comprehensive study on music audio classification with improved convolutional neural networks (CNNs). To the best… ▽ More

    Submitted 15 June, 2019; originally announced June 2019.

    Comments: accepted by The 16th Pacific Rim International Conference on AI

  32. arXiv:1906.10304  [pdf, other

    stat.ML cs.LG

    Res-embedding for Deep Learning Based Click-Through Rate Prediction Modeling

    Authors: Guorui Zhou, Kailun Wu, Weijie Bian, Zhao Yang, Xiaoqiang Zhu, Kun Gai

    Abstract: Recently, click-through rate (CTR) prediction models have evolved from shallow methods to deep neural networks. Most deep CTR models follow an Embedding\&MLP paradigm, that is, first mapping discrete id features, e.g. user visited items, into low dimensional vectors with an embedding module, then learn a multi-layer perception (MLP) to fit the target. In this way, embedding module performs as the… ▽ More

    Submitted 24 June, 2019; originally announced June 2019.

  33. Practice on Long Sequential User Behavior Modeling for Click-Through Rate Prediction

    Authors: Qi Pi, Weijie Bian, Guorui Zhou, Xiaoqiang Zhu, Kun Gai

    Abstract: Click-through rate (CTR) prediction is critical for industrial applications such as recommender system and online advertising. Practically, it plays an important role for CTR modeling in these applications by mining user interest from rich historical behavior data. Driven by the development of deep learning, deep CTR models with ingeniously designed architecture for user interest modeling have bee… ▽ More

    Submitted 23 May, 2019; v1 submitted 22 May, 2019; originally announced May 2019.

    Comments: 9 pages. Accepted by KDD 2019

  34. Lifelong Sequential Modeling with Personalized Memorization for User Response Prediction

    Authors: Kan Ren, Jiarui Qin, Yuchen Fang, Weinan Zhang, Lei Zheng, Weijie Bian, Guorui Zhou, Jian Xu, Yong Yu, Xiaoqiang Zhu, Kun Gai

    Abstract: User response prediction, which models the user preference w.r.t. the presented items, plays a key role in online services. With two-decade rapid development, nowadays the cumulated user behavior sequences on mature Internet service platforms have become extremely long since the user's first registration. Each user not only has intrinsic tastes, but also keeps changing her personal interests durin… ▽ More

    Submitted 12 May, 2019; v1 submitted 2 May, 2019; originally announced May 2019.

    Comments: SIGIR 2019. Reproducible codes and datasets: https://github.com/alimamarankgroup/HPMN

  35. arXiv:1904.08098  [pdf, other

    cs.CV cs.LG

    Correlated Logistic Model With Elastic Net Regularization for Multilabel Image Classification

    Authors: Qiang Li, Bo Xie, Jane You, Wei Bian, Dacheng Tao

    Abstract: In this paper, we present correlated logistic (CorrLog) model for multilabel image classification. CorrLog extends conventional logistic regression model into multilabel cases, via explicitly modeling the pairwise correlation between labels. In addition, we propose to learn the model parameters of CorrLog with elastic net regularization, which helps exploit the sparsity in feature selection and la… ▽ More

    Submitted 17 April, 2019; originally announced April 2019.

  36. arXiv:1904.05335  [pdf, other

    cs.SI cs.LG stat.ML

    Adapting Stochastic Block Models to Power-Law Degree Distributions

    Authors: Maoying Qiao, Jun Yu, Wei Bian, Qiang Li, Dacheng Tao

    Abstract: Stochastic block models (SBMs) have been playing an important role in modeling clusters or community structures of network data. But, it is incapable of handling several complex features ubiquitously exhibited in real-world networks, one of which is the power-law degree characteristic. To this end, we propose a new variant of SBM, termed power-law degree SBM (PLD-SBM), by introducing degree decay… ▽ More

    Submitted 5 April, 2019; originally announced April 2019.

    Comments: 13 pages, 13 figures

    Journal ref: IEEE Transactions on Cybernetics, 49 (2019) 626-637

  37. Diversified Hidden Markov Models for Sequential Labeling

    Authors: Maoying Qiao, Wei Bian, Richard Yida Xu, Dacheng Tao

    Abstract: Labeling of sequential data is a prevalent meta-problem for a wide range of real world applications. While the first-order Hidden Markov Models (HMM) provides a fundamental approach for unsupervised sequential labeling, the basic model does not show satisfying performance when it is directly applied to real world problems, such as part-of-speech tagging (PoS tagging) and optical character recognit… ▽ More

    Submitted 5 April, 2019; originally announced April 2019.

    Comments: 14 pages, 12 figures

    Journal ref: IEEE Transactions on Knowledge and Data Engineering, 27 (2015) 2947 - 2960

  38. arXiv:1809.03672  [pdf, other

    stat.ML cs.IR cs.LG

    Deep Interest Evolution Network for Click-Through Rate Prediction

    Authors: Guorui Zhou, Na Mou, Ying Fan, Qi Pi, Weijie Bian, Chang Zhou, Xiaoqiang Zhu, Kun Gai

    Abstract: Click-through rate~(CTR) prediction, whose goal is to estimate the probability of the user clicks, has become one of the core tasks in advertising systems. For CTR prediction model, it is necessary to capture the latent user interest behind the user behavior data. Besides, considering the changing of the external environment and the internal cognition, user interest evolves over time dynamically.… ▽ More

    Submitted 16 November, 2018; v1 submitted 10 September, 2018; originally announced September 2018.

    Comments: 9 pages. Accepted by AAAI 2019

    ACM Class: I.2.6

  39. arXiv:1708.04106  [pdf, other

    stat.ML cs.LG

    Rocket Launching: A Universal and Efficient Framework for Training Well-performing Light Net

    Authors: Guorui Zhou, Ying Fan, Runpeng Cui, Weijie Bian, Xiaoqiang Zhu, Kun Gai

    Abstract: Models applied on real time response task, like click-through rate (CTR) prediction model, require high accuracy and rigorous response time. Therefore, top-performing deep models of high depth and complexity are not well suited for these applications with the limitations on the inference time. In order to further improve the neural networks' performance given the time and computational limitations… ▽ More

    Submitted 14 March, 2018; v1 submitted 14 August, 2017; originally announced August 2017.

    Comments: 10 pages, AAAI2018

    ACM Class: I.2.6

  40. arXiv:1608.04198  [pdf, ps, other

    cs.IT

    Scaled VIP Algorithms for Joint Dynamic Forwarding and Caching in Named Data Networks

    Authors: Fan Lai, Feng Qiu, Wenjie Bian, Ying Cui, Edmund Yeh

    Abstract: Emerging Information-Centric Networking (ICN) architectures seek to optimally utilize both bandwidth and storage for efficient content distribution over the network. The Virtual Interest Packet (VIP) framework has been proposed to enable joint design of forwarding and caching within the Named Data Networking (NDN) architecture. The virtual plane of the VIP framework captures the measured demand fo… ▽ More

    Submitted 15 August, 2016; originally announced August 2016.

    Comments: to appear in ICN 2016. arXiv admin note: substantial text overlap with arXiv:1607.03270, arXiv:1310.5569

  41. arXiv:1208.3030  [pdf, ps, other

    stat.ML cs.LG

    Asymptotic Generalization Bound of Fisher's Linear Discriminant Analysis

    Authors: Wei Bian, Dacheng Tao

    Abstract: Fisher's linear discriminant analysis (FLDA) is an important dimension reduction method in statistical pattern recognition. It has been shown that FLDA is asymptotically Bayes optimal under the homoscedastic Gaussian assumption. However, this classical result has the following two major limitations: 1) it holds only for a fixed dimensionality $D$, and thus does not apply when $D$ and the training… ▽ More

    Submitted 22 April, 2013; v1 submitted 15 August, 2012; originally announced August 2012.