Skip to main content

Showing 1–50 of 67 results for author: Deng, Q

  1. arXiv:2406.03138  [pdf, other

    cs.SD eess.AS

    A Frame-based Attention Interpretation Method for Relevant Acoustic Feature Extraction in Long Speech Depression Detection

    Authors: Qingkun Deng, Saturnino Luz, Sofia de la Fuente Garcia

    Abstract: Speech-based depression detection tools could help early screening of depression. Here, we address two issues that may hinder the clinical practicality of such tools: segment-level labelling noise and a lack of model interpretability. We propose a speech-level Audio Spectrogram Transformer to avoid segment-level labelling. We observe that the proposed model significantly outperforms a segment-leve… ▽ More

    Submitted 7 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

    Comments: 5 pages, 3 figures. arXiv admin note: substantial text overlap with arXiv:2309.13476

  2. arXiv:2405.00344  [pdf, other

    cs.MM

    Expert Insight-Enhanced Follow-up Chest X-Ray Summary Generation

    Authors: Zhichuan Wang, Kinhei Lee, Qiao Deng, Tiffany Y. So, Wan Hang Chiu, Yeung Yu Hui, Bingjing Zhou, Edward S. Hui

    Abstract: A chest X-ray radiology report describes abnormal findings not only from X-ray obtained at current examination, but also findings on disease progression or change in device placement with reference to the X-ray from previous examination. Majority of the efforts on automatic generation of radiology report pertain to reporting the former, but not the latter, type of findings. To the best of the auth… ▽ More

    Submitted 6 May, 2024; v1 submitted 1 May, 2024; originally announced May 2024.

    Comments: accepted by 22nd International Conference on Artificial Intelligence in medicine (AIME2024)

    ACM Class: I.2.1

  3. arXiv:2404.18081  [pdf, other

    cs.SD cs.AI cs.CL cs.LG cs.MM eess.AS

    ComposerX: Multi-Agent Symbolic Music Composition with LLMs

    Authors: Qixin Deng, Qikai Yang, Ruibin Yuan, Yipeng Huang, Yi Wang, Xubo Liu, Zeyue Tian, Jiahao Pan, Ge Zhang, Hanfeng Lin, Yizhi Li, Yinghao Ma, Jie Fu, Chenghua Lin, Emmanouil Benetos, Wenwu Wang, Guangyu Xia, Wei Xue, Yike Guo

    Abstract: Music composition represents the creative side of humanity, and itself is a complex task that requires abilities to understand and generate information with long dependency and harmony constraints. While demonstrating impressive capabilities in STEM subjects, current LLMs easily fail in this task, generating ill-written music even when equipped with modern techniques like In-Context-Learning and C… ▽ More

    Submitted 30 April, 2024; v1 submitted 28 April, 2024; originally announced April 2024.

  4. arXiv:2404.14750  [pdf, other

    cs.CV cs.AI

    Grounded Knowledge-Enhanced Medical VLP for Chest X-Ray

    Authors: Qiao Deng, Zhongzhen Huang, Yunqi Wang, Zhichuan Wang, Zhao Wang, Xiaofan Zhang, Qi Dou, Yeung Yu Hui, Edward S. Hui

    Abstract: Medical vision-language pre-training has emerged as a promising approach for learning domain-general representations of medical image and text. Current algorithms that exploit the global and local alignment between medical image and text could however be marred by the redundant information in medical data. To address this issue, we propose a grounded knowledge-enhanced medical vision-language pre-… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  5. arXiv:2404.13983  [pdf, other

    cs.CV

    Structure-Aware Human Body Reshaping with Adaptive Affinity-Graph Network

    Authors: Qiwen Deng, Yangcen Liu, Wen Li, Guoqing Wang

    Abstract: Given a source portrait, the automatic human body reshaping task aims at editing it to an aesthetic body shape. As the technology has been widely used in media, several methods have been proposed mainly focusing on generating optical flow to warp the body shape. However, those previous works only consider the local transformation of different body parts (arms, torso, and legs), ignoring the global… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: 11 pages;

  6. arXiv:2404.10004  [pdf

    cs.LG physics.soc-ph stat.AP

    A Strategy Transfer and Decision Support Approach for Epidemic Control in Experience Shortage Scenarios

    Authors: X. Xiao, P. Chen, X. Cao, K. Liu, L. Deng, D. Zhao, Z. Chen, Q. Deng, F. Yu, H. Zhang

    Abstract: Epidemic outbreaks can cause critical health concerns and severe global economic crises. For countries or regions with new infectious disease outbreaks, it is essential to generate preventive strategies by learning lessons from others with similar risk profiles. A Strategy Transfer and Decision Support Approach (STDSA) is proposed based on the profile similarity evaluation. There are four steps in… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 20 pages, 9 figures

  7. arXiv:2404.06769  [pdf

    cs.NE

    Solving the Food-Energy-Water Nexus Problem via Intelligent Optimization Algorithms

    Authors: Qi Deng, Zheng Fan, Zhi Li, Xinna Pan, Qi Kang, MengChu Zhou

    Abstract: The application of evolutionary algorithms (EAs) to multi-objective optimization problems has been widespread. However, the EA research community has not paid much attention to large-scale multi-objective optimization problems arising from real-world applications. Especially, Food-Energy-Water systems are intricately linked among food, energy and water that impact each other. They usually involve… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  8. arXiv:2402.18070  [pdf, other

    cs.AR eess.SP

    A Hierarchical Dataflow-Driven Heterogeneous Architecture for Wireless Baseband Processing

    Authors: Limin Jiang, Yi Shi, Haiqin Hu, Qingyu Deng, Siyi Xu, Yintao Liu, Feng Yuan, Si Wang, Yihao Shen, Fangfang Ye, Shan Cao, Zhiyuan Jiang

    Abstract: Wireless baseband processing (WBP) is a key element of wireless communications, with a series of signal processing modules to improve data throughput and counter channel fading. Conventional hardware solutions, such as digital signal processors (DSPs) and more recently, graphic processing units (GPUs), provide various degrees of parallelism, yet they both fail to take into account the cyclical and… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: 7 pages, 7 figures, conference

  9. arXiv:2402.11202  [pdf, other

    cs.IR

    Towards Scalability and Extensibility of Query Reformulation Modeling in E-commerce Search

    Authors: Ziqi Zhang, Yupin Huang, Quan Deng, Jinghui Xiao, Vivek Mittal, Jingyuan Deng

    Abstract: Customer behavioral data significantly impacts e-commerce search systems. However, in the case of less common queries, the associated behavioral data tends to be sparse and noisy, offering inadequate support to the search mechanism. To address this challenge, the concept of query reformulation has been introduced. It suggests that less common queries could utilize the behavior patterns of their po… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

  10. arXiv:2401.13971  [pdf, other

    math.OC cs.LG

    Stochastic Weakly Convex Optimization Beyond Lipschitz Continuity

    Authors: Wenzhi Gao, Qi Deng

    Abstract: This paper considers stochastic weakly convex optimization without the standard Lipschitz continuity assumption. Based on new adaptive regularization (stepsize) strategies, we show that a wide class of stochastic algorithms, including the stochastic subgradient method, preserve the $\mathcal{O} ( 1 / \sqrt{K})$ convergence rate with constant failure rate. Our analyses rest on rather weak assumptio… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  11. arXiv:2311.11572  [pdf, other

    cs.ET

    Cryogenic quasi-static embedded DRAM for energy-efficient compute-in-memory applications

    Authors: Yuhao Shu, Hongtu Zhang, Hao Sun, Mengru Zhang, Wenfeng Zhao, Qi Deng, Zhidong Tang, Yumeng Yuan, Yongqi Hu, Yu Gu, Xufeng Kou, Yajun Ha

    Abstract: Compute-in-memory (CIM) presents an attractive approach for energy-efficient computing in data-intensive applications. However, the development of suitable memory designs to achieve high-performance CIM remains a challenging task. Here, we propose a cryogenic quasi-static embedded DRAM to address the logic-memory mismatch of CIM. Guided by the re-calibrated cryogenic device model, the designed fou… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  12. arXiv:2310.11973  [pdf, other

    math.OC cs.DC

    Decentralized Gradient-Free Methods for Stochastic Non-Smooth Non-Convex Optimization

    Authors: Zhenwei Lin, Jingfan Xia, Qi Deng, Luo Luo

    Abstract: We consider decentralized gradient-free optimization of minimizing Lipschitz continuous functions that satisfy neither smoothness nor convexity assumption. We propose two novel gradient-free algorithms, the Decentralized Gradient-Free Method (DGFM) and its variant, the Decentralized Gradient-Free Method$^+$ (DGFM$^{+}$). Based on the techniques of randomized smoothing and gradient tracking, DGFM r… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

  13. arXiv:2309.13476  [pdf, other

    cs.CL cs.SD eess.AS

    Hierarchical attention interpretation: an interpretable speech-level transformer for bi-modal depression detection

    Authors: Qingkun Deng, Saturnino Luz, Sofia de la Fuente Garcia

    Abstract: Depression is a common mental disorder. Automatic depression detection tools using speech, enabled by machine learning, help early screening of depression. This paper addresses two limitations that may hinder the clinical implementations of such tools: noise resulting from segment-level labelling and a lack of model interpretability. We propose a bi-modal speech-level transformer to avoid segment-… ▽ More

    Submitted 6 October, 2023; v1 submitted 23 September, 2023; originally announced September 2023.

    Comments: 5 pages, 3 figures, submitted to IEEE International Conference on Acoustics, Speech, and Signal Processing

    ACM Class: F.2.2; I.2.7

  14. arXiv:2309.00753  [pdf, ps, other

    cs.IT eess.SP

    Jamming Suppression Via Resource Hopping in High-Mobility OTFS-SCMA Systems

    Authors: Qinwen Deng, Yao Ge, Zhi Ding

    Abstract: This letter studies the mechanism of uplink multiple access and jamming suppression in an OTFS system. Specifically, we propose a novel resource hopping mechanism for orthogonal time frequency space (OTFS) systems with delay or Doppler partitioned sparse code multiple access (SCMA) to mitigate the effect of jamming in controlled multiuser uplink. We analyze the non-uniform impact of classic jammin… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

  15. arXiv:2308.10767  [pdf, other

    cs.LG

    GBM-based Bregman Proximal Algorithms for Constrained Learning

    Authors: Zhenwei Lin, Qi Deng

    Abstract: As the complexity of learning tasks surges, modern machine learning encounters a new constrained learning paradigm characterized by more intricate and data-driven function constraints. Prominent applications include Neyman-Pearson classification (NPC) and fairness classification, which entail specific risk constraints that render standard projection-based training algorithms unsuitable. Gradient b… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

  16. arXiv:2308.10630  [pdf, other

    math.OC cs.LG

    A Homogenization Approach for Gradient-Dominated Stochastic Optimization

    Authors: Jiyuan Tan, Chenyu Xue, Chuwen Zhang, Qi Deng, Dongdong Ge, Yinyu Ye

    Abstract: Gradient dominance property is a condition weaker than strong convexity, yet sufficiently ensures global convergence even in non-convex optimization. This property finds wide applications in machine learning, reinforcement learning (RL), and operations management. In this paper, we propose the stochastic homogeneous second-order descent method (SHSODM) for stochastic functions enjoying gradient do… ▽ More

    Submitted 29 May, 2024; v1 submitted 21 August, 2023; originally announced August 2023.

    Comments: Accepted by UAI`24

  17. arXiv:2305.13823  [pdf, other

    cs.AI

    XRoute Environment: A Novel Reinforcement Learning Environment for Routing

    Authors: Zhanwen Zhou, Hankz Hankui Zhuo, Xiaowu Zhang, Qiyuan Deng

    Abstract: Routing is a crucial and time-consuming stage in modern design automation flow for advanced technology nodes. Great progress in the field of reinforcement learning makes it possible to use those approaches to improve the routing quality and efficiency. However, the scale of the routing problems solved by reinforcement learning-based methods in recent studies is too small for these methods to be us… ▽ More

    Submitted 5 June, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: arXiv admin note: text overlap with arXiv:1907.11180 by other authors

  18. arXiv:2305.10181  [pdf, other

    cs.LG

    Exploring the cloud of feature interaction scores in a Rashomon set

    Authors: Sichao Li, Rong Wang, Quanling Deng, Amanda Barnard

    Abstract: Interactions among features are central to understanding the behavior of machine learning models. Recent research has made significant strides in detecting and quantifying feature interactions in single predictive models. However, we argue that the feature interactions extracted from a single pre-specified model may not be trustworthy since: a well-trained predictive model may not preserve the tru… ▽ More

    Submitted 11 February, 2024; v1 submitted 17 May, 2023; originally announced May 2023.

  19. arXiv:2305.06802  [pdf, other

    cond-mat.dis-nn cs.LG math.NA

    Physics-Informed Neural Networks for Discovering Localised Eigenstates in Disordered Media

    Authors: Liam Harcombe, Quanling Deng

    Abstract: The Schrödinger equation with random potentials is a fundamental model for understanding the behaviour of particles in disordered systems. Disordered media are characterised by complex potentials that lead to the localisation of wavefunctions, also called Anderson localisation. These wavefunctions may have similar scales of eigenenergies which poses difficulty in their discovery. It has been a lon… ▽ More

    Submitted 12 July, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

  20. arXiv:2304.04778  [pdf, ps, other

    math.OC cs.LG

    First-order methods for Stochastic Variational Inequality problems with Function Constraints

    Authors: Digvijay Boob, Qi Deng, Mohammad Khalafi

    Abstract: The monotone Variational Inequality (VI) is a general model with important applications in various engineering and scientific domains. In numerous instances, the VI problems are accompanied by function constraints that can be data-driven, making the usual projection operator challenging to compute. This paper presents novel first-order methods for the function-constrained Variational Inequality (F… ▽ More

    Submitted 24 May, 2024; v1 submitted 10 April, 2023; originally announced April 2023.

  21. arXiv:2303.15132  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Cross-utterance ASR Rescoring with Graph-based Label Propagation

    Authors: Srinath Tankasala, Long Chen, Andreas Stolcke, Anirudh Raju, Qianli Deng, Chander Chandak, Aparna Khare, Roland Maas, Venkatesh Ravichandran

    Abstract: We propose a novel approach for ASR N-best hypothesis rescoring with graph-based label propagation by leveraging cross-utterance acoustic similarity. In contrast to conventional neural language model (LM) based ASR rescoring/reranking models, our approach focuses on acoustic information and conducts the rescoring collaboratively among utterances, instead of individually. Experiments on the VCTK da… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

    Comments: To appear in IEEE ICASSP 2023

    Journal ref: Proc. IEEE ICASSP, June 2023

  22. arXiv:2303.03590  [pdf, other

    cs.LG cs.IT

    Research on Efficient Fuzzy Clustering Method Based on Local Fuzzy Granular balls

    Authors: Jiang Xie, Qiao Deng, Shuyin Xia, Yangzhou Zhao, Guoyin Wang, Xinbo Gao

    Abstract: In recent years, the problem of fuzzy clustering has been widely concerned. The membership iteration of existing methods is mostly considered globally, which has considerable problems in noisy environments, and iterative calculations for clusters with a large number of different sample sizes are not accurate and efficient. In this paper, starting from the strategy of large-scale priority, the data… ▽ More

    Submitted 8 March, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

  23. arXiv:2302.08902  [pdf, other

    cs.CV

    Fashion Image Retrieval with Multi-Granular Alignment

    Authors: Jinkuan Zhu, Hao Huang, Qiao Deng, Xiyao Li

    Abstract: Fashion image retrieval task aims to search relevant clothing items of a query image from the gallery. The previous recipes focus on designing different distance-based loss functions, pulling relevant pairs to be close and pushing irrelevant images apart. However, these methods ignore fine-grained features (e.g. neckband, cuff) of clothing images. In this paper, we propose a novel fashion image re… ▽ More

    Submitted 7 March, 2023; v1 submitted 16 February, 2023; originally announced February 2023.

  24. arXiv:2302.08869  [pdf, other

    eess.SP cs.IT

    OTFS Signaling for SCMA With Coordinated Multi-Point Vehicle Communications

    Authors: Yao Ge, Qinwen Deng, David González G., Yong Liang Guan, Zhi Ding

    Abstract: This paper investigates an uplink coordinated multi-point (CoMP) coverage scenario, in which multiple mobile users are grouped for sparse code multiple access (SCMA), and served by the remote radio head (RRH) in front of them and the RRH behind them simultaneously. We apply orthogonal time frequency space (OTFS) modulation for each user to exploit the degrees of freedom arising from both the delay… ▽ More

    Submitted 17 February, 2023; originally announced February 2023.

    Comments: 15 pages, 12 figures, accepted by IEEE Transactions on Vehicular Technology

  25. arXiv:2301.12713  [pdf, other

    math.OC cs.DC

    Delayed Stochastic Algorithms for Distributed Weakly Convex Optimization

    Authors: Wenzhi Gao, Qi Deng

    Abstract: This paper studies delayed stochastic algorithms for weakly convex optimization in a distributed network with workers connected to a master node. Recently, Xu et al. 2022 showed that an inertial stochastic subgradient method converges at a rate of $\mathcal{O}(τ_{\text{max}}/\sqrt{K})$ which depends on the maximum information delay $τ_{\text{max}}$. In this work, we show that the delayed stochasti… ▽ More

    Submitted 1 November, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

  26. arXiv:2301.12174  [pdf, other

    math.OC cs.LG

    Stochastic Dimension-reduced Second-order Methods for Policy Optimization

    Authors: Jinsong Liu, Chenghan Xie, Qi Deng, Dongdong Ge, Yinyu Ye

    Abstract: In this paper, we propose several new stochastic second-order algorithms for policy optimization that only require gradient and Hessian-vector product in each iteration, making them computationally efficient and comparable to policy gradient methods. Specifically, we propose a dimension-reduced second-order method (DR-SOPO) which repeatedly solves a projected two-dimensional trust region subproble… ▽ More

    Submitted 28 January, 2023; originally announced January 2023.

  27. arXiv:2212.11143  [pdf, other

    math.OC cs.LG

    Efficient First-order Methods for Convex Optimization with Strongly Convex Function Constraints

    Authors: Zhenwei Lin, Qi Deng

    Abstract: In this paper, we introduce faster first-order primal-dual algorithms for minimizing a convex function subject to strongly convex function constraints. Before our work, the best complexity bound was $\mathcal{O}(1/{\varepsilon})$, and it remains unclear how to improve this result by leveraging the strong convexity assumption. We address this issue by developing novel techniques to progressively es… ▽ More

    Submitted 5 November, 2023; v1 submitted 21 December, 2022; originally announced December 2022.

    Comments: new experiments

    MSC Class: 90C25; 90C30; 90C06

  28. arXiv:2212.06643  [pdf, other

    cs.CV

    Boosting Semi-Supervised Learning with Contrastive Complementary Labeling

    Authors: Qinyi Deng, Yong Guo, Zhibang Yang, Haolin Pan, Jian Chen

    Abstract: Semi-supervised learning (SSL) has achieved great success in leveraging a large amount of unlabeled data to learn a promising classifier. A popular approach is pseudo-labeling that generates pseudo labels only for those unlabeled data with high-confidence predictions. As for the low-confidence ones, existing methods often simply discard them because these unreliable pseudo labels may mislead the m… ▽ More

    Submitted 27 December, 2022; v1 submitted 13 December, 2022; originally announced December 2022.

    Comments: typos corrected, 5 figures, 3 tables,

  29. arXiv:2211.13116  [pdf, other

    cs.LG cs.CR stat.ML

    Fed-TDA: Federated Tabular Data Augmentation on Non-IID Data

    Authors: Shaoming Duan, Chuanyi Liu, Peiyi Han, Tianyu He, Yifeng Xu, Qiyuan Deng

    Abstract: Non-independent and identically distributed (non-IID) data is a key challenge in federated learning (FL), which usually hampers the optimization convergence and the performance of FL. Existing data augmentation methods based on federated generative models or raw data sharing strategies for solving the non-IID problem still suffer from low performance, privacy protection concerns, and high communic… ▽ More

    Submitted 12 January, 2023; v1 submitted 21 November, 2022; originally announced November 2022.

  30. arXiv:2210.12683  [pdf, other

    cs.CV

    GAN-based Facial Attribute Manipulation

    Authors: Yunfan Liu, Qi Li, Qiyao Deng, Zhenan Sun, Ming-Hsuan Yang

    Abstract: Facial Attribute Manipulation (FAM) aims to aesthetically modify a given face image to render desired attributes, which has received significant attention due to its broad practical applications ranging from digital entertainment to biometric forensics. In the last decade, with the remarkable success of Generative Adversarial Networks (GANs) in synthesizing realistic images, numerous GAN-based mod… ▽ More

    Submitted 23 October, 2022; originally announced October 2022.

  31. arXiv:2208.04740  [pdf, other

    cs.CV

    Aesthetic Language Guidance Generation of Images Using Attribute Comparison

    Authors: Xin Jin, Qiang Deng, Jianwen Lv, Heng Huang, Hao Lou, Chaoen Xiao

    Abstract: With the vigorous development of mobile photography technology, major mobile phone manufacturers are scrambling to improve the shooting ability of equipments and the photo beautification algorithm of software. However, the improvement of intelligent equipments and algorithms cannot replace human subjective photography technology. In this paper, we propose the aesthetic language guidance of image (… ▽ More

    Submitted 9 August, 2022; originally announced August 2022.

    Comments: 13 pages, 18 figures, on going research

  32. arXiv:2208.04517  [pdf, other

    cs.CV

    Attribute Controllable Beautiful Caucasian Face Generation by Aesthetics Driven Reinforcement Learning

    Authors: Xin Jin, Shu Zhao, Le Zhang, Xin Zhao, Qiang Deng, Chaoen Xiao

    Abstract: In recent years, image generation has made great strides in improving the quality of images, producing high-fidelity ones. Also, quite recently, there are architecture designs, which enable GAN to unsupervisedly learn the semantic attributes represented in different layers. However, there is still a lack of research on generating face images more consistent with human aesthetics. Based on EigenGAN… ▽ More

    Submitted 8 August, 2022; originally announced August 2022.

    Comments: 13 pages, 5 figures. ACM Multimedia 2022 Technical Demos and Videos Program

  33. arXiv:2208.00238  [pdf, other

    cs.CV

    Improving Fine-tuning of Self-supervised Models with Contrastive Initialization

    Authors: Haolin Pan, Yong Guo, Qinyi Deng, Haomin Yang, Yiqun Chen, Jian Chen

    Abstract: Self-supervised learning (SSL) has achieved remarkable performance in pretraining the models that can be further used in downstream tasks via fine-tuning. However, these self-supervised models may not capture meaningful semantic information since the images belonging to the same class are always regarded as negative pairs in the contrastive loss. Consequently, the images of the same class are ofte… ▽ More

    Submitted 30 July, 2022; originally announced August 2022.

    Comments: 22 pages, 4 figures

  34. arXiv:2207.01806  [pdf, other

    cs.CV

    Aesthetic Attribute Assessment of Images Numerically on Mixed Multi-attribute Datasets

    Authors: Xin Jin, Xinning Li, Hao Lou, Chenyu Fan, Qiang Deng, Chaoen Xiao, Shuai Cui, Amit Kumar Singh

    Abstract: With the continuous development of social software and multimedia technology, images have become a kind of important carrier for spreading information and socializing. How to evaluate an image comprehensively has become the focus of recent researches. The traditional image aesthetic assessment methods often adopt single numerical overall assessment scores, which has certain subjectivity and can no… ▽ More

    Submitted 5 July, 2022; originally announced July 2022.

    Comments: 7 pages, 9figures, to appear: ACM Transactions on Multimedia Computing Communications and Applications (TOMM)

  35. arXiv:2206.09946  [pdf

    cs.CY cs.CV

    Short Video Uprising: How #BlackLivesMatter Content on TikTok Challenges the Protest Paradigm

    Authors: Yanru Jiang, Xin Jin, Qinhao Deng

    Abstract: This study uses TikTok (N = 8,173) to examine how short-form video platforms challenge the protest paradigm in the recent Black Lives Matter movement. A computer-mediated visual analysis, computer vision, is employed to identify the presence of four visual frames of protest (riot, confrontation, spectacle, and debate) in multimedia content. Results of descriptive statistics and the t-test indicate… ▽ More

    Submitted 20 June, 2022; originally announced June 2022.

    Comments: Workshop Proceedings of the 16th International AAAI Conference on Web and Social Media

  36. Investigating Accuracy-Novelty Performance for Graph-based Collaborative Filtering

    Authors: Minghao Zhao, Le Wu, Yile Liang, Lei Chen, Jian Zhang, Qilin Deng, Kai Wang, Xudong Shen, Tangjie Lv, Runze Wu

    Abstract: Recent years have witnessed the great accuracy performance of graph-based Collaborative Filtering (CF) models for recommender systems. By taking the user-item interaction behavior as a graph, these graph-based CF models borrow the success of Graph Neural Networks (GNN), and iteratively perform neighborhood aggregation to propagate the collaborative signals. While conventional CF models are known f… ▽ More

    Submitted 27 April, 2022; v1 submitted 26 April, 2022; originally announced April 2022.

    Comments: To appear in SIGIR 2022

  37. arXiv:2203.15334  [pdf, other

    cs.CV

    AnyFace: Free-style Text-to-Face Synthesis and Manipulation

    Authors: Jianxin Sun, Qiyao Deng, Qi Li, Muyi Sun, Min Ren, Zhenan Sun

    Abstract: Existing text-to-image synthesis methods generally are only applicable to words in the training dataset. However, human faces are so variable to be described with limited words. So this paper proposes the first free-style text-to-face method namely AnyFace enabling much wider open world applications such as metaverse, social media, cosmetics, forensics, etc. AnyFace has a novel two-stream framewor… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

  38. arXiv:2203.12373  [pdf, ps, other

    math.NA cs.CE

    A boundary-penalized isogeometric analysis for second-order hyperbolic equations

    Authors: Quanling Deng, Pouria Behnoudfar, Victor Calo

    Abstract: Explicit time-marching schemes are popular for solving time-dependent partial differential equations; one of the biggest challenges these methods suffer is increasing the critical time-marching step size that guarantees numerical stability. In general, there are two ways to increase the critical step size. One is to reduce the stiffness of the spatially discretized system, while the other is to de… ▽ More

    Submitted 23 March, 2022; originally announced March 2022.

  39. arXiv:2111.15018  [pdf, other

    cs.CV eess.SP

    Hyperspectral Image Segmentation based on Graph Processing over Multilayer Networks

    Authors: Songyang Zhang, Qinwen Deng, Zhi Ding

    Abstract: Hyperspectral imaging is an important sensing technology with broad applications and impact in areas including environmental science, weather, and geo/space exploration. One important task of hyperspectral image (HSI) processing is the extraction of spectral-spatial features. Leveraging on the recent-developed graph signal processing over multilayer networks (M-GSP), this work proposes several app… ▽ More

    Submitted 29 November, 2021; originally announced November 2021.

  40. arXiv:2111.00203  [pdf, other

    cs.CV cs.GR cs.MM

    Imitating Arbitrary Talking Style for Realistic Audio-DrivenTalking Face Synthesis

    Authors: Haozhe Wu, Jia Jia, Haoyu Wang, Yishun Dou, Chao Duan, Qingshan Deng

    Abstract: People talk with diversified styles. For one piece of speech, different talking styles exhibit significant differences in the facial and head pose movements. For example, the "excited" style usually talks with the mouth wide open, while the "solemn" style is more standardized and seldomly exhibits exaggerated motions. Due to such huge differences between different styles, it is necessary to incorp… ▽ More

    Submitted 30 October, 2021; originally announced November 2021.

    Comments: Accepted by MM2021, code available at https://github.com/wuhaozhe/style_avatar

    ACM Class: I.1.4

  41. arXiv:2110.11073  [pdf, other

    cs.IR cs.LG

    RL4RS: A Real-World Dataset for Reinforcement Learning based Recommender System

    Authors: Kai Wang, Zhene Zou, Minghao Zhao, Qilin Deng, Yue Shang, Yile Liang, Runze Wu, Xudong Shen, Tangjie Lyu, Changjie Fan

    Abstract: Reinforcement learning based recommender systems (RL-based RS) aim at learning a good policy from a batch of collected data, by casting recommendations to multi-step decision-making tasks. However, current RL-based RS research commonly has a large reality gap. In this paper, we introduce the first open-source real-world dataset, RL4RS, hoping to replace the artificial datasets and semi-simulated R… ▽ More

    Submitted 17 April, 2023; v1 submitted 18 October, 2021; originally announced October 2021.

    Comments: 4-th version, SIGIR2023

  42. arXiv:2108.03562  [pdf, other

    cs.DC cs.DB cs.NI cs.PF eess.SY

    Master Graduation Thesis: A Lightweight and Distributed Container-based Framework

    Authors: Qifan Deng, Rajkumar Buyya

    Abstract: Edge/Fog computing is a novel computing paradigm that provides resource-limited Internet of Things (IoT) devices with scalable computing and storage resources. Compared to cloud computing, edge/fog servers have fewer resources, but they can be accessed with higher bandwidth and less communication latency. Thus, integrating edge/fog and cloud infrastructures can support the execution of diverse lat… ▽ More

    Submitted 7 August, 2021; originally announced August 2021.

    Comments: https://github.com/cloudslab/fogbus2

  43. arXiv:2108.00591  [pdf, other

    cs.DC cs.NI cs.PF eess.SY

    Resource Management in Edge and Fog Computing using FogBus2 Framework

    Authors: Mohammad Goudarzi, Qifan Deng, Rajkumar Buyya

    Abstract: Edge/Fog computing is a novel computing paradigm that provides resource-limited Internet of Things (IoT) devices with scalable computing and storage resources. Compared to cloud computing, edge/fog servers have fewer resources, but they can be accessed with higher bandwidth and less communication latency. Thus, integrating edge/fog and cloud infrastructures can support the execution of diverse lat… ▽ More

    Submitted 1 August, 2021; originally announced August 2021.

    Comments: Software Availability: The source code of the FogBus2 framework and newly implemented IoT applications and scheduling policies are accessible from the CLOUDS Laboratory GitHub webpage: https://github.com/Cloudslab/FogBus2

  44. arXiv:2106.03034  [pdf, other

    math.OC cs.LG

    Minibatch and Momentum Model-based Methods for Stochastic Weakly Convex Optimization

    Authors: Qi Deng, Wenzhi Gao

    Abstract: Stochastic model-based methods have received increasing attention lately due to their appealing robustness to the stepsize selection and provable efficiency guarantee. We make two important extensions for improving model-based methods on stochastic weakly convex optimization. First, we propose new minibatch model-based methods by involving a set of samples to approximate the model function in each… ▽ More

    Submitted 12 November, 2021; v1 submitted 6 June, 2021; originally announced June 2021.

    Comments: 39 pages, 9 figures

  45. arXiv:2104.05307  [pdf, other

    cs.IR cs.LG

    Personalized Bundle Recommendation in Online Games

    Authors: Qilin Deng, Kai Wang, Minghao Zhao, Zhene Zou, Runze Wu, Jianrong Tao, Changjie Fan, Liang Chen

    Abstract: In business domains, \textit{bundling} is one of the most important marketing strategies to conduct product promotions, which is commonly used in online e-commerce and offline retailers. Existing recommender systems mostly focus on recommending individual items that users may be interested in. In this paper, we target at a practical but less explored recommendation problem named bundle recommendat… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

    Comments: 8 pages, 10 figures, accepted paper on CIKM 2020

  46. arXiv:2104.02981  [pdf, other

    cs.IR cs.LG

    Reinforcement Learning with a Disentangled Universal Value Function for Item Recommendation

    Authors: Kai Wang, Zhene Zou, Qilin Deng, Runze Wu, Jianrong Tao, Changjie Fan, Liang Chen, Peng Cui

    Abstract: In recent years, there are great interests as well as challenges in applying reinforcement learning (RL) to recommendation systems (RS). In this paper, we summarize three key practical challenges of large-scale RL-based recommender systems: massive state and action spaces, high-variance environment, and the unspecific reward setting in recommendation. All these problems remain largely unexplored i… ▽ More

    Submitted 11 April, 2021; v1 submitted 7 April, 2021; originally announced April 2021.

    Comments: 9 pages, 4 figures, to be published in Proceedings of the AAAI Conference on Artificial Intelligence 2021

  47. arXiv:2104.01975  [pdf, other

    eess.IV cs.CV

    Cascaded Robust Learning at Imperfect Labels for Chest X-ray Segmentation

    Authors: Cheng Xue, Qiao Deng, Xiaomeng Li, Qi Dou, Pheng Ann Heng

    Abstract: The superior performance of CNN on medical image analysis heavily depends on the annotation quality, such as the number of labeled image, the source of image, and the expert experience. The annotation requires great expertise and labour. To deal with the high inter-rater variability, the study of imperfect label has great significance in medical image segmentation tasks. In this paper, we present… ▽ More

    Submitted 5 April, 2021; originally announced April 2021.

    Comments: 9pages, 4 figures. MICCAI 2020

  48. An Efficient Hypergraph Approach to Robust Point Cloud Resampling

    Authors: Qinwen Deng, Songyang Zhang, Zhi Ding

    Abstract: Efficient processing and feature extraction of largescale point clouds are important in related computer vision and cyber-physical systems. This work investigates point cloud resampling based on hypergraph signal processing (HGSP) to better explore the underlying relationship among different cloud points and to extract contour-enhanced features. Specifically, we design hypergraph spectral filters… ▽ More

    Submitted 11 March, 2021; originally announced March 2021.

  49. arXiv:2103.00657  [pdf, other

    cs.CV

    Achieving Competitive Play Through Bottom-Up Approach in Semantic Segmentation

    Authors: E. Pryzant, Q. Deng, B. Mei, E. Shrestha

    Abstract: With the renaissance of neural networks, object detection has slowly shifted from a bottom-up recognition problem to a top-down approach. Best in class algorithms enumerate a near-complete list of objects and classify each into object/not object. In this paper, we show that strong performance can still be achieved using a bottom-up approach for vision-based object recognition tasks and achieve com… ▽ More

    Submitted 28 February, 2021; originally announced March 2021.

  50. arXiv:2102.04682  [pdf, other

    cs.IT eess.SP

    OTFS Signaling for Uplink NOMA of Heterogeneous Mobility Users

    Authors: Yao Ge, Qinwen Deng, P. C. Ching, Zhi Ding

    Abstract: We investigate a coded uplink non-orthogonal multiple access (NOMA) configuration in which groups of co-channel users are modulated in accordance with orthogonal time frequency space (OTFS). We take advantage of OTFS characteristics to achieve NOMA spectrum sharing in the delay-Doppler domain between stationary and mobile users. We develop an efficient iterative turbo receiver based on the princip… ▽ More

    Submitted 9 February, 2021; originally announced February 2021.

    Comments: 31 pages, 10 figures, accepted by IEEE Transactions on Communications