Skip to main content

Showing 1–50 of 229 results for author: Meng, D

  1. arXiv:2407.06633  [pdf, other

    eess.IV cs.CV

    Variational Zero-shot Multispectral Pansharpening

    Authors: Xiangyu Rui, Xiangyong Cao, Yining Li, Deyu Meng

    Abstract: Pansharpening aims to generate a high spatial resolution multispectral image (HRMS) by fusing a low spatial resolution multispectral image (LRMS) and a panchromatic image (PAN). The most challenging issue for this task is that only the to-be-fused LRMS and PAN are available, and the existing deep learning-based methods are unsuitable since they rely on many training pairs. Traditional variational… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  2. arXiv:2407.02283  [pdf, other

    cs.CV cs.AI

    A Refreshed Similarity-based Upsampler for Direct High-Ratio Feature Upsampling

    Authors: Minghao Zhou, Hong Wang, Yefeng Zheng, Deyu Meng

    Abstract: Feature upsampling is a fundamental and indispensable ingredient of almost all current network structures for image segmentation tasks. Recently, a popular similarity-based feature upsampling pipeline has been proposed, which utilizes a high-resolution feature as guidance to help upsample the low-resolution deep feature based on their local similarity. Albeit achieving promising performance, this… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Codes are available at https://github.com/zmhhmz/ReSFU

  3. arXiv:2407.00132  [pdf, other

    cs.SE cs.AI

    ShortcutsBench: A Large-Scale Real-world Benchmark for API-based Agents

    Authors: Haiyang Shen, Yue Li, Desong Meng, Dongqi Cai, Sheng Qi, Li Zhang, Mengwei Xu, Yun Ma

    Abstract: Recent advancements in integrating large language models (LLMs) with application programming interfaces (APIs) have gained significant interest in both academia and industry. These API-based agents, leveraging the strong autonomy and planning capabilities of LLMs, can efficiently solve problems requiring multi-step actions. However, their ability to handle multi-dimensional difficulty levels, dive… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

  4. arXiv:2406.05936  [pdf, ps, other

    cs.IT

    Multi-UAV Trajectory Design for Fair and Secure Communication

    Authors: Hongjiang Lei, Dongyang Meng, Haoxiang Ran, Ki-Hong Park, Gaofeng Pan, Mohamed-Slim Alouini

    Abstract: Unmanned aerial vehicles (UAVs) play an essential role in future wireless communication networks due to their high mobility, low cost, and on-demand deployment. In air-to-ground links, UAVs are widely used to enhance the performance of wireless communication systems due to the presence of high-probability line-of-sight (LoS) links. However, the high probability of LoS links also increases the risk… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: 14 pages, 10 figures, submitted to IEEE Journal for review

  5. arXiv:2405.20044  [pdf, other

    cs.CV

    A Point-Neighborhood Learning Framework for Nasal Endoscope Image Segmentation

    Authors: Pengyu Jie, Wanquan Liu, Chenqiang Gao, Yihui Wen, Rui He, Pengcheng Li, Jintao Zhang, Deyu Meng

    Abstract: The lesion segmentation on endoscopic images is challenging due to its complex and ambiguous features. Fully-supervised deep learning segmentation methods can receive good performance based on entirely pixel-level labeled dataset but greatly increase experts' labeling burden. Semi-supervised and weakly supervised methods can ease labeling burden, but heavily strengthen the learning difficulty. To… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 10 pages, 10 figures,

  6. arXiv:2405.17241  [pdf, other

    cs.CV eess.IV

    NeurTV: Total Variation on the Neural Domain

    Authors: Yisi Luo, Xile Zhao, Kai Ye, Deyu Meng

    Abstract: Recently, we have witnessed the success of total variation (TV) for many imaging applications. However, traditional TV is defined on the original pixel domain, which limits its potential. In this work, we suggest a new TV regularization defined on the neural domain. Concretely, the discrete data is continuously and implicitly represented by a deep neural network (DNN), and we use the derivatives o… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    MSC Class: 94A08; 68U10; 68T45

  7. arXiv:2405.04788  [pdf, other

    cs.CV

    DiffMatch: Visual-Language Guidance Makes Better Semi-supervised Change Detector

    Authors: Kaiyu Li, Xiangyong Cao, Yupeng Deng, Junmin Liu, Deyu Meng, Zhi Wang

    Abstract: Change Detection (CD) aims to identify pixels with semantic changes between images. However, annotating massive numbers of pixel-level images is labor-intensive and costly, especially for multi-temporal images, which require pixel-wise comparisons by human experts. Considering the excellent performance of visual language models (VLMs) for zero-shot, open-vocabulary, etc. with prompt-based reasonin… ▽ More

    Submitted 22 June, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

    Comments: 13 pages, 5 figures

  8. arXiv:2404.10353  [pdf, other

    cs.LG cs.SI

    Rethinking the Graph Polynomial Filter via Positive and Negative Coupling Analysis

    Authors: Haodong Wen, Bodong Du, Ruixun Liu, Deyu Meng, Xiangyong Cao

    Abstract: Recently, the optimization of polynomial filters within Spectral Graph Neural Networks (GNNs) has emerged as a prominent research focus. Existing spectral GNNs mainly emphasize polynomial properties in filter design, introducing computational overhead and neglecting the integration of crucial graph structure information. We argue that incorporating graph information into basis construction can enh… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: 13 pages, 8 figures, 6 tables

  9. arXiv:2404.09172  [pdf, other

    cs.CV cs.AI

    LoopAnimate: Loopable Salient Object Animation

    Authors: Fanyi Wang, Peng Liu, Haotian Hu, Dan Meng, Jingwen Su, Jinjin Xu, Yanhao Zhang, Xiaoming Ren, Zhiwang Zhang

    Abstract: Research on diffusion model-based video generation has advanced rapidly. However, limitations in object fidelity and generation length hinder its practical applications. Additionally, specific domains like animated wallpapers require seamless looping, where the first and last frames of the video match seamlessly. To address these challenges, this paper proposes LoopAnimate, a novel method for gene… ▽ More

    Submitted 16 April, 2024; v1 submitted 14 April, 2024; originally announced April 2024.

  10. arXiv:2404.04941  [pdf, other

    cs.CL

    Prompting Large Language Models for Zero-shot Essay Scoring via Multi-trait Specialization

    Authors: Sanwoo Lee, Yida Cai, Desong Meng, Ziyang Wang, Yunfang Wu

    Abstract: Advances in automated essay scoring (AES) have traditionally relied on labeled essays, requiring tremendous cost and expertise for their acquisition. Recently, large language models (LLMs) have achieved great success in various tasks, but their potential is less explored in AES. In this paper, we propose Multi Trait Specialization (MTS), a zero-shot prompting framework to elicit essay scoring capa… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  11. arXiv:2403.11614  [pdf, other

    cs.CV

    CRS-Diff: Controllable Generative Remote Sensing Foundation Model

    Authors: Datao Tang, Xiangyong Cao, Xingsong Hou, Zhongyuan Jiang, Deyu Meng

    Abstract: The emergence of generative models has revolutionized the field of remote sensing (RS) image generation. Despite generating high-quality images, existing methods are limited in relying mainly on text control conditions and thus don't always generate images accurately and stablely. In this paper, we propose CRS-Diff, a new RS generative foundation framework specifically tailored for RS image genera… ▽ More

    Submitted 11 June, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

  12. arXiv:2403.10188  [pdf, other

    cs.CR cs.AR

    Taiyi: A high-performance CKKS accelerator for Practical Fully Homomorphic Encryption

    Authors: Shengyu Fan, Xianglong Deng, Zhuoyu Tian, Zhicheng Hu, Liang Chang, Rui Hou, Dan Meng, Mingzhe Zhang

    Abstract: Fully Homomorphic Encryption (FHE), a novel cryptographic theory enabling computation directly on ciphertext data, offers significant security benefits but is hampered by substantial performance overhead. In recent years, a series of accelerator designs have significantly enhanced the performance of FHE applications, bringing them closer to real-world applicability. However, these accelerators fac… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: 14 pages, 15 figures

  13. arXiv:2403.09993  [pdf, other

    cs.CV eess.IV

    TRG-Net: An Interpretable and Controllable Rain Generator

    Authors: Zhiqiang Pang, Hong Wang, Qi Xie, Deyu Meng, Zongben Xu

    Abstract: Exploring and modeling rain generation mechanism is critical for augmenting paired data to ease training of rainy image processing models. Against this task, this study proposes a novel deep learning based rain generator, which fully takes the physical generation mechanism underlying rains into consideration and well encodes the learning of the fundamental rain factors (i.e., shape, orientation, l… ▽ More

    Submitted 29 April, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

  14. arXiv:2403.06737  [pdf, other

    cs.IR

    Post-Training Attribute Unlearning in Recommender Systems

    Authors: Chaochao Chen, Yizhao Zhang, Yuyuan Li, Dan Meng, Jun Wang, Xiaoli Zheng, Jianwei Yin

    Abstract: With the growing privacy concerns in recommender systems, recommendation unlearning is getting increasing attention. Existing studies predominantly use training data, i.e., model inputs, as unlearning target. However, attackers can extract private information from the model even if it has not been explicitly encountered during training. We name this unseen information as \textit{attribute} and tre… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: arXiv admin note: text overlap with arXiv:2310.05847

  15. arXiv:2403.02901  [pdf, other

    cs.AI

    A Comprehensive Survey on Process-Oriented Automatic Text Summarization with Exploration of LLM-Based Methods

    Authors: Hanlei Jin, Yang Zhang, Dan Meng, Jun Wang, Jinghua Tan

    Abstract: Automatic Text Summarization (ATS), utilizing Natural Language Processing (NLP) algorithms, aims to create concise and accurate summaries, thereby significantly reducing the human effort required in processing large volumes of text. ATS has drawn considerable interest in both academic and industrial circles. Many studies have been conducted in the past to survey ATS methods; however, they generall… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  16. arXiv:2403.02818  [pdf, other

    cs.CV

    Are Dense Labels Always Necessary for 3D Object Detection from Point Cloud?

    Authors: Chenqiang Gao, Chuandong Liu, Jun Shu, Fangcen Liu, Jiang Liu, Luyu Yang, Xinbo Gao, Deyu Meng

    Abstract: Current state-of-the-art (SOTA) 3D object detection methods often require a large amount of 3D bounding box annotations for training. However, collecting such large-scale densely-supervised datasets is notoriously costly. To reduce the cumbersome data annotation process, we propose a novel sparsely-annotated framework, in which we just annotate one 3D object per scene. Such a sparse annotation str… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  17. arXiv:2403.00326  [pdf, other

    cs.CV

    DAMSDet: Dynamic Adaptive Multispectral Detection Transformer with Competitive Query Selection and Adaptive Feature Fusion

    Authors: Junjie Guo, Chenqiang Gao, Fangcen Liu, Deyu Meng, Xinbo Gao

    Abstract: Infrared-visible object detection aims to achieve robust even full-day object detection by fusing the complementary information of infrared and visible images. However, highly dynamically variable complementary characteristics and commonly existing modality misalignment make the fusion of complementary information difficult. In this paper, we propose a Dynamic Adaptive Multispectral Detection Tran… ▽ More

    Submitted 7 March, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

  18. arXiv:2402.15865  [pdf, other

    cs.CV eess.IV

    HIR-Diff: Unsupervised Hyperspectral Image Restoration Via Improved Diffusion Models

    Authors: Li Pang, Xiangyu Rui, Long Cui, Hongzhong Wang, Deyu Meng, Xiangyong Cao

    Abstract: Hyperspectral image (HSI) restoration aims at recovering clean images from degraded observations and plays a vital role in downstream tasks. Existing model-based methods have limitations in accurately modeling the complex image characteristics with handcraft priors, and deep learning-based methods suffer from poor generalization ability. To alleviate these issues, this paper proposes an unsupervis… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

  19. arXiv:2402.15297  [pdf, other

    cs.CV cs.LG

    Semi-supervised Counting via Pixel-by-pixel Density Distribution Modelling

    Authors: Hui Lin, Zhiheng Ma, Rongrong Ji, Yaowei Wang, Zhou Su, Xiaopeng Hong, Deyu Meng

    Abstract: This paper focuses on semi-supervised crowd counting, where only a small portion of the training data are labeled. We formulate the pixel-wise density value to regress as a probability distribution, instead of a single deterministic value. On this basis, we propose a semi-supervised crowd-counting model. Firstly, we design a pixel-wise distribution matching loss to measure the differences in the p… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: This is the technical report of a paper that was submitted to IEEE Transactions and is now under review

  20. arXiv:2402.11438  [pdf, other

    cs.CR cs.AR

    The Road to Trust: Building Enclaves within Confidential VMs

    Authors: Wenhao Wang, Linke Song, Benshan Mei, Shuang Liu, Shijun Zhao, Shoumeng Yan, XiaoFeng Wang, Dan Meng, Rui Hou

    Abstract: Integrity is critical for maintaining system security, as it ensures that only genuine software is loaded onto a machine. Although confidential virtual machines (CVMs) function within isolated environments separate from the host, it is important to recognize that users still encounter challenges in maintaining control over the integrity of the code running within the trusted execution environments… ▽ More

    Submitted 31 May, 2024; v1 submitted 17 February, 2024; originally announced February 2024.

  21. arXiv:2402.10983  [pdf, other

    cs.LG cs.CR quant-ph

    Quantum-Inspired Analysis of Neural Network Vulnerabilities: The Role of Conjugate Variables in System Attacks

    Authors: Jun-Jie Zhang, Deyu Meng

    Abstract: Neural networks demonstrate inherent vulnerability to small, non-random perturbations, emerging as adversarial attacks. Such attacks, born from the gradient of the loss function relative to the input, are discerned as input conjugates, revealing a systemic fragility within the network structure. Intriguingly, a mathematical congruence manifests between this mechanism and the quantum physics' uncer… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: 13 pages, 3 figures

  22. arXiv:2402.00407  [pdf, other

    cs.CV

    InfMAE: A Foundation Model in Infrared Modality

    Authors: Fangcen Liu, Chenqiang Gao, Yaming Zhang, Junjie Guo, Jinhao Wang, Deyu Meng

    Abstract: In recent years, the foundation models have swept the computer vision field and facilitated the development of various tasks within different modalities. However, it remains an open question on how to design an infrared foundation model. In this paper, we propose InfMAE, a foundation model in infrared modality. We release an infrared dataset, called Inf30 to address the problem of lacking large-sc… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: 8 pages, 2 figures

  23. arXiv:2401.12392  [pdf, other

    cs.RO cs.AI

    Evaluating Roadside Perception for Autonomous Vehicles: Insights from Field Testing

    Authors: Rusheng Zhang, Depu Meng, Shengyin Shen, Tinghan Wang, Tai Karir, Michael Maile, Henry X. Liu

    Abstract: Roadside perception systems are increasingly crucial in enhancing traffic safety and facilitating cooperative driving for autonomous vehicles. Despite rapid technological advancements, a major challenge persists for this newly arising field: the absence of standardized evaluation methods and benchmarks for these systems. This limitation hampers the ability to effectively assess and compare the per… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: 6 figures, 8 tables, 14 pages

  24. arXiv:2401.03870  [pdf, other

    cs.CV

    Gramformer: Learning Crowd Counting via Graph-Modulated Transformer

    Authors: Hui Lin, Zhiheng Ma, Xiaopeng Hong, Qinnan Shangguan, Deyu Meng

    Abstract: Transformer has been popular in recent crowd counting work since it breaks the limited receptive field of traditional CNNs. However, since crowd images always contain a large number of similar patches, the self-attention mechanism in Transformer tends to find a homogenized solution where the attention maps of almost all patches are identical. In this paper, we address this problem by proposing Gra… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Comments: This is the accepted version of the paper and supplemental material to appear in AAAI 2024. Please cite the final published version. Code is available at {https://github.com/LoraLinH/Gramformer}

  25. arXiv:2401.00708  [pdf, other

    cs.CV eess.IV

    Revisiting Nonlocal Self-Similarity from Continuous Representation

    Authors: Yisi Luo, Xile Zhao, Deyu Meng

    Abstract: Nonlocal self-similarity (NSS) is an important prior that has been successfully applied in multi-dimensional data processing tasks, e.g., image and video recovery. However, existing NSS-based methods are solely suitable for meshgrid data such as images and videos, but are not suitable for emerging off-meshgrid data, e.g., point cloud and climate data. In this work, we revisit the NSS from the cont… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

  26. arXiv:2312.15701  [pdf, other

    eess.IV cs.CV cs.LG

    Rotation Equivariant Proximal Operator for Deep Unfolding Methods in Image Restoration

    Authors: Jiahong Fu, Qi Xie, Deyu Meng, Zongben Xu

    Abstract: The deep unfolding approach has attracted significant attention in computer vision tasks, which well connects conventional image processing modeling manners with more recent deep learning techniques. Specifically, by establishing a direct correspondence between algorithm operators at each implementation step and network modules within each layer, one can rationally construct an almost ``white box'… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

  27. arXiv:2312.08853  [pdf, other

    cs.CV

    Guided Image Restoration via Simultaneous Feature and Image Guided Fusion

    Authors: Xinyi Liu, Qian Zhao, Jie Liang, Hui Zeng, Deyu Meng, Lei Zhang

    Abstract: Guided image restoration (GIR), such as guided depth map super-resolution and pan-sharpening, aims to enhance a target image using guidance information from another image of the same scene. Currently, joint image filtering-inspired deep learning-based methods represent the state-of-the-art for GIR tasks. Those methods either deal with GIR in an end-to-end way by elaborately designing filtering-ori… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

  28. arXiv:2312.01163  [pdf, other

    cs.CV

    A New Learning Paradigm for Foundation Model-based Remote Sensing Change Detection

    Authors: Kaiyu Li, Xiangyong Cao, Deyu Meng

    Abstract: Change detection (CD) is a critical task to observe and analyze dynamic processes of land cover. Although numerous deep learning-based CD models have performed excellently, their further performance improvements are constrained by the limited knowledge extracted from the given labelled data. On the other hand, the foundation models that emerged recently contain a huge amount of knowledge by scalin… ▽ More

    Submitted 11 February, 2024; v1 submitted 2 December, 2023; originally announced December 2023.

  29. arXiv:2310.12436  [pdf, other

    math.ST

    Learning prediction function of prior measures for statistical inverse problems of partial differential equations

    Authors: Junxiong Jia, Deyu Meng, Zongben Xu, Fang Yao

    Abstract: In this paper, we view the statistical inverse problems of partial differential equations (PDEs) as PDE-constrained regression and focus on learning the prediction function of the prior probability measures. From this perspective, we propose general generalization bounds for learning infinite-dimensionally defined prior measures in the style of the probability approximately correct Bayesian learni… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: 57 pages

    MSC Class: 62F15; 65N21

  30. arXiv:2310.05847  [pdf, other

    cs.LG cs.AI cs.CR cs.IR

    Making Users Indistinguishable: Attribute-wise Unlearning in Recommender Systems

    Authors: Yuyuan Li, Chaochao Chen, Xiaolin Zheng, Yizhao Zhang, Zhongxuan Han, Dan Meng, Jun Wang

    Abstract: With the growing privacy concerns in recommender systems, recommendation unlearning, i.e., forgetting the impact of specific learned targets, is getting increasing attention. Existing studies predominantly use training data, i.e., model inputs, as the unlearning target. However, we find that attackers can extract private information, i.e., gender, race, and age, from a trained model even if it has… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

    Journal ref: Proceedings of the 31st ACM International Conference on Multimedia (MM '23), October 29--November 3, 2023, Ottawa, ON, Canada

  31. arXiv:2310.05290  [pdf, other

    cs.CV cs.RO eess.IV

    MSight: An Edge-Cloud Infrastructure-based Perception System for Connected Automated Vehicles

    Authors: Rusheng Zhang, Depu Meng, Shengyin Shen, Zhengxia Zou, Houqiang Li, Henry X. Liu

    Abstract: As vehicular communication and networking technologies continue to advance, infrastructure-based roadside perception emerges as a pivotal tool for connected automated vehicle (CAV) applications. Due to their elevated positioning, roadside sensors, including cameras and lidars, often enjoy unobstructed views with diminished object occlusion. This provides them a distinct advantage over onboard perc… ▽ More

    Submitted 8 October, 2023; originally announced October 2023.

    Comments: Submitted to IEEE T-ITS

  32. arXiv:2310.02543  [pdf, other

    cs.LG

    Provable Tensor Completion with Graph Information

    Authors: Kaidong Wang, Yao Wang, Xiuwu Liao, Shaojie Tang, Can Yang, Deyu Meng

    Abstract: Graphs, depicting the interrelations between variables, has been widely used as effective side information for accurate data recovery in various matrix/tensor recovery related applications. In this paper, we study the tensor completion problem with graph information. Current research on graph-regularized tensor completion tends to be task-specific, lacking generality and systematic approaches. Fur… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

  33. arXiv:2309.15638  [pdf, other

    eess.IV cs.CV cs.LG

    FRS-Nets: Fourier Parameterized Rotation and Scale Equivariant Networks for Retinal Vessel Segmentation

    Authors: Zihong Sun, Qi Xie, Deyu Meng

    Abstract: With translation equivariance, convolution neural networks (CNNs) have achieved great success in retinal vessel segmentation. However, some other symmetries of the vascular morphology are not characterized by CNNs, such as rotation and scale symmetries. To embed more equivariance into CNNs and achieve the accuracy requirement for retinal vessel segmentation, we construct a novel convolution operat… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

  34. arXiv:2309.01627  [pdf, other

    cs.CV

    Cross-Consistent Deep Unfolding Network for Adaptive All-In-One Video Restoration

    Authors: Yuanshuo Cheng, Mingwen Shao, Yecong Wan, Yuanjian Qiao, Wangmeng Zuo, Deyu Meng

    Abstract: Existing Video Restoration (VR) methods always necessitate the individual deployment of models for each adverse weather to remove diverse adverse weather degradations, lacking the capability for adaptive processing of degradations. Such limitation amplifies the complexity and deployment costs in practical applications. To overcome this deficiency, in this paper, we propose a Cross-consistent Deep… ▽ More

    Submitted 10 December, 2023; v1 submitted 4 September, 2023; originally announced September 2023.

    Comments: 16 pages, 13 figures

  35. arXiv:2309.01483  [pdf, other

    cs.CV cs.LG

    CA2: Class-Agnostic Adaptive Feature Adaptation for One-class Classification

    Authors: Zilong Zhang, Zhibin Zhao, Deyu Meng, Xingwu Zhang, Xuefeng Chen

    Abstract: One-class classification (OCC), i.e., identifying whether an example belongs to the same distribution as the training data, is essential for deploying machine learning models in the real world. Adapting the pre-trained features on the target dataset has proven to be a promising paradigm for improving OCC performance. Existing methods are constrained by assumptions about the number of classes. This… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

    Comments: Submit to AAAI 2024

  36. arXiv:2308.16612  [pdf, other

    cs.CV eess.IV

    Neural Gradient Regularizer

    Authors: Shuang Xu, Yifan Wang, Zixiang Zhao, Jiangjun Peng, Xiangyong Cao, Deyu Meng, Yulun Zhang, Radu Timofte, Luc Van Gool

    Abstract: Owing to its significant success, the prior imposed on gradient maps has consistently been a subject of great interest in the field of image processing. Total variation (TV), one of the most representative regularizers, is known for its ability to capture the intrinsic sparsity prior underlying gradient maps. Nonetheless, TV and its variants often underestimate the gradient maps, leading to the we… ▽ More

    Submitted 13 September, 2023; v1 submitted 31 August, 2023; originally announced August 2023.

  37. arXiv:2308.07537  [pdf, other

    cs.CV

    AttMOT: Improving Multiple-Object Tracking by Introducing Auxiliary Pedestrian Attributes

    Authors: Yunhao Li, Zhen Xiao, Lin Yang, Dan Meng, Xin Zhou, Heng Fan, Libo Zhang

    Abstract: Multi-object tracking (MOT) is a fundamental problem in computer vision with numerous applications, such as intelligent surveillance and automated driving. Despite the significant progress made in MOT, pedestrian attributes, such as gender, hairstyle, body shape, and clothing features, which contain rich and high-level information, have been less explored. To address this gap, we propose a simple,… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

  38. arXiv:2308.07155  [pdf, other

    astro-ph.CO gr-qc hep-th

    Full analysis of the scalar-induced gravitational waves for the curvature perturbation with local-type non-Gaussianities

    Authors: Chen Yuan, De-Shuang Meng, Qing-Guo Huang

    Abstract: Primordial black holes (PBHs) are supposed to form through the gravitational collapse of regions with large density fluctuations. The formation of PBHs inevitably leads to the emission of scalar-induced gravitational wave (SIGW) signals, offering a unique opportunity to test the hypothesis of PBHs as a constituent of dark matter (DM). Previous studies have calculated the energy spectrum of SIGWs i… ▽ More

    Submitted 17 February, 2024; v1 submitted 14 August, 2023; originally announced August 2023.

    Comments: 21 pages, 2 figures; version accepted for publication in JCAP

  39. arXiv:2308.06925  [pdf, other

    cs.LG cs.CV

    CBA: Improving Online Continual Learning via Continual Bias Adaptor

    Authors: Quanziang Wang, Renzhen Wang, Yichen Wu, Xixi Jia, Deyu Meng

    Abstract: Online continual learning (CL) aims to learn new knowledge and consolidate previously learned knowledge from non-stationary data streams. Due to the time-varying training setting, the model learned from a changing distribution easily forgets the previously learned knowledge and biases toward the newly received task. To address this problem, we propose a Continual Bias Adaptor (CBA) module to augme… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

    Comments: Accepted by ICCV 2023

  40. arXiv:2308.06774  [pdf, other

    cs.CV cs.AI

    Dual Meta-Learning with Longitudinally Generalized Regularization for One-Shot Brain Tissue Segmentation Across the Human Lifespan

    Authors: Yongheng Sun, Fan Wang, Jun Shu, Haifeng Wang, Li Wang. Deyu Meng, Chunfeng Lian

    Abstract: Brain tissue segmentation is essential for neuroscience and clinical studies. However, segmentation on longitudinal data is challenging due to dynamic brain changes across the lifespan. Previous researches mainly focus on self-supervision with regularizations and will lose longitudinal generalization when fine-tuning on a specific age group. In this paper, we propose a dual meta-learning paradigm… ▽ More

    Submitted 13 August, 2023; originally announced August 2023.

    Comments: ICCV 2023

  41. arXiv:2306.17799  [pdf, other

    cs.CV cs.SD eess.AS

    A Low-rank Matching Attention based Cross-modal Feature Fusion Method for Conversational Emotion Recognition

    Authors: Yuntao Shou, Xiangyong Cao, Deyu Meng, Bo Dong, Qinghua Zheng

    Abstract: Conversational emotion recognition (CER) is an important research topic in human-computer interactions. Although deep learning (DL) based CER approaches have achieved excellent performance, existing cross-modal feature fusion methods used in these DL-based approaches either ignore the intra-modal and inter-modal emotional interaction or have high computational complexity. To address these issues,… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

    Comments: 10 pages, 4 figures

  42. arXiv:2306.17798  [pdf, other

    cs.CV

    Masked Contrastive Graph Representation Learning for Age Estimation

    Authors: Yuntao Shou, Xiangyong Cao, Deyu Meng

    Abstract: Age estimation of face images is a crucial task with various practical applications in areas such as video surveillance and Internet access control. While deep learning-based age estimation frameworks, e.g., convolutional neural network (CNN), multi-layer perceptrons (MLP), and transformers have shown remarkable performance, they have limitations when modelling complex or irregular objects in an i… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

    Comments: 10 pages, 7 figures

  43. arXiv:2306.17797  [pdf, other

    cs.CV eess.IV

    HIDFlowNet: A Flow-Based Deep Network for Hyperspectral Image Denoising

    Authors: Li Pang, Weizhen Gu, Xiangyong Cao, Xiangyu Rui, Jiangjun Peng, Shuang Xu, Gang Yang, Deyu Meng

    Abstract: Hyperspectral image (HSI) denoising is essentially ill-posed since a noisy HSI can be degraded from multiple clean HSIs. However, current deep learning-based approaches ignore this fact and restore the clean image with deterministic mapping (i.e., the network receives a noisy HSI and outputs a clean HSI). To alleviate this issue, this paper proposes a flow-based HSI denoising network (HIDFlowNet)… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: 10 pages, 8 figures

  44. arXiv:2306.17417  [pdf, other

    cs.DC

    Hashing-Based Distributed Clustering for Massive High-Dimensional Data

    Authors: Yifeng Xiao, Jiang Xue, Deyu Meng

    Abstract: Clustering analysis is of substantial significance for data mining. The properties of big data raise higher demand for more efficient and economical distributed clustering methods. However, existing distributed clustering methods mainly focus on the size of data but ignore possible problems caused by data dimension. To solve this problem, we propose a new distributed algorithm, referred to as Hash… ▽ More

    Submitted 30 June, 2023; originally announced June 2023.

    Comments: 12 pages, 6 figures, 50 references, submitted to TBD

    ACM Class: I.2.11

  45. arXiv:2306.17302  [pdf, other

    cs.CV cs.RO eess.IV

    Robust Roadside Perception: an Automated Data Synthesis Pipeline Minimizing Human Annotation

    Authors: Rusheng Zhang, Depu Meng, Lance Bassett, Shengyin Shen, Zhengxia Zou, Henry X. Liu

    Abstract: Recently, advancements in vehicle-to-infrastructure communication technologies have elevated the significance of infrastructure-based roadside perception systems for cooperative driving. This paper delves into one of its most pivotal challenges: data insufficiency. The lacking of high-quality labeled roadside sensor data with high diversity leads to low robustness, and low transfer-ability of curr… ▽ More

    Submitted 8 February, 2024; v1 submitted 29 June, 2023; originally announced June 2023.

    Comments: Accepted by IEEE Transactions on Intelligent Vehicles

  46. arXiv:2305.10925  [pdf, other

    cs.CV eess.IV

    Unsupervised Hyperspectral Pansharpening via Low-rank Diffusion Model

    Authors: Xiangyu Rui, Xiangyong Cao, Li Pang, Zeyu Zhu, Zongsheng Yue, Deyu Meng

    Abstract: Hyperspectral pansharpening is a process of merging a high-resolution panchromatic (PAN) image and a low-resolution hyperspectral (LRHS) image to create a single high-resolution hyperspectral (HRHS) image. Existing Bayesian-based HS pansharpening methods require designing handcraft image prior to characterize the image features, and deep learning-based HS pansharpening methods usually require a la… ▽ More

    Submitted 19 November, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

  47. arXiv:2305.07892  [pdf, other

    cs.LG cs.AI cs.CV

    DAC-MR: Data Augmentation Consistency Based Meta-Regularization for Meta-Learning

    Authors: Jun Shu, Xiang Yuan, Deyu Meng, Zongben Xu

    Abstract: Meta learning recently has been heavily researched and helped advance the contemporary machine learning. However, achieving well-performing meta-learning model requires a large amount of training tasks with high-quality meta-data representing the underlying task generalization goal, which is sometimes difficult and expensive to obtain for real applications. Current meta-data-driven meta-learning a… ▽ More

    Submitted 13 May, 2023; originally announced May 2023.

    Comments: 27 pages

  48. arXiv:2305.07774  [pdf, other

    cs.CV eess.IV

    PanFlowNet: A Flow-Based Deep Network for Pan-sharpening

    Authors: Gang Yang, Xiangyong Cao, Wenzhe Xiao, Man Zhou, Aiping Liu, Xun chen, Deyu Meng

    Abstract: Pan-sharpening aims to generate a high-resolution multispectral (HRMS) image by integrating the spectral information of a low-resolution multispectral (LRMS) image with the texture details of a high-resolution panchromatic (PAN) image. It essentially inherits the ill-posed nature of the super-resolution (SR) task that diverse HRMS images can degrade into an LRMS image. However, existing deep learn… ▽ More

    Submitted 16 May, 2023; v1 submitted 12 May, 2023; originally announced May 2023.

  49. T-former: An Efficient Transformer for Image Inpainting

    Authors: Ye Deng, Siqi Hui, Sanping Zhou, Deyu Meng, Jinjun Wang

    Abstract: Benefiting from powerful convolutional neural networks (CNNs), learning-based image inpainting methods have made significant breakthroughs over the years. However, some nature of CNNs (e.g. local prior, spatially shared parameters) limit the performance in the face of broken images with diverse and complex forms. Recently, a class of attention-based network architectures, called transformer, has s… ▽ More

    Submitted 18 May, 2023; v1 submitted 12 May, 2023; originally announced May 2023.

    Journal ref: ACM Multimedia 2022

  50. arXiv:2304.05610  [pdf

    cs.RO

    Vehicle Trajectory Prediction based Predictive Collision Risk Assessment for Autonomous Driving in Highway Scenarios

    Authors: Dejian Meng, Wei Xiao, Lijun Zhang, Zhuang Zhang, Zihao Liu

    Abstract: For driving safely and efficiently in highway scenarios, autonomous vehicles (AVs) must be able to predict future behaviors of surrounding object vehicles (OVs), and assess collision risk accurately for reasonable decision-making. Aiming at autonomous driving in highway scenarios, a predictive collision risk assessment method based on trajectory prediction of OVs is proposed in this paper. Firstly… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

    Comments: manuscript submitted to IEEE Transactions on Intelligent Transportation Systems