Skip to main content

Showing 1–50 of 210 results for author: Deng, W

  1. arXiv:2407.02886  [pdf, other

    cs.CR

    A Wolf in Sheep's Clothing: Practical Black-box Adversarial Attacks for Evading Learning-based Windows Malware Detection in the Wild

    Authors: Xiang Ling, Zhiyu Wu, Bin Wang, Wei Deng, Jingzheng Wu, Shouling Ji, Tianyue Luo, Yanjun Wu

    Abstract: Given the remarkable achievements of existing learning-based malware detection in both academia and industry, this paper presents MalGuise, a practical black-box adversarial attack framework that evaluates the security risks of existing learning-based Windows malware detection systems under the black-box setting. MalGuise first employs a novel semantics-preserving transformation of call-based redi… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: This paper has been accepted by 33rd USENIX Security Symposium 2024

  2. arXiv:2406.18957  [pdf, other

    cs.DC cs.GT

    A Treatment of EIP-1559: Enhancing Transaction Fee Mechanism through Nth-Price Auction

    Authors: Kun Li, Guangpeng Qi, Guangyong Shang, Wanli Deng, Minghui Xu, Xiuzhen Cheng

    Abstract: With the widespread adoption of blockchain technology, the transaction fee mechanism (TFM) in blockchain systems has become a prominent research topic. An ideal TFM should satisfy user incentive compatibility (UIC), miner incentive compatibility (MIC), and miner-user side contract proofness ($c$-SCP). However, state-of-the-art works either fail to meet these three properties simultaneously or only… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  3. arXiv:2406.11147  [pdf, other

    cs.SE cs.AI

    Vul-RAG: Enhancing LLM-based Vulnerability Detection via Knowledge-level RAG

    Authors: Xueying Du, Geng Zheng, Kaixin Wang, Jiayi Feng, Wentai Deng, Mingwei Liu, Bihuan Chen, Xin Peng, Tao Ma, Yiling Lou

    Abstract: Vulnerability detection is essential for software quality assurance. In recent years, deep learning models (especially large language models) have shown promise in vulnerability detection. In this work, we propose a novel LLM-based vulnerability detection technique Vul-RAG, which leverages knowledge-level retrieval-augmented generation (RAG) framework to detect vulnerability for the given code in… ▽ More

    Submitted 19 June, 2024; v1 submitted 16 June, 2024; originally announced June 2024.

  4. arXiv:2406.09908  [pdf, other

    cs.LG cs.CV

    What Does Softmax Probability Tell Us about Classifiers Ranking Across Diverse Test Conditions?

    Authors: Weijie Tu, Weijian Deng, Liang Zheng, Tom Gedeon

    Abstract: This work aims to develop a measure that can accurately rank the performance of various classifiers when they are tested on unlabeled data from out-of-distribution (OOD) distributions. We commence by demonstrating that conventional uncertainty metrics, notably the maximum Softmax prediction probability, possess inherent utility in forecasting model generalization across certain OOD contexts. Build… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: TMLR 2024 (https://openreview.net/forum?id=vtiDUgGjyx)

  5. arXiv:2406.08772  [pdf, other

    cs.CV cs.CL

    MMFakeBench: A Mixed-Source Multimodal Misinformation Detection Benchmark for LVLMs

    Authors: Xuannan Liu, Zekun Li, Peipei Li, Shuhan Xia, Xing Cui, Linzhi Huang, Huaibo Huang, Weihong Deng, Zhaofeng He

    Abstract: Current multimodal misinformation detection (MMD) methods often assume a single source and type of forgery for each sample, which is insufficient for real-world scenarios where multiple forgery sources coexist. The lack of a benchmark for mixed-source misinformation has hindered progress in this field. To address this, we introduce MMFakeBench, the first comprehensive benchmark for mixed-source MM… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  6. arXiv:2405.18979  [pdf, other

    cs.LG stat.ML

    MANO: Exploiting Matrix Norm for Unsupervised Accuracy Estimation Under Distribution Shifts

    Authors: Renchunzi Xie, Ambroise Odonnat, Vasilii Feofanov, Weijian Deng, Jianfeng Zhang, Bo An

    Abstract: Leveraging the models' outputs, specifically the logits, is a common approach to estimating the test accuracy of a pre-trained neural network on out-of-distribution (OOD) samples without requiring access to the corresponding ground truth labels. Despite their ease of implementation and computational efficiency, current logit-based methods are vulnerable to overconfidence issues, leading to predict… ▽ More

    Submitted 24 June, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

    Comments: The three first authors contributed equally

  7. arXiv:2405.14280  [pdf, other

    cs.IR

    ASI++: Towards Distributionally Balanced End-to-End Generative Retrieval

    Authors: Yuxuan Liu, Tianchi Yang, Zihan Zhang, Minghui Song, Haizhen Huang, Weiwei Deng, Feng Sun, Qi Zhang

    Abstract: Generative retrieval, a promising new paradigm in information retrieval, employs a seq2seq model to encode document features into parameters and decode relevant document identifiers (IDs) based on search queries. Existing generative retrieval solutions typically rely on a preprocessing stage to pre-define document IDs, which can suffer from a semantic gap between these IDs and the retrieval task.… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  8. arXiv:2405.12130  [pdf, other

    cs.CL cs.LG

    MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning

    Authors: Ting Jiang, Shaohan Huang, Shengyue Luo, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang, Deqing Wang, Fuzhen Zhuang

    Abstract: Low-rank adaptation is a popular parameter-efficient fine-tuning method for large language models. In this paper, we analyze the impact of low-rank updating, as implemented in LoRA. Our findings suggest that the low-rank updating mechanism may limit the ability of LLMs to effectively learn and memorize new knowledge. Inspired by this observation, we propose a new method called MoRA, which employs… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: Work in Progress

  9. arXiv:2405.07839  [pdf, other

    cs.LG cs.AI stat.ML

    Constrained Exploration via Reflected Replica Exchange Stochastic Gradient Langevin Dynamics

    Authors: Haoyang Zheng, Hengrong Du, Qi Feng, Wei Deng, Guang Lin

    Abstract: Replica exchange stochastic gradient Langevin dynamics (reSGLD) is an effective sampler for non-convex learning in large-scale datasets. However, the simulation may encounter stagnation issues when the high-temperature chain delves too deeply into the distribution tails. To tackle this issue, we propose reflected reSGLD (r2SGLD): an algorithm tailored for constrained non-convex exploration by util… ▽ More

    Submitted 3 June, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

    Comments: 28 pages, 13 figures

  10. arXiv:2405.04795  [pdf, other

    cs.LG

    Variational Schrödinger Diffusion Models

    Authors: Wei Deng, Weijian Luo, Yixin Tan, Marin Biloš, Yu Chen, Yuriy Nevmyvaka, Ricky T. Q. Chen

    Abstract: Schrödinger bridge (SB) has emerged as the go-to method for optimizing transportation plans in diffusion models. However, SB requires estimating the intractable forward score functions, inevitably resulting in the costly implicit training loss based on simulated trajectories. To improve the scalability while preserving efficient transportation plans, we leverage variational inference to linearize… ▽ More

    Submitted 19 June, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

    Comments: ICML 2024

  11. arXiv:2405.02241  [pdf, other

    cs.RO

    WeightedPose: Generalizable Cross-Pose Estimation via Weighted SVD

    Authors: Xuxin Cheng, Heng Yu, Harry Zhang, Wenxing Deng

    Abstract: We introduce a new approach for robotic manipulation tasks in human settings that necessitates understanding the 3D geometric connections between a pair of objects. Conventional end-to-end training approaches, which convert pixel observations directly into robot actions, often fail to effectively understand complex pose relationships and do not easily adapt to new object configurations. To overcom… ▽ More

    Submitted 21 May, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

    Comments: arXiv admin note: text overlap with arXiv:2211.09325

  12. arXiv:2404.17227  [pdf, other

    econ.GN cs.CE cs.CR cs.CY q-fin.RM

    Trust Dynamics and Market Behavior in Cryptocurrency: A Comparative Study of Centralized and Decentralized Exchanges

    Authors: Xintong Wu, Wanling Deng, Yuotng Quan, Luyao Zhang

    Abstract: In the evolving landscape of digital finance, the transition from centralized to decentralized trust mechanisms, primarily driven by blockchain technology, plays a critical role in shaping the cryptocurrency ecosystem. This paradigm shift raises questions about the traditional reliance on centralized trust and introduces a novel, decentralized trust framework built upon distributed networks. Our r… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  13. arXiv:2404.16484  [pdf, other

    cs.CV eess.IV

    Real-Time 4K Super-Resolution of Compressed AVIF Images. AIS 2024 Challenge Survey

    Authors: Marcos V. Conde, Zhijun Lei, Wen Li, Cosmin Stejerean, Ioannis Katsavounidis, Radu Timofte, Kihwan Yoon, Ganzorig Gankhuyag, Jiangtao Lv, Long Sun, Jinshan Pan, Jiangxin Dong, Jinhui Tang, Zhiyuan Li, Hao Wei, Chenyang Ge, Dongyang Zhang, Tianle Liu, Huaian Chen, Yi Jin, Menghan Zhou, Yiqiang Yan, Si Gao, Biao Wu, Shaoli Liu , et al. (50 additional authors not shown)

    Abstract: This paper introduces a novel benchmark as part of the AIS 2024 Real-Time Image Super-Resolution (RTSR) Challenge, which aims to upscale compressed images from 540p to 4K resolution (4x factor) in real-time on commercial GPUs. For this, we use a diverse test set containing a variety of 4K images ranging from digital art to gaming and photography. The images are compressed using the modern AVIF cod… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: CVPR 2024, AI for Streaming (AIS) Workshop

  14. arXiv:2404.14248  [pdf, other

    cs.CV

    NTIRE 2024 Challenge on Low Light Image Enhancement: Methods and Results

    Authors: Xiaoning Liu, Zongwei Wu, Ao Li, Florin-Alexandru Vasluianu, Yulun Zhang, Shuhang Gu, Le Zhang, Ce Zhu, Radu Timofte, Zhi Jin, Hongjun Wu, Chenxi Wang, Haitao Ling, Yuanhao Cai, Hao Bian, Yuxin Zheng, Jing Lin, Alan Yuille, Ben Shao, Jin Guo, Tianli Liu, Mohao Wu, Yixu Feng, Shuo Hou, Haotian Lin , et al. (87 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2024 low light image enhancement challenge, highlighting the proposed solutions and results. The aim of this challenge is to discover an effective network design or solution capable of generating brighter, clearer, and visually appealing results when dealing with a variety of conditions, including ultra-high resolution (4K and beyond), non-uniform illumination, backlig… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: NTIRE 2024 Challenge Report

  15. arXiv:2404.00563  [pdf, other

    cs.CV

    Exploiting Inter-sample and Inter-feature Relations in Dataset Distillation

    Authors: Wenxiao Deng, Wenbin Li, Tianyu Ding, Lei Wang, Hongguang Zhang, Kuihua Huang, Jing Huo, Yang Gao

    Abstract: Dataset distillation has emerged as a promising approach in deep learning, enabling efficient training with small synthetic datasets derived from larger real ones. Particularly, distribution matching-based distillation methods attract attention thanks to its effectiveness and low computational cost. However, these methods face two primary limitations: the dispersed feature distribution within the… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Comments: Accepted to CVPR 2024

  16. arXiv:2403.19322  [pdf, other

    cs.CV cs.CL

    Plug-and-Play Grounding of Reasoning in Multimodal Large Language Models

    Authors: Jiaxing Chen, Yuxuan Liu, Dehu Li, Xiang An, Weimo Deng, Ziyong Feng, Yongle Zhao, Yin Xie

    Abstract: The rise of Multimodal Large Language Models (MLLMs), renowned for their advanced instruction-following and reasoning capabilities, has significantly propelled the field of visual reasoning. However, due to limitations in their image tokenization processes, most MLLMs struggle to capture fine details of text and objects in images, especially in high-resolution samples. To overcome this limitation,… ▽ More

    Submitted 18 June, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: 15 pages, 8 figures

  17. arXiv:2403.17752  [pdf, other

    cs.CL

    Can multiple-choice questions really be useful in detecting the abilities of LLMs?

    Authors: Wangyue Li, Liangzhi Li, Tong Xiang, Xiao Liu, Wei Deng, Noa Garcia

    Abstract: Multiple-choice questions (MCQs) are widely used in the evaluation of large language models (LLMs) due to their simplicity and efficiency. However, there are concerns about whether MCQs can truly measure LLM's capabilities, particularly in knowledge-intensive scenarios where long-form generation (LFG) answers are required. The misalignment between the task and the evaluation method demands a thoug… ▽ More

    Submitted 23 May, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

    Comments: LREC-COLING 2024

  18. arXiv:2403.14760  [pdf, other

    cs.CV

    Can 3D Vision-Language Models Truly Understand Natural Language?

    Authors: Weipeng Deng, Jihan Yang, Runyu Ding, Jiahui Liu, Yijiang Li, Xiaojuan Qi, Edith Ngai

    Abstract: Rapid advancements in 3D vision-language (3D-VL) tasks have opened up new avenues for human interaction with embodied agents or robots using natural language. Despite this progress, we find a notable limitation: existing 3D-VL models exhibit sensitivity to the styles of language input, struggling to understand sentences with the same semantic meaning but written in different variants. This observa… ▽ More

    Submitted 3 July, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: https://github.com/VincentDENGP/3D-LR

  19. arXiv:2403.10873  [pdf, other

    cs.IT eess.SP

    CSI Transfer From Sub-6G to mmWave: Reduced-Overhead Multi-User Hybrid Beamforming

    Authors: Weicao Deng, Min Li, Ming-Min Zhao, Min-Jian Zhao, Osvaldo Simeone

    Abstract: Hybrid beamforming is vital in modern wireless systems, especially for massive MIMO and millimeter-wave deployments, offering efficient directional transmission with reduced hardware complexity. However, effective beamforming in multi-user scenarios relies heavily on accurate channel state information, the acquisition of which often incurs excessive pilot overhead, degrading system performance. To… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: 13 pages, 12 figures, submitted

  20. arXiv:2403.09500  [pdf, other

    cs.CV

    Faceptor: A Generalist Model for Face Perception

    Authors: Lixiong Qin, Mei Wang, Xuannan Liu, Yuhang Zhang, Wei Deng, Xiaoshuai Song, Weiran Xu, Weihong Deng

    Abstract: With the comprehensive research conducted on various face analysis tasks, there is a growing interest among researchers to develop a unified approach to face perception. Existing methods mainly discuss unified representation and training, which lack task extensibility and application efficiency. To tackle this issue, we focus on the unified model structure, exploring a face generalist model. As an… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  21. arXiv:2403.06529  [pdf, other

    cs.CV

    Confidence-Aware RGB-D Face Recognition via Virtual Depth Synthesis

    Authors: Zijian Chen, Mei Wang, Weihong Deng, Hongzhi Shi, Dongchao Wen, Yingjie Zhang, Xingchen Cui, Jian Zhao

    Abstract: 2D face recognition encounters challenges in unconstrained environments due to varying illumination, occlusion, and pose. Recent studies focus on RGB-D face recognition to improve robustness by incorporating depth information. However, collecting sufficient paired RGB-D training data is expensive and time-consuming, hindering wide deployment. In this work, we first construct a diverse depth datase… ▽ More

    Submitted 16 March, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: 9 pages, 5 figures

  22. arXiv:2403.06104  [pdf, other

    cs.CV

    Universal Debiased Editing on Foundation Models for Fair Medical Image Classification

    Authors: Ruinan Jin, Wenlong Deng, Minghui Chen, Xiaoxiao Li

    Abstract: In the era of Foundation Models' (FMs) rising prominence in AI, our study addresses the challenge of biases in medical images while using FM API, particularly spurious correlations between pixels and sensitive attributes. Traditional methods for bias mitigation face limitations due to the restricted access to web-hosted FMs and difficulties in addressing the underlying bias encoded within the FM A… ▽ More

    Submitted 16 March, 2024; v1 submitted 10 March, 2024; originally announced March 2024.

  23. arXiv:2403.05523  [pdf, other

    cs.CV

    Beyond Finite Data: Towards Data-free Out-of-distribution Generalization via Extrapolation

    Authors: Yijiang Li, Sucheng Ren, Weipeng Deng, Yuzhi Xu, Ying Gao, Edith Ngai, Haohan Wang

    Abstract: Out-of-distribution (OOD) generalization is a favorable yet challenging property for deep neural networks. The core challenges lie in the limited availability of source domains that help models learn an invariant representation from the spurious features. Various domain augmentation have been proposed but largely rely on interpolating existing domains and frequently face difficulties in creating t… ▽ More

    Submitted 11 March, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

    Comments: Preprint. Paper under review

  24. arXiv:2403.01988  [pdf, other

    cs.CL

    FakeNewsGPT4: Advancing Multimodal Fake News Detection through Knowledge-Augmented LVLMs

    Authors: Xuannan Liu, Peipei Li, Huaibo Huang, Zekun Li, Xing Cui, Jiahao Liang, Lixiong Qin, Weihong Deng, Zhaofeng He

    Abstract: The massive generation of multimodal fake news exhibits substantial distribution discrepancies, prompting the need for generalized detectors. However, the insulated nature of training within specific domains restricts the capability of classical detectors to obtain open-world facts. In this paper, we propose FakeNewsGPT4, a novel framework that augments Large Vision-Language Models (LVLMs) with fo… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  25. arXiv:2402.18039  [pdf, other

    cs.CL cs.AI

    ResLoRA: Identity Residual Mapping in Low-Rank Adaption

    Authors: Shuhua Shi, Shaohan Huang, Minghui Song, Zhoujun Li, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang

    Abstract: As one of the most popular parameter-efficient fine-tuning (PEFT) methods, low-rank adaptation (LoRA) is commonly applied to fine-tune large language models (LLMs). However, updating the weights of LoRA blocks effectively and expeditiously is challenging due to the long calculation path in the original model. To address this, we propose ResLoRA, an improved framework of LoRA. By adding residual pa… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 14 pages, 7 figures

  26. arXiv:2402.15754  [pdf, other

    cs.CL

    HD-Eval: Aligning Large Language Model Evaluators Through Hierarchical Criteria Decomposition

    Authors: Yuxuan Liu, Tianchi Yang, Shaohan Huang, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang

    Abstract: Large language models (LLMs) have emerged as a promising alternative to expensive human evaluations. However, the alignment and coverage of LLM-based evaluations are often limited by the scope and potential bias of the evaluation prompts and criteria. To address this challenge, we propose HD-Eval, a novel framework that iteratively aligns LLM-based evaluators with human preference via Hierarchical… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

    Comments: 20 pages, 13 figures

  27. arXiv:2402.14843  [pdf, other

    cs.CL cs.AI cs.LG

    Text Diffusion with Reinforced Conditioning

    Authors: Yuxuan Liu, Tianchi Yang, Shaohan Huang, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang

    Abstract: Diffusion models have demonstrated exceptional capability in generating high-quality images, videos, and audio. Due to their adaptiveness in iterative refinement, they provide a strong potential for achieving better non-autoregressive sequence generation. However, existing text diffusion models still fall short in their performance due to a challenge in handling the discreteness of language. This… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: 9 pages, 3 figures

  28. arXiv:2402.14208  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    LLM-Assisted Content Conditional Debiasing for Fair Text Embedding

    Authors: Wenlong Deng, Blair Chen, Beidi Zhao, Chiyu Zhang, Xiaoxiao Li, Christos Thrampoulidis

    Abstract: Mitigating biases in machine learning models has become an increasing concern in Natural Language Processing (NLP), particularly in developing fair text embeddings, which are crucial yet challenging for real-world applications like search engines. In response, this paper proposes a novel method for learning fair text embeddings. First, we define a novel content-conditional equal distance (CCED) fa… ▽ More

    Submitted 24 June, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

  29. arXiv:2402.13874  [pdf, other

    cs.CL

    $Se^2$: Sequential Example Selection for In-Context Learning

    Authors: Haoyu Liu, Jianfeng Liu, Shaohan Huang, Yuefeng Zhan, Hao Sun, Weiwei Deng, Furu Wei, Qi Zhang

    Abstract: The remarkable capability of large language models (LLMs) for in-context learning (ICL) needs to be activated by demonstration examples. Prior work has extensively explored the selection of examples for ICL, predominantly following the "select then organize" paradigm, such approaches often neglect the internal relationships between examples and exist an inconsistency between the training and infer… ▽ More

    Submitted 6 June, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: Accepted by ACL 2024 Findings

  30. arXiv:2402.10797  [pdf, other

    cs.MS cs.LG stat.CO stat.ML

    BlackJAX: Composable Bayesian inference in JAX

    Authors: Alberto Cabezas, Adrien Corenflos, Junpeng Lao, Rémi Louf, Antoine Carnec, Kaustubh Chaudhari, Reuben Cohn-Gordon, Jeremie Coullon, Wei Deng, Sam Duffield, Gerardo Durán-Martín, Marcin Elantkowski, Dan Foreman-Mackey, Michele Gregori, Carlos Iguaran, Ravin Kumar, Martin Lysy, Kevin Murphy, Juan Camilo Orduz, Karm Patel, Xi Wang, Rob Zinkov

    Abstract: BlackJAX is a library implementing sampling and variational inference algorithms commonly used in Bayesian computation. It is designed for ease of use, speed, and modularity by taking a functional approach to the algorithms' implementation. BlackJAX is written in Python, using JAX to compile and run NumpPy-like samplers and variational methods on CPUs, GPUs, and TPUs. The library integrates well w… ▽ More

    Submitted 22 February, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: Companion paper for the library https://github.com/blackjax-devs/blackjax Update: minor changes and updated the list of authors to include technical contributors

  31. arXiv:2402.07417  [pdf, other

    cs.CV cs.LG

    An Empirical Study Into What Matters for Calibrating Vision-Language Models

    Authors: Weijie Tu, Weijian Deng, Dylan Campbell, Stephen Gould, Tom Gedeon

    Abstract: Vision-Language Models (VLMs) have emerged as the dominant approach for zero-shot recognition, adept at handling diverse scenarios and significant distribution changes. However, their deployment in risk-sensitive areas requires a deeper understanding of their uncertainty estimation capabilities, a relatively uncharted area. In this study, we explore the calibration properties of VLMs across differ… ▽ More

    Submitted 14 June, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: ICML 2024 Camera Ready

  32. arXiv:2402.07410  [pdf, other

    cs.CV cs.LG

    A Closer Look at the Robustness of Contrastive Language-Image Pre-Training (CLIP)

    Authors: Weijie Tu, Weijian Deng, Tom Gedeon

    Abstract: Contrastive Language-Image Pre-training (CLIP) models have demonstrated remarkable generalization capabilities across multiple challenging distribution shifts. However, there is still much to be explored in terms of their robustness to the variations of specific visual factors. In real-world applications, reliable and safe systems must consider other safety objectives beyond classification accurac… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Comments: Accepted by NeurIPS 2023

  33. arXiv:2401.16729  [pdf, other

    cs.LG

    Widely Linear Matched Filter: A Lynchpin towards the Interpretability of Complex-valued CNNs

    Authors: Qingchen Wang, Zhe Li, Zdenka Babic, Wei Deng, Ljubiša Stanković, Danilo P. Mandic

    Abstract: A recent study on the interpretability of real-valued convolutional neural networks (CNNs) {Stankovic_Mandic_2023CNN} has revealed a direct and physically meaningful link with the task of finding features in data through matched filters. However, applying this paradigm to illuminate the interpretability of complex-valued CNNs meets a formidable obstacle: the extension of matched filtering to a gen… ▽ More

    Submitted 31 January, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

  34. arXiv:2401.15897  [pdf, other

    cs.CY cs.HC cs.LG

    Red-Teaming for Generative AI: Silver Bullet or Security Theater?

    Authors: Michael Feffer, Anusha Sinha, Wesley Hanwen Deng, Zachary C. Lipton, Hoda Heidari

    Abstract: In response to rising concerns surrounding the safety, security, and trustworthiness of Generative AI (GenAI) models, practitioners and regulators alike have pointed to AI red-teaming as a key component of their strategies for identifying and mitigating these risks. However, despite AI red-teaming's central role in policy discussions and corporate messaging, significant questions remain about what… ▽ More

    Submitted 15 May, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

  35. arXiv:2401.13154  [pdf, other

    cs.OS

    Nomad: Non-Exclusive Memory Tiering via Transactional Page Migration

    Authors: Lingfeng Xiang, Zhen Lin, Weishu Deng, Hui Lu, Jia Rao, Yifan Yuan, Ren Wang

    Abstract: With the advent of byte-addressable memory devices, such as CXL memory, persistent memory, and storage-class memory, tiered memory systems have become a reality. Page migration is the de facto method within operating systems for managing tiered memory. It aims to bring hot data whenever possible into fast memory to optimize the performance of data accesses while using slow memory to accommodate da… ▽ More

    Submitted 17 June, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

  36. arXiv:2401.12507  [pdf, other

    cs.CV

    Open-Set Facial Expression Recognition

    Authors: Yuhang Zhang, Yue Yao, Xuannan Liu, Lixiong Qin, Wenjing Wang, Weihong Deng

    Abstract: Facial expression recognition (FER) models are typically trained on datasets with a fixed number of seven basic classes. However, recent research works point out that there are far more expressions than the basic ones. Thus, when these models are deployed in the real world, they may encounter unknown classes, such as compound expressions that cannot be classified into existing basic classes. To ad… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: Accepted by AAAI2024

  37. arXiv:2401.11665  [pdf, other

    stat.ML cs.AI cs.LG

    Accelerating Approximate Thompson Sampling with Underdamped Langevin Monte Carlo

    Authors: Haoyang Zheng, Wei Deng, Christian Moya, Guang Lin

    Abstract: Approximate Thompson sampling with Langevin Monte Carlo broadens its reach from Gaussian posterior sampling to encompass more general smooth posteriors. However, it still encounters scalability issues in high-dimensional problems when demanding high accuracy. To address this, we propose an approximate Thompson sampling strategy, utilizing underdamped Langevin Monte Carlo, where the latter is the g… ▽ More

    Submitted 20 June, 2024; v1 submitted 21 January, 2024; originally announced January 2024.

    Comments: 52 pages, 2 figures

  38. arXiv:2401.09064  [pdf, other

    cs.IT eess.SP

    Performance Bounds and Optimization for CSI-Ratio based Bi-static Doppler Sensing in ISAC Systems

    Authors: Yanmo Hu, Kai Wu, J. Andrew Zhang, Weibo Deng, Y. Jay Guo

    Abstract: Bi-static sensing is crucial for exploring the potential of networked sensing capabilities in integrated sensing and communications (ISAC). However, it suffers from the challenging clock asynchronism issue. CSI ratio-based sensing is an effective means to address the issue. Its performance bounds, particular for Doppler sensing, have not been fully understood yet. This work endeavors to fill the r… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: 14 pages, 15 figures, journal paper

  39. arXiv:2401.07284  [pdf, other

    cs.CL

    Improving Domain Adaptation through Extended-Text Reading Comprehension

    Authors: Ting Jiang, Shaohan Huang, Shengyue Luo, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang, Deqing Wang, Fuzhen Zhuang

    Abstract: To enhance the domain-specific capabilities of large language models, continued pre-training on a domain-specific corpus is a prevalent method. Recent work demonstrates that adapting models using reading comprehension data formatted by regex-based patterns can significantly improve performance on domain-specific tasks. However, regex-based patterns are incapable of parsing raw corpora using domain… ▽ More

    Submitted 18 January, 2024; v1 submitted 14 January, 2024; originally announced January 2024.

    Comments: Work in Progress

  40. arXiv:2401.03228  [pdf, other

    stat.ML cs.LG

    Reflected Schrödinger Bridge for Constrained Generative Modeling

    Authors: Wei Deng, Yu Chen, Nicole Tianjiao Yang, Hengrong Du, Qi Feng, Ricky T. Q. Chen

    Abstract: Diffusion models have become the go-to method for large-scale generative models in real-world applications. These applications often involve data distributions confined within bounded domains, typically requiring ad-hoc thresholding techniques for boundary enforcement. Reflected diffusion models (Lou23) aim to enhance generalizability by generating the data distribution through a backward process… ▽ More

    Submitted 6 January, 2024; originally announced January 2024.

  41. arXiv:2401.02150  [pdf, other

    cs.CV

    Marginal Debiased Network for Fair Visual Recognition

    Authors: Mei Wang, Weihong Deng, Sen Su

    Abstract: Deep neural networks (DNNs) are often prone to learn the spurious correlations between target classes and bias attributes, like gender and race, inherent in a major portion of training data (bias-aligned samples), thus showing unfair behavior and arising controversy in the modern pluralistic and egalitarian society. In this paper, we propose a novel marginal debiased network (MDN) to learn debiase… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

  42. arXiv:2401.01575  [pdf, other

    cs.CV

    Enhancing Generalization of Invisible Facial Privacy Cloak via Gradient Accumulation

    Authors: Xuannan Liu, Yaoyao Zhong, Weihong Deng, Hongzhi Shi, Xingchen Cui, Yunfeng Yin, Dongchao Wen

    Abstract: The blooming of social media and face recognition (FR) systems has increased people's concern about privacy and security. A new type of adversarial privacy cloak (class-universal) can be applied to all the images of regular users, to prevent malicious FR systems from acquiring their identity information. In this work, we discover the optimization dilemma in the existing methods -- the local optima… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

  43. arXiv:2401.00921  [pdf, other

    cs.CV

    Skeleton2vec: A Self-supervised Learning Framework with Contextualized Target Representations for Skeleton Sequence

    Authors: Ruizhuo Xu, Linzhi Huang, Mei Wang, Jiani Hu, Weihong Deng

    Abstract: Self-supervised pre-training paradigms have been extensively explored in the field of skeleton-based action recognition. In particular, methods based on masked prediction have pushed the performance of pre-training to a new height. However, these methods take low-level features, such as raw joint coordinates or temporal motion, as prediction targets for the masked regions, which is suboptimal. In… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

    Comments: Submitted to CVPR 2024

  44. arXiv:2401.00719  [pdf, other

    cs.CV cs.AI

    Depth Map Denoising Network and Lightweight Fusion Network for Enhanced 3D Face Recognition

    Authors: Ruizhuo Xu, Ke Wang, Chao Deng, Mei Wang, Xi Chen, Wenhui Huang, Junlan Feng, Weihong Deng

    Abstract: With the increasing availability of consumer depth sensors, 3D face recognition (FR) has attracted more and more attention. However, the data acquired by these sensors are often coarse and noisy, making them impractical to use directly. In this paper, we introduce an innovative Depth map denoising network (DMDNet) based on the Denoising Implicit Image Function (DIIF) to reduce noise and enhance th… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

    Comments: Accepted by Pattern Recognition

  45. arXiv:2312.16418  [pdf, other

    cs.LG cs.AI cs.SI

    Refining Latent Homophilic Structures over Heterophilic Graphs for Robust Graph Convolution Networks

    Authors: Chenyang Qiu, Guoshun Nan, Tianyu Xiong, Wendi Deng, Di Wang, Zhiyang Teng, Lijuan Sun, Qimei Cui, Xiaofeng Tao

    Abstract: Graph convolution networks (GCNs) are extensively utilized in various graph tasks to mine knowledge from spatial data. Our study marks the pioneering attempt to quantitatively investigate the GCN robustness over omnipresent heterophilic graphs for node classification. We uncover that the predominant vulnerability is caused by the structural out-of-distribution (OOD) issue. This finding motivates u… ▽ More

    Submitted 27 December, 2023; originally announced December 2023.

    Comments: To be appeared in the proceedings of AAAI-2024

  46. arXiv:2312.14407  [pdf, other

    cs.CV

    AdvCloak: Customized Adversarial Cloak for Privacy Protection

    Authors: Xuannan Liu, Yaoyao Zhong, Xing Cui, Yuhang Zhang, Peipei Li, Weihong Deng

    Abstract: With extensive face images being shared on social media, there has been a notable escalation in privacy concerns. In this paper, we propose AdvCloak, an innovative framework for privacy protection using generative models. AdvCloak is designed to automatically customize class-wise adversarial masks that can maintain superior image-level naturalness while providing enhanced feature-level generalizat… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

  47. arXiv:2312.11898  [pdf, other

    cs.LG eess.SP

    Short-Term Multi-Horizon Line Loss Rate Forecasting of a Distribution Network Using Attention-GCN-LSTM

    Authors: Jie Liu, Yijia Cao, Yong Li, Yixiu Guo, Wei Deng

    Abstract: Accurately predicting line loss rates is vital for effective line loss management in distribution networks, especially over short-term multi-horizons ranging from one hour to one week. In this study, we propose Attention-GCN-LSTM, a novel method that combines Graph Convolutional Networks (GCN), Long Short-Term Memory (LSTM), and a three-level attention mechanism to address this challenge. By captu… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  48. arXiv:2312.07934  [pdf, other

    eess.IV cs.CV

    Toward Real World Stereo Image Super-Resolution via Hybrid Degradation Model and Discriminator for Implied Stereo Image Information

    Authors: Yuanbo Zhou, Yuyang Xue, Jiang Bi, Wenlin He, Xinlin Zhang, Jiajun Zhang, Wei Deng, Ruofeng Nie, Junlin Lan, Qinquan Gao, Tong Tong

    Abstract: Real-world stereo image super-resolution has a significant influence on enhancing the performance of computer vision systems. Although existing methods for single-image super-resolution can be applied to improve stereo images, these methods often introduce notable modifications to the inherent disparity, resulting in a loss in the consistency of disparity between the original and the enhanced ster… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  49. arXiv:2312.06075  [pdf, other

    cs.CV

    Oracle Character Recognition using Unsupervised Discriminative Consistency Network

    Authors: Mei Wang, Weihong Deng, Sen Su

    Abstract: Ancient history relies on the study of ancient characters. However, real-world scanned oracle characters are difficult to collect and annotate, posing a major obstacle for oracle character recognition (OrCR). Besides, serious abrasion and inter-class similarity also make OrCR more challenging. In this paper, we propose a novel unsupervised domain adaptation method for OrCR, which enables to transf… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

    Comments: Accepted by Pattern Recognition

  50. arXiv:2312.02434  [pdf, other

    cs.CV

    FINER: Flexible spectral-bias tuning in Implicit NEural Representation by Variable-periodic Activation Functions

    Authors: Zhen Liu, Hao Zhu, Qi Zhang, Jingde Fu, Weibing Deng, Zhan Ma, Yanwen Guo, Xun Cao

    Abstract: Implicit Neural Representation (INR), which utilizes a neural network to map coordinate inputs to corresponding attributes, is causing a revolution in the field of signal processing. However, current INR techniques suffer from a restricted capability to tune their supported frequency set, resulting in imperfect performance when representing complex signals with multiple frequencies. We have identi… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: 10 pages, 9 figures