Skip to main content

Showing 1–50 of 201 results for author: Liao, Q

  1. arXiv:2406.08723  [pdf, other

    cs.CL

    ECBD: Evidence-Centered Benchmark Design for NLP

    Authors: Yu Lu Liu, Su Lin Blodgett, Jackie Chi Kit Cheung, Q. Vera Liao, Alexandra Olteanu, Ziang Xiao

    Abstract: Benchmarking is seen as critical to assessing progress in NLP. However, creating a benchmark involves many design decisions (e.g., which datasets to include, which metrics to use) that often rely on tacit, untested assumptions about what the benchmark is intended to measure or is actually measuring. There is currently no principled way of analyzing these decisions and how they impact the validity… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  2. arXiv:2405.19609  [pdf, other

    cs.CV cs.GR

    SMPLX-Lite: A Realistic and Drivable Avatar Benchmark with Rich Geometry and Texture Annotations

    Authors: Yujiao Jiang, Qingmin Liao, Zhaolong Wang, Xiangru Lin, Zongqing Lu, Yuxi Zhao, Hanqing Wei, Jingrui Ye, Yu Zhang, Zhijing Shao

    Abstract: Recovering photorealistic and drivable full-body avatars is crucial for numerous applications, including virtual reality, 3D games, and tele-presence. Most methods, whether reconstruction or generation, require large numbers of human motion sequences and corresponding textured meshes. To easily learn a drivable avatar, a reasonable parametric body model with unified topology is paramount. However,… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: ICME 2024;Project page: https://alex-jyj.github.io/SMPLX-Lite/

  3. arXiv:2405.17037  [pdf, other

    cs.CV

    BDC-Occ: Binarized Deep Convolution Unit For Binarized Occupancy Network

    Authors: Zongkai Zhang, Zidong Xu, Wenming Yang, Qingmin Liao, Jing-Hao Xue

    Abstract: Existing 3D occupancy networks demand significant hardware resources, hindering the deployment of edge devices. Binarized Neural Networks (BNN) offer substantially reduced computational and memory requirements. However, their performance decreases notably compared to full-precision networks. Moreover, it is challenging to enhance the performance of binarized models by increasing the number of bina… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 19 pages, 8 figures

  4. Modeling User Fatigue for Sequential Recommendation

    Authors: Nian Li, Xin Ban, Cheng Ling, Chen Gao, Lantao Hu, Peng Jiang, Kun Gai, Yong Li, Qingmin Liao

    Abstract: Recommender systems filter out information that meets user interests. However, users may be tired of the recommendations that are too similar to the content they have been exposed to in a short historical period, which is the so-called user fatigue. Despite the significance for a better user experience, user fatigue is seldom explored by existing recommenders. In fact, there are three main challen… ▽ More

    Submitted 22 May, 2024; v1 submitted 19 May, 2024; originally announced May 2024.

    Comments: SIGIR 2024

  5. arXiv:2405.11233  [pdf, other

    cs.SE

    Bridge and Hint: Extending Pre-trained Language Models for Long-Range Code

    Authors: Yujia Chen, Cuiyun Gao, Zezhou Yang, Hongyu Zhang, Qing Liao

    Abstract: In the field of code intelligence, effectively modeling long-range code poses a significant challenge. Existing pre-trained language models (PLMs) such as UniXcoder have achieved remarkable success, but they still face difficulties with long code inputs. This is mainly due to their limited capacity to maintain contextual continuity and memorize the key information over long-range code. To alleviat… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

    Comments: Accepted by ISSTA 2024

  6. "I'm Not Sure, But...": Examining the Impact of Large Language Models' Uncertainty Expression on User Reliance and Trust

    Authors: Sunnie S. Y. Kim, Q. Vera Liao, Mihaela Vorvoreanu, Stephanie Ballard, Jennifer Wortman Vaughan

    Abstract: Widely deployed large language models (LLMs) can produce convincing yet incorrect outputs, potentially misleading users who may rely on them as if they were correct. To reduce such overreliance, there have been calls for LLMs to communicate their uncertainty to end users. However, there has been little empirical work examining how users perceive and act upon LLMs' expressions of uncertainty. We ex… ▽ More

    Submitted 15 May, 2024; v1 submitted 1 May, 2024; originally announced May 2024.

    Comments: Accepted to FAccT 2024. This version includes the appendix

  7. arXiv:2404.08449  [pdf, other

    cs.CV

    OccGaussian: 3D Gaussian Splatting for Occluded Human Rendering

    Authors: Jingrui Ye, Zongkai Zhang, Yujiao Jiang, Qingmin Liao, Wenming Yang, Zongqing Lu

    Abstract: Rendering dynamic 3D human from monocular videos is crucial for various applications such as virtual reality and digital entertainment. Most methods assume the people is in an unobstructed scene, while various objects may cause the occlusion of body parts in real-life scenarios. Previous method utilizing NeRF for surface rendering to recover the occluded areas, but it requiring more than one day t… ▽ More

    Submitted 14 April, 2024; v1 submitted 12 April, 2024; originally announced April 2024.

  8. arXiv:2403.17320  [pdf, other

    cs.RO

    Leveraging Symmetry in RL-based Legged Locomotion Control

    Authors: Zhi Su, Xiaoyu Huang, Daniel Ordoñez-Apraez, Yunfei Li, Zhongyu Li, Qiayuan Liao, Giulio Turrisi, Massimiliano Pontil, Claudio Semini, Yi Wu, Koushil Sreenath

    Abstract: Model-free reinforcement learning is a promising approach for autonomously solving challenging robotics control problems, but faces exploration difficulty without information of the robot's kinematics and dynamics morphology. The under-exploration of multiple modalities with symmetric states leads to behaviors that are often unnatural and sub-optimal. This issue becomes particularly pronounced in… ▽ More

    Submitted 26 March, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

  9. arXiv:2403.11589  [pdf, other

    cs.CV

    UV Gaussians: Joint Learning of Mesh Deformation and Gaussian Textures for Human Avatar Modeling

    Authors: Yujiao Jiang, Qingmin Liao, Xiaoyu Li, Li Ma, Qi Zhang, Chaopeng Zhang, Zongqing Lu, Ying Shan

    Abstract: Reconstructing photo-realistic drivable human avatars from multi-view image sequences has been a popular and challenging topic in the field of computer vision and graphics. While existing NeRF-based methods can achieve high-quality novel view rendering of human models, both training and inference processes are time-consuming. Recent approaches have utilized 3D Gaussians to represent the human body… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  10. arXiv:2403.09284  [pdf, other

    cs.LG cs.DC

    DA-PFL: Dynamic Affinity Aggregation for Personalized Federated Learning

    Authors: Xu Yang, Jiyuan Feng, Songyue Guo, Ye Wang, Ye Ding, Binxing Fang, Qing Liao

    Abstract: Personalized federated learning becomes a hot research topic that can learn a personalized learning model for each client. Existing personalized federated learning models prefer to aggregate similar clients with similar data distribution to improve the performance of learning models. However, similaritybased personalized federated learning methods may exacerbate the class imbalanced problem. In th… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  11. arXiv:2403.02630  [pdf, other

    cs.LG cs.IR cs.SI

    FedHCDR: Federated Cross-Domain Recommendation with Hypergraph Signal Decoupling

    Authors: Hongyu Zhang, Dongyi Zheng, Lin Zhong, Xu Yang, Jiyuan Feng, Yunqing Feng, Qing Liao

    Abstract: In recent years, Cross-Domain Recommendation (CDR) has drawn significant attention, which utilizes user data from multiple domains to enhance the recommendation performance. However, current CDR methods require sharing user data across domains, thereby violating the General Data Protection Regulation (GDPR). Consequently, numerous approaches have been proposed for Federated Cross-Domain Recommenda… ▽ More

    Submitted 10 June, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: 16 pages, 5 figures

  12. arXiv:2403.01428  [pdf, other

    cs.RO eess.SP

    Localization matters too: How localization error affects UAV flight

    Authors: Suquan Zhang, Yuanfan Xu, Shu'ang Yu, Qingmin Liao, Jincheng Yu, Yu Wang

    Abstract: The maximum safe flight speed of a Unmanned Aerial Vehicle (UAV) is an important indicator for measuring its efficiency in completing various tasks. This indicator is influenced by numerous parameters such as UAV localization error, perception range, and system latency. However, in terms of localization errors, although there have been many studies dedicated to improving the localization capabilit… ▽ More

    Submitted 7 March, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

    Comments: 8 pages,8 figures

  13. arXiv:2402.05880  [pdf, other

    cs.CL cs.AI cs.HC

    Generative Echo Chamber? Effects of LLM-Powered Search Systems on Diverse Information Seeking

    Authors: Nikhil Sharma, Q. Vera Liao, Ziang Xiao

    Abstract: Large language models (LLMs) powered conversational search systems have already been used by hundreds of millions of people, and are believed to bring many benefits over conventional search. However, while decades of research and public discourse interrogated the risk of search systems in increasing selective exposure and creating echo chambers -- limiting exposure to diverse opinions and leading… ▽ More

    Submitted 10 February, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: Accepted in CHI'24. Supplementary material will be available online with the official submission in CHI 2024

  14. arXiv:2402.02060  [pdf, other

    cs.CV

    DiffVein: A Unified Diffusion Network for Finger Vein Segmentation and Authentication

    Authors: Yanjun Liu, Wenming Yang, Qingmin Liao

    Abstract: Finger vein authentication, recognized for its high security and specificity, has become a focal point in biometric research. Traditional methods predominantly concentrate on vein feature extraction for discriminative modeling, with a limited exploration of generative approaches. Suffering from verification failure, existing methods often fail to obtain authentic vein patterns by segmentation. To… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

  15. arXiv:2401.15843  [pdf, other

    cs.SE

    APIGen: Generative API Method Recommendation

    Authors: Yujia Chen, Cuiyun Gao, Muyijie Zhu, Qing Liao, Yong Wang, Guoai Xu

    Abstract: Automatic API method recommendation is an essential task of code intelligence, which aims to suggest suitable APIs for programming queries. Existing approaches can be categorized into two primary groups: retrieval-based and learning-based approaches. Although these approaches have achieved remarkable success, they still come with notable limitations. The retrieval-based approaches rely on the text… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

    Comments: To appear in the proceedings of the 31st IEEE International Conference on Software Analysis, Evolution, and Reengineering (SANER 2024)

  16. arXiv:2401.13169  [pdf, other

    cs.CR cs.SE

    ReposVul: A Repository-Level High-Quality Vulnerability Dataset

    Authors: Xinchen Wang, Ruida Hu, Cuiyun Gao, Xin-Cheng Wen, Yujia Chen, Qing Liao

    Abstract: Open-Source Software (OSS) vulnerabilities bring great challenges to the software security and pose potential risks to our society. Enormous efforts have been devoted into automated vulnerability detection, among which deep learning (DL)-based approaches have proven to be the most effective. However, the current labeled data present the following limitations: (1) Tangled Patches: Developers may su… ▽ More

    Submitted 8 February, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

    Comments: Accepted by ICSE 2024 Industry Challenge Track

  17. arXiv:2401.11731  [pdf, other

    cs.NI cs.AI cs.LG

    Fast and Scalable Network Slicing by Integrating Deep Learning with Lagrangian Methods

    Authors: Tianlun Hu, Qi Liao, Qiang Liu, Antonio Massaro, Georg Carle

    Abstract: Network slicing is a key technique in 5G and beyond for efficiently supporting diverse services. Many network slicing solutions rely on deep learning to manage complex and high-dimensional resource allocation problems. However, deep learning models suffer limited generalization and adaptability to dynamic slicing configurations. In this paper, we propose a novel framework that integrates constrain… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: 6 pages, 5 figures, IEEE Global Communications Conference 2023

  18. arXiv:2401.09051  [pdf, other

    cs.HC

    Canvil: Designerly Adaptation for LLM-Powered User Experiences

    Authors: K. J. Kevin Feng, Q. Vera Liao, Ziang Xiao, Jennifer Wortman Vaughan, Amy X. Zhang, David W. McDonald

    Abstract: Advancements in large language models (LLMs) are poised to spark a proliferation of LLM-powered user experiences. In product teams, designers are often tasked with crafting user experiences that align with user needs. To involve designers and leverage their user-centered perspectives to create effective and responsible LLM-powered products, we introduce the practice of designerly adaptation for en… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

  19. arXiv:2401.08360  [pdf, other

    cs.IT

    AdaSem: Adaptive Goal-Oriented Semantic Communications for End-to-End Camera Relocalization

    Authors: Qi Liao, Tze-Yang Tung

    Abstract: Recently, deep autoencoders have gained traction as a powerful method for implementing goal-oriented semantic communications systems. The idea is to train a mapping from the source domain directly to channel symbols, and vice versa. However, prior studies often focused on rate-distortion tradeoff and transmission delay, at the cost of increasing end-to-end complexity and thus latency. Moreover, th… ▽ More

    Submitted 24 May, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: IEEE INFOCOM 2024

  20. arXiv:2401.08131  [pdf, other

    cs.SE cs.CR

    Game Rewards Vulnerabilities: Software Vulnerability Detection with Zero-Sum Game and Prototype Learning

    Authors: Xin-Cheng Wen, Cuiyun Gao, Xinchen Wang, Ruiqi Wang, Tao Zhang, Qing Liao

    Abstract: Recent years have witnessed a growing focus on automated software vulnerability detection. Notably, deep learning (DL)-based methods, which employ source code for the implicit acquisition of vulnerability patterns, have demonstrated superior performance compared to other approaches. However, the DL-based approaches are still hard to capture the vulnerability-related information from the whole code… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: 17 pages, 8 figures

  21. arXiv:2401.08083  [pdf, other

    cs.CV

    UV-SAM: Adapting Segment Anything Model for Urban Village Identification

    Authors: Xin Zhang, Yu Liu, Yuming Lin, Qingmin Liao, Yong Li

    Abstract: Urban villages, defined as informal residential areas in or around urban centers, are characterized by inadequate infrastructures and poor living conditions, closely related to the Sustainable Development Goals (SDGs) on poverty, adequate housing, and sustainable cities. Traditionally, governments heavily depend on field survey methods to monitor the urban villages, which however are time-consumin… ▽ More

    Submitted 1 February, 2024; v1 submitted 15 January, 2024; originally announced January 2024.

    Comments: Accepted by AAAI 2024

  22. arXiv:2312.16805  [pdf, other

    cs.CV cs.AI

    DarkShot: Lighting Dark Images with Low-Compute and High-Quality

    Authors: Jiazhang Zheng, Lei Li, Qiuping Liao, Cheng Li, Li Li, Yangxing Liu

    Abstract: Nighttime photography encounters escalating challenges in extremely low-light conditions, primarily attributable to the ultra-low signal-to-noise ratio. For real-world deployment, a practical solution must not only produce visually appealing results but also require minimal computation. However, most existing methods are either focused on improving restoration performance or employ lightweight mod… ▽ More

    Submitted 9 January, 2024; v1 submitted 27 December, 2023; originally announced December 2023.

    Comments: Accepted by IEEE ICASSP 2024

  23. arXiv:2312.15224  [pdf, other

    cs.AI cs.HC

    LLM-Powered Hierarchical Language Agent for Real-time Human-AI Coordination

    Authors: Jijia Liu, Chao Yu, Jiaxuan Gao, Yuqing Xie, Qingmin Liao, Yi Wu, Yu Wang

    Abstract: AI agents powered by Large Language Models (LLMs) have made significant advances, enabling them to assist humans in diverse complex tasks and leading to a revolution in human-AI coordination. LLM-powered agents typically require invoking LLM APIs and employing artificially designed complex prompts, which results in high inference latency. While this paradigm works well in scenarios with minimal in… ▽ More

    Submitted 9 January, 2024; v1 submitted 23 December, 2023; originally announced December 2023.

    Comments: This paper is accpeted by AAMAS 2024. More demonstrations can be seen on our website https://sites.google.com/view/overcooked-hla/

  24. arXiv:2312.13701  [pdf, ps, other

    cs.IT

    Infinite families $2$-designs from binary projective three-weight codes

    Authors: Canze Zhu, Qunying Liao, Haibo Liu

    Abstract: Combinatorial designs are closely related to linear codes. In recent year, there are a lot of $t$-designs constructed from certain linear codes. In this paper, we aim to construct $2$-designs from binary three-weight codes. For any binary three-weight code $\mathcal{C}$ with length $n$, let $A_{n}(\mathcal{C})$ be the number of codewords in $\mathcal{C}$ with Hamming weight $n$, then we show that… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

  25. arXiv:2312.01536  [pdf, other

    cs.CV

    CalliPaint: Chinese Calligraphy Inpainting with Diffusion Model

    Authors: Qisheng Liao, Zhinuo Wang, Muhammad Abdul-Mageed, Gus Xia

    Abstract: Chinese calligraphy can be viewed as a unique form of visual art. Recent advancements in computer vision hold significant potential for the future development of generative models in the realm of Chinese calligraphy. Nevertheless, methods of Chinese calligraphy inpainting, which can be effectively used in the art and education fields, remain relatively unexplored. In this paper, we introduce a new… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

    Comments: Accepted as a Machine Learning for Creativity and Design(ML4CD) workshop paper at NeruaIPS 2023. https://neurips.cc/virtual/2023/workshop/66545#wse-detail-75063

  26. arXiv:2311.09919  [pdf, other

    cs.CV cs.AI

    DSR-Diff: Depth Map Super-Resolution with Diffusion Model

    Authors: Yuan Shi, Bin Xia, Rui Zhu, Qingmin Liao, Wenming Yang

    Abstract: Color-guided depth map super-resolution (CDSR) improve the spatial resolution of a low-quality depth map with the corresponding high-quality color map, benefiting various applications such as 3D reconstruction, virtual reality, and augmented reality. While conventional CDSR methods typically rely on convolutional neural networks or transformers, diffusion models (DMs) have demonstrated notable eff… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

  27. arXiv:2311.09696  [pdf, other

    cs.CL

    Fumbling in Babel: An Investigation into ChatGPT's Language Identification Ability

    Authors: Wei-Rui Chen, Ife Adebara, Khai Duy Doan, Qisheng Liao, Muhammad Abdul-Mageed

    Abstract: ChatGPT has recently emerged as a powerful NLP tool that can carry out a variety of tasks. However, the range of languages ChatGPT can handle remains largely a mystery. To uncover which languages ChatGPT `knows', we investigate its language identification (LID) abilities. For this purpose, we compile Babel-670, a benchmark comprising 670 languages representing 24 language families spoken in five c… ▽ More

    Submitted 8 April, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: Accepted to NAACL 2024 Findings

  28. arXiv:2310.14557  [pdf, other

    cs.CL

    The Skipped Beat: A Study of Sociopragmatic Understanding in LLMs for 64 Languages

    Authors: Chiyu Zhang, Khai Duy Doan, Qisheng Liao, Muhammad Abdul-Mageed

    Abstract: Instruction tuned large language models (LLMs), such as ChatGPT, demonstrate remarkable performance in a wide range of tasks. Despite numerous recent studies that examine the performance of instruction-tuned LLMs on various NLP benchmarks, there remains a lack of comprehensive investigation into their ability to understand cross-lingual sociopragmatic meaning (SM), i.e., meaning embedded within so… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: Accepted by EMNLP 2023 Main conference

  29. arXiv:2310.10436  [pdf, other

    cs.AI

    EconAgent: Large Language Model-Empowered Agents for Simulating Macroeconomic Activities

    Authors: Nian Li, Chen Gao, Mingyu Li, Yong Li, Qingmin Liao

    Abstract: The advent of artificial intelligence has led to a growing emphasis on data-driven modeling in macroeconomics, with agent-based modeling (ABM) emerging as a prominent bottom-up simulation paradigm. In ABM, agents (e.g., households, firms) interact within a macroeconomic environment, collectively generating market dynamics. Existing agent modeling typically employs predetermined rules or learning-b… ▽ More

    Submitted 23 May, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

    Comments: ACL 2024 (main conference)

  30. arXiv:2309.09496  [pdf, other

    cs.CV cs.AI

    CLIP-based Synergistic Knowledge Transfer for Text-based Person Retrieval

    Authors: Yating Liu, Yaowei Li, Zimo Liu, Wenming Yang, Yaowei Wang, Qingmin Liao

    Abstract: Text-based Person Retrieval (TPR) aims to retrieve the target person images given a textual query. The primary challenge lies in bridging the substantial gap between vision and language modalities, especially when dealing with limited large-scale datasets. In this paper, we introduce a CLIP-based Synergistic Knowledge Transfer (CSKT) approach for TPR. Specifically, to explore the CLIP's knowledge… ▽ More

    Submitted 2 January, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: ICASSP2024(accepted). minor typos revision compared to version 1 in arxiv

  31. FedDCSR: Federated Cross-domain Sequential Recommendation via Disentangled Representation Learning

    Authors: Hongyu Zhang, Dongyi Zheng, Xu Yang, Jiyuan Feng, Qing Liao

    Abstract: Cross-domain Sequential Recommendation (CSR) which leverages user sequence data from multiple domains has received extensive attention in recent years. However, the existing CSR methods require sharing origin user data across domains, which violates the General Data Protection Regulation (GDPR). Thus, it is necessary to combine federated learning (FL) and CSR to fully utilize knowledge from differ… ▽ More

    Submitted 16 January, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

  32. arXiv:2309.06169  [pdf, other

    cs.LG cs.CV

    Elucidating the solution space of extended reverse-time SDE for diffusion models

    Authors: Qinpeng Cui, Xinyi Zhang, Zongqing Lu, Qingmin Liao

    Abstract: Diffusion models (DMs) demonstrate potent image generation capabilities in various generative modeling tasks. Nevertheless, their primary limitation lies in slow sampling speed, requiring hundreds or thousands of sequential function evaluations through large neural networks to generate high-quality images. Sampling from DMs can be seen alternatively as solving corresponding stochastic differential… ▽ More

    Submitted 26 September, 2023; v1 submitted 12 September, 2023; originally announced September 2023.

  33. arXiv:2308.03283  [pdf, other

    quant-ph cs.LG

    High-rate discretely-modulated continuous-variable quantum key distribution using quantum machine learning

    Authors: Qin Liao, Jieyu Liu, Anqi Huang, Lei Huang, Zhuoying Fei, Xiquan Fu

    Abstract: We propose a high-rate scheme for discretely-modulated continuous-variable quantum key distribution (DM CVQKD) using quantum machine learning technologies, which divides the whole CVQKD system into three parts, i.e., the initialization part that is used for training and estimating quantum classifier, the prediction part that is used for generating highly correlated raw keys, and the data-postproce… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

    Comments: 18 pages, 17 figures

  34. arXiv:2307.05276  [pdf, other

    cs.CV

    Unbiased Scene Graph Generation via Two-stage Causal Modeling

    Authors: Shuzhou Sun, Shuaifeng Zhi, Qing Liao, Janne Heikkilä, Li Liu

    Abstract: Despite the impressive performance of recent unbiased Scene Graph Generation (SGG) methods, the current debiasing literature mainly focuses on the long-tailed distribution problem, whereas it overlooks another source of bias, i.e., semantic confusion, which makes the SGG model prone to yield false predictions for similar relationships. In this paper, we explore a debiasing procedure for the SGG ta… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

    Comments: 17 pages, 9 figures. Accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence

  35. arXiv:2307.03476  [pdf, other

    cs.LG cs.CV

    Unpaired Multi-View Graph Clustering with Cross-View Structure Matching

    Authors: Yi Wen, Siwei Wang, Qing Liao, Weixuan Liang, Ke Liang, Xinhang Wan, Xinwang Liu

    Abstract: Multi-view clustering (MVC), which effectively fuses information from multiple views for better performance, has received increasing attention. Most existing MVC methods assume that multi-view data are fully paired, which means that the mappings of all corresponding samples between views are pre-defined or given in advance. However, the data correspondence is often incomplete in real-world applica… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

    Comments: 15 pages

  36. Inter-Cell Network Slicing With Transfer Learning Empowered Multi-Agent Deep Reinforcement Learning

    Authors: Tianlun Hu, Qi Liao, Qiang Liu, Georg Carle

    Abstract: Network slicing enables operators to efficiently support diverse applications on a common physical infrastructure. The ever-increasing densification of network deployment leads to complex and non-trivial inter-cell interference, which requires more than inaccurate analytic models to dynamically optimize resource management for network slices. In this paper, we develop a DIRP algorithm with multipl… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: 14 pages, 14 figures, IEEE Open Journal of the Communications Society

    Journal ref: Volume 4, 2023, Pages 1141 - 1155

  37. arXiv:2306.06935  [pdf, other

    cs.SE cs.AI

    LIVABLE: Exploring Long-Tailed Classification of Software Vulnerability Types

    Authors: Xin-Cheng Wen, Cuiyun Gao, Feng Luo, Haoyu Wang, Ge Li, Qing Liao

    Abstract: Prior studies generally focus on software vulnerability detection and have demonstrated the effectiveness of Graph Neural Network (GNN)-based approaches for the task. Considering the various types of software vulnerabilities and the associated different degrees of severity, it is also beneficial to determine the type of each vulnerable code for developers. In this paper, we observe that the distri… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

  38. arXiv:2306.03100  [pdf, other

    cs.HC cs.AI

    Rethinking Model Evaluation as Narrowing the Socio-Technical Gap

    Authors: Q. Vera Liao, Ziang Xiao

    Abstract: The recent development of generative and large language models (LLMs) poses new challenges for model evaluation that the research community and industry are grappling with. While the versatile capabilities of these models ignite excitement, they also inevitably make a leap toward homogenization: powering a wide range of applications with a single, often referred to as ``general-purpose'', model. I… ▽ More

    Submitted 28 June, 2023; v1 submitted 31 May, 2023; originally announced June 2023.

  39. arXiv:2306.01941  [pdf, other

    cs.HC cs.AI cs.CY

    AI Transparency in the Age of LLMs: A Human-Centered Research Roadmap

    Authors: Q. Vera Liao, Jennifer Wortman Vaughan

    Abstract: The rise of powerful large language models (LLMs) brings about tremendous opportunities for innovation but also looming risks for individuals and society at large. We have reached a pivotal moment for ensuring that LLMs and LLM-infused applications are developed and deployed responsibly. However, a central pillar of responsible AI -- transparency -- is largely missing from the current discourse ar… ▽ More

    Submitted 7 August, 2023; v1 submitted 2 June, 2023; originally announced June 2023.

  40. arXiv:2305.19124  [pdf, other

    cs.CV

    Calliffusion: Chinese Calligraphy Generation and Style Transfer with Diffusion Modeling

    Authors: Qisheng Liao, Gus Xia, Zhinuo Wang

    Abstract: In this paper, we propose Calliffusion, a system for generating high-quality Chinese calligraphy using diffusion models. Our model architecture is based on DDPM (Denoising Diffusion Probabilistic Models), and it is capable of generating common characters in five different scripts and mimicking the styles of famous calligraphers. Experiments demonstrate that our model can generate calligraphy that… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: 5pages, International Conference on Computational Creativity, ICCC

  41. arXiv:2305.14889  [pdf, other

    cs.CL cs.AI

    Evaluating Evaluation Metrics: A Framework for Analyzing NLG Evaluation Metrics using Measurement Theory

    Authors: Ziang Xiao, Susu Zhang, Vivian Lai, Q. Vera Liao

    Abstract: We address a fundamental challenge in Natural Language Generation (NLG) model evaluation -- the design and evaluation of evaluation metrics. Recognizing the limitations of existing automatic metrics and noises from how current human evaluation was conducted, we propose MetricEval, a framework informed by measurement theory, the foundation of educational test design, for conceptualizing and evaluat… ▽ More

    Submitted 22 October, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023

  42. arXiv:2304.14613  [pdf, other

    cs.AI cs.CR

    Deep Intellectual Property Protection: A Survey

    Authors: Yuchen Sun, Tianpeng Liu, Panhe Hu, Qing Liao, Shaojing Fu, Nenghai Yu, Deke Guo, Yongxiang Liu, Li Liu

    Abstract: Deep Neural Networks (DNNs), from AlexNet to ResNet to ChatGPT, have made revolutionary progress in recent years, and are widely used in various fields. The high performance of DNNs requires a huge amount of high-quality data, expensive computing hardware, and excellent DNN architectures that are costly to obtain. Therefore, trained DNNs are becoming valuable assets and must be considered the Inte… ▽ More

    Submitted 17 June, 2023; v1 submitted 27 April, 2023; originally announced April 2023.

    Comments: 37 pages, 19 figures

  43. arXiv:2304.14339  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    MarsEclipse at SemEval-2023 Task 3: Multi-Lingual and Multi-Label Framing Detection with Contrastive Learning

    Authors: Qisheng Liao, Meiting Lai, Preslav Nakov

    Abstract: This paper describes our system for SemEval-2023 Task 3 Subtask 2 on Framing Detection. We used a multi-label contrastive loss for fine-tuning large pre-trained language models in a multi-lingual setting, achieving very competitive results: our system was ranked first on the official test set and on the official shared task leaderboard for five of the six languages for which we had training data a… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

    Comments: framing, contrastive learning, SemEval-2023 task 3

    MSC Class: 68T50 ACM Class: F.2.2; I.2.7

    Journal ref: SemEval-2023

  44. arXiv:2304.10548  [pdf, other

    cs.CL cs.AI cs.HC

    Supporting Qualitative Analysis with Large Language Models: Combining Codebook with GPT-3 for Deductive Coding

    Authors: Ziang Xiao, Xingdi Yuan, Q. Vera Liao, Rania Abdelghani, Pierre-Yves Oudeyer

    Abstract: Qualitative analysis of textual contents unpacks rich and valuable information by assigning labels to the data. However, this process is often labor-intensive, particularly when working with large datasets. While recent AI-based tools demonstrate utility, researchers may not have readily available AI resources and expertise, let alone be challenged by the limited generalizability of those task-spe… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: 28th International Conference on Intelligent User Interfaces (IUI '23 Companion), March 27--31, 2023, Sydney, NSW, Australia

  45. arXiv:2304.08366  [pdf, other

    cs.HC cs.AI

    Why is AI not a Panacea for Data Workers? An Interview Study on Human-AI Collaboration in Data Storytelling

    Authors: Haotian Li, Yun Wang, Q. Vera Liao, Huamin Qu

    Abstract: Data storytelling plays an important role in data workers' daily jobs since it boosts team collaboration and public communication. However, to make an appealing data story, data workers spend tremendous efforts on various tasks, including outlining and styling the story. Recently, a growing research trend has been exploring how to assist data storytelling with advanced artificial intelligence (AI)… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

  46. arXiv:2304.08101  [pdf, other

    cs.CV

    LLA-FLOW: A Lightweight Local Aggregation on Cost Volume for Optical Flow Estimation

    Authors: Jiawei Xu, Zongqing Lu, Qingmin Liao

    Abstract: Lack of texture often causes ambiguity in matching, and handling this issue is an important challenge in optical flow estimation. Some methods insert stacked transformer modules that allow the network to use global information of cost volume for estimation. But the global information aggregation often incurs serious memory and time costs during training and inference, which hinders model deploymen… ▽ More

    Submitted 18 July, 2023; v1 submitted 17 April, 2023; originally announced April 2023.

  47. arXiv:2304.01507  [pdf, other

    cs.LG cs.AI

    RARE: Robust Masked Graph Autoencoder

    Authors: Wenxuan Tu, Qing Liao, Sihang Zhou, Xin Peng, Chuan Ma, Zhe Liu, Xinwang Liu, Zhiping Cai

    Abstract: Masked graph autoencoder (MGAE) has emerged as a promising self-supervised graph pre-training (SGP) paradigm due to its simplicity and effectiveness. However, existing efforts perform the mask-then-reconstruct operation in the raw data space as is done in computer vision (CV) and natural language processing (NLP) areas, while neglecting the important non-Euclidean property of graph data. As a resu… ▽ More

    Submitted 6 April, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

  48. arXiv:2303.10126  [pdf, other

    cs.CV

    IRGen: Generative Modeling for Image Retrieval

    Authors: Yidan Zhang, Ting Zhang, Dong Chen, Yujing Wang, Qi Chen, Xing Xie, Hao Sun, Weiwei Deng, Qi Zhang, Fan Yang, Mao Yang, Qingmin Liao, Baining Guo

    Abstract: While generative modeling has been ubiquitous in natural language processing and computer vision, its application to image retrieval remains unexplored. In this paper, we recast image retrieval as a form of generative modeling by employing a sequence-to-sequence model, contributing to the current unified theme. Our framework, IRGen, is a unified model that enables end-to-end differentiable search,… ▽ More

    Submitted 28 June, 2023; v1 submitted 17 March, 2023; originally announced March 2023.

  49. arXiv:2303.06794  [pdf, other

    cs.HC

    Sensing Wellbeing in the Workplace, Why and For Whom? Envisioning Impacts with Organizational Stakeholders

    Authors: Anna Kawakami, Shreya Chowdhary, Shamsi T. Iqbal, Q. Vera Liao, Alexandra Olteanu, Jina Suh, Koustuv Saha

    Abstract: With the heightened digitization of the workplace, alongside the rise of remote and hybrid work prompted by the pandemic, there is growing corporate interest in using passive sensing technologies for workplace wellbeing. Existing research on these technologies often focus on understanding or improving interactions between an individual user and the technology. Workplace settings can, however, intr… ▽ More

    Submitted 6 June, 2023; v1 submitted 12 March, 2023; originally announced March 2023.

  50. arXiv:2303.00573  [pdf, other

    stat.ML cs.LG

    Dimension-reduced KRnet maps for high-dimensional Bayesian inverse problems

    Authors: Yani Feng, Kejun Tang, Xiaoliang Wan, Qifeng Liao

    Abstract: We present a dimension-reduced KRnet map approach (DR-KRnet) for high-dimensional Bayesian inverse problems, which is based on an explicit construction of a map that pushes forward the prior measure to the posterior measure in the latent space. Our approach consists of two main components: data-driven VAE prior and density approximation of the posterior of the latent variable. In reality, it may n… ▽ More

    Submitted 8 March, 2023; v1 submitted 1 March, 2023; originally announced March 2023.