Skip to main content

Showing 1–50 of 135 results for author: Xing, Z

  1. arXiv:2407.01494  [pdf, other

    cs.CV cs.SD eess.AS

    FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds

    Authors: Yiming Zhang, Yicheng Gu, Yanhong Zeng, Zhening Xing, Yuancheng Wang, Zhizheng Wu, Kai Chen

    Abstract: We study Neural Foley, the automatic generation of high-quality sound effects synchronizing with videos, enabling an immersive audio-visual experience. Despite its wide range of applications, existing approaches encounter limitations when it comes to simultaneously synthesizing high-quality and video-aligned (i.e.,, semantic relevant and temporal synchronized) sounds. To overcome these limitations… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: Project page: https://foleycrafter.github.io/

  2. arXiv:2406.17431  [pdf, other

    cs.SE

    A Large-scale Investigation of Semantically Incompatible APIs behind Compatibility Issues in Android Apps

    Authors: Shidong Pan, Tianchen Guo, Lihong Zhang, Pei Liu, Zhenchang Xing, Xiaoyu Sun

    Abstract: Application Programming Interface (API) incompatibility is a long-standing issue in Android application development. The rapid evolution of Android APIs results in a significant number of API additions, removals, and changes between adjacent versions. Unfortunately, this high frequency of alterations may lead to compatibility issues, often without adequate notification to developers regarding thes… ▽ More

    Submitted 26 June, 2024; v1 submitted 25 June, 2024; originally announced June 2024.

  3. arXiv:2406.09397  [pdf, other

    cs.CV cs.AI

    Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms

    Authors: Miaosen Zhang, Yixuan Wei, Zhen Xing, Yifei Ma, Zuxuan Wu, Ji Li, Zheng Zhang, Qi Dai, Chong Luo, Xin Geng, Baining Guo

    Abstract: Modern vision models are trained on very large noisy datasets. While these models acquire strong capabilities, they may not follow the user's intent to output the desired results in certain aspects, e.g., visual aesthetic, preferred style, and responsibility. In this paper, we target the realm of visual aesthetics and aim to align vision models with human aesthetic standards in a retrieval system.… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 28 pages, 26 figures, under review

  4. arXiv:2406.07411  [pdf, other

    cs.SE cs.CL

    VersiCode: Towards Version-controllable Code Generation

    Authors: Tongtong Wu, Weigang Wu, Xingyu Wang, Kang Xu, Suyu Ma, Bo Jiang, Ping Yang, Zhenchang Xing, Yuan-Fang Li, Gholamreza Haffari

    Abstract: Significant research has focused on improving the performance of large language model on code-related tasks due to their practical importance. Although performance is typically evaluated using public benchmark datasets, the existing datasets do not account for the concept of \emph{version}, which is crucial in professional software development. In this paper, we introduce VersiCode, the first comp… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  5. arXiv:2406.06465  [pdf, other

    cs.CV cs.AI cs.CL cs.LG cs.MM

    AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction

    Authors: Zhen Xing, Qi Dai, Zejia Weng, Zuxuan Wu, Yu-Gang Jiang

    Abstract: Text-guided video prediction (TVP) involves predicting the motion of future frames from the initial frame according to an instruction, which has wide applications in virtual reality, robotics, and content creation. Previous TVP methods make significant breakthroughs by adapting Stable Diffusion for this task. However, they struggle with frame consistency and temporal stability primarily due to the… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  6. Refactoring to Pythonic Idioms: A Hybrid Knowledge-Driven Approach Leveraging Large Language Models

    Authors: Zejun Zhang, Zhenchang Xing, Xiaoxue Ren, Qinghua Lu, Xiwei Xu

    Abstract: Pythonic idioms are highly valued and widely used in the Python programming community. However, many Python users find it challenging to use Pythonic idioms. Adopting a rule-based approach or LLM-only approach is not sufficient to overcome three persistent challenges of code idiomatization including code miss, wrong detection and wrong refactoring. Motivated by the determinism of rules and adaptab… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: Accepted by FSE 2024,22 pages

  7. arXiv:2406.01587  [pdf, other

    cs.RO

    PlanAgent: A Multi-modal Large Language Agent for Closed-loop Vehicle Motion Planning

    Authors: Yupeng Zheng, Zebin Xing, Qichao Zhang, Bu Jin, Pengfei Li, Yuhang Zheng, Zhongpu Xia, Kun Zhan, Xianpeng Lang, Yaran Chen, Dongbin Zhao

    Abstract: Vehicle motion planning is an essential component of autonomous driving technology. Current rule-based vehicle motion planning methods perform satisfactorily in common scenarios but struggle to generalize to long-tailed situations. Meanwhile, learning-based methods have yet to achieve superior performance over rule-based approaches in large-scale closed-loop scenarios. To address these issues, we… ▽ More

    Submitted 4 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: This work has been submitted to the IEEE for possible publication

  8. arXiv:2406.01080  [pdf, other

    cs.CR cs.DC cs.LG

    No Vandalism: Privacy-Preserving and Byzantine-Robust Federated Learning

    Authors: Zhibo Xing, Zijian Zhang, Zi'ang Zhang, Jiamou Liu, Liehuang Zhu, Giovanni Russello

    Abstract: Federated learning allows several clients to train one machine learning model jointly without sharing private data, providing privacy protection. However, traditional federated learning is vulnerable to poisoning attacks, which can not only decrease the model performance, but also implant malicious backdoors. In addition, direct submission of local model parameters can also lead to the privacy lea… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  9. arXiv:2405.18731  [pdf, other

    eess.SP cs.AI physics.comp-ph

    VBIM-Net: Variational Born Iterative Network for Inverse Scattering Problems

    Authors: Ziqing Xing, Zhaoyang Zhang, Zirui Chen, Yusong Wang, Haoran Ma, Zhun Wei, Gang Bao

    Abstract: Recently, studies have shown the potential of integrating field-type iterative methods with deep learning (DL) techniques in solving inverse scattering problems (ISPs). In this article, we propose a novel Variational Born Iterative Network, namely, VBIM-Net, to solve the full-wave ISPs with significantly improved flexibility and inversion quality. The proposed VBIM-Net emulates the alternating upd… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 14 pages, 21 figures

  10. arXiv:2405.07430  [pdf, other

    cs.SE cs.CR

    Don't Chase Your Tail! Missing Key Aspects Augmentation in Textual Vulnerability Descriptions of Long-tail Software through Feature Inference

    Authors: Linyi Han, Shidong Pan, Zhenchang Xing, Jiamou Sun, Sofonias Yitagesu, Xiaowang Zhang, Zhiyong Feng

    Abstract: Augmenting missing key aspects in Textual Vulnerability Descriptions (TVDs) for software with a large user base (referred to as non-long-tail software) has greatly advanced vulnerability analysis and software security research. However, these methods often overlook software instances that have a limited user base (referred to as long-tail software) due to limited TVDs, variations in software featu… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

  11. arXiv:2404.05388  [pdf, other

    cs.SE cs.AI cs.CY cs.LG

    An AI System Evaluation Framework for Advancing AI Safety: Terminology, Taxonomy, Lifecycle Mapping

    Authors: Boming Xia, Qinghua Lu, Liming Zhu, Zhenchang Xing

    Abstract: The advent of advanced AI underscores the urgent need for comprehensive safety evaluations, necessitating collaboration across communities (i.e., AI, software engineering, and governance). However, divergent practices and terminologies across these communities, combined with the complexity of AI systems-of which models are only a part-and environmental affordances (e.g., access to tools), obstruct… ▽ More

    Submitted 15 May, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

    Comments: 1st ACM International Conference on AI-powered Software (AIware)

  12. arXiv:2403.10242  [pdf, other

    cs.CV

    FDGaussian: Fast Gaussian Splatting from Single Image via Geometric-aware Diffusion Model

    Authors: Qijun Feng, Zhen Xing, Zuxuan Wu, Yu-Gang Jiang

    Abstract: Reconstructing detailed 3D objects from single-view images remains a challenging task due to the limited information available. In this paper, we introduce FDGaussian, a novel two-stage framework for single-image 3D reconstruction. Recent methods typically utilize pre-trained 2D diffusion models to generate plausible novel views from the input image, yet they encounter issues with either multi-vie… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  13. arXiv:2402.14544  [pdf, other

    cs.CR cs.SE

    {A New Hope}: Contextual Privacy Policies for Mobile Applications and An Approach Toward Automated Generation

    Authors: Shidong Pan, Zhen Tao, Thong Hoang, Dawen Zhang, Tianshi Li, Zhenchang Xing, Sherry Xu, Mark Staples, Thierry Rakotoarivelo, David Lo

    Abstract: Privacy policies have emerged as the predominant approach to conveying privacy notices to mobile application users. In an effort to enhance both readability and user engagement, the concept of contextual privacy policies (CPPs) has been proposed by researchers. The aim of CPPs is to fragment privacy policies into concise snippets, displaying them only within the corresponding contexts within the a… ▽ More

    Submitted 10 March, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: USENIX Security 2024. arXiv admin note: text overlap with arXiv:2307.01691

  14. arXiv:2401.15234  [pdf, other

    cs.SE

    Moving beyond Deletions: Program Simplification via Diverse Program Transformations

    Authors: Haibo Wang, Zezhong Xing, Zheng Wang, Chengnian Sun, Shin Hwei Tan

    Abstract: To reduce the complexity of software, Developers manually simplify program (known as developer-induced program simplification in this paper) to reduce its code size yet preserving its functionality but manual simplification is time-consuming and error-prone. To reduce manual effort, rule-based approaches (e.g., refactoring) and deletion-based approaches (e.g., delta debugging) can be potentially a… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

  15. arXiv:2401.14268  [pdf, other

    cs.HC

    GPTVoiceTasker: LLM-Powered Virtual Assistant for Smartphone

    Authors: Minh Duc Vu, Han Wang, Zhuang Li, Jieshan Chen, Shengdong Zhao, Zhenchang Xing, Chunyang Chen

    Abstract: Virtual assistants have the potential to play an important role in helping users achieves different tasks. However, these systems face challenges in their real-world usability, characterized by inefficiency and struggles in grasping user intentions. Leveraging recent advances in Large Language Models (LLMs), we introduce GptVoiceTasker, a virtual assistant poised to enhance user experiences and ta… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  16. arXiv:2401.14168  [pdf, other

    cs.CV

    Vivim: a Video Vision Mamba for Medical Video Object Segmentation

    Authors: Yijun Yang, Zhaohu Xing, Chunwang Huang, Lei Zhu

    Abstract: Traditional convolutional neural networks have a limited receptive field while transformer-based networks are mediocre in constructing long-term dependency from the perspective of computational complexity. Such the bottleneck poses a significant challenge when processing long sequences in video analysis tasks. Very recently, the state space models (SSMs) with efficient hardware-aware designs, famo… ▽ More

    Submitted 12 March, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

  17. arXiv:2401.13560  [pdf, other

    cs.CV

    SegMamba: Long-range Sequential Modeling Mamba For 3D Medical Image Segmentation

    Authors: Zhaohu Xing, Tian Ye, Yijun Yang, Guang Liu, Lei Zhu

    Abstract: The Transformer architecture has shown a remarkable ability in modeling global relationships. However, it poses a significant computational challenge when processing high-dimensional medical images. This hinders its development and widespread adoption in this task. Mamba, as a State Space Model (SSM), recently emerged as a notable manner for long-range dependencies in sequential modeling, excellin… ▽ More

    Submitted 25 February, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: Code has released

  18. arXiv:2312.13964  [pdf, other

    cs.CV cs.AI

    PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models

    Authors: Yiming Zhang, Zhening Xing, Yanhong Zeng, Youqing Fang, Kai Chen

    Abstract: Recent advancements in personalized text-to-image (T2I) models have revolutionized content creation, empowering non-experts to generate stunning images with unique styles. While promising, adding realistic motions into these personalized images by text poses significant challenges in preserving distinct styles, high-fidelity details, and achieving motion controllability by text. In this paper, we… ▽ More

    Submitted 25 March, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: Project page: https://pi-animator.github.io/

  19. arXiv:2312.13752  [pdf

    eess.IV cs.AI cs.CV

    Hunting imaging biomarkers in pulmonary fibrosis: Benchmarks of the AIIB23 challenge

    Authors: Yang Nan, Xiaodan Xing, Shiyi Wang, Zeyu Tang, Federico N Felder, Sheng Zhang, Roberta Eufrasia Ledda, Xiaoliu Ding, Ruiqi Yu, Weiping Liu, Feng Shi, Tianyang Sun, Zehong Cao, Minghui Zhang, Yun Gu, Hanxiao Zhang, Jian Gao, Pingyu Wang, Wen Tang, Pengxin Yu, Han Kang, Junqiang Chen, Xing Lu, Boyu Zhang, Michail Mamalakis , et al. (16 additional authors not shown)

    Abstract: Airway-related quantitative imaging biomarkers are crucial for examination, diagnosis, and prognosis in pulmonary diseases. However, the manual delineation of airway trees remains prohibitively time-consuming. While significant efforts have been made towards enhancing airway modelling, current public-available datasets concentrate on lung diseases with moderate morphological variations. The intric… ▽ More

    Submitted 16 April, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: 19 pages

  20. arXiv:2312.09576  [pdf, other

    eess.IV cs.CV

    SegRap2023: A Benchmark of Organs-at-Risk and Gross Tumor Volume Segmentation for Radiotherapy Planning of Nasopharyngeal Carcinoma

    Authors: Xiangde Luo, Jia Fu, Yunxin Zhong, Shuolin Liu, Bing Han, Mehdi Astaraki, Simone Bendazzoli, Iuliana Toma-Dasu, Yiwen Ye, Ziyang Chen, Yong Xia, Yanzhou Su, Jin Ye, Junjun He, Zhaohu Xing, Hongqiu Wang, Lei Zhu, Kaixiang Yang, Xin Fang, Zhiwei Wang, Chan Woong Lee, Sang Joon Park, Jaehee Chun, Constantin Ulrich, Klaus H. Maier-Hein , et al. (17 additional authors not shown)

    Abstract: Radiation therapy is a primary and effective NasoPharyngeal Carcinoma (NPC) treatment strategy. The precise delineation of Gross Tumor Volumes (GTVs) and Organs-At-Risk (OARs) is crucial in radiation treatment, directly impacting patient prognosis. Previously, the delineation of GTVs and OARs was performed by experienced radiation oncologists. Recently, deep learning has achieved promising results… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: A challenge report of SegRap2023 (organized in conjunction with MICCAI2023)

  21. arXiv:2312.07755  [pdf, other

    cs.HC

    Designing with Language: Wireframing UI Design Intent with Generative Large Language Models

    Authors: Sidong Feng, Mingyue Yuan, Jieshan Chen, Zhenchang Xing, Chunyang Chen

    Abstract: Wireframing is a critical step in the UI design process. Mid-fidelity wireframes offer more impactful and engaging visuals compared to low-fidelity versions. However, their creation can be time-consuming and labor-intensive, requiring the addition of actual content and semantic icons. In this paper, we introduce a novel solution WireGen, to automatically generate mid-fidelity wireframes with just… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  22. arXiv:2312.05772  [pdf, other

    cs.SE

    A^3-CodGen: A Repository-Level Code Generation Framework for Code Reuse with Local-Aware, Global-Aware, and Third-Party-Library-Aware

    Authors: Dianshu Liao, Shidong Pan, Xiaoyu Sun, Xiaoxue Ren, Qing Huang, Zhenchang Xing, Huan Jin, Qinying Li

    Abstract: Code generation tools are essential to help developers in the software development process. Existing tools often disconnect with the working context, i.e., the code repository, causing the generated code to be not similar to human developers. In this paper, we propose a novel code generation framework, dubbed A^3-CodGen, to harness information within the code repository to generate code with fewer… ▽ More

    Submitted 5 March, 2024; v1 submitted 10 December, 2023; originally announced December 2023.

  23. arXiv:2311.18837  [pdf, other

    cs.CV cs.AI cs.LG cs.MM

    VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models

    Authors: Zhen Xing, Qi Dai, Zihao Zhang, Hui Zhang, Han Hu, Zuxuan Wu, Yu-Gang Jiang

    Abstract: Diffusion models have achieved significant success in image and video generation. This motivates a growing interest in video editing tasks, where videos are edited according to provided text descriptions. However, most existing approaches only focus on video editing for short clips and rely on time-consuming tuning or inference. We are the first to propose Video Instruction Diffusion (VIDiff), a u… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

  24. arXiv:2311.18252  [pdf, other

    cs.SE cs.AI cs.CY cs.LG

    Navigating Privacy and Copyright Challenges Across the Data Lifecycle of Generative AI

    Authors: Dawen Zhang, Boming Xia, Yue Liu, Xiwei Xu, Thong Hoang, Zhenchang Xing, Mark Staples, Qinghua Lu, Liming Zhu

    Abstract: The advent of Generative AI has marked a significant milestone in artificial intelligence, demonstrating remarkable capabilities in generating realistic images, texts, and data patterns. However, these advancements come with heightened concerns over data privacy and copyright infringement, primarily due to the reliance on vast datasets for model training. Traditional approaches like differential p… ▽ More

    Submitted 10 January, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

    Comments: Accepted by 2024 IEEE/ACM 3rd International Conference on AI Engineering - Software Engineering for AI (CAIN)

  25. arXiv:2311.14768  [pdf, other

    cs.CV cs.AI

    AdaDiff: Adaptive Step Selection for Fast Diffusion

    Authors: Hui Zhang, Zuxuan Wu, Zhen Xing, Jie Shao, Yu-Gang Jiang

    Abstract: Diffusion models, as a type of generative models, have achieved impressive results in generating images and videos conditioned on textual conditions. However, the generation process of diffusion models involves denoising for dozens of steps to produce photorealistic images/videos, which is computationally expensive. Unlike previous methods that design ``one-size-fits-all'' approaches for speed up,… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

    Comments: 10 pages, 5 figures

  26. arXiv:2311.13158  [pdf, other

    cs.SE

    Towards a Responsible AI Metrics Catalogue: A Collection of Metrics for AI Accountability

    Authors: Boming Xia, Qinghua Lu, Liming Zhu, Sung Une Lee, Yue Liu, Zhenchang Xing

    Abstract: Artificial Intelligence (AI), particularly through the advent of large-scale generative AI (GenAI) models such as Large Language Models (LLMs), has become a transformative element in contemporary technology. While these models have unlocked new possibilities, they simultaneously present significant challenges, such as concerns over data privacy and the propensity to generate misleading or fabricat… ▽ More

    Submitted 17 January, 2024; v1 submitted 21 November, 2023; originally announced November 2023.

  27. arXiv:2311.13148  [pdf, other

    cs.AI cs.SE

    Towards Responsible Generative AI: A Reference Architecture for Designing Foundation Model based Agents

    Authors: Qinghua Lu, Liming Zhu, Xiwei Xu, Zhenchang Xing, Stefan Harrer, Jon Whittle

    Abstract: Foundation models, such as large language models (LLMs), have been widely recognised as transformative AI technologies due to their capabilities to understand and generate content, including plans with reasoning capabilities. Foundation model based agents derive their autonomy from the capabilities of foundation models, which enable them to autonomously break down a given goal into a set of manage… ▽ More

    Submitted 2 April, 2024; v1 submitted 21 November, 2023; originally announced November 2023.

  28. arXiv:2311.01266  [pdf, other

    cs.SE

    Let's Discover More API Relations: A Large Language Model-based AI Chain for Unsupervised API Relation Inference

    Authors: Qing Huang, Yanbang Sun, Zhenchang Xing, Yuanlong Cao, Jieshan Chen, Xiwei Xu, Huan Jin, Jiaxing Lu

    Abstract: APIs have intricate relations that can be described in text and represented as knowledge graphs to aid software engineering tasks. Existing relation extraction methods have limitations, such as limited API text corpus and affected by the characteristics of the input text.To address these limitations, we propose utilizing large language models (LLMs) (e.g., GPT-3.5) as a neural knowledge base for A… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  29. arXiv:2310.14848  [pdf, other

    cs.LG cs.CR

    Zero-knowledge Proof Meets Machine Learning in Verifiability: A Survey

    Authors: Zhibo Xing, Zijian Zhang, Jiamou Liu, Ziang Zhang, Meng Li, Liehuang Zhu, Giovanni Russello

    Abstract: With the rapid advancement of artificial intelligence technology, the usage of machine learning models is gradually becoming part of our daily lives. High-quality models rely not only on efficient optimization algorithms but also on the training and learning processes built upon vast amounts of data and computational power. However, in practice, due to various challenges such as limited computatio… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

  30. arXiv:2310.10647  [pdf, other

    cs.CV cs.AI cs.LG

    A Survey on Video Diffusion Models

    Authors: Zhen Xing, Qijun Feng, Haoran Chen, Qi Dai, Han Hu, Hang Xu, Zuxuan Wu, Yu-Gang Jiang

    Abstract: The recent wave of AI-generated content (AIGC) has witnessed substantial success in computer vision, with the diffusion model playing a crucial role in this achievement. Due to their impressive generative capabilities, diffusion models are gradually superseding methods based on GANs and auto-regressive Transformers, demonstrating exceptional performance not only in image generation and editing, bu… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  31. arXiv:2310.07915  [pdf, other

    cs.NI cs.CY cs.SI

    Tag Your Fish in the Broken Net: A Responsible Web Framework for Protecting Online Privacy and Copyright

    Authors: Dawen Zhang, Boming Xia, Yue Liu, Xiwei Xu, Thong Hoang, Zhenchang Xing, Mark Staples, Qinghua Lu, Liming Zhu

    Abstract: The World Wide Web, a ubiquitous source of information, serves as a primary resource for countless individuals, amassing a vast amount of data from global internet users. However, this online data, when scraped, indexed, and utilized for activities like web crawling, search engine indexing, and, notably, AI model training, often diverges from the original intent of its contributors. The ascent of… ▽ More

    Submitted 5 November, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: added some information on how to deal with CDN in the design section; minor fixes on writing

  32. arXiv:2309.16134  [pdf, other

    cs.SE

    Let's Chat to Find the APIs: Connecting Human, LLM and Knowledge Graph through AI Chain

    Authors: Qing Huang, Zhenyu Wan, Zhenchang Xing, Changjing Wang, Jieshan Chen, Xiwei Xu, Qinghua Lu

    Abstract: API recommendation methods have evolved from literal and semantic keyword matching to query expansion and query clarification. The latest query clarification method is knowledge graph (KG)-based, but limitations include out-of-vocabulary (OOV) failures and rigid question templates. To address these limitations, we propose a novel knowledge-guided query clarification approach for API recommendation… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: Accepted on ASE'2023

  33. arXiv:2309.15606  [pdf, other

    cs.SE

    From Misuse to Mastery: Enhancing Code Generation with Knowledge-Driven AI Chaining

    Authors: Xiaoxue Ren, Xinyuan Ye, Dehai Zhao, Zhenchang Xing, Xiaohu Yang

    Abstract: Large Language Models (LLMs) have shown promising results in automatic code generation by improving coding efficiency to a certain extent. However, generating high-quality and reliable code remains a formidable task because of LLMs' lack of good programming practice, especially in exception handling. In this paper, we first conduct an empirical study and summarise three crucial challenges of LLMs… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: Accepted by 38th IEEE/ACM International Conference on Automated Software Engineering (ASE 2023)

  34. arXiv:2309.08273  [pdf, other

    cs.CV

    A Generative Framework for Self-Supervised Facial Representation Learning

    Authors: Ruian He, Zhen Xing, Weimin Tan, Bo Yan

    Abstract: Self-supervised representation learning has gained increasing attention for strong generalization ability without relying on paired datasets. However, it has not been explored sufficiently for facial representation. Self-supervised facial representation learning remains unsolved due to the coupling of facial identities, expressions, and external factors like pose and light. Prior methods primarily… ▽ More

    Submitted 22 May, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

  35. arXiv:2309.07804  [pdf, other

    cs.SE cs.CL

    Pop Quiz! Do Pre-trained Code Models Possess Knowledge of Correct API Names?

    Authors: Terry Yue Zhuo, Xiaoning Du, Zhenchang Xing, Jiamou Sun, Haowei Quan, Li Li, Liming Zhu

    Abstract: Recent breakthroughs in pre-trained code models, such as CodeBERT and Codex, have shown their superior performance in various downstream tasks. The correctness and unambiguity of API usage among these code models are crucial for achieving desirable program functionalities, requiring them to learn various API fully qualified names structurally and semantically. Recent studies reveal that even state… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

  36. arXiv:2309.05619  [pdf, other

    cs.CL

    Effective Proxy for Human Labeling: Ensemble Disagreement Scores in Large Language Models for Industrial NLP

    Authors: Wei Du, Laksh Advani, Yashmeet Gambhir, Daniel J Perry, Prashant Shiralkar, Zhengzheng Xing, Aaron Colak

    Abstract: Large language models (LLMs) have demonstrated significant capability to generalize across a large number of NLP tasks. For industry applications, it is imperative to assess the performance of the LLM on unlabeled production data from time to time to validate for a real-world setting. Human labeling to assess model error requires considerable expense and time delay. Here we demonstrate that ensemb… ▽ More

    Submitted 19 November, 2023; v1 submitted 11 September, 2023; originally announced September 2023.

    Comments: Camera ready version for 2023 EMNLP (The Third Workshop on Natural Language Generation, Evaluation, and Metrics (GEM))

  37. arXiv:2309.01386  [pdf, other

    cs.CR

    SemProtector: A Unified Framework for Semantic Protection in Deep Learning-based Semantic Communication Systems

    Authors: Xinghan Liu, Guoshun Nan, Qimei Cui, Zeju Li, Peiyuan Liu, Zebin Xing, Hanqing Mu, Xiaofeng Tao, Tony Q. S. Quek

    Abstract: Recently proliferated semantic communications (SC) aim at effectively transmitting the semantics conveyed by the source and accurately interpreting the meaning at the destination. While such a paradigm holds the promise of making wireless communications more intelligent, it also suffers from severe semantic security issues, such as eavesdropping, privacy leaking, and spoofing, due to the open natu… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

    Comments: Accepted by Communications Magazine

  38. arXiv:2308.16759  [pdf, other

    cs.LG eess.SP

    Constructing Indoor Region-based Radio Map without Location Labels

    Authors: Zheng Xing, Junting Chen

    Abstract: Radio map construction requires a large amount of radio measurement data with location labels, which imposes a high deployment cost. This paper develops a region-based radio map from received signal strength (RSS) measurements without location labels. The construction is based on a set of blindly collected RSS measurement data from a device that visits each region in an indoor area exactly once, w… ▽ More

    Submitted 22 February, 2024; v1 submitted 31 August, 2023; originally announced August 2023.

  39. arXiv:2308.14726  [pdf, other

    cs.CV cs.AI

    PanoSwin: a Pano-style Swin Transformer for Panorama Understanding

    Authors: Zhixin Ling, Zhen Xing, Xiangdong Zhou, Manliang Cao, Guichun Zhou

    Abstract: In panorama understanding, the widely used equirectangular projection (ERP) entails boundary discontinuity and spatial distortion. It severely deteriorates the conventional CNNs and vision Transformers on panoramas. In this paper, we propose a simple yet effective architecture named PanoSwin to learn panorama representations with ERP. To deal with the challenges brought by equirectangular projecti… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

    Comments: CVPR 2023

  40. arXiv:2308.11237  [pdf, other

    cs.SE

    Distinguishing Look-Alike Innocent and Vulnerable Code by Subtle Semantic Representation Learning and Explanation

    Authors: Chao Ni, Xin Yin, Kaiwen Yang, Dehai Zhao, Zhenchang Xing, Xin Xia

    Abstract: Though many deep learning (DL)-based vulnerability detection approaches have been proposed and indeed achieved remarkable performance, they still have limitations in the generalization as well as the practical usage. More precisely, existing DL-based approaches (1) perform negatively on prediction tasks among functions that are lexically similar but have contrary semantics; (2) provide no intuitiv… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

    Comments: Accepted By FSE'23

  41. arXiv:2308.09710  [pdf, other

    cs.CV cs.AI

    SimDA: Simple Diffusion Adapter for Efficient Video Generation

    Authors: Zhen Xing, Qi Dai, Han Hu, Zuxuan Wu, Yu-Gang Jiang

    Abstract: The recent wave of AI-generated content has witnessed the great development and success of Text-to-Image (T2I) technologies. By contrast, Text-to-Video (T2V) still falls short of expectations though attracting increasing interests. Existing works either train from scratch or adapt large T2I model to videos, both of which are computation and resource expensive. In this work, we propose a Simple Dif… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

  42. arXiv:2308.09568  [pdf, other

    cs.CV cs.CL

    PUMGPT: A Large Vision-Language Model for Product Understanding

    Authors: Wei Xue, Zongyi Guo, Baoliang Cui, Zheng Xing, Xiaoyi Zeng, Xiufei Wang, Shuhui Wu, Weiming Lu

    Abstract: E-commerce platforms benefit from accurate product understanding to enhance user experience and operational efficiency. Traditional methods often focus on isolated tasks such as attribute extraction or categorization, posing adaptability issues to evolving tasks and leading to usability challenges with noisy data from the internet. Current Large Vision Language Models (LVLMs) lack domain-specific… ▽ More

    Submitted 16 June, 2024; v1 submitted 18 August, 2023; originally announced August 2023.

  43. arXiv:2308.05898  [pdf, other

    cs.HC

    Unveiling the Tricks: Automated Detection of Dark Patterns in Mobile Applications

    Authors: Jieshan Chen, Jiamou Sun, Sidong Feng, Zhenchang Xing, Qinghua Lu, Xiwei Xu, Chunyang Chen

    Abstract: Mobile apps bring us many conveniences, such as online shopping and communication, but some use malicious designs called dark patterns to trick users into doing things that are not in their best interest. Many works have been done to summarize the taxonomy of these patterns and some have tried to mitigate the problems through various techniques. However, these techniques are either time-consuming,… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.

    Comments: 20 pages, 9 figures, accepted by UIST 2023

  44. arXiv:2308.04788  [pdf, other

    cs.SE

    Adaptive Intellect Unleashed: The Feasibility of Knowledge Transfer in Large Language Models

    Authors: Qing Huang, Yishun Wu, Zhenchang Xing, He Jiang, Yu Cheng, Huan Jin

    Abstract: We conduct the first empirical study on using knowledge transfer to improve the generalization ability of large language models (LLMs) in software engineering tasks, which often require LLMs to generalize beyond their training data. Our proposed general knowledge transfer approach guides the LLM towards a similar and familiar API or code snippet it has encountered before, improving the model's gen… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

  45. arXiv:2308.03252  [pdf, other

    cs.HC

    Video2Action: Reducing Human Interactions in Action Annotation of App Tutorial Videos

    Authors: Sidong Feng, Chunyang Chen, Zhenchang Xing

    Abstract: Tutorial videos of mobile apps have become a popular and compelling way for users to learn unfamiliar app features. To make the video accessible to the users, video creators always need to annotate the actions in the video, including what actions are performed and where to tap. However, this process can be time-consuming and labor-intensive. In this paper, we introduce a lightweight approach Video… ▽ More

    Submitted 6 August, 2023; originally announced August 2023.

  46. arXiv:2308.01046  [pdf, other

    cs.NI

    Flexible Coherent Optical Access: Architectures, Algorithms, and Demonstrations

    Authors: Ji Zhou, Zhenping Xing, Haide Wang, Kuo Zhang, Xi Chen, Qiguang Feng, Keshuang Zheng, Yijia Zhao, Zhen Dong, Tao Gui, Zhicheng Ye, Liangchuan Li

    Abstract: To cope with the explosive bandwidth demand, significant progress has been made in the ITU-T standardization sector to define a higher-speed passive optical network (PON) with a 50Gb/s line rate. Recently, 50G PON becomes mature gradually, which means it is time to discuss beyond 50G PON. For ensuring an acceptable optical power budget, beyond 50G PON will potentially use coherent technologies, wh… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

    Comments: The paper has been submitted to the Journal of Lightwave Technology

  47. arXiv:2307.11305  [pdf, other

    cs.SE

    Quantum Software Analytics: Opportunities and Challenges

    Authors: Thong Hoang, Hoa Khanh Dam, Tingting Bi, Qinghua Lu, Zhenchang Xing, Liming Zhu, Lam Duc Nguyen, Shiping Chen

    Abstract: Quantum computing systems depend on the principles of quantum mechanics to perform multiple challenging tasks more efficiently than their classical counterparts. In classical software engineering, the software life cycle is used to document and structure the processes of design, implementation, and maintenance of software applications. It helps stakeholders understand how to build an application.… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

  48. arXiv:2307.09885  [pdf, other

    cs.CY cs.AI cs.CL cs.HC

    Test-takers have a say: understanding the implications of the use of AI in language tests

    Authors: Dawen Zhang, Thong Hoang, Shidong Pan, Yongquan Hu, Zhenchang Xing, Mark Staples, Xiwei Xu, Qinghua Lu, Aaron Quigley

    Abstract: Language tests measure a person's ability to use a language in terms of listening, speaking, reading, or writing. Such tests play an integral role in academic, professional, and immigration domains, with entities such as educational institutions, professional accreditation bodies, and governments using them to assess candidate language proficiency. Recent advances in Artificial Intelligence (AI) a… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

  49. arXiv:2307.03941  [pdf, other

    cs.CY cs.AI cs.CL

    Right to be Forgotten in the Era of Large Language Models: Implications, Challenges, and Solutions

    Authors: Dawen Zhang, Pamela Finckenberg-Broman, Thong Hoang, Shidong Pan, Zhenchang Xing, Mark Staples, Xiwei Xu

    Abstract: The Right to be Forgotten (RTBF) was first established as the result of the ruling of Google Spain SL, Google Inc. v AEPD, Mario Costeja González, and was later included as the Right to Erasure under the General Data Protection Regulation (GDPR) of European Union to allow individuals the right to request personal data be deleted by organizations. Specifically for search engines, individuals can se… ▽ More

    Submitted 4 June, 2024; v1 submitted 8 July, 2023; originally announced July 2023.

    Comments: Revised the writing

  50. arXiv:2307.02088  [pdf, other

    cs.SE

    Trust in Software Supply Chains: Blockchain-Enabled SBOM and the AIBOM Future

    Authors: Boming Xia, Dawen Zhang, Yue Liu, Qinghua Lu, Zhenchang Xing, Liming Zhu

    Abstract: The robustness of critical infrastructure systems is contingent upon the integrity and transparency of their software supply chains. A Software Bill of Materials (SBOM) is pivotal in this regard, offering an exhaustive inventory of components and dependencies crucial to software development. However, prevalent challenges in SBOM sharing, such as data tampering risks and vendors' reluctance to full… ▽ More

    Submitted 18 January, 2024; v1 submitted 5 July, 2023; originally announced July 2023.

    Comments: Accepted by The Joint International Workshop on Engineering and Security of Critical Systems and Software Vulnerability Management (EnCyCriS/SVM 2024), Co-located with ICSE 2024