Skip to main content

Showing 1–50 of 188 results for author: Lyu, M

  1. arXiv:2407.10423  [pdf, other

    cs.PF cs.ET

    Assessing the Impact of Network Quality-of-Service on Metaverse Virtual Reality User Experience

    Authors: Rahul Dev Tripathi, Minzhao Lyu, Vijay Sivaraman

    Abstract: Metaverse virtual reality (VR) applications enable users to socialise, work, entertain, and study online with immersive experiences beyond the classic PC-based interactions. While the 360-degree immersion enables users to be fully engaged in a virtual scenario, suboptimal Quality-of-Experience (QoE) like poorly displayed 3D graphics, disruptive loading time, or motion lagging caused by degraded ne… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: Accepted in Proc. IEEE MetaCom, Hong Kong, China, Aug 2024

  2. arXiv:2406.19708  [pdf, other

    cs.NE cs.AI cs.CE q-bio.NC

    A Differentiable Approach to Multi-scale Brain Modeling

    Authors: Chaoming Wang, Muyang Lyu, Tianqiu Zhang, Sichao He, Si Wu

    Abstract: We present a multi-scale differentiable brain modeling workflow utilizing BrainPy, a unique differentiable brain simulator that combines accurate brain simulation with powerful gradient-based optimization. We leverage this capability of BrainPy across different brain scales. At the single-neuron level, we implement differentiable neuron models and employ gradient methods to optimize their fit to e… ▽ More

    Submitted 1 July, 2024; v1 submitted 28 June, 2024; originally announced June 2024.

    Comments: 2nd Differentiable Almost Everything Workshop at ICML 2024

  3. arXiv:2406.16386  [pdf, other

    cs.SE cs.AI

    Automatically Generating UI Code from Screenshot: A Divide-and-Conquer-Based Approach

    Authors: Yuxuan Wan, Chaozheng Wang, Yi Dong, Wenxuan Wang, Shuqing Li, Yintong Huo, Michael R. Lyu

    Abstract: Websites are critical in today's digital world, with over 1.11 billion currently active and approximately 252,000 new sites launched daily. Converting website layout design into functional UI code is a time-consuming yet indispensable step of website development. Manual methods of converting visual designs into functional code present significant challenges, especially for non-experts. To explore… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  4. Less Cybersickness, Please: Demystifying and Detecting Stereoscopic Visual Inconsistencies in VR Apps

    Authors: Shuqing Li, Cuiyun Gao, Jianping Zhang, Yujia Zhang, Yepang Liu, Jiazhen Gu, Yun Peng, Michael R. Lyu

    Abstract: The quality of Virtual Reality (VR) apps is vital, particularly the rendering quality of the VR Graphical User Interface (GUI). Different from traditional 2D apps, VR apps create a 3D digital scene for users, by rendering two distinct 2D images for the user's left and right eyes, respectively. Stereoscopic visual inconsistency (denoted as "SVI") issues, however, undermine the rendering process of… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: This work has been accepted at the ACM International Conference on the Foundations of Software Engineering (FSE) 2024, Porto de Galinhas, Brazil. DOI: https://doi.org/10.1145/3660803

  5. arXiv:2406.07174  [pdf, other

    cs.SE

    ULog: Unsupervised Log Parsing with Large Language Models through Log Contrastive Units

    Authors: Junjie Huang, Zhihan Jiang, Zhuangbin Chen, Michael R. Lyu

    Abstract: Log parsing serves as an essential prerequisite for various log analysis tasks. Recent advancements in this field have improved parsing accuracy by leveraging the semantics in logs through fine-tuning large language models (LLMs) or learning from in-context demonstrations. However, these methods heavily depend on labeled examples to achieve optimal performance. In practice, collecting sufficient l… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  6. arXiv:2406.06975  [pdf, other

    cs.DC cs.SE

    TraceMesh: Scalable and Streaming Sampling for Distributed Traces

    Authors: Zhuangbin Chen, Zhihan Jiang, Yuxin Su, Michael R. Lyu, Zibin Zheng

    Abstract: Distributed tracing serves as a fundamental element in the monitoring of cloud-based and datacenter systems. It provides visibility into the full lifecycle of a request or operation across multiple services, which is essential for understanding system dependencies and performance bottlenecks. To mitigate computational and storage overheads, most tracing frameworks adopt a uniform sampling strategy… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Accepted by The 2024 IEEE 17th International Conference on Cloud Computing (CLOUD)

  7. arXiv:2405.02213  [pdf, other

    cs.SE cs.AI cs.LG

    Automatic Programming: Large Language Models and Beyond

    Authors: Michael R. Lyu, Baishakhi Ray, Abhik Roychoudhury, Shin Hwei Tan, Patanamon Thongtanunam

    Abstract: Automatic programming has seen increasing popularity due to the emergence of tools like GitHub Copilot which rely on Large Language Models (LLMs). At the same time, automatically generated code faces challenges during deployment due to concerns around quality and trust. In this article, we study automated coding in a general sense and study the concerns around code quality, security and related is… ▽ More

    Submitted 15 May, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

  8. arXiv:2404.19368  [pdf, other

    cs.SE

    Exploring Multi-Lingual Bias of Large Code Models in Code Generation

    Authors: Chaozheng Wang, Zongjie Li, Cuiyun Gao, Wenxuan Wang, Ting Peng, Hailiang Huang, Yuetang Deng, Shuai Wang, Michael R. Lyu

    Abstract: Code generation aims to synthesize code and fulfill functional requirements based on natural language (NL) specifications, which can greatly improve development efficiency. In the era of large language models (LLMs), large code models (LCMs) have been recently proposed to generate source code. LCMs can generate highly feasible solutions for programming problems described in natural language. Despi… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: 12 pages

  9. arXiv:2404.17153  [pdf, other

    cs.SE

    A Unified Debugging Approach via LLM-Based Multi-Agent Synergy

    Authors: Cheryl Lee, Chunqiu Steven Xia, Jen-tse Huang, Zhouruixin Zhu, Lingming Zhang, Michael R. Lyu

    Abstract: Tremendous efforts have been devoted to automating software debugging, a time-consuming process involving fault localization and repair generation. Recently, Large Language Models (LLMs) have shown great potential in automated debugging. However, we identified three challenges posed to traditional and LLM-based debugging tools: 1) the upstream imperfection of fault localization affects the downstr… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  10. arXiv:2404.13957  [pdf, other

    cs.CL

    How Well Can LLMs Echo Us? Evaluating AI Chatbots' Role-Play Ability with ECHO

    Authors: Man Tik Ng, Hui Tung Tse, Jen-tse Huang, Jingjing Li, Wenxuan Wang, Michael R. Lyu

    Abstract: The role-play ability of Large Language Models (LLMs) has emerged as a popular research direction. However, existing studies focus on imitating well-known public figures or fictional characters, overlooking the potential for simulating ordinary individuals. Such an oversight limits the potential for advancements in digital human clones and non-player characters in video games. To bridge this gap,… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: 9 pages

  11. arXiv:2403.19096  [pdf, other

    cs.SE cs.CR

    SCALE: Constructing Structured Natural Language Comment Trees for Software Vulnerability Detection

    Authors: Xin-Cheng Wen, Cuiyun Gao, Shuzheng Gao, Yang Xiao, Michael R. Lyu

    Abstract: Recently, there has been a growing interest in automatic software vulnerability detection. Pre-trained model-based approaches have demonstrated superior performance than other Deep Learning (DL)-based approaches in detecting vulnerabilities. However, the existing pre-trained model-based approaches generally employ code sequences as input during prediction, and may ignore vulnerability-related stru… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: Accepted by ISSTA 2024

  12. arXiv:2403.18252  [pdf, other

    cs.CV cs.AI cs.CL cs.LG cs.MM

    Beyond Embeddings: The Promise of Visual Table in Visual Reasoning

    Authors: Yiwu Zhong, Zi-Yuan Hu, Michael R. Lyu, Liwei Wang

    Abstract: Visual representation learning has been a cornerstone in computer vision, involving typical forms such as visual embeddings, structural symbols, and text-based representations. Despite the success of CLIP-type visual embeddings, they often lack access to world knowledge critical for visual reasoning. In this work, we propose Visual Table, a novel form of visual representation tailored for visual r… ▽ More

    Submitted 17 June, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: Project page: https://github.com/LaVi-Lab/Visual-Table

  13. arXiv:2403.17574  [pdf, other

    cs.SE cs.DC

    SPES: Towards Optimizing Performance-Resource Trade-Off for Serverless Functions

    Authors: Cheryl Lee, Zhouruixin Zhu, Tianyi Yang, Yintong Huo, Yuxin Su, Pinjia He, Michael R. Lyu

    Abstract: As an emerging cloud computing deployment paradigm, serverless computing is gaining traction due to its efficiency and ability to harness on-demand cloud resources. However, a significant hurdle remains in the form of the cold start problem, causing latency when launching new function instances from scratch. Existing solutions tend to use over-simplistic strategies for function pre-loading/unloadi… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: 12 pages, accepted by ICDE 2024 (40th IEEE International Conference on Data Engineering)

  14. arXiv:2403.13089  [pdf

    cs.CL

    Automatic Summarization of Doctor-Patient Encounter Dialogues Using Large Language Model through Prompt Tuning

    Authors: Mengxian Lyu, Cheng Peng, Xiaohan Li, Patrick Balian, Jiang Bian, Yonghui Wu

    Abstract: Automatic text summarization (ATS) is an emerging technology to assist clinicians in providing continuous and coordinated care. This study presents an approach to summarize doctor-patient dialogues using generative large language models (LLMs). We developed prompt-tuning algorithms to instruct generative LLMs to summarize clinical text. We examined the prompt-tuning strategies, the size of soft pr… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  15. arXiv:2403.11807  [pdf, other

    cs.AI cs.CL

    How Far Are We on the Decision-Making of LLMs? Evaluating LLMs' Gaming Ability in Multi-Agent Environments

    Authors: Jen-tse Huang, Eric John Li, Man Ho Lam, Tian Liang, Wenxuan Wang, Youliang Yuan, Wenxiang Jiao, Xing Wang, Zhaopeng Tu, Michael R. Lyu

    Abstract: Decision-making, a complicated task requiring various types of abilities, presents an excellent framework for assessing Large Language Models (LLMs). Our research investigates LLMs' decision-making capabilities through the lens of a well-established field, Game Theory. We focus specifically on games that support the participation of more than two agents simultaneously. Subsequently, we introduce o… ▽ More

    Submitted 25 April, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: 16 pages of main text. 11 pages of appendices. 15 figures, 9 tables. Updated scoring scheme

  16. arXiv:2403.06485  [pdf, other

    cs.SE cs.CL cs.LG

    Knowledge-aware Alert Aggregation in Large-scale Cloud Systems: a Hybrid Approach

    Authors: Jinxi Kuang, Jinyang Liu, Junjie Huang, Renyi Zhong, Jiazhen Gu, Lan Yu, Rui Tan, Zengyin Yang, Michael R. Lyu

    Abstract: Due to the scale and complexity of cloud systems, a system failure would trigger an "alert storm", i.e., massive correlated alerts. Although these alerts can be traced back to a few root causes, the overwhelming number makes it infeasible for manual handling. Alert aggregation is thus critical to help engineers concentrate on the root cause and facilitate failure resolution. Existing methods typic… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: Accepted by Proceedings of the 46th International Conference on Software Engineering: Software Engineering in Practice (ICSE SEIP 2024)

  17. arXiv:2403.05245  [pdf, other

    eess.IV cs.AI cs.CV

    Noise Level Adaptive Diffusion Model for Robust Reconstruction of Accelerated MRI

    Authors: Shoujin Huang, Guanxiong Luo, Xi Wang, Ziran Chen, Yuwan Wang, Huaishui Yang, Pheng-Ann Heng, Lingyan Zhang, Mengye Lyu

    Abstract: In general, diffusion model-based MRI reconstruction methods incrementally remove artificially added noise while imposing data consistency to reconstruct the underlying images. However, real-world MRI acquisitions already contain inherent noise due to thermal fluctuations. This phenomenon is particularly notable when using ultra-fast, high-resolution imaging sequences for advanced research, or usi… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  18. arXiv:2402.17583  [pdf, other

    cs.SE cs.CL cs.LG

    FaultProfIT: Hierarchical Fault Profiling of Incident Tickets in Large-scale Cloud Systems

    Authors: Junjie Huang, Jinyang Liu, Zhuangbin Chen, Zhihan Jiang, Yichen LI, Jiazhen Gu, Cong Feng, Zengyin Yang, Yongqiang Yang, Michael R. Lyu

    Abstract: Postmortem analysis is essential in the management of incidents within cloud systems, which provides valuable insights to improve system's reliability and robustness. At CloudA, fault pattern profiling is performed during the postmortem phase, which involves the classification of incidents' faults into unique categories, referred to as fault pattern. By aggregating and analyzing these fault patter… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: Accepted by Proceedings of the 46th International Conference on Software Engineering: Software Engineering in Practice (ICSE SEIP 2024)

  19. arXiv:2402.12958  [pdf, other

    cs.SE

    Go Static: Contextualized Logging Statement Generation

    Authors: Yichen Li, Yintong Huo, Renyi Zhong, Zhihan Jiang, Jinyang Liu, Junjie Huang, Jiazhen Gu, Pinjia He, Michael R. Lyu

    Abstract: Logging practices have been extensively investigated to assist developers in writing appropriate logging statements for documenting software behaviors. Although numerous automatic logging approaches have been proposed, their performance remains unsatisfactory due to the constraint of the single-method input, without informative programming context outside the method. Specifically, we identify thre… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: This paper was accepted by The ACM International Conference on the Foundations of Software Engineering (FSE 2024)

  20. arXiv:2402.11217  [pdf, other

    cs.CL cs.CV

    Asclepius: A Spectrum Evaluation Benchmark for Medical Multi-Modal Large Language Models

    Authors: Wenxuan Wang, Yihang Su, Jingyuan Huan, Jie Liu, Wenting Chen, Yudi Zhang, Cheng-Yi Li, Kao-Jung Chang, Xiaohan Xin, Linlin Shen, Michael R. Lyu

    Abstract: The significant breakthroughs of Medical Multi-Modal Large Language Models (Med-MLLMs) renovate modern healthcare with robust information synthesis and medical decision support. However, these models are often evaluated on benchmarks that are unsuitable for the Med-MLLMs due to the intricate nature of the real-world diagnostic frameworks, which encompass diverse medical specialties and involve com… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

    Comments: 20 pages, 15 figures

  21. MetaVRadar: Measuring Metaverse Virtual Reality Network Activity

    Authors: Minzhao Lyu, Rahul Dev Tripathi, Vijay Sivaraman

    Abstract: The "metaverse", wherein users can enter virtual worlds to work, study, play, shop, socialize, and entertain, is fast becoming a reality, attracting billions of dollars in investment from companies such as Meta, Microsoft, and Clipo Labs. Further, virtual reality (VR) headsets from entities like Oculus, HTC, and Microsoft are rapidly maturing to provide fully immersive experiences to metaverse use… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: This paper is accepted at ACM SIGMETRICS/IFIP PERFORMANCE 2024 and is published by the Proceedings of the ACM on Measurement and Analysis of Computing Systems (POMACS)

    Journal ref: Proc. ACM Meas. Anal. Comput. Syst. 7, 3, Article 55 (December 2023), 29 pages

  22. arXiv:2402.03630  [pdf, other

    cs.SE cs.AI

    Enhancing LLM-Based Coding Tools through Native Integration of IDE-Derived Static Context

    Authors: Yichen Li, Yun Peng, Yintong Huo, Michael R. Lyu

    Abstract: Large Language Models (LLMs) have achieved remarkable success in code completion, as evidenced by their essential roles in developing code assistant services such as Copilot. Being trained on in-file contexts, current LLMs are quite effective in completing code for single source files. However, it is challenging for them to conduct repository-level code completion for large software projects that… ▽ More

    Submitted 19 February, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

  23. Network Anatomy and Real-Time Measurement of Nvidia GeForce NOW Cloud Gaming

    Authors: Minzhao Lyu, Sharat Chandra Madanapalli, Arun Vishwanath, Vijay Sivaraman

    Abstract: Cloud gaming, wherein game graphics is rendered in the cloud and streamed back to the user as real-time video, expands the gaming market to billions of users who do not have gaming consoles or high-power graphics PCs. Companies like Nvidia, Amazon, Sony and Microsoft are investing in building cloud gaming platforms to tap this large unserved market. However, cloud gaming requires the user to have… ▽ More

    Submitted 13 February, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

    Comments: This paper is accepted at Passive and Active Measurement (PAM) conference Mar 2024

    Journal ref: M. Lyu, S. C. Madanapalli, A. Vishwanath, and V. Sivaraman, "Network Anatomy and Real-Time Measurement of Nvidia GeForce NOW Cloud Gaming", in Proc. PAM, Virtual Event, Mar 2024

  24. arXiv:2401.06175  [pdf, other

    cs.SE cs.AI cs.LG

    MTAD: Tools and Benchmarks for Multivariate Time Series Anomaly Detection

    Authors: Jinyang Liu, Wenwei Gu, Zhuangbin Chen, Yichen Li, Yuxin Su, Michael R. Lyu

    Abstract: Key Performance Indicators (KPIs) are essential time-series metrics for ensuring the reliability and stability of many software systems. They faithfully record runtime states to facilitate the understanding of anomalous system behaviors and provide informative clues for engineers to pinpoint the root causes. The unprecedented scale and complexity of modern software systems, however, make the volum… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    Comments: The code and datasets are available at https://github.com/OpsPAI/MTAD

  25. Learning in the Wild: Towards Leveraging Unlabeled Data for Effectively Tuning Pre-trained Code Models

    Authors: Shuzheng Gao, Wenxin Mao, Cuiyun Gao, Li Li, Xing Hu, Xin Xia, Michael R. Lyu

    Abstract: Pre-trained code models have recently achieved substantial improvements in many code intelligence tasks. These models are first pre-trained on large-scale unlabeled datasets in a task-agnostic manner using self-supervised learning, and then fine-tuned on labeled datasets in downstream tasks. However, the labeled datasets are usually limited in size (i.e., human intensive efforts), which may hinder… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

    Comments: Accepted by ICSE 2024

  26. arXiv:2401.00763  [pdf, other

    cs.SE cs.AI cs.CL cs.CV cs.MM

    New Job, New Gender? Measuring the Social Bias in Image Generation Models

    Authors: Wenxuan Wang, Haonan Bai, Jen-tse Huang, Yuxuan Wan, Youliang Yuan, Haoyi Qiu, Nanyun Peng, Michael R. Lyu

    Abstract: Image generation models can generate or edit images from a given text. Recent advancements in image generation technology, exemplified by DALL-E and Midjourney, have been groundbreaking. These advanced models, despite their impressive capabilities, are often trained on massive Internet datasets, making them susceptible to generating content that perpetuates social stereotypes and biases, which can… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

  27. arXiv:2401.00761  [pdf, other

    cs.SE cs.AI cs.CL

    The Earth is Flat? Unveiling Factual Errors in Large Language Models

    Authors: Wenxuan Wang, Juluan Shi, Zhaopeng Tu, Youliang Yuan, Jen-tse Huang, Wenxiang Jiao, Michael R. Lyu

    Abstract: Large Language Models (LLMs) like ChatGPT are foundational in various applications due to their extensive knowledge from pre-training and fine-tuning. Despite this, they are prone to generating factual and commonsense errors, raising concerns in critical areas like healthcare, journalism, and education to mislead users. Current methods for evaluating LLMs' veracity are limited by test data leakage… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

  28. arXiv:2401.00757  [pdf, other

    cs.SE cs.AI cs.CL cs.LO

    A & B == B & A: Triggering Logical Reasoning Failures in Large Language Models

    Authors: Yuxuan Wan, Wenxuan Wang, Yiliu Yang, Youliang Yuan, Jen-tse Huang, Pinjia He, Wenxiang Jiao, Michael R. Lyu

    Abstract: Recent advancements in large language models (LLMs) have propelled Artificial Intelligence (AI) to new heights, enabling breakthroughs in various tasks such as writing assistance, code generation, and machine translation. A significant distinction of advanced LLMs, such as ChatGPT, is their demonstrated ability to "reason." However, evaluating the reasoning ability of LLMs remains a challenge as m… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

  29. Realizing Open and Decentralized Marketplace for Exchanging Data of Expected IoT Behaviors

    Authors: Song Guo, Minzhao Lyu, Hassan Habibi Gharakheili

    Abstract: With rising concerns about the security of IoT devices, network operators need better ways to handle potential risks. Luckily, IoT devices show consistent patterns in how they communicate. But despite previous efforts, it remains unclear how knowledge of these patterns can be made available. As data marketplaces become popular in different domains, this paper1 proposes creating a special marketpla… ▽ More

    Submitted 29 December, 2023; originally announced January 2024.

    Comments: This manuscript is the full version of our paper [1] accepted to the IEEE/IFIP NOMS 2024 conference. IEEE/IFIP NOMS, Seoul, South Korea, May 2024

    Journal ref: NOMS 2024-2024 IEEE Network Operations and Management Symposium, Seoul, Korea, Republic of, 2024, pp. 1-5

  30. arXiv:2312.16145  [pdf, other

    cs.CV cs.AI cs.LG

    One-Dimensional Adapter to Rule Them All: Concepts, Diffusion Models and Erasing Applications

    Authors: Mengyao Lyu, Yuhong Yang, Haiwen Hong, Hui Chen, Xuan Jin, Yuan He, Hui Xue, Jungong Han, Guiguang Ding

    Abstract: The prevalent use of commercial and open-source diffusion models (DMs) for text-to-image generation prompts risk mitigation to prevent undesired behaviors. Existing concept erasing methods in academia are all based on full parameter or specification-based fine-tuning, from which we observe the following issues: 1) Generation alternation towards erosion: Parameter drift during target elimination ca… ▽ More

    Submitted 11 March, 2024; v1 submitted 26 December, 2023; originally announced December 2023.

    Comments: CVPR 2024

  31. arXiv:2312.10813  [pdf, other

    cs.CV cs.CL cs.LG

    Re-parameterized Low-rank Prompt: Generalize a Vision-Language Model within 0.5K Parameters

    Authors: Tianxiang Hao, Mengyao Lyu, Hui Chen, Sicheng Zhao, Jungong Han, Guiguang Ding

    Abstract: With the development of large pre-trained vision-language models, how to effectively transfer the knowledge of such foundational models to downstream tasks becomes a hot topic, especially in a data-deficient scenario. Recently, prompt tuning has become a popular solution. When adapting the vision-language models, researchers freeze the parameters in the backbone and only design and tune the prompt… ▽ More

    Submitted 11 January, 2024; v1 submitted 17 December, 2023; originally announced December 2023.

  32. arXiv:2310.12598  [pdf, other

    cs.SE

    Less is More? An Empirical Study on Configuration Issues in Python PyPI Ecosystem

    Authors: Yun Peng, Ruida Hu, Ruoke Wang, Cuiyun Gao, Shuqing Li, Michael R. Lyu

    Abstract: Python is widely used in the open-source community, largely owing to the extensive support from diverse third-party libraries within the PyPI ecosystem. Nevertheless, the utilization of third-party libraries can potentially lead to conflicts in dependencies, prompting researchers to develop dependency conflict detectors. Moreover, endeavors have been made to automatically infer dependencies. These… ▽ More

    Submitted 4 January, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: This paper has been accepted by ICSE 2024

  33. arXiv:2310.12481  [pdf, other

    cs.CL cs.AI

    Not All Countries Celebrate Thanksgiving: On the Cultural Dominance in Large Language Models

    Authors: Wenxuan Wang, Wenxiang Jiao, Jingyuan Huang, Ruyi Dai, Jen-tse Huang, Zhaopeng Tu, Michael R. Lyu

    Abstract: This paper identifies a cultural dominance issue within large language models (LLMs) due to the predominant use of English data in model training (e.g., ChatGPT). LLMs often provide inappropriate English-culture-related answers that are not relevant to the expected culture when users ask in non-English languages. To systematically evaluate the cultural dominance issue, we build a benchmark of conc… ▽ More

    Submitted 16 February, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

  34. arXiv:2310.01796  [pdf, other

    cs.SE

    LILAC: Log Parsing using LLMs with Adaptive Parsing Cache

    Authors: Zhihan Jiang, Jinyang Liu, Zhuangbin Chen, Yichen Li, Junjie Huang, Yintong Huo, Pinjia He, Jiazhen Gu, Michael R. Lyu

    Abstract: Log parsing transforms log messages into structured formats, serving as the prerequisite step for various log analysis tasks. Although a variety of log parsing approaches have been proposed, their performance on complicated log data remains compromised due to the use of human-crafted rules or learning-based models with limited training data. The recent emergence of powerful large language models (… ▽ More

    Submitted 22 March, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: This paper was accepted by The ACM International Conference on the Foundations of Software Engineering (FSE 2024)

  35. arXiv:2310.01386  [pdf, other

    cs.CL

    Who is ChatGPT? Benchmarking LLMs' Psychological Portrayal Using PsychoBench

    Authors: Jen-tse Huang, Wenxuan Wang, Eric John Li, Man Ho Lam, Shujie Ren, Youliang Yuan, Wenxiang Jiao, Zhaopeng Tu, Michael R. Lyu

    Abstract: Large Language Models (LLMs) have recently showcased their remarkable capacities, not only in natural language processing tasks but also across diverse domains such as clinical medicine, legal consultation, and education. LLMs become more than mere applications, evolving into assistants capable of addressing diverse user requests. This narrows the distinction between human beings and artificial in… ▽ More

    Submitted 22 January, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: Accepted for ICLR 2024 Oral Presentation. 15 pages (main text) and 5 pages (appendix)

  36. arXiv:2310.00905  [pdf, other

    cs.CL cs.AI

    All Languages Matter: On the Multilingual Safety of Large Language Models

    Authors: Wenxuan Wang, Zhaopeng Tu, Chang Chen, Youliang Yuan, Jen-tse Huang, Wenxiang Jiao, Michael R. Lyu

    Abstract: Safety lies at the core of developing and deploying large language models (LLMs). However, previous safety benchmarks only concern the safety in one language, e.g. the majority language in the pretraining data such as English. In this work, we build the first multilingual safety benchmark for LLMs, XSafety, in response to the global deployment of LLMs in practice. XSafety covers 14 kinds of common… ▽ More

    Submitted 20 June, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: Accepted by ACL 2024 Findings. The first multilingual safety benchmark for large language models

  37. arXiv:2310.00677  [pdf, other

    cs.SE

    A Roadmap towards Intelligent Operations for Reliable Cloud Computing Systems

    Authors: Yintong Huo, Cheryl Lee, Jinyang Liu, Tianyi Yang, Michael R. Lyu

    Abstract: The increasing complexity and usage of cloud systems have made it challenging for service providers to ensure reliability. This paper highlights two main challenges, namely internal and external factors, that affect the reliability of cloud microservices. Afterward, we discuss the data-driven approach that can resolve these challenges from four key aspects: ticket management, log management, multi… ▽ More

    Submitted 1 October, 2023; originally announced October 2023.

    Comments: This paper has been accepted by ICDM AIOPS workshop

  38. arXiv:2309.16102  [pdf, other

    cs.AI cs.DB

    Discovering Utility-driven Interval Rules

    Authors: Chunkai Zhang, Maohua Lyu, Huaijin Hao, Wensheng Gan, Philip S. Yu

    Abstract: For artificial intelligence, high-utility sequential rule mining (HUSRM) is a knowledge discovery method that can reveal the associations between events in the sequences. Recently, abundant methods have been proposed to discover high-utility sequence rules. However, the existing methods are all related to point-based sequences. Interval events that persist for some time are common. Traditional int… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: Preprint. 11 figures, 5 tables

  39. arXiv:2309.12167  [pdf, other

    cs.SE

    Revealing Performance Issues in Server-side WebAssembly Runtimes via Differential Testing

    Authors: Shuyao Jiang, Ruiying Zeng, Zihao Rao, Jiazhen Gu, Yangfan Zhou, Michael R. Lyu

    Abstract: WebAssembly (Wasm) is a bytecode format originally serving as a compilation target for Web applications. It has recently been used increasingly on the server side, e.g., providing a safer, faster, and more portable alternative to Linux containers. With the popularity of server-side Wasm applications, it is essential to study performance issues (i.e., abnormal latency) in Wasm runtimes, as they may… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Comments: Accepted by the 38th IEEE/ACM International Conference on Automated Software Engineering (ASE 2023)

  40. Ditto: An Elastic and Adaptive Memory-Disaggregated Caching System

    Authors: Jiacheng Shen, Pengfei Zuo, Xuchuan Luo, Yuxin Su, Jiazhen Gu, Hao Feng, Yangfan Zhou, Michael R. Lyu

    Abstract: In-memory caching systems are fundamental building blocks in cloud services. However, due to the coupled CPU and memory on monolithic servers, existing caching systems cannot elastically adjust resources in a resource-efficient and agile manner. To achieve better elasticity, we propose to port in-memory caching systems to the disaggregated memory (DM) architecture, where compute and memory resourc… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

  41. arXiv:2309.08115  [pdf, other

    cs.SE

    REEF: A Framework for Collecting Real-World Vulnerabilities and Fixes

    Authors: Chaozheng Wang, Zongjie Li, Yun Peng, Shuzheng Gao, Sirong Chen, Shuai Wang, Cuiyun Gao, Michael R. Lyu

    Abstract: Software plays a crucial role in our daily lives, and therefore the quality and security of software systems have become increasingly important. However, vulnerabilities in software still pose a significant threat, as they can have serious consequences. Recent advances in automated program repair have sought to automatically detect and fix bugs using data-driven techniques. Sophisticated deep lear… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

    Comments: Accepted by ASE 2023 Industry Challenge(Competition) Track

  42. Your Code Secret Belongs to Me: Neural Code Completion Tools Can Memorize Hard-Coded Credentials

    Authors: Yizhan Huang, Yichen Li, Weibin Wu, Jianping Zhang, Michael R. Lyu

    Abstract: Neural Code Completion Tools (NCCTs) have reshaped the field of software engineering, which are built upon the language modeling technique and can accurately suggest contextually relevant code snippets. However, language models may emit the training data verbatim during inference with appropriate prompts. This memorization property raises privacy concerns of NCCTs about hard-coded credential leaka… ▽ More

    Submitted 20 May, 2024; v1 submitted 14 September, 2023; originally announced September 2023.

    Comments: Accepted by FSE '24

  43. arXiv:2308.10828  [pdf, other

    cs.SE

    A Large-Scale Evaluation for Log Parsing Techniques: How Far Are We?

    Authors: Zhihan Jiang, Jinyang Liu, Junjie Huang, Yichen Li, Yintong Huo, Jiazhen Gu, Zhuangbin Chen, Jieming Zhu, Michael R. Lyu

    Abstract: Log data have facilitated various tasks of software development and maintenance, such as testing, debugging and diagnosing. Due to the unstructured nature of logs, log parsing is typically required to transform log messages into structured data for automated log analysis. Given the abundance of log parsers that employ various techniques, evaluating these tools to comprehend their characteristics a… ▽ More

    Submitted 22 March, 2024; v1 submitted 21 August, 2023; originally announced August 2023.

    Comments: This paper was accepted by 33rd ACM SIGSOFT International Symposium on Software Testing and Analysis (ISSTA 2024)

  44. arXiv:2308.09937  [pdf, other

    cs.SE cs.LG

    Practical Anomaly Detection over Multivariate Monitoring Metrics for Online Services

    Authors: Jinyang Liu, Tianyi Yang, Zhuangbin Chen, Yuxin Su, Cong Feng, Zengyin Yang, Michael R. Lyu

    Abstract: As modern software systems continue to grow in terms of complexity and volume, anomaly detection on multivariate monitoring metrics, which profile systems' health status, becomes more and more critical and challenging. In particular, the dependency between different metrics and their historical patterns plays a critical role in pursuing prompt and accurate anomaly detection. Existing approaches fa… ▽ More

    Submitted 19 August, 2023; originally announced August 2023.

    Comments: This paper has been accepted by the 34th IEEE International Symposium on Software Reliability Engineering (ISSRE'2023)

  45. arXiv:2308.09810  [pdf, other

    cs.SE cs.AI cs.CL cs.CV

    An Image is Worth a Thousand Toxic Words: A Metamorphic Testing Framework for Content Moderation Software

    Authors: Wenxuan Wang, Jingyuan Huang, Jen-tse Huang, Chang Chen, Jiazhen Gu, Pinjia He, Michael R. Lyu

    Abstract: The exponential growth of social media platforms has brought about a revolution in communication and content dissemination in human society. Nevertheless, these platforms are being increasingly misused to spread toxic content, including hate speech, malicious advertising, and pornography, leading to severe negative consequences such as harm to teenagers' mental health. Despite tremendous efforts i… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

    Comments: Accepted by ASE 2023. arXiv admin note: substantial text overlap with arXiv:2302.05706

  46. arXiv:2308.09804  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    VL-PET: Vision-and-Language Parameter-Efficient Tuning via Granularity Control

    Authors: Zi-Yuan Hu, Yanyang Li, Michael R. Lyu, Liwei Wang

    Abstract: As the model size of pre-trained language models (PLMs) grows rapidly, full fine-tuning becomes prohibitively expensive for model training and storage. In vision-and-language (VL), parameter-efficient tuning (PET) techniques are proposed to integrate modular modifications (e.g., Adapter and LoRA) into encoder-decoder PLMs. By tuning a small set of trainable parameters, these techniques perform on… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

    Comments: ICCV 2023 (17 pages, 6 figures, 22 tables)

  47. arXiv:2308.09324  [pdf, other

    cs.SE

    AutoLog: A Log Sequence Synthesis Framework for Anomaly Detection

    Authors: Yintong Huo, Yichen Li, Yuxin Su, Pinjia He, Zifan Xie, Michael R. Lyu

    Abstract: The rapid progress of modern computing systems has led to a growing interest in informative run-time logs. Various log-based anomaly detection techniques have been proposed to ensure software reliability. However, their implementation in the industry has been limited due to the lack of high-quality public log resources as training datasets. While some log datasets are available for anomaly detec… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

    Comments: The paper has been accepted by ASE 2023 (Research Track)

  48. arXiv:2308.07676  [pdf, other

    cs.SE

    Maat: Performance Metric Anomaly Anticipation for Cloud Services with Conditional Diffusion

    Authors: Cheryl Lee, Tianyi Yang, Zhuangbin Chen, Yuxin Su, Michael R. Lyu

    Abstract: Ensuring the reliability and user satisfaction of cloud services necessitates prompt anomaly detection followed by diagnosis. Existing techniques for anomaly detection focus solely on real-time detection, meaning that anomaly alerts are issued as soon as anomalies occur. However, anomalies can propagate and escalate into failures, making faster-than-real-time anomaly detection highly desirable… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

    Comments: This paper has been accepted by the Research track of the 38th IEEE/ACM International Conference on Automated Software Engineering (ASE 2023)

  49. arXiv:2308.07638  [pdf, other

    cs.SE

    Prism: Revealing Hidden Functional Clusters from Massive Instances in Cloud Systems

    Authors: Jinyang Liu, Zhihan Jiang, Jiazhen Gu, Junjie Huang, Zhuangbin Chen, Cong Feng, Zengyin Yang, Yongqiang Yang, Michael R. Lyu

    Abstract: Ensuring the reliability of cloud systems is critical for both cloud vendors and customers. Cloud systems often rely on virtualization techniques to create instances of hardware resources, such as virtual machines. However, virtualization hinders the observability of cloud systems, making it challenging to diagnose platform-level issues. To improve system observability, we propose to infer functio… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

    Comments: The paper was accepted by the 38th IEEE/ACM International Conference on Automated Software Engineering (ASE 2023)

  50. arXiv:2308.06783  [pdf, other

    cs.SE cs.HC

    Towards Modeling Software Quality of Virtual Reality Applications from Users' Perspectives

    Authors: Shuqing Li, Lili Wei, Yepang Liu, Cuiyun Gao, Shing-Chi Cheung, Michael R. Lyu

    Abstract: Virtual Reality (VR) technology has become increasingly popular in recent years as a key enabler of the Metaverse. VR applications have unique characteristics, including the revolutionized human-computer interaction mechanisms, that distinguish them from traditional software. Hence, user expectations for the software quality of VR applications diverge from those for traditional software. Investiga… ▽ More

    Submitted 13 August, 2023; originally announced August 2023.

    ACM Class: D.2.9; H.5.1