Skip to main content

Showing 1–50 of 104 results for author: Wong, D

  1. arXiv:2407.08733  [pdf, other

    cs.CL

    Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist

    Authors: Zihao Zhou, Shudong Liu, Maizhen Ning, Wei Liu, Jindong Wang, Derek F. Wong, Xiaowei Huang, Qiufeng Wang, Kaizhu Huang

    Abstract: Exceptional mathematical reasoning ability is one of the key features that demonstrate the power of large language models (LLMs). How to comprehensively define and evaluate the mathematical abilities of LLMs, and even reflect the user experience in real-world scenarios, has emerged as a critical issue. Current benchmarks predominantly concentrate on problem-solving capabilities, which presents a s… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 35 pages, 10 figures, preprint

  2. arXiv:2407.04879  [pdf, other

    cs.SD eess.AS

    All Neural Low-latency Directional Speech Extraction

    Authors: Ashutosh Pandey, Sanha Lee, Juan Azcarreta, Daniel Wong, Buye Xu

    Abstract: We introduce a novel all neural model for low-latency directional speech extraction. The model uses direction of arrival (DOA) embeddings from a predefined spatial grid, which are transformed and fused into a recurrent neural network based speech extraction model. This process enables the model to effectively extract speech from a specified DOA. Unlike previous methods that relied on hand-crafted… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: Accepted for publication at INTERSPEECH 2024

  3. arXiv:2406.16405  [pdf, ps, other

    cs.DM cs.DS math.CO

    Greedy Gray Codes for some Restricted Classes of Binary Words

    Authors: Nathanaël Hassler, Vincent Vajnovszki, Dennis Wong

    Abstract: We investigate the existence of greedy Gray codes, based on the choice of the first element in the code, for two classes of binary words: generalized Fibonacci words and generalized Dyck words.

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: In Proceedings GASCom 2024, arXiv:2406.14588

    Journal ref: EPTCS 403, 2024, pp. 108-112

  4. arXiv:2406.11432  [pdf, other

    cs.CV cs.AI

    AnyTrans: Translate AnyText in the Image with Large Scale Models

    Authors: Zhipeng Qian, Pei Zhang, Baosong Yang, Kai Fan, Yiwei Ma, Derek F. Wong, Xiaoshuai Sun, Rongrong Ji

    Abstract: This paper introduces AnyTrans, an all-encompassing framework for the task-Translate AnyText in the Image (TATI), which includes multilingual text translation and text fusion within images. Our framework leverages the strengths of large-scale models, such as Large Language Models (LLMs) and text-guided diffusion models, to incorporate contextual cues from both textual and visual elements during tr… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  5. arXiv:2406.07054  [pdf, other

    cs.CL cs.AI

    CoEvol: Constructing Better Responses for Instruction Finetuning through Multi-Agent Cooperation

    Authors: Renhao Li, Minghuan Tan, Derek F. Wong, Min Yang

    Abstract: In recent years, instruction fine-tuning (IFT) on large language models (LLMs) has garnered considerable attention to enhance model performance on unseen tasks. Attempts have been made on automatic construction and effective selection for IFT data. However, we posit that previous methods have not fully harnessed the potential of LLMs for enhancing data quality. The responses within IFT data could… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  6. arXiv:2406.05221  [pdf, other

    cs.DC

    GCAPS: GPU Context-Aware Preemptive Priority-based Scheduling for Real-Time Tasks

    Authors: Yidi Wang, Cong Liu, Daniel Wong, Hyoseung Kim

    Abstract: Scheduling real-time tasks that utilize GPUs with analyzable guarantees poses a significant challenge due to the intricate interaction between CPU and GPU resources, as well as the complex GPU hardware and software stack. While much research has been conducted in the real-time research community, several limitations persist, including the absence or limited availability of GPU-level preemption, ex… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: Accepted by ECRTS 2024. arXiv admin note: substantial text overlap with arXiv:2401.16529

  7. arXiv:2406.03450  [pdf, other

    cs.CL cs.AI

    What is the Best Way for ChatGPT to Translate Poetry?

    Authors: Shanshan Wang, Derek F. Wong, Jingming Yao, Lidia S. Chao

    Abstract: Machine translation (MT) has historically faced significant challenges when applied to literary works, particularly in the domain of poetry translation. The advent of Large Language Models such as ChatGPT holds potential for innovation in this field. This study examines ChatGPT's capabilities in English-Chinese poetry translation tasks, utilizing targeted prompts and small sample scenarios to asce… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 19 pages, 1 figure. The paper has been accepted by ACL 2024(Main Conference)

  8. arXiv:2406.00839  [pdf, other

    cs.CL cs.AI

    FOCUS: Forging Originality through Contrastive Use in Self-Plagiarism for Language Models

    Authors: Kaixin Lan, Tao Fang, Derek F. Wong, Yabo Xu, Lidia S. Chao, Cecilia G. Zhao

    Abstract: Pre-trained Language Models (PLMs) have shown impressive results in various Natural Language Generation (NLG) tasks, such as powering chatbots and generating stories. However, an ethical concern arises due to their potential to produce verbatim copies of paragraphs from their training data. This is problematic as PLMs are trained on corpora constructed by human authors. As such, there is a pressin… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: 16 pages, 8 figures. The paper has been accepted by ACL 2024 (Findings), with Kaixin Lan and Tao Fang contributing equally, and Derek F. Wong serving as the corresponding author

  9. arXiv:2405.14039  [pdf, other

    cs.CL cs.AI cs.LG

    Trajectory Volatility for Out-of-Distribution Detection in Mathematical Reasoning

    Authors: Yiming Wang, Pei Zhang, Baosong Yang, Derek F. Wong, Zhuosheng Zhang, Rui Wang

    Abstract: Real-world data deviating from the independent and identically distributed (i.i.d.) assumption of in-distribution training data poses security threats to deep networks, thus advancing out-of-distribution (OOD) detection algorithms. Detection methods in generative language models (GLMs) mainly focus on uncertainty estimation and embedding distance measurement, with the latter proven to be most effe… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 27 pages, 6 figures, 12 tables

  10. arXiv:2405.04286  [pdf, other

    cs.CL

    Who Wrote This? The Key to Zero-Shot LLM-Generated Text Detection Is GECScore

    Authors: Junchao Wu, Runzhe Zhan, Derek F. Wong, Shu Yang, Xuebo Liu, Lidia S. Chao, Min Zhang

    Abstract: The efficacy of an large language model (LLM) generated text detector depends substantially on the availability of sizable training data. White-box zero-shot detectors, which require no such data, are nonetheless limited by the accessibility of the source model of the LLM-generated text. In this paper, we propose an simple but effective black-box zero-shot detection approach, predicated on the obs… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  11. arXiv:2405.02925  [pdf, other

    cs.CL

    A Two-Stage Prediction-Aware Contrastive Learning Framework for Multi-Intent NLU

    Authors: Guanhua Chen, Yutong Yao, Derek F. Wong, Lidia S. Chao

    Abstract: Multi-intent natural language understanding (NLU) presents a formidable challenge due to the model confusion arising from multiple intents within a single utterance. While previous works train the model contrastively to increase the margin between different multi-intent labels, they are less suited to the nuances of multi-intent NLU. They ignore the rich information between the shared intents, whi… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

    Comments: LREC-COLING 2024

  12. arXiv:2404.18413  [pdf, other

    cs.CV cs.AI

    3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset

    Authors: Xinyu Ma, Xuebo Liu, Derek F. Wong, Jun Rao, Bei Li, Liang Ding, Lidia S. Chao, Dacheng Tao, Min Zhang

    Abstract: Multimodal machine translation (MMT) is a challenging task that seeks to improve translation quality by incorporating visual information. However, recent studies have indicated that the visual information provided by existing MMT datasets is insufficient, causing models to disregard it and overestimate their capabilities. This issue presents a significant obstacle to the development of MMT researc… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  13. arXiv:2404.16766  [pdf, other

    cs.CL cs.AI

    Prefix Text as a Yarn: Eliciting Non-English Alignment in Foundation Language Model

    Authors: Runzhe Zhan, Xinyi Yang, Derek F. Wong, Lidia S. Chao, Yue Zhang

    Abstract: While supervised fine-tuning (SFT) has been a straightforward approach for tailoring the output of foundation large language model (LLM) to specific preferences, concerns have been raised about the depth of this alignment, with some critiques suggesting it is merely "superficial". We critically examine this hypothesis within the scope of cross-lingual generation tasks, proposing that the effective… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  14. arXiv:2403.15012  [pdf, other

    cs.LG stat.ML

    Empirical investigation of multi-source cross-validation in clinical machine learning

    Authors: Tuija Leinonen, David Wong, Ali Wahab, Ramesh Nadarajah, Matti Kaisti, Antti Airola

    Abstract: Traditionally, machine learning-based clinical prediction models have been trained and evaluated on patient data from a single source, such as a hospital. Cross-validation methods can be used to estimate the accuracy of such models on new patients originating from the same source, by repeated random splitting of the data. However, such estimates tend to be highly overoptimistic when compared to ac… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: 14 pages, 3 figures

  15. arXiv:2403.12381  [pdf, other

    cs.CE

    Explainable AutoML (xAutoML) with adaptive modeling for yield enhancement in semiconductor smart manufacturing

    Authors: Weihong Zhai, Xiupeng Shi, Yiik Diew Wong, Qing Han, Lisheng Chen

    Abstract: Enhancing yield is recognized as a paramount driver to reducing production costs in semiconductor smart manufacturing. However, optimizing and ensuring high yield rates is a highly complex and technical challenge, especially while maintaining reliable yield diagnosis and prognosis, and this shall require understanding all the confounding factors in a complex condition. This study proposes a domain… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  16. arXiv:2403.11621  [pdf, other

    cs.CL

    Let's Focus on Neuron: Neuron-Level Supervised Fine-tuning for Large Language Model

    Authors: Haoyun Xu, Runzhe Zhan, Derek F. Wong, Lidia S. Chao

    Abstract: Large Language Models (LLMs) are composed of neurons that exhibit various behaviors and roles, which become increasingly diversified as models scale. Recent studies have revealed that not all neurons are active across different datasets, and this sparsity correlates positively with the task-specific ability, leading to advancements in model pruning and training efficiency. Traditional fine-tuning… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  17. arXiv:2402.16705  [pdf, other

    cs.CL cs.AI cs.LG

    SelectIT: Selective Instruction Tuning for Large Language Models via Uncertainty-Aware Self-Reflection

    Authors: Liangxin Liu, Xuebo Liu, Derek F. Wong, Dongfang Li, Ziyi Wang, Baotian Hu, Min Zhang

    Abstract: Instruction tuning (IT) is crucial to tailoring large language models (LLMs) towards human-centric interactions. Recent advancements have shown that the careful selection of a small, high-quality subset of IT data can significantly enhance the performance of LLMs. Despite this, common approaches often rely on additional models or data sets, which increases costs and limits widespread adoption. In… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  18. arXiv:2402.15566  [pdf

    eess.IV cs.CV cs.LG

    Closing the AI generalization gap by adjusting for dermatology condition distribution differences across clinical settings

    Authors: Rajeev V. Rikhye, Aaron Loh, Grace Eunhae Hong, Preeti Singh, Margaret Ann Smith, Vijaytha Muralidharan, Doris Wong, Rory Sayres, Michelle Phung, Nicolas Betancourt, Bradley Fong, Rachna Sahasrabudhe, Khoban Nasim, Alec Eschholz, Basil Mustafa, Jan Freyberg, Terry Spitz, Yossi Matias, Greg S. Corrado, Katherine Chou, Dale R. Webster, Peggy Bui, Yuan Liu, Yun Liu, Justin Ko , et al. (1 additional authors not shown)

    Abstract: Recently, there has been great progress in the ability of artificial intelligence (AI) algorithms to classify dermatological conditions from clinical photographs. However, little is known about the robustness of these algorithms in real-world settings where several factors can lead to a loss of generalizability. Understanding and overcoming these limitations will permit the development of generali… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  19. arXiv:2402.15293  [pdf, other

    cs.CR

    SoK: What don't we know? Understanding Security Vulnerabilities in SNARKs

    Authors: Stefanos Chaliasos, Jens Ernstberger, David Theodore, David Wong, Mohammad Jahanara, Benjamin Livshits

    Abstract: Zero-knowledge proofs (ZKPs) have evolved from being a theoretical concept providing privacy and verifiability to having practical, real-world implementations, with SNARKs (Succinct Non-Interactive Argument of Knowledge) emerging as one of the most significant innovations. Prior work has mainly focused on designing more efficient SNARK systems and providing security proofs for them. Many think of… ▽ More

    Submitted 11 July, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

  20. arXiv:2402.07616  [pdf, other

    cs.CL cs.AI

    Anchor-based Large Language Models

    Authors: Jianhui Pang, Fanghua Ye, Derek Fai Wong, Xin He, Wanshun Chen, Longyue Wang

    Abstract: Large language models (LLMs) predominantly employ decoder-only transformer architectures, necessitating the retention of keys/values information for historical tokens to provide contextual information and avoid redundant computation. However, the substantial size and parameter volume of these LLMs require massive GPU memory. This memory demand increases with the length of the input text, leading t… ▽ More

    Submitted 1 June, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: The paper has been accepted by the ACL2024 conference. Work was done when Jianhui Pang and Fanghua Ye were interning at Tencent AI Lab

  21. Quantum-Secure Hybrid Blockchain System for DID-based Verifiable Random Function with NTRU Linkable Ring Signature

    Authors: Bong Gon Kim, Dennis Wong, Yoon Seok Yang

    Abstract: In this study, we present a secure smart contract-based Verifiable Random Function (VRF) model, addressing the shortcomings of existing systems. As quantum computing emerges, conventional public key cryptography faces potential vulnerabilities. To enhance our VRF's robustness, we employ post-quantum Ring-LWE encryption for generating pseudo-random sequences. Given the computational intensity of th… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: 25 pages, 5 figures, 2023 International Journal on Cryptography and Information Security (IJCIS). arXiv admin note: text overlap with arXiv:2311.11734

    Journal ref: Volume 13, Number 4, December 2023

  22. arXiv:2401.16529  [pdf, other

    cs.DC cs.PF

    Unleashing the Power of Preemptive Priority-based Scheduling for Real-Time GPU Tasks

    Authors: Yidi Wang, Cong Liu, Daniel Wong, Hyoseung Kim

    Abstract: Scheduling real-time tasks that utilize GPUs with analyzable guarantees poses a significant challenge due to the intricate interaction between CPU and GPU resources, as well as the complex GPU hardware and software stack. While much research has been conducted in the real-time research community, several limitations persist, including the absence or limited availability of preemption, extended blo… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  23. arXiv:2401.12794  [pdf, other

    cs.CL

    Benchmarking LLMs via Uncertainty Quantification

    Authors: Fanghua Ye, Mingming Yang, Jianhui Pang, Longyue Wang, Derek F. Wong, Emine Yilmaz, Shuming Shi, Zhaopeng Tu

    Abstract: The proliferation of open-source Large Language Models (LLMs) from various institutions has highlighted the urgent need for comprehensive evaluation methods. However, current evaluation platforms, such as the widely recognized HuggingFace open LLM leaderboard, neglect a crucial aspect -- uncertainty, which is vital for thoroughly assessing LLMs. To bridge this gap, we introduce a new benchmarking… ▽ More

    Submitted 25 April, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

    Comments: 25 pages, preprints

  24. arXiv:2401.08350  [pdf, other

    cs.CL

    Salute the Classic: Revisiting Challenges of Machine Translation in the Age of Large Language Models

    Authors: Jianhui Pang, Fanghua Ye, Longyue Wang, Dian Yu, Derek F. Wong, Shuming Shi, Zhaopeng Tu

    Abstract: The evolution of Neural Machine Translation (NMT) has been significantly influenced by six core challenges (Koehn and Knowles, 2017), which have acted as benchmarks for progress in this field. This study revisits these challenges, offering insights into their ongoing relevance in the context of advanced Large Language Models (LLMs): domain mismatch, amount of parallel data, rare word prediction, t… ▽ More

    Submitted 17 January, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: 17 pages. Longyue Wang is the Corresponding Author

  25. arXiv:2401.07882  [pdf, other

    cs.SD eess.AS

    On the Importance of Neural Wiener Filter for Resource Efficient Multichannel Speech Enhancement

    Authors: Tsun-An Hsieh, Jacob Donley, Daniel Wong, Buye Xu, Ashutosh Pandey

    Abstract: We introduce a time-domain framework for efficient multichannel speech enhancement, emphasizing low latency and computational efficiency. This framework incorporates two compact deep neural networks (DNNs) surrounding a multichannel neural Wiener filter (NWF). The first DNN enhances the speech signal to estimate NWF coefficients, while the second DNN refines the output from the NWF. The NWF, while… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: Accepted for publication at ICASSP

  26. Private and Secure Post-Quantum Verifiable Random Function with NIZK Proof and Ring-LWE Encryption in Blockchain

    Authors: Bong Gon Kim, Dennis Wong, Yoon Seok Yang

    Abstract: We present a secure and private blockchain-based Verifiable Random Function (VRF) scheme addressing some limitations of classical VRF constructions. Given the imminent quantum computing adversarial scenario, conventional cryptographic methods face vulnerabilities. To enhance our VRF's secure randomness, we adopt post-quantum Ring-LWE encryption for synthesizing pseudo-random sequences. Considering… ▽ More

    Submitted 7 February, 2024; v1 submitted 20 November, 2023; originally announced November 2023.

    Comments: 21 pages, 5 figures, In the 2023 Proceedings of International Conference on Cryptography and Blockchain

    Journal ref: Proceedings of International Conference on Cryptography and Blockchain, 13(21), 47-67 (2023)

  27. arXiv:2311.03032  [pdf, other

    cs.RO

    Reconfigurable, Transformable Soft Pneumatic Actuator with Tunable 3D Deformations for Dexterous Soft Robotics Applications

    Authors: Dickson Chiu Yu Wong, Mingtan Li, Shijie Kang, Lifan Luo, Hongyu Yu

    Abstract: Numerous soft actuators based on PneuNet design have already been proposed and extensively employed across various soft robotics applications in recent years. Despite their widespread use, a common limitation of most existing designs is that their action is pre-determined during the fabrication process, thereby restricting the ability to modify or alter their function during operation. To address… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: Submitted to Soft Robotics Journal. 12 pages, 10 figures

  28. arXiv:2310.14724  [pdf, other

    cs.CL cs.AI

    A Survey on LLM-Generated Text Detection: Necessity, Methods, and Future Directions

    Authors: Junchao Wu, Shu Yang, Runzhe Zhan, Yulin Yuan, Derek F. Wong, Lidia S. Chao

    Abstract: The powerful ability to understand, follow, and generate complex language emerging from large language models (LLMs) makes LLM-generated text flood many areas of our daily lives at an incredible speed and is widely accepted by humans. As LLMs continue to expand, there is an imperative need to develop detectors that can detect LLM-generated text. This is crucial to mitigate potential misuse of LLMs… ▽ More

    Submitted 19 April, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

  29. arXiv:2310.08908  [pdf, other

    cs.CL

    Human-in-the-loop Machine Translation with Large Language Model

    Authors: Xinyi Yang, Runzhe Zhan, Derek F. Wong, Junchao Wu, Lidia S. Chao

    Abstract: The large language model (LLM) has garnered significant attention due to its in-context learning mechanisms and emergent capabilities. The research community has conducted several pilot studies to apply LLMs to machine translation tasks and evaluate their performance from diverse perspectives. However, previous research has primarily focused on the LLM itself and has not explored human interventio… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Comments: Accepted to MT Summit 2023

  30. arXiv:2305.19847  [pdf, other

    cs.CL cs.AI

    How Does Pretraining Improve Discourse-Aware Translation?

    Authors: Zhihong Huang, Longyue Wang, Siyou Liu, Derek F. Wong

    Abstract: Pretrained language models (PLMs) have produced substantial improvements in discourse-aware neural machine translation (NMT), for example, improved coherence in spoken language translation. However, the underlying reasons for their strong performance have not been well explained. To bridge this gap, we introduce a probing task to interpret the ability of PLMs to capture discourse relation knowledg… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

    Comments: Interspeech 2023

  31. arXiv:2305.01951  [pdf, other

    cs.CL

    Can LMs Generalize to Future Data? An Empirical Analysis on Text Summarization

    Authors: Chi Seng Cheang, Hou Pong Chan, Derek F. Wong, Xuebo Liu, Zhaocong Li, Yanming Sun, Shudong Liu, Lidia S. Chao

    Abstract: Recent pre-trained language models (PLMs) achieve promising results in existing abstractive summarization datasets. However, existing summarization benchmarks overlap in time with the standard pre-training corpora and finetuning datasets. Hence, the strong performance of PLMs may rely on the parametric knowledge that is memorized during pre-training and fine-tuning. Moreover, the knowledge memoriz… ▽ More

    Submitted 2 November, 2023; v1 submitted 3 May, 2023; originally announced May 2023.

    Comments: Accepted at EMNLP 2023

  32. arXiv:2305.01181  [pdf, other

    cs.CL

    A Paradigm Shift: The Future of Machine Translation Lies with Large Language Models

    Authors: Chenyang Lyu, Zefeng Du, Jitao Xu, Yitao Duan, Minghao Wu, Teresa Lynn, Alham Fikri Aji, Derek F. Wong, Siyou Liu, Longyue Wang

    Abstract: Machine Translation (MT) has greatly advanced over the years due to the developments in deep neural networks. However, the emergence of Large Language Models (LLMs) like GPT-4 and ChatGPT is introducing a new phase in the MT domain. In this context, we believe that the future of MT is intricately tied to the capabilities of LLMs. These models not only offer vast linguistic understandings but also… ▽ More

    Submitted 1 April, 2024; v1 submitted 1 May, 2023; originally announced May 2023.

    Comments: Accepted to LREC-COLING 2024

  33. arXiv:2304.14937  [pdf, other

    cs.CV

    Contactless hand tremor amplitude measurement using smartphones: development and pilot evaluation

    Authors: James Bungay, Osasenaga Emokpae, Samuel D. Relton, Jane Alty, Stefan Williams, Hui Fang, David C. Wong

    Abstract: Background: Physiological tremor is defined as an involuntary and rhythmic shaking. Tremor of the hand is a key symptom of multiple neurological diseases, and its frequency and amplitude differs according to both disease type and disease progression. In routine clinical practice, tremor frequency and amplitude are assessed by expert rating using a 0 to 4 integer scale. Such ratings are subjective… ▽ More

    Submitted 28 April, 2023; originally announced April 2023.

    Comments: Accepted to IEEE EMBC 2023, Sydney (pre-refereed version)

  34. arXiv:2304.07943  [pdf

    cs.CR

    Detecting Domain-Generation Algorithm (DGA) Based Fully-Qualified Domain Names (FQDNs) with Shannon Entropy

    Authors: Adam Dorian Wong

    Abstract: Domain Name System (DNS) is the backbone of the Internet. However, threat actors have abused the antiquated protocol to facilitate command-and-control (C2) actions, to tunnel, or to exfiltrate sensitive information in novel ways. The FireEye breach and Solarwinds intrusions of late 2020 demonstrated the sophistication of hacker groups. Researchers were eager to reverse-engineer the malware and eag… ▽ More

    Submitted 16 April, 2023; originally announced April 2023.

  35. arXiv:2304.01746  [pdf, other

    cs.CL

    Is ChatGPT a Highly Fluent Grammatical Error Correction System? A Comprehensive Evaluation

    Authors: Tao Fang, Shu Yang, Kaixin Lan, Derek F. Wong, Jinpeng Hu, Lidia S. Chao, Yue Zhang

    Abstract: ChatGPT, a large-scale language model based on the advanced GPT-3.5 architecture, has shown remarkable potential in various Natural Language Processing (NLP) tasks. However, there is currently a dearth of comprehensive study exploring its potential in the area of Grammatical Error Correction (GEC). To showcase its capabilities in GEC, we design zero-shot chain-of-thought (CoT) and few-shot CoT set… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

  36. arXiv:2303.12723  [pdf, other

    cs.CV

    AdaOPC: A Self-Adaptive Mask Optimization Framework For Real Design Patterns

    Authors: Wenqian Zhao, Xufeng Yao, Ziyang Yu, Guojin Chen, Yuzhe Ma, Bei Yu, Martin D. F. Wong

    Abstract: Optical proximity correction (OPC) is a widely-used resolution enhancement technique (RET) for printability optimization. Recently, rigorous numerical optimization and fast machine learning are the research focus of OPC in both academia and industry, each of which complements the other in terms of robustness or efficiency. We inspect the pattern distribution on a design layer and find that differe… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

  37. A High-Performance Accelerator for Super-Resolution Processing on Embedded GPU

    Authors: Wenqian Zhao, Qi Sun, Yang Bai, Wenbo Li, Haisheng Zheng, Bei Yu, Martin D. F. Wong

    Abstract: Recent years have witnessed impressive progress in super-resolution (SR) processing. However, its real-time inference requirement sets a challenge not only for the model design but also for the on-chip implementation. In this paper, we implement a full-stack SR acceleration framework on embedded GPU devices. The special dictionary learning algorithm used in SR models was analyzed in detail and acc… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

  38. arXiv:2303.08435  [pdf, other

    cs.CV cs.LG eess.IV

    Physics-Informed Optical Kernel Regression Using Complex-valued Neural Fields

    Authors: Guojin Chen, Zehua Pei, Haoyu Yang, Yuzhe Ma, Bei Yu, Martin D. F. Wong

    Abstract: Lithography is fundamental to integrated circuit fabrication, necessitating large computation overhead. The advancement of machine learning (ML)-based lithography models alleviates the trade-offs between manufacturing process expense and capability. However, all previous methods regard the lithography system as an image-to-image black box mapping, utilizing network parameters to learn by rote mapp… ▽ More

    Submitted 9 April, 2023; v1 submitted 15 March, 2023; originally announced March 2023.

    Comments: Accepted by DAC23

  39. arXiv:2302.08975  [pdf, other

    cs.CL

    Towards Fine-Grained Information: Identifying the Type and Location of Translation Errors

    Authors: Keqin Bao, Yu Wan, Dayiheng Liu, Baosong Yang, Wenqiang Lei, Xiangnan He, Derek F. Wong, Jun Xie

    Abstract: Fine-grained information on translation errors is helpful for the translation evaluation community. Existing approaches can not synchronously consider error position and type, failing to integrate the error information of both. In this paper, we propose Fine-Grained Translation Error Detection (FG-TED) task, aiming at identifying both the position and the type of translation errors on given source… ▽ More

    Submitted 17 February, 2023; originally announced February 2023.

  40. arXiv:2301.04320  [pdf, other

    cs.SD cs.LG eess.AS

    Rethinking complex-valued deep neural networks for monaural speech enhancement

    Authors: Haibin Wu, Ke Tan, Buye Xu, Anurag Kumar, Daniel Wong

    Abstract: Despite multiple efforts made towards adopting complex-valued deep neural networks (DNNs), it remains an open question whether complex-valued DNNs are generally more effective than real-valued DNNs for monaural speech enhancement. This work is devoted to presenting a critical assessment by systematically examining complex-valued DNNs against their real-valued counterparts. Specifically, we investi… ▽ More

    Submitted 11 January, 2023; originally announced January 2023.

  41. arXiv:2212.10179  [pdf, other

    cs.CL

    Toward Human-Like Evaluation for Natural Language Generation with Error Analysis

    Authors: Qingyu Lu, Liang Ding, Liping Xie, Kanjian Zhang, Derek F. Wong, Dacheng Tao

    Abstract: The state-of-the-art language model-based automatic metrics, e.g. BARTScore, benefiting from large-scale contextualized pre-training, have been successfully used in a wide range of natural language generation (NLG) tasks, including machine translation, text summarization, and data-to-text. Recent studies show that considering both major errors (e.g. mistranslated tokens) and minor errors (e.g. imp… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

    Comments: work in progress

  42. arXiv:2212.04262  [pdf, other

    cs.CL cs.AI cs.LG

    ConsistTL: Modeling Consistency in Transfer Learning for Low-Resource Neural Machine Translation

    Authors: Zhaocong Li, Xuebo Liu, Derek F. Wong, Lidia S. Chao, Min Zhang

    Abstract: Transfer learning is a simple and powerful method that can be used to boost model performance of low-resource neural machine translation (NMT). Existing transfer learning methods for NMT are static, which simply transfer knowledge from a parent model to a child model once via parameter initialization. In this paper, we propose a novel transfer learning method for NMT, namely ConsistTL, which can c… ▽ More

    Submitted 8 December, 2022; originally announced December 2022.

    Comments: Accepted to EMNLP 2022

  43. arXiv:2211.08624  [pdf, ps, other

    cs.SD cs.LG eess.AS

    Leveraging Heteroscedastic Uncertainty in Learning Complex Spectral Mapping for Single-channel Speech Enhancement

    Authors: Kuan-Lin Chen, Daniel D. E. Wong, Ke Tan, Buye Xu, Anurag Kumar, Vamsi Krishna Ithapu

    Abstract: Most speech enhancement (SE) models learn a point estimate and do not make use of uncertainty estimation in the learning process. In this paper, we show that modeling heteroscedastic uncertainty by minimizing a multivariate Gaussian negative log-likelihood (NLL) improves SE performance at no extra cost. During training, our approach augments a model learning complex spectral mapping with a tempora… ▽ More

    Submitted 8 March, 2023; v1 submitted 15 November, 2022; originally announced November 2022.

    Comments: 5 pages. Accepted at ICASSP 2023

  44. arXiv:2210.10049  [pdf, other

    cs.CL

    Alibaba-Translate China's Submission for WMT 2022 Quality Estimation Shared Task

    Authors: Keqin Bao, Yu Wan, Dayiheng Liu, Baosong Yang, Wenqiang Lei, Xiangnan He, Derek F. Wong, Jun Xie

    Abstract: In this paper, we present our submission to the sentence-level MQM benchmark at Quality Estimation Shared Task, named UniTE (Unified Translation Evaluation). Specifically, our systems employ the framework of UniTE, which combined three types of input formats during training with a pre-trained language model. First, we apply the pseudo-labeled data examples for the continuously pre-training phase.… ▽ More

    Submitted 17 February, 2023; v1 submitted 18 October, 2022; originally announced October 2022.

    Comments: WMT 2022 QE Shared Task. arXiv admin note: text overlap with arXiv:2210.09683

  45. arXiv:2210.09683  [pdf, other

    cs.CL

    Alibaba-Translate China's Submission for WMT 2022 Metrics Shared Task

    Authors: Yu Wan, Keqin Bao, Dayiheng Liu, Baosong Yang, Derek F. Wong, Lidia S. Chao, Wenqiang Lei, Jun Xie

    Abstract: In this report, we present our submission to the WMT 2022 Metrics Shared Task. We build our system based on the core idea of UNITE (Unified Translation Evaluation), which unifies source-only, reference-only, and source-reference-combined evaluation scenarios into one single model. Specifically, during the model pre-training phase, we first apply the pseudo-labeled data examples to continuously pre… ▽ More

    Submitted 17 February, 2023; v1 submitted 18 October, 2022; originally announced October 2022.

    Comments: WMT 2022 Metrics Shared Task

  46. BLADERUNNER: Rapid Countermeasure for Synthetic (AI-Generated) StyleGAN Faces

    Authors: Adam Dorian Wong

    Abstract: StyleGAN is the open-sourced TensorFlow implementation made by NVIDIA. It has revolutionized high quality facial image generation. However, this democratization of Artificial Intelligence / Machine Learning (AI/ML) algorithms has enabled hostile threat actors to establish cyber personas or sock-puppet accounts in social media platforms. These ultra-realistic synthetic faces. This report surveys th… ▽ More

    Submitted 28 October, 2022; v1 submitted 12 October, 2022; originally announced October 2022.

    Comments: 29 pages

  47. arXiv:2209.01766  [pdf, ps, other

    cs.SE cs.PL

    Exploring the Verifiability of Code Generated by GitHub Copilot

    Authors: Dakota Wong, Austin Kothig, Patrick Lam

    Abstract: GitHub's Copilot generates code quickly. We investigate whether it generates good code. Our approach is to identify a set of problems, ask Copilot to generate solutions, and attempt to formally verify these solutions with Dafny. Our formal verification is with respect to hand-crafted specifications. We have carried out this process on 6 problems and succeeded in formally verifying 4 of the created… ▽ More

    Submitted 27 October, 2022; v1 submitted 5 September, 2022; originally announced September 2022.

    Comments: HATRA workshop at SPLASH 2022

  48. arXiv:2207.07572  [pdf, other

    cs.LG cs.HC eess.SP

    Outlier detection of vital sign trajectories from COVID-19 patients

    Authors: Sara Summerton, Ann Tivey, Rohan Shotton, Gavin Brown, Oliver C. Redfern, Rachel Oakley, John Radford, David C. Wong

    Abstract: In this work, we present a novel trajectory comparison algorithm to identify abnormal vital sign trends, with the aim of improving recognition of deteriorating health. There is growing interest in continuous wearable vital sign sensors for monitoring patients remotely at home. These monitors are usually coupled to an alerting system, which is triggered when vital sign measurements fall outside a… ▽ More

    Submitted 20 April, 2023; v1 submitted 15 July, 2022; originally announced July 2022.

    Comments: 4 pages, 4 figures, 1 table. Accepted to EMBC 2023, to be indexed in IEEE Xplore and PubMed Medline

  49. arXiv:2207.07195  [pdf

    cs.LG cs.MA eess.SY

    COOR-PLT: A hierarchical control model for coordinating adaptive platoons of connected and autonomous vehicles at signal-free intersections based on deep reinforcement learning

    Authors: Duowei Li, Jianping Wu, Feng Zhu, Tianyi Chen, Yiik Diew Wong

    Abstract: Platooning and coordination are two implementation strategies that are frequently proposed for traffic control of connected and autonomous vehicles (CAVs) at signal-free intersections instead of using conventional traffic signals. However, few studies have attempted to integrate both strategies to better facilitate the CAV control at signal-free intersections. To this end, this study proposes a hi… ▽ More

    Submitted 30 June, 2022; originally announced July 2022.

    Comments: This paper has been submitted to Transportation Research Part C: Emerging Technologies and is currently under review

    Journal ref: Transportation Research Part C: Emerging Technologies 146 (2023): 103933

  50. arXiv:2207.03522  [pdf, other

    cs.LG cs.NE cs.SI physics.soc-ph stat.ML

    TF-GNN: Graph Neural Networks in TensorFlow

    Authors: Oleksandr Ferludin, Arno Eigenwillig, Martin Blais, Dustin Zelle, Jan Pfeifer, Alvaro Sanchez-Gonzalez, Wai Lok Sibon Li, Sami Abu-El-Haija, Peter Battaglia, Neslihan Bulut, Jonathan Halcrow, Filipe Miguel Gonçalves de Almeida, Pedro Gonnet, Liangze Jiang, Parth Kothari, Silvio Lattanzi, André Linhares, Brandon Mayer, Vahab Mirrokni, John Palowitch, Mihir Paradkar, Jennifer She, Anton Tsitsulin, Kevin Villela, Lisa Wang , et al. (2 additional authors not shown)

    Abstract: TensorFlow-GNN (TF-GNN) is a scalable library for Graph Neural Networks in TensorFlow. It is designed from the bottom up to support the kinds of rich heterogeneous graph data that occurs in today's information ecosystems. In addition to enabling machine learning researchers and advanced developers, TF-GNN offers low-code solutions to empower the broader developer community in graph learning. Many… ▽ More

    Submitted 23 July, 2023; v1 submitted 7 July, 2022; originally announced July 2022.