Skip to main content

Showing 1–50 of 142 results for author: Lin, P

  1. arXiv:2407.08990  [pdf, other

    cs.AR cs.AI cs.ET cs.NE

    Dynamic neural network with memristive CIM and CAM for 2D and 3D vision

    Authors: Yue Zhang, Woyu Zhang, Shaocong Wang, Ning Lin, Yifei Yu, Yangu He, Bo Wang, Hao Jiang, Peng Lin, Xiaoxin Xu, Xiaojuan Qi, Zhongrui Wang, Xumeng Zhang, Dashan Shang, Qi Liu, Kwang-Ting Cheng, Ming Liu

    Abstract: The brain is dynamic, associative and efficient. It reconfigures by associating the inputs with past experiences, with fused memory and processing. In contrast, AI models are static, unable to associate inputs with past experiences, and run on digital computers with physically separated memory and processing. We propose a hardware-software co-design, a semantic memory-based dynamic neural network… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: In press

  2. arXiv:2407.00436  [pdf, other

    cs.CL

    A Recipe of Parallel Corpora Exploitation for Multilingual Large Language Models

    Authors: Peiqin Lin, André F. T. Martins, Hinrich Schütze

    Abstract: Recent studies have highlighted the potential of exploiting parallel corpora to enhance multilingual large language models, improving performance in both bilingual tasks, e.g., machine translation, and general-purpose tasks, e.g., text classification. Building upon these findings, our comprehensive study aims to identify the most effective strategies for leveraging parallel corpora. We investigate… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  3. arXiv:2406.12041  [pdf

    cs.CR

    Outer Space Cyberattacks: Generating Novel Scenarios to Avoid Surprise

    Authors: Patrick Lin, Keith Abney, Bruce DeBruhl, Kira Abercromby, Henry Danielson, Ryan Jenkins

    Abstract: Though general awareness around it may be low, space cyberattacks are an increasingly urgent problem given the vital role that space systems play in the modern world. Open-source or public discussions about it typically revolve around only a couple generic scenarios, namely satellite hacking and signals jamming or spoofing. But there are so many more possibilities. The report offers a scenario-p… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: A 95-page report, funded by the US National Science Foundation, award no. 2208458

  4. arXiv:2406.00761  [pdf, other

    cs.LG cs.AI

    Shared-unique Features and Task-aware Prioritized Sampling on Multi-task Reinforcement Learning

    Authors: Po-Shao Lin, Jia-Fong Yeh, Yi-Ting Chen, Winston H. Hsu

    Abstract: We observe that current state-of-the-art (SOTA) methods suffer from the performance imbalance issue when performing multi-task reinforcement learning (MTRL) tasks. While these methods may achieve impressive performance on average, they perform extremely poorly on a few tasks. To address this, we propose a new and effective method called STARS, which consists of two novel strategies: a shared-uniqu… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: The first two authors contribute equally

  5. arXiv:2405.11459  [pdf, other

    eess.SP cs.CL q-bio.NC

    Du-IN: Discrete units-guided mask modeling for decoding speech from Intracranial Neural signals

    Authors: Hui Zheng, Hai-Teng Wang, Wei-Bang Jiang, Zhong-Tao Chen, Li He, Pei-Yang Lin, Peng-Hu Wei, Guo-Guang Zhao, Yun-Zhe Liu

    Abstract: Invasive brain-computer interfaces have garnered significant attention due to their high performance. The current intracranial stereoElectroEncephaloGraphy (sEEG) foundation models typically build univariate representations based on a single channel. Some of them further use Transformer to model the relationship among channels. However, due to the locality and specificity of brain computation, the… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

  6. arXiv:2405.05409  [pdf, other

    cs.LG

    Initialization is Critical to Whether Transformers Fit Composite Functions by Inference or Memorizing

    Authors: Zhongwang Zhang, Pengxiao Lin, Zhiwei Wang, Yaoyu Zhang, Zhi-Qin John Xu

    Abstract: Transformers have shown impressive capabilities across various tasks, but their performance on compositional problems remains a topic of debate. In this work, we investigate the mechanisms of how transformers behave on unseen compositional tasks. We discover that the parameter initialization scale plays a critical role in determining whether the model learns inferential solutions, which capture th… ▽ More

    Submitted 24 May, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

  7. arXiv:2405.05116  [pdf, other

    cs.CL

    XAMPLER: Learning to Retrieve Cross-Lingual In-Context Examples

    Authors: Peiqin Lin, André F. T. Martins, Hinrich Schütze

    Abstract: Recent studies indicate that leveraging off-the-shelf or fine-tuned retrievers, capable of retrieving relevant in-context examples tailored to the input query, enhances few-shot in-context learning of English. However, adapting these methods to other languages, especially low-resource ones, poses challenges due to the scarcity of cross-lingual retrievers and annotated data. Thus, we introduce XAMP… ▽ More

    Submitted 29 June, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

  8. arXiv:2405.04503  [pdf, other

    cs.RO

    Physics-data hybrid dynamic model of a multi-axis manipulator for sensorless dexterous manipulation and high-performance motion planning

    Authors: Wu-Te Yang, Jyun-Ming Liao, Pei-Chun Lin

    Abstract: We report on the development of an implementable physics-data hybrid dynamic model for an articulated manipulator to plan and operate in various scenarios. Meanwhile, the physics-based and data-driven dynamic models are studied in this research to select the best model for planning. The physics-based model is constructed using the Lagrangian method, and the loss terms include inertia loss, viscous… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: 26 pages, 16 figures

  9. arXiv:2404.18264  [pdf, other

    cs.CL cs.AI

    Modeling Orthographic Variation Improves NLP Performance for Nigerian Pidgin

    Authors: Pin-Jie Lin, Merel Scholman, Muhammed Saeed, Vera Demberg

    Abstract: Nigerian Pidgin is an English-derived contact language and is traditionally an oral language, spoken by approximately 100 million people. No orthographic standard has yet been adopted, and thus the few available Pidgin datasets that exist are characterised by noise in the form of orthographic variations. This contributes to under-performance of models in critical NLP tasks. The current work is the… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

    Comments: Accepted to LREC-COLING 2024 Main Conference

  10. arXiv:2404.00270  [pdf, other

    cs.DC cs.DS

    Engineering A Workload-balanced Push-Relabel Algorithm for Massive Graphs on GPUs

    Authors: Chou-Ying Hsieh, Po-Chieh Lin, Sy-Yen Kuo

    Abstract: The push-relabel algorithm is an efficient algorithm that solves the maximum flow/ minimum cut problems of its affinity to parallelization. As the size of graphs grows exponentially, researchers have used Graphics Processing Units (GPUs) to accelerate the computation of the push-relabel algorithm further. However, prior works need to handle the significant memory consumption to represent a massive… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

  11. arXiv:2403.13251  [pdf, ps, other

    cs.RO

    A Rule-Compliance Path Planner for Lane-Merge Scenarios Based on Responsibility-Sensitive Safety

    Authors: Pengfei Lin, Ehsan Javanmardi, Yuze Jiang, Manabu Tsukada

    Abstract: Lane merging is one of the critical tasks for self-driving cars, and how to perform lane-merge maneuvers effectively and safely has become one of the important standards in measuring the capability of autonomous driving systems. However, due to the ambiguity in driving intentions and right-of-way issues, the lane merging process in autonomous driving remains deficient in terms of maintaining or ce… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Submitted to IEEE IROS 2024

  12. PPM : A Pre-trained Plug-in Model for Click-through Rate Prediction

    Authors: Yuanbo Gao, Peng Lin, Dongyue Wang, Feng Mei, Xiwei Zhao, Sulong Xu, Jinghe Hu

    Abstract: Click-through rate (CTR) prediction is a core task in recommender systems. Existing methods (IDRec for short) rely on unique identities to represent distinct users and items that have prevailed for decades. On one hand, IDRec often faces significant performance degradation on cold-start problem; on the other hand, IDRec cannot use longer training data due to constraints imposed by iteration effici… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: Accepted by ACM Web Conference 2024 (WWW'24)

    Report number: ip6417

  13. arXiv:2402.17179  [pdf, other

    cs.LG q-bio.BM

    Dual-Space Optimization: Improved Molecule Sequence Design by Latent Prompt Transformer

    Authors: Deqian Kong, Yuhao Huang, Jianwen Xie, Edouardo Honig, Ming Xu, Shuanghong Xue, Pei Lin, Sanping Zhou, Sheng Zhong, Nanning Zheng, Ying Nian Wu

    Abstract: Designing molecules with desirable properties, such as drug-likeliness and high binding affinities towards protein targets, is a challenging problem. In this paper, we propose the Dual-Space Optimization (DSO) method that integrates latent space sampling and data space selection to solve this problem. DSO iteratively updates a latent space generative model and a synthetic dataset in an optimizatio… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  14. arXiv:2402.15075  [pdf

    cs.AI

    Stacking Factorizing Partitioned Expressions in Hybrid Bayesian Network Models

    Authors: Peng Lin, Martin Neil, Norman Fenton

    Abstract: Hybrid Bayesian networks (HBN) contain complex conditional probabilistic distributions (CPD) specified as partitioned expressions over discrete and continuous variables. The size of these CPDs grows exponentially with the number of parent nodes when using discrete inference, resulting in significant inefficiency. Normally, an effective way to reduce the CPD size is to use a binary factorization (B… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  15. arXiv:2401.14265  [pdf, other

    cs.IT

    Worst-Case Per-User Error Bound for Asynchronous Unsourced Multiple Access

    Authors: Jyun-Sian Wu, Pin-Hsun Lin, Marcel A. Mross, Eduard A. Jorswieck

    Abstract: This work considers an asynchronous $\textsf{K}_\text{a}$-active-user unsourced multiple access channel (AUMAC) with the worst-case asynchronicity. The transmitted messages must be decoded within $n$ channel uses, while some codewords are not completely received due to asynchronicities. We consider a constraint of the largest allowed delay of the transmission. The AUMAC lacks the permutation-invar… ▽ More

    Submitted 30 January, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

  16. arXiv:2401.13303  [pdf, other

    cs.CL

    MaLA-500: Massive Language Adaptation of Large Language Models

    Authors: Peiqin Lin, Shaoxiong Ji, Jörg Tiedemann, André F. T. Martins, Hinrich Schütze

    Abstract: Large language models (LLMs) have advanced the state of the art in natural language processing. However, their predominant design for English or a limited set of languages creates a substantial gap in their effectiveness for low-resource languages. To bridge this gap, we introduce MaLA-500, a novel large language model designed to cover an extensive range of 534 languages. To train MaLA-500, we em… ▽ More

    Submitted 3 April, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

  17. arXiv:2401.11592  [pdf, other

    cs.LG cs.CR cs.DC

    Differentially-Private Hierarchical Federated Learning

    Authors: Frank Po-Chen Lin, Christopher Brinton

    Abstract: While federated learning (FL) eliminates the transmission of raw data over a network, it is still vulnerable to privacy breaches from the communicated model parameters. In this work, we propose \underline{H}ierarchical \underline{F}ederated Learning with \underline{H}ierarchical \underline{D}ifferential \underline{P}rivacy ({\tt H$^2$FDP}), a DP-enhanced FL methodology for jointly optimizing priva… ▽ More

    Submitted 15 May, 2024; v1 submitted 21 January, 2024; originally announced January 2024.

  18. arXiv:2312.17582  [pdf, other

    cs.NE cs.AR

    Darwin3: A large-scale neuromorphic chip with a Novel ISA and On-Chip Learning

    Authors: De Ma, Xiaofei Jin, Shichun Sun, Yitao Li, Xundong Wu, Youneng Hu, Fangchao Yang, Huajin Tang, Xiaolei Zhu, Peng Lin, Gang Pan

    Abstract: Spiking Neural Networks (SNNs) are gaining increasing attention for their biological plausibility and potential for improved computational efficiency. To match the high spatial-temporal dynamics in SNNs, neuromorphic chips are highly desired to execute SNNs in hardware-based neuron and synapse circuits directly. This paper presents a large-scale neuromorphic chip named Darwin3 with a novel instruc… ▽ More

    Submitted 29 December, 2023; originally announced December 2023.

  19. arXiv:2312.09262  [pdf, other

    cs.LG cs.AR

    Random resistive memory-based deep extreme point learning machine for unified visual processing

    Authors: Shaocong Wang, Yizhao Gao, Yi Li, Woyu Zhang, Yifei Yu, Bo Wang, Ning Lin, Hegan Chen, Yue Zhang, Yang Jiang, Dingchen Wang, Jia Chen, Peng Dai, Hao Jiang, Peng Lin, Xumeng Zhang, Xiaojuan Qi, Xiaoxin Xu, Hayden So, Zhongrui Wang, Dashan Shang, Qi Liu, Kwang-Ting Cheng, Ming Liu

    Abstract: Visual sensors, including 3D LiDAR, neuromorphic DVS sensors, and conventional frame cameras, are increasingly integrated into edge-side intelligent machines. Realizing intensive multi-sensory data analysis directly on edge intelligent machines is crucial for numerous emerging edge applications, such as augmented and virtual reality and unmanned aerial vehicles, which necessitates unified data rep… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

  20. arXiv:2312.07948  [pdf, other

    cs.NI cs.CR

    Zero-Knowledge Proof of Traffic: A Deterministic and Privacy-Preserving Cross Verification Mechanism for Cooperative Perception Data

    Authors: Ye Tao, Ehsan Javanmardi, Pengfei Lin, Jin Nakazato, Yuze Jiang, Manabu Tsukada, Hiroshi Esaki

    Abstract: Cooperative perception is crucial for connected automated vehicles in intelligent transportation systems (ITSs); however, ensuring the authenticity of perception data remains a challenge as the vehicles cannot verify events that they do not witness independently. Various studies have been conducted on establishing the authenticity of data, such as trust-based statistical methods and plausibility-b… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  21. arXiv:2312.04867  [pdf, other

    cs.CV

    HandDiffuse: Generative Controllers for Two-Hand Interactions via Diffusion Models

    Authors: Pei Lin, Sihang Xu, Hongdi Yang, Yiran Liu, Xin Chen, Jingya Wang, Jingyi Yu, Lan Xu

    Abstract: Existing hands datasets are largely short-range and the interaction is weak due to the self-occlusion and self-similarity of hands, which can not yet fit the need for interacting hands motion generation. To rescue the data scarcity, we propose HandDiffuse12.5M, a novel dataset that consists of temporal sequences with strong two-hand interactions. HandDiffuse12.5M has the largest scale and richest… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  22. arXiv:2311.12833  [pdf, other

    cs.DC cs.AI cs.CL

    HPC-GPT: Integrating Large Language Model for High-Performance Computing

    Authors: Xianzhong Ding, Le Chen, Murali Emani, Chunhua Liao, Pei-Hung Lin, Tristan Vanderbruggen, Zhen Xie, Alberto E. Cerpa, Wan Du

    Abstract: Large Language Models (LLMs), including the LLaMA model, have exhibited their efficacy across various general-domain natural language processing (NLP) tasks. However, their performance in high-performance computing (HPC) domain tasks has been less than optimal due to the specialized expertise required to interpret the model responses. In response to this challenge, we propose HPC-GPT, a novel LLaM… ▽ More

    Submitted 2 October, 2023; originally announced November 2023.

    Comments: 9 pages

  23. arXiv:2311.09711  [pdf, other

    cs.IT

    Second-order Rate Analysis of a Two-user Gaussian Interference Channel with Heterogeneous Blocklength Constraints

    Authors: Kailun Dong, Pin-Hsun Lin, Marcel Mross, Eduard A. Jorswieck

    Abstract: We consider a two-user Gaussian interference channel with heterogeneous blocklength constraints (HB-GIC), strong interference, and two private messages. We propose to apply the successive interference cancellation with early decoding, i.e., decoding a message with a number of received symbols less than the blocklength at the receiver. We determine the necessary number of received symbols to achiev… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: 4 figures

  24. arXiv:2311.09122  [pdf, other

    cs.CL

    Universal NER: A Gold-Standard Multilingual Named Entity Recognition Benchmark

    Authors: Stephen Mayhew, Terra Blevins, Shuheng Liu, Marek Šuppa, Hila Gonen, Joseph Marvin Imperial, Börje F. Karlsson, Peiqin Lin, Nikola Ljubešić, LJ Miranda, Barbara Plank, Arij Riabi, Yuval Pinter

    Abstract: We introduce Universal NER (UNER), an open, community-driven project to develop gold-standard NER benchmarks in many languages. The overarching goal of UNER is to provide high-quality, cross-lingually consistent annotations to facilitate and standardize multilingual NER research. UNER v1 contains 18 datasets annotated with named entities in a cross-lingual consistent schema across 12 diverse langu… ▽ More

    Submitted 29 June, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: NAACL 2024 Camera-ready

  25. arXiv:2311.08849  [pdf, other

    cs.CL

    OFA: A Framework of Initializing Unseen Subword Embeddings for Efficient Large-scale Multilingual Continued Pretraining

    Authors: Yihong Liu, Peiqin Lin, Mingyang Wang, Hinrich Schütze

    Abstract: Instead of pretraining multilingual language models from scratch, a more efficient method is to adapt existing pretrained language models (PLMs) to new languages via vocabulary extension and continued pretraining. However, this method usually randomly initializes the embeddings of new subwords and introduces substantially more embedding parameters to the model, thus weakening the efficiency. To ad… ▽ More

    Submitted 25 March, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: NAACL 2024 Findings

  26. arXiv:2311.07164  [pdf, other

    cs.ET cs.AI cs.AR

    Pruning random resistive memory for optimizing analogue AI

    Authors: Yi Li, Songqi Wang, Yaping Zhao, Shaocong Wang, Woyu Zhang, Yangu He, Ning Lin, Binbin Cui, Xi Chen, Shiming Zhang, Hao Jiang, Peng Lin, Xumeng Zhang, Xiaojuan Qi, Zhongrui Wang, Xiaoxin Xu, Dashan Shang, Qi Liu, Kwang-Ting Cheng, Ming Liu

    Abstract: The rapid advancement of artificial intelligence (AI) has been marked by the large language models exhibiting human-like intelligence. However, these models also present unprecedented challenges to energy consumption and environmental sustainability. One promising solution is to revisit analogue computing, a technique that predates digital computing and exploits emerging analogue electronic device… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  27. arXiv:2311.01288  [pdf, other

    cs.DC physics.plasm-ph

    Unraveling Diffusion in Fusion Plasma: A Case Study of In Situ Processing and Particle Sorting

    Authors: Junmin Gu, Paul Lin, Kesheng Wu, Seung-Hoe Ku, C. S. Chang, R. Michael Churchill, Jong Choi, Norbert Podhorszki, Scott Klasky

    Abstract: This work starts an in situ processing capability to study a certain diffusion process in magnetic confinement fusion. This diffusion process involves plasma particles that are likely to escape confinement. Such particles carry a significant amount of energy from the burning plasma inside the tokamak to the diverter and damaging the diverter plate. This study requires in situ processing because of… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  28. MAAIG: Motion Analysis And Instruction Generation

    Authors: Wei-Hsin Yeh, Pei Hsin Lin, Yu-An Su, Wen Hsiang Cheng, Lun-Wei Ku

    Abstract: Many people engage in self-directed sports training at home but lack the real-time guidance of professional coaches, making them susceptible to injuries or the development of incorrect habits. In this paper, we propose a novel application framework called MAAIG(Motion Analysis And Instruction Generation). It can generate embedding vectors for each frame based on user-provided sports action videos.… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: Accepted to the ACM Multimedia Asia 2023 Workshop on Intelligent Sports Technologies (WIST)

    ACM Class: I.2.10; I.2.7

  29. arXiv:2311.00897  [pdf, other

    cs.SD cs.CL eess.AS

    On The Open Prompt Challenge In Conditional Audio Generation

    Authors: Ernie Chang, Sidd Srinivasan, Mahi Luthra, Pin-Jie Lin, Varun Nagaraja, Forrest Iandola, Zechun Liu, Zhaoheng Ni, Changsheng Zhao, Yangyang Shi, Vikas Chandra

    Abstract: Text-to-audio generation (TTA) produces audio from a text description, learning from pairs of audio samples and hand-annotated text. However, commercializing audio generation is challenging as user-input prompts are often under-specified when compared to text descriptions used to train TTA models. In this work, we treat TTA models as a ``blackbox'' and address the user prompt challenge with two ke… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: 5 pages, 3 figures, 4 tables

  30. arXiv:2311.00895  [pdf, other

    cs.SD cs.CL eess.AS

    In-Context Prompt Editing For Conditional Audio Generation

    Authors: Ernie Chang, Pin-Jie Lin, Yang Li, Sidd Srinivasan, Gael Le Lan, David Kant, Yangyang Shi, Forrest Iandola, Vikas Chandra

    Abstract: Distributional shift is a central challenge in the deployment of machine learning models as they can be ill-equipped for real-world data. This is particularly evident in text-to-audio generation where the encoded representations are easily undermined by unseen prompts, which leads to the degradation of generated audio -- the limited set of the text-audio pairs remains inadequate for conditional au… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: 5 pages, 3 figures, 2 tables

  31. arXiv:2309.16457  [pdf, other

    cs.LG eess.SP q-bio.NC

    SI-SD: Sleep Interpreter through awake-guided cross-subject Semantic Decoding

    Authors: Hui Zheng, Zhong-Tao Chen, Hai-Teng Wang, Jian-Yang Zhou, Lin Zheng, Pei-Yang Lin, Yun-Zhe Liu

    Abstract: Understanding semantic content from brain activity during sleep represents a major goal in neuroscience. While studies in rodents have shown spontaneous neural reactivation of memories during sleep, capturing the semantic content of human sleep poses a significant challenge due to the absence of well-annotated sleep datasets and the substantial differences in neural patterns between wakefulness an… ▽ More

    Submitted 19 May, 2024; v1 submitted 28 September, 2023; originally announced September 2023.

  32. arXiv:2309.04475  [pdf, other

    cond-mat.mtrl-sci cs.LG

    Crystal Structure Prediction by Joint Equivariant Diffusion

    Authors: Rui Jiao, Wenbing Huang, Peijia Lin, Jiaqi Han, Pin Chen, Yutong Lu, Yang Liu

    Abstract: Crystal Structure Prediction (CSP) is crucial in various scientific disciplines. While CSP can be addressed by employing currently-prevailing generative models (e.g. diffusion models), this task encounters unique challenges owing to the symmetric geometry of crystal structures -- the invariance of translation, rotation, and periodicity. To incorporate the above symmetries, this paper proposes Diff… ▽ More

    Submitted 6 March, 2024; v1 submitted 30 July, 2023; originally announced September 2023.

    Comments: NeurIPS 2023

  33. arXiv:2308.14763  [pdf, other

    eess.AS cs.CL cs.SD

    VoiceBank-2023: A Multi-Speaker Mandarin Speech Corpus for Constructing Personalized TTS Systems for the Speech Impaired

    Authors: Jia-Jyu Su, Pang-Chen Liao, Yen-Ting Lin, Wu-Hao Li, Guan-Ting Liou, Cheng-Che Kao, Wei-Cheng Chen, Jen-Chieh Chiang, Wen-Yang Chang, Pin-Han Lin, Chen-Yu Chiang

    Abstract: Services of personalized TTS systems for the Mandarin-speaking speech impaired are rarely mentioned. Taiwan started the VoiceBanking project in 2020, aiming to build a complete set of services to deliver personalized Mandarin TTS systems to amyotrophic lateral sclerosis patients. This paper reports the corpus design, corpus recording, data purging and correction for the corpus, and evaluations of… ▽ More

    Submitted 27 August, 2023; originally announced August 2023.

    Comments: submitted to 26th International Conference of the ORIENTAL-COCOSDA

  34. arXiv:2308.10049  [pdf, other

    cs.RO

    Clothoid Curve-based Emergency-Stopping Path Planning with Adaptive Potential Field for Autonomous Vehicles

    Authors: Pengfei Lin, Ehsan Javanmardi, Manabu Tsukada

    Abstract: The Potential Field (PF)-based path planning method is widely adopted for autonomous vehicles (AVs) due to its real-time efficiency and simplicity. PF often creates a rigid road boundary, and while this ensures that the ego vehicle consistently operates within the confines of the road, it also brings a lurking peril in emergency scenarios. If nearby vehicles suddenly switch lanes, the AV has to ve… ▽ More

    Submitted 19 August, 2023; originally announced August 2023.

    Comments: 14 pages, 20 figures, journal paper in submission

  35. arXiv:2308.08649  [pdf, other

    cs.NE cs.AI

    Towards Zero Memory Footprint Spiking Neural Network Training

    Authors: Bin Lei, Sheng Lin, Pei-Hung Lin, Chunhua Liao, Caiwen Ding

    Abstract: Biologically-inspired Spiking Neural Networks (SNNs), processing information using discrete-time events known as spikes rather than continuous values, have garnered significant attention due to their hardware-friendly and energy-efficient characteristics. However, the training of SNNs necessitates a considerably large memory footprint, given the additional storage requirements for spikes or events… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

  36. arXiv:2308.08614  [pdf, other

    cs.LG cs.AI cs.CL

    Boosting Logical Reasoning in Large Language Models through a New Framework: The Graph of Thought

    Authors: Bin Lei, pei-Hung Lin, Chunhua Liao, Caiwen Ding

    Abstract: Recent advancements in large-scale models, such as GPT-4, have showcased remarkable capabilities in addressing standard queries. However, when facing complex problems that require multi-step logical reasoning, their accuracy dramatically decreases. Current research has explored the realm of \textit{prompting engineering} to bolster the inferential capacities of these models. Our paper unveils a pi… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

  37. arXiv:2308.08473  [pdf, other

    cs.SE

    DataRaceBench V1.4.1 and DataRaceBench-ML V0.1: Benchmark Suites for Data Race Detection

    Authors: Le Chen, Wenhao Wu, Stephen F. Siegel, Pei-Hung Lin, Chunhua Liao

    Abstract: Data races pose a significant threat in multi-threaded parallel applications due to their negative impact on program correctness. DataRaceBench, an open-source benchmark suite, is specifically crafted to assess these data race detection tools in a systematic and measurable manner. Machine learning techniques have recently demonstrated considerable potential in high-performance computing (HPC) prog… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

  38. Data Race Detection Using Large Language Models

    Authors: Le Chen, Xianzhong Ding, Murali Emani, Tristan Vanderbruggen, Pei-hung Lin, Chuanhua Liao

    Abstract: Large language models (LLMs) are demonstrating significant promise as an alternate strategy to facilitate analyses and optimizations of high-performance computing programs, circumventing the need for resource-intensive manual tool creation. In this paper, we explore a novel LLM-based data race detection approach combining prompting engineering and fine-tuning techniques. We create a dedicated data… ▽ More

    Submitted 3 October, 2023; v1 submitted 14 August, 2023; originally announced August 2023.

  39. arXiv:2308.04156  [pdf, other

    cs.CV cs.MM eess.IV

    Towards Top-Down Stereo Image Quality Assessment via Stereo Attention

    Authors: Huilin Zhang, Sumei Li, Haoxiang Chang, Peiming Lin

    Abstract: Stereo image quality assessment (SIQA) plays a crucial role in evaluating and improving the visual experience of 3D content. Existing visual properties-based methods for SIQA have achieved promising performance. However, these approaches ignore the top-down philosophy, leading to a lack of a comprehensive grasp of the human visual system (HVS) and SIQA. This paper presents a novel Stereo AttenTion… ▽ More

    Submitted 14 November, 2023; v1 submitted 8 August, 2023; originally announced August 2023.

    Comments: 12 pages, 5 figures

  40. arXiv:2307.13294  [pdf, other

    cs.CV cs.AI

    Imperceptible Physical Attack against Face Recognition Systems via LED Illumination Modulation

    Authors: Junbin Fang, Canjian Jiang, You Jiang, Puxi Lin, Zhaojie Chen, Yujing Sun, Siu-Ming Yiu, Zoe L. Jiang

    Abstract: Although face recognition starts to play an important role in our daily life, we need to pay attention that data-driven face recognition vision systems are vulnerable to adversarial attacks. However, the current two categories of adversarial attacks, namely digital attacks and physical attacks both have drawbacks, with the former ones impractical and the latter one conspicuous, high-computational… ▽ More

    Submitted 7 August, 2023; v1 submitted 25 July, 2023; originally announced July 2023.

  41. arXiv:2307.07815  [pdf, other

    cs.CR

    HyperGo: Probability-based Directed Hybrid Fuzzing

    Authors: Peihong Lin, Pengfei Wang, Xu Zhou, Wei Xie, Kai Lu, Gen Zhang

    Abstract: Directed grey-box fuzzing (DGF) is a target-guided fuzzing intended for testing specific targets (e.g., the potential buggy code). Despite numerous techniques proposed to enhance directedness, the existing DGF techniques still face challenges, such as taking into account the difficulty of reaching different basic blocks when designing the fitness metric, and promoting the effectiveness of symbolic… ▽ More

    Submitted 15 July, 2023; originally announced July 2023.

    Comments: 16 pages

  42. arXiv:2307.07686  [pdf, other

    cs.SE cs.AI cs.LG

    Creating a Dataset for High-Performance Computing Code Translation using LLMs: A Bridge Between OpenMP Fortran and C++

    Authors: Bin Lei, Caiwen Ding, Le Chen, Pei-Hung Lin, Chunhua Liao

    Abstract: In this study, we present a novel dataset for training machine learning models translating between OpenMP Fortran and C++ code. To ensure reliability and applicability, the dataset is created from a range of representative open-source OpenMP benchmarks. It is also refined using a meticulous code similarity test. The effectiveness of our dataset is assessed using both quantitative (CodeBLEU) and qu… ▽ More

    Submitted 18 September, 2023; v1 submitted 14 July, 2023; originally announced July 2023.

    Comments: This paper was accepted by the HPEC 2023 conference and received the Outstanding Student Paper Award

  43. arXiv:2307.00771  [pdf, other

    cs.ET

    Resistive memory-based zero-shot liquid state machine for multimodal event data learning

    Authors: Ning Lin, Shaocong Wang, Yi Li, Bo Wang, Shuhui Shi, Yangu He, Woyu Zhang, Yifei Yu, Yue Zhang, Xiaojuan Qi, Xiaoming Chen, Hao Jiang, Xumeng Zhang, Peng Lin, Xiaoxin Xu, Qi Liu, Zhongrui Wang, Dashan Shang, Ming Liu

    Abstract: The human brain is a complex spiking neural network (SNN) that learns multimodal signals in a zero-shot manner by generalizing existing knowledge. Remarkably, the brain achieves this with minimal power consumption, using event-based signals that propagate within its structure. However, mimicking the human brain in neuromorphic hardware presents both hardware and software challenges. Hardware limit… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

  44. arXiv:2307.00382  [pdf, other

    cs.CL

    Low-Resource Cross-Lingual Adaptive Training for Nigerian Pidgin

    Authors: Pin-Jie Lin, Muhammed Saeed, Ernie Chang, Merel Scholman

    Abstract: Developing effective spoken language processing systems for low-resource languages poses several challenges due to the lack of parallel data and limited resources for fine-tuning models. In this work, we target on improving upon both text classification and translation of Nigerian Pidgin (Naija) by collecting a large-scale parallel English-Pidgin corpus and further propose a framework of cross-lin… ▽ More

    Submitted 1 July, 2023; originally announced July 2023.

    Comments: To appear in INTERSPEECH 2023

  45. arXiv:2307.00374  [pdf, other

    cs.CL

    Revisiting Sample Size Determination in Natural Language Understanding

    Authors: Ernie Chang, Muhammad Hassan Rashid, Pin-Jie Lin, Changsheng Zhao, Vera Demberg, Yangyang Shi, Vikas Chandra

    Abstract: Knowing exactly how many data points need to be labeled to achieve a certain model performance is a hugely beneficial step towards reducing the overall budgets for annotation. It pertains to both active learning and traditional data annotation, and is particularly beneficial for low resource scenarios. Nevertheless, it remains a largely under-explored area of research in NLP. We therefore explored… ▽ More

    Submitted 1 July, 2023; originally announced July 2023.

    Comments: Accepted to ACL 2023

  46. LM4HPC: Towards Effective Language Model Application in High-Performance Computing

    Authors: Le Chen, Pei-Hung Lin, Tristan Vanderbruggen, Chunhua Liao, Murali Emani, Bronis de Supinski

    Abstract: In recent years, language models (LMs), such as GPT-4, have been widely used in multiple domains, including natural language processing, visualization, and so on. However, applying them for analyzing and optimizing high-performance computing (HPC) software is still challenging due to the lack of HPC-specific support. In this paper, we design the LM4HPC framework to facilitate the research and deve… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

  47. arXiv:2306.10196  [pdf, other

    cs.CL cs.AI cs.FL cs.LG

    Structured Thoughts Automaton: First Formalized Execution Model for Auto-Regressive Language Models

    Authors: Tristan Vanderbruggen, Chunhua Liao, Peter Pirkelbauer, Pei-Hung Lin

    Abstract: In recent months, Language Models (LMs) have become a part of daily discourse, with focus on OpenAI and the potential of Artificial General Intelligence (AGI). Furthermore, the leaking of LLama's weights to the public has led to an influx of innovations demonstrating the impressive capabilities of generative LMs. While we believe that AGI is still a distant goal, we recognize the potential of LMs… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

    Comments: Submitted to CGO-24

  48. arXiv:2306.06993  [pdf, ps, other

    cs.RO

    Occlusion-Aware Path Planning for Collision Avoidance: Leveraging Potential Field Method with Responsibility-Sensitive Safety

    Authors: Pengfei Lin, Ehsan Javanmardi, Jin Nakazato, Manabu Tsukada

    Abstract: Collision avoidance (CA) has always been the foremost task for autonomous vehicles (AVs) under safety criteria. And path planning is directly responsible for generating a safe path to accomplish CA while satisfying other commands. Due to the real-time computation and simple structure, the potential field (PF) has emerged as one of the mainstream path-planning algorithms. However, the current PF is… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Comments: Submitted to IEEE ITSC 2023

  49. arXiv:2306.06987  [pdf, ps, other

    cs.RO

    Potential Field-based Path Planning with Interactive Speed Optimization for Autonomous Vehicles

    Authors: Pengfei Lin, Ehsan Javanmardi, Jin Nakazato, Manabu Tsukada

    Abstract: Path planning is critical for autonomous vehicles (AVs) to determine the optimal route while considering constraints and objectives. The potential field (PF) approach has become prevalent in path planning due to its simple structure and computational efficiency. However, current PF methods used in AVs focus solely on the path generation of the ego vehicle while assuming that the surrounding obstac… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Comments: Submitted to IEEE IECON 2023

  50. arXiv:2306.06981  [pdf, ps, other

    cs.RO

    Time-to-Collision-Aware Lane-Change Strategy Based on Potential Field and Cubic Polynomial for Autonomous Vehicles

    Authors: Pengfei Lin, Ehsan Javanmardi, Ye Tao, Vishal Chauhan, Jin Nakazato, Manabu Tsukada

    Abstract: Making safe and successful lane changes (LCs) is one of the many vitally important functions of autonomous vehicles (AVs) that are needed to ensure safe driving on expressways. Recently, the simplicity and real-time performance of the potential field (PF) method have been leveraged to design decision and planning modules for AVs. However, the LC trajectory planned by the PF method is usually lengt… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Comments: Accepted in IEEE Intelligent Vehicles Symposium (IV) 2023