Skip to main content

Showing 1–45 of 45 results for author: Lai, V

  1. arXiv:2406.19415  [pdf, other

    cs.CL

    An Analysis of Multilingual FActScore

    Authors: Kim Trong Vu, Michael Krumdick, Varshini Reddy, Franck Dernoncourt, Viet Dac Lai

    Abstract: FActScore has gained popularity as a metric to estimate the factuality of long-form texts generated by Large Language Models (LLMs) in English. However, there has not been any work in studying the behavior of FActScore in other languages. This paper studies the limitations of each component in the four-component pipeline of FActScore in the multilingual setting. We introduce a new dataset for FAct… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  2. arXiv:2406.14394  [pdf, other

    cs.CL

    SEC-QA: A Systematic Evaluation Corpus for Financial QA

    Authors: Viet Dac Lai, Michael Krumdick, Charles Lovering, Varshini Reddy, Craig Schmidt, Chris Tanner

    Abstract: The financial domain frequently deals with large numbers of long documents that are essential for daily operations. Significant effort is put towards automating financial data analysis. However, a persistent challenge, not limited to the finance domain, is the scarcity of datasets that accurately reflect real-world tasks for model evaluation. Existing datasets are often constrained by size, contex… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  3. Masked Graph Transformer for Large-Scale Recommendation

    Authors: Huiyuan Chen, Zhe Xu, Chin-Chia Michael Yeh, Vivian Lai, Yan Zheng, Minghua Xu, Hanghang Tong

    Abstract: Graph Transformers have garnered significant attention for learning graph-structured data, thanks to their superb ability to capture long-range dependencies among nodes. However, the quadratic space and time complexity hinders the scalability of Graph Transformers, particularly for large-scale recommendation. Here we propose an efficient Masked Graph Transformer, named MGFormer, capable of capturi… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  4. arXiv:2403.05565  [pdf, other

    cs.HC cs.AI

    OpenHEXAI: An Open-Source Framework for Human-Centered Evaluation of Explainable Machine Learning

    Authors: Jiaqi Ma, Vivian Lai, Yiming Zhang, Chacha Chen, Paul Hamilton, Davor Ljubenkov, Himabindu Lakkaraju, Chenhao Tan

    Abstract: Recently, there has been a surge of explainable AI (XAI) methods driven by the need for understanding machine learning model behaviors in high-stakes scenarios. However, properly evaluating the effectiveness of the XAI methods inevitably requires the involvement of human subjects, and conducting human-centered benchmarks is challenging in a number of ways: designing and implementing user studies i… ▽ More

    Submitted 20 February, 2024; originally announced March 2024.

  5. arXiv:2402.10487  [pdf, other

    cs.LG cs.AI

    RPMixer: Shaking Up Time Series Forecasting with Random Projections for Large Spatial-Temporal Data

    Authors: Chin-Chia Michael Yeh, Yujie Fan, Xin Dai, Uday Singh Saini, Vivian Lai, Prince Osei Aboagye, Junpeng Wang, Huiyuan Chen, Yan Zheng, Zhongfang Zhuang, Liang Wang, Wei Zhang

    Abstract: Spatial-temporal forecasting systems play a crucial role in addressing numerous real-world challenges. In this paper, we investigate the potential of addressing spatial-temporal forecasting problems using general time series forecasting models, i.e., models that do not leverage the spatial relationships among the nodes. We propose a all-Multi-Layer Perceptron (all-MLP) time series forecasting arch… ▽ More

    Submitted 12 June, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

  6. arXiv:2401.06915  [pdf, other

    cs.CL cs.AI

    DocFinQA: A Long-Context Financial Reasoning Dataset

    Authors: Varshini Reddy, Rik Koncel-Kedziorski, Viet Dac Lai, Michael Krumdick, Charles Lovering, Chris Tanner

    Abstract: For large language models (LLMs) to be effective in the financial domain -- where each decision can have a significant impact -- it is necessary to investigate realistic tasks and data. Financial professionals often interact with documents that are hundreds of pages long, but most financial research datasets only deal with short excerpts from these documents. To address this, we introduce a long-d… ▽ More

    Submitted 29 February, 2024; v1 submitted 12 January, 2024; originally announced January 2024.

    Comments: 13 pages

  7. Towards Mitigating Dimensional Collapse of Representations in Collaborative Filtering

    Authors: Huiyuan Chen, Vivian Lai, Hongye Jin, Zhimeng Jiang, Mahashweta Das, Xia Hu

    Abstract: Contrastive Learning (CL) has shown promising performance in collaborative filtering. The key idea is to generate augmentation-invariant embeddings by maximizing the Mutual Information between different augmented views of the same instance. However, we empirically observe that existing CL models suffer from the \textsl{dimensional collapse} issue, where user/item embeddings only span a low-dimensi… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

  8. arXiv:2311.06602  [pdf, other

    cs.CL

    BizBench: A Quantitative Reasoning Benchmark for Business and Finance

    Authors: Rik Koncel-Kedziorski, Michael Krumdick, Viet Lai, Varshini Reddy, Charles Lovering, Chris Tanner

    Abstract: Answering questions within business and finance requires reasoning, precision, and a wide-breadth of technical knowledge. Together, these requirements make this domain difficult for large language models (LLMs). We introduce BizBench, a benchmark for evaluating models' ability to reason about realistic financial problems. BizBench comprises eight quantitative reasoning tasks, focusing on question-… ▽ More

    Submitted 12 March, 2024; v1 submitted 11 November, 2023; originally announced November 2023.

    Comments: Work in progress

  9. Highly Significant Detection of X-Ray Polarization from the Brightest Accreting Neutron Star Sco X-1

    Authors: Fabio La Monaca, Alessandro Di Marco, Juri Poutanen, Matteo Bachetti, Sara E. Motta, Alessandro Papitto, Maura Pilia, Fei Xie, Stefano Bianchi, Anna Bobrikova, Enrico Costa, Wei Deng, Mingyu Ge, Giulia Illiano, Shu-Mei Jia, Henric Krawczynski, Eleonora V. Lai, Kuan Liu, Guglielmo Mastroserio, Fabio Muleri, John Rankin, Paolo Soffitta, Alexandra Veledina, Filippo Ambrosino, Melania Del Santo , et al. (94 additional authors not shown)

    Abstract: The Imaging X-ray Polarimetry Explorer (IXPE) measured with high significance the X-ray polarization of the brightest Z-source Scorpius X-1, resulting in the nominal 2-8 keV energy band in a polarization degree of 1.0(0.2)% and a polarization angle of 8(6)° at 90% of confidence level. This observation was strictly simultaneous with observations performed by NICER, NuSTAR, and Insight-HXMT, which a… ▽ More

    Submitted 24 January, 2024; v1 submitted 10 November, 2023; originally announced November 2023.

    Journal ref: ApJL 960 L11 (2024)

  10. arXiv:2311.02561  [pdf, other

    cs.LG cs.AI

    Ego-Network Transformer for Subsequence Classification in Time Series Data

    Authors: Chin-Chia Michael Yeh, Huiyuan Chen, Yujie Fan, Xin Dai, Yan Zheng, Vivian Lai, Junpeng Wang, Zhongfang Zhuang, Liang Wang, Wei Zhang, Eamonn Keogh

    Abstract: Time series classification is a widely studied problem in the field of time series data mining. Previous research has predominantly focused on scenarios where relevant or foreground subsequences have already been extracted, with each subsequence corresponding to a single label. However, real-world time series data often contain foreground subsequences that are intertwined with background subsequen… ▽ More

    Submitted 5 November, 2023; originally announced November 2023.

  11. arXiv:2311.02560  [pdf, other

    cs.IR cs.LG

    Temporal Treasure Hunt: Content-based Time Series Retrieval System for Discovering Insights

    Authors: Chin-Chia Michael Yeh, Huiyuan Chen, Xin Dai, Yan Zheng, Yujie Fan, Vivian Lai, Junpeng Wang, Audrey Der, Zhongfang Zhuang, Liang Wang, Wei Zhang

    Abstract: Time series data is ubiquitous across various domains such as finance, healthcare, and manufacturing, but their properties can vary significantly depending on the domain they originate from. The ability to perform Content-based Time Series Retrieval (CTSR) is crucial for identifying unknown time series examples. However, existing CTSR works typically focus on retrieving time series from a single d… ▽ More

    Submitted 5 November, 2023; originally announced November 2023.

  12. arXiv:2310.03919  [pdf, other

    cs.IR cs.AI cs.LG

    An Efficient Content-based Time Series Retrieval System

    Authors: Chin-Chia Michael Yeh, Huiyuan Chen, Xin Dai, Yan Zheng, Junpeng Wang, Vivian Lai, Yujie Fan, Audrey Der, Zhongfang Zhuang, Liang Wang, Wei Zhang, Jeff M. Phillips

    Abstract: A Content-based Time Series Retrieval (CTSR) system is an information retrieval system for users to interact with time series emerged from multiple domains, such as finance, healthcare, and manufacturing. For example, users seeking to learn more about the source of a time series can submit the time series as a query to the CTSR system and retrieve a list of relevant time series with associated met… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

  13. arXiv:2310.03916  [pdf, other

    cs.LG cs.AI

    Toward a Foundation Model for Time Series Data

    Authors: Chin-Chia Michael Yeh, Xin Dai, Huiyuan Chen, Yan Zheng, Yujie Fan, Audrey Der, Vivian Lai, Zhongfang Zhuang, Junpeng Wang, Liang Wang, Wei Zhang

    Abstract: A foundation model is a machine learning model trained on a large and diverse set of data, typically using self-supervised learning-based pre-training techniques, that can be adapted to various downstream tasks. However, current research on time series pre-training has mostly focused on models pre-trained solely on data from a single domain, resulting in a lack of knowledge about other types of ti… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

  14. arXiv:2309.09400  [pdf, other

    cs.CL cs.AI

    CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages

    Authors: Thuat Nguyen, Chien Van Nguyen, Viet Dac Lai, Hieu Man, Nghia Trung Ngo, Franck Dernoncourt, Ryan A. Rossi, Thien Huu Nguyen

    Abstract: The driving factors behind the development of large language models (LLMs) with impressive learning capabilities are their colossal model sizes and extensive training datasets. Along with the progress in natural language processing, LLMs have been frequently made accessible to the public to foster deeper investigation and applications. However, when it comes to training datasets for these LLMs, es… ▽ More

    Submitted 17 September, 2023; originally announced September 2023.

    Comments: Ongoing Work

  15. Adversarial Collaborative Filtering for Free

    Authors: Huiyuan Chen, Xiaoting Li, Vivian Lai, Chin-Chia Michael Yeh, Yujie Fan, Yan Zheng, Mahashweta Das, Hao Yang

    Abstract: Collaborative Filtering (CF) has been successfully used to help users discover the items of interest. Nevertheless, existing CF methods suffer from noisy data issue, which negatively impacts the quality of recommendation. To tackle this problem, many prior studies leverage adversarial learning to regularize the representations of users/items, which improves both generalizability and robustness. Th… ▽ More

    Submitted 20 August, 2023; originally announced August 2023.

  16. Enhancing Transformers without Self-supervised Learning: A Loss Landscape Perspective in Sequential Recommendation

    Authors: Vivian Lai, Huiyuan Chen, Chin-Chia Michael Yeh, Minghua Xu, Yiwei Cai, Hao Yang

    Abstract: Transformer and its variants are a powerful class of architectures for sequential recommendation, owing to their ability of capturing a user's dynamic interests from their past interactions. Despite their success, Transformer-based models often require the optimization of a large number of parameters, making them difficult to train from sparse data in sequential recommendation. To address the prob… ▽ More

    Submitted 20 August, 2023; originally announced August 2023.

  17. arXiv:2307.16039  [pdf, other

    cs.CL cs.LG

    Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback

    Authors: Viet Dac Lai, Chien Van Nguyen, Nghia Trung Ngo, Thuat Nguyen, Franck Dernoncourt, Ryan A. Rossi, Thien Huu Nguyen

    Abstract: A key technology for the development of large language models (LLMs) involves instruction tuning that helps align the models' responses with human expectations to realize impressive learning abilities. Two major approaches for instruction tuning characterize supervised fine-tuning (SFT) and reinforcement learning from human feedback (RLHF), which are currently applied to produce the best commercia… ▽ More

    Submitted 1 August, 2023; v1 submitted 29 July, 2023; originally announced July 2023.

  18. arXiv:2307.12949  [pdf, ps, other

    cs.CL

    Boosting Punctuation Restoration with Data Generation and Reinforcement Learning

    Authors: Viet Dac Lai, Abel Salinas, Hao Tan, Trung Bui, Quan Tran, Seunghyun Yoon, Hanieh Deilamsalehy, Franck Dernoncourt, Thien Huu Nguyen

    Abstract: Punctuation restoration is an important task in automatic speech recognition (ASR) which aim to restore the syntactic structure of generated ASR texts to improve readability. While punctuated texts are abundant from written documents, the discrepancy between written punctuated texts and ASR texts limits the usability of written texts in training punctuation restoration systems for ASR texts. This… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: Accepted at INTERSPEECH 2023, 6 pages

  19. arXiv:2307.08910  [pdf, other

    cs.LG cs.IR

    Sharpness-Aware Graph Collaborative Filtering

    Authors: Huiyuan Chen, Chin-Chia Michael Yeh, Yujie Fan, Yan Zheng, Junpeng Wang, Vivian Lai, Mahashweta Das, Hao Yang

    Abstract: Graph Neural Networks (GNNs) have achieved impressive performance in collaborative filtering. However, GNNs tend to yield inferior performance when the distributions of training and test data are not aligned well. Also, training GNNs requires optimizing non-convex neural networks with an abundance of local and global minima, which may differ widely in their performance at test time. Thus, it is es… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

  20. arXiv:2305.14889  [pdf, other

    cs.CL cs.AI

    Evaluating Evaluation Metrics: A Framework for Analyzing NLG Evaluation Metrics using Measurement Theory

    Authors: Ziang Xiao, Susu Zhang, Vivian Lai, Q. Vera Liao

    Abstract: We address a fundamental challenge in Natural Language Generation (NLG) model evaluation -- the design and evaluation of evaluation metrics. Recognizing the limitations of existing automatic metrics and noises from how current human evaluation was conducted, we propose MetricEval, a framework informed by measurement theory, the foundation of educational test design, for conceptualizing and evaluat… ▽ More

    Submitted 22 October, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023

  21. arXiv:2304.05613  [pdf, other

    cs.CL cs.AI

    ChatGPT Beyond English: Towards a Comprehensive Evaluation of Large Language Models in Multilingual Learning

    Authors: Viet Dac Lai, Nghia Trung Ngo, Amir Pouran Ben Veyseh, Hieu Man, Franck Dernoncourt, Trung Bui, Thien Huu Nguyen

    Abstract: Over the last few years, large language models (LLMs) have emerged as the most important breakthroughs in natural language processing (NLP) that fundamentally transform research and developments in the field. ChatGPT represents one of the most exciting LLM systems developed recently to showcase impressive skills for language generation and highly attract public attention. Among various exciting ap… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

  22. arXiv:2301.09656  [pdf, other

    cs.AI cs.CL cs.HC cs.LG

    Selective Explanations: Leveraging Human Input to Align Explainable AI

    Authors: Vivian Lai, Yiming Zhang, Chacha Chen, Q. Vera Liao, Chenhao Tan

    Abstract: While a vast collection of explainable AI (XAI) algorithms have been developed in recent years, they are often criticized for significant gaps with how humans produce and consume explanations. As a result, current XAI techniques are often found to be hard to use and lack effectiveness. In this work, we attempt to close these gaps by making AI explanations selective -- a fundamental property of hum… ▽ More

    Submitted 7 August, 2023; v1 submitted 23 January, 2023; originally announced January 2023.

    Comments: 21 pages, 25 figures

  23. arXiv:2210.03419  [pdf, other

    cs.CL cs.IR cs.LG

    Event Extraction: A Survey

    Authors: Viet Dac Lai

    Abstract: Extracting the reported events from text is one of the key research themes in natural language processing. This process includes several tasks such as event detection, argument extraction, role labeling. As one of the most important topics in natural language processing and natural language understanding, the applications of event extraction spans across a wide range of domains such as newswire, b… ▽ More

    Submitted 10 October, 2022; v1 submitted 7 October, 2022; originally announced October 2022.

    Comments: 20 pages

  24. arXiv:2206.06383  [pdf, other

    cs.CL cs.AI cs.HC

    An Exploration of Post-Editing Effectiveness in Text Summarization

    Authors: Vivian Lai, Alison Smith-Renner, Ke Zhang, Ruijia Cheng, Wenjuan Zhang, Joel Tetreault, Alejandro Jaimes

    Abstract: Automatic summarization methods are efficient but can suffer from low quality. In comparison, manual summarization is expensive but produces higher quality. Can humans and AI collaborate to improve summarization performance? In similar text generation tasks (e.g., machine translation), human-AI collaboration in the form of "post-editing" AI-generated text reduces human workload and improves the qu… ▽ More

    Submitted 13 June, 2022; originally announced June 2022.

    Comments: 18 pages, 21 figures

  25. arXiv:2204.12070  [pdf, other

    cs.CL

    Symlink: A New Dataset for Scientific Symbol-Description Linking

    Authors: Viet Dac Lai, Amir Pouran Ben Veyseh, Franck Dernoncourt, Thien Huu Nguyen

    Abstract: Mathematical symbols and descriptions appear in various forms across document section boundaries without explicit markup. In this paper, we present a new large-scale dataset that emphasizes extracting symbols and descriptions in scientific documents. Symlink annotates scientific papers of 5 different domains (i.e., computer science, biology, physics, mathematics, and economics). Our experiments on… ▽ More

    Submitted 26 April, 2022; originally announced April 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2202.09695

  26. arXiv:2204.11788  [pdf, other

    cs.AI cs.HC cs.LG

    Human-AI Collaboration via Conditional Delegation: A Case Study of Content Moderation

    Authors: Vivian Lai, Samuel Carton, Rajat Bhatnagar, Q. Vera Liao, Yunfeng Zhang, Chenhao Tan

    Abstract: Despite impressive performance in many benchmark datasets, AI models can still make mistakes, especially among out-of-distribution examples. It remains an open question how such imperfect models can be used effectively in collaboration with humans. Prior work has focused on AI assistance that helps people make individual high-stakes decisions, which is not scalable for a large amount of relatively… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

    Comments: 18 pages, 44 figures

  27. arXiv:2202.09695  [pdf, other

    cs.CL cs.CV

    SemEval 2022 Task 12: Symlink- Linking Mathematical Symbols to their Descriptions

    Authors: Viet Dac Lai, Amir Pouran Ben Veyseh, Franck Dernoncourt, Thien Huu Nguyen

    Abstract: Given the increasing number of livestreaming videos, automatic speech recognition and post-processing for livestreaming video transcripts are crucial for efficient data management as well as knowledge mining. A key step in this process is punctuation restoration which restores fundamental text structures such as phrase and sentence boundaries from the video transcripts. This work presents a new hu… ▽ More

    Submitted 24 April, 2022; v1 submitted 19 February, 2022; originally announced February 2022.

    Comments: SemEval 2022 Task 12

  28. The X-ray spectral-timing contribution of the stellar wind in the hard state of Cyg X-1

    Authors: E. V. Lai, B. De Marco, A. A. Zdziarski, T. M. Belloni, S. Mondal, P. Uttley, V. Grinberg, J. Wilms, A. Różańska

    Abstract: The clumpy stellar wind from the companion star in high mass X-ray binaries causes variable, partial absorption of the emission from the X-ray source. We studied XMM-Newton observations from the 7.22 d-long "Cyg X-1 Hard state Observations of a Complete Binary Orbit in X-rays" (CHOCBOX) monitoring campaign, in order to constrain the effects of the stellar wind on the short-timescale X-ray spectral… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

    Comments: 16 pages, 13 figures

  29. arXiv:2112.11471  [pdf, other

    cs.AI cs.CL cs.CY cs.HC cs.LG

    Towards a Science of Human-AI Decision Making: A Survey of Empirical Studies

    Authors: Vivian Lai, Chacha Chen, Q. Vera Liao, Alison Smith-Renner, Chenhao Tan

    Abstract: As AI systems demonstrate increasingly strong predictive performance, their adoption has grown in numerous domains. However, in high-stakes domains such as criminal justice and healthcare, full automation is often not desirable due to safety, ethical, and legal concerns, yet fully manual approaches can be inaccurate and time consuming. As a result, there is growing interest in the research communi… ▽ More

    Submitted 21 December, 2021; originally announced December 2021.

    Comments: 36 pages, 2 figures, see https://haidecisionmaking.github.io for website

  30. arXiv:2105.07949  [pdf, other

    cs.CY cs.CL

    Using Transformers to Provide Teachers with Personalized Feedback on their Classroom Discourse: The TalkMoves Application

    Authors: Abhijit Suresh, Jennifer Jacobs, Vivian Lai, Chenhao Tan, Wayne Ward, James H. Martin, Tamara Sumner

    Abstract: TalkMoves is an innovative application designed to support K-12 mathematics teachers to reflect on, and continuously improve their instructional practices. This application combines state-of-the-art natural language processing capabilities with automated speech recognition to automatically analyze classroom recordings and provide teachers with personalized feedback on their use of specific types o… ▽ More

    Submitted 29 April, 2021; originally announced May 2021.

    Comments: Presented at the AAAI 2021 Spring Symposium on Artificial Intelligence for K-12 Education

  31. arXiv:2103.09330  [pdf, other

    cs.CL

    Cross-Task Instance Representation Interactions and Label Dependencies for Joint Information Extraction with Graph Convolutional Networks

    Authors: Minh Van Nguyen, Viet Dac Lai, Thien Huu Nguyen

    Abstract: Existing works on information extraction (IE) have mainly solved the four main tasks separately (entity mention recognition, relation extraction, event trigger detection, and argument extraction), thus failing to benefit from inter-dependencies between tasks. This paper presents a novel deep learning model to simultaneously solve the four tasks of IE in a single model (called FourIE). Compared to… ▽ More

    Submitted 26 March, 2021; v1 submitted 16 March, 2021; originally announced March 2021.

    Comments: Accepted at NAACL-HLT 2021

  32. The inner flow geometry in MAXI J1820+070 during hard and hard-intermediate states

    Authors: B. De Marco, A. A. Zdziarski, G. Ponti, G. Migliori, T. M. Belloni, A. Segovia Otero, M. Dziełak, E. V. Lai

    Abstract: [Abridged] Context: We present a systematic X-ray spectral-timing study of the recently discovered, exceptionally bright black hole X-ray binary system MAXI J1820+070. Our analysis focuses on the first part of the 2018 outburst, covering the rise throughout the hard state, the bright hard and hard-intermediate states, and the transition to the soft-intermediate state. Aims: We address the issue of… ▽ More

    Submitted 6 August, 2021; v1 submitted 15 February, 2021; originally announced February 2021.

    Comments: Accepted for publication in Astronomy & Astrophysics, matches published version

    Journal ref: A&A 654, A14 (2021)

  33. arXiv:2101.05303  [pdf, other

    cs.AI cs.CY cs.HC cs.LG

    Understanding the Effect of Out-of-distribution Examples and Interactive Explanations on Human-AI Decision Making

    Authors: Han Liu, Vivian Lai, Chenhao Tan

    Abstract: Although AI holds promise for improving human decision making in societally critical domains, it remains an open question how human-AI teams can reliably outperform AI alone and human alone in challenging prediction tasks (also known as complementary performance). We explore two directions to understand the gaps in achieving complementary performance. First, we argue that the typical experimental… ▽ More

    Submitted 5 October, 2021; v1 submitted 13 January, 2021; originally announced January 2021.

    Comments: 45 pages, 24 figures, accepted to CSCW 2021

  34. arXiv:2101.03289  [pdf, other

    cs.CL

    Trankit: A Light-Weight Transformer-based Toolkit for Multilingual Natural Language Processing

    Authors: Minh Van Nguyen, Viet Dac Lai, Amir Pouran Ben Veyseh, Thien Huu Nguyen

    Abstract: We introduce Trankit, a light-weight Transformer-based Toolkit for multilingual Natural Language Processing (NLP). It provides a trainable pipeline for fundamental NLP tasks over 100 languages, and 90 pretrained pipelines for 56 languages. Built on a state-of-the-art pretrained language model, Trankit significantly outperforms prior multilingual NLP pipelines over sentence segmentation, part-of-sp… ▽ More

    Submitted 14 October, 2021; v1 submitted 8 January, 2021; originally announced January 2021.

    Comments: Camera-ready version for EACL 2021 Demo

  35. arXiv:2010.14123  [pdf, ps, other

    cs.CL

    Event Detection: Gate Diversity and Syntactic Importance Scoresfor Graph Convolution Neural Networks

    Authors: Viet Dac Lai, Tuan Ngo Nguyen, Thien Huu Nguyen

    Abstract: Recent studies on event detection (ED) haveshown that the syntactic dependency graph canbe employed in graph convolution neural net-works (GCN) to achieve state-of-the-art per-formance. However, the computation of thehidden vectors in such graph-based models isagnostic to the trigger candidate words, po-tentially leaving irrelevant information for thetrigger candidate for event prediction. In addi… ▽ More

    Submitted 27 October, 2020; originally announced October 2020.

    Comments: EMNLP 2020

  36. An extreme Ultraluminous X-ray source X-1 in NGC 5055

    Authors: Samaresh Mondal, Agata Rozanska, Eleonora Veronica Lai, Barbara De Marco

    Abstract: Aims. We analyzed multi-epoch X-ray data of the Ultraluminous X-ray source (ULX) NGC 5055 X-1, with luminosity up to $2.32\times10^{40}\ \rm erg\ s^{-1}$, in order to constrain the physical parameters of the source. Methods. We performed timing and spectral analysis of Chandra and XMM-Newton observations. We used spectral models which assume the emission is from an accreting black hole system. We… ▽ More

    Submitted 5 August, 2020; originally announced August 2020.

    Comments: 8 pages, 10 figures, Accepted for publication in A&A

    Journal ref: A&A 642, A94 (2020)

  37. arXiv:2006.10093  [pdf, ps, other

    cs.CL

    Extensively Matching for Few-shot Learning Event Detection

    Authors: Viet Dac Lai, Franck Dernoncourt, Thien Huu Nguyen

    Abstract: Current event detection models under super-vised learning settings fail to transfer to newevent types. Few-shot learning has not beenexplored in event detection even though it al-lows a model to perform well with high gener-alization on new event types. In this work, weformulate event detection as a few-shot learn-ing problem to enable to extend event detec-tion to new event types. We propose two… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

    Comments: 1st Joint Workshop on Narrative Understanding, Storylines, and Events (NUSE) @ ACL 2020

  38. arXiv:2003.07370  [pdf, ps, other

    cs.HC cs.AI cs.CL cs.CY

    Harnessing Explanations to Bridge AI and Humans

    Authors: Vivian Lai, Samuel Carton, Chenhao Tan

    Abstract: Machine learning models are increasingly integrated into societally critical applications such as recidivism prediction and medical diagnosis, thanks to their superior predictive power. In these applications, however, full automation is often not desired due to ethical and legal concerns. The research community has thus ventured into developing interpretable methods that explain machine prediction… ▽ More

    Submitted 16 March, 2020; originally announced March 2020.

    Comments: 4 pages, CHI 2020 Fair & Responsible AI Workshop

  39. arXiv:2002.05295  [pdf, ps, other

    cs.CL cs.LG stat.ML

    Exploiting the Matching Information in the Support Set for Few Shot Event Classification

    Authors: Viet Dac Lai, Franck Dernoncourt, Thien Huu Nguyen

    Abstract: The existing event classification (EC) work primarily focuseson the traditional supervised learning setting in which models are unableto extract event mentions of new/unseen event types. Few-shot learninghas not been investigated in this area although it enables EC models toextend their operation to unobserved event types. To fill in this gap, inthis work, we investigate event classification under… ▽ More

    Submitted 19 June, 2020; v1 submitted 12 February, 2020; originally announced February 2020.

    Comments: Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD) 2020

  40. arXiv:2001.05871  [pdf, other

    cs.HC cs.AI cs.CL cs.CY cs.LG

    "Why is 'Chicago' deceptive?" Towards Building Model-Driven Tutorials for Humans

    Authors: Vivian Lai, Han Liu, Chenhao Tan

    Abstract: To support human decision making with machine learning models, we often need to elucidate patterns embedded in the models that are unsalient, unknown, or counterintuitive to humans. While existing approaches focus on explaining machine predictions with real-time assistance, we explore model-driven tutorials to help humans understand these patterns in a training phase. We consider both tutorials wi… ▽ More

    Submitted 14 January, 2020; originally announced January 2020.

    Comments: 26 pages, 48 figures, CHI 2020

  41. arXiv:1910.11368  [pdf, ps, other

    cs.LG cs.CL stat.ML

    Extending Event Detection to New Types with Learning from Keywords

    Authors: Viet Dac Lai, Thien Huu Nguyen

    Abstract: Traditional event detection classifies a word or a phrase in a given sentence for a set of predefined event types. The limitation of such predefined set is that it prevents the adaptation of the event detection models to new event types. We study a novel formulation of event detection that describes types via several keywords to match the contexts in documents. This facilitates the operation of th… ▽ More

    Submitted 24 October, 2019; originally announced October 2019.

  42. arXiv:1910.08534  [pdf, other

    cs.CL cs.CY cs.HC cs.LG

    Many Faces of Feature Importance: Comparing Built-in and Post-hoc Feature Importance in Text Classification

    Authors: Vivian Lai, Jon Z. Cai, Chenhao Tan

    Abstract: Feature importance is commonly used to explain machine predictions. While feature importance can be derived from a machine learning model with a variety of methods, the consistency of feature importance via different methods remains understudied. In this work, we systematically compare feature importance from built-in mechanisms in a model such as attention values and post-hoc methods that approxi… ▽ More

    Submitted 18 October, 2019; originally announced October 2019.

    Comments: 17 pages, 18 figures, EMNLP 2019, the code is available at https://vivlai.github.io/

  43. arXiv:1906.05398  [pdf

    physics.app-ph cond-mat.mtrl-sci

    Self-driving laboratory for accelerated discovery of thin-film materials

    Authors: Benjamin P. MacLeod, Fraser G. L. Parlane, Thomas D. Morrissey, Florian Häse, Loïc M. Roch, Kevan E. Dettelbach, Raphaell Moreira, Lars P. E. Yunker, Michael B. Rooney, Joseph R. Deeth, Veronica Lai, Gordon J. Ng, Henry Situ, Ray H. Zhang, Michael S. Elliott, Ted H. Haley, David J. Dvorak, Alán Aspuru-Guzik, Jason E. Hein, Curtis P. Berlinguette

    Abstract: Discovering and optimizing commercially viable materials for clean energy applications typically takes over a decade. Self-driving laboratories that iteratively design, execute, and learn from material science experiments in a fully autonomous loop present an opportunity to accelerate this research. We report here a modular robotic platform driven by a model-based optimization algorithm capable of… ▽ More

    Submitted 10 March, 2020; v1 submitted 12 June, 2019; originally announced June 2019.

    Comments: 43 pages, 9 figures

  44. arXiv:1811.07901  [pdf, other

    cs.AI cs.CL cs.CY physics.soc-ph stat.ML

    On Human Predictions with Explanations and Predictions of Machine Learning Models: A Case Study on Deception Detection

    Authors: Vivian Lai, Chenhao Tan

    Abstract: Humans are the final decision makers in critical tasks that involve ethical and legal concerns, ranging from recidivism prediction, to medical diagnosis, to fighting against fake news. Although machine learning models can sometimes achieve impressive performance in these tasks, these tasks are not amenable to full automation. To realize the potential of machine learning for improving human decisio… ▽ More

    Submitted 8 January, 2019; v1 submitted 19 November, 2018; originally announced November 2018.

    Comments: 17 pages, 19 figures, in Proceedings of ACM FAT* 2019, dataset & demo available at https://deception.machineintheloop.com

  45. arXiv:1611.05339  [pdf

    cs.CY

    CareerMapper: An Automated Resume Evaluation Tool

    Authors: Vivian Lai, Kyong Jin Shim, Richard J. Oentaryo, Philips K. Prasetyo, Casey Vu, Ee-Peng Lim, David Lo

    Abstract: The advent of the Web brought about major changes in the way people search for jobs and companies look for suitable candidates. As more employers and recruitment firms turn to the Web for job candidate search, an increasing number of people turn to the Web for uploading and creating their online resumes. Resumes are often the first source of information about candidates and also the first item of… ▽ More

    Submitted 16 November, 2016; originally announced November 2016.

    Journal ref: Proceedings of the IEEE International Conference on Big Data (IEEE BigData 2016)