Skip to main content

Showing 1–50 of 54 results for author: Choo, E

  1. arXiv:2406.14155  [pdf, other

    cs.CL

    Aligning Large Language Models with Diverse Political Viewpoints

    Authors: Dominik Stammbach, Philine Widmer, Eunjung Cho, Caglar Gulcehre, Elliott Ash

    Abstract: Large language models such as ChatGPT often exhibit striking political biases. If users query them about political information, they might take a normative stance and reinforce such biases. To overcome this, we align LLMs with diverse political viewpoints from 100,000 comments written by candidates running for national parliament in Switzerland. Such aligned models are able to generate more accura… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  2. arXiv:2406.13474  [pdf, other

    cs.LG cs.AI

    Attention-aware Post-training Quantization without Backpropagation

    Authors: Junhan Kim, Ho-young Kim, Eulrang Cho, Chungman Lee, Joonyoung Kim, Yongkweon Jeon

    Abstract: Quantization is a promising solution for deploying large-scale language models (LLMs) on resource-constrained devices. Existing quantization approaches, however, rely on gradient-based optimization, regardless of it being post-training quantization (PTQ) or quantization-aware training (QAT), which becomes problematic for hyper-scale LLMs with billions of parameters. This overhead can be alleviated… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 20 pages, under review

  3. arXiv:2406.13144  [pdf, other

    cs.CL cs.AI

    DialSim: A Real-Time Simulator for Evaluating Long-Term Dialogue Understanding of Conversational Agents

    Authors: Jiho Kim, Woosog Chay, Hyeonji Hwang, Daeun Kyung, Hyunseung Chung, Eunbyeol Cho, Yohan Jo, Edward Choi

    Abstract: Recent advancements in Large Language Models (LLMs) have significantly enhanced the capabilities of conversational agents, making them applicable to various fields (e.g., education). Despite their progress, the evaluation of the agents often overlooks the complexities of real-world conversations, such as real-time interactions, multi-party dialogues, and extended contextual dependencies. To bridge… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  4. arXiv:2405.19598  [pdf, other

    cs.CR

    Evaluating the Effectiveness and Robustness of Visual Similarity-based Phishing Detection Models

    Authors: Fujiao Ji, Kiho Lee, Hyungjoon Koo, Wenhao You, Euijin Choo, Hyoungshick Kim, Doowon Kim

    Abstract: Phishing attacks pose a significant threat to Internet users, with cybercriminals elaborately replicating the visual appearance of legitimate websites to deceive victims. Visual similarity-based detection systems have emerged as an effective countermeasure, but their effectiveness and robustness in real-world scenarios have been unexplored. In this paper, we comprehensively scrutinize and evaluate… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 12 pages

  5. arXiv:2404.09041  [pdf, other

    cs.CY

    Three Disclaimers for Safe Disclosure: A Cardwriter for Reporting the Use of Generative AI in Writing Process

    Authors: Won Ik Cho, Eunjung Cho, Hyeonji Shin

    Abstract: Generative artificial intelligence (AI) and large language models (LLMs) are increasingly being used in the academic writing process. This is despite the current lack of unified framework for reporting the use of machine assistance. In this work, we propose "Cardwriter", an intuitive interface that produces a short report for authors to declare their use of generative AI in their writing process.… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

    Comments: 6 pages; an implementation version of PaperCard project

  6. arXiv:2404.05687  [pdf, other

    cs.CV

    Retrieval-Augmented Open-Vocabulary Object Detection

    Authors: Jooyeon Kim, Eulrang Cho, Sehyung Kim, Hyunwoo J. Kim

    Abstract: Open-vocabulary object detection (OVD) has been studied with Vision-Language Models (VLMs) to detect novel objects beyond the pre-trained categories. Previous approaches improve the generalization ability to expand the knowledge of the detector, using 'positive' pseudo-labels with additional 'class' names, e.g., sock, iPod, and alligator. To extend the previous methods in two aspects, we propose R… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: Accepted paper at CVPR 2024

  7. arXiv:2404.05431  [pdf, other

    cs.CR

    Simplifying MBA Expression Using E-Graphs

    Authors: Seoksu Lee, Hyeongchang Jeon, Eun-Sun Cho

    Abstract: Code obfuscation involves the addition of meaningless code or the complication of existing code in order to make a program difficult to reverse engineer. In recent years, MBA (Mixed Boolean Arithmetic) obfuscation has been applied to virus and malware code to impede expert analysis. Among the various obfuscation techniques, Mixed Boolean Arithmetic (MBA) obfuscation is considered the most challeng… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  8. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seongjin Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  9. arXiv:2403.15370  [pdf, other

    cs.CV cs.LG cs.RO

    Augmented Reality based Simulated Data (ARSim) with multi-view consistency for AV perception networks

    Authors: Aqeel Anwar, Tae Eun Choe, Zian Wang, Sanja Fidler, Minwoo Park

    Abstract: Detecting a diverse range of objects under various driving scenarios is essential for the effectiveness of autonomous driving systems. However, the real-world data collected often lacks the necessary diversity presenting a long-tail distribution. Although synthetic data has been utilized to overcome this issue by generating virtual scenes, it faces hurdles such as a significant domain gap and the… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: 17 pages, 15 figures, 7 tables

  10. arXiv:2403.02786  [pdf, other

    cs.LG cs.AI

    Semi-Supervised Graph Representation Learning with Human-centric Explanation for Predicting Fatty Liver Disease

    Authors: So Yeon Kim, Sehee Wang, Eun Kyung Choe

    Abstract: Addressing the challenge of limited labeled data in clinical settings, particularly in the prediction of fatty liver disease, this study explores the potential of graph representation learning within a semi-supervised learning framework. Leveraging graph neural networks (GNNs), our approach constructs a subject similarity graph to identify risk patterns from health checkup data. The effectiveness… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: Paper accepted in Human-Centric Representation Learning workshop at AAAI 2024 (https://hcrl-workshop.github.io/2024/)

  11. arXiv:2311.08788  [pdf, other

    cs.CL cs.AI cs.LG

    X-Eval: Generalizable Multi-aspect Text Evaluation via Augmented Instruction Tuning with Auxiliary Evaluation Aspects

    Authors: Minqian Liu, Ying Shen, Zhiyang Xu, Yixin Cao, Eunah Cho, Vaibhav Kumar, Reza Ghanadan, Lifu Huang

    Abstract: Natural Language Generation (NLG) typically involves evaluating the generated text in various aspects (e.g., consistency and naturalness) to obtain a comprehensive assessment. However, multi-aspect evaluation remains challenging as it may require the evaluator to generalize to any given evaluation aspect even if it's absent during training. In this paper, we introduce X-Eval, a two-stage instructi… ▽ More

    Submitted 13 April, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: NAACL 2024 Main Conference. 20 pages, 6 figures, 17 tables

  12. arXiv:2310.18652  [pdf, other

    cs.CL cs.AI cs.CV

    EHRXQA: A Multi-Modal Question Answering Dataset for Electronic Health Records with Chest X-ray Images

    Authors: Seongsu Bae, Daeun Kyung, Jaehee Ryu, Eunbyeol Cho, Gyubok Lee, Sunjun Kweon, Jungwoo Oh, Lei Ji, Eric I-Chao Chang, Tackeun Kim, Edward Choi

    Abstract: Electronic Health Records (EHRs), which contain patients' medical histories in various multi-modal formats, often overlook the potential for joint reasoning across imaging and table modalities underexplored in current EHR Question Answering (QA) systems. In this paper, we introduce EHRXQA, a novel multi-modal question answering dataset combining structured EHRs and chest X-ray images. To develop o… ▽ More

    Submitted 25 December, 2023; v1 submitted 28 October, 2023; originally announced October 2023.

    Comments: Accepted at NeurIPS 2023 Datasets and Benchmarks Track (10 pages for main text, 4 pages for references, 39 pages for supplementary materials)

  13. arXiv:2310.05791  [pdf, other

    cs.CL

    Problem-Solving Guide: Predicting the Algorithm Tags and Difficulty for Competitive Programming Problems

    Authors: Juntae Kim, Eunjung Cho, Dongwoo Kim, Dongbin Na

    Abstract: The recent program development industries have required problem-solving abilities for engineers, especially application developers. However, AI-based education systems to help solve computer algorithm problems have not yet attracted attention, while most big tech companies require the ability to solve algorithm problems including Google, Meta, and Amazon. The most useful guide to solving algorithm… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: 8 pages

  14. arXiv:2310.04824  [pdf, other

    cs.CY

    PaperCard for Reporting Machine Assistance in Academic Writing

    Authors: Won Ik Cho, Eunjung Cho, Kyunghyun Cho

    Abstract: Academic writing process has benefited from various technological developments over the years including search engines, automatic translators, and editing tools that review grammar and spelling mistakes. They have enabled human writers to become more efficient in writing academic papers, for example by helping with finding relevant literature more effectively and polishing texts. While these devel… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

    Comments: Accepted at EAAMO'23 as a poster presentation

  15. arXiv:2309.03406  [pdf, other

    cs.CV

    Distribution-Aware Prompt Tuning for Vision-Language Models

    Authors: Eulrang Cho, Jooyeon Kim, Hyunwoo J. Kim

    Abstract: Pre-trained vision-language models (VLMs) have shown impressive performance on various downstream tasks by utilizing knowledge learned from large data. In general, the performance of VLMs on target tasks can be further improved by prompt tuning, which adds context to the input image or text. By leveraging data from target tasks, various prompt-tuning methods have been studied in the literature. A… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

    Comments: Accepted to ICCV2023

  16. arXiv:2309.00237  [pdf, other

    cs.CL cs.AI

    Publicly Shareable Clinical Large Language Model Built on Synthetic Clinical Notes

    Authors: Sunjun Kweon, Junu Kim, Jiyoun Kim, Sujeong Im, Eunbyeol Cho, Seongsu Bae, Jungwoo Oh, Gyubok Lee, Jong Hak Moon, Seng Chan You, Seungjin Baek, Chang Hoon Han, Yoon Bin Jung, Yohan Jo, Edward Choi

    Abstract: The development of large language models tailored for handling patients' clinical notes is often hindered by the limited accessibility and usability of these notes due to strict privacy regulations. To address these challenges, we first create synthetic large-scale clinical notes using publicly available case reports extracted from biomedical literature. We then use these synthetic notes to train… ▽ More

    Submitted 13 June, 2024; v1 submitted 1 September, 2023; originally announced September 2023.

    Comments: ACL 2024 (Findings)

  17. arXiv:2308.14296  [pdf, other

    cs.IR cs.AI

    RecMind: Large Language Model Powered Agent For Recommendation

    Authors: Yancheng Wang, Ziyan Jiang, Zheng Chen, Fan Yang, Yingxue Zhou, Eunah Cho, Xing Fan, Xiaojiang Huang, Yanbin Lu, Yingzhen Yang

    Abstract: While the recommendation system (RS) has advanced significantly through deep learning, current RS approaches usually train and fine-tune models on task-specific datasets, limiting their generalizability to new recommendation tasks and their ability to leverage external knowledge due to model scale and data size constraints. Thus, we designed an LLM-powered autonomous recommender agent, RecMind, wh… ▽ More

    Submitted 20 March, 2024; v1 submitted 28 August, 2023; originally announced August 2023.

    Comments: Accepted by NAACL 2024 (Findings)

  18. arXiv:2305.14449  [pdf, other

    cs.AI cs.IR cs.LG

    Graph Meets LLM: A Novel Approach to Collaborative Filtering for Robust Conversational Understanding

    Authors: Zheng Chen, Ziyan Jiang, Fan Yang, Eunah Cho, Xing Fan, Xiaojiang Huang, Yanbin Lu, Aram Galstyan

    Abstract: Conversational AI systems such as Alexa need to understand defective queries to ensure robust conversational understanding and reduce user friction. These defective queries often arise from user ambiguities, mistakes, or errors in automatic speech recognition (ASR) and natural language understanding (NLU). Personalized query rewriting is an approach that focuses on reducing defects in queries by… ▽ More

    Submitted 19 June, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    ACM Class: F.2.2; I.2.7

  19. arXiv:2305.07622  [pdf, other

    cs.IR cs.AI cs.CL

    PALR: Personalization Aware LLMs for Recommendation

    Authors: Fan Yang, Zheng Chen, Ziyan Jiang, Eunah Cho, Xiaojiang Huang, Yanbin Lu

    Abstract: Large language models (LLMs) have recently received significant attention for their exceptional capabilities. Despite extensive efforts in developing general-purpose LLMs that can be utilized in various natural language processing (NLP) tasks, there has been less research exploring their potential in recommender systems. In this paper, we propose a novel framework, named PALR, which aiming to comb… ▽ More

    Submitted 7 June, 2023; v1 submitted 12 May, 2023; originally announced May 2023.

    ACM Class: I.2.6; I.2.7

  20. arXiv:2304.02260  [pdf, ps, other

    cs.CR

    Feature Engineering Using File Layout for Malware Detection

    Authors: Jeongwoo Kim, Eun-Sun Cho, Joon-Young Paik

    Abstract: Malware detection on binary executables provides a high availability to even binaries which are not disassembled or decompiled. However, a binary-level approach could cause ambiguity problems. In this paper, we propose a new feature engineering technique that use minimal knowledge about the internal layout on a binary. The proposed feature avoids the ambiguity problems by integrating the informati… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

    Comments: 2pages, no figures, This manuscript was presented in the poster session of The Annual Computer Security Applications Conference (ACSAC) 2020

  21. arXiv:2303.08290  [pdf, other

    cs.LG cs.CL

    Rediscovery of CNN's Versatility for Text-based Encoding of Raw Electronic Health Records

    Authors: Eunbyeol Cho, Min Jae Lee, Kyunghoon Hur, Jiyoun Kim, Jinsung Yoon, Edward Choi

    Abstract: Making the most use of abundant information in electronic health records (EHR) is rapidly becoming an important topic in the medical domain. Recent work presented a promising framework that embeds entire features in raw EHR data regardless of its form and medical code standards. The framework, however, only focuses on encoding EHR with minimal preprocessing and fails to consider how to learn effic… ▽ More

    Submitted 10 May, 2023; v1 submitted 14 March, 2023; originally announced March 2023.

    Comments: Accepted to CHIL 2023

  22. arXiv:2303.07547  [pdf, other

    cs.CV

    HazardNet: Road Debris Detection by Augmentation of Synthetic Models

    Authors: Tae Eun Choe, Jane Wu, Xiaolin Lin, Karen Kwon, Minwoo Park

    Abstract: We present an algorithm to detect unseen road debris using a small set of synthetic models. Early detection of road debris is critical for safe autonomous or assisted driving, yet the development of a robust road debris detection model has not been widely discussed. There are two main challenges to building a road debris detector: first, data collection of road debris is challenging since hazardou… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

    Comments: 11 pages

    MSC Class: ACM-class: I.1.4

  23. arXiv:2302.10454  [pdf, other

    cs.CL cs.LG

    KG-ECO: Knowledge Graph Enhanced Entity Correction for Query Rewriting

    Authors: Jinglun Cai, Mingda Li, Ziyan Jiang, Eunah Cho, Zheng Chen, Yang Liu, Xing Fan, Chenlei Guo

    Abstract: Query Rewriting (QR) plays a critical role in large-scale dialogue systems for reducing frictions. When there is an entity error, it imposes extra challenges for a dialogue system to produce satisfactory responses. In this work, we propose KG-ECO: Knowledge Graph enhanced Entity COrrection for query rewriting, an entity correction system with corrupt entity span detection and entity retrieval/re-r… ▽ More

    Submitted 22 February, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

  24. arXiv:2302.06819  [pdf, other

    cs.CR

    L4 Pointer: An efficient pointer extension for spatial memory safety support without hardware extension

    Authors: Seong-Kyun Mok, Eun-Sun Cho

    Abstract: Since buffer overflow has long been a frequently occurring, high-risk vulnerability, various methods have been developed to support spatial memory safety and prevent buffer overflow. However, every proposed method, although effective in part, has its limitations. Due to expensive bound-checking or large memory in taking for metadata, the software-only support for spatial memory safety inherently e… ▽ More

    Submitted 13 February, 2023; originally announced February 2023.

  25. arXiv:2211.08082  [pdf, other

    cs.LG cs.NE

    UniHPF : Universal Healthcare Predictive Framework with Zero Domain Knowledge

    Authors: Kyunghoon Hur, Jungwoo Oh, Junu Kim, Jiyoun Kim, Min Jae Lee, Eunbyeol Cho, Seong-Eun Moon, Young-Hak Kim, Edward Choi

    Abstract: Despite the abundance of Electronic Healthcare Records (EHR), its heterogeneity restricts the utilization of medical data in building predictive models. To address this challenge, we propose Universal Healthcare Predictive Framework (UniHPF), which requires no medical domain knowledge and minimal pre-processing for multiple prediction tasks. Experimental results demonstrate that UniHPF is capable… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

    Comments: Extended Abstract presented at Machine Learning for Health (ML4H) symposium 2022, November 28th, 2022, New Orleans, United States & Virtual, http://www.ml4h.cc, 19 pages(main paper 6 pages). arXiv admin note: substantial text overlap with arXiv:2207.09858

  26. arXiv:2209.02903  [pdf

    cs.CY cs.CL cs.HC

    Taking a Language Detour: How International Migrants Speaking a Minority Language Seek COVID-Related Information in Their Host Countries

    Authors: Ge Gao, Jian Zheng, Eun Kyoung Choe, Naomi Yamashita

    Abstract: Information seeking is crucial for people's self-care and wellbeing in times of public crises. Extensive research has investigated empirical understandings as well as technical solutions to facilitate information seeking by domestic citizens of affected regions. However, limited knowledge is established to support international migrants who need to survive a crisis in their host countries. The cur… ▽ More

    Submitted 27 September, 2022; v1 submitted 6 September, 2022; originally announced September 2022.

    Journal ref: PACM on Human-Computer Interaction, Vol.6, No.CSCW2, Article 542, Publication date: November 2022

  27. arXiv:2208.05612  [pdf, other

    cs.CR cs.PL

    SSLEM: A Simplifier for MBA Expressions based on Semi-linear MBA Expressions and Program Synthesis

    Authors: Seong-Kyun Mok, Seoyeon Kang, Jeongwoo Kim, Eun-Sun Cho, Seokwoo Choi

    Abstract: MBA (mixed boolean and arithmetic) expressions are hard to simplify, so used for malware obfuscation to hinder analysts' diagnosis. Some MBA simplification methods with high performance have been developed, but they narrowed the target to "linear" MBA expressions, which allows efficient solutions based on logic/term-rewriting. However such restrictions are not appropriate for general forms of MBA… ▽ More

    Submitted 15 August, 2022; v1 submitted 10 August, 2022; originally announced August 2022.

  28. GenHPF: General Healthcare Predictive Framework with Multi-task Multi-source Learning

    Authors: Kyunghoon Hur, Jungwoo Oh, Junu Kim, Jiyoun Kim, Min Jae Lee, Eunbyeol Cho, Seong-Eun Moon, Young-Hak Kim, Louis Atallah, Edward Choi

    Abstract: Despite the remarkable progress in the development of predictive models for healthcare, applying these algorithms on a large scale has been challenging. Algorithms trained on a particular task, based on specific data formats available in a set of medical records, tend to not generalize well to other tasks or databases in which the data fields may differ. To address this challenge, we propose Gener… ▽ More

    Submitted 15 November, 2023; v1 submitted 20 July, 2022; originally announced July 2022.

    Comments: Accepted by IEEE Journal of Biomedical and Health Informatics

    Journal ref: IEEE Journal of Biomedical and Health Informatics 2024

  29. arXiv:2205.13155  [pdf, other

    cs.CR

    A Large Scale Study and Classification of VirusTotal Reports on Phishing and Malware URLs

    Authors: Euijin Choo, Mohamed Nabeel, Ravindu De Silva, Ting Yu, Issa Khalil

    Abstract: VirusTotal (VT) provides aggregated threat intelligence on various entities including URLs, IP addresses, and binaries. It is widely used by researchers and practitioners to collect ground truth and evaluate the maliciousness of entities. In this work, we provide a comprehensive analysis of VT URL scanning reports containing the results of 95 scanners for 1.577 Billion URLs over two years. Individ… ▽ More

    Submitted 26 May, 2022; originally announced May 2022.

  30. arXiv:2205.08290  [pdf, other

    cs.SE

    Literature Review to Collect Conceptual Variables of Scenario Methods for Establishing a Conceptual Scenario Framework

    Authors: Young-Min Baek, Esther Cho, Donghwan Shin, Doo-Hwan Bae

    Abstract: Over recent decades, scenarios and scenario-based software/system engineering have been actively employed as essential tools to handle intricate problems, validate requirements, and support stakeholders' communication. However, despite the widespread use of scenarios, there have been several challenges for engineers to more willingly utilize scenario-based engineering approaches (i.e., scenario me… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

    Comments: 22 pages, 7 figures

    MSC Class: 68M99 ACM Class: D.2.1

  31. Alexa as an Active Listener: How Backchanneling Can Elicit Self-Disclosure and Promote User Experience

    Authors: Eugene Cho, Nasim Motalebi, S. Shyam Sundar, Saeed Abdullah

    Abstract: Active listening is a well-known skill applied in human communication to build intimacy and elicit self-disclosure to support a wide variety of cooperative tasks. When applied to conversational UIs, active listening from machines can also elicit greater self-disclosure by signaling to the users that they are being heard, which can have positive outcomes. However, it takes considerable engineering… ▽ More

    Submitted 22 September, 2022; v1 submitted 21 April, 2022; originally announced April 2022.

    Comments: To appear in Proceedings of the ACM on Human-Computer Interaction (PACM HCI). The paper will be presented in CSCW 2022 (https://cscw.acm.org/2022)

  32. MyMove: Facilitating Older Adults to Collect In-Situ Activity Labels on a Smartwatch with Speech

    Authors: Young-Ho Kim, Diana Chou, Bongshin Lee, Margaret Danilovich, Amanda Lazar, David E. Conroy, Hernisa Kacorri, Eun Kyoung Choe

    Abstract: Current activity tracking technologies are largely trained on younger adults' data, which can lead to solutions that are not well-suited for older adults. To build activity trackers for older adults, it is crucial to collect training data with them. To this end, we examine the feasibility and challenges with older adults in collecting activity labels by leveraging speech. Specifically, we built My… ▽ More

    Submitted 31 March, 2022; originally announced April 2022.

    Comments: To appear at ACM CHI 2022. 21 pages, 3 figures, 7 tables. For the NSF funded project, visit https://mymove-collective.github.io

    ACM Class: H.5.2; H.5.1; I.2.1

  33. arXiv:2109.04655  [pdf, other

    cs.CL

    Zero-Shot Dialogue State Tracking via Cross-Task Transfer

    Authors: Zhaojiang Lin, Bing Liu, Andrea Madotto, Seungwhan Moon, Paul Crook, Zhenpeng Zhou, Zhiguang Wang, Zhou Yu, Eunjoon Cho, Rajen Subba, Pascale Fung

    Abstract: Zero-shot transfer learning for dialogue state tracking (DST) enables us to handle a variety of task-oriented dialogue domains without the expense of collecting in-domain data. In this work, we propose to transfer the \textit{cross-task} knowledge from general question answering (QA) corpora for the zero-shot DST task. Specifically, we propose TransferQA, a transferable generative QA model that se… ▽ More

    Submitted 9 September, 2021; originally announced September 2021.

    Comments: EMNLP 2021

  34. arXiv:2105.04222  [pdf, other

    cs.CL

    Leveraging Slot Descriptions for Zero-Shot Cross-Domain Dialogue State Tracking

    Authors: Zhaojiang Lin, Bing Liu, Seungwhan Moon, Paul Crook, Zhenpeng Zhou, Zhiguang Wang, Zhou Yu, Andrea Madotto, Eunjoon Cho, Rajen Subba

    Abstract: Zero-shot cross-domain dialogue state tracking (DST) enables us to handle task-oriented dialogue in unseen domains without the expense of collecting in-domain data. In this paper, we propose a slot description enhanced generative approach for zero-shot cross-domain DST. Specifically, our model first encodes dialogue context and slots with a pre-trained self-attentive encoder, and generates slot va… ▽ More

    Submitted 10 May, 2021; originally announced May 2021.

    Comments: NAACL 2021

  35. arXiv:2104.05979  [pdf, other

    cs.HC

    Investigating Opportunities to Support Kids' Agency and Well-being: A Review of Kids' Wearables

    Authors: Rachael Zehrung, Lily Huang, Bongshin Lee, Eun Kyoung Choe

    Abstract: Wearable devices hold great potential for promoting children's health and well-being. However, research on kids' wearables is sparse and often focuses on their use in the context of parental surveillance. To gain insight into the current landscape of kids' wearables, we surveyed 47 wearable devices marketed for children. We collected rich data on the functionality of these devices and assessed how… ▽ More

    Submitted 13 April, 2021; originally announced April 2021.

    Comments: 20 pages, 1 figure, 5 tables

  36. Data@Hand: Fostering Visual Exploration of Personal Data on Smartphones Leveraging Speech and Touch Interaction

    Authors: Young-Ho Kim, Bongshin Lee, Arjun Srinivasan, Eun Kyoung Choe

    Abstract: Most mobile health apps employ data visualization to help people view their health and activity data, but these apps provide limited support for visual data exploration. Furthermore, despite its huge potential benefits, mobile visualization research in the personal data context is sparse. This work aims to empower people to easily navigate and compare their personal health data on smartphones by e… ▽ More

    Submitted 15 January, 2021; originally announced January 2021.

    Comments: To appear in ACM CHI 2021 Conference on Human Factors in Computing Systems; 16 pages, 6 figures, 5 tables

    ACM Class: H.5.2

    Journal ref: In CHI Conference on Human Factors in Computing Systems (CHI '21), May 8-13, 2021, Yokohama, Japan

  37. arXiv:2012.15504  [pdf, other

    cs.CL cs.AI

    Continual Learning in Task-Oriented Dialogue Systems

    Authors: Andrea Madotto, Zhaojiang Lin, Zhenpeng Zhou, Seungwhan Moon, Paul Crook, Bing Liu, Zhou Yu, Eunjoon Cho, Zhiguang Wang

    Abstract: Continual learning in task-oriented dialogue systems can allow us to add new domains and functionalities through time without incurring the high cost of a whole system retraining. In this paper, we propose a continual learning benchmark for task-oriented dialogue systems with 37 domains to be learned continuously in four settings, such as intent recognition, state tracking, natural language genera… ▽ More

    Submitted 31 December, 2020; originally announced December 2020.

    Comments: 9 pages

  38. arXiv:2012.13971  [pdf, other

    cs.LG cs.CR

    Time-Window Group-Correlation Support vs. Individual Features: A Detection of Abnormal Users

    Authors: Lun-Pin Yuan, Euijin Choo, Ting Yu, Issa Khalil, Sencun Zhu

    Abstract: Autoencoder-based anomaly detection methods have been used in identifying anomalous users from large-scale enterprise logs with the assumption that adversarial activities do not follow past habitual patterns. Most existing approaches typically build models by reconstructing single-day and individual-user behaviors. However, without capturing long-term signals and group-correlation signals, the mod… ▽ More

    Submitted 27 December, 2020; originally announced December 2020.

  39. arXiv:2010.12757  [pdf, other

    cs.CL

    Adding Chit-Chat to Enhance Task-Oriented Dialogues

    Authors: Kai Sun, Seungwhan Moon, Paul Crook, Stephen Roller, Becka Silvert, Bing Liu, Zhiguang Wang, Honglei Liu, Eunjoon Cho, Claire Cardie

    Abstract: Existing dialogue corpora and models are typically designed under two disjoint motives: while task-oriented systems focus on achieving functional goals (e.g., booking hotels), open-domain chatbots aim at making socially engaging conversations. In this work, we propose to integrate both types of systems by Adding Chit-Chat to ENhance Task-ORiented dialogues (ACCENTOR), with the goal of making virtu… ▽ More

    Submitted 1 May, 2021; v1 submitted 23 October, 2020; originally announced October 2020.

    Comments: To appear in NAACL-HLT 2021

  40. arXiv:2006.01460  [pdf, other

    cs.CL cs.AI cs.HC cs.LG

    Situated and Interactive Multimodal Conversations

    Authors: Seungwhan Moon, Satwik Kottur, Paul A. Crook, Ankita De, Shivani Poddar, Theodore Levin, David Whitney, Daniel Difranco, Ahmad Beirami, Eunjoon Cho, Rajen Subba, Alborz Geramifard

    Abstract: Next generation virtual assistants are envisioned to handle multimodal inputs (e.g., vision, memories of previous interactions, in addition to the user's utterances), and perform multimodal actions (e.g., displaying a route in addition to generating the system's utterance). We introduce Situated Interactive MultiModal Conversations (SIMMC) as a new direction aimed at training agents that take mult… ▽ More

    Submitted 10 November, 2020; v1 submitted 2 June, 2020; originally announced June 2020.

    Comments: 20 pages, 5 figures, 11 tables, accepted to COLING 2020

  41. Gen-LaneNet: A Generalized and Scalable Approach for 3D Lane Detection

    Authors: Yuliang Guo, Guang Chen, Peitao Zhao, Weide Zhang, Jinghao Miao, Jingao Wang, Tae Eun Choe

    Abstract: We present a generalized and scalable method, called Gen-LaneNet, to detect 3D lanes from a single image. The method, inspired by the latest state-of-the-art 3D-LaneNet, is a unified framework solving image encoding, spatial transform of features and 3D lane prediction in a single network. However, we propose unique designs for Gen-LaneNet in two folds. First, we introduce a new geometry-guided la… ▽ More

    Submitted 24 March, 2020; originally announced March 2020.

  42. arXiv:2003.09891  [pdf, other

    eess.AS cs.CL cs.SD

    Low Latency ASR for Simultaneous Speech Translation

    Authors: Thai Son Nguyen, Jan Niehues, Eunah Cho, Thanh-Le Ha, Kevin Kilgour, Markus Muller, Matthias Sperber, Sebastian Stueker, Alex Waibel

    Abstract: User studies have shown that reducing the latency of our simultaneous lecture translation system should be the most important goal. We therefore have worked on several techniques for reducing the latency for both components, the automatic speech recognition and the speech translation module. Since the commonly used commitment latency is not appropriate in our case of continuous stream decoding, we… ▽ More

    Submitted 22 March, 2020; originally announced March 2020.

  43. arXiv:2003.02245  [pdf, other

    cs.CL cs.LG

    Data Augmentation using Pre-trained Transformer Models

    Authors: Varun Kumar, Ashutosh Choudhary, Eunah Cho

    Abstract: Language model based pre-trained models such as BERT have provided significant gains across different NLP tasks. In this paper, we study different types of transformer based pre-trained models such as auto-regressive models (GPT-2), auto-encoder models (BERT), and seq2seq models (BART) for conditional data augmentation. We show that prepending the class labels to text sequences provides a simple y… ▽ More

    Submitted 31 January, 2021; v1 submitted 4 March, 2020; originally announced March 2020.

    Comments: In Proceedings of the 2nd Workshop on Life-long Learning for Spoken Language Systems @ AACL 2020; Code: https://github.com/varinf/TransformersDataAugmentation

  44. arXiv:1911.12080  [pdf, other

    cs.CR

    DeviceWatch: Identifying Compromised Mobile Devices through Network Traffic Analysis and Graph Inference

    Authors: Euijin Choo, Mohamed Nabeel, Mashael Alsabah, Issa Khalil, Ting Yu, Wei Wang

    Abstract: In this paper, we propose to identify compromised mobile devices from a network administrator's point of view. Intuitively, inadvertent users (and thus their devices) who download apps through untrustworthy markets are often allured to install malicious apps through in-app advertisement or phishing. We thus hypothesize that devices sharing a similar set of apps will have a similar probability of b… ▽ More

    Submitted 27 November, 2019; originally announced November 2019.

  45. arXiv:1910.04196  [pdf, other

    cs.CL

    Efficient Semi-Supervised Learning for Natural Language Understanding by Optimizing Diversity

    Authors: Eunah Cho, He Xie, John P. Lalor, Varun Kumar, William M. Campbell

    Abstract: Expanding new functionalities efficiently is an ongoing challenge for single-turn task-oriented dialogue systems. In this work, we explore functionality-specific semi-supervised learning via self-training. We consider methods that augment training data automatically from unlabeled data sets in a functionality-targeted manner. In addition, we examine multiple techniques for efficient selection of a… ▽ More

    Submitted 9 October, 2019; originally announced October 2019.

    Comments: IEEE Copyright. To appear at ASRU 2019

  46. A Comparative Evaluation of Animation and Small Multiples for Trend Visualization on Mobile Phones

    Authors: Matthew Brehmer, Bongshin Lee, Petra Isenberg, Eun Kyoung Choe

    Abstract: We compare the efficacy of animated and small multiples variants of scatterplots on mobile phones for comparing trends in multivariate datasets. Visualization is increasingly prevalent in mobile applications and mobile-first websites, yet there is little prior visualization research dedicated to small displays. In this paper, we build upon previous experimental research carried out on larger displ… ▽ More

    Submitted 12 October, 2019; v1 submitted 8 July, 2019; originally announced July 2019.

    Comments: Accepted for presentation at IEEE VIS 2019, October 20-25 in Vancouver, Canada. To appear in IEEE Transactions on Visualization and Computer Graphics

  47. arXiv:1708.00993  [pdf, other

    cs.CL

    Exploiting Linguistic Resources for Neural Machine Translation Using Multi-task Learning

    Authors: Jan Niehues, Eunah Cho

    Abstract: Linguistic resources such as part-of-speech (POS) tags have been extensively used in statistical machine translation (SMT) frameworks and have yielded better performances. However, usage of such linguistic annotations in neural machine translation (NMT) systems has been left under-explored. In this work, we show that multi-task learning is a successful and a easy approach to introduce an additio… ▽ More

    Submitted 3 August, 2017; originally announced August 2017.

    Comments: 9 pages, Second Conference on Machine Translation(WMT17)

  48. arXiv:1708.00563  [pdf, other

    cs.CL

    Analyzing Neural MT Search and Model Performance

    Authors: Jan Niehues, Eunah Cho, Thanh-Le Ha, Alex Waibel

    Abstract: In this paper, we offer an in-depth analysis about the modeling and search performance. We address the question if a more complex search algorithm is necessary. Furthermore, we investigate the question if more complex models which might only be applicable during rescoring are promising. By separating the search space and the modeling using $n$-best list reranking, we analyze the influence of bot… ▽ More

    Submitted 1 August, 2017; originally announced August 2017.

    Comments: 7 pages, First Workshop on Neural Machine Translation

  49. arXiv:1706.00180  [pdf, ps, other

    math.CO cs.IT

    A spectral characterisation of t-designs and its applications

    Authors: Eun-Kyung Cho, Cunsheng Ding, Jong Yoon Hyun

    Abstract: There are two standard approaches to the construction of $t$-designs. The first one is based on permutation group actions on certain base blocks. The second one is based on coding theory. The objective of this paper is to give a spectral characterisation of all $t$-designs by introducing a characteristic Boolean function of a $t$-design. The spectra of the characteristic functions of $(n-2)/2$-… ▽ More

    Submitted 9 June, 2018; v1 submitted 1 June, 2017; originally announced June 2017.

    MSC Class: 05B05; 51E10; 94B15

  50. arXiv:1610.05243  [pdf, other

    cs.CL

    Pre-Translation for Neural Machine Translation

    Authors: Jan Niehues, Eunah Cho, Thanh-Le Ha, Alex Waibel

    Abstract: Recently, the development of neural machine translation (NMT) has significantly improved the translation quality of automatic machine translation. While most sentences are more accurate and fluent than translations by statistical machine translation (SMT)-based systems, in some cases, the NMT system produces translations that have a completely different meaning. This is especially the case when ra… ▽ More

    Submitted 17 October, 2016; originally announced October 2016.

    Comments: 9 pages. To appear in COLING 2016