Skip to main content

Showing 1–50 of 180 results for author: Choi, D

  1. arXiv:2406.19634  [pdf, other

    cs.RO

    CLOi-Mapper: Consistent, Lightweight, Robust, and Incremental Mapper With Embedded Systems for Commercial Robot Services

    Authors: DongKi Noh, Hyungtae Lim, Gyuho Eoh, Duckyu Choi, Jeongsik Choi, Hyunjun Lim, SeungMin Baek, Hyun Myung

    Abstract: In commercial autonomous service robots with several form factors, simultaneous localization and mapping (SLAM) is an essential technology for providing proper services such as cleaning and guidance. Such robots require SLAM algorithms suitable for specific applications and environments. Hence, several SLAM frameworks have been proposed to address various requirements in the past decade. However,… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Journal ref: IEEE Robotics and Automation Letters, 2024

  2. arXiv:2406.14546  [pdf, other

    cs.CL cs.AI cs.LG

    Connecting the Dots: LLMs can Infer and Verbalize Latent Structure from Disparate Training Data

    Authors: Johannes Treutlein, Dami Choi, Jan Betley, Cem Anil, Samuel Marks, Roger Baker Grosse, Owain Evans

    Abstract: One way to address safety risks from large language models (LLMs) is to censor dangerous knowledge from their training data. While this removes the explicit information, implicit information can remain scattered across various training documents. Could an LLM infer the censored knowledge by piecing together these implicit hints? As a step towards answering this question, we study inductive out-of-… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  3. arXiv:2406.09138  [pdf, other

    cs.CL

    Leveraging Explicit Reasoning for Inference Integration in Commonsense-Augmented Dialogue Models

    Authors: Sarah E. Finch, Jinho D. Choi

    Abstract: Open-domain dialogue systems need to grasp social commonsense to understand and respond effectively to human users. Commonsense-augmented dialogue models have been proposed that aim to infer commonsense knowledge from dialogue contexts in order to improve response quality. However, existing approaches to commonsense-augmented dialogue rely on implicit reasoning to integrate commonsense inferences… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  4. arXiv:2406.07800  [pdf, other

    cs.LG cs.DC

    Regularizing and Aggregating Clients with Class Distribution for Personalized Federated Learning

    Authors: Gyuejeong Lee, Daeyoung Choi

    Abstract: Personalized federated learning (PFL) enables customized models for clients with varying data distributions. However, existing PFL methods often incur high computational and communication costs, limiting their practical application. This paper proposes a novel PFL method, Class-wise Federated Averaging (cwFedAVG), that performs Federated Averaging (FedAVG) class-wise, creating multiple global mode… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  5. arXiv:2406.03663  [pdf

    eess.IV cs.LG q-bio.QM

    A Hybrid Deep Learning Classification of Perimetric Glaucoma Using Peripapillary Nerve Fiber Layer Reflectance and Other OCT Parameters from Three Anatomy Regions

    Authors: Ou Tan, David S. Greenfield, Brian A. Francis, Rohit Varma, Joel S. Schuman, David Huang, Dongseok Choi

    Abstract: Precis: A hybrid deep-learning model combines NFL reflectance and other OCT parameters to improve glaucoma diagnosis. Objective: To investigate if a deep learning model could be used to combine nerve fiber layer (NFL) reflectance and other OCT parameters for glaucoma diagnosis. Patients and Methods: This is a prospective observational study where of 106 normal subjects and 164 perimetric glaucoma… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 12 pages

  6. arXiv:2405.12856  [pdf, other

    stat.ML cs.CL cs.LG

    LLM Processes: Numerical Predictive Distributions Conditioned on Natural Language

    Authors: James Requeima, John Bronskill, Dami Choi, Richard E. Turner, David Duvenaud

    Abstract: Machine learning practitioners often face significant challenges in formally integrating their prior knowledge and beliefs into predictive models, limiting the potential for nuanced and context-aware analyses. Moreover, the expertise needed to integrate this prior knowledge into probabilistic modeling typically limits the application of these models to specialists. Our goal is to build a regressio… ▽ More

    Submitted 25 May, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

  7. arXiv:2405.12468  [pdf, other

    cs.CL

    Diverse and Effective Synthetic Data Generation for Adaptable Zero-Shot Dialogue State Tracking

    Authors: James D. Finch, Jinho D. Choi

    Abstract: We demonstrate substantial performance gains in zero-shot dialogue state tracking (DST) by enhancing training data diversity through synthetic data generation. Existing DST datasets are severely limited in the number of application domains and slot types they cover due to the high costs of data collection, restricting their adaptability to new domains. This work addresses this challenge with a nov… ▽ More

    Submitted 13 June, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

  8. arXiv:2405.11178  [pdf, other

    cs.CL

    Automating PTSD Diagnostics in Clinical Interviews: Leveraging Large Language Models for Trauma Assessments

    Authors: Sichang Tu, Abigail Powers, Natalie Merrill, Negar Fani, Sierra Carter, Stephen Doogan, Jinho D. Choi

    Abstract: The shortage of clinical workforce presents significant challenges in mental healthcare, limiting access to formal diagnostics and services. We aim to tackle this shortage by integrating a customized large language model (LLM) into the workflow, thus promoting equity in mental healthcare for the general population. Although LLMs have showcased their capability in clinical decision-making, their ad… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

  9. arXiv:2405.04497  [pdf, other

    cs.HC

    Unveiling Disparities in Web Task Handling Between Human and Web Agent

    Authors: Kihoon Son, Jinhyeon Kwon, DaEun Choi, Tae Soo Kim, Young-Ho Kim, Sangdoo Yun, Juho Kim

    Abstract: With the advancement of Large-Language Models (LLMs) and Large Vision-Language Models (LVMs), agents have shown significant capabilities in various tasks, such as data analysis, gaming, or code generation. Recently, there has been a surge in research on web agents, capable of performing tasks within the web environment. However, the web poses unforeseeable scenarios, challenging the generalizabili… ▽ More

    Submitted 8 May, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

  10. arXiv:2405.00523  [pdf, other

    cs.AI cs.CL

    CookingSense: A Culinary Knowledgebase with Multidisciplinary Assertions

    Authors: Donghee Choi, Mogan Gim, Donghyeon Park, Mujeen Sung, Hyunjae Kim, Jaewoo Kang, Jihun Choi

    Abstract: This paper introduces CookingSense, a descriptive collection of knowledge assertions in the culinary domain extracted from various sources, including web data, scientific papers, and recipes, from which knowledge covering a broad range of aspects is acquired. CookingSense is constructed through a series of dictionary-based filtering and language model-based semantic filtering techniques, which res… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: LREC-COLING 2024 Accepted

  11. arXiv:2404.06621  [pdf, other

    cs.CL

    What is Your Favorite Gender, MLM? Gender Bias Evaluation in Multilingual Masked Language Models

    Authors: Jeongrok Yu, Seong Ug Kim, Jacob Choi, Jinho D. Choi

    Abstract: Bias is a disproportionate prejudice in favor of one side against another. Due to the success of transformer-based Masked Language Models (MLMs) and their impact on many NLP tasks, a systematic evaluation of bias in these models is needed more than ever. While many studies have evaluated gender bias in English MLMs, only a few works have been conducted for the task in other languages. This paper p… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  12. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seongjin Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  13. arXiv:2404.00676  [pdf, other

    cs.CV cs.GR

    OmniLocalRF: Omnidirectional Local Radiance Fields from Dynamic Videos

    Authors: Dongyoung Choi, Hyeonjoong Jang, Min H. Kim

    Abstract: Omnidirectional cameras are extensively used in various applications to provide a wide field of vision. However, they face a challenge in synthesizing novel views due to the inevitable presence of dynamic objects, including the photographer, in their wide field of view. In this paper, we introduce a new approach called Omnidirectional Local Radiance Fields (OmniLocalRF) that can render static-only… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024

  14. arXiv:2404.00376  [pdf, other

    cs.CL

    Small Language Models Learn Enhanced Reasoning Skills from Medical Textbooks

    Authors: Hyunjae Kim, Hyeon Hwang, Jiwoo Lee, Sihyeon Park, Dain Kim, Taewhoo Lee, Chanwoong Yoon, Jiwoong Sohn, Donghee Choi, Jaewoo Kang

    Abstract: While recent advancements in commercial large language models (LM) have shown promising results in medical tasks, their closed-source nature poses significant privacy and security concerns, hindering their widespread use in the medical field. Despite efforts to create open-source models, their limited parameters often result in insufficient multi-step reasoning capabilities required for solving co… ▽ More

    Submitted 30 June, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

    Comments: Added new LLaMA-3-based models and experiments on NEJM case challenges

  15. arXiv:2403.14110  [pdf, other

    cs.LG cs.AI

    Heuristic Algorithm-based Action Masking Reinforcement Learning (HAAM-RL) with Ensemble Inference Method

    Authors: Kyuwon Choi, Cheolkyun Rho, Taeyoun Kim, Daewoo Choi

    Abstract: This paper presents a novel reinforcement learning (RL) approach called HAAM-RL (Heuristic Algorithm-based Action Masking Reinforcement Learning) for optimizing the color batching re-sequencing problem in automobile painting processes. The existing heuristic algorithms have limitations in adequately reflecting real-world constraints and accurately predicting logistics performance. Our methodology… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: 7 pages, 8 figures

  16. arXiv:2403.06252  [pdf, other

    cs.HC

    Demystifying Tacit Knowledge in Graphic Design: Characteristics, Instances, Approaches, and Guidelines

    Authors: Kihoon Son, DaEun Choi, Tae Soo Kim, Juho Kim

    Abstract: Despite the growing demand for professional graphic design knowledge, the tacit nature of design inhibits knowledge sharing. However, there is a limited understanding on the characteristics and instances of tacit knowledge in graphic design. In this work, we build a comprehensive set of tacit knowledge characteristics through a literature review. Through interviews with 10 professional graphic des… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

  17. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  18. arXiv:2403.03082  [pdf, other

    cs.LG cs.AI cs.CV

    Recall-Oriented Continual Learning with Generative Adversarial Meta-Model

    Authors: Haneol Kang, Dong-Wan Choi

    Abstract: The stability-plasticity dilemma is a major challenge in continual learning, as it involves balancing the conflicting objectives of maintaining performance on previous tasks while learning new tasks. In this paper, we propose the recall-oriented continual learning framework to address this challenge. Inspired by the human brain's ability to separate the mechanisms responsible for stability and pla… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: Accepted in AAAI-2024 (Oral presentation)

  19. arXiv:2402.14340  [pdf, other

    cs.CV

    TIE-KD: Teacher-Independent and Explainable Knowledge Distillation for Monocular Depth Estimation

    Authors: Sangwon Choi, Daejune Choi, Duksu Kim

    Abstract: Monocular depth estimation (MDE) is essential for numerous applications yet is impeded by the substantial computational demands of accurate deep learning models. To mitigate this, we introduce a novel Teacher-Independent Explainable Knowledge Distillation (TIE-KD) framework that streamlines the knowledge transfer from complex teacher models to compact student networks, eliminating the need for arc… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: 13 pages, 8 figures, under review for a journal

  20. arXiv:2402.12821  [pdf, other

    cs.CL cs.LG

    Identifying Factual Inconsistencies in Summaries: Grounding Model Inference via Task Taxonomy

    Authors: Liyan Xu, Zhenlin Su, Mo Yu, Jin Xu, Jinho D. Choi, Jie Zhou, Fei Liu

    Abstract: Factual inconsistencies pose a significant hurdle for the faithful summarization by generative models. While a major direction to enhance inconsistency detection is to derive stronger Natural Language Inference (NLI) models, we propose an orthogonal aspect that underscores the importance of incorporating task-specific taxonomy into the inference. To this end, we consolidate key error types of inco… ▽ More

    Submitted 19 June, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

  21. arXiv:2402.12406  [pdf, other

    cs.LG cs.AI cs.CV

    Teacher as a Lenient Expert: Teacher-Agnostic Data-Free Knowledge Distillation

    Authors: Hyunjune Shin, Dong-Wan Choi

    Abstract: Data-free knowledge distillation (DFKD) aims to distill pretrained knowledge to a student model with the help of a generator without using original data. In such data-free scenarios, achieving stable performance of DFKD is essential due to the unavailability of validation data. Unfortunately, this paper has discovered that existing DFKD methods are quite sensitive to different teacher models, occa… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

    Comments: Accepted in AAAI-2024

  22. arXiv:2401.15471  [pdf, other

    cs.CL

    ConvoSense: Overcoming Monotonous Commonsense Inferences for Conversational AI

    Authors: Sarah E. Finch, Jinho D. Choi

    Abstract: Mastering commonsense understanding and reasoning is a pivotal skill essential for conducting engaging conversations. While there have been several attempts to create datasets that facilitate commonsense inferences in dialogue contexts, existing datasets tend to lack in-depth details, restate information already present in the conversation, and often fail to capture the multifaceted nature of comm… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

    Comments: accepted to TACL 2024; final author's version of paper; pre-MIT Press publication version

  23. arXiv:2312.15514  [pdf, other

    cs.CV cs.AI

    Towards Reliable AI Model Deployments: Multiple Input Mixup for Out-of-Distribution Detection

    Authors: Dasol Choi, Dongbin Na

    Abstract: Recent remarkable success in the deep-learning industries has unprecedentedly increased the need for reliable model deployment. For example, the model should alert the user if the produced model outputs might not be reliable. Previous studies have proposed various methods to solve the Out-of-Distribution (OOD) detection problem, however, they generally require a burden of resources. In this work,… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

    Comments: Accepted to the AAAI 2024 Workshop on Deployable AI (DAI)

  24. arXiv:2312.15449  [pdf, other

    cs.CV

    iDet3D: Towards Efficient Interactive Object Detection for LiDAR Point Clouds

    Authors: Dongmin Choi, Wonwoo Cho, Kangyeol Kim, Jaegul Choo

    Abstract: Accurately annotating multiple 3D objects in LiDAR scenes is laborious and challenging. While a few previous studies have attempted to leverage semi-automatic methods for cost-effective bounding box annotation, such methods have limitations in efficiently handling numerous multi-class objects. To effectively accelerate 3D annotation pipelines, we propose iDet3D, an efficient interactive 3D object… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

    Comments: Accepted to AAAI 2024

  25. arXiv:2312.14129  [pdf, other

    cs.LG cs.AI cs.IR

    WellFactor: Patient Profiling using Integrative Embedding of Healthcare Data

    Authors: Dongjin Choi, Andy Xiang, Ozgur Ozturk, Deep Shrestha, Barry Drake, Hamid Haidarian, Faizan Javed, Haesun Park

    Abstract: In the rapidly evolving healthcare industry, platforms now have access to not only traditional medical records, but also diverse data sets encompassing various patient interactions, such as those from healthcare web portals. To address this rich diversity of data, we introduce WellFactor: a method that derives patient profiles by integrating information from these sources. Central to our approach… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: 2023 IEEE International Conference on Big Data (IEEE BigData 2023)

  26. arXiv:2312.11949  [pdf, other

    cs.HC

    CreativeConnect: Supporting Reference Recombination for Graphic Design Ideation with Generative AI

    Authors: DaEun Choi, Sumin Hong, Jeongeon Park, John Joon Young Chung, Juho Kim

    Abstract: Graphic designers often get inspiration through the recombination of references. Our formative study (N=6) reveals that graphic designers focus on conceptual keywords during this process, and want support for discovering the keywords, expanding them, and exploring diverse recombination options of them, while still having room for designers' creativity. We propose CreativeConnect, a system with gen… ▽ More

    Submitted 6 March, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

  27. arXiv:2312.06134  [pdf, other

    cs.CL cs.LG

    Order Matters in the Presence of Dataset Imbalance for Multilingual Learning

    Authors: Dami Choi, Derrick Xin, Hamid Dadkhahi, Justin Gilmer, Ankush Garg, Orhan Firat, Chih-Kuan Yeh, Andrew M. Dai, Behrooz Ghorbani

    Abstract: In this paper, we empirically study the optimization dynamics of multi-task learning, particularly focusing on those that govern a collection of tasks with significant data imbalance. We present a simple yet effective method of pre-training on high-resource tasks, followed by fine-tuning on a mixture of high/low-resource tasks. We provide a thorough empirical study and analysis of this method's be… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  28. arXiv:2311.11602   

    cs.CV cs.AI

    A Multi-In-Single-Out Network for Video Frame Interpolation without Optical Flow

    Authors: Jaemin Lee, Minseok Seo, Sangwoo Lee, Hyobin Park, Dong-Geol Choi

    Abstract: In general, deep learning-based video frame interpolation (VFI) methods have predominantly focused on estimating motion vectors between two input frames and warping them to the target time. While this approach has shown impressive performance for linear motion between two input frames, it exhibits limitations when dealing with occlusions and nonlinear movements. Recently, generative models have be… ▽ More

    Submitted 4 December, 2023; v1 submitted 20 November, 2023; originally announced November 2023.

    Comments: Discovering a problem with the manuscript

  29. arXiv:2311.03383  [pdf, other

    cs.LG cs.AI cs.AR cs.HC

    Toward Reinforcement Learning-based Rectilinear Macro Placement Under Human Constraints

    Authors: Tuyen P. Le, Hieu T. Nguyen, Seungyeol Baek, Taeyoun Kim, Jungwoo Lee, Seongjung Kim, Hyunjin Kim, Misu Jung, Daehoon Kim, Seokyong Lee, Daewoo Choi

    Abstract: Macro placement is a critical phase in chip design, which becomes more intricate when involving general rectilinear macros and layout areas. Furthermore, macro placement that incorporates human-like constraints, such as design hierarchy and peripheral bias, has the potential to significantly reduce the amount of additional manual labor required from designers. This study proposes a methodology tha… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: Fast ML for Science @ ICCAD 2023

  30. arXiv:2311.02240  [pdf, other

    cs.CV

    Towards Machine Unlearning Benchmarks: Forgetting the Personal Identities in Facial Recognition Systems

    Authors: Dasol Choi, Dongbin Na

    Abstract: Machine unlearning is a crucial tool for enabling a classification model to forget specific data that are used in the training time. Recently, various studies have presented machine unlearning algorithms and evaluated their methods on several datasets. However, most of the current machine unlearning algorithms have been evaluated solely on traditional computer vision datasets such as CIFAR-10, MNI… ▽ More

    Submitted 24 December, 2023; v1 submitted 3 November, 2023; originally announced November 2023.

    Comments: Accepted to the AAAI 2024 Workshop on Privacy-Preserving Artificial Intelligence (PPAI)

  31. arXiv:2310.16538  [pdf, other

    cs.CL cs.AI cs.LG

    FedTherapist: Mental Health Monitoring with User-Generated Linguistic Expressions on Smartphones via Federated Learning

    Authors: Jaemin Shin, Hyungjun Yoon, Seungjoo Lee, Sungjoon Park, Yunxin Liu, Jinho D. Choi, Sung-Ju Lee

    Abstract: Psychiatrists diagnose mental disorders via the linguistic use of patients. Still, due to data privacy, existing passive mental health monitoring systems use alternative features such as activity, app usage, and location via mobile devices. We propose FedTherapist, a mobile mental health monitoring system that utilizes continuous speech and keyboard input in a privacy-preserving way via federated… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: Accepted to the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023)

  32. arXiv:2310.16318  [pdf, other

    cs.LG cs.AI

    Modality-Agnostic Self-Supervised Learning with Meta-Learned Masked Auto-Encoder

    Authors: Huiwon Jang, Jihoon Tack, Daewon Choi, Jongheon Jeong, Jinwoo Shin

    Abstract: Despite its practical importance across a wide range of modalities, recent advances in self-supervised learning (SSL) have been primarily focused on a few well-curated domains, e.g., vision and language, often relying on their domain-specific knowledge. For example, Masked Auto-Encoder (MAE) has become one of the popular architectures in these domains, but less has explored its potential in other… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: Accepted to NeurIPS 2023. The first two authors contributed equally

  33. arXiv:2310.04313  [pdf, other

    cs.CL

    KoMultiText: Large-Scale Korean Text Dataset for Classifying Biased Speech in Real-World Online Services

    Authors: Dasol Choi, Jooyoung Song, Eunsun Lee, Jinwoo Seo, Heejune Park, Dongbin Na

    Abstract: With the growth of online services, the need for advanced text classification algorithms, such as sentiment analysis and biased text detection, has become increasingly evident. The anonymous nature of online services often leads to the presence of biased and harmful language, posing challenges to maintaining the health of online communities. This phenomenon is especially relevant in South Korea, w… ▽ More

    Submitted 12 November, 2023; v1 submitted 6 October, 2023; originally announced October 2023.

    Comments: Accepted to the NeurIPS 2023 Workshop on Socially Responsible Language Modelling Research (SoLaR)

  34. arXiv:2310.01287  [pdf, other

    cs.HC

    GenQuery: Supporting Expressive Visual Search with Generative Models

    Authors: Kihoon Son, DaEun Choi, Tae Soo Kim, Young-Ho Kim, Juho Kim

    Abstract: Designers rely on visual search to explore and develop ideas in early design stages. However, designers can struggle to identify suitable text queries to initiate a search or to discover images for similarity-based search that can adequately express their intent. We propose GenQuery, a novel system that integrates generative models into the visual search process. GenQuery can automatically elabora… ▽ More

    Submitted 4 March, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: 18 pages and 12 figures

  35. arXiv:2309.12047  [pdf, other

    cs.CV cs.GR eess.IV

    Self-Calibrating, Fully Differentiable NLOS Inverse Rendering

    Authors: Kiseok Choi, Inchul Kim, Dongyoung Choi, Julio Marco, Diego Gutierrez, Min H. Kim

    Abstract: Existing time-resolved non-line-of-sight (NLOS) imaging methods reconstruct hidden scenes by inverting the optical paths of indirect illumination measured at visible relay surfaces. These methods are prone to reconstruction artifacts due to inversion ambiguities and capture noise, which are typically mitigated through the manual selection of filtering functions and parameters. We introduce a fully… ▽ More

    Submitted 25 September, 2023; v1 submitted 21 September, 2023; originally announced September 2023.

    Journal ref: Proceedings of ACM SIGGRAPH Asia 2023 (December 2023)

  36. arXiv:2309.07998  [pdf, other

    cs.CL

    Exploring the Impact of Human Evaluator Group on Chat-Oriented Dialogue Evaluation

    Authors: Sarah E. Finch, James D. Finch, Jinho D. Choi

    Abstract: Human evaluation has been widely accepted as the standard for evaluating chat-oriented dialogue systems. However, there is a significant variation in previous work regarding who gets recruited as evaluators. Evaluator groups such as domain experts, university students, and professional annotators have been used to assess and compare dialogue systems, although it is unclear to what extent the choic… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

  37. arXiv:2309.07677  [pdf, other

    cs.CL

    Aligning Speakers: Evaluating and Visualizing Text-based Diarization Using Efficient Multiple Sequence Alignment (Extended Version)

    Authors: Chen Gong, Peilin Wu, Jinho D. Choi

    Abstract: This paper presents a novel evaluation approach to text-based speaker diarization (SD), tackling the limitations of traditional metrics that do not account for any contextual information in text. Two new metrics are proposed, Text-based Diarization Error Rate and Diarization F1, which perform utterance- and word-level evaluations by aligning tokens in reference and hypothesis transcripts. Our metr… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

    Comments: Accepted to the 35th IEEE International Conference on Tools with Artificial Intelligence (ICTAI) 2023

  38. arXiv:2309.06490  [pdf, other

    cs.CL

    Leveraging Large Language Models for Automated Dialogue Analysis

    Authors: Sarah E. Finch, Ellie S. Paek, Jinho D. Choi

    Abstract: Developing high-performing dialogue systems benefits from the automatic identification of undesirable behaviors in system responses. However, detecting such behaviors remains challenging, as it draws on a breadth of general knowledge and understanding of conversational practices. Although recent research has focused on building specialized classifiers for detecting specific dialogue behaviors, the… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Comments: Accepted to SIGDIAL 2023

  39. arXiv:2309.06460  [pdf, other

    cs.CL

    Widely Interpretable Semantic Representation: Frameless Meaning Representation for Broader Applicability

    Authors: Lydia Feng, Gregor Williamson, Han He, Jinho D. Choi

    Abstract: This paper presents a novel semantic representation, WISeR, that overcomes challenges for Abstract Meaning Representation (AMR). Despite its strengths, AMR is not easily applied to languages or domains without predefined semantic frames, and its use of numbered arguments results in semantic role labels, which are not directly interpretable and are semantically overloaded for parsers. We examine th… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

  40. Patient Clustering via Integrated Profiling of Clinical and Digital Data

    Authors: Dongjin Choi, Andy Xiang, Ozgur Ozturk, Deep Shrestha, Barry Drake, Hamid Haidarian, Faizan Javed, Haesun Park

    Abstract: We introduce a novel profile-based patient clustering model designed for clinical data in healthcare. By utilizing a method grounded on constrained low-rank approximation, our model takes advantage of patients' clinical data and digital interaction data, including browsing and search, to construct patient profiles. As a result of the method, nonnegative embedding vectors are generated, serving as… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

    Comments: Accepted for the Short Paper track of CIKM'23, October 21-25, 2023, Birmingham, United Kingdom

  41. arXiv:2308.03526  [pdf, other

    cs.LG cs.AI

    AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning

    Authors: Michaël Mathieu, Sherjil Ozair, Srivatsan Srinivasan, Caglar Gulcehre, Shangtong Zhang, Ray Jiang, Tom Le Paine, Richard Powell, Konrad Żołna, Julian Schrittwieser, David Choi, Petko Georgiev, Daniel Toyama, Aja Huang, Roman Ring, Igor Babuschkin, Timo Ewalds, Mahyar Bordbar, Sarah Henderson, Sergio Gómez Colmenarejo, Aäron van den Oord, Wojciech Marian Czarnecki, Nando de Freitas, Oriol Vinyals

    Abstract: StarCraft II is one of the most challenging simulated reinforcement learning environments; it is partially observable, stochastic, multi-agent, and mastering StarCraft II requires strategic planning over long time horizons with real-time low-level execution. It also has an active professional competitive scene. StarCraft II is uniquely suited for advancing offline RL algorithms, both because of it… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

    Comments: 32 pages, 13 figures, previous version published as a NeurIPS 2021 workshop: https://openreview.net/forum?id=Np8Pumfoty

  42. Distributionally Robust Safety Filter for Learning-Based Control in Active Distribution Systems

    Authors: Hoang Tien Nguyen, Dae-Hyun Choi

    Abstract: Operational constraint violations may occur when deep reinforcement learning (DRL) agents interact with real-world active distribution systems to learn their optimal policies during training. This letter presents a universal distributionally robust safety filter (DRSF) using which any DRL agent can reduce the constraint violations of distribution systems significantly during training while maintai… ▽ More

    Submitted 30 July, 2023; originally announced July 2023.

  43. arXiv:2307.00682  [pdf, other

    cs.LG cs.CR

    Tools for Verifying Neural Models' Training Data

    Authors: Dami Choi, Yonadav Shavit, David Duvenaud

    Abstract: It is important that consumers and regulators can verify the provenance of large neural models to evaluate their capabilities and risks. We introduce the concept of a "Proof-of-Training-Data": any protocol that allows a model trainer to convince a Verifier of the training data that produced a set of model weights. Such protocols could verify the amount and kind of data and compute used to train th… ▽ More

    Submitted 2 July, 2023; originally announced July 2023.

  44. arXiv:2306.12626  [pdf, other

    cs.CV eess.IV

    1st Place Solution to MultiEarth 2023 Challenge on Multimodal SAR-to-EO Image Translation

    Authors: Jingi Ju, Hyeoncheol Noh, Minwoo Kim, Dong-Geol Choi

    Abstract: The Multimodal Learning for Earth and Environment Workshop (MultiEarth 2023) aims to harness the substantial amount of remote sensing data gathered over extensive periods for the monitoring and analysis of Earth's ecosystems'health. The subtask, Multimodal SAR-to-EO Image Translation, involves the use of robust SAR data, even under adverse weather and lighting conditions, transforming it into high… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

  45. arXiv:2305.18350  [pdf, other

    cs.LG cs.CL cs.IR

    Towards Open-World Product Attribute Mining: A Lightly-Supervised Approach

    Authors: Liyan Xu, Chenwei Zhang, Xian Li, Jingbo Shang, Jinho D. Choi

    Abstract: We present a new task setting for attribute mining on e-commerce products, serving as a practical solution to extract open-world attributes without extensive human intervention. Our supervision comes from a high-quality seed attribute set bootstrapped from existing resources, and we aim to expand the attribute vocabulary of existing seed types, and also to discover any new attribute types automati… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: Accepted to ACL 2023

  46. arXiv:2304.12972  [pdf

    cs.CV physics.chem-ph

    Automated Solubility Analysis System and Method Using Computer Vision and Machine Learning

    Authors: Gahee Kim, Minwoo Jeon, Hyun Do Choi, Jun Ki Cho, Youn-Suk Choi, Hyoseok Hwang

    Abstract: In this study, a novel active solubility sensing device using computer vision is proposed to improve separation purification performance and prevent malfunctions of separation equipment such as preparative liquid chromatographers and evaporators. The proposed device actively measures the solubility by transmitting a solution using a background image. The proposed system is a combination of a devic… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

    Comments: 20 pages, 6 figures, 3 tables

  47. arXiv:2304.10739  [pdf, other

    cs.CL cs.AI cs.LG math.NA

    KitchenScale: Learning to predict ingredient quantities from recipe contexts

    Authors: Donghee Choi, Mogan Gim, Samy Badreddine, Hajung Kim, Donghyeon Park, Jaewoo Kang

    Abstract: Determining proper quantities for ingredients is an essential part of cooking practice from the perspective of enriching tastiness and promoting healthiness. We introduce KitchenScale, a fine-tuned Pre-trained Language Model (PLM) that predicts a target ingredient's quantity and measurement unit given its recipe context. To effectively train our KitchenScale model, we formulate an ingredient quant… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.

    Comments: Expert Systems with Applications 2023, Demo: http://kitchenscale.korea.ac.kr/

    ACM Class: H.4.m

    Journal ref: Expert Systems with Applications, Volume 224, 15 August 2023, 120041

  48. InterviewBot: Real-Time End-to-End Dialogue System to Interview Students for College Admission

    Authors: Zihao Wang, Nathan Keyes, Terry Crawford, Jinho D. Choi

    Abstract: We present the InterviewBot that dynamically integrates conversation history and customized topics into a coherent embedding space to conduct 10 mins hybrid-domain (open and closed) conversations with foreign students applying to U.S. colleges for assessing their academic and cultural readiness. To build a neural-based end-to-end dialogue model, 7,361 audio recordings of human-to-human interviews… ▽ More

    Submitted 5 September, 2023; v1 submitted 27 March, 2023; originally announced March 2023.

    Journal ref: Information 2023, 14, 460

  49. arXiv:2303.14828  [pdf, other

    cs.CV

    VisDA 2022 Challenge: Domain Adaptation for Industrial Waste Sorting

    Authors: Dina Bashkirova, Samarth Mishra, Diala Lteif, Piotr Teterwak, Donghyun Kim, Fadi Alladkani, James Akl, Berk Calli, Sarah Adel Bargal, Kate Saenko, Daehan Kim, Minseok Seo, YoungJin Jeon, Dong-Geol Choi, Shahaf Ettedgui, Raja Giryes, Shady Abu-Hussein, Binhui Xie, Shuang Li

    Abstract: Label-efficient and reliable semantic segmentation is essential for many real-life applications, especially for industrial settings with high visual diversity, such as waste sorting. In industrial waste sorting, one of the biggest challenges is the extreme diversity of the input stream depending on factors like the location of the sorting facility, the equipment available in the facility, and the… ▽ More

    Submitted 26 March, 2023; originally announced March 2023.

    Comments: Proceedings of Machine Learning Research

  50. arXiv:2303.11606  [pdf, other

    cs.CV

    CAFS: Class Adaptive Framework for Semi-Supervised Semantic Segmentation

    Authors: Jingi Ju, Hyeoncheol Noh, Yooseung Wang, Minseok Seo, Dong-Geol Choi

    Abstract: Semi-supervised semantic segmentation learns a model for classifying pixels into specific classes using a few labeled samples and numerous unlabeled images. The recent leading approach is consistency regularization by selftraining with pseudo-labeling pixels having high confidences for unlabeled images. However, using only highconfidence pixels for self-training may result in losing much of the in… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: 13 pages, 9 figures