Skip to main content

Showing 1–50 of 344 results for author: Choi, D

  1. arXiv:2407.07313  [pdf, other

    cs.CL

    ESM+: Modern Insights into Perspective on Text-to-SQL Evaluation in the Age of Large Language Models

    Authors: Benjamin Ascoli, Ram Kandikonda, Jinho D. Choi

    Abstract: The task of Text-to-SQL enables anyone to retrieve information from SQL databases using natural language. Despite several challenges, recent models have made remarkable advancements in this task using large language models (LLMs). Interestingly, we find that LLM-based models without fine-tuning exhibit distinct natures compared to their fine-tuned counterparts, leading to inadequacies in current e… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  2. arXiv:2406.19634  [pdf, other

    cs.RO

    CLOi-Mapper: Consistent, Lightweight, Robust, and Incremental Mapper With Embedded Systems for Commercial Robot Services

    Authors: DongKi Noh, Hyungtae Lim, Gyuho Eoh, Duckyu Choi, Jeongsik Choi, Hyunjun Lim, SeungMin Baek, Hyun Myung

    Abstract: In commercial autonomous service robots with several form factors, simultaneous localization and mapping (SLAM) is an essential technology for providing proper services such as cleaning and guidance. Such robots require SLAM algorithms suitable for specific applications and environments. Hence, several SLAM frameworks have been proposed to address various requirements in the past decade. However,… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Journal ref: IEEE Robotics and Automation Letters, 2024

  3. arXiv:2406.14546  [pdf, other

    cs.CL cs.AI cs.LG

    Connecting the Dots: LLMs can Infer and Verbalize Latent Structure from Disparate Training Data

    Authors: Johannes Treutlein, Dami Choi, Jan Betley, Cem Anil, Samuel Marks, Roger Baker Grosse, Owain Evans

    Abstract: One way to address safety risks from large language models (LLMs) is to censor dangerous knowledge from their training data. While this removes the explicit information, implicit information can remain scattered across various training documents. Could an LLM infer the censored knowledge by piecing together these implicit hints? As a step towards answering this question, we study inductive out-of-… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  4. arXiv:2406.09138  [pdf, other

    cs.CL

    Leveraging Explicit Reasoning for Inference Integration in Commonsense-Augmented Dialogue Models

    Authors: Sarah E. Finch, Jinho D. Choi

    Abstract: Open-domain dialogue systems need to grasp social commonsense to understand and respond effectively to human users. Commonsense-augmented dialogue models have been proposed that aim to infer commonsense knowledge from dialogue contexts in order to improve response quality. However, existing approaches to commonsense-augmented dialogue rely on implicit reasoning to integrate commonsense inferences… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  5. arXiv:2406.07800  [pdf, other

    cs.LG cs.DC

    Regularizing and Aggregating Clients with Class Distribution for Personalized Federated Learning

    Authors: Gyuejeong Lee, Daeyoung Choi

    Abstract: Personalized federated learning (PFL) enables customized models for clients with varying data distributions. However, existing PFL methods often incur high computational and communication costs, limiting their practical application. This paper proposes a novel PFL method, Class-wise Federated Averaging (cwFedAVG), that performs Federated Averaging (FedAVG) class-wise, creating multiple global mode… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  6. arXiv:2406.03705  [pdf, other

    cond-mat.mes-hall quant-ph

    Coherent control of a triangular exchange-only spin qubit

    Authors: Edwin Acuna, Joseph D. Broz, Kaushal Shyamsundar, Antonio B. Mei, Colin P. Feeney, Valerie Smetanka, Tiffany Davis, Kangmu Lee, Maxwell D. Choi, Brydon Boyd, June Suh, Wonill D. Ha, Cameron Jennings, Andrew S. Pan, Daniel S. Sanchez, Matthew D. Reed, Jason R. Petta

    Abstract: We demonstrate coherent control of a three-electron exchange-only spin qubit with the quantum dots arranged in a close-packed triangular geometry. The device is tuned to confine one electron in each quantum dot, as evidenced by pairwise charge stability diagrams. Time-domain control of the exchange coupling is demonstrated and qubit performance is characterized using blind randomized benchmarking,… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  7. arXiv:2406.03663  [pdf

    eess.IV cs.LG q-bio.QM

    A Hybrid Deep Learning Classification of Perimetric Glaucoma Using Peripapillary Nerve Fiber Layer Reflectance and Other OCT Parameters from Three Anatomy Regions

    Authors: Ou Tan, David S. Greenfield, Brian A. Francis, Rohit Varma, Joel S. Schuman, David Huang, Dongseok Choi

    Abstract: Precis: A hybrid deep-learning model combines NFL reflectance and other OCT parameters to improve glaucoma diagnosis. Objective: To investigate if a deep learning model could be used to combine nerve fiber layer (NFL) reflectance and other OCT parameters for glaucoma diagnosis. Patients and Methods: This is a prospective observational study where of 106 normal subjects and 164 perimetric glaucoma… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 12 pages

  8. arXiv:2406.00170  [pdf

    q-bio.QM

    Focal Loss Analysis of Peripapillary Nerve Fiber Layer Reflectance for Glaucoma Diagnosis

    Authors: Ou Tan, Dongseok Choi, Aiyin Chen, David S. Greenfield, Brian A. Francis, Rohit Varma, Joel S. Schuman, David Huang, Advanced Imaging for Glaucoma Study Group

    Abstract: Purpose: To evaluate nerve fiber layer (NFL) reflectance for glaucoma diagnosis using a large dataset. Methods: Participants were imaged with 4.9mm ONH scans using spectral-domain optical coherence tomography (OCT). The NFL reflectance map was reconstructed from 13 concentric rings of optic nerve head(ONH) scan, then processed by an azimuthal filter to reduce directional reflectance bias due to va… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

    Comments: 18 pages. arXiv admin note: text overlap with arXiv:2006.13522

  9. arXiv:2406.00168  [pdf

    q-bio.QM

    Reliability for Nerve Fiber Layer Reflectance Using Spectral Domain Optical Coherence Tomography

    Authors: Kabir Hossain, Ou Tan, Po-Han Yeh, Jie Wang, Elizabeth White, Dongseok Choi, David Huang

    Abstract: Purpose: Reliability for Nerve Fiber Layer Reflectance Using Spectral Domain Optical Coherence Tomography (OCT) Methods: The study utilized OCT to scan participants with a cubic 6x6 mm disc scan. NFL reflectance were normalized by the average of bands below NFL and summarized. We selected several reference bands, including the pigment epithelium complex (PPEC), the band between NFL and Bruch's mem… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

    Comments: 13 pages

  10. arXiv:2405.15229  [pdf, other

    cond-mat.mes-hall

    Multi-Orbital Interactions and Spin Polarization in Single Rare-Earth Adatoms

    Authors: Massine Kelai, Stefano Reale, Roberto Robles, Jaehyun Lee, Divya Jyoti, Philippe Ohresser, Edwige Otero, Fadi Choueikani, Fabrice Scheurer, Nicolás Lorente, Deung-Jang Choi, Aparajita Singha, Fabio Donati

    Abstract: Surface-adsorbed rare-earth nanostructures are ideal platforms to investigate the interplay between intra-atomic interactions and multi-orbital spin configurations. However, addressing these properties has posed severe experimental and theoretical challenges. Here, we use the orbital selectivity offered by X-ray absorption spectroscopy to quantify the Coulomb integrals of Nd atoms on conductive su… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  11. arXiv:2405.12856  [pdf, other

    stat.ML cs.CL cs.LG

    LLM Processes: Numerical Predictive Distributions Conditioned on Natural Language

    Authors: James Requeima, John Bronskill, Dami Choi, Richard E. Turner, David Duvenaud

    Abstract: Machine learning practitioners often face significant challenges in formally integrating their prior knowledge and beliefs into predictive models, limiting the potential for nuanced and context-aware analyses. Moreover, the expertise needed to integrate this prior knowledge into probabilistic modeling typically limits the application of these models to specialists. Our goal is to build a regressio… ▽ More

    Submitted 25 May, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

  12. arXiv:2405.12468  [pdf, other

    cs.CL

    Diverse and Effective Synthetic Data Generation for Adaptable Zero-Shot Dialogue State Tracking

    Authors: James D. Finch, Jinho D. Choi

    Abstract: We demonstrate substantial performance gains in zero-shot dialogue state tracking (DST) by enhancing training data diversity through synthetic data generation. Existing DST datasets are severely limited in the number of application domains and slot types they cover due to the high costs of data collection, restricting their adaptability to new domains. This work addresses this challenge with a nov… ▽ More

    Submitted 13 June, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

  13. arXiv:2405.11178  [pdf, other

    cs.CL

    Automating PTSD Diagnostics in Clinical Interviews: Leveraging Large Language Models for Trauma Assessments

    Authors: Sichang Tu, Abigail Powers, Natalie Merrill, Negar Fani, Sierra Carter, Stephen Doogan, Jinho D. Choi

    Abstract: The shortage of clinical workforce presents significant challenges in mental healthcare, limiting access to formal diagnostics and services. We aim to tackle this shortage by integrating a customized large language model (LLM) into the workflow, thus promoting equity in mental healthcare for the general population. Although LLMs have showcased their capability in clinical decision-making, their ad… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

  14. arXiv:2405.04497  [pdf, other

    cs.HC

    Unveiling Disparities in Web Task Handling Between Human and Web Agent

    Authors: Kihoon Son, Jinhyeon Kwon, DaEun Choi, Tae Soo Kim, Young-Ho Kim, Sangdoo Yun, Juho Kim

    Abstract: With the advancement of Large-Language Models (LLMs) and Large Vision-Language Models (LVMs), agents have shown significant capabilities in various tasks, such as data analysis, gaming, or code generation. Recently, there has been a surge in research on web agents, capable of performing tasks within the web environment. However, the web poses unforeseeable scenarios, challenging the generalizabili… ▽ More

    Submitted 8 May, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

  15. arXiv:2405.00523  [pdf, other

    cs.AI cs.CL

    CookingSense: A Culinary Knowledgebase with Multidisciplinary Assertions

    Authors: Donghee Choi, Mogan Gim, Donghyeon Park, Mujeen Sung, Hyunjae Kim, Jaewoo Kang, Jihun Choi

    Abstract: This paper introduces CookingSense, a descriptive collection of knowledge assertions in the culinary domain extracted from various sources, including web data, scientific papers, and recipes, from which knowledge covering a broad range of aspects is acquired. CookingSense is constructed through a series of dictionary-based filtering and language model-based semantic filtering techniques, which res… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: LREC-COLING 2024 Accepted

  16. arXiv:2404.14392  [pdf, other

    cond-mat.mes-hall cond-mat.other

    Direct observation of Floquet-Bloch states in monolayer graphene

    Authors: Dongsung Choi, Masataka Mogi, Umberto De Giovannini, Doron Azoury, Baiqing Lv, Yifan Su, Hannes Hübener, Angel Rubio, Nuh Gedik

    Abstract: Floquet engineering is a novel method of manipulating quantum phases of matter via periodic driving [1, 2]. It has successfully been utilized in different platforms ranging from photonic systems [3] to optical lattice of ultracold atoms [4, 5]. In solids, light can be used as the periodic drive via coherent light-matter interaction. This leads to hybridization of Bloch electrons with photons resul… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  17. arXiv:2404.10860  [pdf, ps, other

    math.AG math.QA

    Line bundles on Contractions of $\overline{\rm{M}}_{0,n}$ via Conformal Block Divisors

    Authors: Daebeom Choi

    Abstract: The moduli space of stable curves of genus $g$ with $n$ marked points, $\overline{\rm{M}}_{g,n}$, is a central object in algebraic geometry, and plays a crucial role in $2$-dimensional conformal field theory. In this paper, we apply the sheaf of coinvariants and conformal block divisors to study the geometry of $\overline{\rm{M}}_{0,n}$. The main theorem characterizes the line bundles on certain c… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: 28 pages, Comments are welcome!

    MSC Class: 14H10; 17B69 (primary); 81R10 (secondary)

  18. arXiv:2404.09182  [pdf, other

    cond-mat.str-el cond-mat.mtrl-sci

    Coexistence of interacting charge density waves in a layered semiconductor

    Authors: B. Q. Lv, Alfred Zong, Dong Wu, Zhengwei Nie, Yifan Su, Dongsung Choi, Batyr Ilyas, Bryan T. Fichera, Jiarui Li, Edoardo Baldini, Masataka Mogi, Y. -B. Huang, Hoi Chun Po, Sheng Meng, Yao Wang, N. L. Wang, Nuh Gedik

    Abstract: Coexisting orders are key features of strongly correlated materials and underlie many intriguing phenomena from unconventional superconductivity to topological orders. Here, we report the coexistence of two interacting charge-density-wave (CDW) orders in EuTe4, a layered crystal that has drawn considerable attention owing to its anomalous thermal hysteresis and a semiconducting CDW state despite t… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: To appear in PRL

    Journal ref: Physical Review Letters 132, 206401 (2024)

  19. arXiv:2404.06764  [pdf

    physics.optics physics.app-ph

    A mid-infrared Brillouin laser using ultra-high-Q on-chip resonators

    Authors: Kiyoung Ko, Daewon Suk, Dohyeong Kim, Soobong Park, Betul Sen, Dae-Gon Kim, Yingying Wang, Shixun Dai, Xunsi Wang, Rongping Wang, Byung Jae Chun, Kwang-Hoon Ko, Peter T. Rakich, Duk-Yong Choi, Hansuek Lee

    Abstract: Ultra-high-Q optical resonators have facilitated recent advancements in on-chip photonics by effectively harnessing nonlinear phenomena providing useful functionalities. While these breakthroughs, primarily focused on the near-infrared region, have extended interest to longer wavelengths holding importance for monitoring and manipulating molecules, the absence of ultra-high-Q resonators in this re… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 10 pages, 5 figures in main script, and 1 figure in methods

  20. arXiv:2404.06621  [pdf, other

    cs.CL

    What is Your Favorite Gender, MLM? Gender Bias Evaluation in Multilingual Masked Language Models

    Authors: Jeongrok Yu, Seong Ug Kim, Jacob Choi, Jinho D. Choi

    Abstract: Bias is a disproportionate prejudice in favor of one side against another. Due to the success of transformer-based Masked Language Models (MLMs) and their impact on many NLP tasks, a systematic evaluation of bias in these models is needed more than ever. While many studies have evaluated gender bias in English MLMs, only a few works have been conducted for the task in other languages. This paper p… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  21. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seongjin Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  22. arXiv:2404.00676  [pdf, other

    cs.CV cs.GR

    OmniLocalRF: Omnidirectional Local Radiance Fields from Dynamic Videos

    Authors: Dongyoung Choi, Hyeonjoong Jang, Min H. Kim

    Abstract: Omnidirectional cameras are extensively used in various applications to provide a wide field of vision. However, they face a challenge in synthesizing novel views due to the inevitable presence of dynamic objects, including the photographer, in their wide field of view. In this paper, we introduce a new approach called Omnidirectional Local Radiance Fields (OmniLocalRF) that can render static-only… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024

  23. arXiv:2404.00376  [pdf, other

    cs.CL

    Small Language Models Learn Enhanced Reasoning Skills from Medical Textbooks

    Authors: Hyunjae Kim, Hyeon Hwang, Jiwoo Lee, Sihyeon Park, Dain Kim, Taewhoo Lee, Chanwoong Yoon, Jiwoong Sohn, Donghee Choi, Jaewoo Kang

    Abstract: While recent advancements in commercial large language models (LM) have shown promising results in medical tasks, their closed-source nature poses significant privacy and security concerns, hindering their widespread use in the medical field. Despite efforts to create open-source models, their limited parameters often result in insufficient multi-step reasoning capabilities required for solving co… ▽ More

    Submitted 30 June, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

    Comments: Added new LLaMA-3-based models and experiments on NEJM case challenges

  24. arXiv:2403.15714  [pdf, ps, other

    math.AP

    Analytic asymptotic formulas for effective parameters of planar elastic composites

    Authors: Daehee Cho, Doosung Choi, Mikyoung Lim

    Abstract: We investigate the effective elastic properties of periodic dilute two-phase composites consisting of an homogeneous isotropic matrix and a periodic array of rigid inclusions. We assume the rigid inclusion in a unit cell is a simply connected, bounded domain so that there exists an exterior conformal mapping corresponding the inclusion. Recently, an analytical series solution method for the elasti… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

  25. arXiv:2403.15713  [pdf, ps, other

    math.AP

    Geometric series solution for the plane elastostatic problem in the presence of a cavity

    Authors: Daehee Cho, Doosung Choi, Mikyoung Lim

    Abstract: This paper presents an analytic series solution method for the elastic inclusion problem in a two-dimensional unbounded isotropic medium with a cavity. Generalizing the work of Mattei and Lim \cite{Mattei:2021:EAS}, this study develops an analytic series solution method for the elastic inclusion problem to encompass a cavity problem. The central mathematical challenge tackled in this research is t… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

  26. arXiv:2403.14110  [pdf, other

    cs.LG cs.AI

    Heuristic Algorithm-based Action Masking Reinforcement Learning (HAAM-RL) with Ensemble Inference Method

    Authors: Kyuwon Choi, Cheolkyun Rho, Taeyoun Kim, Daewoo Choi

    Abstract: This paper presents a novel reinforcement learning (RL) approach called HAAM-RL (Heuristic Algorithm-based Action Masking Reinforcement Learning) for optimizing the color batching re-sequencing problem in automobile painting processes. The existing heuristic algorithms have limitations in adequately reflecting real-world constraints and accurately predicting logistics performance. Our methodology… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: 7 pages, 8 figures

  27. arXiv:2403.06252  [pdf, other

    cs.HC

    Demystifying Tacit Knowledge in Graphic Design: Characteristics, Instances, Approaches, and Guidelines

    Authors: Kihoon Son, DaEun Choi, Tae Soo Kim, Juho Kim

    Abstract: Despite the growing demand for professional graphic design knowledge, the tacit nature of design inhibits knowledge sharing. However, there is a limited understanding on the characteristics and instances of tacit knowledge in graphic design. In this work, we build a comprehensive set of tacit knowledge characteristics through a literature review. Through interviews with 10 professional graphic des… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

  28. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  29. arXiv:2403.03082  [pdf, other

    cs.LG cs.AI cs.CV

    Recall-Oriented Continual Learning with Generative Adversarial Meta-Model

    Authors: Haneol Kang, Dong-Wan Choi

    Abstract: The stability-plasticity dilemma is a major challenge in continual learning, as it involves balancing the conflicting objectives of maintaining performance on previous tasks while learning new tasks. In this paper, we propose the recall-oriented continual learning framework to address this challenge. Inspired by the human brain's ability to separate the mechanisms responsible for stability and pla… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: Accepted in AAAI-2024 (Oral presentation)

  30. TIE-KD: Teacher-Independent and Explainable Knowledge Distillation for Monocular Depth Estimation

    Authors: Sangwon Choi, Daejune Choi, Duksu Kim

    Abstract: Monocular depth estimation (MDE) is essential for numerous applications yet is impeded by the substantial computational demands of accurate deep learning models. To mitigate this, we introduce a novel Teacher-Independent Explainable Knowledge Distillation (TIE-KD) framework that streamlines the knowledge transfer from complex teacher models to compact student networks, eliminating the need for arc… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: 13 pages, 8 figures, under review for a journal

    Journal ref: Image and Vision Computing, 148 (2024), 105110

  31. arXiv:2402.12821  [pdf, other

    cs.CL cs.LG

    Identifying Factual Inconsistencies in Summaries: Grounding Model Inference via Task Taxonomy

    Authors: Liyan Xu, Zhenlin Su, Mo Yu, Jin Xu, Jinho D. Choi, Jie Zhou, Fei Liu

    Abstract: Factual inconsistencies pose a significant hurdle for the faithful summarization by generative models. While a major direction to enhance inconsistency detection is to derive stronger Natural Language Inference (NLI) models, we propose an orthogonal aspect that underscores the importance of incorporating task-specific taxonomy into the inference. To this end, we consolidate key error types of inco… ▽ More

    Submitted 19 June, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

  32. arXiv:2402.12406  [pdf, other

    cs.LG cs.AI cs.CV

    Teacher as a Lenient Expert: Teacher-Agnostic Data-Free Knowledge Distillation

    Authors: Hyunjune Shin, Dong-Wan Choi

    Abstract: Data-free knowledge distillation (DFKD) aims to distill pretrained knowledge to a student model with the help of a generator without using original data. In such data-free scenarios, achieving stable performance of DFKD is essential due to the unavailability of validation data. Unfortunately, this paper has discovered that existing DFKD methods are quite sensitive to different teacher models, occa… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

    Comments: Accepted in AAAI-2024

  33. arXiv:2402.11761  [pdf, ps, other

    math.NT

    The number of automorphic representations of $\mathrm{GL}_2$ with exceptional eigenvalues

    Authors: Dohoon Choi, Min Lee, Youngmin Lee, Subong Lim

    Abstract: We obtain an upper bound for the dimension of the cuspidal automorphic forms for $\mathrm{GL}_2$ over a number field, whose archimedean local representations are not tempered. More precisely, we prove the following result. Let $F$ be a number field and $\mathbb{A}_{F}$ be the ring of adeles of $F$. Let $\mathcal{O}_{F}$ be the ring of integers of $F$. Let $\mathfrak{X}_{F,\mathrm{ex}}$ be the se… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

    MSC Class: 11F72 (Primary); 11F12 (Secondary)

  34. arXiv:2402.01280  [pdf, other

    cond-mat.supr-con cond-mat.mes-hall

    In-gap states induced by magnetic impurities on wide-band s-wave superconductors: self-consistent calculations

    Authors: Divya Jyoti, Deung-Jang Choi, Nicolas Lorente

    Abstract: The role of self-consistency in Bogoliubov-de Gennes equations is frequently underestimated in the investigation of in-gap states created by magnetic impurities in s-wave superconductors. Our research focuses on the impact of self-consistency on the in-gap states produced by magnetic stuctures on superconductors, specifically evaluating the density of states, the in-gap bands, and their topologica… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  35. arXiv:2402.00644  [pdf, other

    cond-mat.supr-con cond-mat.mes-hall

    Two molecular devices for superconducting spintronics

    Authors: Cristina Mier, Alex Fétida, Roberto Robles, Parmenio Boronat, Divya Jyoti, Nicolás Lorente, Laurent Limot, Deung-Jang Choi

    Abstract: We create two molecular devices with superconducting junctions, using nickelocene molecules, single Fe atoms, and Pb electrodes at low temperature. We find contrasting behavior based on the coordination of the Fe atom: one device shows low-bias features in its differential conductance due to the superposition of multiple Andreev reflections (MAR) and Fe-induced in-gap states. The other reveals int… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  36. arXiv:2401.15471  [pdf, other

    cs.CL

    ConvoSense: Overcoming Monotonous Commonsense Inferences for Conversational AI

    Authors: Sarah E. Finch, Jinho D. Choi

    Abstract: Mastering commonsense understanding and reasoning is a pivotal skill essential for conducting engaging conversations. While there have been several attempts to create datasets that facilitate commonsense inferences in dialogue contexts, existing datasets tend to lack in-depth details, restate information already present in the conversation, and often fail to capture the multifaceted nature of comm… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

    Comments: accepted to TACL 2024; final author's version of paper; pre-MIT Press publication version

  37. arXiv:2312.15514  [pdf, other

    cs.CV cs.AI

    Towards Reliable AI Model Deployments: Multiple Input Mixup for Out-of-Distribution Detection

    Authors: Dasol Choi, Dongbin Na

    Abstract: Recent remarkable success in the deep-learning industries has unprecedentedly increased the need for reliable model deployment. For example, the model should alert the user if the produced model outputs might not be reliable. Previous studies have proposed various methods to solve the Out-of-Distribution (OOD) detection problem, however, they generally require a burden of resources. In this work,… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

    Comments: Accepted to the AAAI 2024 Workshop on Deployable AI (DAI)

  38. arXiv:2312.15449  [pdf, other

    cs.CV

    iDet3D: Towards Efficient Interactive Object Detection for LiDAR Point Clouds

    Authors: Dongmin Choi, Wonwoo Cho, Kangyeol Kim, Jaegul Choo

    Abstract: Accurately annotating multiple 3D objects in LiDAR scenes is laborious and challenging. While a few previous studies have attempted to leverage semi-automatic methods for cost-effective bounding box annotation, such methods have limitations in efficiently handling numerous multi-class objects. To effectively accelerate 3D annotation pipelines, we propose iDet3D, an efficient interactive 3D object… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

    Comments: Accepted to AAAI 2024

  39. arXiv:2312.14129  [pdf, other

    cs.LG cs.AI cs.IR

    WellFactor: Patient Profiling using Integrative Embedding of Healthcare Data

    Authors: Dongjin Choi, Andy Xiang, Ozgur Ozturk, Deep Shrestha, Barry Drake, Hamid Haidarian, Faizan Javed, Haesun Park

    Abstract: In the rapidly evolving healthcare industry, platforms now have access to not only traditional medical records, but also diverse data sets encompassing various patient interactions, such as those from healthcare web portals. To address this rich diversity of data, we introduce WellFactor: a method that derives patient profiles by integrating information from these sources. Central to our approach… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: 2023 IEEE International Conference on Big Data (IEEE BigData 2023)

  40. arXiv:2312.11949  [pdf, other

    cs.HC

    CreativeConnect: Supporting Reference Recombination for Graphic Design Ideation with Generative AI

    Authors: DaEun Choi, Sumin Hong, Jeongeon Park, John Joon Young Chung, Juho Kim

    Abstract: Graphic designers often get inspiration through the recombination of references. Our formative study (N=6) reveals that graphic designers focus on conceptual keywords during this process, and want support for discovering the keywords, expanding them, and exploring diverse recombination options of them, while still having room for designers' creativity. We propose CreativeConnect, a system with gen… ▽ More

    Submitted 6 March, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

  41. arXiv:2312.06134  [pdf, other

    cs.CL cs.LG

    Order Matters in the Presence of Dataset Imbalance for Multilingual Learning

    Authors: Dami Choi, Derrick Xin, Hamid Dadkhahi, Justin Gilmer, Ankush Garg, Orhan Firat, Chih-Kuan Yeh, Andrew M. Dai, Behrooz Ghorbani

    Abstract: In this paper, we empirically study the optimization dynamics of multi-task learning, particularly focusing on those that govern a collection of tasks with significant data imbalance. We present a simple yet effective method of pre-training on high-resource tasks, followed by fine-tuning on a mixture of high/low-resource tasks. We provide a thorough empirical study and analysis of this method's be… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  42. arXiv:2311.11602   

    cs.CV cs.AI

    A Multi-In-Single-Out Network for Video Frame Interpolation without Optical Flow

    Authors: Jaemin Lee, Minseok Seo, Sangwoo Lee, Hyobin Park, Dong-Geol Choi

    Abstract: In general, deep learning-based video frame interpolation (VFI) methods have predominantly focused on estimating motion vectors between two input frames and warping them to the target time. While this approach has shown impressive performance for linear motion between two input frames, it exhibits limitations when dealing with occlusions and nonlinear movements. Recently, generative models have be… ▽ More

    Submitted 4 December, 2023; v1 submitted 20 November, 2023; originally announced November 2023.

    Comments: Discovering a problem with the manuscript

  43. arXiv:2311.06943  [pdf

    physics.flu-dyn

    Friction Tubes to Generate Nanobubble Ozone Water with an Increased Half-Life for Virucidal Activity

    Authors: Suk-Joo Byun, A-Ram You, Tae Seok Park, Chang-Hee Park, Dae-Hyun Choi, Eun-Hee Jun, Young-Ho Yoo, Taekeun Yoo

    Abstract: Nanobubbles and related technologies are expected to be highly utilized in water resource-based industries such as water purification, crops, horticulture, medicine, bio, and sterilization. Ozone, a chemical with high sterilizing power, is known as a natural substance that is reduced to oxygen and water after reacting with pollutants. Ozone water, which is generated by dissolving ozone in water, h… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

  44. arXiv:2311.05187  [pdf

    physics.optics quant-ph

    Ultrafast all-optical second harmonic wavefront shaping

    Authors: A. Sinelnik, S. H. Lam, F. Coviello, S. Klimmer, G. Della Valle, D. -Y. Choi, T. Pertsch, G. Soavi, I. Staude

    Abstract: Optical communication can be revolutionized by encoding data into the orbital angular momentum of light beams. However, state-of-the-art approaches for dynamic control of complex optical wavefronts are mainly based on liquid crystal spatial light modulators or miniaturized mirrors, which suffer from intrinsically slow response times. Here, we experimentally realize a hybrid meta-optical system tha… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  45. arXiv:2311.03383  [pdf, other

    cs.LG cs.AI cs.AR cs.HC

    Toward Reinforcement Learning-based Rectilinear Macro Placement Under Human Constraints

    Authors: Tuyen P. Le, Hieu T. Nguyen, Seungyeol Baek, Taeyoun Kim, Jungwoo Lee, Seongjung Kim, Hyunjin Kim, Misu Jung, Daehoon Kim, Seokyong Lee, Daewoo Choi

    Abstract: Macro placement is a critical phase in chip design, which becomes more intricate when involving general rectilinear macros and layout areas. Furthermore, macro placement that incorporates human-like constraints, such as design hierarchy and peripheral bias, has the potential to significantly reduce the amount of additional manual labor required from designers. This study proposes a methodology tha… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: Fast ML for Science @ ICCAD 2023

  46. arXiv:2311.02240  [pdf, other

    cs.CV

    Towards Machine Unlearning Benchmarks: Forgetting the Personal Identities in Facial Recognition Systems

    Authors: Dasol Choi, Dongbin Na

    Abstract: Machine unlearning is a crucial tool for enabling a classification model to forget specific data that are used in the training time. Recently, various studies have presented machine unlearning algorithms and evaluated their methods on several datasets. However, most of the current machine unlearning algorithms have been evaluated solely on traditional computer vision datasets such as CIFAR-10, MNI… ▽ More

    Submitted 24 December, 2023; v1 submitted 3 November, 2023; originally announced November 2023.

    Comments: Accepted to the AAAI 2024 Workshop on Privacy-Preserving Artificial Intelligence (PPAI)

  47. arXiv:2310.16538  [pdf, other

    cs.CL cs.AI cs.LG

    FedTherapist: Mental Health Monitoring with User-Generated Linguistic Expressions on Smartphones via Federated Learning

    Authors: Jaemin Shin, Hyungjun Yoon, Seungjoo Lee, Sungjoon Park, Yunxin Liu, Jinho D. Choi, Sung-Ju Lee

    Abstract: Psychiatrists diagnose mental disorders via the linguistic use of patients. Still, due to data privacy, existing passive mental health monitoring systems use alternative features such as activity, app usage, and location via mobile devices. We propose FedTherapist, a mobile mental health monitoring system that utilizes continuous speech and keyboard input in a privacy-preserving way via federated… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: Accepted to the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023)

  48. arXiv:2310.16318  [pdf, other

    cs.LG cs.AI

    Modality-Agnostic Self-Supervised Learning with Meta-Learned Masked Auto-Encoder

    Authors: Huiwon Jang, Jihoon Tack, Daewon Choi, Jongheon Jeong, Jinwoo Shin

    Abstract: Despite its practical importance across a wide range of modalities, recent advances in self-supervised learning (SSL) have been primarily focused on a few well-curated domains, e.g., vision and language, often relying on their domain-specific knowledge. For example, Masked Auto-Encoder (MAE) has become one of the popular architectures in these domains, but less has explored its potential in other… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: Accepted to NeurIPS 2023. The first two authors contributed equally

  49. arXiv:2310.16151  [pdf

    physics.flu-dyn

    Generation of high concentration nanobubbles based on friction tubes

    Authors: Taekeun Yoo, Young-Ho Yoo, Suk-Joo Byun, A-Ram You, Chang-Hee Park, Dae-Hyun Choi, Eun-Hee Jun

    Abstract: Nanobubble-related technologies have been confirmed to be useful in various fields such as climate change and the environment as well as water-based industries such as water purification, crops, horticulture, medical care, bio, and sterilization. However, a method of mass production in real time enough to apply nano-bubbles to the industry has not yet been developed. We explored the mechanism of n… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: 24 pages, 24 figures, 6 tables

  50. arXiv:2310.04313  [pdf, other

    cs.CL

    KoMultiText: Large-Scale Korean Text Dataset for Classifying Biased Speech in Real-World Online Services

    Authors: Dasol Choi, Jooyoung Song, Eunsun Lee, Jinwoo Seo, Heejune Park, Dongbin Na

    Abstract: With the growth of online services, the need for advanced text classification algorithms, such as sentiment analysis and biased text detection, has become increasingly evident. The anonymous nature of online services often leads to the presence of biased and harmful language, posing challenges to maintaining the health of online communities. This phenomenon is especially relevant in South Korea, w… ▽ More

    Submitted 12 November, 2023; v1 submitted 6 October, 2023; originally announced October 2023.

    Comments: Accepted to the NeurIPS 2023 Workshop on Socially Responsible Language Modelling Research (SoLaR)