Skip to main content

Showing 1–50 of 98 results for author: Cho, W

  1. arXiv:2407.09779  [pdf, other

    cs.CV cs.AI

    Layout-and-Retouch: A Dual-stage Framework for Improving Diversity in Personalized Image Generation

    Authors: Kangyeol Kim, Wooseok Seo, Sehyun Nam, Bodam Kim, Suhyeon Jeong, Wonwoo Cho, Jaegul Choo, Youngjae Yu

    Abstract: Personalized text-to-image (P-T2I) generation aims to create new, text-guided images featuring the personalized subject with a few reference images. However, balancing the trade-off relationship between prompt fidelity and identity preservation remains a critical challenge. To address the issue, we propose a novel P-T2I method called Layout-and-Retouch, consisting of two stages: 1) layout generati… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

  2. arXiv:2406.12223  [pdf, other

    cs.CL cs.CY

    ToxiCloakCN: Evaluating Robustness of Offensive Language Detection in Chinese with Cloaking Perturbations

    Authors: Yunze Xiao, Yujia Hu, Kenny Tsu Wei Choo, Roy Ka-wei Lee

    Abstract: Detecting hate speech and offensive language is essential for maintaining a safe and respectful digital environment. This study examines the limitations of state-of-the-art large language models (LLMs) in identifying offensive content within systematically perturbed data, with a focus on Chinese, a language particularly susceptible to such perturbations. We introduce \textsf{ToxiCloakCN}, an enhan… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 10 pages,5 Tables, 2 Figures

  3. Towards Understanding Emotions for Engaged Mental Health Conversations

    Authors: Kellie Yu Hui Sim, Kohleen Tijing Fortuno, Kenny Tsu Wei Choo

    Abstract: Providing timely support and intervention is crucial in mental health settings. As the need to engage youth comfortable with texting increases, mental health providers are exploring and adopting text-based media such as chatbots, community-based forums, online therapies with licensed professionals, and helplines operated by trained responders. To support these text-based media for mental health--p… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: 5 pages, 1 figure, to be published in DIS Companion '24

    ACM Class: H.5.2; I.2.7

  4. arXiv:2406.05071  [pdf, other

    cs.AI cs.LG cs.MA

    Massively Multiagent Minigames for Training Generalist Agents

    Authors: Kyoung Whan Choe, Ryan Sullivan, Joseph Suárez

    Abstract: We present Meta MMO, a collection of many-agent minigames for use as a reinforcement learning benchmark. Meta MMO is built on top of Neural MMO, a massively multiagent environment that has been the subject of two previous NeurIPS competitions. Our work expands Neural MMO with several computationally efficient minigames. We explore generalization across Meta MMO by learning to play several minigame… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  5. arXiv:2405.20165  [pdf, other

    stat.ML cs.LG

    Randomized Exploration for Reinforcement Learning with Multinomial Logistic Function Approximation

    Authors: Wooseong Cho, Taehyun Hwang, Joongkyu Lee, Min-hwan Oh

    Abstract: We study reinforcement learning with multinomial logistic (MNL) function approximation where the underlying transition probability kernel of the Markov decision processes (MDPs) is parametrized by an unknown transition core with features of state and action. For the finite horizon episodic setting with inhomogeneous state transitions, we propose provably efficient algorithms with randomized explor… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  6. arXiv:2405.06754  [pdf, other

    cs.NI eess.SP

    Wall-Street: Smart Surface-Enabled 5G mmWave for Roadside Networking

    Authors: Kun Woo Cho, Prasanthi Maddala, Ivan Seskar, Kyle Jamieson

    Abstract: 5G mmWave roadside networks promise high-speed wireless connectivity, but face significant challenges in maintaining reliable connections for users moving at high speed. Frequent handovers, complex beam alignment, and signal attenuation due to obstacles like car bodies lead to service interruptions and degraded performance. We present Wall-Street, a smart surface installed on vehicles to enhance 5… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    Comments: 15 pages, 22 figures, under submission

  7. arXiv:2405.01842  [pdf, ps, other

    cs.CL

    SGHateCheck: Functional Tests for Detecting Hate Speech in Low-Resource Languages of Singapore

    Authors: Ri Chi Ng, Nirmalendu Prakash, Ming Shan Hee, Kenny Tsu Wei Choo, Roy Ka-Wei Lee

    Abstract: To address the limitations of current hate speech detection models, we introduce \textsf{SGHateCheck}, a novel framework designed for the linguistic and cultural context of Singapore and Southeast Asia. It extends the functional testing approach of HateCheck and MHC, employing large language models for translation and paraphrasing into Singapore's main languages, and refining these with native ann… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  8. arXiv:2404.14873  [pdf, ps, other

    stat.ML cs.LG math.NA

    Estimating the Distribution of Parameters in Differential Equations with Repeated Cross-Sectional Data

    Authors: Hyeontae Jo, Sung Woong Cho, Hyung Ju Hwang

    Abstract: Differential equations are pivotal in modeling and understanding the dynamics of various systems, offering insights into their future states through parameter estimation fitted to time series data. In fields such as economy, politics, and biology, the observation data points in the time series are often independently obtained (i.e., Repeated Cross-Sectional (RCS) data). With RCS data, we found tha… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 16 pages, 10 figures

    MSC Class: 65L08; 65D17; 68U07

  9. arXiv:2404.11539  [pdf, other

    cs.CL

    Evaluating Span Extraction in Generative Paradigm: A Reflection on Aspect-Based Sentiment Analysis

    Authors: Soyoung Yang, Won Ik Cho

    Abstract: In the era of rapid evolution of generative language models within the realm of natural language processing, there is an imperative call to revisit and reformulate evaluation methodologies, especially in the domain of aspect-based sentiment analysis (ABSA). This paper addresses the emerging challenges introduced by the generative paradigm, which has moderately blurred traditional boundaries betwee… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: 10 pages

  10. arXiv:2404.09041  [pdf, other

    cs.CY

    Three Disclaimers for Safe Disclosure: A Cardwriter for Reporting the Use of Generative AI in Writing Process

    Authors: Won Ik Cho, Eunjung Cho, Hyeonji Shin

    Abstract: Generative artificial intelligence (AI) and large language models (LLMs) are increasingly being used in the academic writing process. This is despite the current lack of unified framework for reporting the use of machine assistance. In this work, we propose "Cardwriter", an intuitive interface that produces a short report for authors to declare their use of generative AI in their writing process.… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

    Comments: 6 pages; an implementation version of PaperCard project

  11. A Taxonomy for Human-LLM Interaction Modes: An Initial Exploration

    Authors: Jie Gao, Simret Araya Gebreegziabher, Kenny Tsu Wei Choo, Toby Jia-Jun Li, Simon Tangi Perrault, Thomas W. Malone

    Abstract: With ChatGPT's release, conversational prompting has become the most popular form of human-LLM interaction. However, its effectiveness is limited for more complex tasks involving reasoning, creativity, and iteration. Through a systematic analysis of HCI papers published since 2021, we identified four key phases in the human-LLM interaction flow - planning, facilitating, iterating, and testing - to… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: 11 pages, 4 figures, 3 tables. Accepted at CHI Late-Breaking Work 2024

  12. arXiv:2403.05209  [pdf, other

    cs.LG cs.AI cs.CV

    Overcoming Data Inequality across Domains with Semi-Supervised Domain Generalization

    Authors: Jinha Park, Wonguk Cho, Taesup Kim

    Abstract: While there have been considerable advancements in machine learning driven by extensive datasets, a significant disparity still persists in the availability of data across various sources and populations. This inequality across domains poses challenges in modeling for those with limited data, which can lead to profound practical and ethical concerns. In this paper, we address a representative case… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: 20 pages, 4 figures

  13. arXiv:2402.08187  [pdf, other

    cs.LG math.NA

    Learning time-dependent PDE via graph neural networks and deep operator network for robust accuracy on irregular grids

    Authors: Sung Woong Cho, Jae Yong Lee, Hyung Ju Hwang

    Abstract: Scientific computing using deep learning has seen significant advancements in recent years. There has been growing interest in models that learn the operator from the parameters of a partial differential equation (PDE) to the corresponding solutions. Deep Operator Network (DeepONet) and Fourier Neural operator, among other models, have been designed with structures suitable for handling functions… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Comments: 25 pages, 11 figures

    MSC Class: 65D17; 68U07

  14. arXiv:2312.15949  [pdf, other

    cs.LG math.NA

    HyperDeepONet: learning operator with complex target function space using the limited resources via hypernetwork

    Authors: Jae Yong Lee, Sung Woong Cho, Hyung Ju Hwang

    Abstract: Fast and accurate predictions for complex physical dynamics are a significant challenge across various applications. Real-time prediction on resource-constrained hardware is even more crucial in real-world problems. The deep operator network (DeepONet) has recently been proposed as a framework for learning nonlinear mappings between function spaces. However, the DeepONet requires many parameters a… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

    Comments: 26 pages, 13 figures. Published as a conference paper at Eleventh International Conference on Learning Representations (ICLR 2023)

    MSC Class: 65D17; 68U07

  15. arXiv:2312.15449  [pdf, other

    cs.CV

    iDet3D: Towards Efficient Interactive Object Detection for LiDAR Point Clouds

    Authors: Dongmin Choi, Wonwoo Cho, Kangyeol Kim, Jaegul Choo

    Abstract: Accurately annotating multiple 3D objects in LiDAR scenes is laborious and challenging. While a few previous studies have attempted to leverage semi-automatic methods for cost-effective bounding box annotation, such methods have limitations in efficiently handling numerous multi-class objects. To effectively accelerate 3D annotation pipelines, we propose iDet3D, an efficient interactive 3D object… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

    Comments: Accepted to AAAI 2024

  16. arXiv:2312.12467  [pdf, other

    cs.LG cs.AI cs.CE

    Learning Flexible Body Collision Dynamics with Hierarchical Contact Mesh Transformer

    Authors: Youn-Yeol Yu, Jeongwhan Choi, Woojin Cho, Kookjin Lee, Nayong Kim, Kiseok Chang, Chang-Seung Woo, Ilho Kim, Seok-Woo Lee, Joon-Young Yang, Sooyoung Yoon, Noseong Park

    Abstract: Recently, many mesh-based graph neural network (GNN) models have been proposed for modeling complex high-dimensional physical systems. Remarkable achievements have been made in significantly reducing the solving time compared to traditional numerical solvers. These methods are typically designed to i) reduce the computational cost in solving physical dynamics and/or ii) propose techniques to enhan… ▽ More

    Submitted 25 March, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

    Comments: Accepted at ICLR 2024

  17. arXiv:2312.10274  [pdf, other

    cs.LG cs.AI

    Operator-learning-inspired Modeling of Neural Ordinary Differential Equations

    Authors: Woojin Cho, Seunghyeon Cho, Hyundong Jin, Jinsung Jeon, Kookjin Lee, Sanghyun Hong, Dongeun Lee, Jonghyun Choi, Noseong Park

    Abstract: Neural ordinary differential equations (NODEs), one of the most influential works of the differential equation-based deep learning, are to continuously generalize residual networks and opened a new field. They are currently utilized for various downstream tasks, e.g., image classification, time series classification, image generation, etc. Its key part is how to model the time-derivative of the hi… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

  18. arXiv:2312.09603  [pdf, other

    cs.SD cs.LG eess.AS

    Stethoscope-guided Supervised Contrastive Learning for Cross-domain Adaptation on Respiratory Sound Classification

    Authors: June-Woo Kim, Sangmin Bae, Won-Yang Cho, Byungjo Lee, Ho-Young Jung

    Abstract: Despite the remarkable advances in deep learning technology, achieving satisfactory performance in lung sound classification remains a challenge due to the scarcity of available data. Moreover, the respiratory sound samples are collected from a variety of electronic stethoscopes, which could potentially introduce biases into the trained models. When a significant distribution shift occurs within t… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: accepted to ICASSP 2024

  19. arXiv:2311.03736  [pdf, other

    cs.AI cs.LG cs.MA

    Neural MMO 2.0: A Massively Multi-task Addition to Massively Multi-agent Learning

    Authors: Joseph Suárez, Phillip Isola, Kyoung Whan Choe, David Bloomin, Hao Xiang Li, Nikhil Pinnaparaju, Nishaanth Kanna, Daniel Scott, Ryan Sullivan, Rose S. Shuman, Lucas de Alcântara, Herbie Bradley, Louis Castricato, Kirsty You, Yuhao Jiang, Qimai Li, Jiaxin Chen, Xiaolong Zhu

    Abstract: Neural MMO 2.0 is a massively multi-agent environment for reinforcement learning research. The key feature of this new version is a flexible task system that allows users to define a broad range of objectives and reward signals. We challenge researchers to train agents capable of generalizing to tasks, maps, and opponents never seen during training. Neural MMO features procedurally generated maps… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  20. arXiv:2310.11551  [pdf, other

    cs.NI eess.SP

    WaveFlex: A Smart Surface for Private CBRS Wireless Cellular Networks

    Authors: Fan Yi, Kun Woo Cho, Yaxiong Xie, Kyle Jamieson

    Abstract: We present the design and implementation of WaveFlex, the first smart surface that enhances Private LTE/5G networks operating under the shared-license framework in the Citizens Broadband Radio Service frequency band. WaveFlex works in the presence of frequency diversity: multiple nearby base stations operating on different frequencies, as dictated by a Spectrum Access System coordinator. It also h… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: 15 pages

  21. arXiv:2310.09528  [pdf, other

    cs.LG math.NA physics.comp-ph

    Hypernetwork-based Meta-Learning for Low-Rank Physics-Informed Neural Networks

    Authors: Woojin Cho, Kookjin Lee, Donsub Rim, Noseong Park

    Abstract: In various engineering and applied science applications, repetitive numerical simulations of partial differential equations (PDEs) for varying input parameters are often required (e.g., aircraft shape optimization over many design parameters) and solvers are required to perform rapid execution. In this study, we suggest a path that potentially opens up a possibility for physics-informed neural net… ▽ More

    Submitted 14 October, 2023; originally announced October 2023.

  22. arXiv:2310.04824  [pdf, other

    cs.CY

    PaperCard for Reporting Machine Assistance in Academic Writing

    Authors: Won Ik Cho, Eunjung Cho, Kyunghyun Cho

    Abstract: Academic writing process has benefited from various technological developments over the years including search engines, automatic translators, and editing tools that review grammar and spelling mistakes. They have enabled human writers to become more efficient in writing academic papers, for example by helping with finding relevant literature more effectively and polishing texts. While these devel… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

    Comments: Accepted at EAAMO'23 as a poster presentation

  23. arXiv:2309.13858  [pdf, other

    cs.HC

    Impact of Human-AI Interaction on User Trust and Reliance in AI-Assisted Qualitative Coding

    Authors: Jie Gao, Junming Cao, ShunYi Yeo, Kenny Tsu Wei Choo, Zheng Zhang, Toby Jia-Jun Li, Shengdong Zhao, Simon Tangi Perrault

    Abstract: While AI shows promise for enhancing the efficiency of qualitative analysis, the unique human-AI interaction resulting from varied coding strategies makes it challenging to develop a trustworthy AI-assisted qualitative coding system (AIQCs) that supports coding tasks effectively. We bridge this gap by exploring the impact of varying coding strategies on user trust and reliance on AI. We conducted… ▽ More

    Submitted 24 September, 2023; originally announced September 2023.

    Comments: 27 pages with references, 9 figures, 5 tables

  24. arXiv:2309.11017  [pdf, other

    cs.ET

    3SAT on an All-to-All-Connected CMOS Ising Solver Chip

    Authors: Hüsrev Cılasun, Ziqing Zeng, Ramprasath S, Abhimanyu Kumar, Hao Lo, William Cho, Chris H. Kim, Ulya R. Karpuzcu, Sachin S. Sapatnekar

    Abstract: This work solves 3SAT, a classical NP-complete problem, on a CMOS-based Ising hardware chip with all-to-all connectivity. The paper addresses practical issues in going from algorithms to hardware. It considers several degrees of freedom in mapping the 3SAT problem to the chip - using multiple Ising formulations for 3SAT; exploring multiple strategies for decomposing large problems into subproblems… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

    ACM Class: B.7

  25. arXiv:2309.01961  [pdf, other

    cs.CV

    NICE: CVPR 2023 Challenge on Zero-shot Image Captioning

    Authors: Taehoon Kim, Pyunghwan Ahn, Sangyun Kim, Sihaeng Lee, Mark Marsden, Alessandra Sala, Seung Hwan Kim, Bohyung Han, Kyoung Mu Lee, Honglak Lee, Kyounghoon Bae, Xiangyu Wu, Yi Gao, Hailiang Zhang, Yang Yang, Weili Guo, Jianfeng Lu, Youngtaek Oh, Jae Won Cho, Dong-jin Kim, In So Kweon, Junmo Kim, Wooyoung Kang, Won Young Jhoo, Byungseok Roh , et al. (17 additional authors not shown)

    Abstract: In this report, we introduce NICE (New frontiers for zero-shot Image Captioning Evaluation) project and share the results and outcomes of 2023 challenge. This project is designed to challenge the computer vision community to develop robust image captioning models that advance the state-of-the-art both in terms of accuracy and fairness. Through the challenge, the image captioning models were tested… ▽ More

    Submitted 10 September, 2023; v1 submitted 5 September, 2023; originally announced September 2023.

    Comments: Tech report, project page https://nice.lgresearch.ai/

  26. arXiv:2307.16887  [pdf

    cs.RO

    Data-Based MHE for Agile Quadrotor Flight

    Authors: Wonoo Choo, Erkan Kayacan

    Abstract: This paper develops a data-based moving horizon estimation (MHE) method for agile quadrotors. Accurate state estimation of the system is paramount for precise trajectory control for agile quadrotors; however, the high level of aerodynamic forces experienced by the quadrotors during high-speed flights make this task extremely challenging. These complex turbulent effects are difficult to model and t… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

    Comments: 8 pages, accepted in IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2023

  27. arXiv:2305.17680  [pdf, other

    cs.CL cs.AI

    Evaluating GPT-3 Generated Explanations for Hateful Content Moderation

    Authors: Han Wang, Ming Shan Hee, Md Rabiul Awal, Kenny Tsu Wei Choo, Roy Ka-Wei Lee

    Abstract: Recent research has focused on using large language models (LLMs) to generate explanations for hate speech through fine-tuning or prompting. Despite the growing interest in this area, these generated explanations' effectiveness and potential limitations remain poorly understood. A key concern is that these explanations, generated by LLMs, may lead to erroneous judgments about the nature of flagged… ▽ More

    Submitted 30 August, 2023; v1 submitted 28 May, 2023; originally announced May 2023.

    Comments: 9 pages, 2 figures, Accepted by International Joint Conference on Artificial Intelligence(IJCAI)

    ACM Class: I.2.7

  28. arXiv:2305.17254  [pdf

    cs.RO eess.SY

    Computationally Efficient Data-Driven MPC for Agile Quadrotor Flight

    Authors: Wonoo Choo, Erkan Kayacan

    Abstract: This paper develops computationally efficient data-driven model predictive control (MPC) for Agile quadrotor flight. Agile quadrotors in high-speed flights can experience high levels of aerodynamic effects. Modeling these turbulent aerodynamic effects is a cumbersome task and the resulting model may be overly complex and computationally infeasible. Combining Gaussian Process (GP) regression models… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: 6 pages, accepted in ACC 2023 (American Control Conference, 2023)

  29. Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification

    Authors: Sangmin Bae, June-Woo Kim, Won-Yang Cho, Hyerim Baek, Soyoun Son, Byungjo Lee, Changwan Ha, Kyongpil Tae, Sungnyun Kim, Se-Young Yun

    Abstract: Respiratory sound contains crucial information for the early diagnosis of fatal lung diseases. Since the COVID-19 pandemic, there has been a growing interest in contact-free medical care based on electronic stethoscopes. To this end, cutting-edge deep learning models have been developed to diagnose lung diseases; however, it is still challenging due to the scarcity of medical data. In this study,… ▽ More

    Submitted 22 November, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: INTERSPEECH 2023, Code URL: https://github.com/raymin0223/patch-mix_contrastive_learning

  30. arXiv:2304.05560  [pdf, other

    cs.HC

    CoAIcoder: Examining the Effectiveness of AI-assisted Human-to-Human Collaboration in Qualitative Analysis

    Authors: Jie Gao, Kenny Tsu Wei Choo, Junming Cao, Roy Ka Wei Lee, Simon Perrault

    Abstract: While AI-assisted individual qualitative analysis has been substantially studied, AI-assisted collaborative qualitative analysis (CQA)-a process that involves multiple researchers working together to interpret data-remains relatively unexplored. After identifying CQA practices and design opportunities through formative interviews, we designed and implemented CoAIcoder, a tool leveraging AI to enha… ▽ More

    Submitted 24 July, 2023; v1 submitted 11 April, 2023; originally announced April 2023.

    Comments: Will appear on ACM Transactions on Computer-Human Interaction (TOCHI)

  31. arXiv:2304.00350  [pdf, other

    cs.CL

    When Crowd Meets Persona: Creating a Large-Scale Open-Domain Persona Dialogue Corpus

    Authors: Won Ik Cho, Yoon Kyung Lee, Seoyeon Bae, Jihwan Kim, Sangah Park, Moosung Kim, Sowon Hahn, Nam Soo Kim

    Abstract: Building a natural language dataset requires caution since word semantics is vulnerable to subtle text change or the definition of the annotated concept. Such a tendency can be seen in generative tasks like question-answering and dialogue generation and also in tasks that create a categorization-based corpus, like topic classification or sentiment analysis. Open-domain conversations involve two or… ▽ More

    Submitted 1 April, 2023; originally announced April 2023.

    Comments: Presented at HCOMP 2022 as Works-in-Progress

  32. arXiv:2303.15833  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Complementary Domain Adaptation and Generalization for Unsupervised Continual Domain Shift Learning

    Authors: Wonguk Cho, Jinha Park, Taesup Kim

    Abstract: Continual domain shift poses a significant challenge in real-world applications, particularly in situations where labeled data is not available for new domains. The challenge of acquiring knowledge in this problem setting is referred to as unsupervised continual domain shift learning. Existing methods for domain adaptation and generalization have limitations in addressing this issue, as they focus… ▽ More

    Submitted 13 October, 2023; v1 submitted 28 March, 2023; originally announced March 2023.

    Comments: ICCV 2023

  33. arXiv:2303.11771  [pdf, other

    cs.CV

    Self-Sufficient Framework for Continuous Sign Language Recognition

    Authors: Youngjoon Jang, Youngtaek Oh, Jae Won Cho, Myungchul Kim, Dong-Jin Kim, In So Kweon, Joon Son Chung

    Abstract: The goal of this work is to develop self-sufficient framework for Continuous Sign Language Recognition (CSLR) that addresses key issues of sign language recognition. These include the need for complex multi-scale features such as hands, face, and mouth for understanding, and absence of frame-level annotations. To this end, we propose (1) Divide and Focus Convolution (DFConv) which extracts both ma… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

  34. arXiv:2302.14368  [pdf, other

    cs.CV cs.AI cs.GR

    Towards Enhanced Controllability of Diffusion Models

    Authors: Wonwoong Cho, Hareesh Ravi, Midhun Harikumar, Vinh Khuc, Krishna Kumar Singh, Jingwan Lu, David I. Inouye, Ajinkya Kale

    Abstract: Denoising Diffusion models have shown remarkable capabilities in generating realistic, high-quality and diverse images. However, the extent of controllability during generation is underexplored. Inspired by techniques based on GAN latent space for image manipulation, we train a diffusion model conditioned on two latent codes, a spatial content mask and a flattened style embedding. We rely on the i… ▽ More

    Submitted 15 March, 2023; v1 submitted 28 February, 2023; originally announced February 2023.

    Comments: 28 pages, 28 figures

  35. arXiv:2212.10433  [pdf, other

    cs.DS cs.AI cs.LG

    Scheduling with Predictions

    Authors: Woo-Hyung Cho, Shane Henderson, David Shmoys

    Abstract: There is significant interest in deploying machine learning algorithms for diagnostic radiology, as modern learning techniques have made it possible to detect abnormalities in medical images within minutes. While machine-assisted diagnoses cannot yet reliably replace human reviews of images by a radiologist, they could inform prioritization rules for determining the order by which to review patien… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

  36. arXiv:2212.07663  [pdf, other

    cs.NI eess.SP

    Cross-Link Channel Prediction for Massive IoT Networks

    Authors: Kun Woo Cho, Marco Cominelli, Francesco Gringoli, Joerg Widmer, Kyle Jamieson

    Abstract: Tomorrow's massive-scale IoT sensor networks are poised to drive uplink traffic demand, especially in areas of dense deployment. To meet this demand, however, network designers leverage tools that often require accurate estimates of Channel State Information (CSI), which incurs a high overhead and thus reduces network throughput. Furthermore, the overhead generally scales with the number of client… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

    Comments: 13 pages, 12 figures

  37. arXiv:2211.00448  [pdf, other

    cs.CV

    Signing Outside the Studio: Benchmarking Background Robustness for Continuous Sign Language Recognition

    Authors: Youngjoon Jang, Youngtaek Oh, Jae Won Cho, Dong-Jin Kim, Joon Son Chung, In So Kweon

    Abstract: The goal of this work is background-robust continuous sign language recognition. Most existing Continuous Sign Language Recognition (CSLR) benchmarks have fixed backgrounds and are filmed in studios with a static monochromatic background. However, signing is not limited only to studios in the real world. In order to analyze the robustness of CSLR models under background shifts, we first evaluate e… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

    Comments: Our dataset is available at https://github.com/art-jang/Signing-Outside-the-Studio

  38. arXiv:2209.11554  [pdf, other

    cs.NI

    mmWall: A Transflective Metamaterial Surface for mmWave Networks

    Authors: Kun Woo Cho, Mohammad H. Mazaheri, Jeremy Gummeson, Omid Abari, Kyle Jamieson

    Abstract: Mobile operators are poised to leverage millimeter wave technology as 5G evolves, but despite efforts to bolster their reliability indoors and outdoors, mmWave links remain vulnerable to blockage by walls, people, and obstacles. Further, there is significant interest in bringing outdoor mmWave coverage indoors, which for similar reasons remains challenging today. This paper presents the design, ha… ▽ More

    Submitted 25 September, 2022; v1 submitted 23 September, 2022; originally announced September 2022.

    Comments: 18 pages, 18 figures

  39. arXiv:2209.07578  [pdf, other

    cond-mat.mtrl-sci cs.LG

    Pixel-wise classification in graphene-detection with tree-based machine learning algorithms

    Authors: Woon Hyung Cho, Jiseon Shin, Young Duck Kim, George J. Jung

    Abstract: Mechanical exfoliation of graphene and its identification by optical inspection is one of the milestones in condensed matter physics that sparked the field of 2D materials. Finding regions of interest from the entire sample space and identification of layer number is a routine task potentially amenable to automatization. We propose supervised pixel-wise classification methods showing a high perfor… ▽ More

    Submitted 24 August, 2022; originally announced September 2022.

    Comments: 12 pages, 6 figures

  40. arXiv:2208.00690  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Generative Bias for Robust Visual Question Answering

    Authors: Jae Won Cho, Dong-jin Kim, Hyeonggon Ryu, In So Kweon

    Abstract: The task of Visual Question Answering (VQA) is known to be plagued by the issue of VQA models exploiting biases within the dataset to make its final prediction. Various previous ensemble based debiasing methods have been proposed where an additional model is purposefully trained to be biased in order to train a robust target model. However, these methods compute the bias for a model simply from th… ▽ More

    Submitted 22 March, 2023; v1 submitted 1 August, 2022; originally announced August 2022.

    Comments: CVPR 2023

  41. arXiv:2207.11534  [pdf, other

    eess.IV cs.AI cs.CV

    Comparative Validation of AI and non-AI Methods in MRI Volumetry to Diagnose Parkinsonian Syndromes

    Authors: Joomee Song, Juyoung Hahm, Jisoo Lee, Chae Yeon Lim, Myung Jin Chung, Jinyoung Youn, Jin Whan Cho, Jong Hyeon Ahn, Kyung-Su Kim

    Abstract: Automated segmentation and volumetry of brain magnetic resonance imaging (MRI) scans are essential for the diagnosis of Parkinson's disease (PD) and Parkinson's plus syndromes (P-plus). To enhance the diagnostic performance, we adopt deep learning (DL) models in brain segmentation and compared their performance with the gold-standard non-DL method. We collected brain MRI scans of healthy controls… ▽ More

    Submitted 23 July, 2022; originally announced July 2022.

    Comments: Joomee Song and Juyoung Hahm contributed equally to this work as the co-first author. Jong Hyeon Ahn and Kyung-Su Kim (kskim.doc@gmail.com) contributed equally to this work as the co-corresponding author

  42. arXiv:2207.10287  [pdf, other

    cs.CV

    Towards Accurate Open-Set Recognition via Background-Class Regularization

    Authors: Wonwoo Cho, Jaegul Choo

    Abstract: In open-set recognition (OSR), classifiers should be able to reject unknown-class samples while maintaining high closed-set classification accuracy. To effectively solve the OSR problem, previous studies attempted to limit latent feature space and reject data located outside the limited space via offline analyses, e.g., distance-based feature analyses, or complicated network architectures. To cond… ▽ More

    Submitted 20 July, 2022; originally announced July 2022.

    Comments: Accepted to ECCV 2022

  43. STI: Turbocharge NLP Inference at the Edge via Elastic Pipelining

    Authors: Liwei Guo, Wonkyo Choe, Felix Xiaozhu Lin

    Abstract: Natural Language Processing (NLP) inference is seeing increasing adoption by mobile applications, where on-device inference is desirable for crucially preserving user data privacy and avoiding network roundtrips. Yet, the unprecedented size of an NLP model stresses both latency and memory, creating a tension between the two key resources of a mobile device. To meet a target latency, holding the wh… ▽ More

    Submitted 31 January, 2023; v1 submitted 11 July, 2022; originally announced July 2022.

    Comments: ASPLOS'23

  44. arXiv:2207.02286  [pdf, other

    cs.LG cs.AI

    Cooperative Distribution Alignment via JSD Upper Bound

    Authors: Wonwoong Cho, Ziyu Gong, David I. Inouye

    Abstract: Unsupervised distribution alignment estimates a transformation that maps two or more source distributions to a shared aligned distribution given only samples from each distribution. This task has many applications including generative modeling, unsupervised domain adaptation, and socially aware learning. Most prior works use adversarial learning (i.e., min-max optimization), which can be challengi… ▽ More

    Submitted 31 October, 2022; v1 submitted 5 July, 2022; originally announced July 2022.

    Comments: Accepted for publication in Advances in Neural Information Processing Systems 36 (NeurIPS 2022)

  45. arXiv:2206.14939  [pdf, other

    cs.NI

    Towards Dual-band Reconfigurable Metamaterial Surfaces for Satellite Networking

    Authors: Kun Woo Cho, Yasaman Ghasempour, Kyle Jamieson

    Abstract: The first low earth orbit satellite networks for internet service have recently been deployed and are growing in size, yet will face deployment challenges in many practical circumstances of interest. This paper explores how a dual-band, electronically tunable smart surface can enable dynamic beam alignment between the satellite and mobile users, make service possible in urban canyons, and improve… ▽ More

    Submitted 29 June, 2022; originally announced June 2022.

    Comments: 9 pages including references, 9 figures

    ACM Class: C.2.5; C.3

  46. arXiv:2206.09885  [pdf, other

    cs.CV

    KOLOMVERSE: KRISO open large-scale image dataset for object detection in the maritime universe

    Authors: Abhilasha Nanda, Sung Won Cho, Hyeopwoo Lee, Jin Hyoung Park

    Abstract: Over the years, datasets have been developed for various object detection tasks. Object detection in the maritime domain is essential for the safety and navigation of ships. However, there is still a lack of publicly available large-scale datasets in the maritime domain. To overcome this challenge, we present KOLOMVERSE, an open large-scale image dataset for object detection in the maritime domain… ▽ More

    Submitted 20 June, 2022; originally announced June 2022.

    Comments: 13 Pages, 12 figures, submitted to NeurIPS 2022 Datasets and Benchmarks Track (Under Review)

  47. arXiv:2205.01059  [pdf, other

    cs.LG cs.AI math.NA math.OC

    Enhanced Physics-Informed Neural Networks with Augmented Lagrangian Relaxation Method (AL-PINNs)

    Authors: Hwijae Son, Sung Woong Cho, Hyung Ju Hwang

    Abstract: Physics-Informed Neural Networks (PINNs) have become a prominent application of deep learning in scientific computation, as they are powerful approximators of solutions to nonlinear partial differential equations (PDEs). There have been numerous attempts to facilitate the training process of PINNs by adjusting the weight of each component of the loss function, called adaptive loss-balancing algori… ▽ More

    Submitted 30 May, 2023; v1 submitted 29 April, 2022; originally announced May 2022.

  48. arXiv:2204.12687  [pdf, other

    physics.soc-ph cs.SI

    Multiresolution community analysis of international trade networks

    Authors: Wonguk Cho, Daekyung Lee, Beom Jun Kim

    Abstract: The international trade network is a complex system where multiple trade blocs with varying sizes coexist and overlap with each other. However, the resulting structures of community detection in trade networks are often inconsistent and fails to capture the complex landscape of international trade. To address these problems, we propose a multiresolution framework that aggregates all the configurat… ▽ More

    Submitted 26 April, 2022; originally announced April 2022.

    Comments: 19 pages, 5 figures, 1 table

  49. arXiv:2204.02633  [pdf

    cs.CL cs.AI

    DAGAM: Data Augmentation with Generation And Modification

    Authors: Byeong-Cheol Jo, Tak-Sung Heo, Yeongjoon Park, Yongmin Yoo, Won Ik Cho, Kyungsun Kim

    Abstract: Text classification is a representative downstream task of natural language processing, and has exhibited excellent performance since the advent of pre-trained language models based on Transformer architecture. However, in pre-trained language models, under-fitting often occurs due to the size of the model being very large compared to the amount of available training data. Along with significant i… ▽ More

    Submitted 6 April, 2022; originally announced April 2022.

  50. arXiv:2204.00089  [pdf, other

    cs.LG cs.AI cs.CR cs.CV

    Investigating Top-$k$ White-Box and Transferable Black-box Attack

    Authors: Chaoning Zhang, Philipp Benz, Adil Karjauv, Jae Won Cho, Kang Zhang, In So Kweon

    Abstract: Existing works have identified the limitation of top-$1$ attack success rate (ASR) as a metric to evaluate the attack strength but exclusively investigated it in the white-box setting, while our work extends it to a more practical black-box setting: transferable attack. It is widely reported that stronger I-FGSM transfers worse than simple FGSM, leading to a popular belief that transferability is… ▽ More

    Submitted 30 March, 2022; originally announced April 2022.

    Comments: Accepted by CVPR2022