Skip to main content

Showing 1–50 of 103 results for author: Park, W

  1. arXiv:2407.06614  [pdf, other

    eess.IV cs.CV

    Implicit Regression in Subspace for High-Sensitivity CEST Imaging

    Authors: Chu Chen, Yang Liu, Se Weon Park, Jizhou Li, Kannie W. Y. Chan, Raymond H. F. Chan

    Abstract: Chemical Exchange Saturation Transfer (CEST) MRI demonstrates its capability in significantly enhancing the detection of proteins and metabolites with low concentrations through exchangeable protons. The clinical application of CEST, however, is constrained by its low contrast and low signal-to-noise ratio (SNR) in the acquired data. Denoising, as one of the post-processing stages for CEST data, c… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  2. arXiv:2406.15524  [pdf, other

    cs.CL cs.LG

    Rethinking Pruning Large Language Models: Benefits and Pitfalls of Reconstruction Error Minimization

    Authors: Sungbin Shin, Wonpyo Park, Jaeho Lee, Namhoon Lee

    Abstract: This work suggests fundamentally rethinking the current practice of pruning large language models (LLMs). The way it is done is by divide and conquer: split the model into submodels, sequentially prune them, and reconstruct predictions of the dense counterparts on small calibration data one at a time; the final model is obtained simply by putting the resulting sparse submodels together. While this… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  3. arXiv:2406.12016  [pdf, other

    cs.LG cs.CL

    Prefixing Attention Sinks can Mitigate Activation Outliers for Large Language Model Quantization

    Authors: Seungwoo Son, Wonpyo Park, Woohyun Han, Kyuyeun Kim, Jaeho Lee

    Abstract: Despite recent advances in LLM quantization, activation quantization remains to be challenging due to the activation outliers. Conventional remedies, e.g., mixing precisions for different channels, introduce extra overhead and reduce the speedup. In this work, we develop a simple yet effective strategy to facilitate per-tensor activation quantization by preventing the generation of problematic tok… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  4. The CoExplorer Technology Probe: A Generative AI-Powered Adaptive Interface to Support Intentionality in Planning and Running Video Meetings

    Authors: Gun Woo Warren Park, Payod Panda, Lev Tankelevitch, Sean Rintel

    Abstract: Effective meetings are effortful, but traditional videoconferencing systems offer little support for reducing this effort across the meeting lifecycle. Generative AI (GenAI) has the potential to radically redefine meetings by augmenting intentional meeting behaviors. CoExplorer, our novel adaptive meeting prototype, preemptively generates likely phases that meetings would undergo, tools that allow… ▽ More

    Submitted 29 May, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: DIS 2024

  5. arXiv:2405.17633  [pdf, other

    cs.CL

    HEART-felt Narratives: Tracing Empathy and Narrative Style in Personal Stories with LLMs

    Authors: Jocelyn Shen, Joel Mire, Hae Won Park, Cynthia Breazeal, Maarten Sap

    Abstract: Empathy serves as a cornerstone in enabling prosocial behaviors, and can be evoked through sharing of personal experiences in stories. While empathy is influenced by narrative content, intuitively, people respond to the way a story is told as well, through narrative style. Yet the relationship between empathy and narrative style is not fully understood. In this work, we empirically examine and qua… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  6. arXiv:2405.15708  [pdf, other

    cs.CL

    EmpathicStories++: A Multimodal Dataset for Empathy towards Personal Experiences

    Authors: Jocelyn Shen, Yubin Kim, Mohit Hulse, Wazeer Zulfikar, Sharifa Alghowinem, Cynthia Breazeal, Hae Won Park

    Abstract: Modeling empathy is a complex endeavor that is rooted in interpersonal and experiential dimensions of human interaction, and remains an open problem within AI. Existing empathy datasets fall short in capturing the richness of empathy responses, often being confined to in-lab or acted scenarios, lacking longitudinal data, and missing self-reported labels. We introduce a new multimodal dataset for e… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: Accepted to ACL 2024 Findings

  7. arXiv:2405.10536  [pdf, other

    cs.LG cs.AI

    Time-Varying Constraint-Aware Reinforcement Learning for Energy Storage Control

    Authors: Jaeik Jeong, Tai-Yeon Ku, Wan-Ki Park

    Abstract: Energy storage devices, such as batteries, thermal energy storages, and hydrogen systems, can help mitigate climate change by ensuring a more stable and sustainable power supply. To maximize the effectiveness of such energy storage, determining the appropriate charging and discharging amounts for each time period is crucial. Reinforcement learning is preferred over traditional optimization for the… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: ICLR 2024 Workshop: Tackling Climate Change with Machine Learning

  8. arXiv:2404.15155  [pdf, other

    cs.CL cs.AI cs.LG

    Adaptive Collaboration Strategy for LLMs in Medical Decision Making

    Authors: Yubin Kim, Chanwoo Park, Hyewon Jeong, Yik Siu Chan, Xuhai Xu, Daniel McDuff, Cynthia Breazeal, Hae Won Park

    Abstract: Foundation models have become invaluable in advancing the medical field. Despite their promise, the strategic deployment of LLMs for effective utility in complex medical tasks remains an open question. Our novel framework, Medical Decision-making Agents (MDAgents) aims to address this gap by automatically assigning the effective collaboration structure for LLMs. Assigned solo or group collaboratio… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  9. arXiv:2404.07554  [pdf, other

    cs.CV cs.AI

    CAT: Contrastive Adapter Training for Personalized Image Generation

    Authors: Jae Wan Park, Sang Hyun Park, Jun Young Koh, Junha Lee, Min Song

    Abstract: The emergence of various adapters, including Low-Rank Adaptation (LoRA) applied from the field of natural language processing, has allowed diffusion models to personalize image generation at a low cost. However, due to the various challenges including limited datasets and shortage of regularization and computation resources, adapter training often results in unsatisfactory outcomes, leading to the… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: CVPRW 2024

  10. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seongjin Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  11. arXiv:2403.14155  [pdf, other

    cs.CV

    Harmonizing Visual and Textual Embeddings for Zero-Shot Text-to-Image Customization

    Authors: Yeji Song, Jimyeong Kim, Wonhark Park, Wonsik Shin, Wonjong Rhee, Nojun Kwak

    Abstract: In a surge of text-to-image (T2I) models and their customization methods that generate new images of a user-provided subject, current works focus on alleviating the costs incurred by a lengthy per-subject optimization. These zero-shot customization methods encode the image of a specified subject into a visual embedding which is then utilized alongside the textual embedding for diffusion guidance.… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: Project page: https://ldynx.github.io/harmony-zero-t2i/

  12. arXiv:2403.11330  [pdf, other

    cs.CL cs.AI cs.HC cs.LG

    Improving Dialogue Agents by Decomposing One Global Explicit Annotation with Local Implicit Multimodal Feedback

    Authors: Dong Won Lee, Hae Won Park, Yoon Kim, Cynthia Breazeal, Louis-Philippe Morency

    Abstract: We describe an approach for aligning an LLM-based dialogue agent based on global (i.e., dialogue-level) rewards, while also taking into account naturally-occurring multimodal signals. At a high level, our approach (dubbed GELI) learns a local, turn-level reward model by decomposing the human-provided Global Explicit (GE) session-level reward, using Local Implicit (LI) multimodal reward signals to… ▽ More

    Submitted 22 April, 2024; v1 submitted 17 March, 2024; originally announced March 2024.

    Comments: 10 pages, 3 figures, 2 tables

  13. arXiv:2403.04982  [pdf, other

    cs.AR

    A 28.6 mJ/iter Stable Diffusion Processor for Text-to-Image Generation with Patch Similarity-based Sparsity Augmentation and Text-based Mixed-Precision

    Authors: Jiwon Choi, Wooyoung Jo, Seongyon Hong, Beomseok Kwon, Wonhoon Park, Hoi-Jun Yoo

    Abstract: This paper presents an energy-efficient stable diffusion processor for text-to-image generation. While stable diffusion attained attention for high-quality image synthesis results, its inherent characteristics hinder its deployment on mobile platforms. The proposed processor achieves high throughput and energy efficiency with three key features as solutions: 1) Patch similarity-based sparsity augm… ▽ More

    Submitted 14 March, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

    Comments: Accepted at 2024 IEEE International Symposium on Circuits and Systems (ISCAS)

  14. arXiv:2403.01469  [pdf, other

    cs.CL

    KorMedMCQA: Multi-Choice Question Answering Benchmark for Korean Healthcare Professional Licensing Examinations

    Authors: Sunjun Kweon, Byungjin Choi, Minkyu Kim, Rae Woong Park, Edward Choi

    Abstract: We introduce KorMedMCQA, the first Korean multiple-choice question answering (MCQA) benchmark derived from Korean healthcare professional licensing examinations, covering from the year 2012 to year 2023. This dataset consists of a selection of questions from the license examinations for doctors, nurses, and pharmacists, featuring a diverse array of subjects. We conduct baseline experiments on vari… ▽ More

    Submitted 5 March, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

  15. arXiv:2402.02005  [pdf, other

    cs.LG

    Topology-Informed Graph Transformer

    Authors: Yun Young Choi, Sun Woo Park, Minho Lee, Youngho Woo

    Abstract: Transformers have revolutionized performance in Natural Language Processing and Vision, paving the way for their integration with Graph Neural Networks (GNNs). One key challenge in enhancing graph transformers is strengthening the discriminative power of distinguishing isomorphisms of graphs, which plays a crucial role in boosting their predictive performances. To address this challenge, we introd… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  16. arXiv:2401.15894  [pdf, other

    cs.LG cs.AI

    A Gated MLP Architecture for Learning Topological Dependencies in Spatio-Temporal Graphs

    Authors: Yun Young Choi, Minho Lee, Sun Woo Park, Seunghwan Lee, Joohwan Ko

    Abstract: Graph Neural Networks (GNNs) and Transformer have been increasingly adopted to learn the complex vector representations of spatio-temporal graphs, capturing intricate spatio-temporal dependencies crucial for applications such as traffic datasets. Although many existing methods utilize multi-head attention mechanisms and message-passing neural networks (MPNNs) to capture both spatial and temporal r… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  17. arXiv:2401.06866  [pdf, other

    cs.CL cs.AI cs.LG

    Health-LLM: Large Language Models for Health Prediction via Wearable Sensor Data

    Authors: Yubin Kim, Xuhai Xu, Daniel McDuff, Cynthia Breazeal, Hae Won Park

    Abstract: Large language models (LLMs) are capable of many natural language tasks, yet they are far from perfect. In health applications, grounding and interpreting domain-specific and non-linguistic data is crucial. This paper investigates the capacity of LLMs to make inferences about health based on contextual information (e.g. user demographics, health knowledge) and physiological data (e.g. resting hear… ▽ More

    Submitted 27 April, 2024; v1 submitted 12 January, 2024; originally announced January 2024.

  18. Integrating Flow Theory and Adaptive Robot Roles: A Conceptual Model of Dynamic Robot Role Adaptation for the Enhanced Flow Experience in Long-term Multi-person Human-Robot Interactions

    Authors: Huili Chen, Sharifa Alghowinem, Cynthia Breazeal, Hae Won Park

    Abstract: In this paper, we introduce a novel conceptual model for a robot's behavioral adaptation in its long-term interaction with humans, integrating dynamic robot role adaptation with principles of flow experience from psychology. This conceptualization introduces a hierarchical interaction objective grounded in the flow experience, serving as the overarching adaptation goal for the robot. This objectiv… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Journal ref: Proceedings of the 2024 ACM/IEEE International Conference on Human - Robot Interaction (HRI24), March 11 - 14, 2024, Boulder, CO, USA

  19. Discordance Minimization-based Imputation Algorithms for Missing Values in Rating Data

    Authors: Young Woong Park, Jinhak Kim, Dan Zhu

    Abstract: Ratings are frequently used to evaluate and compare subjects in various applications, from education to healthcare, because ratings provide succinct yet credible measures for comparing subjects. However, when multiple rating lists are combined or considered together, subjects often have missing ratings, because most rating lists do not rate every subject in the combined list. In this study, we pro… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  20. arXiv:2310.20187  [pdf, other

    cs.LG cs.AI

    Self-Supervised Pre-Training for Precipitation Post-Processor

    Authors: Sojung An, Junha Lee, Jiyeon Jang, Inchae Na, Wooyeon Park, Sujeong You

    Abstract: Obtaining a sufficient forecast lead time for local precipitation is essential in preventing hazardous weather events. Global warming-induced climate change increases the challenge of accurately predicting severe precipitation events, such as heavy rainfall. In this paper, we propose a deep learning-based precipitation post-processor for numerical weather prediction (NWP) models. The precipitation… ▽ More

    Submitted 19 February, 2024; v1 submitted 31 October, 2023; originally announced October 2023.

    Comments: 7 pages, 3 figures, 1 table, accepted to NeurIPS 2023 Workshop on Tackling Climate Change with Machine Learning at [this http URL](https://www.climatechange.ai/papers/neurips2023/18)

  21. arXiv:2310.16112  [pdf, other

    cs.CV

    Towards long-tailed, multi-label disease classification from chest X-ray: Overview of the CXR-LT challenge

    Authors: Gregory Holste, Yiliang Zhou, Song Wang, Ajay Jaiswal, Mingquan Lin, Sherry Zhuge, Yuzhe Yang, Dongkyun Kim, Trong-Hieu Nguyen-Mau, Minh-Triet Tran, Jaehyup Jeong, Wongi Park, Jongbin Ryu, Feng Hong, Arsh Verma, Yosuke Yamagishi, Changhyun Kim, Hyeryeong Seo, Myungjoo Kang, Leo Anthony Celi, Zhiyong Lu, Ronald M. Summers, George Shih, Zhangyang Wang, Yifan Peng

    Abstract: Many real-world image recognition problems, such as diagnostic medical imaging exams, are "long-tailed" $\unicode{x2013}$ there are a few common findings followed by many more relatively rare conditions. In chest radiography, diagnosis is both a long-tailed and multi-label problem, as patients often present with multiple findings simultaneously. While researchers have begun to study the problem of… ▽ More

    Submitted 1 April, 2024; v1 submitted 24 October, 2023; originally announced October 2023.

    Comments: Update after major revision

  22. arXiv:2310.00926  [pdf, other

    cs.LG

    Integration of Graph Neural Network and Neural-ODEs for Tumor Dynamic Prediction

    Authors: Omid Bazgir, Zichen Wang, Ji Won Park, Marc Hafner, James Lu

    Abstract: In anti-cancer drug development, a major scientific challenge is disentangling the complex relationships between high-dimensional genomics data from patient tumor samples, the corresponding tumor's organ of origin, the drug targets associated with given treatments and the resulting treatment response. Furthermore, to realize the aspirations of precision medicine in identifying and adjusting treatm… ▽ More

    Submitted 27 March, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

  23. arXiv:2309.01961  [pdf, other

    cs.CV

    NICE: CVPR 2023 Challenge on Zero-shot Image Captioning

    Authors: Taehoon Kim, Pyunghwan Ahn, Sangyun Kim, Sihaeng Lee, Mark Marsden, Alessandra Sala, Seung Hwan Kim, Bohyung Han, Kyoung Mu Lee, Honglak Lee, Kyounghoon Bae, Xiangyu Wu, Yi Gao, Hailiang Zhang, Yang Yang, Weili Guo, Jianfeng Lu, Youngtaek Oh, Jae Won Cho, Dong-jin Kim, In So Kweon, Junmo Kim, Wooyoung Kang, Won Young Jhoo, Byungseok Roh , et al. (17 additional authors not shown)

    Abstract: In this report, we introduce NICE (New frontiers for zero-shot Image Captioning Evaluation) project and share the results and outcomes of 2023 challenge. This project is designed to challenge the computer vision community to develop robust image captioning models that advance the state-of-the-art both in terms of accuracy and fairness. Through the challenge, the image captioning models were tested… ▽ More

    Submitted 10 September, 2023; v1 submitted 5 September, 2023; originally announced September 2023.

    Comments: Tech report, project page https://nice.lgresearch.ai/

  24. arXiv:2309.01670  [pdf, other

    q-bio.GN cs.LG

    Blind Biological Sequence Denoising with Self-Supervised Set Learning

    Authors: Nathan Ng, Ji Won Park, Jae Hyeon Lee, Ryan Lewis Kelly, Stephen Ra, Kyunghyun Cho

    Abstract: Biological sequence analysis relies on the ability to denoise the imprecise output of sequencing platforms. We consider a common setting where a short sequence is read out repeatedly using a high-throughput long-read platform to generate multiple subreads, or noisy observations of the same sequence. Denoising these subreads with alignment-based approaches often fails when too few subreads are avai… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

  25. arXiv:2308.05770  [pdf, other

    cs.CV

    Fine-Grained Self-Supervised Learning with Jigsaw Puzzles for Medical Image Classification

    Authors: Wongi Park, Jongbin Ryu

    Abstract: Classifying fine-grained lesions is challenging due to minor and subtle differences in medical images. This is because learning features of fine-grained lesions with highly minor differences is very difficult in training deep neural networks. Therefore, in this paper, we introduce Fine-Grained Self-Supervised Learning(FG-SSL) method for classifying subtle lesions in medical images. The proposed me… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

  26. arXiv:2308.05542  [pdf, other

    cs.CV

    Robust Asymmetric Loss for Multi-Label Long-Tailed Learning

    Authors: Wongi Park, Inhyuk Park, Sungeun Kim, Jongbin Ryu

    Abstract: In real medical data, training samples typically show long-tailed distributions with multiple labels. Class distribution of the medical data has a long-tailed shape, in which the incidence of different diseases is quite varied, and at the same time, it is not unusual for images taken from symptomatic patients to be multi-label diseases. Therefore, in this paper, we concurrently address these two i… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.

    Journal ref: ICCVW 2023

  27. arXiv:2307.15017  [pdf, other

    cs.CR cs.LG

    Samplable Anonymous Aggregation for Private Federated Data Analysis

    Authors: Kunal Talwar, Shan Wang, Audra McMillan, Vojta Jina, Vitaly Feldman, Bailey Basile, Aine Cahill, Yi Sheng Chan, Mike Chatzidakis, Junye Chen, Oliver Chick, Mona Chitnis, Suman Ganta, Yusuf Goren, Filip Granqvist, Kristine Guo, Frederic Jacobs, Omid Javidbakht, Albert Liu, Richard Low, Dan Mascenik, Steve Myers, David Park, Wonhee Park, Gianni Parsa , et al. (11 additional authors not shown)

    Abstract: We revisit the problem of designing scalable protocols for private statistics and private federated learning when each device holds its private data. Our first contribution is to propose a simple primitive that allows for efficient implementation of several commonly used algorithms, and allows for privacy accounting that is close to that in the central setting without requiring the strong trust as… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

    Comments: 24 pages

  28. arXiv:2306.00344  [pdf, other

    cs.LG stat.ML

    BOtied: Multi-objective Bayesian optimization with tied multivariate ranks

    Authors: Ji Won Park, Nataša Tagasovska, Michael Maser, Stephen Ra, Kyunghyun Cho

    Abstract: Many scientific and industrial applications require the joint optimization of multiple, potentially competing objectives. Multi-objective Bayesian optimization (MOBO) is a sample-efficient framework for identifying Pareto-optimal solutions. At the heart of MOBO is the acquisition function, which determines the next candidate to evaluate by navigating the best compromises among the objectives. In t… ▽ More

    Submitted 7 June, 2024; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: 12 pages (+9 appendix), 13 figures. Accepted at ICML 2024

  29. arXiv:2305.19473  [pdf, other

    stat.ML cs.LG stat.CO

    Chain of Log-Concave Markov Chains

    Authors: Saeed Saremi, Ji Won Park, Francis Bach

    Abstract: We introduce a theoretical framework for sampling from unnormalized densities based on a smoothing scheme that uses an isotropic Gaussian kernel with a single fixed noise scale. We prove one can decompose sampling from a density (minimal assumptions made on the density) into a sequence of sampling from log-concave conditional densities via accumulation of noisy measurements with equal noise levels… ▽ More

    Submitted 28 September, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

  30. arXiv:2305.14246  [pdf, other

    cs.CL

    Modeling Empathic Similarity in Personal Narratives

    Authors: Jocelyn Shen, Maarten Sap, Pedro Colon-Hernandez, Hae Won Park, Cynthia Breazeal

    Abstract: The most meaningful connections between people are often fostered through expression of shared vulnerability and emotional experiences in personal narratives. We introduce a new task of identifying similarity in personal stories based on empathic resonance, i.e., the extent to which two people empathize with each others' experiences, as opposed to raw semantic or lexical similarity, as has predomi… ▽ More

    Submitted 6 December, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Published at EMNLP 2023

  31. arXiv:2305.12369  [pdf, other

    cs.CV cs.AI cs.LG

    HIINT: Historical, Intra- and Inter- personal Dynamics Modeling with Cross-person Memory Transformer

    Authors: Yubin Kim, Dong Won Lee, Paul Pu Liang, Sharifa Algohwinem, Cynthia Breazeal, Hae Won Park

    Abstract: Accurately modeling affect dynamics, which refers to the changes and fluctuations in emotions and affective displays during human conversations, is crucial for understanding human interactions. By analyzing affect dynamics, we can gain insights into how people communicate, respond to different situations, and form relationships. However, modeling affect dynamics is challenging due to contextual fa… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

  32. arXiv:2304.14082  [pdf, other

    cs.LG cs.SE

    JaxPruner: A concise library for sparsity research

    Authors: Joo Hyung Lee, Wonpyo Park, Nicole Mitchell, Jonathan Pilault, Johan Obando-Ceron, Han-Byul Kim, Namhoon Lee, Elias Frantar, Yun Long, Amir Yazdanbakhsh, Shivani Agrawal, Suvinay Subramanian, Xin Wang, Sheng-Chun Kao, Xingyao Zhang, Trevor Gale, Aart Bik, Woohyun Han, Milen Ferev, Zhonglin Han, Hong-Seok Kim, Yann Dauphin, Gintare Karolina Dziugaite, Pablo Samuel Castro, Utku Evci

    Abstract: This paper introduces JaxPruner, an open-source JAX-based pruning and sparse training library for machine learning research. JaxPruner aims to accelerate research on sparse neural networks by providing concise implementations of popular pruning and sparse training algorithms with minimal memory and latency overhead. Algorithms implemented in JaxPruner use a common API and work seamlessly with the… ▽ More

    Submitted 18 December, 2023; v1 submitted 27 April, 2023; originally announced April 2023.

    Comments: Jaxpruner is hosted at http://github.com/google-research/jaxpruner

  33. arXiv:2304.12204  [pdf, other

    cs.CV cs.AI cs.LG

    Multipar-T: Multiparty-Transformer for Capturing Contingent Behaviors in Group Conversations

    Authors: Dong Won Lee, Yubin Kim, Rosalind Picard, Cynthia Breazeal, Hae Won Park

    Abstract: As we move closer to real-world AI systems, AI agents must be able to deal with multiparty (group) conversations. Recognizing and interpreting multiparty behaviors is challenging, as the system must recognize individual behavioral cues, deal with the complexity of multiple streams of data from multiple people, and recognize the subtle contingent social exchanges that take place amongst group membe… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

    Comments: 7 pages, 4 figures, IJCAI

  34. arXiv:2302.07754  [pdf, other

    cs.LG

    SupSiam: Non-contrastive Auxiliary Loss for Learning from Molecular Conformers

    Authors: Michael Maser, Ji Won Park, Joshua Yao-Yu Lin, Jae Hyeon Lee, Nathan C. Frey, Andrew Watkins

    Abstract: We investigate Siamese networks for learning related embeddings for augmented samples of molecular conformers. We find that a non-contrastive (positive-pair only) auxiliary task aids in supervised training of Euclidean neural networks (E3NNs) and increases manifold smoothness (MS) around point-cloud geometries. We demonstrate this property for multiple drug-activity prediction tasks while maintain… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

    Comments: Submitted to the MLDD workshop, ICLR 2023

  35. Design, Modeling and Control of a Top-loading Fully-Actuated Cargo Transportation Multirotor

    Authors: Wooyong Park, Xiangyu Wu, Dongjae Lee, Seung Jae Lee

    Abstract: Existing multirotor-based cargo transportation does not maintain a constant cargo attitude due to underactuation; however, fragile payloads may require a consistent posture. The conventional method is also cumbersome when loading cargo, and the size of the cargo to be loaded is limited. To overcome these issues, we propose a new fully-actuated multirotor unmanned aerial vehicle platform capable of… ▽ More

    Submitted 17 January, 2023; originally announced January 2023.

    Comments: 8 pages, 8 figures, submitted to IEEE RA-L

  36. arXiv:2212.14128  [pdf, other

    cs.CV

    Joint Engagement Classification using Video Augmentation Techniques for Multi-person Human-robot Interaction

    Authors: Yubin Kim, Huili Chen, Sharifa Alghowinem, Cynthia Breazeal, Hae Won Park

    Abstract: Affect understanding capability is essential for social robots to autonomously interact with a group of users in an intuitive and reciprocal way. However, the challenge of multi-person affect understanding comes from not only the accurate perception of each user's affective state (e.g., engagement) but also the recognition of the affect interplay between the members (e.g., joint engagement) that p… ▽ More

    Submitted 28 December, 2022; originally announced December 2022.

  37. arXiv:2212.10504  [pdf, other

    cs.CL

    Can Current Task-oriented Dialogue Models Automate Real-world Scenarios in the Wild?

    Authors: Sang-Woo Lee, Sungdong Kim, Donghyeon Ko, Donghoon Ham, Youngki Hong, Shin Ah Oh, Hyunhoon Jung, Wangkyo Jung, Kyunghyun Cho, Donghyun Kwak, Hyungsuk Noh, Woomyoung Park

    Abstract: Task-oriented dialogue (TOD) systems are mainly based on the slot-filling-based TOD (SF-TOD) framework, in which dialogues are broken down into smaller, controllable units (i.e., slots) to fulfill a specific task. A series of approaches based on this framework achieved remarkable success on various TOD benchmarks. However, we argue that the current TOD benchmarks are limited to surrogate real-worl… ▽ More

    Submitted 24 May, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

  38. arXiv:2212.02421  [pdf, other

    cond-mat.mtrl-sci cs.LG physics.atom-ph

    Score-based denoising for atomic structure identification

    Authors: Tim Hsu, Babak Sadigh, Nicolas Bertin, Cheol Woo Park, James Chapman, Vasily Bulatov, Fei Zhou

    Abstract: We propose an effective method for removing thermal vibrations that complicate the task of analyzing complex dynamics in atomistic simulation of condensed matter. Our method iteratively subtracts thermal noises or perturbations in atomic positions using a denoising score function trained on synthetically noised but otherwise perfect crystal lattices. The resulting denoised structures clearly revea… ▽ More

    Submitted 3 May, 2023; v1 submitted 5 December, 2022; originally announced December 2022.

  39. arXiv:2211.10082  [pdf, other

    cs.CR

    Private Federated Statistics in an Interactive Setting

    Authors: Audra McMillan, Omid Javidbakht, Kunal Talwar, Elliot Briggs, Mike Chatzidakis, Junye Chen, John Duchi, Vitaly Feldman, Yusuf Goren, Michael Hesse, Vojta Jina, Anil Katti, Albert Liu, Cheney Lyford, Joey Meyer, Alex Palmer, David Park, Wonhee Park, Gianni Parsa, Paul Pelzl, Rehan Rishi, Congzheng Song, Shan Wang, Shundong Zhou

    Abstract: Privately learning statistics of events on devices can enable improved user experience. Differentially private algorithms for such problems can benefit significantly from interactivity. We argue that an aggregation protocol can enable an interactive private federated statistics system where user's devices maintain control of the privacy assurance. We describe the architecture of such a system, and… ▽ More

    Submitted 18 November, 2022; originally announced November 2022.

  40. arXiv:2211.07807  [pdf, other

    astro-ph.CO cs.LG

    Hierarchical Inference of the Lensing Convergence from Photometric Catalogs with Bayesian Graph Neural Networks

    Authors: Ji Won Park, Simon Birrer, Madison Ueland, Miles Cranmer, Adriano Agnello, Sebastian Wagner-Carena, Philip J. Marshall, Aaron Roodman, the LSST Dark Energy Science Collaboration

    Abstract: We present a Bayesian graph neural network (BGNN) that can estimate the weak lensing convergence ($κ$) from photometric measurements of galaxies along a given line of sight. The method is of particular interest in strong gravitational time delay cosmography (TDC), where characterizing the "external convergence" ($κ_{\rm ext}$) from the lens environment and line of sight is necessary for precise in… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

    Comments: 15 pages, 8 figures (+ 6 pages, 2 figures in Appendix). Submitted to ApJ. Code at https://github.com/jiwoncpark/node-to-joy

  41. arXiv:2210.11068  [pdf, other

    cs.LG cs.CV cs.SD eess.AS

    Frequency of Interest-based Noise Attenuation Method to Improve Anomaly Detection Performance

    Authors: YeongHyeon Park, Myung Jin Kim, Won Seok Park

    Abstract: Accurately extracting driving events is the way to maximize computational efficiency and anomaly detection performance in the tire frictional nose-based anomaly detection task. This study proposes a concise and highly useful method for improving the precision of the event extraction that is hindered by extra noise such as wind noise, which is difficult to characterize clearly due to its randomness… ▽ More

    Submitted 2 December, 2022; v1 submitted 20 October, 2022; originally announced October 2022.

    Comments: 5 pages, 4 figures, 4 tables

  42. arXiv:2210.09394  [pdf

    cs.AI cs.LG

    Review Learning: Alleviating Catastrophic Forgetting with Generative Replay without Generator

    Authors: Jaesung Yoo, Sunghyuk Choi, Ye Seul Yang, Suhyeon Kim, Jieun Choi, Dongkyeong Lim, Yaeji Lim, Hyung Joon Joo, Dae Jung Kim, Rae Woong Park, Hyeong-Jin Yoon, Kwangsoo Kim

    Abstract: When a deep learning model is sequentially trained on different datasets, it forgets the knowledge acquired from previous data, a phenomenon known as catastrophic forgetting. It deteriorates performance of the deep learning model on diverse datasets, which is critical in privacy-preserving deep learning (PPDL) applications based on transfer learning (TL). To overcome this, we propose review learni… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

  43. arXiv:2210.08750  [pdf, other

    cs.CL cs.AI

    Keep Me Updated! Memory Management in Long-term Conversations

    Authors: Sanghwan Bae, Donghyun Kwak, Soyoung Kang, Min Young Lee, Sungdong Kim, Yuin Jeong, Hyeri Kim, Sang-Woo Lee, Woomyoung Park, Nako Sung

    Abstract: Remembering important information from the past and continuing to talk about it in the present are crucial in long-term conversations. However, previous literature does not deal with cases where the memorized information is outdated, which may cause confusion in later conversations. To address this issue, we present a novel task and a corresponding dataset of memory management in long-term convers… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

    Comments: Accepted to EMNLP2022 Findings

  44. arXiv:2210.04096  [pdf, other

    cs.LG q-bio.QM

    PropertyDAG: Multi-objective Bayesian optimization of partially ordered, mixed-variable properties for biological sequence design

    Authors: Ji Won Park, Samuel Stanton, Saeed Saremi, Andrew Watkins, Henri Dwyer, Vladimir Gligorijevic, Richard Bonneau, Stephen Ra, Kyunghyun Cho

    Abstract: Bayesian optimization offers a sample-efficient framework for navigating the exploration-exploitation trade-off in the vast design space of biological sequences. Whereas it is possible to optimize the various properties of interest jointly using a multi-objective acquisition function, such as the expected hypervolume improvement (EHVI), this approach does not account for objectives with a hierarch… ▽ More

    Submitted 8 October, 2022; originally announced October 2022.

    Comments: 9 pages, 7 figures. Submitted to NeurIPS 2022 AI4Science Workshop

  45. arXiv:2208.13427  [pdf, other

    cs.LG math.AT

    The PWLR Graph Representation: A Persistent Weisfeiler-Lehman scheme with Random Walks for Graph Classification

    Authors: Sun Woo Park, Yun Young Choi, Dosang Joe, U Jin Choi, Youngho Woo

    Abstract: This paper presents the Persistent Weisfeiler-Lehman Random walk scheme (abbreviated as PWLR) for graph representations, a novel mathematical framework which produces a collection of explainable low-dimensional representations of graphs with discrete and continuous node features. The proposed scheme effectively incorporates normalized Weisfeiler-Lehman procedure, random walks on graphs, and persis… ▽ More

    Submitted 29 August, 2022; originally announced August 2022.

    Comments: Accepted to the ICML 2022 Workshop on Topology, Algebra, and Geometry in Machine Learning

  46. arXiv:2208.11258  [pdf, other

    cs.CV

    Applying Eigencontours to PolarMask-Based Instance Segmentation

    Authors: Wonhui Park, Dongkwon Jin, Chang-Su Kim

    Abstract: Eigencontours are the first data-driven contour descriptors based on singular value decomposition. Based on the implementation of ESE-Seg, eigencontours were applied to the instance segmentation task successfully. In this report, we incorporate eigencontours into the PolarMask network for instance segmentation. Experimental results demonstrate that the proposed algorithm yields better results than… ▽ More

    Submitted 23 August, 2022; originally announced August 2022.

  47. arXiv:2206.07018  [pdf, other

    cs.CV

    Turning a Curse into a Blessing: Enabling In-Distribution-Data-Free Backdoor Removal via Stabilized Model Inversion

    Authors: Si Chen, Yi Zeng, Jiachen T. Wang, Won Park, Xun Chen, Lingjuan Lyu, Zhuoqing Mao, Ruoxi Jia

    Abstract: Many backdoor removal techniques in machine learning models require clean in-distribution data, which may not always be available due to proprietary datasets. Model inversion techniques, often considered privacy threats, can reconstruct realistic training samples, potentially eliminating the need for in-distribution data. Prior attempts to combine backdoor removal and model inversion yielded limit… ▽ More

    Submitted 23 March, 2023; v1 submitted 14 June, 2022; originally announced June 2022.

    Comments: Because of an equation and author informational error, this paper has been withdrawn by the submitter

  48. arXiv:2205.04259  [pdf, other

    cs.LG q-bio.BM

    Multi-segment preserving sampling for deep manifold sampler

    Authors: Daniel Berenberg, Jae Hyeon Lee, Simon Kelow, Ji Won Park, Andrew Watkins, Vladimir Gligorijević, Richard Bonneau, Stephen Ra, Kyunghyun Cho

    Abstract: Deep generative modeling for biological sequences presents a unique challenge in reconciling the bias-variance trade-off between explicit biological insight and model flexibility. The deep manifold sampler was recently proposed as a means to iteratively sample variable-length protein sequences by exploiting the gradients from a function predictor. We introduce an alternative approach to this guide… ▽ More

    Submitted 9 May, 2022; originally announced May 2022.

  49. arXiv:2205.00176  [pdf, other

    cs.CL

    Building a Role Specified Open-Domain Dialogue System Leveraging Large-Scale Language Models

    Authors: Sanghwan Bae, Donghyun Kwak, Sungdong Kim, Donghoon Ham, Soyoung Kang, Sang-Woo Lee, Woomyoung Park

    Abstract: Recent open-domain dialogue models have brought numerous breakthroughs. However, building a chat system is not scalable since it often requires a considerable volume of human-human dialogue data, especially when enforcing features such as persona, style, or safety. In this work, we study the challenge of imposing roles on open-domain dialogue systems, with the goal of making the systems maintain c… ▽ More

    Submitted 30 April, 2022; originally announced May 2022.

    Comments: Accepted to NAACL2022 as a long paper

  50. arXiv:2204.13509  [pdf, other

    cs.CL

    On the Effect of Pretraining Corpora on In-context Learning by a Large-scale Language Model

    Authors: Seongjin Shin, Sang-Woo Lee, Hwijeen Ahn, Sungdong Kim, HyoungSeok Kim, Boseop Kim, Kyunghyun Cho, Gichang Lee, Woomyoung Park, Jung-Woo Ha, Nako Sung

    Abstract: Many recent studies on large-scale language models have reported successful in-context zero- and few-shot learning ability. However, the in-depth analysis of when in-context learning occurs is still lacking. For example, it is unknown how in-context learning performance changes as the training corpus varies. Here, we investigate the effects of the source and size of the pretraining corpus on in-co… ▽ More

    Submitted 8 May, 2022; v1 submitted 28 April, 2022; originally announced April 2022.

    Comments: Accepted to NAACL2022 as a long paper. Camera-ready version