Skip to main content

Showing 1–50 of 181 results for author: Ko, J

  1. arXiv:2407.03086  [pdf, other

    cs.LG cs.AI cs.DC

    Effective Heterogeneous Federated Learning via Efficient Hypernetwork-based Weight Generation

    Authors: Yujin Shin, Kichang Lee, Sungmin Lee, You Rim Choi, Hyung-Sin Kim, JeongGil Ko

    Abstract: While federated learning leverages distributed client resources, it faces challenges due to heterogeneous client capabilities. This necessitates allocating models suited to clients' resources and careful parameter aggregation to accommodate this heterogeneity. We propose HypeMeFed, a novel federated learning framework for supporting client heterogeneity by combining a multi-exit network architectu… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  2. arXiv:2406.09728  [pdf, other

    cs.CV cs.GR

    Neural Pose Representation Learning for Generating and Transferring Non-Rigid Object Poses

    Authors: Seungwoo Yoo, Juil Koo, Kyeongmin Yeo, Minhyuk Sung

    Abstract: We propose a novel method for learning representations of poses for 3D deformable objects, which specializes in 1) disentangling pose information from the object's identity, 2) facilitating the learning of pose variations, and 3) transferring pose information to other object identities. Based on these properties, our method enables the generation of 3D deformable objects with diversity in both ide… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  3. arXiv:2406.00920  [pdf, ps, other

    stat.ML cs.LG math.OC

    Demystifying SGD with Doubly Stochastic Gradients

    Authors: Kyurae Kim, Joohwan Ko, Yi-An Ma, Jacob R. Gardner

    Abstract: Optimization objectives in the form of a sum of intractable expectations are rising in importance (e.g., diffusion models, variational autoencoders, and many more), a setting also known as "finite sum with infinite data." For these problems, a popular strategy is to employ SGD with doubly stochastic gradients (doubly SGD): the expectations are estimated using the gradient estimator of each compone… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: Accepted to ICML'24

  4. arXiv:2405.17083  [pdf, other

    cs.CV

    F-3DGS: Factorized Coordinates and Representations for 3D Gaussian Splatting

    Authors: Xiangyu Sun, Joo Chan Lee, Daniel Rho, Jong Hwan Ko, Usman Ali, Eunbyung Park

    Abstract: The neural radiance field (NeRF) has made significant strides in representing 3D scenes and synthesizing novel views. Despite its advancements, the high computational costs of NeRF have posed challenges for its deployment in resource-constrained environments and real-time applications. As an alternative to NeRF-like neural rendering methods, 3D Gaussian Splatting (3DGS) offers rapid rendering spee… ▽ More

    Submitted 28 May, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

    Comments: Our project page including code is available at https://xiangyu1sun.github.io/Factorize-3DGS/

  5. arXiv:2405.11297  [pdf, other

    cs.CL

    Unveiling Key Aspects of Fine-Tuning in Sentence Embeddings: A Representation Rank Analysis

    Authors: Euna Jung, Jaeill Kim, Jungmin Ko, Jinwoo Park, Wonjong Rhee

    Abstract: The latest advancements in unsupervised learning of sentence embeddings predominantly involve employing contrastive learning-based (CL-based) fine-tuning over pre-trained language models. In this study, we analyze the latest sentence embedding methods by adopting representation rank as the primary tool of analysis. We first define Phase 1 and Phase 2 of fine-tuning based on when representation ran… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

  6. arXiv:2405.06208  [pdf, other

    cs.DS

    A Lock-free Binary Trie

    Authors: Jeremy Ko

    Abstract: A binary trie is a sequential data structure for a dynamic set on the universe $\{0,\dots,u-1\}$ supporting Search with $O(1)$ worst-case step complexity, and Insert, Delete, and Predecessor operations with $O(\log u)$ worst-case step complexity. We give a wait-free implementation of a relaxed binary trie, using read, write, CAS, and ($\log u$)-bit AND operations. It supports all operations with… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  7. arXiv:2404.18399  [pdf, other

    cs.CV

    Semantic Line Combination Detector

    Authors: Jinwon Ko, Dongkwon Jin, Chang-Su Kim

    Abstract: A novel algorithm, called semantic line combination detector (SLCD), to find an optimal combination of semantic lines is proposed in this paper. It processes all lines in each line combination at once to assess the overall harmony of the lines. First, we generate various line combinations from reliable lines. Second, we estimate the score of each line combination and determine the best one. Experi… ▽ More

    Submitted 1 May, 2024; v1 submitted 28 April, 2024; originally announced April 2024.

    Comments: CVPR 2024 accepted

  8. arXiv:2404.16012  [pdf, other

    cs.CV cs.MM

    GaussianTalker: Real-Time High-Fidelity Talking Head Synthesis with Audio-Driven 3D Gaussian Splatting

    Authors: Kyusun Cho, Joungbin Lee, Heeji Yoon, Yeobin Hong, Jaehoon Ko, Sangjun Ahn, Seungryong Kim

    Abstract: We propose GaussianTalker, a novel framework for real-time generation of pose-controllable talking heads. It leverages the fast rendering capabilities of 3D Gaussian Splatting (3DGS) while addressing the challenges of directly controlling 3DGS with speech audio. GaussianTalker constructs a canonical 3DGS representation of the head and deforms it in sync with the audio. A key insight is to encode t… ▽ More

    Submitted 25 April, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

    Comments: Project Page: https://ku-cvlab.github.io/GaussianTalker

  9. arXiv:2404.15707  [pdf, other

    cs.CV

    ESR-NeRF: Emissive Source Reconstruction Using LDR Multi-view Images

    Authors: Jinseo Jeong, Junseo Koo, Qimeng Zhang, Gunhee Kim

    Abstract: Existing NeRF-based inverse rendering methods suppose that scenes are exclusively illuminated by distant light sources, neglecting the potential influence of emissive sources within a scene. In this work, we confront this limitation using LDR multi-view images captured with emissive sources turned on and off. Two key issues must be addressed: 1) ambiguity arising from the limited dynamic range alo… ▽ More

    Submitted 6 June, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

    Comments: CVPR 2024

  10. An Optimal MPC Algorithm for Subunit-Monge Matrix Multiplication, with Applications to LIS

    Authors: Jaehyun Koo

    Abstract: We present an $O(1)$-round fully-scalable deterministic massively parallel algorithm for computing the min-plus matrix multiplication of unit-Monge matrices. We use this to derive a $O(\log n)$-round fully-scalable massively parallel algorithm for solving the exact longest increasing subsequence (LIS) problem. For a fully-scalable MPC regime, this result substantially improves the previously known… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

    Comments: To appear in SPAA 2024

  11. Anarchy in the APSP: Algorithm and Hardness for Incorrect Implementation of Floyd-Warshall

    Authors: Jaehyun Koo

    Abstract: The celebrated Floyd-Warshall algorithm efficiently computes the all-pairs shortest path, and its simplicity made it a staple in computer science classes. Frequently, students discover a variant of this Floyd-Warshall algorithm by mixing up the loop order, ending up with the incorrect APSP matrix. This paper considers a computational problem of computing this incorrect APSP matrix. We will propose… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: To appear in FUN 2024

  12. arXiv:2404.03159  [pdf, other

    cs.CV

    HandDiff: 3D Hand Pose Estimation with Diffusion on Image-Point Cloud

    Authors: Wencan Cheng, Hao Tang, Luc Van Gool, Jong Hwan Ko

    Abstract: Extracting keypoint locations from input hand frames, known as 3D hand pose estimation, is a critical task in various human-computer interaction applications. Essentially, the 3D hand pose estimation can be regarded as a 3D point subset generative problem conditioned on input frames. Thanks to the recent significant progress on diffusion-based generative models, hand pose estimation can also benef… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: Accepted as a conference paper to the Conference on Computer Vision and Pattern Recognition (2024)

  13. arXiv:2404.02252  [pdf, other

    cs.SD eess.AS

    SMITIN: Self-Monitored Inference-Time INtervention for Generative Music Transformers

    Authors: Junghyun Koo, Gordon Wichern, Francois G. Germain, Sameer Khurana, Jonathan Le Roux

    Abstract: We introduce Self-Monitored Inference-Time INtervention (SMITIN), an approach for controlling an autoregressive generative music transformer using classifier probes. These simple logistic regression probes are trained on the output of each attention head in the transformer using a small dataset of audio examples both exhibiting and missing a specific musical trait (e.g., the presence/absence of dr… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  14. arXiv:2404.00851  [pdf, other

    cs.CV

    Prompt Learning via Meta-Regularization

    Authors: Jinyoung Park, Juyeon Ko, Hyunwoo J. Kim

    Abstract: Pre-trained vision-language models have shown impressive success on various computer vision tasks with their zero-shot generalizability. Recently, prompt learning approaches have been explored to efficiently and effectively adapt the vision-language models to a variety of downstream tasks. However, most existing prompt learning methods suffer from task overfitting since the general knowledge of th… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Comments: CVPR 2024

  15. arXiv:2403.20153  [pdf, other

    cs.CV

    Talk3D: High-Fidelity Talking Portrait Synthesis via Personalized 3D Generative Prior

    Authors: Jaehoon Ko, Kyusun Cho, Joungbin Lee, Heeji Yoon, Sangmin Lee, Sangjun Ahn, Seungryong Kim

    Abstract: Recent methods for audio-driven talking head synthesis often optimize neural radiance fields (NeRF) on a monocular talking portrait video, leveraging its capability to render high-fidelity and 3D-consistent novel-view frames. However, they often struggle to reconstruct complete face geometry due to the absence of comprehensive 3D information in the input monocular videos. In this paper, we introdu… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: Project page: https://ku-cvlab.github.io/Talk3D/

  16. arXiv:2403.16921  [pdf, other

    cs.CV

    PropTest: Automatic Property Testing for Improved Visual Programming

    Authors: Jaywon Koo, Ziyan Yang, Paola Cascante-Bonilla, Baishakhi Ray, Vicente Ordonez

    Abstract: Visual Programming has emerged as an alternative to end-to-end black-box visual reasoning models. This type of methods leverage Large Language Models (LLMs) to decompose a problem and generate the source code for an executable computer program. This strategy has the advantage of offering an interpretable reasoning path and does not require finetuning a model with task-specific data. We propose Pro… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: Project Page: https://jaywonkoo17.github.io/PropTest/

  17. arXiv:2403.14370  [pdf, other

    cs.CV

    SyncTweedies: A General Generative Framework Based on Synchronized Diffusions

    Authors: Jaihoon Kim, Juil Koo, Kyeongmin Yeo, Minhyuk Sung

    Abstract: We introduce a general framework for generating diverse visual content, including ambiguous images, panorama images, mesh textures, and Gaussian splat textures, by synchronizing multiple diffusion processes. We present exhaustive investigation into all possible scenarios for synchronizing multiple diffusion processes through a canonical space and analyze their characteristics across applications.… ▽ More

    Submitted 20 June, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: Project page: https://synctweedies.github.io/

  18. arXiv:2403.07189  [pdf, ps, other

    cs.IT cond-mat.dis-nn math-ph math.ST

    A multiscale cavity method for sublinear-rank symmetric matrix factorization

    Authors: Jean Barbier, Justin Ko, Anas A. Rahman

    Abstract: We consider a statistical model for symmetric matrix factorization with additive Gaussian noise in the high-dimensional regime where the rank $M$ of the signal matrix to infer scales with its size $N$ as $M = o(N^{1/10})$. Allowing for a $N$-dependent rank offers new challenges and requires new methods. Working in the Bayesian-optimal setting, we show that whenever the signal has i.i.d. entries th… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  19. arXiv:2403.04234  [pdf, other

    stat.ML cs.LG

    Fundamental limits of Non-Linear Low-Rank Matrix Estimation

    Authors: Pierre Mergny, Justin Ko, Florent Krzakala, Lenka Zdeborová

    Abstract: We consider the task of estimating a low-rank matrix from non-linear and noisy observations. We prove a strong universality result showing that Bayes-optimal performances are characterized by an equivalent Gaussian model with an effective prior, whose parameters are entirely determined by an expansion of the non-linear function. In particular, we show that to reconstruct the signal accurately, one… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: 42 pages, 2 figures

  20. arXiv:2403.04134  [pdf, other

    cs.RO

    An Adaptable, Safe, and Portable Robot-Assisted Feeding System

    Authors: Ethan Kroll Gordon, Rajat Kumar Jenamani, Amal Nanavati, Ziang Liu, Haya Bolotski, Raida Karim, Daniel Stabile, Atharva Kashyap, Bernie Hao Zhu, Xilai Dai, Tyler Schrenk, Jonathan Ko, Taylor Kessler Faulkner, Tapomayukh Bhattacharjee, Siddhartha Srinivasa

    Abstract: We demonstrate a robot-assisted feeding system that enables people with mobility impairments to feed themselves. Our system design embodies Safety, Portability, and User Control, with comprehensive full-stack safety checks, the ability to be mounted on and powered by any powered wheelchair, and a custom web-app allowing care-recipients to leverage their own assistive devices for robot control. For… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: HRI 2024 Demo; Corrected inaccurate author ordering in ACM DL which occurred due to formatting issues

  21. arXiv:2403.03695  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG math.PR math.ST

    Spectral Phase Transition and Optimal PCA in Block-Structured Spiked models

    Authors: Pierre Mergny, Justin Ko, Florent Krzakala

    Abstract: We discuss the inhomogeneous spiked Wigner model, a theoretical framework recently introduced to study structured noise in various learning scenarios, through the prism of random matrix theory, with a specific focus on its spectral properties. Our primary objective is to find an optimal spectral method and to extend the celebrated \cite{BBP} (BBP) phase transition criterion -- well-known in the ho… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 26 pages, 2 figures

  22. arXiv:2402.18545  [pdf, other

    cs.CY

    Crowdsourcing Dermatology Images with Google Search Ads: Creating a Real-World Skin Condition Dataset

    Authors: Abbi Ward, Jimmy Li, Julie Wang, Sriram Lakshminarasimhan, Ashley Carrick, Bilson Campana, Jay Hartford, Pradeep Kumar S, Tiya Tiyasirichokchai, Sunny Virmani, Renee Wong, Yossi Matias, Greg S. Corrado, Dale R. Webster, Dawn Siegel, Steven Lin, Justin Ko, Alan Karthikesalingam, Christopher Semturs, Pooja Rao

    Abstract: Background: Health datasets from clinical sources do not reflect the breadth and diversity of disease in the real world, impacting research, medical education, and artificial intelligence (AI) tool development. Dermatology is a suitable area to develop and test a new and scalable method to create representative health datasets. Methods: We used Google Search advertisements to invite contribution… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  23. arXiv:2402.18293  [pdf, other

    cs.CV

    Continuous Memory Representation for Anomaly Detection

    Authors: Joo Chan Lee, Taejune Kim, Eunbyung Park, Simon S. Woo, Jong Hwan Ko

    Abstract: There have been significant advancements in anomaly detection in an unsupervised manner, where only normal images are available for training. Several recent methods aim to detect anomalies based on a memory, comparing or reconstructing the input with directly stored normal features (or trained features with normal images). However, such memory-based approaches operate on a discrete feature space i… ▽ More

    Submitted 10 March, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: Project page: https://tae-mo.github.io/crad/

  24. arXiv:2402.16506  [pdf, other

    cs.CV

    Stochastic Conditional Diffusion Models for Robust Semantic Image Synthesis

    Authors: Juyeon Ko, Inho Kong, Dogyun Park, Hyunwoo J. Kim

    Abstract: Semantic image synthesis (SIS) is a task to generate realistic images corresponding to semantic maps (labels). However, in real-world applications, SIS often encounters noisy user inputs. To address this, we propose Stochastic Conditional Diffusion Model (SCDM), which is a robust conditional diffusion model that features novel forward and generation processes tailored for SIS with noisy labels. It… ▽ More

    Submitted 3 June, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: ICML 2024

  25. arXiv:2402.15566  [pdf

    eess.IV cs.CV cs.LG

    Closing the AI generalization gap by adjusting for dermatology condition distribution differences across clinical settings

    Authors: Rajeev V. Rikhye, Aaron Loh, Grace Eunhae Hong, Preeti Singh, Margaret Ann Smith, Vijaytha Muralidharan, Doris Wong, Rory Sayres, Michelle Phung, Nicolas Betancourt, Bradley Fong, Rachna Sahasrabudhe, Khoban Nasim, Alec Eschholz, Basil Mustafa, Jan Freyberg, Terry Spitz, Yossi Matias, Greg S. Corrado, Katherine Chou, Dale R. Webster, Peggy Bui, Yuan Liu, Yun Liu, Justin Ko , et al. (1 additional authors not shown)

    Abstract: Recently, there has been great progress in the ability of artificial intelligence (AI) algorithms to classify dermatological conditions from clinical photographs. However, little is known about the robustness of these algorithms in real-world settings where several factors can lead to a loss of generalizability. Understanding and overcoming these limitations will permit the development of generali… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  26. arXiv:2402.14196  [pdf, other

    cs.CV cs.GR

    Mip-Grid: Anti-aliased Grid Representations for Neural Radiance Fields

    Authors: Seungtae Nam, Daniel Rho, Jong Hwan Ko, Eunbyung Park

    Abstract: Despite the remarkable achievements of neural radiance fields (NeRF) in representing 3D scenes and generating novel view images, the aliasing issue, rendering "jaggies" or "blurry" images at varying camera distances, remains unresolved in most existing approaches. The recently proposed mip-NeRF has addressed this challenge by rendering conical frustums instead of rays. However, it relies on MLP ar… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: Accepted to NeurIPS 2023

  27. arXiv:2402.03898  [pdf, other

    cs.CL cs.AI cs.LG

    DistiLLM: Towards Streamlined Distillation for Large Language Models

    Authors: Jongwoo Ko, Sungnyun Kim, Tianyi Chen, Se-Young Yun

    Abstract: Knowledge distillation (KD) is widely used for compressing a teacher model to a smaller student model, reducing its inference cost and memory footprint while preserving model capabilities. However, current KD methods for auto-regressive sequence models (e.g., large language models) suffer from missing a standardized objective function. Moreover, the recent use of student-generated outputs to addre… ▽ More

    Submitted 3 July, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: ICML 2024; Code is available at https://github.com/jongwooko/distillm

  28. arXiv:2401.15894  [pdf, other

    cs.LG cs.AI

    A Gated MLP Architecture for Learning Topological Dependencies in Spatio-Temporal Graphs

    Authors: Yun Young Choi, Minho Lee, Sun Woo Park, Seunghwan Lee, Joohwan Ko

    Abstract: Graph Neural Networks (GNNs) and Transformer have been increasingly adopted to learn the complex vector representations of spatio-temporal graphs, capturing intricate spatio-temporal dependencies crucial for applications such as traffic datasets. Although many existing methods utilize multi-head attention mechanisms and message-passing neural networks (MPNNs) to capture both spatial and temporal r… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  29. arXiv:2401.10989  [pdf, other

    stat.ML cs.LG stat.CO

    Provably Scalable Black-Box Variational Inference with Structured Variational Families

    Authors: Joohwan Ko, Kyurae Kim, Woo Chang Kim, Jacob R. Gardner

    Abstract: Variational families with full-rank covariance approximations are known not to work well in black-box variational inference (BBVI), both empirically and theoretically. In fact, recent computational complexity results for BBVI have established that full-rank variational families scale poorly with the dimensionality of the problem compared to e.g. mean-field families. This is particularly critical t… ▽ More

    Submitted 1 June, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Comments: Accepted to ICML'24

  30. arXiv:2401.09986  [pdf, other

    cs.LG cs.AI

    Improving Local Training in Federated Learning via Temperature Scaling

    Authors: Kichang Lee, Songkuk Kim, JeongGil Ko

    Abstract: Federated learning is inherently hampered by data heterogeneity: non-i.i.d. training data over local clients. We propose a novel model training approach for federated learning, FLex&Chill, which exploits the Logit Chilling method. Through extensive evaluations, we demonstrate that, in the presence of non-i.i.d. data characteristics inherent in federated learning systems, this approach can expedite… ▽ More

    Submitted 26 June, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

    Comments: 24 pages

    MSC Class: 68 ACM Class: I.2.11

  31. arXiv:2401.09678  [pdf, other

    cs.SE cs.FL cs.LO eess.SY

    Integrating Graceful Degradation and Recovery through Requirement-driven Adaptation

    Authors: Simon Chu, Justin Koe, David Garlan, Eunsuk Kang

    Abstract: Cyber-physical systems (CPS) are subject to environmental uncertainties such as adverse operating conditions, malicious attacks, and hardware degradation. These uncertainties may lead to failures that put the system in a sub-optimal or unsafe state. Systems that are resilient to such uncertainties rely on two types of operations: (1) graceful degradation, to ensure that the system maintains an acc… ▽ More

    Submitted 8 April, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

    Comments: Pre-print for the SEAMS '24 conference (Software Engineering for Adaptive and Self-Managing Systems Conference)

  32. Transfer-Learning-Based Autotuning Using Gaussian Copula

    Authors: Thomas Randall, Jaehoon Koo, Brice Videau, Michael Kruse, Xingfu Wu, Paul Hovland, Mary Hall, Rong Ge, Prasanna Balaprakash

    Abstract: As diverse high-performance computing (HPC) systems are built, many opportunities arise for applications to solve larger problems than ever before. Given the significantly increased complexity of these HPC systems and application tuning, empirical performance tuning, such as autotuning, has emerged as a promising approach in recent years. Despite its effectiveness, autotuning is often a computatio… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

    Comments: 13 pages, 5 figures, 7 tables, the definitive version of this work is published in the Proceedings of the ACM International Conference on Supercomputing 2023, available at https://dl.acm.org/doi/10.1145/3577193.3593712

    ACM Class: I.2.4; G.3; D.2.8

    Journal ref: Proceedings of the 37th International Conference on Supercomputing (2023) 37-49

  33. arXiv:2401.03650  [pdf, other

    eess.AS cs.SD eess.SP

    DDD: A Perceptually Superior Low-Response-Time DNN-based Declipper

    Authors: Jayeon Yi, Junghyun Koo, Kyogu Lee

    Abstract: Clipping is a common nonlinear distortion that occurs whenever the input or output of an audio system exceeds the supported range. This phenomenon undermines not only the perception of speech quality but also downstream processes utilizing the disrupted signal. Therefore, a real-time-capable, robust, and low-response-time method for speech declipping (SD) is desired. In this work, we introduce DDD… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

    Comments: To appear, ICASSP 2024. Demo samples at https://stet-stet.github.io/DDD, repo at https://github.com/stet-stet/DDD

  34. arXiv:2312.08847  [pdf, other

    cs.AI cs.LG cs.NE stat.ML

    Knowledge-Driven Modulation of Neural Networks with Attention Mechanism for Next Activity Prediction

    Authors: Ivan Donadello, Jonghyeon Ko, Fabrizio Maria Maggi, Jan Mendling, Francesco Riva, Matthias Weidlich

    Abstract: Predictive Process Monitoring (PPM) aims at leveraging historic process execution data to predict how ongoing executions will continue up to their completion. In recent years, PPM techniques for the prediction of the next activities have matured significantly, mainly thanks to the use of Neural Networks (NNs) as a predictor. While their performance is difficult to beat in the general case, there a… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    MSC Class: 68T20 (Primary) 68T01; 68T05; 68T37 (Secondary) ACM Class: I.2.6; I.2.8; I.2.m

  35. arXiv:2311.15569  [pdf, other

    cs.CV cs.AI

    Improving Adaptability and Generalizability of Efficient Transfer Learning for Vision-Language Models

    Authors: Yongjin Yang, Jongwoo Ko, Se-Young Yun

    Abstract: Vision-Language Models (VLMs) like CLIP have demonstrated remarkable applicability across a variety of downstream tasks, including zero-shot image classification. Recently, the use of prompts or adapters for efficient transfer learning has gained significant attention for effectively adapting to downstream tasks. However, the roles of vision and text prompts, as well as adapters in terms of genera… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: 11 pages (19 pages including supplementary), 10 figures (12 figures including supplementary), 6 tables (17 tables including supplementary)

  36. arXiv:2311.14993  [pdf, other

    cs.CV

    Coordinate-Aware Modulation for Neural Fields

    Authors: Joo Chan Lee, Daniel Rho, Seungtae Nam, Jong Hwan Ko, Eunbyung Park

    Abstract: Neural fields, mapping low-dimensional input coordinates to corresponding signals, have shown promising results in representing various signals. Numerous methodologies have been proposed, and techniques employing MLPs and grid representations have achieved substantial success. MLPs allow compact and high expressibility, yet often suffer from spectral bias and slow convergence speed. On the other h… ▽ More

    Submitted 25 November, 2023; originally announced November 2023.

    Comments: Project page: http://maincold2.github.io/cam/

  37. arXiv:2311.13831  [pdf, other

    cs.CV

    Posterior Distillation Sampling

    Authors: Juil Koo, Chanho Park, Minhyuk Sung

    Abstract: We introduce Posterior Distillation Sampling (PDS), a novel optimization method for parametric image editing based on diffusion models. Existing optimization-based methods, which leverage the powerful 2D prior of diffusion models to handle various parametric images, have mainly focused on generation. Unlike generation, editing requires a balance between conforming to the target attribute and prese… ▽ More

    Submitted 31 March, 2024; v1 submitted 23 November, 2023; originally announced November 2023.

    Comments: Project page: https://posterior-distillation-sampling.github.io/

  38. arXiv:2311.13681  [pdf, other

    cs.CV cs.GR

    Compact 3D Gaussian Representation for Radiance Field

    Authors: Joo Chan Lee, Daniel Rho, Xiangyu Sun, Jong Hwan Ko, Eunbyung Park

    Abstract: Neural Radiance Fields (NeRFs) have demonstrated remarkable potential in capturing complex 3D scenes with high fidelity. However, one persistent challenge that hinders the widespread adoption of NeRFs is the computational bottleneck due to the volumetric rendering. On the other hand, 3D Gaussian splatting (3DGS) has recently emerged as an alternative representation that leverages a 3D Gaussisan-ba… ▽ More

    Submitted 15 February, 2024; v1 submitted 22 November, 2023; originally announced November 2023.

    Comments: Project page: http://maincold2.github.io/c3dgs/

  39. arXiv:2311.09585  [pdf, other

    cs.CL

    LifeTox: Unveiling Implicit Toxicity in Life Advice

    Authors: Minbeom Kim, Jahyun Koo, Hwanhee Lee, Joonsuk Park, Hwaran Lee, Kyomin Jung

    Abstract: As large language models become increasingly integrated into daily life, detecting implicit toxicity across diverse contexts is crucial. To this end, we introduce LifeTox, a dataset designed for identifying implicit toxicity within a broad range of advice-seeking scenarios. Unlike existing safety datasets, LifeTox comprises diverse contexts derived from personal experiences through open-ended ques… ▽ More

    Submitted 18 March, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: 11 pages, 5 figures, NAACL 2024

  40. arXiv:2311.07607  [pdf, other

    cs.AI cs.LG

    Modeling Choice via Self-Attention

    Authors: Joohwan Ko, Andrew A. Li

    Abstract: Models of choice are a fundamental input to many now-canonical optimization problems in the field of Operations Management, including assortment, inventory, and price optimization. Naturally, accurate estimation of these models from data is a critical step in the application of these optimization problems in practice. Concurrently, recent advancements in deep learning have sparked interest in inte… ▽ More

    Submitted 8 February, 2024; v1 submitted 11 November, 2023; originally announced November 2023.

  41. arXiv:2310.20258  [pdf, other

    cs.LG

    Advancing Bayesian Optimization via Learning Correlated Latent Space

    Authors: Seunghun Lee, Jaewon Chu, Sihyeon Kim, Juyeon Ko, Hyunwoo J. Kim

    Abstract: Bayesian optimization is a powerful method for optimizing black-box functions with limited function evaluations. Recent works have shown that optimization in a latent space through deep generative models such as variational autoencoders leads to effective and efficient Bayesian optimization for structured or discrete data. However, as the optimization does not take place in the input space, it lea… ▽ More

    Submitted 19 November, 2023; v1 submitted 31 October, 2023; originally announced October 2023.

  42. arXiv:2310.17668  [pdf, other

    cs.LG

    Fine tuning Pre trained Models for Robustness Under Noisy Labels

    Authors: Sumyeong Ahn, Sihyeon Kim, Jongwoo Ko, Se-Young Yun

    Abstract: The presence of noisy labels in a training dataset can significantly impact the performance of machine learning models. To tackle this issue, researchers have explored methods for Learning with Noisy Labels to identify clean samples and reduce the influence of noisy labels. However, constraining the influence of a certain portion of the training dataset can result in a reduction in overall general… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: 10 pages (17 pages including supplementary)

    MSC Class: Computer Science; Artificial Intelligence

  43. arXiv:2310.15668  [pdf, other

    cs.SI cs.DB

    Hypergraph Motifs and Their Extensions Beyond Binary

    Authors: Geon Lee, Seokbum Yoon, Jihoon Ko, Hyunju Kim, Kijung Shin

    Abstract: Hypergraphs naturally represent group interactions, which are omnipresent in many domains: collaborations of researchers, co-purchases of items, and joint interactions of proteins, to name a few. In this work, we propose tools for answering the following questions: (Q1) what are the structural design principles of real-world hypergraphs? (Q2) how can we compare local structures of hypergraphs of d… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: Extended version of VLDB 2020 paper arXiv:2003.01853

  44. arXiv:2310.10054  [pdf, other

    cs.CL cs.AI cs.LG

    NASH: A Simple Unified Framework of Structured Pruning for Accelerating Encoder-Decoder Language Models

    Authors: Jongwoo Ko, Seungjoon Park, Yujin Kim, Sumyeong Ahn, Du-Seong Chang, Euijai Ahn, Se-Young Yun

    Abstract: Structured pruning methods have proven effective in reducing the model size and accelerating inference speed in various network architectures such as Transformers. Despite the versatility of encoder-decoder models in numerous NLP tasks, the structured pruning methods on such models are relatively less explored compared to encoder-only models. In this study, we investigate the behavior of the struc… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: Findings of the Association for Computational Linguistics: EMNLP 2023

  45. arXiv:2310.06511  [pdf, other

    cs.LG

    Self-Supervised Dataset Distillation for Transfer Learning

    Authors: Dong Bok Lee, Seanie Lee, Joonho Ko, Kenji Kawaguchi, Juho Lee, Sung Ju Hwang

    Abstract: Dataset distillation methods have achieved remarkable success in distilling a large dataset into a small set of representative samples. However, they are not designed to produce a distilled dataset that can be effectively used for facilitating self-supervised pre-training. To this end, we propose a novel problem of distilling an unlabeled dataset into a set of small synthetic samples for efficient… ▽ More

    Submitted 11 April, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

  46. arXiv:2310.05424  [pdf, other

    cs.CL

    Fast and Robust Early-Exiting Framework for Autoregressive Language Models with Synchronized Parallel Decoding

    Authors: Sangmin Bae, Jongwoo Ko, Hwanjun Song, Se-Young Yun

    Abstract: To tackle the high inference latency exhibited by autoregressive language models, previous studies have proposed an early-exiting framework that allocates adaptive computation paths for each token based on the complexity of generating the subsequent token. However, we observed several shortcomings, including performance degradation caused by a state copying mechanism or numerous exit paths, and se… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 (Long)

  47. arXiv:2310.02823  [pdf, other

    cs.LG stat.ML

    Learning to Scale Logits for Temperature-Conditional GFlowNets

    Authors: Minsu Kim, Joohwan Ko, Taeyoung Yun, Dinghuai Zhang, Ling Pan, Woochang Kim, Jinkyoo Park, Emmanuel Bengio, Yoshua Bengio

    Abstract: GFlowNets are probabilistic models that sequentially generate compositional structures through a stochastic policy. Among GFlowNets, temperature-conditional GFlowNets can introduce temperature-based controllability for exploration and exploitation. We propose \textit{Logit-scaling GFlowNets} (Logit-GFN), a novel architectural design that greatly accelerates the training of temperature-conditional… ▽ More

    Submitted 2 June, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: ICML 2024, 23 pages, 21 figures

  48. arXiv:2310.00109  [pdf, other

    cs.LG cs.DC cs.DL

    FedAIoT: A Federated Learning Benchmark for Artificial Intelligence of Things

    Authors: Samiul Alam, Tuo Zhang, Tiantian Feng, Hui Shen, Zhichao Cao, Dong Zhao, JeongGil Ko, Kiran Somasundaram, Shrikanth S. Narayanan, Salman Avestimehr, Mi Zhang

    Abstract: There is a significant relevance of federated learning (FL) in the realm of Artificial Intelligence of Things (AIoT). However, most existing FL works do not use datasets collected from authentic IoT devices and thus do not capture unique modalities and inherent challenges of IoT data. To fill this critical gap, in this work, we introduce FedAIoT, an FL benchmark for AIoT. FedAIoT includes eight da… ▽ More

    Submitted 19 June, 2024; v1 submitted 29 September, 2023; originally announced October 2023.

  49. arXiv:2309.10310  [pdf, other

    cs.LG

    TensorCodec: Compact Lossy Compression of Tensors without Strong Data Assumptions

    Authors: Taehyung Kwon, Jihoon Ko, Jinhong Jung, Kijung Shin

    Abstract: Many real-world datasets are represented as tensors, i.e., multi-dimensional arrays of numerical values. Storing them without compression often requires substantial space, which grows exponentially with the order. While many tensor compression algorithms are available, many of them rely on strong data assumptions regarding its order, sparsity, rank, and smoothness. In this work, we propose TENSORC… ▽ More

    Submitted 20 September, 2023; v1 submitted 19 September, 2023; originally announced September 2023.

    Comments: Accepted to ICDM 2023 - IEEE International Conference on Data Mining 2023

  50. arXiv:2309.10069  [pdf

    q-bio.NC cs.AI

    Sex-based Disparities in Brain Aging: A Focus on Parkinson's Disease

    Authors: Iman Beheshti, Samuel Booth, Ji Hyun Ko

    Abstract: PD is linked to faster brain aging. Sex is recognized as an important factor in PD, such that males are twice as likely as females to have the disease and have more severe symptoms and a faster progression rate. Despite previous research, there remains a significant gap in understanding the function of sex in the process of brain aging in PD patients. The T1-weighted MRI-driven brain-predicted age… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

    Comments: 35 pages, 5 figures