Skip to main content

Showing 51–100 of 5,715 results for author: Kim, H

  1. arXiv:2406.15659  [pdf, other

    cs.LG cs.MA

    Contextual Sprint Classification in Soccer Based on Deep Learning

    Authors: Hyunsung Kim, Gun-Hee Joe, Jinsung Yoon, Sang-Ki Ko

    Abstract: The analysis of high-intensity runs (or sprints) in soccer has long been a topic of interest for sports science researchers and practitioners. In particular, recent studies suggested contextualizing sprints based on their tactical purposes to better understand the physical-tactical requirements of modern match-play. However, they have a limitation in scalability, as human experts have to manually… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: Accepted at IJCAI 2024 Workshop on Intelligent Technologies for Precision Sports Science (IT4PSS 2024)

  2. arXiv:2406.14571  [pdf, other

    cs.AR cs.AI cs.LG

    PreSto: An In-Storage Data Preprocessing System for Training Recommendation Models

    Authors: Yunjae Lee, Hyeseong Kim, Minsoo Rhu

    Abstract: Training recommendation systems (RecSys) faces several challenges as it requires the "data preprocessing" stage to preprocess an ample amount of raw data and feed them to the GPU for training in a seamless manner. To sustain high training throughput, state-of-the-art solutions reserve a large fleet of CPU servers for preprocessing which incurs substantial deployment cost and power consumption. Our… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Journal ref: Published at 51th IEEE/ACM International Symposium on Computer Architecture (ISCA-51), 2024

  3. arXiv:2406.13935  [pdf, other

    eess.AS cs.AI cs.SD

    CONMOD: Controllable Neural Frame-based Modulation Effects

    Authors: Gyubin Lee, Hounsu Kim, Junwon Lee, Juhan Nam

    Abstract: Deep learning models have seen widespread use in modelling LFO-driven audio effects, such as phaser and flanger. Although existing neural architectures exhibit high-quality emulation of individual effects, they do not possess the capability to manipulate the output via control parameters. To address this issue, we introduce Controllable Neural Frame-based Modulation Effects (CONMOD), a single blac… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  4. arXiv:2406.13474  [pdf, other

    cs.LG cs.AI

    Attention-aware Post-training Quantization without Backpropagation

    Authors: Junhan Kim, Ho-young Kim, Eulrang Cho, Chungman Lee, Joonyoung Kim, Yongkweon Jeon

    Abstract: Quantization is a promising solution for deploying large-scale language models (LLMs) on resource-constrained devices. Existing quantization approaches, however, rely on gradient-based optimization, regardless of it being post-training quantization (PTQ) or quantization-aware training (QAT), which becomes problematic for hyper-scale LLMs with billions of parameters. This overhead can be alleviated… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 20 pages, under review

  5. arXiv:2406.12721  [pdf

    eess.AS cs.SD

    Sound event detection based on auxiliary decoder and maximum probability aggregation for DCASE Challenge 2024 Task 4

    Authors: Sang Won Son, Jongyeon Park, Hong Kook Kim, Sulaiman Vesal, Jeong Eun Lim

    Abstract: In this report, we propose three novel methods for developing a sound event detection (SED) model for the DCASE 2024 Challenge Task 4. First, we propose an auxiliary decoder attached to the final convolutional block to improve feature extraction capabilities while reducing dependency on embeddings from pre-trained large models. The proposed auxiliary decoder operates independently from the main de… ▽ More

    Submitted 24 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: DCASE 2024 challenge Task4, 4 pages

  6. arXiv:2406.12258  [pdf, other

    cs.CV

    Advancing Cross-Domain Generalizability in Face Anti-Spoofing: Insights, Design, and Metrics

    Authors: Hyojin Kim, Jiyoon Lee, Yonghyun Jeong, Haneol Jang, YoungJoon Yoo

    Abstract: This paper presents a novel perspective for enhancing anti-spoofing performance in zero-shot data domain generalization. Unlike traditional image classification tasks, face anti-spoofing datasets display unique generalization characteristics, necessitating novel zero-shot data domain generalization. One step forward to the previous frame-wise spoofing prediction, we introduce a nuanced metric calc… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 10 pages with 4 figures, Accepted by CVPRW 2024

  7. arXiv:2406.11961  [pdf, other

    hep-ph hep-ex

    Elaborating Higgs to dimuon decay from gluon fusion by decorrelation and jet substructure

    Authors: Subin Han, Hyung Do Kim

    Abstract: Discovery of the Higgs boson decay to dimuon is anticipated soon based on the current evidence. Precise categorization of the events without affecting the invariant mass shape is crucial in the analysis. Decorrelation of the invariant mass and the output of discriminators (the score of discriminators) is essential for consistent and precise analysis. In this paper we use distance correlation as th… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 25 pages, 7 figures, 7 tables

  8. arXiv:2406.11574  [pdf, ps, other

    quant-ph

    Non-unitary Coupled Cluster Enabled by Mid-circuit Measurements on Quantum Computers

    Authors: Alexandre Fleury, James Brown, Erika Lloyd, Maritza Hernandez, Isaac H. Kim

    Abstract: Many quantum algorithms rely on a quality initial state for optimal performance. Preparing an initial state for specific applications can considerably reduce the cost of probabilistic algorithms such as the well studied quantum phase estimation (QPE). Fortunately, in the application space of quantum chemistry, generating approximate wave functions for molecular systems is well studied, and quantum… ▽ More

    Submitted 28 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: 26 pages, 6 figures; title changed, references added

  9. arXiv:2406.11378  [pdf, ps, other

    math.GR math.GT

    Non-freeness of parabolic two-generator groups

    Authors: Philip Choi, Kyeonghee Jo, Hyuk Kim, Junho Lee

    Abstract: A complex number $λ$ is said to be non-free if the subgroup of $SL(2,\bc)$ generated by $$X=\begin{pmatrix} 1& 1\\ 0 & 1 \end{pmatrix} \,\, \text{and}\,\,\,Y_λ=\begin{pmatrix} 1& 0\\ λ& 1 \end{pmatrix}$$ is not a free group of rank 2. In this case the number $λ$ is called a relation number, and it has been a long standing problem to determine the relation numbers. In this paper, we characteriz… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 43 pages, 2 figures

    MSC Class: 20E05; 11B39; 11J70; 30F35; 30F40

  10. arXiv:2406.11313  [pdf, other

    cs.CV

    Semi-Supervised Domain Adaptation Using Target-Oriented Domain Augmentation for 3D Object Detection

    Authors: Yecheol Kim, Junho Lee, Changsoo Park, Hyoung won Kim, Inho Lim, Christopher Chang, Jun Won Choi

    Abstract: 3D object detection is crucial for applications like autonomous driving and robotics. However, in real-world environments, variations in sensor data distribution due to sensor upgrades, weather changes, and geographic differences can adversely affect detection performance. Semi-Supervised Domain Adaptation (SSDA) aims to mitigate these challenges by transferring knowledge from a source domain, abu… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Accepted to IEEE Transactions on Intelligent Vehicles (T-IV). The code is available at: https://github.com/rasd3/TODA

  11. arXiv:2406.11248  [pdf

    eess.AS cs.AI cs.SD

    Performance Improvement of Language-Queried Audio Source Separation Based on Caption Augmentation From Large Language Models for DCASE Challenge 2024 Task 9

    Authors: Do Hyun Lee, Yoonah Song, Hong Kook Kim

    Abstract: We present a prompt-engineering-based text-augmentation approach applied to a language-queried audio source separation (LASS) task. To enhance the performance of LASS, the proposed approach utilizes large language models (LLMs) to generate multiple captions corresponding to each sentence of the training dataset. To this end, we first perform experiments to identify the most effective prompts for c… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: DCASE 2024 Challenge Task 9, 4 pages

  12. arXiv:2406.11244  [pdf, other

    cs.LG cs.AI

    SpoT-Mamba: Learning Long-Range Dependency on Spatio-Temporal Graphs with Selective State Spaces

    Authors: Jinhyeok Choi, Heehyeon Kim, Minhyeong An, Joyce Jiyoung Whang

    Abstract: Spatio-temporal graph (STG) forecasting is a critical task with extensive applications in the real world, including traffic and weather forecasting. Although several recent methods have been proposed to model complex dynamics in STGs, addressing long-range spatio-temporal dependencies remains a significant challenge, leading to limited performance gains. Inspired by a recently proposed state space… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 6 pages, 2 figures, 3 tables. Spatio-Temporal Reasoning and Learning (STRL) Workshop at the 33rd International Joint Conference on Artificial Intelligence (IJCAI 2024)

  13. arXiv:2406.10996  [pdf, other

    cs.CL

    THEANINE: Revisiting Memory Management in Long-term Conversations with Timeline-augmented Response Generation

    Authors: Seo Hyun Kim, Kai Tzu-iunn Ong, Taeyoon Kwon, Namyoung Kim, Keummin Ka, SeongHyeon Bae, Yohan Jo, Seung-won Hwang, Dongha Lee, Jinyoung Yeo

    Abstract: Large language models (LLMs) are capable of processing lengthy dialogue histories during prolonged interaction with users without additional memory modules; however, their responses tend to overlook or incorrectly recall information from the past. In this paper, we revisit memory-augmented response generation in the era of LLMs. While prior work focuses on getting rid of outdated memories, we argu… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: Under Review

  14. arXiv:2406.10671  [pdf

    cs.CL

    Augmenting Biomedical Named Entity Recognition with General-domain Resources

    Authors: Yu Yin, Hyunjae Kim, Xiao Xiao, Chih Hsuan Wei, Jaewoo Kang, Zhiyong Lu, Hua Xu, Meng Fang, Qingyu Chen

    Abstract: Training a neural network-based biomedical named entity recognition (BioNER) model usually requires extensive and costly human annotations. While several studies have employed multi-task learning with multiple BioNER datasets to reduce human effort, this approach does not consistently yield performance improvements and may introduce label ambiguity in different biomedical corpora. We aim to tackle… ▽ More

    Submitted 18 June, 2024; v1 submitted 15 June, 2024; originally announced June 2024.

    Comments: We make data, codes, and models publicly available via https://github.com/qingyu-qc/bioner_gerbera

  15. arXiv:2406.10549  [pdf, other

    eess.AS cs.CL cs.SD

    Lightweight Audio Segmentation for Long-form Speech Translation

    Authors: Jaesong Lee, Soyoon Kim, Hanbyul Kim, Joon Son Chung

    Abstract: Speech segmentation is an essential part of speech translation (ST) systems in real-world scenarios. Since most ST models are designed to process speech segments, long-form audio must be partitioned into shorter segments before translation. Recently, data-driven approaches for the speech segmentation task have been developed. Although the approaches improve overall translation quality, a performan… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: Accepted to Interspeech 2024

  16. arXiv:2406.09905  [pdf, other

    cs.CV cs.GR

    Nymeria: A Massive Collection of Multimodal Egocentric Daily Motion in the Wild

    Authors: Lingni Ma, Yuting Ye, Fangzhou Hong, Vladimir Guzov, Yifeng Jiang, Rowan Postyeni, Luis Pesqueira, Alexander Gamino, Vijay Baiyya, Hyo Jin Kim, Kevin Bailey, David Soriano Fosas, C. Karen Liu, Ziwei Liu, Jakob Engel, Renzo De Nardi, Richard Newcombe

    Abstract: We introduce Nymeria - a large-scale, diverse, richly annotated human motion dataset collected in the wild with multiple multimodal egocentric devices. The dataset comes with a) full-body 3D motion ground truth; b) egocentric multimodal recordings from Project Aria devices with RGB, grayscale, eye-tracking cameras, IMUs, magnetometer, barometer, and microphones; and c) an additional "observer" dev… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  17. arXiv:2406.09698  [pdf, other

    physics.ins-det hep-ex

    Projected background and sensitivity of AMoRE-II

    Authors: A. Agrawal, V. V. Alenkov, P. Aryal, J. Beyer, B. Bhandari, R. S. Boiko, K. Boonin, O. Buzanov, C. R. Byeon, N. Chanthima, M. K. Cheoun, J. S. Choe, Seonho Choi, S. Choudhury, J. S. Chung, F. A. Danevich, M. Djamal, D. Drung, C. Enss, A. Fleischmann, A. M. Gangapshev, L. Gastaldo, Y. M. Gavrilyuk, A. M. Gezhaev, O. Gileva , et al. (81 additional authors not shown)

    Abstract: AMoRE-II aims to search for neutrinoless double beta decay with an array of 423 Li$_2$$^{100}$MoO$_4$ crystals operating in the cryogenic system as the main phase of the Advanced Molybdenum-based Rare process Experiment (AMoRE). AMoRE has been planned to operate in three phases: AMoRE-pilot, AMoRE-I, and AMoRE-II. AMoRE-II is currently being installed at the Yemi Underground Laboratory, located ap… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  18. arXiv:2406.08612  [pdf, other

    astro-ph.HE

    Observation of Declination Dependence in the Cosmic Ray Energy Spectrum

    Authors: The Telescope Array Collaboration, R. U. Abbasi, T. Abu-Zayyad, M. Allen, J. W. Belz, D. R. Bergman, I. Buckland, W. Campbell, B. G. Cheon, K. Endo, A. Fedynitch, T. Fujii, K. Fujisue, K. Fujita, M. Fukushima, G. Furlich, Z. Gerber, N. Globus, W. Hanlon, N. Hayashida, H. He, K. Hibino, R. Higuchi, D. Ikeda, T. Ishii , et al. (101 additional authors not shown)

    Abstract: We report on an observation of the difference between northern and southern skies of the ultrahigh energy cosmic ray energy spectrum with a significance of ${\sim}8σ$. We use measurements from the two largest experiments$\unicode{x2014}$the Telescope Array observing the northern hemisphere and the Pierre Auger Observatory viewing the southern hemisphere. Since the comparison of two measurements fr… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 8 pages, 6 figures

  19. arXiv:2406.08301  [pdf, other

    nucl-ex

    Jet modification via $π^0$-hadron correlations in Au$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV

    Authors: PHENIX Collaboration, N. J. Abdulameer, U. Acharya, A. Adare, S. Afanasiev, C. Aidala, N. N. Ajitanand, Y. Akiba, H. Al-Bataineh, J. Alexander, M. Alfred, K. Aoki, N. Apadula, L. Aphecetche, J. Asai, H. Asano, E. T. Atomssa, R. Averbeck, T. C. Awes, B. Azmoun, V. Babintsev, M. Bai, G. Baksay, L. Baksay, A. Baldisseri , et al. (510 additional authors not shown)

    Abstract: High-momentum two-particle correlations are a useful tool for studying jet-quenching effects in the quark-gluon plasma. Angular correlations between neutral-pion triggers and charged hadrons with transverse momenta in the range 4--12~GeV/$c$ and 0.5--7~GeV/$c$, respectively, have been measured by the PHENIX experiment in 2014 for Au$+$Au collisions at $\sqrt{s_{_{NN}}}=200$~GeV. Suppression is obs… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 534 authors from 83 institutions, 12 pages, 7 figures. v1 is version submitted to Physical Review C. HEPdata tables for the points plotted in figures for this and previous PHENIX publications are (or will be) publicly available at http://www.phenix.bnl.gov/papers.html

  20. arXiv:2406.08176  [pdf, other

    cs.CV cs.RO

    Category-level Neural Field for Reconstruction of Partially Observed Objects in Indoor Environment

    Authors: Taekbeom Lee, Youngseok Jang, H. Jin Kim

    Abstract: Neural implicit representation has attracted attention in 3D reconstruction through various success cases. For further applications such as scene understanding or editing, several works have shown progress towards object compositional reconstruction. Despite their superior performance in observed regions, their performance is still limited in reconstructing objects that are partially observed. To… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: RA-L. 8 pages, 8 figures, 4 tables

  21. arXiv:2406.08140  [pdf

    q-bio.NC

    Functional voxel hierarchy and afferent capacity revealed mental state transition on dynamic correlation resting-state fMRI

    Authors: Dong Soo Lee, Hyun Joo Kim, Youngmin Huh, Yeon Koo Kang, Wonseok Whi, Hyekyoung Lee, Hyejin Kang

    Abstract: Voxel hierarchy on dynamic brain graphs is produced by k core percolation on functional dynamic amplitude correlation of resting-state fMRI. Directed graphs and their afferent/efferent capacities are produced by Markov modeling of the universal cover of undirected graphs simultaneously with the calculation of volume entropy. Positive and unsigned negative brain graphs were analyzed separately on s… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  22. arXiv:2406.07909  [pdf, other

    eess.AS cs.CL cs.SD stat.ML

    Guiding Frame-Level CTC Alignments Using Self-knowledge Distillation

    Authors: Eungbeom Kim, Hantae Kim, Kyogu Lee

    Abstract: Transformer encoder with connectionist temporal classification (CTC) framework is widely used for automatic speech recognition (ASR). However, knowledge distillation (KD) for ASR displays a problem of disagreement between teacher-student models in frame-level alignment which ultimately hinders it from improving the student model's performance. In order to resolve this problem, this paper introduce… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Accepted by Interspeech 2024

  23. arXiv:2406.07130  [pdf, other

    physics.plasm-ph

    Assessing the Impact of Alpha Particles on Thermal Confinement in JET D-T Plasmas through Global GENE-Tango Simulations

    Authors: A. Di Siena, J. Garcia, R. Bilato, K. Kirov, J. Varela A. Banon Navarro, Hyun-Tae Kim, C. Challis, J. Hobirk, A. Kappatou, E. Lerche, D. Spong, C. Angioni, T. Gorler, E. Poli, M. Bergmann, F. Jenko, JET contributors

    Abstract: The capability of the global, electromagnetic gyrokinetic GENE code interfaced with the transport Tango solver is exploited to address the impact of fusion alpha particles (in their dual role of fast particles and heating source) on plasma profiles and performance at JET in the discharges with the highest quasi-stationary peak fusion power during the DTE2 experimental campaigns. Employing radially… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  24. arXiv:2406.06976  [pdf, other

    cs.LG cs.AI

    Discrete Dictionary-based Decomposition Layer for Structured Representation Learning

    Authors: Taewon Park, Hyun-Chul Kim, Minho Lee

    Abstract: Neuro-symbolic neural networks have been extensively studied to integrate symbolic operations with neural networks, thereby improving systematic generalization. Specifically, Tensor Product Representation (TPR) framework enables neural networks to perform differentiable symbolic operations by encoding the symbolic structure of data within vector spaces. However, TPR-based neural networks often str… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  25. arXiv:2406.06913  [pdf

    cond-mat.str-el

    Frustrated phonon with charge density wave in vanadium Kagome metal

    Authors: Seung-Phil Heo, Choongjae Won, Heemin Lee, Hanbyul Kim, Eunyoung Park, Sung Yun Lee, Junha Hwang, Hyeongi Choi, Sang-Youn Park, Byungjune Lee, Woo-Suk Noh, Hoyoung Jang, Jae-Hoon Park, Dongbin Shin, Changyong Song

    Abstract: Crystals with unique ionic arrangements and strong electronic correlations serve as a fertile ground for the emergence of exotic phases, as evidenced by the coexistence of charge density wave (CDW) and superconductivity in vanadium Kagome metals, specifically AV3Sb5 (where A represents K, Rb, or Cs). The formation of a star of David CDW superstructure, resulting from the coordinated displacements… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: Manuscript: 20 pages, 4 figures, SI: 14 pages, 8 figures

  26. arXiv:2406.06704  [pdf, other

    hep-th

    Exploring new constraints on Kahler moduli space of 6d N = 1 Supergravity

    Authors: Hee-Cheol Kim, Cumrun Vafa

    Abstract: We propose new constraints for 6d (1, 0) supergravity theories based on consistency conditions on the Kahler moduli spaces of their 5d reductions. The requirement that both the metric and the BPS string tensions in the Kahler moduli space are positive imposes specific restrictions on the Chern-Simons coefficients in the 5d effective Lagrangians that are derived from the Kaluza-Klein reductions of… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 32 pages

  27. arXiv:2406.06222  [pdf, other

    cond-mat.soft

    Shear thickening in suspensions of particles with dynamic brush layers

    Authors: Hojin Kim, Michael van der Naald, Finn A. Braaten, Thomas A. Witten, Stuart J. Rowan, Heinrich M. Jaeger

    Abstract: Control of frictional interactions among liquid-suspended particles has led to tunable, strikingly non-Newtonian rheology via the formation of strong flow constraints as particles come into close proximity under shear. Typically, these frictional interactions have been in the form of physical contact, controllable via particle shape and surface roughness. We investigate a different route, where mo… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  28. arXiv:2406.06149  [pdf, other

    cs.LG stat.ML

    Decoupled Marked Temporal Point Process using Neural Ordinary Differential Equations

    Authors: Yujee Song, Donghyun Lee, Rui Meng, Won Hwa Kim

    Abstract: A Marked Temporal Point Process (MTPP) is a stochastic process whose realization is a set of event-time data. MTPP is often used to understand complex dynamics of asynchronous temporal events such as money transaction, social media, healthcare, etc. Recent studies have utilized deep neural networks to capture complex temporal dependencies of events and generate embedding that aptly represent the o… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 18 pages, 8 figures, The Twelfth International Conference on Learning Representations (ICLR 2024)

  29. arXiv:2406.06117  [pdf, other

    hep-ex

    Exclusion of the Cosmological Triangle in Reactor-Based Search for Axion-Like Particles

    Authors: Byung Ju Park, Jae Jin Choi, Eunju Jeon, Jinyu Kim, Kyungwon Kim, Sung Hyun Kim, Sun Kee Kim, Yeongduk Kim, Young Ju Ko, Byoung-Cheol Koh, Chang Hyon Ha, Seo Hyun Lee, In Soo Lee, Hyunseok Lee, Hyun Su Lee, Jaison Lee, Yoomin Oh, Doojin Kim

    Abstract: We report new constraints on axion-like particle (ALP) using data corresponding to a sodium iodine target exposure of 3063 kg$\cdot$days from the neutrino elastic scattering observation with NaI (NEON) experiment. A 16.7 kg of thallium-doped sodium iodide target was located 23.7 meters from a 2.8 GW thermal power nuclear reactor. We searched for ALPs produced by high-flux photons by comparing the… ▽ More

    Submitted 11 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

  30. arXiv:2406.06072  [pdf, other

    cs.CV cs.LG cs.RO

    Adapting Pretrained ViTs with Convolution Injector for Visuo-Motor Control

    Authors: Dongyoon Hwang, Byungkun Lee, Hojoon Lee, Hyunseung Kim, Jaegul Choo

    Abstract: Vision Transformers (ViT), when paired with large-scale pretraining, have shown remarkable performance across various computer vision tasks, primarily due to their weak inductive bias. However, while such weak inductive bias aids in pretraining scalability, this may hinder the effective adaptation of ViTs for visuo-motor control tasks as a result of the absence of control-centric inductive biases.… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: accepted to ICML 2024

  31. arXiv:2406.05221  [pdf, other

    cs.DC

    GCAPS: GPU Context-Aware Preemptive Priority-based Scheduling for Real-Time Tasks

    Authors: Yidi Wang, Cong Liu, Daniel Wong, Hyoseung Kim

    Abstract: Scheduling real-time tasks that utilize GPUs with analyzable guarantees poses a significant challenge due to the intricate interaction between CPU and GPU resources, as well as the complex GPU hardware and software stack. While much research has been conducted in the real-time research community, several limitations persist, including the absence or limited availability of GPU-level preemption, ex… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: Accepted by ECRTS 2024. arXiv admin note: substantial text overlap with arXiv:2401.16529

  32. arXiv:2406.03539  [pdf, other

    hep-ph astro-ph.CO astro-ph.GA

    Astrometric Search for Ultralight Dark Matter

    Authors: Hyungjin Kim

    Abstract: Precision astrometry offers a way to probe new physics. By measuring the angular position of light sources at unprecedented precision, astrometry could probe minuscule fluctuations of underlying spacetime. This work explores the possibility of probing ultralight dark matter candidates using precision astrometry. Through the coherent and stochastic density fluctuations over the scale of its wavelen… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 20 pages, 4 figures

    Report number: DESY-24-063

  33. arXiv:2406.03072  [pdf, other

    cs.LG cs.IT stat.ML

    Local to Global: Learning Dynamics and Effect of Initialization for Transformers

    Authors: Ashok Vardhan Makkuva, Marco Bondaschi, Chanakya Ekbote, Adway Girish, Alliot Nagle, Hyeji Kim, Michael Gastpar

    Abstract: In recent years, transformer-based models have revolutionized deep learning, particularly in sequence modeling. To better understand this phenomenon, there is a growing interest in using Markov input processes to study transformers. However, our current understanding in this regard remains limited with many fundamental questions about how transformers learn Markov chains still unanswered. In this… ▽ More

    Submitted 27 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

  34. arXiv:2406.02893  [pdf, other

    cs.CL

    Language Model Can Do Knowledge Tracing: Simple but Effective Method to Integrate Language Model and Knowledge Tracing Task

    Authors: Unggi Lee, Jiyeong Bae, Dohee Kim, Sookbun Lee, Jaekwon Park, Taekyung Ahn, Gunho Lee, Damji Stratton, Hyeoncheol Kim

    Abstract: Knowledge Tracing (KT) is a critical task in online learning for modeling student knowledge over time. Despite the success of deep learning-based KT models, which rely on sequences of numbers as data, most existing approaches fail to leverage the rich semantic information in the text of questions and concepts. This paper proposes Language model-based Knowledge Tracing (LKT), a novel framework that… ▽ More

    Submitted 9 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Comments: 11 pages, 5 figures, 3 tables

  35. arXiv:2406.02596  [pdf, other

    cs.LG cs.AI

    Slow and Steady Wins the Race: Maintaining Plasticity with Hare and Tortoise Networks

    Authors: Hojoon Lee, Hyeonseo Cho, Hyunseung Kim, Donghu Kim, Dugki Min, Jaegul Choo, Clare Lyle

    Abstract: This study investigates the loss of generalization ability in neural networks, revisiting warm-starting experiments from Ash & Adams. Our empirical analysis reveals that common methods designed to enhance plasticity by maintaining trainability provide limited benefits to generalization. While reinitializing the network can be effective, it also risks losing valuable prior knowledge. To this end, w… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: accepted to ICML 2024

  36. arXiv:2406.02479  [pdf

    cs.LG eess.SP eess.SY

    Applying Fine-Tuned LLMs for Reducing Data Needs in Load Profile Analysis

    Authors: Yi Hu, Hyeonjin Kim, Kai Ye, Ning Lu

    Abstract: This paper presents a novel method for utilizing fine-tuned Large Language Models (LLMs) to minimize data requirements in load profile analysis, demonstrated through the restoration of missing data in power system load profiles. A two-stage fine-tuning strategy is proposed to adapt a pre-trained LLMs, i.e., GPT-3.5, for missing data restoration tasks. Through empirical evaluation, we demonstrate t… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  37. arXiv:2406.02369  [pdf

    stat.ME

    Identifying Sample Size and Accuracy and Precision of the Estimators in Case-Crossover Designs with Distributed Lags of Heteroskedastic Time-Varying Continuous Exposures Measured with Simple or Complex Error

    Authors: Honghyok Kim

    Abstract: Understanding of sample size, statistical power, and the accuracy and precision of the estimator in epidemiological research can facilitate power and bias analyses. However, such understanding can become complicated for several reasons. First, exposures varying spatiotemporally may be heteroskedastic. Second, distributed lags of exposures may be used to identify critical exposure time-windows. Thi… ▽ More

    Submitted 18 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Comments: Submitted for peer-reviewed publication

  38. Measure-Observe-Remeasure: An Interactive Paradigm for Differentially-Private Exploratory Analysis

    Authors: Priyanka Nanayakkara, Hyeok Kim, Yifan Wu, Ali Sarvghad, Narges Mahyar, Gerome Miklau, Jessica Hullman

    Abstract: Differential privacy (DP) has the potential to enable privacy-preserving analysis on sensitive data, but requires analysts to judiciously spend a limited ``privacy loss budget'' $ε$ across queries. Analysts conducting exploratory analyses do not, however, know all queries in advance and seldom have DP expertise. Thus, they are limited in their ability to specify $ε$ allotments across queries prior… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: Published in IEEE Symposium on Security and Privacy (SP) 2024

    Journal ref: in 2024 IEEE Symposium on Security and Privacy (SP), San Francisco, CA, USA, 2024 pp. 231-231

  39. arXiv:2406.01920  [pdf, other

    cs.CV cs.AI

    CODE: Contrasting Self-generated Description to Combat Hallucination in Large Multi-modal Models

    Authors: Junho Kim, Hyunjun Kim, Yeonju Kim, Yong Man Ro

    Abstract: Large Multi-modal Models (LMMs) have recently demonstrated remarkable abilities in visual context understanding and coherent response generation. However, alongside these advancements, the issue of hallucinations has emerged as a significant challenge, producing erroneous responses that are unrelated to the visual contents. In this paper, we introduce a novel contrastive-based decoding method, COu… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Project page: https://ivy-lvlm.github.io/CODE/

  40. arXiv:2406.00945  [pdf, other

    astro-ph.HE astro-ph.IM gr-qc

    General relativistic self-gravitating equilibrium disks around rotating neutron stars

    Authors: Yoonsoo Kim, Jinho Kim, Hee Il Kim, Hyung Mok Lee

    Abstract: In modeling a relativistic disk around a compact object, the self-gravity of the disk is often neglected while it needs to be incorporated for more accurate descriptions in several circumstances. Extending the Komatsu-Eriguchi-Hachisu self-consistent field method, we present numerical models of a rapidly rotating neutron star with a self-gravitating disk in stationary equilibrium. In particular, o… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: 15 pages, 12 figures

  41. arXiv:2406.00810  [pdf, other

    cs.CR

    Expanding the Attack Scenarios of SAE J1939: A Comprehensive Analysis of Established and Novel Vulnerabilities in Transport Protocol

    Authors: Hwejae Lee, Hyosun Lee, Saehee Jun, Huy Kang Kim

    Abstract: Following the enactment of the UN Regulation, substantial efforts have been directed toward implementing intrusion detection and prevention systems (IDPSs) and vulnerability analysis in Controller Area Network (CAN). However, Society of Automotive Engineers (SAE) J1939 protocol, despite its extensive application in camping cars and commercial vehicles, has seen limited vulnerability identification… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: 18 pages, 7 figures, 5 tables; This is the accepted version of ESCAR USA 2024

    MSC Class: 68M25 ACM Class: K.6.5

  42. arXiv:2406.00584  [pdf, other

    cs.DB cs.AI

    A Blueprint Architecture of Compound AI Systems for Enterprise

    Authors: Eser Kandogan, Sajjadur Rahman, Nikita Bhutani, Dan Zhang, Rafael Li Chen, Kushan Mitra, Sairam Gurajada, Pouya Pezeshkpour, Hayate Iso, Yanlin Feng, Hannah Kim, Chen Shen, Jin Wang, Estevam Hruschka

    Abstract: Large Language Models (LLMs) have showcased remarkable capabilities surpassing conventional NLP challenges, creating opportunities for use in production use cases. Towards this goal, there is a notable shift to building compound AI systems, wherein LLMs are integrated into an expansive software infrastructure with many components like models, retrievers, databases and tools. In this paper, we intr… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: Compound AI Systems Workshop at the Data+AI Summit 2024

  43. arXiv:2406.00324  [pdf, other

    cs.LG cs.AI

    Do's and Don'ts: Learning Desirable Skills with Instruction Videos

    Authors: Hyunseung Kim, Byungkun Lee, Hojoon Lee, Dongyoon Hwang, Donghu Kim, Jaegul Choo

    Abstract: Unsupervised skill discovery is a learning paradigm that aims to acquire diverse behaviors without explicit rewards. However, it faces challenges in learning complex behaviors and often leads to learning unsafe or undesirable behaviors. For instance, in various continuous control tasks, current unsupervised skill discovery methods succeed in learning basic locomotions like standing but struggle wi… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  44. arXiv:2406.00138  [pdf, ps, other

    hep-th math.QA math.RT

    Mirror Symmetry and Level-rank Duality for 3d $\mathcal{N} = 4$ Rank 0 SCFTs

    Authors: Thomas Creutzig, Niklas Garner, Heeyeon Kim

    Abstract: We introduce a family of 3d $\mathcal{N} = 4$ superconformal field theories that have zero-dimensional Coulomb and Higgs branches and propose that the rational vertex operator algebras $W^{\text{min}}_{k - \scriptstyle{\frac{1}{2}}}(\mathfrak{sp}_{2N})$ and $L_{k}(\mathfrak{osp}_{1|2N})$ model the modular tensor categories of line operators in their topological $A$ and $B$ twists, respectively. Ou… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

    Comments: 29 pages, 1 figure; comments welcome!

  45. arXiv:2406.00014  [pdf, other

    cs.DB cs.AI cs.CL cs.IR

    KU-DMIS at EHRSQL 2024:Generating SQL query via question templatization in EHR

    Authors: Hajung Kim, Chanhwi Kim, Hoonick Lee, Kyochul Jang, Jiwoo Lee, Kyungjae Lee, Gangwoo Kim, Jaewoo Kang

    Abstract: Transforming natural language questions into SQL queries is crucial for precise data retrieval from electronic health record (EHR) databases. A significant challenge in this process is detecting and rejecting unanswerable questions that request information beyond the database's scope or exceed the system's capabilities. In this paper, we introduce a novel text-to-SQL framework that robustly handle… ▽ More

    Submitted 19 June, 2024; v1 submitted 21 May, 2024; originally announced June 2024.

    Comments: Published at ClinicalNLP workshop @ NAACL 2024

  46. arXiv:2405.20574  [pdf, other

    cs.CL cs.AI

    Open Ko-LLM Leaderboard: Evaluating Large Language Models in Korean with Ko-H5 Benchmark

    Authors: Chanjun Park, Hyeonwoo Kim, Dahyun Kim, Seonghwan Cho, Sanghoon Kim, Sukyung Lee, Yungi Kim, Hwalsuk Lee

    Abstract: This paper introduces the Open Ko-LLM Leaderboard and the Ko-H5 Benchmark as vital tools for evaluating Large Language Models (LLMs) in Korean. Incorporating private test sets while mirroring the English Open LLM Leaderboard, we establish a robust evaluation framework that has been well integrated in the Korean LLM community. We perform data leakage analysis that shows the benefit of private test… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Accepted at ACL 2024 Main

  47. arXiv:2405.20448  [pdf, other

    cs.LG

    Knockout: A simple way to handle missing inputs

    Authors: Minh Nguyen, Batuhan K. Karaman, Heejong Kim, Alan Q. Wang, Fengbei Liu, Mert R. Sabuncu

    Abstract: Deep learning models can extract predictive and actionable information from complex inputs. The richer the inputs, the better these models usually perform. However, models that leverage rich inputs (e.g., multi-modality) can be difficult to deploy widely, because some inputs may be missing at inference. Current popular solutions to this problem include marginalization, imputation, and training mul… ▽ More

    Submitted 3 June, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

  48. arXiv:2405.19598  [pdf, other

    cs.CR

    Evaluating the Effectiveness and Robustness of Visual Similarity-based Phishing Detection Models

    Authors: Fujiao Ji, Kiho Lee, Hyungjoon Koo, Wenhao You, Euijin Choo, Hyoungshick Kim, Doowon Kim

    Abstract: Phishing attacks pose a significant threat to Internet users, with cybercriminals elaborately replicating the visual appearance of legitimate websites to deceive victims. Visual similarity-based detection systems have emerged as an effective countermeasure, but their effectiveness and robustness in real-world scenarios have been unexplored. In this paper, we comprehensively scrutinize and evaluate… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 12 pages

  49. arXiv:2405.18986  [pdf, other

    cs.LG q-bio.BM q-bio.QM

    Robust Optimization in Protein Fitness Landscapes Using Reinforcement Learning in Latent Space

    Authors: Minji Lee, Luiz Felipe Vecchietti, Hyunkyu Jung, Hyun Joo Ro, Meeyoung Cha, Ho Min Kim

    Abstract: Proteins are complex molecules responsible for different functions in nature. Enhancing the functionality of proteins and cellular fitness can significantly impact various industries. However, protein optimization using computational methods remains challenging, especially when starting from low-fitness sequences. We propose LatProtRL, an optimization method to efficiently traverse a latent space… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: ICML 2024

  50. arXiv:2405.18623  [pdf

    cs.HC

    I See You: Teacher Analytics with GPT-4 Vision-Powered Observational Assessment

    Authors: Unggi Lee, Yeil Jeong, Junbo Koh, Gyuri Byun, Yunseo Lee, Hyunwoong Lee, Seunmin Eun, Jewoong Moon, Cheolil Lim, Hyeoncheol Kim

    Abstract: This preliminary study explores the integration of GPT-4 Vision (GPT-4V) technology into teacher analytics, focusing on its applicability in observational assessment to enhance reflective teaching practice. This research is grounded in developing a Video-based Automatic Assessment System (VidAAS) empowered by GPT-4V. Our approach aims to revolutionize teachers' assessment of students' practices by… ▽ More

    Submitted 30 May, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: 27 pages, 5 figures, 4 tables