Skip to main content

Showing 1–50 of 170 results for author: Yoo, J

  1. arXiv:2406.10296  [pdf, other

    cs.CL cs.AI cs.CY

    CLST: Cold-Start Mitigation in Knowledge Tracing by Aligning a Generative Language Model as a Students' Knowledge Tracer

    Authors: Heeseok Jung, Jaesang Yoo, Yohaan Yoon, Yeonju Jang

    Abstract: Knowledge tracing (KT), wherein students' problem-solving histories are used to estimate their current levels of knowledge, has attracted significant interest from researchers. However, most existing KT models were developed with an ID-based paradigm, which exhibits limitations in cold-start performance. These limitations can be mitigated by leveraging the vast quantities of external knowledge pos… ▽ More

    Submitted 17 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

  2. arXiv:2406.09716  [pdf, ps, other

    cs.CR cs.AI cs.DC cs.LG

    Speed-up of Data Analysis with Kernel Trick in Encrypted Domain

    Authors: Joon Soo Yoo, Baek Kyung Song, Tae Min Ahn, Ji Won Heo, Ji Won Yoon

    Abstract: Homomorphic encryption (HE) is pivotal for secure computation on encrypted data, crucial in privacy-preserving data analysis. However, efficiently processing high-dimensional data in HE, especially for machine learning and statistical (ML/STAT) algorithms, poses a challenge. In this paper, we present an effective acceleration method using the kernel method for HE schemes, enhancing time performanc… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: Submitted as a preprint

  3. arXiv:2406.04814  [pdf, other

    cs.CV cs.LG

    Online Continual Learning of Video Diffusion Models From a Single Video Stream

    Authors: Jason Yoo, Dylan Green, Geoff Pleiss, Frank Wood

    Abstract: Diffusion models have shown exceptional capabilities in generating realistic videos. Yet, their training has been predominantly confined to offline environments where models can repeatedly train on i.i.d. data to convergence. This work explores the feasibility of training diffusion models from a semantically continuous video stream, where correlated video frames sequentially arrive one at a time.… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  4. arXiv:2406.01045  [pdf, other

    cs.CL cs.AI

    Decompose, Enrich, and Extract! Schema-aware Event Extraction using LLMs

    Authors: Fatemeh Shiri, Van Nguyen, Farhad Moghimifar, John Yoo, Gholamreza Haffari, Yuan-Fang Li

    Abstract: Large Language Models (LLMs) demonstrate significant capabilities in processing natural language data, promising efficient knowledge extraction from diverse textual sources to enhance situational awareness and support decision-making. However, concerns arise due to their susceptibility to hallucination, resulting in contextually inaccurate content. This work focuses on harnessing LLMs for automate… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  5. arXiv:2405.11614  [pdf, other

    cs.CV eess.IV

    Nickel and Diming Your GAN: A Dual-Method Approach to Enhancing GAN Efficiency via Knowledge Distillation

    Authors: Sangyeop Yeo, Yoojin Jang, Jaejun Yoo

    Abstract: In this paper, we address the challenge of compressing generative adversarial networks (GANs) for deployment in resource-constrained environments by proposing two novel methodologies: Distribution Matching for Efficient compression (DiME) and Network Interactive Compression via Knowledge Exchange and Learning (NICKEL). DiME employs foundation models as embedding kernels for efficient distribution… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

  6. arXiv:2405.00646  [pdf, other

    cs.CV cs.LG

    Learning to Compose: Improving Object Centric Learning by Injecting Compositionality

    Authors: Whie Jung, Jaehoon Yoo, Sungjin Ahn, Seunghoon Hong

    Abstract: Learning compositional representation is a key aspect of object-centric learning as it enables flexible systematic generalization and supports complex visual reasoning. However, most of the existing approaches rely on auto-encoding objective, while the compositionality is implicitly imposed by the architectural or algorithmic bias in the encoder. This misalignment between auto-encoding objective a… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  7. arXiv:2404.18066  [pdf, other

    cs.NE cs.AI cs.AR cs.CV q-bio.NC

    Quantized Context Based LIF Neurons for Recurrent Spiking Neural Networks in 45nm

    Authors: Sai Sukruth Bezugam, Yihao Wu, JaeBum Yoo, Dmitri Strukov, Bongjin Kim

    Abstract: In this study, we propose the first hardware implementation of a context-based recurrent spiking neural network (RSNN) emphasizing on integrating dual information streams within the neocortical pyramidal neurons specifically Context- Dependent Leaky Integrate and Fire (CLIF) neuron models, essential element in RSNN. We present a quantized version of the CLIF neuron (qCLIF), developed through a har… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

    Comments: 7 Pages, 7 Figures, 2 Tables

  8. arXiv:2404.15333  [pdf, other

    eess.SP cs.LG

    EB-GAME: A Game-Changer in ECG Heartbeat Anomaly Detection

    Authors: JuneYoung Park, Da Young Kim, Yunsoo Kim, Jisu Yoo, Tae Joon Kim

    Abstract: Cardiologists use electrocardiograms (ECG) for the detection of arrhythmias. However, continuous monitoring of ECG signals to detect cardiac abnormal-ities requires significant time and human resources. As a result, several deep learning studies have been conducted in advance for the automatic detection of arrhythmia. These models show relatively high performance in supervised learning, but are no… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  9. arXiv:2404.09161  [pdf, other

    cs.CV cs.LG

    Coreset Selection for Object Detection

    Authors: Hojun Lee, Suyoung Kim, Junhoo Lee, Jaeyoung Yoo, Nojun Kwak

    Abstract: Coreset selection is a method for selecting a small, representative subset of an entire dataset. It has been primarily researched in image classification, assuming there is only one object per image. However, coreset selection for object detection is more challenging as an image can contain multiple objects. As a result, much research has yet to be done on this topic. Therefore, we introduce a new… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: Accepted by CVPR 2024: 1st Workshop on Dataset Distillation for Computer Vision

  10. arXiv:2404.03155  [pdf, other

    cs.ET

    TEGRA -- Scaling Up Terascale Graph Processing with Disaggregated Computing

    Authors: William Shaddix, Mahyar Samani, Marjan Fariborz, S. J. Ben Yoo, Jason Lowe-Power, Venkatesh Akella

    Abstract: Graphs are essential for representing relationships in various domains, driving modern AI applications such as graph analytics and neural networks across science, engineering, cybersecurity, transportation, and economics. However, the size of modern graphs are rapidly expanding, posing challenges for traditional CPUs and GPUs in meeting real-time processing demands. As a result, hardware accelerat… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: Presented at the 3rd Workshop on Heterogeneous Composable and Disaggregated Systems (HCDS 2024)

  11. arXiv:2404.02865  [pdf, other

    cs.LG

    End-To-End Self-tuning Self-supervised Time Series Anomaly Detection

    Authors: Boje Deforce, Meng-Chieh Lee, Bart Baesens, Estefanía Serral Asensio, Jaemin Yoo, Leman Akoglu

    Abstract: Time series anomaly detection (TSAD) finds many applications such as monitoring environmental sensors, industry KPIs, patient biomarkers, etc. A two-fold challenge for TSAD is a versatile and unsupervised model that can detect various different types of time series anomalies (spikes, discontinuities, trend shifts, etc.) without any labeled data. Modern neural networks have outstanding ability in m… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  12. arXiv:2404.01690  [pdf, other

    cs.CV

    RefQSR: Reference-based Quantization for Image Super-Resolution Networks

    Authors: Hongjae Lee, Jun-Sang Yoo, Seung-Won Jung

    Abstract: Single image super-resolution (SISR) aims to reconstruct a high-resolution image from its low-resolution observation. Recent deep learning-based SISR models show high performance at the expense of increased computational costs, limiting their use in resource-constrained environments. As a promising solution for computationally efficient network design, network quantization has been extensively stu… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: Accepted by IEEE Transactions on Image Processing (TIP)

  13. arXiv:2404.00995  [pdf, other

    cs.CV

    PosterLlama: Bridging Design Ability of Langauge Model to Contents-Aware Layout Generation

    Authors: Jaejung Seol, Seojun Kim, Jaejun Yoo

    Abstract: Visual layout plays a critical role in graphic design fields such as advertising, posters, and web UI design. The recent trend towards content-aware layout generation through generative models has shown promise, yet it often overlooks the semantic intricacies of layout design by treating it as a simple numerical optimization. To bridge this gap, we introduce PosterLlama, a network designed for gen… ▽ More

    Submitted 2 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

  14. arXiv:2404.00921  [pdf, other

    cs.CV

    Towards Label-Efficient Human Matting: A Simple Baseline for Weakly Semi-Supervised Trimap-Free Human Matting

    Authors: Beomyoung Kim, Myeong Yeon Yi, Joonsang Yu, Young Joon Yoo, Sung Ju Hwang

    Abstract: This paper presents a new practical training method for human matting, which demands delicate pixel-level human region identification and significantly laborious annotations. To reduce the annotation cost, most existing matting approaches often rely on image synthesis to augment the dataset. However, the unnaturalness of synthesized training images brings in a new domain generalization challenge f… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: Preprint, 15 pages, 13 figures

  15. arXiv:2404.00638  [pdf, other

    cs.LG

    HypeBoy: Generative Self-Supervised Representation Learning on Hypergraphs

    Authors: Sunwoo Kim, Shinhwan Kang, Fanchen Bu, Soo Yong Lee, Jaemin Yoo, Kijung Shin

    Abstract: Hypergraphs are marked by complex topology, expressing higher-order interactions among multiple nodes with hyperedges, and better capturing the topology is essential for effective representation learning. Recent advances in generative self-supervised learning (SSL) suggest that hypergraph neural networks learned from generative self supervision have the potential to effectively encode the complex… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Comments: Published as a conference paper at ICLR 2024

  16. arXiv:2403.19724  [pdf

    cs.ET cs.NE physics.optics

    Towards Reverse-Engineering the Brain: Brain-Derived Neuromorphic Computing Approach with Photonic, Electronic, and Ionic Dynamicity in 3D integrated circuits

    Authors: S. J. Ben Yoo, Luis El-Srouji, Suman Datta, Shimeng Yu, Jean Anne Incorvia, Alberto Salleo, Volker Sorger, Juejun Hu, Lionel C Kimerling, Kristofer Bouchard, Joy Geng, Rishidev Chaudhuri, Charan Ranganath, Randall O'Reilly

    Abstract: The human brain has immense learning capabilities at extreme energy efficiencies and scale that no artificial system has been able to match. For decades, reverse engineering the brain has been one of the top priorities of science and technology research. Despite numerous efforts, conventional electronics-based methods have failed to match the scalability, energy efficiency, and self-supervised lea… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: 15 pages, 12 figures

  17. arXiv:2403.15227  [pdf, other

    cs.CV cs.GR

    LeGO: Leveraging a Surface Deformation Network for Animatable Stylized Face Generation with One Example

    Authors: Soyeon Yoon, Kwan Yun, Kwanggyoon Seo, Sihun Cha, Jung Eun Yoo, Junyong Noh

    Abstract: Recent advances in 3D face stylization have made significant strides in few to zero-shot settings. However, the degree of stylization achieved by existing methods is often not sufficient for practical applications because they are mostly based on statistical 3D Morphable Models (3DMM) with limited variations. To this end, we propose a method that can produce a highly stylized 3D face model with de… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: 8 pages

    MSC Class: 68T45 ACM Class: I.4.9

  18. arXiv:2403.10906  [pdf, other

    cs.CV

    HourglassNeRF: Casting an Hourglass as a Bundle of Rays for Few-shot Neural Rendering

    Authors: Seunghyeon Seo, Yeonjin Chang, Jayeon Yoo, Seungwoo Lee, Hojun Lee, Nojun Kwak

    Abstract: Recent advancements in the Neural Radiance Field (NeRF) have bolstered its capabilities for novel view synthesis, yet its reliance on dense multi-view training images poses a practical challenge. Addressing this, we propose HourglassNeRF, an effective regularization-based approach with a novel hourglass casting strategy. Our proposed hourglass is conceptualized as a bundle of additional rays withi… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: 21 pages, 11 figures

  19. arXiv:2403.09675  [pdf, other

    cs.CV cs.GR

    Open-Universe Indoor Scene Generation using LLM Program Synthesis and Uncurated Object Databases

    Authors: Rio Aguina-Kang, Maxim Gumin, Do Heon Han, Stewart Morris, Seung Jean Yoo, Aditya Ganeshan, R. Kenny Jones, Qiuhong Anna Wei, Kailiang Fu, Daniel Ritchie

    Abstract: We present a system for generating indoor scenes in response to text prompts. The prompts are not limited to a fixed vocabulary of scene descriptions, and the objects in generated scenes are not restricted to a fixed set of object categories -- we call this setting indoor scene generation. Unlike most prior work on indoor scene generation, our system does not require a large training dataset of ex… ▽ More

    Submitted 4 February, 2024; originally announced March 2024.

    Comments: See ancillary files for link to supplemental material

  20. arXiv:2403.09669  [pdf, other

    cs.CV cs.AI

    STREAM: Spatio-TempoRal Evaluation and Analysis Metric for Video Generative Models

    Authors: Pum Jun Kim, Seojun Kim, Jaejun Yoo

    Abstract: Image generative models have made significant progress in generating realistic and diverse images, supported by comprehensive guidance from various evaluation metrics. However, current video generative models struggle to generate even short video clips, with limited tools that provide insights for improvements. Current video evaluation metrics are simple adaptations of image metrics by switching t… ▽ More

    Submitted 28 March, 2024; v1 submitted 30 January, 2024; originally announced March 2024.

    Comments: Our work is accepted to ICLR 2024

  21. arXiv:2403.01663  [pdf, other

    cs.CV

    PillarGen: Enhancing Radar Point Cloud Density and Quality via Pillar-based Point Generation Network

    Authors: Jisong Kim, Geonho Bang, Kwangjin Choi, Minjae Seong, Jaechang Yoo, Eunjong Pyo, Jun Won Choi

    Abstract: In this paper, we present a novel point generation model, referred to as Pillar-based Point Generation Network (PillarGen), which facilitates the transformation of point clouds from one domain into another. PillarGen can produce synthetic point clouds with enhanced density and quality based on the provided input point clouds. The PillarGen model performs the following three steps: 1) pillar encodi… ▽ More

    Submitted 8 March, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

    Comments: Accepted by IEEE International Conference on Robotics and Automation (ICRA 2024), 8 pages, 3 figures

  22. arXiv:2403.01344  [pdf, other

    cs.LG cs.CV

    Mitigating the Bias in the Model for Continual Test-Time Adaptation

    Authors: Inseop Chung, Kyomin Hwang, Jayeon Yoo, Nojun Kwak

    Abstract: Continual Test-Time Adaptation (CTA) is a challenging task that aims to adapt a source pre-trained model to continually changing target domains. In the CTA setting, a model does not know when the target domain changes, thus facing a drastic change in the distribution of streaming inputs during the test-time. The key challenge is to keep adapting the model to the continually changing target domains… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

  23. arXiv:2402.13729  [pdf, other

    cs.CV

    Hybrid Video Diffusion Models with 2D Triplane and 3D Wavelet Representation

    Authors: Kihong Kim, Haneol Lee, Jihye Park, Seyeon Kim, Kwanghee Lee, Seungryong Kim, Jaejun Yoo

    Abstract: Generating high-quality videos that synthesize desired realistic content is a challenging task due to their intricate high-dimensionality and complexity of videos. Several recent diffusion-based methods have shown comparable performance by compressing videos to a lower-dimensional latent space, using traditional video autoencoder architecture. However, such method that employ standard frame-wise 2… ▽ More

    Submitted 3 April, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: Project page is available at https://hxngiee.github.io/HVDM/

  24. arXiv:2402.11201  [pdf, other

    cs.CV

    A Decoding Scheme with Successive Aggregation of Multi-Level Features for Light-Weight Semantic Segmentation

    Authors: Jiwon Yoo, Jangwon Lee, Gyeonghwan Kim

    Abstract: Multi-scale architecture, including hierarchical vision transformer, has been commonly applied to high-resolution semantic segmentation to deal with computational complexity with minimum performance loss. In this paper, we propose a novel decoding scheme for semantic segmentation in this regard, which takes multi-level features from the encoder with multi-scale architecture. The decoding scheme ba… ▽ More

    Submitted 14 June, 2024; v1 submitted 17 February, 2024; originally announced February 2024.

    Comments: 7 pages, 4 figures, ICIP2024 Accepted paper

  25. arXiv:2402.09542  [pdf, other

    cs.LG

    Layerwise Proximal Replay: A Proximal Point Method for Online Continual Learning

    Authors: Jason Yoo, Yunpeng Liu, Frank Wood, Geoff Pleiss

    Abstract: In online continual learning, a neural network incrementally learns from a non-i.i.d. data stream. Nearly all online continual learning methods employ experience replay to simultaneously prevent catastrophic forgetting and underfitting on past data. Our work demonstrates a limitation of this approach: neural networks trained with experience replay tend to have unstable optimization trajectories, i… ▽ More

    Submitted 7 June, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  26. arXiv:2402.08138  [pdf, other

    cs.CV

    H2O-SDF: Two-phase Learning for 3D Indoor Reconstruction using Object Surface Fields

    Authors: Minyoung Park, Mirae Do, YeonJae Shin, Jaeseok Yoo, Jongkwang Hong, Joongrock Kim, Chul Lee

    Abstract: Advanced techniques using Neural Radiance Fields (NeRF), Signed Distance Fields (SDF), and Occupancy Fields have recently emerged as solutions for 3D indoor scene reconstruction. We introduce a novel two-phase learning approach, H2O-SDF, that discriminates between object and non-object regions within indoor environments. This method achieves a nuanced balance, carefully preserving the geometric in… ▽ More

    Submitted 8 March, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

  27. arXiv:2402.04621  [pdf, other

    cs.LG

    Feature Distribution on Graph Topology Mediates the Effect of Graph Convolution: Homophily Perspective

    Authors: Soo Yong Lee, Sunwoo Kim, Fanchen Bu, Jaemin Yoo, Jiliang Tang, Kijung Shin

    Abstract: How would randomly shuffling feature vectors among nodes from the same class affect graph neural networks (GNNs)? The feature shuffle, intuitively, perturbs the dependence between graph topology and features (A-X dependence) for GNNs to learn from. Surprisingly, we observe a consistent and significant improvement in GNN performance following the feature shuffle. Having overlooked the impact of A-X… ▽ More

    Submitted 6 June, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: published in ICML 2024

  28. Bridging the Domain Gap: A Simple Domain Matching Method for Reference-based Image Super-Resolution in Remote Sensing

    Authors: Jeongho Min, Yejun Lee, Dongyoung Kim, Jaejun Yoo

    Abstract: Recently, reference-based image super-resolution (RefSR) has shown excellent performance in image super-resolution (SR) tasks. The main idea of RefSR is to utilize additional information from the reference (Ref) image to recover the high-frequency components in low-resolution (LR) images. By transferring relevant textures through feature matching, RefSR models outperform existing single image supe… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: Accepted to IEEE GRSL 2023

    Report number: Article Sequence Number: 8000105, Print ISSN: 1545-598X

    Journal ref: Volume: 21, Year: 2023, Page: 1-5

  29. arXiv:2312.08875  [pdf, other

    cs.CV

    What, How, and When Should Object Detectors Update in Continually Changing Test Domains?

    Authors: Jayeon Yoo, Dongkwan Lee, Inseop Chung, Donghyun Kim, Nojun Kwak

    Abstract: It is a well-known fact that the performance of deep learning models deteriorates when they encounter a distribution shift at test time. Test-time adaptation (TTA) algorithms have been proposed to adapt the model online while inferring test data. However, existing research predominantly focuses on classification tasks through the optimization of batch normalization layers or classification heads,… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  30. arXiv:2312.07266  [pdf, other

    cs.CV

    ProxyDet: Synthesizing Proxy Novel Classes via Classwise Mixup for Open-Vocabulary Object Detection

    Authors: Joonhyun Jeong, Geondo Park, Jayeon Yoo, Hyungsik Jung, Heesu Kim

    Abstract: Open-vocabulary object detection (OVOD) aims to recognize novel objects whose categories are not included in the training set. In order to classify these unseen classes during training, many OVOD frameworks leverage the zero-shot capability of largely pretrained vision and language models, such as CLIP. To further improve generalization on the unseen novel classes, several approaches proposed to a… ▽ More

    Submitted 20 February, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: Accepted in AAAI24. Code: https://github.com/clovaai/ProxyDet Project page: https://proxydet.github.io

  31. arXiv:2311.14496  [pdf, other

    cs.CR

    RTPS Attack Dataset Description

    Authors: Dong Young Kim, Dongsung Kim, Yuchan Song, Gang Min Kim, Min Geun Song, Jeong Do Yoo, Huy Kang Kim

    Abstract: This paper explains all about our RTPS datasets. We collect malicious/benign packet data by injecting attack data in an Unmanned Ground Vehicle (UGV) in the normal state. We assembled the testbed, consisting of UGV, Controller, PC, and Router. We collect this dataset in the UGV part of our testbed. We conducted two types of attack "Command Injection" and "Command Injection with ARP Spoofing" on… ▽ More

    Submitted 2 April, 2024; v1 submitted 24 November, 2023; originally announced November 2023.

    Comments: This manuscript is written in Korean. You can download our dataset through our lab: https://ocslab.hksecurity.net/Datasets/rtps-attack-dataset We welcome your comments or feedback. Contact INFO: Dong Young Kim (klgh1256@korea.ac.kr), Huy Kang Kim (cenda@korea.ac.kr)

  32. arXiv:2311.14342  [pdf, other

    cs.CR

    AI-based Attack Graph Generation

    Authors: Sangbeom Park, Jaesung Lee, Jeong Do Yoo, Min Geun Song, Hyosun Lee, Jaewoong Choi, Chaeyeon Sagong, Huy Kang Kim

    Abstract: With the advancement of IoT technology, many electronic devices are interconnected through networks, communicating with each other and performing specific roles. However, as numerous devices join networks, the threat of cyberattacks also escalates. Preventing and detecting cyber threats are crucial, and one method of preventing such threats involves using attack graphs. Attack graphs are widely us… ▽ More

    Submitted 27 November, 2023; v1 submitted 24 November, 2023; originally announced November 2023.

    Comments: in Korean Language, 8 Figures, 14 Pages

  33. arXiv:2311.14327  [pdf, other

    cs.CR

    C-ITS Environment Modeling and Attack Modeling

    Authors: Jaewoong Choi, Min Geun Song, Hyosun Lee, Chaeyeon Sagong, Sangbeom Park, Jaesung Lee, Jeong Do Yoo, Huy Kang Kim

    Abstract: As technology advances, cities are evolving into smart cities, with the ability to process large amounts of data and the increasing complexity and diversification of various elements within urban areas. Among the core systems of a smart city is the Cooperative-Intelligent Transport Systems (C-ITS). C-ITS is a system where vehicles provide real-time information to drivers about surrounding traffic… ▽ More

    Submitted 27 November, 2023; v1 submitted 24 November, 2023; originally announced November 2023.

    Comments: in Korean Language, 14 Figures, 15 Pages

  34. arXiv:2310.08745  [pdf, other

    cs.RO cs.CV

    AcTExplore: Active Tactile Exploration of Unknown Objects

    Authors: Amir-Hossein Shahidzadeh, Seong Jong Yoo, Pavan Mantripragada, Chahat Deep Singh, Cornelia Fermüller, Yiannis Aloimonos

    Abstract: Tactile exploration plays a crucial role in understanding object structures for fundamental robotics tasks such as grasping and manipulation. However, efficiently exploring such objects using tactile sensors is challenging, primarily due to the large-scale unknown environments and limited sensing coverage of these sensors. To this end, we present AcTExplore, an active tactile exploration method dr… ▽ More

    Submitted 20 June, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

    Comments: 8 pages, 6 figures, Accepted to ICRA 2024

  35. arXiv:2310.02751  [pdf, other

    cs.LG cs.CV

    SHOT: Suppressing the Hessian along the Optimization Trajectory for Gradient-Based Meta-Learning

    Authors: JunHoo Lee, Jayeon Yoo, Nojun Kwak

    Abstract: In this paper, we hypothesize that gradient-based meta-learning (GBML) implicitly suppresses the Hessian along the optimization trajectory in the inner loop. Based on this hypothesis, we introduce an algorithm called SHOT (Suppressing the Hessian along the Optimization Trajectory) that minimizes the distance between the parameters of the target and reference models to suppress the Hessian in the i… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  36. arXiv:2309.01950  [pdf, other

    cs.CV cs.AI cs.LG cs.SD eess.AS

    RADIO: Reference-Agnostic Dubbing Video Synthesis

    Authors: Dongyeun Lee, Chaewon Kim, Sangjoon Yu, Jaejun Yoo, Gyeong-Moon Park

    Abstract: One of the most challenging problems in audio-driven talking head generation is achieving high-fidelity detail while ensuring precise synchronization. Given only a single reference image, extracting meaningful identity attributes becomes even more challenging, often causing the network to mirror the facial and lip structures too closely. To address these issues, we introduce RADIO, a framework eng… ▽ More

    Submitted 6 November, 2023; v1 submitted 5 September, 2023; originally announced September 2023.

    Comments: Accepted by WACV 2024

  37. arXiv:2309.00349  [pdf

    physics.chem-ph cs.LG

    Bespoke Nanoparticle Synthesis and Chemical Knowledge Discovery Via Autonomous Experimentations

    Authors: Hyuk Jun Yoo, Nayeon Kim, Heeseung Lee, Daeho Kim, Leslie Tiong Ching Ow, Hyobin Nam, Chansoo Kim, Seung Yong Lee, Kwan-Young Lee, Donghun Kim, Sang Soo Han

    Abstract: The optimization of nanomaterial synthesis using numerous synthetic variables is considered to be extremely laborious task because the conventional combinatorial explorations are prohibitively expensive. In this work, we report an autonomous experimentation platform developed for the bespoke design of nanoparticles (NPs) with targeted optical properties. This platform operates in a closed-loop man… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

  38. arXiv:2308.14380  [pdf, other

    cs.LG

    Self-Supervision for Tackling Unsupervised Anomaly Detection: Pitfalls and Opportunities

    Authors: Leman Akoglu, Jaemin Yoo

    Abstract: Self-supervised learning (SSL) is a growing torrent that has recently transformed machine learning and its many real world applications, by learning on massive amounts of unlabeled data via self-generated supervisory signals. Unsupervised anomaly detection (AD) has also capitalized on SSL, by self-generating pseudo-anomalies through various data augmentation functions or external data exposure. In… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

  39. arXiv:2308.11568  [pdf, other

    cs.CV

    SPANet: Frequency-balancing Token Mixer using Spectral Pooling Aggregation Modulation

    Authors: Guhnoo Yun, Juhan Yoo, Kijung Kim, Jeongho Lee, Dong Hwan Kim

    Abstract: Recent studies show that self-attentions behave like low-pass filters (as opposed to convolutions) and enhancing their high-pass filtering capability improves model performance. Contrary to this idea, we investigate existing convolution-based models with spectral analysis and observe that improving the low-pass filtering in convolution operations also leads to performance improvement. To account f… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

    Comments: Accepted paper at ICCV 2023

  40. arXiv:2307.11242  [pdf, other

    cs.NE cs.AI cs.LG

    On-Sensor Data Filtering using Neuromorphic Computing for High Energy Physics Experiments

    Authors: Shruti R. Kulkarni, Aaron Young, Prasanna Date, Narasinga Rao Miniskar, Jeffrey S. Vetter, Farah Fahim, Benjamin Parpillon, Jennet Dickinson, Nhan Tran, Jieun Yoo, Corrinne Mills, Morris Swartz, Petar Maksimovic, Catherine D. Schuman, Alice Bean

    Abstract: This work describes the investigation of neuromorphic computing-based spiking neural network (SNN) models used to filter data from sensor electronics in high energy physics experiments conducted at the High Luminosity Large Hadron Collider. We present our approach for developing a compact neuromorphic model that filters out the sensor data based on the particle's transverse momentum with the goal… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

    Comments: Manuscript accepted at ICONS'23

  41. arXiv:2307.08263  [pdf, other

    cs.CV

    Hierarchical Spatiotemporal Transformers for Video Object Segmentation

    Authors: Jun-Sang Yoo, Hongjae Lee, Seung-Won Jung

    Abstract: This paper presents a novel framework called HST for semi-supervised video object segmentation (VOS). HST extracts image and video features using the latest Swin Transformer and Video Swin Transformer to inherit their inductive bias for the spatiotemporal locality, which is essential for temporally coherent VOS. To take full advantage of the image and video features, HST casts image and video feat… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

  42. arXiv:2307.06534  [pdf, other

    cs.LG

    DSV: An Alignment Validation Loss for Self-supervised Outlier Model Selection

    Authors: Jaemin Yoo, Yue Zhao, Lingxiao Zhao, Leman Akoglu

    Abstract: Self-supervised learning (SSL) has proven effective in solving various problems by generating internal supervisory signals. Unsupervised anomaly detection, which faces the high cost of obtaining true labels, is an area that can greatly benefit from SSL. However, recent literature suggests that tuning the hyperparameters (HP) of data augmentation functions is crucial to the success of SSL-based ano… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

    Comments: Accepted to ECML PKDD 2023

  43. arXiv:2306.13637  [pdf, other

    math.OC cs.LG

    Constrained optimization of sensor placement for nuclear digital twins

    Authors: Niharika Karnik, Mohammad G. Abdo, Carlos E. Estrada Perez, Jun Soo Yoo, Joshua J. Cogliati, Richard S. Skifton, Pattrick Calderoni, Steven L. Brunton, Krithika Manohar

    Abstract: The deployment of extensive sensor arrays in nuclear reactors is infeasible due to challenging operating conditions and inherent spatial limitations. Strategically placing sensors within defined spatial constraints is essential for the reconstruction of reactor flow fields and the creation of nuclear digital twins. We develop a data-driven technique that incorporates constraints into an optimizati… ▽ More

    Submitted 16 February, 2024; v1 submitted 23 June, 2023; originally announced June 2023.

  44. arXiv:2306.12033  [pdf, other

    cs.LG cs.CV

    End-to-End Augmentation Hyperparameter Tuning for Self-Supervised Anomaly Detection

    Authors: Jaemin Yoo, Lingxiao Zhao, Leman Akoglu

    Abstract: Self-supervised learning (SSL) has emerged as a promising paradigm that presents self-generated supervisory signals to real-world problems, bypassing the extensive manual labeling burden. SSL is especially attractive for unsupervised tasks such as anomaly detection, where labeled anomalies are often nonexistent and costly to obtain. While self-supervised anomaly detection (SSAD) has seen a recent… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

  45. arXiv:2306.11572  [pdf

    cs.ET cond-mat.other physics.app-ph

    Energy-efficient superparamagnetic Ising machine and its application to traveling salesman problems

    Authors: Jia Si, Shuhan Yang, Yunuo Cen, Jiaer Chen, Zhaoyang Yao, Dong-Jun Kim, Kaiming Cai, Jerald Yoo, Xuanyao Fong, Hyunsoo Yang

    Abstract: The growth of artificial intelligence and IoT has created a significant computational load for solving non-deterministic polynomial-time (NP)-hard problems, which are difficult to solve using conventional computers. The Ising computer, based on the Ising model and annealing process, has been highly sought for finding approximate solutions to NP-hard problems by observing the convergence of dynamic… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: 5 figures

  46. arXiv:2306.08013  [pdf, other

    cs.LG cs.AI cs.CV

    TopP&R: Robust Support Estimation Approach for Evaluating Fidelity and Diversity in Generative Models

    Authors: Pum Jun Kim, Yoojin Jang, Jisu Kim, Jaejun Yoo

    Abstract: We propose a robust and reliable evaluation metric for generative models by introducing topological and statistical treatments for rigorous support estimation. Existing metrics, such as Inception Score (IS), Frechet Inception Distance (FID), and the variants of Precision and Recall (P&R), heavily rely on supports that are estimated from sample features. However, the reliability of their estimation… ▽ More

    Submitted 24 January, 2024; v1 submitted 13 June, 2023; originally announced June 2023.

    Comments: Accepted to NeurIPS 2023

  47. Dual policy as self-model for planning

    Authors: Jaesung Yoo, Fernanda de la Torre, Guangyu Robert Yang

    Abstract: Planning is a data efficient decision-making strategy where an agent selects candidate actions by exploring possible future states. To simulate future states when there is a high-dimensional action space, the knowledge of one's decision making strategy must be used to limit the number of actions to be explored. We refer to the model used to simulate one's decisions as the agent's self-model. While… ▽ More

    Submitted 11 June, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

  48. Classification of Edge-dependent Labels of Nodes in Hypergraphs

    Authors: Minyoung Choe, Sunwoo Kim, Jaemin Yoo, Kijung Shin

    Abstract: A hypergraph is a data structure composed of nodes and hyperedges, where each hyperedge is an any-sized subset of nodes. Due to the flexibility in hyperedge size, hypergraphs represent group interactions (e.g., co-authorship by more than two authors) more naturally and accurately than ordinary graphs. Interestingly, many real-world systems modeled as hypergraphs contain edge-dependent node labels,… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: Accepted to KDD 2023

  49. arXiv:2306.02376  [pdf, other

    cs.LG cs.AI

    Towards Deep Attention in Graph Neural Networks: Problems and Remedies

    Authors: Soo Yong Lee, Fanchen Bu, Jaemin Yoo, Kijung Shin

    Abstract: Graph neural networks (GNNs) learn the representation of graph-structured data, and their expressiveness can be further enhanced by inferring node relations for propagation. Attention-based GNNs infer neighbor importance to manipulate the weight of its propagation. Despite their popularity, the discussion on deep graph attention and its unique challenges has been limited. In this work, we investig… ▽ More

    Submitted 4 June, 2023; originally announced June 2023.

    Comments: 22 pages, 6 figures, conference paper, published in International Conference on Machine Learning. PMLR, 2023

  50. How Transitive Are Real-World Group Interactions? -- Measurement and Reproduction

    Authors: Sunwoo Kim, Fanchen Bu, Minyoung Choe, Jaemin Yoo, Kijung Shin

    Abstract: Many real-world interactions (e.g., researcher collaborations and email communication) occur among multiple entities. These group interactions are naturally modeled as hypergraphs. In graphs, transitivity is helpful to understand the connections between node pairs sharing a neighbor, and it has extensive applications in various domains. Hypergraphs, an extension of graphs, are designed to represen… ▽ More

    Submitted 25 October, 2023; v1 submitted 4 June, 2023; originally announced June 2023.

    Comments: Published in KDD 2023. 12 pages, 7 figures, and 11 tables