Skip to main content

Showing 1–50 of 128 results for author: Jo, H

  1. arXiv:2406.16275  [pdf, other

    cs.CL

    Investigating the Influence of Prompt-Specific Shortcuts in AI Generated Text Detection

    Authors: Choonghyun Park, Hyuhng Joon Kim, Junyeob Kim, Youna Kim, Taeuk Kim, Hyunsoo Cho, Hwiyeol Jo, Sang-goo Lee, Kang Min Yoo

    Abstract: AI Generated Text (AIGT) detectors are developed with texts from humans and LLMs of common tasks. Despite the diversity of plausible prompt choices, these datasets are generally constructed with a limited number of prompts. The lack of prompt variation can introduce prompt-specific shortcut features that exist in data collected with the chosen prompt, but do not generalize to others. In this paper… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: 19 pages, 3 figures, 13 tables, under review

  2. arXiv:2406.13342  [pdf, other

    cs.CL cs.AI

    ZeroDL: Zero-shot Distribution Learning for Text Clustering via Large Language Models

    Authors: Hwiyeol Jo, Hyunwoo Lee, Taiwoo Park

    Abstract: The recent advancements in large language models (LLMs) have brought significant progress in solving NLP tasks. Notably, in-context learning (ICL) is the key enabling mechanism for LLMs to understand specific tasks and grasping nuances. In this paper, we propose a simple yet effective method to contextualize a task toward a specific LLM, by (1) observing how a given LLM describes (all or a part of… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: ARR Submitted

  3. arXiv:2406.07006  [pdf, other

    cs.CV

    MIPI 2024 Challenge on Few-shot RAW Image Denoising: Methods and Results

    Authors: Xin Jin, Chunle Guo, Xiaoming Li, Zongsheng Yue, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Yuekun Dai, Peiqing Yang, Chen Change Loy, Ruoqi Li, Chang Liu, Ziyi Wang, Yao Du, Jingjing Yang, Long Bao, Heng Sun, Xiangyu Kong, Xiaoxia Xing, Jinlong Wu, Yuanyang Xue, Hyunhee Park, Sejun Song, Changho Kim, Jingfan Tan , et al. (17 additional authors not shown)

    Abstract: The increasing demand for computational photography and imaging on mobile platforms has led to the widespread development and integration of advanced image sensors with novel algorithms in camera systems. However, the scarcity of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photogra… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: CVPR 2024 Mobile Intelligent Photography and Imaging (MIPI) Workshop--Few-shot RAWImage Denoising Challenge Report. Website: https://mipi-challenge.org/MIPI2024/

  4. arXiv:2406.02657  [pdf, other

    cs.CL cs.AI cs.LG

    Block Transformer: Global-to-Local Language Modeling for Fast Inference

    Authors: Namgyu Ho, Sangmin Bae, Taehyeon Kim, Hyunjik Jo, Yireun Kim, Tal Schuster, Adam Fisch, James Thorne, Se-Young Yun

    Abstract: This paper presents the Block Transformer architecture which adopts hierarchical global-to-local modeling to autoregressive transformers to mitigate the inference bottlenecks of self-attention. To apply self-attention, the key-value (KV) cache of all previous sequences must be retrieved from memory at every decoding step. Thereby, this KV cache IO becomes a significant bottleneck in batch inferenc… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 30 pages, 21 figures, 5 tables

  5. arXiv:2406.01512  [pdf, other

    cs.CL

    MAD: Multi-Alignment MEG-to-Text Decoding

    Authors: Yiqian Yang, Hyejeong Jo, Yiqun Duan, Qiang Zhang, Jinni Zhou, Won Hee Lee, Renjing Xu, Hui Xiong

    Abstract: Deciphering language from brain activity is a crucial task in brain-computer interface (BCI) research. Non-invasive cerebral signaling techniques including electroencephalography (EEG) and magnetoencephalography (MEG) are becoming increasingly popular due to their safety and practicality, avoiding invasive electrode implantation. However, current works under-investigated three points: 1) a predomi… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  6. arXiv:2405.08424  [pdf, other

    cs.LG math.OC

    Tackling Prevalent Conditions in Unsupervised Combinatorial Optimization: Cardinality, Minimum, Covering, and More

    Authors: Fanchen Bu, Hyeonsoo Jo, Soo Yong Lee, Sungsoo Ahn, Kijung Shin

    Abstract: Combinatorial optimization (CO) is naturally discrete, making machine learning based on differentiable optimization inapplicable. Karalias & Loukas (2020) adapted the probabilistic method to incorporate CO into differentiable optimization. Their work ignited the research on unsupervised learning for CO, composed of two main components: probabilistic objectives and derandomization. However, each co… ▽ More

    Submitted 23 May, 2024; v1 submitted 14 May, 2024; originally announced May 2024.

    Comments: ICML 2024

  7. arXiv:2405.06459  [pdf, other

    cs.CL cs.AI

    Are EEG-to-Text Models Working?

    Authors: Hyejeong Jo, Yiqian Yang, Juhyeok Han, Yiqun Duan, Hui Xiong, Won Hee Lee

    Abstract: This work critically analyzes existing models for open-vocabulary EEG-to-Text translation. We identify a crucial limitation: previous studies often employed implicit teacher-forcing during evaluation, artificially inflating performance metrics. Additionally, they lacked a critical benchmark - comparing model performance on pure noise inputs. We propose a methodology to differentiate between models… ▽ More

    Submitted 13 June, 2024; v1 submitted 10 May, 2024; originally announced May 2024.

  8. arXiv:2405.03080  [pdf, other

    cs.SI physics.soc-ph

    Homophilic organization of egocentric communities in ICT services

    Authors: Chandreyee Roy, Hang-Hyun Jo, János Kertész, Kimmo Kaski, János Török

    Abstract: Members of a society can be characterized by a large number of features, such as gender, age, ethnicity, religion, social status, and shared activities. One of the main tie-forming factors between individuals in human societies is homophily, the tendency of being attracted to similar others. Homophily has been mainly studied with focus on one of the features and little is known about the roles of… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

    Comments: 8 pages, 7 figures, 1 table

  9. arXiv:2404.14873  [pdf, ps, other

    stat.ML cs.LG math.NA

    Estimating the Distribution of Parameters in Differential Equations with Repeated Cross-Sectional Data

    Authors: Hyeontae Jo, Sung Woong Cho, Hyung Ju Hwang

    Abstract: Differential equations are pivotal in modeling and understanding the dynamics of various systems, offering insights into their future states through parameter estimation fitted to time series data. In fields such as economy, politics, and biology, the observation data points in the time series are often independently obtained (i.e., Repeated Cross-Sectional (RCS) data). With RCS data, we found tha… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 16 pages, 10 figures

    MSC Class: 65L08; 65D17; 68U07

  10. arXiv:2404.14410  [pdf, other

    cs.CV

    Guess The Unseen: Dynamic 3D Scene Reconstruction from Partial 2D Glimpses

    Authors: Inhee Lee, Byungjun Kim, Hanbyul Joo

    Abstract: In this paper, we present a method to reconstruct the world and multiple dynamic humans in 3D from a monocular video input. As a key idea, we represent both the world and multiple humans via the recently emerging 3D Gaussian Splatting (3D-GS) representation, enabling to conveniently and efficiently compose and render them together. In particular, we address the scenarios with severely limited and… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: The project page is available at https://snuvclab.github.io/gtu/

  11. arXiv:2404.08672  [pdf, other

    cs.IR cs.AI cs.CL cs.CY cs.LG

    Taxonomy and Analysis of Sensitive User Queries in Generative AI Search

    Authors: Hwiyeol Jo, Taiwoo Park, Nayoung Choi, Changbong Kim, Ohjoon Kwon, Donghyeon Jeon, Hyunwoo Lee, Eui-Hyeon Lee, Kyoungho Shin, Sun Suk Lim, Kyungmi Kim, Jihye Lee, Sun Kim

    Abstract: Although there has been a growing interest among industries to integrate generative LLMs into their services, limited experiences and scarcity of resources acts as a barrier in launching and servicing large-scale LLM-based conversational services. In this paper, we share our experiences in developing and operating generative AI models within a national-scale search engine, with a specific focus on… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  12. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seongjin Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  13. arXiv:2403.16444  [pdf, other

    cs.CL

    KIT-19: A Comprehensive Korean Instruction Toolkit on 19 Tasks for Fine-Tuning Korean Large Language Models

    Authors: Dongjun Jang, Sungjoo Byun, Hyemi Jo, Hyopil Shin

    Abstract: Instruction Tuning on Large Language Models is an essential process for model to function well and achieve high performance in specific tasks. Accordingly, in mainstream languages such as English, instruction-based datasets are being constructed and made publicly available. In the case of Korean, publicly available models and datasets all rely on using the output of ChatGPT or translating datasets… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  14. arXiv:2403.01748  [pdf, other

    cs.CL cs.AI

    NeuSpeech: Decode Neural signal as Speech

    Authors: Yiqian Yang, Yiqun Duan, Qiang Zhang, Hyejeong Jo, Jinni Zhou, Won Hee Lee, Renjing Xu, Hui Xiong

    Abstract: Decoding language from brain dynamics is an important open direction in the realm of brain-computer interface (BCI), especially considering the rapid growth of large language models. Compared to invasive-based signals which require electrode implantation surgery, non-invasive neural signals (e.g. EEG, MEG) have attracted increasing attention considering their safety and generality. However, the ex… ▽ More

    Submitted 3 June, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  15. arXiv:2402.10636  [pdf, other

    cs.CV

    PEGASUS: Personalized Generative 3D Avatars with Composable Attributes

    Authors: Hyunsoo Cha, Byungjun Kim, Hanbyul Joo

    Abstract: We present PEGASUS, a method for constructing a personalized generative 3D face avatar from monocular video sources. Our generative 3D avatar enables disentangled controls to selectively alter the facial attributes (e.g., hair or nose) while preserving the identity. Our approach consists of two stages: synthetic database generation and constructing a personalized generative avatar. We generate a s… ▽ More

    Submitted 2 April, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: Accepted at CVPR 2024, Project Page: https://snuvclab.github.io/pegasus/

  16. arXiv:2401.13872  [pdf, other

    cs.LG

    Edge Conditional Node Update Graph Neural Network for Multi-variate Time Series Anomaly Detection

    Authors: Hayoung Jo, Seong-Whan Lee

    Abstract: With the rapid advancement in cyber-physical systems, the increasing number of sensors has significantly complicated manual monitoring of system states. Consequently, graph-based time-series anomaly detection methods have gained attention due to their ability to explicitly represent relationships between sensors. However, these methods often apply a uniform source node representation across all co… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  17. arXiv:2401.12979  [pdf, other

    cs.CV

    GALA: Generating Animatable Layered Assets from a Single Scan

    Authors: Taeksoo Kim, Byungjun Kim, Shunsuke Saito, Hanbyul Joo

    Abstract: We present GALA, a framework that takes as input a single-layer clothed 3D human mesh and decomposes it into complete multi-layered 3D assets. The outputs can then be combined with other assets to create novel clothed human avatars with any pose. Existing reconstruction approaches often treat clothed humans as a single-layer of geometry and overlook the inherent compositionality of humans with hai… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: The project page is available at https://snuvclab.github.io/gala/

  18. arXiv:2401.12978  [pdf, other

    cs.CV

    Zero-Shot Learning for the Primitives of 3D Affordance in General Objects

    Authors: Hyeonwoo Kim, Sookwan Han, Patrick Kwon, Hanbyul Joo

    Abstract: One of the major challenges in AI is teaching machines to precisely respond and utilize environmental functionalities, thereby achieving the affordance awareness that humans possess. Despite its importance, the field has been lagging in terms of learning, especially in 3D, as annotating affordance accompanies a laborious process due to the numerous variations of human-object interaction. The low a… ▽ More

    Submitted 24 January, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

    Comments: Project Page: https://sshowbiz.github.io/ZSP3A/

  19. arXiv:2401.10232  [pdf, other

    cs.CV

    ParaHome: Parameterizing Everyday Home Activities Towards 3D Generative Modeling of Human-Object Interactions

    Authors: Jeonghwan Kim, Jisoo Kim, Jeonghyeon Na, Hanbyul Joo

    Abstract: To enable machines to learn how humans interact with the physical world in our daily activities, it is crucial to provide rich data that encompasses the 3D motion of humans as well as the motion of objects in a learnable 3D representation. Ideally, this data should be collected in a natural setup, capturing the authentic dynamic 3D signals during human-object interactions. To address this challeng… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

  20. Optimizing Dataflow Systems for Scalable Interactive Visualization

    Authors: Junran Yang, Hyekang Kevin Joo, Sai Yerramreddy, Dominik Moritz, Leilani Battle

    Abstract: Supporting the interactive exploration of large datasets is a popular and challenging use case for data management systems. Traditionally, the interface and the back-end system are built and optimized separately, and interface design and system optimization require different skill sets that are difficult for one person to master. To enable analysts to focus on visualization design, we contribute V… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

  21. arXiv:2401.00847  [pdf, other

    cs.CV cs.GR

    Mocap Everyone Everywhere: Lightweight Motion Capture With Smartwatches and a Head-Mounted Camera

    Authors: Jiye Lee, Hanbyul Joo

    Abstract: We present a lightweight and affordable motion capture method based on two smartwatches and a head-mounted camera. In contrast to the existing approaches that use six or more expert-level IMU devices, our approach is much more cost-effective and convenient. Our method can make wearable motion capture accessible to everyone everywhere, enabling 3D full-body motion capture in diverse environments. A… ▽ More

    Submitted 6 May, 2024; v1 submitted 1 January, 2024; originally announced January 2024.

    Comments: Accepted to CVPR 2024; Project page: https://jiyewise.github.io/projects/MocapEvery/

  22. arXiv:2311.18215  [pdf, other

    cs.CL

    Automatic Construction of a Korean Toxic Instruction Dataset for Ethical Tuning of Large Language Models

    Authors: Sungjoo Byun, Dongjun Jang, Hyemi Jo, Hyopil Shin

    Abstract: Caution: this paper may include material that could be offensive or distressing. The advent of Large Language Models (LLMs) necessitates the development of training approaches that mitigate the generation of unethical language and aptly manage toxic user queries. Given the challenges related to human labor and the scarcity of data, we present KoTox, comprising 39K unethical instruction-output pa… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: NeurIPS 2023 Workshop on Instruction Tuning and Instruction Following

  23. arXiv:2311.13784  [pdf, other

    cs.CL

    DaG LLM ver 1.0: Pioneering Instruction-Tuned Language Modeling for Korean NLP

    Authors: Dongjun Jang, Sangah Lee, Sungjoo Byun, Jinwoong Kim, Jean Seo, Minseok Kim, Soyeon Kim, Chaeyoung Oh, Jaeyoon Kim, Hyemi Jo, Hyopil Shin

    Abstract: This paper presents the DaG LLM (David and Goliath Large Language Model), a language model specialized for Korean and fine-tuned through Instruction Tuning across 41 tasks within 13 distinct categories.

    Submitted 22 November, 2023; originally announced November 2023.

  24. arXiv:2311.08735  [pdf, other

    q-bio.NC cs.HC

    Neurophysiological Response Based on Auditory Sense for Brain Modulation Using Monaural Beat

    Authors: Ha-Na Jo, Young-Seok Kweon, Gi-Hwan Shin, Heon-Gyu Kwak, Seong-Whan Lee

    Abstract: Brain modulation is a modification process of brain activity through external stimulations. However, which condition can induce the activation is still unclear. Therefore, we aimed to identify brain activation conditions using 40 Hz monaural beat (MB). Under this stimulation, auditory sense status which is determined by frequency and power range is the condition to consider. Hence, we designed fiv… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: Accepted to EMBC 2023

  25. arXiv:2311.08703  [pdf, other

    q-bio.NC cs.HC

    Impact of Nap on Performance in Different Working Memory Tasks Using EEG

    Authors: Gi-Hwan Shin, Young-Seok Kweon, Heon-Gyu Kwak, Ha-Na Jo, Seong-Whan Lee

    Abstract: Electroencephalography (EEG) has been widely used to study the relationship between naps and working memory, yet the effects of naps on distinct working memory tasks remain unclear. Here, participants performed word-pair and visuospatial working memory tasks pre- and post-nap sessions. We found marked differences in accuracy and reaction time between tasks performed pre- and post-nap. In order to… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: Submitted to 2024 12th IEEE International Winter Conference on Brain-Computer Interface

  26. arXiv:2311.07962  [pdf, other

    q-bio.NC cs.HC

    Relationship Between Mood, Sleepiness, and EEG Functional Connectivity by 40 Hz Monaural Beats

    Authors: Ha-Na Jo, Young-Seok Kweon, Gi-Hwan Shin, Heon-Gyu Kwak, Seong-Whan Lee

    Abstract: The monaural beat is known that it can modulate brain and personal states. However, which changes in brain waves are related to changes in state is still unclear. Therefore, we aimed to investigate the effects of monaural beats and find the relationship between them. Ten participants took part in five separate random sessions, which included a baseline session and four sessions with monaural beats… ▽ More

    Submitted 20 November, 2023; v1 submitted 14 November, 2023; originally announced November 2023.

  27. arXiv:2311.07868  [pdf, other

    cs.LG cs.AI eess.SP

    Multi-Signal Reconstruction Using Masked Autoencoder From EEG During Polysomnography

    Authors: Young-Seok Kweon, Gi-Hwan Shin, Heon-Gyu Kwak, Ha-Na Jo, Seong-Whan Lee

    Abstract: Polysomnography (PSG) is an indispensable diagnostic tool in sleep medicine, essential for identifying various sleep disorders. By capturing physiological signals, including EEG, EOG, EMG, and cardiorespiratory metrics, PSG presents a patient's sleep architecture. However, its dependency on complex equipment and expertise confines its use to specialized clinical settings. Addressing these limitati… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: Proc. 12th IEEE International Winter Conference on Brain-Computer Interface

  28. arXiv:2311.00322  [pdf, other

    cs.LG cs.AI

    Robust Graph Clustering via Meta Weighting for Noisy Graphs

    Authors: Hyeonsoo Jo, Fanchen Bu, Kijung Shin

    Abstract: How can we find meaningful clusters in a graph robustly against noise edges? Graph clustering (i.e., dividing nodes into groups of similar ones) is a fundamental problem in graph analysis with applications in various fields. Recent studies have demonstrated that graph neural network (GNN) based approaches yield promising results for graph clustering. However, we observe that their performance dege… ▽ More

    Submitted 8 November, 2023; v1 submitted 1 November, 2023; originally announced November 2023.

    Comments: CIKM '23: Proceedings of the 32nd ACM International Conference on Information and Knowledge Management

  29. arXiv:2309.01166  [pdf, other

    cs.CV cs.AI

    Spatial-temporal Vehicle Re-identification

    Authors: Hye-Geun Kim, YouKyoung Na, Hae-Won Joe, Yong-Hyuk Moon, Yeong-Jun Cho

    Abstract: Vehicle re-identification (ReID) in a large-scale camera network is important in public safety, traffic control, and security. However, due to the appearance ambiguities of vehicle, the previous appearance-based ReID methods often fail to track vehicle across multiple cameras. To overcome the challenge, we propose a spatial-temporal vehicle ReID framework that estimates reliable camera network top… ▽ More

    Submitted 3 September, 2023; originally announced September 2023.

    Comments: 10 pages, 6 figures

  30. arXiv:2308.12288  [pdf, other

    cs.CV cs.AI

    CHORUS: Learning Canonicalized 3D Human-Object Spatial Relations from Unbounded Synthesized Images

    Authors: Sookwan Han, Hanbyul Joo

    Abstract: We present a method for teaching machines to understand and model the underlying spatial common sense of diverse human-object interactions in 3D in a self-supervised way. This is a challenging task, as there exist specific manifolds of the interactions that can be considered human-like and natural, but the human pose and the geometry of objects can vary even for similar interactions. Such diversit… ▽ More

    Submitted 3 September, 2023; v1 submitted 23 August, 2023; originally announced August 2023.

    Comments: Accepted to ICCV 2023 (Oral Presentation). Project Page: https://jellyheadandrew.github.io/projects/chorus

  31. arXiv:2305.14345  [pdf, other

    cs.CV

    NCHO: Unsupervised Learning for Neural 3D Composition of Humans and Objects

    Authors: Taeksoo Kim, Shunsuke Saito, Hanbyul Joo

    Abstract: Deep generative models have been recently extended to synthesizing 3D digital humans. However, previous approaches treat clothed humans as a single chunk of geometry without considering the compositionality of clothing and accessories. As a result, individual items cannot be naturally composed into novel identities, leading to limited expressiveness and controllability of generative 3D avatars. Wh… ▽ More

    Submitted 29 May, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: The project page is available at https://taeksuu.github.io/ncho/

  32. arXiv:2305.11870  [pdf, other

    cs.CV

    Chupa: Carving 3D Clothed Humans from Skinned Shape Priors using 2D Diffusion Probabilistic Models

    Authors: Byungjun Kim, Patrick Kwon, Kwangho Lee, Myunggi Lee, Sookwan Han, Daesik Kim, Hanbyul Joo

    Abstract: We propose a 3D generation pipeline that uses diffusion models to generate realistic human digital avatars. Due to the wide variety of human identities, poses, and stochastic details, the generation of 3D human meshes has been a challenging problem. To address this, we decompose the problem into 2D normal map generation and normal map-based 3D reconstruction. Specifically, we first simultaneously… ▽ More

    Submitted 15 September, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: Project Page: https://snuvclab.github.io/chupa/

  33. arXiv:2305.02622  [pdf, other

    physics.flu-dyn cs.LG

    Critical heat flux diagnosis using conditional generative adversarial networks

    Authors: UngJin Na, Moonhee Choi, HangJin Jo

    Abstract: The critical heat flux (CHF) is an essential safety boundary in boiling heat transfer processes employed in high heat flux thermal-hydraulic systems. Identifying CHF is vital for preventing equipment damage and ensuring overall system safety, yet it is challenging due to the complexity of the phenomena. For an in-depth understanding of the complicated phenomena, various methodologies have been dev… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

  34. arXiv:2301.02667  [pdf, other

    cs.CV cs.GR cs.RO

    Locomotion-Action-Manipulation: Synthesizing Human-Scene Interactions in Complex 3D Environments

    Authors: Jiye Lee, Hanbyul Joo

    Abstract: Synthesizing interaction-involved human motions has been challenging due to the high complexity of 3D environments and the diversity of possible human behaviors within. We present LAMA, Locomotion-Action-MAnipulation, to synthesize natural and plausible long-term human movements in complex indoor environments. The key motivation of LAMA is to build a unified framework to encompass a series of ever… ▽ More

    Submitted 8 September, 2023; v1 submitted 9 January, 2023; originally announced January 2023.

    Comments: Accepted to ICCV 2023

  35. arXiv:2212.05136  [pdf, other

    cs.CV

    CLIP-TSA: CLIP-Assisted Temporal Self-Attention for Weakly-Supervised Video Anomaly Detection

    Authors: Hyekang Kevin Joo, Khoa Vo, Kashu Yamazaki, Ngan Le

    Abstract: Video anomaly detection (VAD) -- commonly formulated as a multiple-instance learning problem in a weakly-supervised manner due to its labor-intensive nature -- is a challenging problem in video surveillance where the frames of anomaly need to be localized in an untrimmed video. In this paper, we first propose to utilize the ViT-encoded visual features from CLIP, in contrast with the conventional C… ▽ More

    Submitted 3 July, 2023; v1 submitted 9 December, 2022; originally announced December 2022.

    Comments: Published at the 30th IEEE International Conference on Image Processing (IEEE ICIP 2023)

  36. arXiv:2211.04755  [pdf, other

    cs.CV cs.AI cs.LG

    Towards Global Crop Maps with Transfer Learning

    Authors: Hyun-Woo Jo, Alkiviadis Koukos, Vasileios Sitokonstantinou, Woo-Kyun Lee, Charalampos Kontoes

    Abstract: The continuous increase in global population and the impact of climate change on crop production are expected to affect the food sector significantly. In this context, there is need for timely, large-scale and precise mapping of crops for evidence-based decision making. A key enabler towards this direction are new satellite missions that freely offer big remote sensing data of high spatio-temporal… ▽ More

    Submitted 10 November, 2022; v1 submitted 9 November, 2022; originally announced November 2022.

    Comments: Accepted for publication at Tackling Climate Change with Machine Learning: workshop at NeurIPS 2022

  37. arXiv:2210.09394  [pdf

    cs.AI cs.LG

    Review Learning: Alleviating Catastrophic Forgetting with Generative Replay without Generator

    Authors: Jaesung Yoo, Sunghyuk Choi, Ye Seul Yang, Suhyeon Kim, Jieun Choi, Dongkyeong Lim, Yaeji Lim, Hyung Joon Joo, Dae Jung Kim, Rae Woong Park, Hyeong-Jin Yoon, Kwangsoo Kim

    Abstract: When a deep learning model is sequentially trained on different datasets, it forgets the knowledge acquired from previous data, a phenomenon known as catastrophic forgetting. It deteriorates performance of the deep learning model on diverse datasets, which is critical in privacy-preserving deep learning (PPDL) applications based on transfer learning (TL). To overcome this, we propose review learni… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

  38. arXiv:2209.02251  [pdf, other

    cs.CL

    External Knowledge Selection with Weighted Negative Sampling in Knowledge-grounded Task-oriented Dialogue Systems

    Authors: Janghoon Han, Joongbo Shin, Hosung Song, Hyunjik Jo, Gyeonghun Kim, Yireun Kim, Stanley Jungkyu Choi

    Abstract: Constructing a robust dialogue system on spoken conversations bring more challenge than written conversation. In this respect, DSTC10-Track2-Task2 is proposed, which aims to build a task-oriented dialogue (TOD) system incorporating unstructured external knowledge on a spoken conversation, extending DSTC9-Track1. This paper introduces our system containing four advanced methods: data construction,… ▽ More

    Submitted 6 September, 2022; originally announced September 2022.

    Comments: 7page, DSTC10-Track2-task2

  39. arXiv:2208.07009  [pdf, other

    physics.soc-ph cs.SI

    Copula-based analysis of the generalized friendship paradox in clustered networks

    Authors: Hang-Hyun Jo, Eun Lee, Young-Ho Eom

    Abstract: A heterogeneous structure of social networks induces various intriguing phenomena. One of them is the friendship paradox, which states that on average your friends have more friends than you do. Its generalization, called the generalized friendship paradox (GFP), states that on average your friends have higher attributes than yours. Despite successful demonstrations of the GFP by empirical analyse… ▽ More

    Submitted 8 December, 2022; v1 submitted 15 August, 2022; originally announced August 2022.

    Comments: 9 pages, 3 figures. arXiv admin note: text overlap with arXiv:2107.05838

  40. arXiv:2205.12685  [pdf, other

    cs.CL cs.AI cs.LG

    Ground-Truth Labels Matter: A Deeper Look into Input-Label Demonstrations

    Authors: Kang Min Yoo, Junyeob Kim, Hyuhng Joon Kim, Hyunsoo Cho, Hwiyeol Jo, Sang-Woo Lee, Sang-goo Lee, Taeuk Kim

    Abstract: Despite recent explosion of interests in in-context learning, the underlying mechanism and the precise impact of the quality of demonstrations remain elusive. Intuitively, ground-truth labels should have as much impact in in-context learning (ICL) as supervised learning, but recent work reported that the input-label correspondence is significantly less important than previously thought. Intrigued… ▽ More

    Submitted 24 October, 2022; v1 submitted 25 May, 2022; originally announced May 2022.

    Comments: Accepted to EMNLP Long. Kang Min Yoo and Junyeob Kim contributed equally. Kang Min Yoo and Taeuk Kim are the corresponding authors

  41. arXiv:2205.09185  [pdf, other

    physics.ins-det cs.LG hep-ex nucl-ex physics.comp-ph

    AI-assisted Optimization of the ECCE Tracking System at the Electron Ion Collider

    Authors: C. Fanelli, Z. Papandreou, K. Suresh, J. K. Adkins, Y. Akiba, A. Albataineh, M. Amaryan, I. C. Arsene, C. Ayerbe Gayoso, J. Bae, X. Bai, M. D. Baker, M. Bashkanov, R. Bellwied, F. Benmokhtar, V. Berdnikov, J. C. Bernauer, F. Bock, W. Boeglin, M. Borysova, E. Brash, P. Brindza, W. J. Briscoe, M. Brooks, S. Bueltmann , et al. (258 additional authors not shown)

    Abstract: The Electron-Ion Collider (EIC) is a cutting-edge accelerator facility that will study the nature of the "glue" that binds the building blocks of the visible matter in the universe. The proposed experiment will be realized at Brookhaven National Laboratory in approximately 10 years from now, with detector design and R&D currently ongoing. Notably, EIC is one of the first large-scale facilities to… ▽ More

    Submitted 19 May, 2022; v1 submitted 18 May, 2022; originally announced May 2022.

    Comments: 16 pages, 18 figures, 2 appendices, 3 tables

  42. arXiv:2205.02001  [pdf

    cs.CL cs.SD eess.AS

    Design of a novel Korean learning application for efficient pronunciation correction

    Authors: Minjong Cheon, Minseon Kim, Hanseon Joo

    Abstract: The Korean wave, which denotes the global popularity of South Korea's cultural economy, contributes to the increasing demand for the Korean language. However, as there does not exist any application for foreigners to learn Korean, this paper suggested a design of a novel Korean learning application. Speech recognition, speech-to-text, and speech-to-waveform are the three key systems in the propose… ▽ More

    Submitted 4 May, 2022; originally announced May 2022.

  43. arXiv:2205.00069  [pdf, other

    cs.CV

    Birds' Eye View: Measuring Behavior and Posture of Chickens as a Metric for Their Well-Being

    Authors: Kevin Hyekang Joo, Shiyuan Duan, Shawna L. Weimer, Mohammad Nayeem Teli

    Abstract: Chicken well-being is important for ensuring food security and better nutrition for a growing global human population. In this research, we represent behavior and posture as a metric to measure chicken well-being. With the objective of detecting chicken posture and behavior in a pen, we employ two algorithms: Mask R-CNN for instance segmentation and YOLOv4 in combination with ResNet50 for classifi… ▽ More

    Submitted 29 April, 2022; originally announced May 2022.

    Comments: under review at IJCV

  44. arXiv:2204.08451  [pdf, other

    cs.CV

    Learning to Listen: Modeling Non-Deterministic Dyadic Facial Motion

    Authors: Evonne Ng, Hanbyul Joo, Liwen Hu, Hao Li, Trevor Darrell, Angjoo Kanazawa, Shiry Ginosar

    Abstract: We present a framework for modeling interactional communication in dyadic conversations: given multimodal inputs of a speaker, we autoregressively output multiple possibilities of corresponding listener motion. We combine the motion and speech audio of the speaker using a motion-audio cross attention transformer. Furthermore, we enable non-deterministic prediction by learning a discrete latent rep… ▽ More

    Submitted 18 April, 2022; originally announced April 2022.

  45. arXiv:2201.08543  [pdf

    stat.ML cs.LG physics.geo-ph

    Deep Learning-Accelerated 3D Carbon Storage Reservoir Pressure Forecasting Based on Data Assimilation Using Surface Displacement from InSAR

    Authors: Hewei Tang, Pengcheng Fu, Honggeun Jo, Su Jiang, Christopher S. Sherman, François Hamon, Nicholas A. Azzolina, Joseph P. Morris

    Abstract: Fast forecasting of reservoir pressure distribution in geologic carbon storage (GCS) by assimilating monitoring data is a challenging problem. Due to high drilling cost, GCS projects usually have spatially sparse measurements from wells, leading to high uncertainties in reservoir pressure prediction. To address this challenge, we propose to use low-cost Interferometric Synthetic-Aperture Radar (In… ▽ More

    Submitted 26 January, 2022; v1 submitted 21 January, 2022; originally announced January 2022.

  46. Demonstration of VegaPlus: Optimizing Declarative Visualization Languages

    Authors: Junran Yang, Hyekang Kevin Joo, Sai S. Yerramreddy, Siyao Li, Dominik Moritz, Leilani Battle

    Abstract: While many visualization specification languages are user-friendly, they tend to have one critical drawback: they are designed for small data on the client-side and, as a result, perform poorly at scale. We propose a system that takes declarative visualization specifications as input and automatically optimizes the resulting visualization execution plans by offloading computational-intensive opera… ▽ More

    Submitted 8 March, 2022; v1 submitted 17 January, 2022; originally announced January 2022.

  47. arXiv:2112.12761  [pdf, other

    cs.CV cs.GR

    BANMo: Building Animatable 3D Neural Models from Many Casual Videos

    Authors: Gengshan Yang, Minh Vo, Natalia Neverova, Deva Ramanan, Andrea Vedaldi, Hanbyul Joo

    Abstract: Prior work for articulated 3D shape reconstruction often relies on specialized sensors (e.g., synchronized multi-camera systems), or pre-built 3D deformable models (e.g., SMAL or SMPL). Such methods are not able to scale to diverse sets of objects in the wild. We present BANMo, a method that requires neither a specialized sensor nor a pre-defined template shape. BANMo builds high-fidelity, articul… ▽ More

    Submitted 3 April, 2023; v1 submitted 23 December, 2021; originally announced December 2021.

    Comments: CVPR 2022 camera-ready version (last update: May 2022)

  48. arXiv:2112.00903  [pdf, other

    cs.AI cs.RO

    Modeling human intention inference in continuous 3D domains by inverse planning and body kinematics

    Authors: Yingdong Qian, Marta Kryven, Tao Gao, Hanbyul Joo, Josh Tenenbaum

    Abstract: How to build AI that understands human intentions, and uses this knowledge to collaborate with people? We describe a computational framework for evaluating models of goal inference in the domain of 3D motor actions, which receives as input the 3D coordinates of an agent's body, and of possible targets, to produce a continuously updated inference of the intended target. We evaluate our framework in… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

  49. arXiv:2111.13581  [pdf, other

    physics.geo-ph cs.AI

    Machine learning-based porosity estimation from spectral decomposed seismic data

    Authors: Honggeun Jo, Yongchae Cho, Michael J. Pyrcz, Hewei Tang, Pengcheng Fu

    Abstract: Estimating porosity models via seismic data is challenging due to the signal noise and insufficient resolution of seismic data. Although impedance inversion is often used by combining with well logs, several hurdles remain to retrieve sub-seismic scale porosity. As an alternative, we propose a machine learning-based workflow to convert seismic data to porosity models. A ResUNet++ based workflow is… ▽ More

    Submitted 22 November, 2021; originally announced November 2021.

  50. arXiv:2110.11474  [pdf, other

    cs.CV

    AEI: Actors-Environment Interaction with Adaptive Attention for Temporal Action Proposals Generation

    Authors: Khoa Vo, Hyekang Joo, Kashu Yamazaki, Sang Truong, Kris Kitani, Minh-Triet Tran, Ngan Le

    Abstract: Humans typically perceive the establishment of an action in a video through the interaction between an actor and the surrounding environment. An action only starts when the main actor in the video begins to interact with the environment, while it ends when the main actor stops the interaction. Despite the great progress in temporal action proposal generation, most existing works ignore the aforeme… ▽ More

    Submitted 24 October, 2021; v1 submitted 21 October, 2021; originally announced October 2021.

    Comments: Accepted in BMVC 2021 (Oral Session)