Skip to main content

Showing 1–50 of 61 results for author: Yoo, Y

  1. arXiv:2406.19848  [pdf, other

    cs.RO

    3D Operation of Autonomous Excavator based on Reinforcement Learning through Independent Reward for Individual Joints

    Authors: Yoonkyu Yoo, Donghwi Jung, Seong-Woo Kim

    Abstract: In this paper, we propose a control algorithm based on reinforcement learning, employing independent rewards for each joint to control excavators in a 3D space. The aim of this research is to address the challenges associated with achieving precise control of excavators, which are extensively utilized in construction sites but prove challenging to control with precision due to their hydraulic stru… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  2. arXiv:2406.12258  [pdf, other

    cs.CV

    Advancing Cross-Domain Generalizability in Face Anti-Spoofing: Insights, Design, and Metrics

    Authors: Hyojin Kim, Jiyoon Lee, Yonghyun Jeong, Haneol Jang, YoungJoon Yoo

    Abstract: This paper presents a novel perspective for enhancing anti-spoofing performance in zero-shot data domain generalization. Unlike traditional image classification tasks, face anti-spoofing datasets display unique generalization characteristics, necessitating novel zero-shot data domain generalization. One step forward to the previous frame-wise spoofing prediction, we introduce a nuanced metric calc… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 10 pages with 4 figures, Accepted by CVPRW 2024

  3. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seongjin Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  4. arXiv:2404.00921  [pdf, other

    cs.CV

    Towards Label-Efficient Human Matting: A Simple Baseline for Weakly Semi-Supervised Trimap-Free Human Matting

    Authors: Beomyoung Kim, Myeong Yeon Yi, Joonsang Yu, Young Joon Yoo, Sung Ju Hwang

    Abstract: This paper presents a new practical training method for human matting, which demands delicate pixel-level human region identification and significantly laborious annotations. To reduce the annotation cost, most existing matting approaches often rely on image synthesis to augment the dataset. However, the unnaturalness of synthesized training images brings in a new domain generalization challenge f… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: Preprint, 15 pages, 13 figures

  5. arXiv:2403.12449  [pdf, other

    cs.RO

    Multi-Object RANSAC: Efficient Plane Clustering Method in a Clutter

    Authors: Seunghyeon Lim, Youngjae Yoo, Jun Ki Lee, Byoung-Tak Zhang

    Abstract: In this paper, we propose a novel method for plane clustering specialized in cluttered scenes using an RGB-D camera and validate its effectiveness through robot grasping experiments. Unlike existing methods, which focus on large-scale indoor structures, our approach -- Multi-Object RANSAC emphasizes cluttered environments that contain a wide range of objects with different scales. It enhances plan… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: 7 pages, 6 figures

  6. arXiv:2403.09490  [pdf, other

    cs.CL

    Hyper-CL: Conditioning Sentence Representations with Hypernetworks

    Authors: Young Hyun Yoo, Jii Cha, Changhyeon Kim, Taeuk Kim

    Abstract: While the introduction of contrastive learning frameworks in sentence representation learning has significantly contributed to advancements in the field, it still remains unclear whether state-of-the-art sentence embeddings can capture the fine-grained semantics of sentences, particularly when conditioned on specific perspectives. In this paper, we introduce Hyper-CL, an efficient methodology that… ▽ More

    Submitted 6 June, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

    Comments: ACL 2024

  7. Multimodal Anomaly Detection based on Deep Auto-Encoder for Object Slip Perception of Mobile Manipulation Robots

    Authors: Youngjae Yoo, Chung-Yeon Lee, Byoung-Tak Zhang

    Abstract: Object slip perception is essential for mobile manipulation robots to perform manipulation tasks reliably in the dynamic real-world. Traditional approaches to robot arms' slip perception use tactile or vision sensors. However, mobile robots still have to deal with noise in their sensor signals caused by the robot's movement in a changing environment. To solve this problem, we present an anomaly de… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  8. arXiv:2401.09048  [pdf, other

    cs.CV

    Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis

    Authors: Jonghyun Lee, Hansam Cho, Youngjoon Yoo, Seoung Bum Kim, Yonghyun Jeong

    Abstract: Addressing the limitations of text as a source of accurate layout representation in text-conditional diffusion models, many works incorporate additional signals to condition certain attributes within a generated image. Although successful, previous works do not account for the specific localization of said attributes extended into the three dimensional plane. In this context, we present a conditio… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: ICLR 2024

  9. Gaussian Mixture Proposals with Pull-Push Learning Scheme to Capture Diverse Events for Weakly Supervised Temporal Video Grounding

    Authors: Sunoh Kim, Jungchan Cho, Joonsang Yu, YoungJoon Yoo, Jin Young Choi

    Abstract: In the weakly supervised temporal video grounding study, previous methods use predetermined single Gaussian proposals which lack the ability to express diverse events described by the sentence query. To enhance the expression ability of a proposal, we propose a Gaussian mixture proposal (GMP) that can depict arbitrary shapes by learning importance, centroid, and range of every Gaussian in the mixt… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

    Comments: Accepted in AAAI 2024

  10. arXiv:2312.11532  [pdf, other

    cs.CL cs.AI cs.LG

    Topic-VQ-VAE: Leveraging Latent Codebooks for Flexible Topic-Guided Document Generation

    Authors: YoungJoon Yoo, Jongwon Choi

    Abstract: This paper introduces a novel approach for topic modeling utilizing latent codebooks from Vector-Quantized Variational Auto-Encoder~(VQ-VAE), discretely encapsulating the rich information of the pre-trained embeddings such as the pre-trained language model. From the novel interpretation of the latent codebooks and embeddings as conceptual bag-of-words, we propose a new generative topic model calle… ▽ More

    Submitted 21 January, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: Published in the 38th annual AAAI conference on Artificial Intelligence

  11. arXiv:2311.11169  [pdf

    eess.IV cs.AI cs.LG eess.SP

    Deep Coherence Learning: An Unsupervised Deep Beamformer for High Quality Single Plane Wave Imaging in Medical Ultrasound

    Authors: Hyunwoo Cho, Seongjun Park, Jinbum Kang, Yangmo Yoo

    Abstract: Plane wave imaging (PWI) in medical ultrasound is becoming an important reconstruction method with high frame rates and new clinical applications. Recently, single PWI based on deep learning (DL) has been studied to overcome lowered frame rates of traditional PWI with multiple PW transmissions. However, due to the lack of appropriate ground truth images, DL-based PWI still remains challenging for… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

  12. arXiv:2305.08611  [pdf, other

    cs.CV

    GeNAS: Neural Architecture Search with Better Generalization

    Authors: Joonhyun Jeong, Joonsang Yu, Geondo Park, Dongyoon Han, YoungJoon Yoo

    Abstract: Neural Architecture Search (NAS) aims to automatically excavate the optimal network architecture with superior test performance. Recent neural architecture search (NAS) approaches rely on validation loss or accuracy to find the superior network for the target data. In this paper, we investigate a new neural architecture search measure for excavating architectures with better generalization. We dem… ▽ More

    Submitted 18 May, 2023; v1 submitted 15 May, 2023; originally announced May 2023.

    Comments: Accepted by IJCAI2023

  13. arXiv:2303.16767  [pdf, other

    cs.IR cs.AI cs.SI

    A Novel Patent Similarity Measurement Methodology: Semantic Distance and Technological Distance

    Authors: Yongmin Yoo, Cheonkam Jeong, Sanguk Gim, Junwon Lee, Zachary Schimke, Deaho Seo

    Abstract: Patent similarity analysis plays a crucial role in evaluating the risk of patent infringement. Nonetheless, this analysis is predominantly conducted manually by legal experts, often resulting in a time-consuming process. Recent advances in natural language processing technology offer a promising avenue for automating this process. However, methods for measuring similarity between patents still rel… ▽ More

    Submitted 30 November, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

  14. arXiv:2303.03165  [pdf

    cs.CL cs.AI

    Multi label classification of Artificial Intelligence related patents using Modified D2SBERT and Sentence Attention mechanism

    Authors: Yongmin Yoo, Tak-Sung Heo, Dongjin Lim, Deaho Seo

    Abstract: Patent classification is an essential task in patent information management and patent knowledge mining. It is very important to classify patents related to artificial intelligence, which is the biggest topic these days. However, artificial intelligence-related patents are very difficult to classify because it is a mixture of complex technologies and legal terms. Moreover, due to the unsatisfactor… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

  15. arXiv:2211.16090  [pdf, other

    cs.RO

    Compliant Suction Gripper with Seamless Deployment and Retraction for Robust Picking against Depth and Tilt Errors

    Authors: Yuna Yoo, Jaemin Eom, Min Jo Park, Kyu-Jin Cho

    Abstract: Applying suction grippers in unstructured environments is a challenging task because of depth and tilt errors in vision systems, requiring additional costs in elaborate sensing and control. To reduce additional costs, suction grippers with compliant bodies or mechanisms have been proposed; however, their bulkiness and limited allowable error hinder their use in complex environments with large erro… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

    Comments: 8 pages, 11 figures

  16. arXiv:2209.12417  [pdf

    cs.AI cs.CL

    5-Star Hotel Customer Satisfaction Analysis Using Hybrid Methodology

    Authors: Yongmin Yoo, Yeongjoon Park, Dongjin Lim, Deaho Seo

    Abstract: Due to the rapid development of non-face-to-face services due to the corona virus, commerce through the Internet, such as sales and reservations, is increasing very rapidly. Consumers also post reviews, suggestions, or judgments about goods or services on the website. The review data directly used by consumers provides positive feedback and nice impact to consumers, such as creating business value… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

  17. arXiv:2208.10644  [pdf, other

    cs.CR eess.SY

    Machine Learning-Enabled Cyber Attack Prediction and Mitigation for EV Charging Stations

    Authors: Mansi Girdhar, Junho Hong, Yongsik Yoo, Tai-Jin Song

    Abstract: Safe and reliable electric vehicle charging stations (EVCSs) have become imperative in an intelligent transportation infrastructure. Over the years, there has been a rapid increase in the deployment of EVCSs to address the upsurging charging demands. However, advances in information and communication technologies (ICT) have rendered this cyber-physical system (CPS) vulnerable to suffering cyber th… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

    Comments: 5 pages, 4 figures, 11 mathematical equations

  18. arXiv:2204.10825  [pdf, other

    cs.CL

    Meet Your Favorite Character: Open-domain Chatbot Mimicking Fictional Characters with only a Few Utterances

    Authors: Seungju Han, Beomsu Kim, Jin Yong Yoo, Seokjun Seo, Sangbum Kim, Enkhbayar Erdenee, Buru Chang

    Abstract: In this paper, we consider mimicking fictional characters as a promising direction for building engaging conversation models. To this end, we present a new practical task where only a few utterances of each fictional character are available to generate responses mimicking them. Furthermore, we propose a new method named Pseudo Dialog Prompting (PDP) that generates responses by leveraging the power… ▽ More

    Submitted 22 April, 2022; originally announced April 2022.

    Comments: NAACL2022 (Short)

  19. arXiv:2204.02633  [pdf

    cs.CL cs.AI

    DAGAM: Data Augmentation with Generation And Modification

    Authors: Byeong-Cheol Jo, Tak-Sung Heo, Yeongjoon Park, Yongmin Yoo, Won Ik Cho, Kyungsun Kim

    Abstract: Text classification is a representative downstream task of natural language processing, and has exhibited excellent performance since the advent of pre-trained language models based on Transformer architecture. However, in pre-trained language models, under-fitting often occurs due to the size of the model being very large compared to the amount of available training data. Along with significant i… ▽ More

    Submitted 6 April, 2022; originally announced April 2022.

  20. arXiv:2204.01209  [pdf, other

    cs.CV

    EResFD: Rediscovery of the Effectiveness of Standard Convolution for Lightweight Face Detection

    Authors: Joonhyun Jeong, Beomyoung Kim, Joonsang Yu, Youngjoon Yoo

    Abstract: This paper analyzes the design choices of face detection architecture that improve efficiency of computation cost and accuracy. Specifically, we re-examine the effectiveness of the standard convolutional block as a lightweight backbone architecture for face detection. Unlike the current tendency of lightweight architecture design, which heavily utilizes depthwise separable convolution layers, we s… ▽ More

    Submitted 2 November, 2023; v1 submitted 3 April, 2022; originally announced April 2022.

    Comments: Accepted by WACV 2024

  21. arXiv:2203.12445  [pdf, ps, other

    cs.DC cs.SI

    ShareTrace: Contact Tracing with the Actor Model

    Authors: Ryan Tatton, Erman Ayday, Youngjin Yoo, Anisa Halimi

    Abstract: Proximity-based contact tracing relies on mobile-device interaction to estimate the spread of disease. ShareTrace is one such approach that improves the efficacy of tracking disease spread by considering direct and indirect forms of contact. In this work, we utilize the actor model to provide an efficient and scalable formulation of ShareTrace with asynchronous, concurrent message passing on a tem… ▽ More

    Submitted 18 September, 2022; v1 submitted 23 March, 2022; originally announced March 2022.

    Comments: To be published in IEEE HealthCom 2022 Conference Proceedings; added mathematical detail about message reachability; improved explanations of algorithms and figures, updated conclusion, fixed typos, results unchanged; 6 pages with 3 figures

    ACM Class: F.1.2; G.2.2; J.3; G.3

  22. arXiv:2202.02777  [pdf, other

    cs.CV cs.LG

    Learning Features with Parameter-Free Layers

    Authors: Dongyoon Han, YoungJoon Yoo, Beomyoung Kim, Byeongho Heo

    Abstract: Trainable layers such as convolutional building blocks are the standard network design choices by learning parameters to capture the global context through successive spatial operations. When designing an efficient network, trainable layers such as the depthwise convolution is the source of efficiency in the number of parameters and FLOPs, but there was little improvement to the model speed in pra… ▽ More

    Submitted 20 March, 2022; v1 submitted 6 February, 2022; originally announced February 2022.

    Comments: ICLR 2022

  23. arXiv:2201.01283  [pdf, other

    cs.CV

    Self-supervised Learning from 100 Million Medical Images

    Authors: Florin C. Ghesu, Bogdan Georgescu, Awais Mansoor, Youngjin Yoo, Dominik Neumann, Pragneshkumar Patel, R. S. Vishwanath, James M. Balter, Yue Cao, Sasa Grbic, Dorin Comaniciu

    Abstract: Building accurate and robust artificial intelligence systems for medical image assessment requires not only the research and design of advanced deep learning models but also the creation of large and curated sets of annotated training examples. Constructing such datasets, however, is often very costly -- due to the complex nature of annotation tasks and the high level of expertise required for the… ▽ More

    Submitted 4 January, 2022; originally announced January 2022.

  24. arXiv:2112.06278  [pdf, other

    math.CO cs.DM cs.DS

    Approximating TSP walks in subcubic graphs

    Authors: Michael C. Wigal, Youngho Yoo, Xingxing Yu

    Abstract: We prove that every simple 2-connected subcubic graph on $n$ vertices with $n_2$ vertices of degree 2 has a TSP walk of length at most $\frac{5n+n_2}{4}-1$, confirming a conjecture of Dvořák, Král', and Mohar. This bound is best possible; there are infinitely many subcubic and cubic graphs whose minimum TSP walks have lengths $\frac{5n+n_2}{4}-1$ and $\frac{5n}{4} - 2$ respectively. We characteriz… ▽ More

    Submitted 12 December, 2021; originally announced December 2021.

    Comments: 30 pages

  25. arXiv:2111.11295  [pdf

    cs.IR cs.AI cs.LG

    Artificial Intelligence Technology analysis using Artificial Intelligence patent through Deep Learning model and vector space model

    Authors: Yongmin Yoo, Dongjin Lim, Kyungsun Kim

    Abstract: Thanks to rapid development of artificial intelligence technology in recent years, the current artificial intelligence technology is contributing to many part of society. Education, environment, medical care, military, tourism, economy, politics, etc. are having a very large impact on society as a whole. For example, in the field of education, there is an artificial intelligence tutoring system th… ▽ More

    Submitted 7 November, 2021; originally announced November 2021.

  26. arXiv:2110.04248  [pdf, other

    cs.CV cs.LG

    Observations on K-image Expansion of Image-Mixing Augmentation for Classification

    Authors: Joonhyun Jeong, Sungmin Cha, Youngjoon Yoo, Sangdoo Yun, Taesup Moon, Jongwon Choi

    Abstract: Image-mixing augmentations (e.g., Mixup and CutMix), which typically involve mixing two images, have become the de-facto training techniques for image classification. Despite their huge success in image classification, the number of images to be mixed has not been elucidated in the literature: only the naive K-image expansion has been shown to lead to performance degradation. This study derives a… ▽ More

    Submitted 17 March, 2023; v1 submitted 8 October, 2021; originally announced October 2021.

    Comments: Preprint

  27. arXiv:2109.09477  [pdf, other

    cs.CV

    Beyond Semantic to Instance Segmentation: Weakly-Supervised Instance Segmentation via Semantic Knowledge Transfer and Self-Refinement

    Authors: Beomyoung Kim, Youngjoon Yoo, Chaeeun Rhee, Junmo Kim

    Abstract: Weakly-supervised instance segmentation (WSIS) has been considered as a more challenging task than weakly-supervised semantic segmentation (WSSS). Compared to WSSS, WSIS requires instance-wise localization, which is difficult to extract from image-level labels. To tackle the problem, most WSIS approaches use off-the-shelf proposal techniques that require pre-training with instance or object level… ▽ More

    Submitted 29 March, 2022; v1 submitted 20 September, 2021; originally announced September 2021.

    Comments: CVPR 2022, Accepted

  28. arXiv:2109.08796   

    cs.IR cs.CL

    Solar cell patent classification method based on keyword extraction and deep neural network

    Authors: Yongmin Yoo, Dongjin Lim, Tak-Sung Heo

    Abstract: With the growing impact of ESG on businesses, research related to renewable energy is receiving great attention. Solar cells are one of them, and accordingly, it can be said that the research value of solar cell patent analysis is very high. Patent documents have high research value. Being able to accurately analyze and classify patent documents can reveal several important technical relationships… ▽ More

    Submitted 8 December, 2021; v1 submitted 17 September, 2021; originally announced September 2021.

    Comments: The content and quality of the thesis is too low, and the title and content have been changed and will be uploaded

  29. arXiv:2109.00544  [pdf, other

    cs.CL cs.AI cs.LG

    Towards Improving Adversarial Training of NLP Models

    Authors: Jin Yong Yoo, Yanjun Qi

    Abstract: Adversarial training, a method for learning robust deep neural networks, constructs adversarial examples during training. However, recent methods for generating NLP adversarial examples involve combinatorial search and expensive sentence encoders for constraining the generated instances. As a result, it remains challenging to use vanilla adversarial training to improve NLP models' performance, and… ▽ More

    Submitted 11 September, 2021; v1 submitted 1 September, 2021; originally announced September 2021.

    Comments: EMNLP Findings 2021

  30. arXiv:2106.11644  [pdf, other

    cs.CV cs.LG

    NCIS: Neural Contextual Iterative Smoothing for Purifying Adversarial Perturbations

    Authors: Sungmin Cha, Naeun Ko, Youngjoon Yoo, Taesup Moon

    Abstract: We propose a novel and effective purification based adversarial defense method against pre-processor blind white- and black-box attacks. Our method is computationally efficient and trained only with self-supervised learning on general images, without requiring any adversarial training or retraining of the classification model. We first show an empirical analysis on the adversarial noise, defined t… ▽ More

    Submitted 30 December, 2021; v1 submitted 22 June, 2021; originally announced June 2021.

    Comments: Preprint version

  31. arXiv:2106.11562  [pdf, other

    cs.CV cs.LG

    SSUL: Semantic Segmentation with Unknown Label for Exemplar-based Class-Incremental Learning

    Authors: Sungmin Cha, Beomyoung Kim, Youngjoon Yoo, Taesup Moon

    Abstract: This paper introduces a solid state-of-the-art baseline for a class-incremental semantic segmentation (CISS) problem. While the recent CISS algorithms utilize variants of the knowledge distillation (KD) technique to tackle the problem, they failed to fully address the critical challenges in CISS causing the catastrophic forgetting; the semantic drift of the background class and the multi-label pre… ▽ More

    Submitted 19 November, 2021; v1 submitted 22 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021 camera ready version

  32. arXiv:2106.08918  [pdf, other

    cs.LG cs.NE eess.SY

    Towards Automatic Actor-Critic Solutions to Continuous Control

    Authors: Jake Grigsby, Jin Yong Yoo, Yanjun Qi

    Abstract: Model-free off-policy actor-critic methods are an efficient solution to complex continuous control tasks. However, these algorithms rely on a number of design tricks and hyperparameters, making their application to new domains difficult and computationally expensive. This paper creates an evolutionary approach that automatically tunes these design decisions and eliminates the RL-specific hyperpara… ▽ More

    Submitted 23 October, 2021; v1 submitted 16 June, 2021; originally announced June 2021.

    Comments: NeurIPS Deep RL Workshop 2021

  33. arXiv:2106.07932  [pdf

    cs.AI

    Medical Code Prediction from Discharge Summary: Document to Sequence BERT using Sequence Attention

    Authors: Tak-Sung Heo, Yongmin Yoo, Yeongjoon Park, Byeong-Cheol Jo, Kyungsun Kim

    Abstract: Clinical notes are unstructured text generated by clinicians during patient encounters. Clinical notes are usually accompanied by a set of metadata codes from the International Classification of Diseases(ICD). ICD code is an important code used in various operations, including insurance, reimbursement, medical diagnosis, etc. Therefore, it is important to classify ICD codes quickly and accurately.… ▽ More

    Submitted 10 November, 2021; v1 submitted 15 June, 2021; originally announced June 2021.

  34. arXiv:2105.00648  [pdf

    cs.AI

    A novel hybrid methodology of measuring sentence similarity

    Authors: Yongmin Yoo, Tak-Sung Heo, Yeongjoon Park, Kyungsun Kim

    Abstract: The problem of measuring sentence similarity is an essential issue in the natural language processing (NLP) area. It is necessary to measure the similarity between sentences accurately. There are many approaches to measuring sentence similarity. Deep learning methodology shows a state-of-the-art performance in many natural language processing fields and is used a lot in sentence similarity measure… ▽ More

    Submitted 20 June, 2021; v1 submitted 3 May, 2021; originally announced May 2021.

  35. arXiv:2103.17230  [pdf, other

    cs.CV cs.LG

    Rainbow Memory: Continual Learning with a Memory of Diverse Samples

    Authors: Jihwan Bang, Heesu Kim, YoungJoon Yoo, Jung-Woo Ha, Jonghyun Choi

    Abstract: Continual learning is a realistic learning scenario for AI models. Prevalent scenario of continual learning, however, assumes disjoint sets of classes as tasks and is less realistic rather artificial. Instead, we focus on 'blurry' task boundary; where tasks shares classes and is more realistic and practical. To address such task, we argue the importance of diversity of samples in an episodic memor… ▽ More

    Submitted 31 March, 2021; originally announced March 2021.

    Comments: Accepted paper at CVPR 2021

  36. arXiv:2101.00200  [pdf, other

    cs.CV cs.AI

    More than just an auxiliary loss: Anti-spoofing Backbone Training via Adversarial Pseudo-depth Generation

    Authors: Chang Keun Paik, Naeun Ko, Youngjoon Yoo

    Abstract: In this paper, a new method of training pipeline is discussed to achieve significant performance on the task of anti-spoofing with RGB image. We explore and highlight the impact of using pseudo-depth to pre-train a network that will be used as the backbone to the final classifier. While the usage of pseudo-depth for anti-spoofing task is not a new idea on its own, previous endeavours utilize pseud… ▽ More

    Submitted 19 March, 2021; v1 submitted 1 January, 2021; originally announced January 2021.

  37. arXiv:2010.01724  [pdf, other

    cs.SE

    TextAttack: Lessons learned in designing Python frameworks for NLP

    Authors: John X. Morris, Jin Yong Yoo, Yanjun Qi

    Abstract: TextAttack is an open-source Python toolkit for adversarial attacks, adversarial training, and data augmentation in NLP. TextAttack unites 15+ papers from the NLP adversarial attack literature into a single framework, with many components reused across attacks. This framework allows both researchers and developers to test and study the weaknesses of their NLP models. To build such an open-source N… ▽ More

    Submitted 4 October, 2020; originally announced October 2020.

    Comments: 4 pages

  38. arXiv:2009.06368  [pdf, other

    cs.CL cs.AI cs.CR cs.LG

    Searching for a Search Method: Benchmarking Search Algorithms for Generating NLP Adversarial Examples

    Authors: Jin Yong Yoo, John X. Morris, Eli Lifland, Yanjun Qi

    Abstract: We study the behavior of several black-box search algorithms used for generating adversarial examples for natural language processing (NLP) tasks. We perform a fine-grained analysis of three elements relevant to search: search algorithm, search space, and search budget. When new search algorithms are proposed in past work, the attack search space is often modified alongside the search algorithm. W… ▽ More

    Submitted 12 October, 2020; v1 submitted 9 September, 2020; originally announced September 2020.

    Comments: 14 pages, 5 figures, 4 tables; Accepted by EMNLP BlackBox NLP Workshop 2020 @ https://blackboxnlp.github.io/cfp.html

  39. arXiv:2007.04258  [pdf, other

    eess.IV cs.CV

    Quantifying and Leveraging Predictive Uncertainty for Medical Image Assessment

    Authors: Florin C. Ghesu, Bogdan Georgescu, Awais Mansoor, Youngjin Yoo, Eli Gibson, R. S. Vishwanath, Abishek Balachandran, James M. Balter, Yue Cao, Ramandeep Singh, Subba R. Digumarthy, Mannudeep K. Kalra, Sasa Grbic, Dorin Comaniciu

    Abstract: The interpretation of medical images is a challenging task, often complicated by the presence of artifacts, occlusions, limited contrast and more. Most notable is the case of chest radiography, where there is a high inter-rater variability in the detection and classification of abnormalities. This is largely due to inconclusive evidence in the data or subjective definitions of disease appearance.… ▽ More

    Submitted 8 July, 2020; originally announced July 2020.

    Comments: Under review at Medical Image Analysis

  40. arXiv:2007.00992  [pdf, other

    cs.CV

    Rethinking Channel Dimensions for Efficient Model Design

    Authors: Dongyoon Han, Sangdoo Yun, Byeongho Heo, YoungJoon Yoo

    Abstract: Designing an efficient model within the limited computational cost is challenging. We argue the accuracy of a lightweight model has been further limited by the design convention: a stage-wise configuration of the channel dimensions, which looks like a piecewise linear function of the network stage. In this paper, we study an effective channel dimension configuration towards better performance than… ▽ More

    Submitted 8 June, 2021; v1 submitted 2 July, 2020; originally announced July 2020.

    Comments: 13 pages, 8 figures, CVPR 2021

  41. arXiv:2006.11021  [pdf, other

    eess.AS cs.LG

    Boosting Active Learning for Speech Recognition with Noisy Pseudo-labeled Samples

    Authors: Jihwan Bang, Heesu Kim, YoungJoon Yoo, Jung-Woo Ha

    Abstract: The cost of annotating transcriptions for large speech corpora becomes a bottleneck to maximally enjoy the potential capacity of deep neural network-based automatic speech recognition models. In this paper, we present a new training pipeline boosting the conventional active learning approach targeting label-efficient learning to resolve the mentioned problem. Existing active learning methods only… ▽ More

    Submitted 5 November, 2020; v1 submitted 19 June, 2020; originally announced June 2020.

    Comments: 8 pages, 4 figures, 2 tables

  42. arXiv:2006.09679  [pdf, other

    cs.LG cs.CV stat.ML

    FrostNet: Towards Quantization-Aware Network Architecture Search

    Authors: Taehoon Kim, YoungJoon Yoo, Jihoon Yang

    Abstract: INT8 quantization has become one of the standard techniques for deploying convolutional neural networks (CNNs) on edge devices to reduce the memory and computational resource usages. By analyzing quantized performances of existing mobile-target network architectures, we can raise an issue regarding the importance of network architecture for optimal INT8 quantization. In this paper, we present a ne… ▽ More

    Submitted 30 November, 2020; v1 submitted 17 June, 2020; originally announced June 2020.

  43. arXiv:2006.04998  [pdf

    eess.IV cs.CV cs.LG

    Machine Learning Automatically Detects COVID-19 using Chest CTs in a Large Multicenter Cohort

    Authors: Eduardo Jose Mortani Barbosa Jr., Bogdan Georgescu, Shikha Chaganti, Gorka Bastarrika Aleman, Jordi Broncano Cabrero, Guillaume Chabin, Thomas Flohr, Philippe Grenier, Sasa Grbic, Nakul Gupta, François Mellot, Savvas Nicolaou, Thomas Re, Pina Sanelli, Alexander W. Sauter, Youngjin Yoo, Valentin Ziebandt, Dorin Comaniciu

    Abstract: Objectives: To investigate machine-learning classifiers and interpretable models using chest CT for detection of COVID-19 and differentiation from other pneumonias, ILD and normal CTs. Methods: Our retrospective multi-institutional study obtained 2096 chest CTs from 16 institutions (including 1077 COVID-19 patients). Training/testing cohorts included 927/100 COVID-19, 388/33 ILD, 189/33 other pn… ▽ More

    Submitted 9 October, 2020; v1 submitted 8 June, 2020; originally announced June 2020.

  44. arXiv:2005.05909  [pdf, other

    cs.CL cs.AI cs.LG

    TextAttack: A Framework for Adversarial Attacks, Data Augmentation, and Adversarial Training in NLP

    Authors: John X. Morris, Eli Lifland, Jin Yong Yoo, Jake Grigsby, Di Jin, Yanjun Qi

    Abstract: While there has been substantial research using adversarial attacks to analyze NLP models, each attack is implemented in its own code repository. It remains challenging to develop NLP attacks and utilize them to improve model performance. This paper introduces TextAttack, a Python framework for adversarial attacks, data augmentation, and adversarial training in NLP. TextAttack builds attacks from… ▽ More

    Submitted 4 October, 2020; v1 submitted 29 April, 2020; originally announced May 2020.

    Comments: 6 pages. More details are shared at https://github.com/QData/TextAttack

  45. arXiv:2005.01903  [pdf, other

    eess.IV cs.CV

    3D Tomographic Pattern Synthesis for Enhancing the Quantification of COVID-19

    Authors: Siqi Liu, Bogdan Georgescu, Zhoubing Xu, Youngjin Yoo, Guillaume Chabin, Shikha Chaganti, Sasa Grbic, Sebastian Piat, Brian Teixeira, Abishek Balachandran, Vishwanath RS, Thomas Re, Dorin Comaniciu

    Abstract: The Coronavirus Disease (COVID-19) has affected 1.8 million people and resulted in more than 110,000 deaths as of April 12, 2020. Several studies have shown that tomographic patterns seen on chest Computed Tomography (CT), such as ground-glass opacities, consolidations, and crazy paving pattern, are correlated with the disease severity and progression. CT imaging can thus emerge as an important mo… ▽ More

    Submitted 4 May, 2020; originally announced May 2020.

  46. Automated Quantification of CT Patterns Associated with COVID-19 from Chest CT

    Authors: Shikha Chaganti, Abishek Balachandran, Guillaume Chabin, Stuart Cohen, Thomas Flohr, Bogdan Georgescu, Philippe Grenier, Sasa Grbic, Siqi Liu, François Mellot, Nicolas Murray, Savvas Nicolaou, William Parker, Thomas Re, Pina Sanelli, Alexander W. Sauter, Zhoubing Xu, Youngjin Yoo, Valentin Ziebandt, Dorin Comaniciu

    Abstract: Purpose: To present a method that automatically segments and quantifies abnormal CT patterns commonly present in coronavirus disease 2019 (COVID-19), namely ground glass opacities and consolidations. Materials and Methods: In this retrospective study, the proposed method takes as input a non-contrasted chest CT and segments the lesions, lungs, and lobes in three dimensions, based on a dataset of 9… ▽ More

    Submitted 18 November, 2020; v1 submitted 2 April, 2020; originally announced April 2020.

    Journal ref: Radiology: Artificial Intelligence, Vol. 2, No. 4, 2020

  47. arXiv:2003.03879  [pdf, other

    cs.CV

    An Empirical Evaluation on Robustness and Uncertainty of Regularization Methods

    Authors: Sanghyuk Chun, Seong Joon Oh, Sangdoo Yun, Dongyoon Han, Junsuk Choe, Youngjoon Yoo

    Abstract: Despite apparent human-level performances of deep neural networks (DNN), they behave fundamentally differently from humans. They easily change predictions when small corruptions such as blur and noise are applied on the input (lack of robustness), and they often produce confident predictions on out-of-distribution samples (improper uncertainty measure). While a number of researches have aimed to a… ▽ More

    Submitted 8 March, 2020; originally announced March 2020.

    Comments: Accepted at ICML 2019 Workshop on Uncertainty and Robustness in Deep Learning. 7 pages, 1 figure

  48. arXiv:1911.09099  [pdf, other

    cs.CV

    SINet: Extreme Lightweight Portrait Segmentation Networks with Spatial Squeeze Modules and Information Blocking Decoder

    Authors: Hyojin Park, Lars Lowe Sjösund, YoungJoon Yoo, Nicolas Monet, Jihwan Bang, Nojun Kwak

    Abstract: Designing a lightweight and robust portrait segmentation algorithm is an important task for a wide range of face applications. However, the problem has been considered as a subset of the object segmentation problem and less handled in the semantic segmentation field. Obviously, portrait segmentation has its unique requirements. First, because the portrait segmentation is performed in the middle of… ▽ More

    Submitted 9 February, 2020; v1 submitted 20 November, 2019; originally announced November 2019.

    Comments: https://github.com/HYOJINPARK/ExtPortraitSeg. arXiv admin note: text overlap with arXiv:1908.03093

  49. arXiv:1910.06705  [pdf, other

    cs.LG cs.CV stat.ML

    Neural Approximation of an Auto-Regressive Process through Confidence Guided Sampling

    Authors: YoungJoon Yoo, Sanghyuk Chun, Sangdoo Yun, Jung-Woo Ha, Jaejun Yoo

    Abstract: We propose a generic confidence-based approximation that can be plugged in and simplify the auto-regressive generation process with a proved convergence. We first assume that the priors of future samples can be generated in an independently and identically distributed (i.i.d.) manner using an efficient predictor. Given the past samples and future priors, the mother AR model can post-process the pr… ▽ More

    Submitted 15 October, 2019; originally announced October 2019.

  50. arXiv:1908.08204  [pdf, other

    eess.IV cs.LG stat.ML

    Convolutional Recurrent Reconstructive Network for Spatiotemporal Anomaly Detection in Solder Paste Inspection

    Authors: Yong-Ho Yoo, Ue-Hwan Kim, Jong-Hwan Kim

    Abstract: Surface mount technology (SMT) is a process for producing printed circuit boards. Solder paste printer (SPP), package mounter, and solder reflow oven are used for SMT. The board on which the solder paste is deposited from the SPP is monitored by solder paste inspector (SPI). If SPP malfunctions due to the printer defects, the SPP produces defective products, and then abnormal patterns are detected… ▽ More

    Submitted 22 August, 2019; originally announced August 2019.