Skip to main content

Showing 1–43 of 43 results for author: Jang, K

  1. arXiv:2407.09541  [pdf, other

    cs.CL cs.AI cs.CV

    MATE: Meet At The Embedding -- Connecting Images with Long Texts

    Authors: Young Kyun Jang, Junmo Kang, Yong Jae Lee, Donghyun Kim

    Abstract: While advancements in Vision Language Models (VLMs) have significantly improved the alignment of visual and textual data, these models primarily focus on aligning images with short descriptive captions. This focus limits their ability to handle complex text interactions, particularly with longer texts such as lengthy captions or documents, which have not been extensively explored yet. In this pape… ▽ More

    Submitted 26 June, 2024; originally announced July 2024.

  2. arXiv:2407.03563  [pdf, other

    eess.AS cs.CL cs.LG eess.IV

    Learning Video Temporal Dynamics with Cross-Modal Attention for Robust Audio-Visual Speech Recognition

    Authors: Sungnyun Kim, Kangwook Jang, Sangmin Bae, Hoirin Kim, Se-Young Yun

    Abstract: Audio-visual speech recognition (AVSR) aims to transcribe human speech using both audio and video modalities. In practical environments with noise-corrupted audio, the role of video information becomes crucial. However, prior works have primarily focused on enhancing audio features in AVSR, overlooking the importance of video features. In this study, we strengthen the video features by learning th… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  3. arXiv:2406.16716  [pdf, other

    eess.AS cs.CR cs.SD

    One-Class Learning with Adaptive Centroid Shift for Audio Deepfake Detection

    Authors: Hyun Myung Kim, Kangwook Jang, Hoirin Kim

    Abstract: As speech synthesis systems continue to make remarkable advances in recent years, the importance of robust deepfake detection systems that perform well in unseen systems has grown. In this paper, we propose a novel adaptive centroid shift (ACS) method that updates the centroid representation by continually shifting as the weighted average of bonafide representations. Our approach uses only bonafid… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: Accepted by Interspeech 2024

  4. arXiv:2406.01192  [pdf, other

    cs.LG stat.ML

    Sparsity-Agnostic Linear Bandits with Adaptive Adversaries

    Authors: Tianyuan Jin, Kyoungseok Jang, Nicolò Cesa-Bianchi

    Abstract: We study stochastic linear bandits where, in each round, the learner receives a set of actions (i.e., feature vectors), from which it chooses an element and obtains a stochastic reward. The expected reward is a fixed but unknown linear function of the chosen action. We study sparse regret bounds, that depend on the number $S$ of non-zero coefficients in the linear reward function. Previous works f… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 25 pages

  5. arXiv:2406.00014  [pdf, other

    cs.DB cs.AI cs.CL cs.IR

    KU-DMIS at EHRSQL 2024:Generating SQL query via question templatization in EHR

    Authors: Hajung Kim, Chanhwi Kim, Hoonick Lee, Kyochul Jang, Jiwoo Lee, Kyungjae Lee, Gangwoo Kim, Jaewoo Kang

    Abstract: Transforming natural language questions into SQL queries is crucial for precise data retrieval from electronic health record (EHR) databases. A significant challenge in this process is detecting and rejecting unanswerable questions that request information beyond the database's scope or exceed the system's capabilities. In this paper, we introduce a novel text-to-SQL framework that robustly handle… ▽ More

    Submitted 19 June, 2024; v1 submitted 21 May, 2024; originally announced June 2024.

    Comments: Published at ClinicalNLP workshop @ NAACL 2024

  6. arXiv:2405.14726  [pdf, other

    cs.CV

    Distilling Vision-Language Pretraining for Efficient Cross-Modal Retrieval

    Authors: Young Kyun Jang, Donghyun Kim, Ser-nam Lim

    Abstract: ``Learning to hash'' is a practical solution for efficient retrieval, offering fast search speed and low storage cost. It is widely applied in various applications, such as image-text cross-modal search. In this paper, we explore the potential of enhancing the performance of learning to hash with the proliferation of powerful large pre-trained models, such as Vision-Language Pre-training (VLP) mod… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  7. arXiv:2405.14715  [pdf, other

    cs.CV cs.AI

    Towards Cross-modal Backward-compatible Representation Learning for Vision-Language Models

    Authors: Young Kyun Jang, Ser-nam Lim

    Abstract: Modern retrieval systems often struggle with upgrading to new and more powerful models due to the incompatibility of embeddings between the old and new models. This necessitates a costly process known as backfilling, which involves re-computing the embeddings for a large number of data samples. In vision, Backward-compatible Training (BT) has been proposed to ensure that the new model aligns with… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  8. arXiv:2405.00571  [pdf, other

    cs.CV cs.AI

    Spherical Linear Interpolation and Text-Anchoring for Zero-shot Composed Image Retrieval

    Authors: Young Kyun Jang, Dat Huynh, Ashish Shah, Wen-Kai Chen, Ser-Nam Lim

    Abstract: Composed Image Retrieval (CIR) is a complex task that retrieves images using a query, which is configured with an image and a caption that describes desired modifications to that image. Supervised CIR approaches have shown strong performance, but their reliance on expensive manually-annotated datasets restricts their scalability and broader applicability. To address these issues, previous studies… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  9. arXiv:2404.15516  [pdf, other

    cs.CV cs.AI

    Visual Delta Generator with Large Multi-modal Models for Semi-supervised Composed Image Retrieval

    Authors: Young Kyun Jang, Donghyun Kim, Zihang Meng, Dat Huynh, Ser-Nam Lim

    Abstract: Composed Image Retrieval (CIR) is a task that retrieves images similar to a query, based on a provided textual modification. Current techniques rely on supervised learning for CIR models using labeled triplets of the reference image, text, target image. These specific triplets are not as commonly available as simple image-text pairs, limiting the widespread use of CIR and its scalability. On the o… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 15 pages

  10. arXiv:2404.05726  [pdf, other

    cs.CV

    MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding

    Authors: Bo He, Hengduo Li, Young Kyun Jang, Menglin Jia, Xuefei Cao, Ashish Shah, Abhinav Shrivastava, Ser-Nam Lim

    Abstract: With the success of large language models (LLMs), integrating the vision model into LLMs to build vision-language foundation models has gained much more interest recently. However, existing LLM-based large multimodal models (e.g., Video-LLaMA, VideoChat) can only take in a limited number of frames for short video understanding. In this study, we mainly focus on designing an efficient and effective… ▽ More

    Submitted 24 April, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

    Comments: Accepted at CVPR 2024. Project Page https://boheumd.github.io/MA-LMM/

  11. arXiv:2402.17050  [pdf, other

    eess.SY cs.RO

    Reinforcement Learning Based Oscillation Dampening: Scaling up Single-Agent RL algorithms to a 100 AV highway field operational test

    Authors: Kathy Jang, Nathan Lichtlé, Eugene Vinitsky, Adit Shah, Matthew Bunting, Matthew Nice, Benedetto Piccoli, Benjamin Seibold, Daniel B. Work, Maria Laura Delle Monache, Jonathan Sprinkle, Jonathan W. Lee, Alexandre M. Bayen

    Abstract: In this article, we explore the technical details of the reinforcement learning (RL) algorithms that were deployed in the largest field test of automated vehicles designed to smooth traffic flow in history as of 2023, uncovering the challenges and breakthroughs that come with developing RL controllers for automated vehicles. We delve into the fundamental concepts behind RL algorithms and their app… ▽ More

    Submitted 14 May, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

  12. arXiv:2402.11156  [pdf, other

    stat.ML cs.LG

    Efficient Low-Rank Matrix Estimation, Experimental Design, and Arm-Set-Dependent Low-Rank Bandits

    Authors: Kyoungseok Jang, Chicheng Zhang, Kwang-Sung Jun

    Abstract: We study low-rank matrix trace regression and the related problem of low-rank matrix bandits. Assuming access to the distribution of the covariates, we propose a novel low-rank matrix estimation method called LowPopArt and provide its recovery guarantee that depends on a novel quantity denoted by B(Q) that characterizes the hardness of the problem, where Q is the covariance matrix of the measureme… ▽ More

    Submitted 8 June, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

  13. arXiv:2402.10429  [pdf, ps, other

    stat.ML cs.LG

    Fixed Confidence Best Arm Identification in the Bayesian Setting

    Authors: Kyoungseok Jang, Junpei Komiyama, Kazutoshi Yamazaki

    Abstract: We consider the fixed-confidence best arm identification (FC-BAI) problem in the Bayesian setting. This problem aims to find the arm of the largest mean with a fixed confidence level when the bandit model has been sampled from the known prior. Most studies on the FC-BAI problem have been conducted in the frequentist setting, where the bandit model is predetermined before the game starts. We show t… ▽ More

    Submitted 22 June, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

  14. arXiv:2402.09201  [pdf, ps, other

    cs.LG stat.ML

    Better-than-KL PAC-Bayes Bounds

    Authors: Ilja Kuzborskij, Kwang-Sung Jun, Yulian Wu, Kyoungseok Jang, Francesco Orabona

    Abstract: Let $f(θ, X_1),$ $ \dots,$ $ f(θ, X_n)$ be a sequence of random elements, where $f$ is a fixed scalar function, $X_1, \dots, X_n$ are independent random variables (data), and $θ$ is a random parameter distributed according to some data-dependent posterior distribution $P_n$. In this paper, we consider the problem of proving concentration inequalities to estimate the mean of the sequence. An exampl… ▽ More

    Submitted 4 April, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  15. arXiv:2401.09666  [pdf, other

    eess.SY cs.AI cs.MA

    Traffic Smoothing Controllers for Autonomous Vehicles Using Deep Reinforcement Learning and Real-World Trajectory Data

    Authors: Nathan Lichtlé, Kathy Jang, Adit Shah, Eugene Vinitsky, Jonathan W. Lee, Alexandre M. Bayen

    Abstract: Designing traffic-smoothing cruise controllers that can be deployed onto autonomous vehicles is a key step towards improving traffic flow, reducing congestion, and enhancing fuel efficiency in mixed autonomy traffic. We bypass the common issue of having to carefully fine-tune a large traffic microsimulator by leveraging real-world trajectory data from the I-24 highway in Tennessee, replayed in a o… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: Accepted to be published as part of the 26th IEEE International Conference on Intelligent Transportation Systems (ITSC) 2023, Bilbao, Spain, September 24-28, 2023

  16. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  17. arXiv:2312.09040  [pdf, other

    cs.SD cs.CL eess.AS

    STaR: Distilling Speech Temporal Relation for Lightweight Speech Self-Supervised Learning Models

    Authors: Kangwook Jang, Sungnyun Kim, Hoirin Kim

    Abstract: Albeit great performance of Transformer-based speech selfsupervised learning (SSL) models, their large parameter size and computational cost make them unfavorable to utilize. In this study, we propose to compress the speech SSL models by distilling speech temporal relation (STaR). Unlike previous works that directly match the representation for each speech frame, STaR distillation transfers tempor… ▽ More

    Submitted 25 April, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: ICASSP 2024 Best Student Paper Awarded. Code URL: https://github.com/sungnyun/ARMHuBERT

  18. arXiv:2312.03777  [pdf, other

    cs.CV

    On the Robustness of Large Multimodal Models Against Image Adversarial Attacks

    Authors: Xuanming Cui, Alejandro Aparcedo, Young Kyun Jang, Ser-Nam Lim

    Abstract: Recent advances in instruction tuning have led to the development of State-of-the-Art Large Multimodal Models (LMMs). Given the novelty of these models, the impact of visual adversarial attacks on LMMs has not been thoroughly examined. We conduct a comprehensive study of the robustness of various LMMs against different adversarial attacks, evaluated across tasks including image classification, ima… ▽ More

    Submitted 8 December, 2023; v1 submitted 5 December, 2023; originally announced December 2023.

  19. arXiv:2308.14815  [pdf, other

    cs.AI cs.LG cs.RO

    Distributionally Robust Statistical Verification with Imprecise Neural Networks

    Authors: Souradeep Dutta, Michele Caprio, Vivian Lin, Matthew Cleaveland, Kuk Jin Jang, Ivan Ruchkin, Oleg Sokolsky, Insup Lee

    Abstract: A particularly challenging problem in AI safety is providing guarantees on the behavior of high-dimensional autonomous systems. Verification approaches centered around reachability analysis fail to scale, and purely statistical approaches are constrained by the distributional assumptions about the sampling process. Instead, we pose a distributionally robust version of the statistical verification… ▽ More

    Submitted 11 December, 2023; v1 submitted 28 August, 2023; originally announced August 2023.

  20. arXiv:2307.06816  [pdf, other

    cs.LG physics.data-an physics.flu-dyn

    Data-driven Nonlinear Parametric Model Order Reduction Framework using Deep Hierarchical Variational Autoencoder

    Authors: SiHun Lee, Sangmin Lee, Kijoo Jang, Haeseong Cho, SangJoon Shin

    Abstract: A data-driven parametric model order reduction (MOR) method using a deep artificial neural network is proposed. The present network, which is the least-squares hierarchical variational autoencoder (LSH-VAE), is capable of performing nonlinear MOR for the parametric interpolation of a nonlinear dynamic system with a significant number of degrees of freedom. LSH-VAE exploits two major changes to the… ▽ More

    Submitted 9 July, 2023; originally announced July 2023.

  21. arXiv:2306.04662  [pdf, other

    cs.LG cs.CY cs.HC cs.SI

    Understanding Place Identity with Generative AI

    Authors: Kee Moon Jang, Junda Chen, Yuhao Kang, Junghwan Kim, Jinhyung Lee, Fábio Duarte

    Abstract: Researchers are constantly leveraging new forms of data with the goal of understanding how people perceive the built environment and build the collective place identity of cities. Latest advancements in generative artificial intelligence (AI) models have enabled the production of realistic representations learned from vast amounts of data. In this study, we aim to test the potential of generative… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Comments: 6 pages, 3 figures, GIScience 2023

  22. Recycle-and-Distill: Universal Compression Strategy for Transformer-based Speech SSL Models with Attention Map Reusing and Masking Distillation

    Authors: Kangwook Jang, Sungnyun Kim, Se-Young Yun, Hoirin Kim

    Abstract: Transformer-based speech self-supervised learning (SSL) models, such as HuBERT, show surprising performance in various speech processing tasks. However, huge number of parameters in speech SSL models necessitate the compression to a more compact model for wider usage in academia or small companies. In this study, we suggest to reuse attention maps across the Transformer layers, so as to remove key… ▽ More

    Submitted 26 October, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: Proceedings of Interspeech 2023. Code URL: https://github.com/sungnyun/ARMHuBERT

  23. arXiv:2302.10341  [pdf, other

    cs.LG cs.CV

    DC4L: Distribution Shift Recovery via Data-Driven Control for Deep Learning Models

    Authors: Vivian Lin, Kuk Jin Jang, Souradeep Dutta, Michele Caprio, Oleg Sokolsky, Insup Lee

    Abstract: Deep neural networks have repeatedly been shown to be non-robust to the uncertainties of the real world, even to naturally occurring ones. A vast majority of current approaches have focused on data-augmentation methods to expand the range of perturbations that the classifier is exposed to while training. A relatively unexplored avenue that is equally promising involves sanitizing an image as a pre… ▽ More

    Submitted 15 May, 2024; v1 submitted 20 February, 2023; originally announced February 2023.

  24. arXiv:2302.09656  [pdf, other

    cs.LG stat.ML

    Credal Bayesian Deep Learning

    Authors: Michele Caprio, Souradeep Dutta, Kuk Jin Jang, Vivian Lin, Radoslav Ivanov, Oleg Sokolsky, Insup Lee

    Abstract: Uncertainty quantification and robustness to distribution shifts are important goals in machine learning and artificial intelligence. Although Bayesian Neural Networks (BNNs) allow for uncertainty in the predictions to be assessed, different sources of uncertainty are indistinguishable. We present Credal Bayesian Deep Learning (CBDL). Heuristically, CBDL allows to train an (uncountably) infinite e… ▽ More

    Submitted 22 February, 2024; v1 submitted 19 February, 2023; originally announced February 2023.

    MSC Class: Primary: 68T37; Secondary: 68T05; 68W25

  25. arXiv:2302.05829  [pdf, other

    cs.LG stat.ML

    Tighter PAC-Bayes Bounds Through Coin-Betting

    Authors: Kyoungseok Jang, Kwang-Sung Jun, Ilja Kuzborskij, Francesco Orabona

    Abstract: We consider the problem of estimating the mean of a sequence of random elements $f(X_1, θ)$ $, \ldots, $ $f(X_n, θ)$ where $f$ is a fixed scalar function, $S=(X_1, \ldots, X_n)$ are independent random variables, and $θ$ is a possibly $S$-dependent parameter. An example of such a problem would be to estimate the generalization error of a neural network trained on $n$ examples where $f$ is a loss fu… ▽ More

    Submitted 11 February, 2023; originally announced February 2023.

  26. arXiv:2210.15345  [pdf, other

    stat.ML cs.LG

    PopArt: Efficient Sparse Regression and Experimental Design for Optimal Sparse Linear Bandits

    Authors: Kyoungseok Jang, Chicheng Zhang, Kwang-Sung Jun

    Abstract: In sparse linear bandits, a learning agent sequentially selects an action and receive reward feedback, and the reward function depends linearly on a few coordinates of the covariates of the actions. This has applications in many real-world sequential decision making problems. In this paper, we propose a simple and computationally efficient sparse linear estimation method called PopArt that enjoys… ▽ More

    Submitted 17 November, 2023; v1 submitted 25 October, 2022; originally announced October 2022.

    Comments: 10 pages, 1 figures, published in the 2022 Conference on Neural Information Processing Systems

  27. arXiv:2207.00555  [pdf, other

    eess.AS cs.CL cs.LG

    FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech Self-Supervised Learning

    Authors: Yeonghyeon Lee, Kangwook Jang, Jahyun Goo, Youngmoon Jung, Hoirin Kim

    Abstract: Large-scale speech self-supervised learning (SSL) has emerged to the main field of speech processing, however, the problem of computational cost arising from its vast size makes a high entry barrier to academia. In addition, existing distillation techniques of speech SSL models compress the model by reducing layers, which induces performance degradation in linguistic pattern recognition tasks such… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.

    Comments: Accepted to Interspeech 2022

  28. arXiv:2112.08816  [pdf, other

    cs.CV cs.IR

    Deep Hash Distillation for Image Retrieval

    Authors: Young Kyun Jang, Geonmo Gu, Byungsoo Ko, Isaac Kang, Nam Ik Cho

    Abstract: In hash-based image retrieval systems, degraded or transformed inputs usually generate different codes from the original, deteriorating the retrieval accuracy. To mitigate this issue, data augmentation can be applied during training. However, even if augmented samples of an image are similar in real feature space, the quantization can scatter them far away in Hamming space. This results in represe… ▽ More

    Submitted 13 July, 2022; v1 submitted 16 December, 2021; originally announced December 2021.

    Comments: ECCV2022

  29. arXiv:2109.02244  [pdf, other

    cs.CV

    Self-supervised Product Quantization for Deep Unsupervised Image Retrieval

    Authors: Young Kyun Jang, Nam Ik Cho

    Abstract: Supervised deep learning-based hash and vector quantization are enabling fast and large-scale image retrieval systems. By fully exploiting label annotations, they are achieving outstanding retrieval performances compared to the conventional methods. However, it is painstaking to assign labels precisely for a vast amount of training data, and also, the annotation process is error-prone. To tackle t… ▽ More

    Submitted 12 January, 2022; v1 submitted 6 September, 2021; originally announced September 2021.

    Comments: ICCV 2021

  30. arXiv:2107.05025  [pdf, other

    cs.CV cs.IR

    Similarity Guided Deep Face Image Retrieval

    Authors: Young Kyun Jang, Nam Ik Cho

    Abstract: Face image retrieval, which searches for images of the same identity from the query input face image, is drawing more attention as the size of the image database increases rapidly. In order to conduct fast and accurate retrieval, a compact hash code-based methods have been proposed, and recently, deep face image hashing methods with supervised classification training have shown outstanding perform… ▽ More

    Submitted 11 July, 2021; originally announced July 2021.

    Comments: 10 pages, 9 figures

  31. arXiv:2104.07198  [pdf, other

    cs.CL cs.IR

    Ultra-High Dimensional Sparse Representations with Binarization for Efficient Text Retrieval

    Authors: Kyoung-Rok Jang, Junmo Kang, Giwon Hong, Sung-Hyon Myaeng, Joohee Park, Taewon Yoon, Heecheol Seo

    Abstract: The semantic matching capabilities of neural information retrieval can ameliorate synonymy and polysemy problems of symbolic approaches. However, neural models' dense representations are more suitable for re-ranking, due to their inefficiency. Sparse representations, either in symbolic or latent form, are more efficient with an inverted index. Taking the merits of the sparse and dense representati… ▽ More

    Submitted 15 October, 2021; v1 submitted 14 April, 2021; originally announced April 2021.

    Comments: To appear at EMNLP 2021

  32. arXiv:2104.05554  [pdf

    cs.LG

    On Analyzing Churn Prediction in Mobile Games

    Authors: Kihoon Jang, Junwhan Kim, Byunggu Yu

    Abstract: In subscription-based businesses, the churn rate refers to the percentage of customers who discontinue their subscriptions within a given time period. Particularly, in the mobile games industry, the churn rate is often pronounced due to the high competition and cost in customer acquisition; therefore, the process of minimizing the churn rate is crucial. This needs churn prediction, predicting user… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

    Comments: 8 pages, 10 figures, 2021 6th International Conference on Machine Learning Technologies

    ACM Class: I.2.1

  33. arXiv:2101.10404  [pdf, other

    eess.SY cs.LG cs.RO

    Learning-'N-Flying: A Learning-based, Decentralized Mission Aware UAS Collision Avoidance Scheme

    Authors: Alëna Rodionova, Yash Vardhan Pant, Connor Kurtz, Kuk Jang, Houssam Abbas, Rahul Mangharam

    Abstract: Urban Air Mobility, the scenario where hundreds of manned and Unmanned Aircraft System (UAS) carry out a wide variety of missions (e.g. moving humans and goods within the city), is gaining acceptance as a transportation solution of the future. One of the key requirements for this to happen is safely managing the air traffic in these urban airspaces. Due to the expected density of the airspace, thi… ▽ More

    Submitted 25 January, 2021; originally announced January 2021.

    Comments: to be published in ACM Transactions on Cyber-Physical Systems. arXiv admin note: text overlap with arXiv:2006.13267

  34. arXiv:2010.06900  [pdf

    cs.CV

    Development of Open Informal Dataset Affecting Autonomous Driving

    Authors: Yong-Gu Lee, Seong-Jae Lee, Sang-Jin Lee, Tae-Seung Baek, Dong-Whan Lee, Kyeong-Chan Jang, Ho-Jin Sohn, Jin-Soo Kim

    Abstract: This document is a document that has written procedures and methods for collecting objects and unstructured dynamic data on the road for the development of object recognition technology for self-driving cars, and outlines the methods of collecting data, annotation data, object classifier criteria, and data processing methods. On-road object and unstructured dynamic data were collected in various e… ▽ More

    Submitted 14 October, 2020; originally announced October 2020.

    Comments: 26 pages, 16 figures

  35. arXiv:2008.01825  [pdf, other

    cs.LG cs.MA cs.RO stat.ML

    Robust Reinforcement Learning using Adversarial Populations

    Authors: Eugene Vinitsky, Yuqing Du, Kanaad Parvate, Kathy Jang, Pieter Abbeel, Alexandre Bayen

    Abstract: Reinforcement Learning (RL) is an effective tool for controller design but can struggle with issues of robustness, failing catastrophically when the underlying system dynamics are perturbed. The Robust RL formulation tackles this by adding worst-case adversarial noise to the dynamics and constructing the noise distribution as the solution to a zero-sum minimax game. However, existing work on learn… ▽ More

    Submitted 22 September, 2020; v1 submitted 4 August, 2020; originally announced August 2020.

  36. arXiv:2006.13267  [pdf, other

    eess.SY cs.LG cs.RO

    Learning-to-Fly: Learning-based Collision Avoidance for Scalable Urban Air Mobility

    Authors: Alëna Rodionova, Yash Vardhan Pant, Kuk Jang, Houssam Abbas, Rahul Mangharam

    Abstract: With increasing urban population, there is global interest in Urban Air Mobility (UAM), where hundreds of autonomous Unmanned Aircraft Systems (UAS) execute missions in the airspace above cities. Unlike traditional human-in-the-loop air traffic management, UAM requires decentralized autonomous approaches that scale for an order of magnitude higher aircraft densities and are applicable to urban set… ▽ More

    Submitted 23 June, 2020; originally announced June 2020.

    Comments: To be published in IEEE International Conference on Intelligent Transportation Systems (ITSC), 2020

  37. arXiv:2003.11583  [pdf, other

    physics.optics cs.ET physics.app-ph

    Nanophotonic spin-glass for realization of a coherent Ising machine

    Authors: Yoshitomo Okawachi, Mengjie Yu, Jae K. Jang, Xingchen Ji, Yun Zhao, Bok Young Kim, Michal Lipson, Alexander L. Gaeta

    Abstract: The need for solving optimization problems is prevalent in a wide range of physical applications, including neuroscience, network design, biological systems, socio-economics, and chemical reactions. Many of these are classified as non-deterministic polynomial-time (NP) hard and thus become intractable to solve as the system scales to a large number of elements. Recent research advances in photonic… ▽ More

    Submitted 25 March, 2020; originally announced March 2020.

    Comments: 8 pages, 6 figures

  38. arXiv:2002.11281  [pdf, other

    cs.CV

    Generalized Product Quantization Network for Semi-supervised Image Retrieval

    Authors: Young Kyun Jang, Nam Ik Cho

    Abstract: Image retrieval methods that employ hashing or vector quantization have achieved great success by taking advantage of deep learning. However, these approaches do not meet expectations unless expensive label information is sufficient. To resolve this issue, we propose the first quantization-based semi-supervised image retrieval scheme: Generalized Product Quantization (GPQ) network. We design a nov… ▽ More

    Submitted 11 June, 2020; v1 submitted 25 February, 2020; originally announced February 2020.

    Comments: 10 pages, 10 figures, Computer Vision and Pattern Recognition (CVPR) 2020 accpeted paper

  39. arXiv:1812.06120  [pdf, other

    eess.SY cs.AI cs.RO

    Simulation to Scaled City: Zero-Shot Policy Transfer for Traffic Control via Autonomous Vehicles

    Authors: Kathy Jang, Eugene Vinitsky, Behdad Chalaki, Ben Remer, Logan Beaver, Andreas Malikopoulos, Alexandre Bayen

    Abstract: Using deep reinforcement learning, we train control policies for autonomous vehicles leading a platoon of vehicles onto a roundabout. Using Flow, a library for deep reinforcement learning in micro-simulators, we train two policies, one policy with noise injected into the state and action space and one without any injected noise. In simulation, the autonomous vehicle learns an emergent metering beh… ▽ More

    Submitted 22 February, 2019; v1 submitted 14 December, 2018; originally announced December 2018.

    Comments: To be published at the International Conference on Cyber Physical Systems (ICCPS) 2019. 10 pages, 9 figures

    ACM Class: I.2.1; I.2.4; I.2.6; I.2.10; I.6.5

  40. arXiv:1810.02186  [pdf, other

    cs.DC

    OPERA: Reasoning about continuous common knowledge in asynchronous distributed systems

    Authors: Sang-Min Choi, Jiho Park, Quan Nguyen, Andre Cronje, Kiyoung Jang, Hyunjoon Cheon, Yo-Sub Han, Byung-Ik Ahn

    Abstract: This paper introduces a new family of consensus protocols, namely \emph{Lachesis-class} denoted by $\mathcal{L}$, for distributed networks with guaranteed Byzantine fault tolerance. Each Lachesis protocol $L$ in $\mathcal{L}$ has complete asynchrony, is leaderless, has no round robin, no proof-of-work, and has eventual consensus. The core concept of our technology is the \emph{OPERA chain}, gene… ▽ More

    Submitted 4 October, 2018; originally announced October 2018.

  41. arXiv:1610.04688  [pdf, other

    cs.NI

    ExpressPass: End-to-End Credit-based Congestion Control for Datacenters

    Authors: Inho Cho, Dongsu Han, Keon Jang

    Abstract: As link speeds increase in datacenter networks, existing congestion control algorithms become less effective in providing fast convergence. TCP-based algorithms that probe for bandwidth take a long time to reach the fair-share and lead to long flow completion times. An ideal congestion control algorithms for datacenter must provide 1) zero data loss, 2) fast convergence, and 3) low buffer occupanc… ▽ More

    Submitted 15 October, 2016; originally announced October 2016.

  42. arXiv:1411.3410  [pdf

    cs.CV

    Person Re-identification Based on Color Histogram and Spatial Configuration of Dominant Color Regions

    Authors: Kwangchol Jang, Sokmin Han, Insong Kim

    Abstract: There is a requirement to determine whether a given person of interest has already been observed over a network of cameras in video surveillance systems. A human appearance obtained in one camera is usually different from the ones obtained in another camera due to difference in illumination, pose and viewpoint, camera parameters. Being related to appearance-based approaches for person re-identific… ▽ More

    Submitted 12 November, 2014; originally announced November 2014.

    Comments: 12 pages, 6 figures

  43. Interference Alignment Through User Cooperation for Two-cell MIMO Interfering Broadcast Channels

    Authors: Wonjae Shin, Namyoon Lee, Jong-Bu Lim, Changyong Shin, Kyunghun Jang

    Abstract: This paper focuses on two-cell multiple-input multiple-output (MIMO) Gaussian interfering broadcast channels (MIMO-IFBC) with $K$ cooperating users on the cell-boundary of each BS. It corresponds to a downlink scenario for cellular networks with two base stations (BSs), and $K$ users equipped with Wi-Fi interfaces enabling to cooperate among users on a peer-to-peer basis. In this scenario, we prop… ▽ More

    Submitted 16 November, 2010; originally announced November 2010.

    Comments: This paper will appear in IEEE GLOBECOM 2010