Skip to main content

Showing 1–28 of 28 results for author: Ok, J

  1. arXiv:2406.04625  [pdf, other

    cs.CL cs.AI

    Key-Element-Informed sLLM Tuning for Document Summarization

    Authors: Sangwon Ryu, Heejin Do, Yunsu Kim, Gary Geunbae Lee, Jungseul Ok

    Abstract: Remarkable advances in large language models (LLMs) have enabled high-quality text summarization. However, this capability is currently accessible only through LLMs of substantial size or proprietary LLMs with usage fees. In response, smaller-scale LLMs (sLLMs) of easy accessibility and low costs have been extensively studied, yet they often suffer from missing key information and entities, i.e.,… ▽ More

    Submitted 25 June, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

    Comments: Interspeech 2024

  2. arXiv:2406.00303  [pdf, other

    cs.CL cs.AI

    Multi-Dimensional Optimization for Text Summarization via Reinforcement Learning

    Authors: Sangwon Ryu, Heejin Do, Yunsu Kim, Gary Geunbae Lee, Jungseul Ok

    Abstract: The evaluation of summary quality encompasses diverse dimensions such as consistency, coherence, relevance, and fluency. However, existing summarization methods often target a specific dimension, facing challenges in generating well-balanced summaries across multiple dimensions. In this paper, we propose multi-objective reinforcement learning tailored to generate balanced summaries across all four… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: ACL 2024

  3. arXiv:2404.01123  [pdf, other

    cs.CV cs.GR eess.IV

    CLIPtone: Unsupervised Learning for Text-based Image Tone Adjustment

    Authors: Hyeongmin Lee, Kyoungkook Kang, Jungseul Ok, Sunghyun Cho

    Abstract: Recent image tone adjustment (or enhancement) approaches have predominantly adopted supervised learning for learning human-centric perceptual assessment. However, these approaches are constrained by intrinsic challenges of supervised learning. Primarily, the requirement for expertly-curated or retouched images escalates the data acquisition expenses. Moreover, their coverage of target style is con… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  4. arXiv:2403.19326  [pdf, other

    cs.LG cs.CR cs.CV

    MedBN: Robust Test-Time Adaptation against Malicious Test Samples

    Authors: Hyejin Park, Jeongyeon Hwang, Sunung Mun, Sangdon Park, Jungseul Ok

    Abstract: Test-time adaptation (TTA) has emerged as a promising solution to address performance decay due to unforeseen distribution shifts between training and test data. While recent TTA methods excel in adapting to test data variations, such adaptability exposes a model to vulnerability against malicious examples, an aspect that has received limited attention. Previous studies have uncovered security vul… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR 2024

  5. arXiv:2403.10820  [pdf, other

    cs.CV

    Active Label Correction for Semantic Segmentation with Foundation Models

    Authors: Hoyoung Kim, Sehyun Hwang, Suha Kwak, Jungseul Ok

    Abstract: Training and validating models for semantic segmentation require datasets with pixel-wise annotations, which are notoriously labor-intensive. Although useful priors such as foundation models or crowdsourced datasets are available, they are error-prone. We hence propose an effective framework of active label correction (ALC) based on a design of correction query to rectify pseudo labels of pixels,… ▽ More

    Submitted 4 June, 2024; v1 submitted 16 March, 2024; originally announced March 2024.

  6. arXiv:2309.09319  [pdf, other

    cs.CV cs.AI cs.LG

    Active Learning for Semantic Segmentation with Multi-class Label Query

    Authors: Sehyun Hwang, Sohyun Lee, Hoyoung Kim, Minhyeon Oh, Jungseul Ok, Suha Kwak

    Abstract: This paper proposes a new active learning method for semantic segmentation. The core of our method lies in a new annotation query design. It samples informative local image regions (e.g., superpixels), and for each of such regions, asks an oracle for a multi-hot vector indicating all classes existing in the region. This multi-class labeling strategy is substantially more efficient than existing on… ▽ More

    Submitted 6 November, 2023; v1 submitted 17 September, 2023; originally announced September 2023.

    Comments: NeurIPS 2023 accepted

    MSC Class: 68T07 ACM Class: I.2.10

  7. arXiv:2309.05287  [pdf, other

    cs.SD cs.AI eess.AS

    Addressing Feature Imbalance in Sound Source Separation

    Authors: Jaechang Kim, Jeongyeon Hwang, Soheun Yi, Jaewoong Cho, Jungseul Ok

    Abstract: Neural networks often suffer from a feature preference problem, where they tend to overly rely on specific features to solve a task while disregarding other features, even if those neglected features are essential for the task. Feature preference problems have primarily been investigated in classification task. However, we observe that feature preference occurs in high-dimensional regression task,… ▽ More

    Submitted 4 October, 2023; v1 submitted 11 September, 2023; originally announced September 2023.

  8. arXiv:2303.16817  [pdf, other

    cs.CV

    Adaptive Superpixel for Active Learning in Semantic Segmentation

    Authors: Hoyoung Kim, Minhyeon Oh, Sehyun Hwang, Suha Kwak, Jungseul Ok

    Abstract: Learning semantic segmentation requires pixel-wise annotations, which can be time-consuming and expensive. To reduce the annotation cost, we propose a superpixel-based active learning (AL) framework, which collects a dominant label per superpixel instead. To be specific, it consists of adaptive superpixel and sieving mechanisms, fully dedicated to AL. At each round of AL, we adaptively merge neigh… ▽ More

    Submitted 20 August, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

  9. arXiv:2208.06604  [pdf, other

    cs.LG cs.AI cs.CV

    Combating Label Distribution Shift for Active Domain Adaptation

    Authors: Sehyun Hwang, Sohyun Lee, Sungyeon Kim, Jungseul Ok, Suha Kwak

    Abstract: We consider the problem of active domain adaptation (ADA) to unlabeled target data, of which subset is actively selected and labeled given a budget constraint. Inspired by recent analysis on a critical issue from label distribution mismatch between source and target in domain adaptation, we devise a method that addresses the issue for the first time in ADA. At its heart lies a novel sampling strat… ▽ More

    Submitted 13 August, 2022; originally announced August 2022.

    Comments: ECCV 2022 accepted

    ACM Class: I.2.10

  10. arXiv:2208.05810  [pdf, other

    cs.CV cs.LG

    Towards Sequence-Level Training for Visual Tracking

    Authors: Minji Kim, Seungkwan Lee, Jungseul Ok, Bohyung Han, Minsu Cho

    Abstract: Despite the extensive adoption of machine learning on the task of visual object tracking, recent learning-based approaches have largely overlooked the fact that visual tracking is a sequence-level task in its nature; they rely heavily on frame-level training, which inevitably induces inconsistency between training and testing in terms of both data distributions and task objectives. This work intro… ▽ More

    Submitted 16 October, 2022; v1 submitted 11 August, 2022; originally announced August 2022.

    Comments: ECCV 2022

  11. arXiv:2206.00518  [pdf, other

    cs.LG cs.AI

    Efficient Scheduling of Data Augmentation for Deep Reinforcement Learning

    Authors: Byungchan Ko, Jungseul Ok

    Abstract: In deep reinforcement learning (RL), data augmentation is widely considered as a tool to induce a set of useful priors about semantic consistency and improve sample efficiency and generalization performance. However, even when the prior is useful for generalization, distilling it to RL agent often interferes with RL training and degenerates sample efficiency. Meanwhile, the agent is forgetful of t… ▽ More

    Submitted 1 March, 2023; v1 submitted 1 June, 2022; originally announced June 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2102.08581

    Journal ref: Neurips2022

  12. arXiv:2205.15567  [pdf, ps, other

    cs.LG

    Few-Shot Unlearning by Model Inversion

    Authors: Youngsik Yoon, Jinhwan Nam, Hyojeong Yun, Jaeho Lee, Dongwoo Kim, Jungseul Ok

    Abstract: We consider a practical scenario of machine unlearning to erase a target dataset, which causes unexpected behavior from the trained model. The target dataset is often assumed to be fully identifiable in a standard unlearning scenario. Such a flawless identification, however, is almost impossible if the training dataset is inaccessible at the time of unlearning. Unlike previous approaches requiring… ▽ More

    Submitted 14 March, 2023; v1 submitted 31 May, 2022; originally announced May 2022.

  13. arXiv:2205.15271  [pdf, other

    cs.LG eess.SP

    MetaSSD: Meta-Learned Self-Supervised Detection

    Authors: Moon Jeong Park, Jungseul Ok, Yo-Seb Jeon, Dongwoo Kim

    Abstract: Deep learning-based symbol detector gains increasing attention due to the simple algorithm design than the traditional model-based algorithms such as Viterbi and BCJR. The supervised learning framework is often employed to predict the input symbols, where training symbols are used to train the model. There are two major limitations in the supervised approaches: a) a model needs to be retrained fro… ▽ More

    Submitted 30 May, 2022; originally announced May 2022.

    Comments: Accepted by ISIT 2022

  14. arXiv:2111.00734  [pdf, other

    cs.LG cs.AI stat.ML

    Robust Deep Learning from Crowds with Belief Propagation

    Authors: Hoyoung Kim, Seunghyuk Cho, Dongwoo Kim, Jungseul Ok

    Abstract: Crowdsourcing systems enable us to collect large-scale dataset, but inherently suffer from noisy labels of low-paid workers. We address the inference and learning problems using such a crowdsourced dataset with noise. Due to the nature of sparsity in crowdsourcing, it is critical to exploit both probabilistic model to capture worker prior and neural network to extract task feature despite risks fr… ▽ More

    Submitted 24 February, 2022; v1 submitted 1 November, 2021; originally announced November 2021.

  15. arXiv:2111.00195  [pdf, other

    cs.SD cs.LG eess.AS

    Learning Continuous Representation of Audio for Arbitrary Scale Super Resolution

    Authors: Jaechang Kim, Yunjoo Lee, Seunghoon Hong, Jungseul Ok

    Abstract: Audio super resolution aims to predict the missing high resolution components of the low resolution audio signals. While audio in nature is a continuous signal, current approaches treat it as discrete data (i.e., input is defined on discrete time domain), and consider the super resolution over a fixed scale factor (i.e., it is required to train a new neural network to change output resolution). To… ▽ More

    Submitted 30 March, 2022; v1 submitted 30 October, 2021; originally announced November 2021.

    Comments: Accepted by ICASSP 2022. The source code is available at https://github.com/ml-postech/LISA

  16. arXiv:2110.14962  [pdf, other

    cs.LG

    Gradient Inversion with Generative Image Prior

    Authors: Jinwoo Jeon, Jaechang Kim, Kangwook Lee, Sewoong Oh, Jungseul Ok

    Abstract: Federated Learning (FL) is a distributed learning framework, in which the local data never leaves clients devices to preserve privacy, and the server trains models on the data via accessing only the gradients of those local data. Without further privacy mechanisms such as differential privacy, this leaves the system vulnerable against an attacker who inverts those gradients to reveal clients sensi… ▽ More

    Submitted 28 October, 2021; originally announced October 2021.

    Comments: Accepted to NeurIPS 2021

  17. arXiv:2110.12160  [pdf, ps, other

    cs.LG cs.GT stat.ML

    Multi-armed Bandit Algorithm against Strategic Replication

    Authors: Suho Shin, Seungjoon Lee, Jungseul Ok

    Abstract: We consider a multi-armed bandit problem in which a set of arms is registered by each agent, and the agent receives reward when its arm is selected. An agent might strategically submit more arms with replications, which can bring more reward by abusing the bandit algorithm's exploration-exploitation balance. Our analysis reveals that a standard algorithm indeed fails at preventing replication and… ▽ More

    Submitted 23 October, 2021; originally announced October 2021.

  18. arXiv:2102.08581  [pdf, other

    cs.LG cs.AI

    Efficient Scheduling of Data Augmentation for Deep Reinforcement Learning

    Authors: Byungchan Ko, Jungseul Ok

    Abstract: In deep reinforcement learning (RL), data augmentation is widely considered as a tool to induce a set of useful priors about semantic consistency and improve sample efficiency and generalization performance. However, even when the prior is useful for generalization, distilling it to RL agent often interferes with RL training and degenerates sample efficiency. Meanwhile, the agent is forgetful of t… ▽ More

    Submitted 18 October, 2022; v1 submitted 17 February, 2021; originally announced February 2021.

    Journal ref: Neurips 2022

  19. arXiv:2102.02472  [pdf, other

    cs.LG cs.AI stat.ML

    Transfer Learning in Bandits with Latent Continuity

    Authors: Hyejin Park, Seiyun Shin, Kwang-Sung Jun, Jungseul Ok

    Abstract: Structured stochastic multi-armed bandits provide accelerated regret rates over the standard unstructured bandit problems. Most structured bandits, however, assume the knowledge of the structural parameter such as Lipschitz continuity, which is often not available. To cope with the latent structural parameter, we consider a transfer learning setting in which an agent must learn to transfer the str… ▽ More

    Submitted 25 June, 2021; v1 submitted 4 February, 2021; originally announced February 2021.

  20. arXiv:1910.06002  [pdf, other

    stat.ML cs.LG

    Optimal Clustering from Noisy Binary Feedback

    Authors: Kaito Ariu, Jungseul Ok, Alexandre Proutiere, Se-Young Yun

    Abstract: We study the problem of clustering a set of items from binary user feedback. Such a problem arises in crowdsourcing platforms solving large-scale labeling tasks with minimal effort put on the users. For example, in some of the recent reCAPTCHA systems, users clicks (binary answers) can be used to efficiently label images. In our inference problem, items are grouped into initially unknown non-overl… ▽ More

    Submitted 5 February, 2024; v1 submitted 14 October, 2019; originally announced October 2019.

  21. arXiv:1806.00775  [pdf, other

    cs.LG stat.ML

    Exploration in Structured Reinforcement Learning

    Authors: Jungseul Ok, Alexandre Proutiere, Damianos Tranos

    Abstract: We address reinforcement learning problems with finite state and action spaces where the underlying MDP has some known structure that could be potentially exploited to minimize the exploration rates of suboptimal (state, action) pairs. For any arbitrary structure, we derive problem-specific regret lower bounds satisfied by any learning algorithm. These lower bounds are made explicit for unstructur… ▽ More

    Submitted 29 November, 2018; v1 submitted 3 June, 2018; originally announced June 2018.

  22. arXiv:1805.01685  [pdf, ps, other

    cs.LG stat.ML

    Combinatorial Pure Exploration with Continuous and Separable Reward Functions and Its Applications (Extended Version)

    Authors: Weiran Huang, Jungseul Ok, Liang Li, Wei Chen

    Abstract: We study the Combinatorial Pure Exploration problem with Continuous and Separable reward functions (CPE-CS) in the stochastic multi-armed bandit setting. In a CPE-CS instance, we are given several stochastic arms with unknown distributions, as well as a collection of possible decisions. Each decision has a reward according to the distributions of arms. The goal is to identify the decision with the… ▽ More

    Submitted 4 May, 2018; originally announced May 2018.

    Comments: conference version accepted by IJCAI-ECAI-18

  23. arXiv:1804.03178  [pdf, ps, other

    cs.GT

    Power of Bonus in Pricing for Crowdsourcing

    Authors: Suho Shin, Hoyong Choi, Yung Yi, Jungseul Ok

    Abstract: We consider a simple form of pricing for a crowdsourcing system, where pricing policy is published a priori, and workers then decide their task acceptance. Such a pricing form is widely adopted in practice for its simplicity, e.g., Amazon Mechanical Turk, although additional sophistication to pricing rule can enhance budget efficiency. With the goal of designing efficient and simple pricing rules,… ▽ More

    Submitted 26 October, 2021; v1 submitted 9 April, 2018; originally announced April 2018.

  24. arXiv:1702.08840  [pdf, other

    cs.LG stat.ML

    Iterative Bayesian Learning for Crowdsourced Regression

    Authors: Jungseul Ok, Sewoong Oh, Yunhun Jang, Jinwoo Shin, Yung Yi

    Abstract: Crowdsourcing platforms emerged as popular venues for purchasing human intelligence at low cost for large volume of tasks. As many low-paid workers are prone to give noisy answers, a common practice is to add redundancy by assigning multiple workers to each task and then simply average out these answers. However, to fully harness the wisdom of the crowd, one needs to learn the heterogeneous qualit… ▽ More

    Submitted 8 October, 2018; v1 submitted 28 February, 2017; originally announced February 2017.

  25. arXiv:1602.03619  [pdf, other

    cs.LG stat.ML

    Optimal Inference in Crowdsourced Classification via Belief Propagation

    Authors: Jungseul Ok, Sewoong Oh, Jinwoo Shin, Yung Yi

    Abstract: Crowdsourcing systems are popular for solving large-scale labelling tasks with low-paid workers. We study the problem of recovering the true labels from the possibly erroneous crowdsourced labels under the popular Dawid-Skene model. To address this inference problem, several algorithms have recently been proposed, but the best known guarantee is still significantly larger than the fundamental limi… ▽ More

    Submitted 11 January, 2017; v1 submitted 11 February, 2016; originally announced February 2016.

    Comments: This article is partially based on preliminary results published in the proceeding of the 33rd International Conference on Machine Learning (ICML 2016)

  26. arXiv:1407.0454  [pdf, other

    cs.DB

    AsterixDB: A Scalable, Open Source BDMS

    Authors: Sattam Alsubaiee, Yasser Altowim, Hotham Altwaijry, Alexander Behm, Vinayak Borkar, Yingyi Bu, Michael Carey, Inci Cetindil, Madhusudan Cheelangi, Khurram Faraaz, Eugenia Gabrielova, Raman Grover, Zachary Heilbron, Young-Seok Kim, Chen Li, Guangqiang Li, Ji Mahn Ok, Nicola Onose, Pouria Pirzadeh, Vassilis Tsotras, Rares Vernica, Jian Wen, Till Westmann

    Abstract: AsterixDB is a new, full-function BDMS (Big Data Management System) with a feature set that distinguishes it from other platforms in today's open source Big Data ecosystem. Its features make it well-suited to applications like web data warehousing, social data storage and analysis, and other use cases related to Big Data. AsterixDB has a flexible NoSQL style data model; a query language that suppo… ▽ More

    Submitted 2 July, 2014; originally announced July 2014.

  27. arXiv:1307.7309  [pdf, other

    cs.NI cs.IT

    Optimal Rate Sampling in 802.11 Systems

    Authors: Richard Combes, Alexandre Proutiere, Donggyu Yun, Jungseul Ok, Yung Yi

    Abstract: In 802.11 systems, Rate Adaptation (RA) is a fundamental mechanism allowing transmitters to adapt the coding and modulation scheme as well as the MIMO transmission mode to the radio channel conditions, and in turn, to learn and track the (mode, rate) pair providing the highest throughput. So far, the design of RA mechanisms has been mainly driven by heuristics. In contrast, in this paper, we rigor… ▽ More

    Submitted 20 September, 2013; v1 submitted 27 July, 2013; originally announced July 2013.

    Comments: 52 pages

  28. arXiv:1207.1878  [pdf, ps, other

    cs.NI

    Embedding of Virtual Network Requests over Static Wireless Multihop Networks

    Authors: Donggyu Yun, Jungseul Ok, Bongjhin Shin, Soobum Park, Yung Yi

    Abstract: Network virtualization is a technology of running multiple heterogeneous network architecture on a shared substrate network. One of the crucial components in network virtualization is virtual network embedding, which provides a way to allocate physical network resources (CPU and link bandwidth) to virtual network requests. Despite significant research efforts on virtual network embedding in wired… ▽ More

    Submitted 8 July, 2012; originally announced July 2012.

    Comments: 22 pages