Skip to main content

Showing 1–35 of 35 results for author: Ham, B

  1. arXiv:2407.08113  [pdf, other

    cs.CV

    FYI: Flip Your Images for Dataset Distillation

    Authors: Byunggwan Son, Youngmin Oh, Donghyeon Baek, Bumsub Ham

    Abstract: Dataset distillation synthesizes a small set of images from a large-scale real dataset such that synthetic and real images share similar behavioral properties (e.g, distributions of gradients or features) during a training process. Through extensive analyses on current methods and real datasets, together with empirical observations, we provide in this paper two important things to share for datase… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: Accepted to ECCV 2024

  2. arXiv:2404.19248  [pdf, other

    cs.CV

    Transition Rate Scheduling for Quantization-Aware Training

    Authors: Junghyup lee, Dohyung Kim, Jeimin Jeon, Bumsub Ham

    Abstract: Quantization-aware training (QAT) simulates a quantization process during training to lower bit-precision of weights/activations. It learns quantized weights indirectly by updating latent weights, i.e., full-precision inputs to a quantizer, using gradient-based optimizers. We claim that coupling a user-defined learning rate (LR) with these optimizers is sub-optimal for QAT. Quantized weights trans… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: Submitted to IEEE TPAMI on Apr. 03, 2023

  3. arXiv:2404.00928  [pdf, other

    cs.CV cs.LG

    Instance-Aware Group Quantization for Vision Transformers

    Authors: Jaehyeon Moon, Dohyung Kim, Junyong Cheon, Bumsub Ham

    Abstract: Post-training quantization (PTQ) is an efficient model compression technique that quantizes a pretrained full-precision model using only a small calibration set of unlabeled samples without retraining. PTQ methods for convolutional neural networks (CNNs) provide quantization results comparable to full-precision counterparts. Directly applying them to vision transformers (ViTs), however, incurs sev… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: CVPR 2024

  4. arXiv:2403.19232  [pdf, other

    cs.CV cs.LG

    AZ-NAS: Assembling Zero-Cost Proxies for Network Architecture Search

    Authors: Junghyup Lee, Bumsub Ham

    Abstract: Training-free network architecture search (NAS) aims to discover high-performing networks with zero-cost proxies, capturing network characteristics related to the final performance. However, network rankings estimated by previous training-free NAS methods have shown weak correlations with the performance. To address this issue, we propose AZ-NAS, a novel approach that leverages the ensemble of var… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: CVPR 2024

  5. arXiv:2308.11990  [pdf, other

    cs.CV

    RankMixup: Ranking-Based Mixup Training for Network Calibration

    Authors: Jongyoun Noh, Hyekang Park, Junghyup Lee, Bumsub Ham

    Abstract: Network calibration aims to accurately estimate the level of confidences, which is particularly important for employing deep neural networks in real-world systems. Recent approaches leverage mixup to calibrate the network's predictions during training. However, they do not consider the problem that mixtures of labels in mixup may not accurately represent the actual distribution of augmented sample… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

    Comments: Accepted at ICCV 2023

  6. arXiv:2308.11911  [pdf, other

    cs.CV

    ACLS: Adaptive and Conditional Label Smoothing for Network Calibration

    Authors: Hyekang Park, Jongyoun Noh, Youngmin Oh, Donghyeon Baek, Bumsub Ham

    Abstract: We address the problem of network calibration adjusting miscalibrated confidences of deep neural networks. Many approaches to network calibration adopt a regularization-based method that exploits a regularization term to smooth the miscalibrated confidences. Although these approaches have shown the effectiveness on calibrating the networks, there is still a lack of understanding on the underlying… ▽ More

    Submitted 24 August, 2023; v1 submitted 23 August, 2023; originally announced August 2023.

    Comments: Accepted to ICCV 2023 (Oral presentation)

  7. arXiv:2308.11901  [pdf, other

    cs.CV

    Camera-Driven Representation Learning for Unsupervised Domain Adaptive Person Re-identification

    Authors: Geon Lee, Sanghoon Lee, Dohyung Kim, Younghoon Shin, Yongsang Yoon, Bumsub Ham

    Abstract: We present a novel unsupervised domain adaption method for person re-identification (reID) that generalizes a model trained on a labeled source domain to an unlabeled target domain. We introduce a camera-driven curriculum learning (CaCL) framework that leverages camera labels of person images to transfer knowledge from source to target domains progressively. To this end, we divide target domain da… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

    Comments: Accepted to ICCV 2023

  8. arXiv:2307.08169  [pdf, other

    cs.LG cs.HC

    Discovering User Types: Mapping User Traits by Task-Specific Behaviors in Reinforcement Learning

    Authors: L. L. Ankile, B. S. Ham, K. Mao, E. Shin, S. Swaroop, F. Doshi-Velez, W. Pan

    Abstract: When assisting human users in reinforcement learning (RL), we can represent users as RL agents and study key parameters, called \emph{user traits}, to inform intervention design. We study the relationship between user behaviors (policy classes) and user traits. Given an environment, we introduce an intuitive tool for studying the breakdown of "user types": broad sets of traits that result in the s… ▽ More

    Submitted 16 July, 2023; originally announced July 2023.

  9. arXiv:2210.06816  [pdf, other

    cs.CV

    ALIFE: Adaptive Logit Regularizer and Feature Replay for Incremental Semantic Segmentation

    Authors: Youngmin Oh, Donghyeon Baek, Bumsub Ham

    Abstract: We address the problem of incremental semantic segmentation (ISS) recognizing novel object/stuff categories continually without forgetting previous ones that have been learned. The catastrophic forgetting problem is particularly severe in ISS, since pixel-level ground-truth labels are available only for the novel categories at training time. To address the problem, regularization-based methods exp… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: Accepted to NeurIPS 2022

  10. arXiv:2210.05941  [pdf, other

    cs.CV

    Decomposed Knowledge Distillation for Class-Incremental Semantic Segmentation

    Authors: Donghyeon Baek, Youngmin Oh, Sanghoon Lee, Junghyup Lee, Bumsub Ham

    Abstract: Class-incremental semantic segmentation (CISS) labels each pixel of an image with a corresponding object/stuff class continually. To this end, it is crucial to learn novel classes incrementally without forgetting previously learned knowledge. Current CISS methods typically use a knowledge distillation (KD) technique for preserving classifier logits, or freeze a feature extractor, to avoid the forg… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: Accepted to NeurIPS 2022

  11. arXiv:2207.10892  [pdf, other

    cs.CV

    Bi-directional Contrastive Learning for Domain Adaptive Semantic Segmentation

    Authors: Geon Lee, Chanho Eom, Wonkyung Lee, Hyekang Park, Bumsub Ham

    Abstract: We present a novel unsupervised domain adaptation method for semantic segmentation that generalizes a model trained with source images and corresponding ground-truth labels to a target domain. A key to domain adaptive semantic segmentation is to learn domain-invariant and discriminative features without target ground-truth labels. To this end, we propose a bi-directional pixel-prototype contrastiv… ▽ More

    Submitted 22 July, 2022; originally announced July 2022.

    Comments: Accepted to ECCV 2022

  12. arXiv:2207.10320  [pdf, other

    cs.CV

    OIMNet++: Prototypical Normalization and Localization-aware Learning for Person Search

    Authors: Sanghoon Lee, Youngmin Oh, Donghyeon Baek, Junghyup Lee, Bumsub Ham

    Abstract: We address the task of person search, that is, localizing and re-identifying query persons from a set of raw scene images. Recent approaches are typically built upon OIMNet, a pioneer work on person search, that learns joint person representations for performing both detection and person re-identification (reID) tasks. To obtain the representations, they extract features from pedestrian proposals,… ▽ More

    Submitted 21 July, 2022; originally announced July 2022.

    Comments: Accepted to ECCV 2022

  13. arXiv:2108.09039  [pdf, other

    cs.CV

    Video-based Person Re-identification with Spatial and Temporal Memory Networks

    Authors: Chanho Eom, Geon Lee, Junghyup Lee, Bumsub Ham

    Abstract: Video-based person re-identification (reID) aims to retrieve person videos with the same identity as a query person across multiple cameras. Spatial and temporal distractors in person videos, such as background clutter and partial occlusions over frames, respectively, make this task much more challenging than image-based person reID. We observe that spatial distractors appear consistently in a par… ▽ More

    Submitted 20 August, 2021; originally announced August 2021.

    Comments: International Conference on Computer Vision (ICCV) 2021

  14. arXiv:2108.07422  [pdf, other

    cs.CV

    Learning by Aligning: Visible-Infrared Person Re-identification using Cross-Modal Correspondences

    Authors: Hyunjong Park, Sanghoon Lee, Junghyup Lee, Bumsub Ham

    Abstract: We address the problem of visible-infrared person re-identification (VI-reID), that is, retrieving a set of person images, captured by visible or infrared cameras, in a cross-modal setting. Two main challenges in VI-reID are intra-class variations across person images, and cross-modal discrepancies between visible and infrared images. Assuming that the person images are roughly aligned, previous a… ▽ More

    Submitted 16 August, 2021; originally announced August 2021.

    Comments: iccv 2021

  15. arXiv:2108.06983  [pdf, other

    cs.CV

    Distance-aware Quantization

    Authors: Dohyung kim, Junghyup Lee, Bumsub Ham

    Abstract: We address the problem of network quantization, that is, reducing bit-widths of weights and/or activations to lighten network architectures. Quantization methods use a rounding function to map full-precision values to the nearest quantized ones, but this operation is not differentiable. There are mainly two approaches to training quantized networks with gradient-based optimizers. First, a straight… ▽ More

    Submitted 16 August, 2021; originally announced August 2021.

    Comments: ICCV2021

  16. arXiv:2108.06536  [pdf, other

    cs.CV

    Exploiting a Joint Embedding Space for Generalized Zero-Shot Semantic Segmentation

    Authors: Donghyeon Baek, Youngmin Oh, Bumsub Ham

    Abstract: We address the problem of generalized zero-shot semantic segmentation (GZS3) predicting pixel-wise semantic labels for seen and unseen classes. Most GZS3 methods adopt a generative approach that synthesizes visual features of unseen classes from corresponding semantic ones (e.g., word2vec) to train novel classifiers for both seen and unseen classes. Although generative methods show decent performa… ▽ More

    Submitted 14 August, 2021; originally announced August 2021.

    Comments: Accepted to ICCV 2021

  17. arXiv:2104.00905  [pdf, other

    cs.CV

    Background-Aware Pooling and Noise-Aware Loss for Weakly-Supervised Semantic Segmentation

    Authors: Youngmin Oh, Beomjun Kim, Bumsub Ham

    Abstract: We address the problem of weakly-supervised semantic segmentation (WSSS) using bounding box annotations. Although object bounding boxes are good indicators to segment corresponding objects, they do not specify object boundaries, making it hard to train convolutional neural networks (CNNs) for semantic segmentation. We find that background regions are perceptually consistent in part within an image… ▽ More

    Submitted 2 April, 2021; originally announced April 2021.

    Comments: Accepted to CVPR 2021

  18. arXiv:2104.00903  [pdf, other

    cs.CV

    Network Quantization with Element-wise Gradient Scaling

    Authors: Junghyup Lee, Dohyung Kim, Bumsub Ham

    Abstract: Network quantization aims at reducing bit-widths of weights and/or activations, particularly important for implementing deep neural networks with limited hardware resources. Most methods use the straight-through estimator (STE) to train quantized networks, which avoids a zero-gradient problem by replacing a derivative of a discretizer (i.e., a round function) with that of an identity function. Alt… ▽ More

    Submitted 2 April, 2021; originally announced April 2021.

    Comments: Accepted to CVPR 2021

  19. arXiv:2104.00902  [pdf, other

    cs.CV

    HVPR: Hybrid Voxel-Point Representation for Single-stage 3D Object Detection

    Authors: Jongyoun Noh, Sanghoon Lee, Bumsub Ham

    Abstract: We address the problem of 3D object detection, that is, estimating 3D object bounding boxes from point clouds. 3D object detection methods exploit either voxel-based or point-based features to represent 3D objects in a scene. Voxel-based features are efficient to extract, while they fail to preserve fine-grained 3D structures of objects. Point-based features, on the other hand, represent the 3D st… ▽ More

    Submitted 2 April, 2021; originally announced April 2021.

    Comments: Accepted to CVPR 2021

  20. arXiv:2008.05833  [pdf

    quant-ph cs.CR

    Experimental demonstrations of unconditional security in a purely classical regime

    Authors: Byoung S. Ham

    Abstract: So far, unconditional security in key distribution processes has been confined to quantum key distribution (QKD) protocols based on the no-cloning theorem of nonorthogonal bases. Recently, a completely different approach, the unconditionally secured classical key distribution (USCKD), has been proposed for unconditional security in the purely classical regime. Unlike QKD, both classical channels a… ▽ More

    Submitted 13 August, 2020; originally announced August 2020.

    Comments: 8 pages, 4 figures, 1 table

  21. arXiv:2007.07524  [pdf, other

    cs.CV

    Learning with Privileged Information for Efficient Image Super-Resolution

    Authors: Wonkyung Lee, Junghyup Lee, Dohyung Kim, Bumsub Ham

    Abstract: Convolutional neural networks (CNNs) have allowed remarkable advances in single image super-resolution (SISR) over the last decade. Most SR methods based on CNNs have focused on achieving performance gains in terms of quality metrics, such as PSNR and SSIM, over classical approaches. They typically require a large amount of memory and computational units. FSRCNN, consisting of few numbers of convo… ▽ More

    Submitted 15 July, 2020; originally announced July 2020.

    Comments: ECCV-2020

  22. arXiv:2003.13228  [pdf, other

    cs.CV

    Learning Memory-guided Normality for Anomaly Detection

    Authors: Hyunjong Park, Jongyoun Noh, Bumsub Ham

    Abstract: We address the problem of anomaly detection, that is, detecting anomalous events in a video sequence. Anomaly detection methods based on convolutional neural networks (CNNs) typically leverage proxy tasks, such as reconstructing input video frames, to learn models describing normality without seeing anomalous samples at training time, and quantify the extent of abnormalities using the reconstructi… ▽ More

    Submitted 30 March, 2020; originally announced March 2020.

    Comments: Accepted to CVPR 2020

  23. arXiv:1911.12914  [pdf, other

    cs.CV

    Learning Semantic Correspondence Exploiting an Object-level Prior

    Authors: Junghyup Lee, Dohyung Kim, Wonkyung Lee, Jean Ponce, Bumsub Ham

    Abstract: We address the problem of semantic correspondence, that is, establishing a dense flow field between images depicting different instances of the same object or scene category. We propose to use images annotated with binary foreground masks and subjected to synthetic geometric deformations to train a convolutional neural network (CNN) for this task. Using these masks as part of the supervisory signa… ▽ More

    Submitted 21 July, 2020; v1 submitted 28 November, 2019; originally announced November 2019.

    Comments: Accepted to TPAMI. arXiv admin note: substantial text overlap with arXiv:1904.01810

  24. arXiv:1911.09318  [pdf, other

    cs.CV

    Relation Network for Person Re-identification

    Authors: Hyunjong Park, Bumsub Ham

    Abstract: Person re-identification (reID) aims at retrieving an image of the person of interest from a set of images typically captured by multiple cameras. Recent reID methods have shown that exploiting local features describing body parts, together with a global feature of a person image itself, gives robust feature representations, even in the case of missing body parts. However, using the individual par… ▽ More

    Submitted 25 November, 2019; v1 submitted 21 November, 2019; originally announced November 2019.

    Comments: Accepted by AAAI 2020

  25. arXiv:1910.12003  [pdf, other

    cs.CV

    Learning Disentangled Representation for Robust Person Re-identification

    Authors: Chanho Eom, Bumsub Ham

    Abstract: We address the problem of person re-identification (reID), that is, retrieving person images from a large dataset, given a query image of the person of interest. A key challenge is to learn person representations robust to intra-class variations, as different persons can have the same attribute and the same person's appearance looks different with viewpoint changes. Recent reID methods focus on le… ▽ More

    Submitted 1 November, 2019; v1 submitted 26 October, 2019; originally announced October 2019.

    Journal ref: NeurIPS 2019

  26. Deformable Kernel Networks for Joint Image Filtering

    Authors: Beomjun Kim, Jean Ponce, Bumsub Ham

    Abstract: Joint image filters are used to transfer structural details from a guidance picture used as a prior to a target image, in tasks such as enhancing spatial resolution and suppressing noise. Previous methods based on convolutional neural networks (CNNs) combine nonlinear activations of spatially-invariant kernels to estimate structural details and regress the filtering result. In this paper, we inste… ▽ More

    Submitted 20 October, 2020; v1 submitted 17 October, 2019; originally announced October 2019.

    Comments: International Journal of Computer Vision (2020). arXiv admin note: substantial text overlap with arXiv:1903.11286

  27. arXiv:1909.07074  [pdf, other

    cs.CV

    Temporally Consistent Depth Prediction with Flow-Guided Memory Units

    Authors: Chanho Eom, Hyunjong Park, Bumsub Ham

    Abstract: Predicting depth from a monocular video sequence is an important task for autonomous driving. Although it has advanced considerably in the past few years, recent methods based on convolutional neural networks (CNNs) discard temporal coherence in the video sequence and estimate depth independently for each frame, which often leads to undesired inconsistent results over time. To address this problem… ▽ More

    Submitted 16 September, 2019; originally announced September 2019.

    Comments: IEEE Transactions on Intelligent Transportation Systems

  28. arXiv:1904.01810  [pdf, other

    cs.CV

    SFNet: Learning Object-aware Semantic Correspondence

    Authors: Junghyup Lee, Dohyung Kim, Jean Ponce, Bumsub Ham

    Abstract: We address the problem of semantic correspondence, that is, establishing a dense flow field between images depicting different instances of the same object or scene category. We propose to use images annotated with binary foreground masks and subjected to synthetic geometric deformations to train a convolutional neural network (CNN) for this task. Using these masks as part of the supervisory signa… ▽ More

    Submitted 4 April, 2019; v1 submitted 3 April, 2019; originally announced April 2019.

    Comments: cvpr 2019 oral paper

  29. arXiv:1903.11286  [pdf, other

    cs.CV

    Deformable kernel networks for guided depth map upsampling

    Authors: Beomjun Kim, Jean Ponce, Bumsub Ham

    Abstract: We address the problem of upsampling a low-resolution (LR) depth map using a registered high-resolution (HR) color image of the same scene. Previous methods based on convolutional neural networks (CNNs) combine nonlinear activations of spatially-invariant kernels to estimate structural details from LR depth and HR color images, and regress upsampling results directly from the networks. In this pap… ▽ More

    Submitted 27 March, 2019; originally announced March 2019.

    Comments: conference submission

  30. arXiv:1705.04043  [pdf, other

    cs.CV

    SCNet: Learning Semantic Correspondence

    Authors: Kai Han, Rafael S. Rezende, Bumsub Ham, Kwan-Yee K. Wong, Minsu Cho, Cordelia Schmid, Jean Ponce

    Abstract: This paper addresses the problem of establishing semantic correspondences between images depicting different instances of the same object or scene category. Previous approaches focus on either combining a spatial regularizer with hand-crafted features, or learning a correspondence model for appearance only. We propose instead a convolutional neural network architecture, called SCNet, for learning… ▽ More

    Submitted 17 August, 2017; v1 submitted 11 May, 2017; originally announced May 2017.

    Comments: ICCV 2017

  31. arXiv:1703.07144  [pdf, other

    cs.CV

    Proposal Flow: Semantic Correspondences from Object Proposals

    Authors: Bumsub Ham, Minsu Cho, Cordelia Schmid, Jean Ponce

    Abstract: Finding image correspondences remains a challenging problem in the presence of intra-class variations and large changes in scene layout. Semantic flow methods are designed to handle images depicting different instances of the same object or scene category. We introduce a novel approach to semantic flow, dubbed proposal flow, that establishes reliable correspondences using object proposals. Unlike… ▽ More

    Submitted 21 March, 2017; originally announced March 2017.

    Comments: arXiv admin note: text overlap with arXiv:1511.05065

  32. arXiv:1702.00926  [pdf, other

    cs.CV

    FCSS: Fully Convolutional Self-Similarity for Dense Semantic Correspondence

    Authors: Seungryong Kim, Dongbo Min, Bumsub Ham, Sangryul Jeon, Stephen Lin, Kwanghoon Sohn

    Abstract: We present a descriptor, called fully convolutional self-similarity (FCSS), for dense semantic correspondence. To robustly match points among different instances within the same object class, we formulate FCSS using local self-similarity (LSS) within a fully convolutional network. In contrast to existing CNN-based descriptors, FCSS is inherently insensitive to intra-class appearance variations bec… ▽ More

    Submitted 3 February, 2017; originally announced February 2017.

  33. arXiv:1604.07944  [pdf, other

    cs.CV

    DASC: Robust Dense Descriptor for Multi-modal and Multi-spectral Correspondence Estimation

    Authors: Seungryong Kim, Dongbo Min, Bumsub Ham, Minh N. Do, Kwanghoon Sohn

    Abstract: Establishing dense correspondences between multiple images is a fundamental task in many applications. However, finding a reliable correspondence in multi-modal or multi-spectral images still remains unsolved due to their challenging photometric and geometric variations. In this paper, we propose a novel dense descriptor, called dense adaptive self-correlation (DASC), to estimate multi-modal and m… ▽ More

    Submitted 27 April, 2016; originally announced April 2016.

  34. arXiv:1604.07681  [pdf, other

    cs.CV

    Efficient Splitting-based Method for Global Image Smoothing

    Authors: Youngjung Kim, Dongbo Min, Bumsub Ham, Kwanghoon Sohn

    Abstract: Edge-preserving smoothing (EPS) can be formulated as minimizing an objective function that consists of data and prior terms. This global EPS approach shows better smoothing performance than a local one that typically has a form of weighted averaging, at the price of high computational cost. In this paper, we introduce a highly efficient splitting-based method for global EPS that minimizes the obje… ▽ More

    Submitted 26 April, 2016; originally announced April 2016.

  35. arXiv:1511.05065  [pdf, other

    cs.CV

    Proposal Flow

    Authors: Bumsub Ham, Minsu Cho, Cordelia Schmid, Jean Ponce

    Abstract: Finding image correspondences remains a challenging problem in the presence of intra-class variations and large changes in scene layout.~Semantic flow methods are designed to handle images depicting different instances of the same object or scene category. We introduce a novel approach to semantic flow, dubbed proposal flow, that establishes reliable correspondences using object proposals. Unlike… ▽ More

    Submitted 8 July, 2016; v1 submitted 16 November, 2015; originally announced November 2015.