Skip to main content

Showing 1–50 of 118 results for author: Lee, K M

  1. arXiv:2406.00636  [pdf, other

    cs.CV

    T2LM: Long-Term 3D Human Motion Generation from Multiple Sentences

    Authors: Taeryung Lee, Fabien Baradel, Thomas Lucas, Kyoung Mu Lee, Gregory Rogez

    Abstract: In this paper, we address the challenging problem of long-term 3D human motion generation. Specifically, we aim to generate a long sequence of smoothly connected actions from a stream of multiple sentences (i.e., paragraph). Previous long-term motion generating approaches were mostly based on recurrent methods, using previously generated motion chunks as input for the next step. However, this appr… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: CVPR 2024 HuMoGen Workshop

  2. arXiv:2405.20233  [pdf, other

    cs.LG cs.AI

    Grokfast: Accelerated Grokking by Amplifying Slow Gradients

    Authors: Jaerin Lee, Bong Gyun Kang, Kihoon Kim, Kyoung Mu Lee

    Abstract: One puzzling artifact in machine learning dubbed grokking is where delayed generalization is achieved tenfolds of iterations after near perfect overfitting to the training data. Focusing on the long delay itself on behalf of machine learning practitioners, our goal is to accelerate generalization of a model under grokking phenomenon. By regarding a series of gradients of a parameter over training… ▽ More

    Submitted 5 June, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

    Comments: 17 pages, 13 figures. Typo fixed. Project page: https://jaerinlee.com/research/grokfast

  3. arXiv:2404.11358  [pdf, other

    cs.CV

    DeblurGS: Gaussian Splatting for Camera Motion Blur

    Authors: Jeongtaek Oh, Jaeyoung Chung, Dongwoo Lee, Kyoung Mu Lee

    Abstract: Although significant progress has been made in reconstructing sharp 3D scenes from motion-blurred images, a transition to real-world applications remains challenging. The primary obstacle stems from the severe blur which leads to inaccuracies in the acquisition of initial camera poses through Structure-from-Motion, a critical aspect often overlooked by previous approaches. To address this challeng… ▽ More

    Submitted 17 April, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

  4. arXiv:2404.04819  [pdf, other

    cs.CV

    Joint Reconstruction of 3D Human and Object via Contact-Based Refinement Transformer

    Authors: Hyeongjin Nam, Daniel Sungho Jung, Gyeongsik Moon, Kyoung Mu Lee

    Abstract: Human-object contact serves as a strong cue to understand how humans physically interact with objects. Nevertheless, it is not widely explored to utilize human-object contact information for the joint reconstruction of 3D human and object from a single image. In this work, we present a novel joint 3D human-object reconstruction method (CONTHO) that effectively exploits contact information between… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Comments: Published at CVPR 2024, 19 pages including the supplementary material

  5. arXiv:2404.03296  [pdf, other

    cs.CV eess.IV

    AdaBM: On-the-Fly Adaptive Bit Mapping for Image Super-Resolution

    Authors: Cheeun Hong, Kyoung Mu Lee

    Abstract: Although image super-resolution (SR) problem has experienced unprecedented restoration accuracy with deep neural networks, it has yet limited versatile applications due to the substantial computational costs. Since different input images for SR face different restoration difficulties, adapting computational costs based on the input image, referred to as adaptive inference, has emerged as a promisi… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: CVPR 2024

  6. arXiv:2404.01692  [pdf, other

    cs.CV

    Beyond Image Super-Resolution for Image Recognition with Task-Driven Perceptual Loss

    Authors: Jaeha Kim, Junghun Oh, Kyoung Mu Lee

    Abstract: In real-world scenarios, image recognition tasks, such as semantic segmentation and object detection, often pose greater challenges due to the lack of information available within low-resolution (LR) content. Image super-resolution (SR) is one of the promising solutions for addressing the challenges. However, due to the ill-posed property of SR, it is challenging for typical SR methods to restore… ▽ More

    Submitted 4 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: Accepted at CVPR 2024

  7. arXiv:2403.09055  [pdf, other

    cs.CV

    StreamMultiDiffusion: Real-Time Interactive Generation with Region-Based Semantic Control

    Authors: Jaerin Lee, Daniel Sungho Jung, Kanggeon Lee, Kyoung Mu Lee

    Abstract: The enormous success of diffusion models in text-to-image synthesis has made them promising candidates for the next generation of end-user applications for image generation and editing. Previous works have focused on improving the usability of diffusion models by reducing the inference time or increasing user interactivity by allowing new, fine-grained controls such as region-based text prompts. H… ▽ More

    Submitted 1 April, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

    Comments: 29 pages, 16 figures. v2: typos corrected, references added. Project page: https://jaerinlee.com/research/StreamMultiDiffusion

  8. arXiv:2402.03399  [pdf, other

    eess.IV cs.CV

    Rethinking RGB Color Representation for Image Restoration Models

    Authors: Jaerin Lee, JoonKyu Park, Sungyong Baik, Kyoung Mu Lee

    Abstract: Image restoration models are typically trained with a pixel-wise distance loss defined over the RGB color representation space, which is well known to be a source of blurry and unrealistic textures in the restored images. The reason, we believe, is that the three-channel RGB space is insufficient for supervising the restoration models. To this end, we augment the representation to hold structural… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 31 pages (11 pages main manuscript + 20 pages appendices), 22 figures

  9. arXiv:2401.04143  [pdf, other

    cs.CV

    RHOBIN Challenge: Reconstruction of Human Object Interaction

    Authors: Xianghui Xie, Xi Wang, Nikos Athanasiou, Bharat Lal Bhatnagar, Chun-Hao P. Huang, Kaichun Mo, Hao Chen, Xia Jia, Zerui Zhang, Liangxian Cui, Xiao Lin, Bingqiao Qian, Jie Xiao, Wenfei Yang, Hyeongjin Nam, Daniel Sungho Jung, Kihoon Kim, Kyoung Mu Lee, Otmar Hilliges, Gerard Pons-Moll

    Abstract: Modeling the interaction between humans and objects has been an emerging research direction in recent years. Capturing human-object interaction is however a very challenging task due to heavy occlusion and complex dynamics, which requires understanding not only 3D human pose, and object pose but also the interaction between them. Reconstruction of 3D humans and objects has been two separate resear… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

    Comments: 14 pages, 5 tables, 7 figure. Technical report of the CVPR'23 workshop: RHOBIN challenge (https://rhobin-challenge.github.io/)

  10. arXiv:2312.09925  [pdf, other

    cs.CV

    CNC-Net: Self-Supervised Learning for CNC Machining Operations

    Authors: Mohsen Yavartanoo, Sangmin Hong, Reyhaneh Neshatavar, Kyoung Mu Lee

    Abstract: CNC manufacturing is a process that employs computer numerical control (CNC) machines to govern the movements of various industrial tools and machinery, encompassing equipment ranging from grinders and lathes to mills and CNC routers. However, the reliance on manual CNC programming has become a bottleneck, and the requirement for expert knowledge can result in significant costs. Therefore, we intr… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

  11. arXiv:2311.13398  [pdf, other

    cs.CV cs.GR

    Depth-Regularized Optimization for 3D Gaussian Splatting in Few-Shot Images

    Authors: Jaeyoung Chung, Jeongtaek Oh, Kyoung Mu Lee

    Abstract: In this paper, we present a method to optimize Gaussian splatting with a limited number of images while avoiding overfitting. Representing a 3D scene by combining numerous Gaussian splats has yielded outstanding visual quality. However, it tends to overfit the training views when only a small number of images are available. To address this issue, we introduce a dense depth map as a geometry guide… ▽ More

    Submitted 4 January, 2024; v1 submitted 22 November, 2023; originally announced November 2023.

    Comments: 10 pages, 5 figures; Project page: robot0321.github.io/DepthRegGS

  12. arXiv:2311.13384  [pdf, other

    cs.CV

    LucidDreamer: Domain-free Generation of 3D Gaussian Splatting Scenes

    Authors: Jaeyoung Chung, Suyoung Lee, Hyeongjin Nam, Jaerin Lee, Kyoung Mu Lee

    Abstract: With the widespread usage of VR devices and contents, demands for 3D scene generation techniques become more popular. Existing 3D scene generation models, however, limit the target scene to specific domain, primarily due to their training strategies using 3D scan dataset that is far from the real-world. To address such limitation, we propose LucidDreamer, a domain-free scene generation pipeline by… ▽ More

    Submitted 23 November, 2023; v1 submitted 22 November, 2023; originally announced November 2023.

    Comments: Project page: https://luciddreamer-cvlab.github.io/

  13. 3DHR-Co: A Collaborative Test-time Refinement Framework for In-the-Wild 3D Human-Body Reconstruction Task

    Authors: Jonathan Samuel Lumentut, Kyoung Mu Lee

    Abstract: The field of 3D human-body reconstruction (abbreviated as 3DHR) that utilizes parametric pose and shape representations has witnessed significant advancements in recent years. However, the application of 3DHR techniques to handle real-world, diverse scenes, known as in-the-wild data, still faces limitations. The primary challenge arises as curating accurate 3D human pose ground truth (GT) for in-t… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Comments: 12 pages, 7 figures

  14. arXiv:2309.08957  [pdf, other

    cs.CV

    ExBluRF: Efficient Radiance Fields for Extreme Motion Blurred Images

    Authors: Dongwoo Lee, Jeongtaek Oh, Jaesung Rim, Sunghyun Cho, Kyoung Mu Lee

    Abstract: We present ExBluRF, a novel view synthesis method for extreme motion blurred images based on efficient radiance fields optimization. Our approach consists of two main components: 6-DOF camera trajectory-based motion blur formulation and voxel-based radiance fields. From extremely blurred images, we optimize the sharp radiance fields by jointly estimating the camera trajectories that generate the b… ▽ More

    Submitted 24 February, 2024; v1 submitted 16 September, 2023; originally announced September 2023.

    Comments: https://github.com/taekkii/ExBluRF/tree/main

  15. arXiv:2309.01961  [pdf, other

    cs.CV

    NICE: CVPR 2023 Challenge on Zero-shot Image Captioning

    Authors: Taehoon Kim, Pyunghwan Ahn, Sangyun Kim, Sihaeng Lee, Mark Marsden, Alessandra Sala, Seung Hwan Kim, Bohyung Han, Kyoung Mu Lee, Honglak Lee, Kyounghoon Bae, Xiangyu Wu, Yi Gao, Hailiang Zhang, Yang Yang, Weili Guo, Jianfeng Lu, Youngtaek Oh, Jae Won Cho, Dong-jin Kim, In So Kweon, Junmo Kim, Wooyoung Kang, Won Young Jhoo, Byungseok Roh , et al. (17 additional authors not shown)

    Abstract: In this report, we introduce NICE (New frontiers for zero-shot Image Captioning Evaluation) project and share the results and outcomes of 2023 challenge. This project is designed to challenge the computer vision community to develop robust image captioning models that advance the state-of-the-art both in terms of accuracy and fairness. Through the challenge, the image captioning models were tested… ▽ More

    Submitted 10 September, 2023; v1 submitted 5 September, 2023; originally announced September 2023.

    Comments: Tech report, project page https://nice.lgresearch.ai/

  16. arXiv:2309.01943  [pdf, other

    cs.CV

    Extract-and-Adaptation Network for 3D Interacting Hand Mesh Recovery

    Authors: JoonKyu Park, Daniel Sungho Jung, Gyeongsik Moon, Kyoung Mu Lee

    Abstract: Understanding how two hands interact with each other is a key component of accurate 3D interacting hand mesh recovery. However, recent Transformer-based methods struggle to learn the interaction between two hands as they directly utilize two hand features as input tokens, which results in distant token problem. The distant token problem represents that input tokens are in heterogeneous spaces, lea… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: Accepted at ICCVW 2023

  17. arXiv:2308.09305  [pdf, other

    cs.CV

    Human Part-wise 3D Motion Context Learning for Sign Language Recognition

    Authors: Taeryung Lee, Yeonguk Oh, Kyoung Mu Lee

    Abstract: In this paper, we propose P3D, the human part-wise motion context learning framework for sign language recognition. Our main contributions lie in two dimensions: learning the part-wise motion context and employing the pose ensemble to utilize 2D and 3D pose jointly. First, our empirical observation implies that part-wise context encoding benefits the performance of sign language recognition. While… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

    Comments: ICCV 2023

  18. arXiv:2308.06554  [pdf, other

    cs.CV

    Cyclic Test-Time Adaptation on Monocular Video for 3D Human Mesh Reconstruction

    Authors: Hyeongjin Nam, Daniel Sungho Jung, Yeonguk Oh, Kyoung Mu Lee

    Abstract: Despite recent advances in 3D human mesh reconstruction, domain gap between training and test data is still a major challenge. Several prior works tackle the domain gap problem via test-time adaptation that fine-tunes a network relying on 2D evidence (e.g., 2D human keypoints) from test images. However, the high reliance on 2D evidence during adaptation causes two major issues. First, 2D evidence… ▽ More

    Submitted 12 August, 2023; originally announced August 2023.

    Comments: Published at ICCV 2023, 16 pages including the supplementary material

  19. arXiv:2307.13337  [pdf, other

    cs.CV eess.IV

    Overcoming Distribution Mismatch in Quantizing Image Super-Resolution Networks

    Authors: Cheeun Hong, Kyoung Mu Lee

    Abstract: Quantization is a promising approach to reduce the high computational complexity of image super-resolution (SR) networks. However, compared to high-level tasks like image classification, low-bit quantization leads to severe accuracy loss in SR networks. This is because feature distributions of SR networks are significantly divergent for each channel or input image, and is thus difficult to determi… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

  20. arXiv:2307.12751  [pdf, other

    eess.IV cs.CV

    ICF-SRSR: Invertible scale-Conditional Function for Self-Supervised Real-world Single Image Super-Resolution

    Authors: Reyhaneh Neshatavar, Mohsen Yavartanoo, Sanghyun Son, Kyoung Mu Lee

    Abstract: Single image super-resolution (SISR) is a challenging ill-posed problem that aims to up-sample a given low-resolution (LR) image to a high-resolution (HR) counterpart. Due to the difficulty in obtaining real LR-HR training pairs, recent approaches are trained on simulated LR images degraded by simplified down-sampling operators, e.g., bicubic. Such an approach can be problematic in practice becaus… ▽ More

    Submitted 31 August, 2023; v1 submitted 24 July, 2023; originally announced July 2023.

  21. arXiv:2305.01869  [pdf, other

    cs.RO cs.MA

    Decentralised Active Perception in Continuous Action Spaces for the Coordinated Escort Problem

    Authors: Rhett Hull, Ki Myung Brian Lee, Jennifer Wakulicz, Chanyeol Yoo, James McMahon, Bryan Clarke, Stuart Anstee, Jijoong Kim, Robert Fitch

    Abstract: We consider the coordinated escort problem, where a decentralised team of supporting robots implicitly assist the mission of higher-value principal robots. The defining challenge is how to evaluate the effect of supporting robots' actions on the principal robots' mission. To capture this effect, we define two novel auxiliary reward functions for supporting robots called satisfaction improvement an… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

    Comments: 7 pages, 4 figures

  22. The Effect of Robot Skill Level and Communication in Rapid, Proximate Human-Robot Collaboration

    Authors: Kin Man Lee, Arjun Krishna, Zulfiqar Zaidi, Rohan Paleja, Letian Chen, Erin Hedlund-Botti, Mariah Schrum, Matthew Gombolay

    Abstract: As high-speed, agile robots become more commonplace, these robots will have the potential to better aid and collaborate with humans. However, due to the increased agility and functionality of these robots, close collaboration with humans can create safety concerns that alter team dynamics and degrade task performance. In this work, we aim to enable the deployment of safe and trustworthy agile robo… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

    Journal ref: HRI '23: Proceedings of the 2023 ACM/IEEE International Conference on Human-Robot Interaction

  23. arXiv:2303.15417  [pdf, other

    cs.CV

    Recovering 3D Hand Mesh Sequence from a Single Blurry Image: A New Dataset and Temporal Unfolding

    Authors: Yeonguk Oh, JoonKyu Park, Jaeha Kim, Gyeongsik Moon, Kyoung Mu Lee

    Abstract: Hands, one of the most dynamic parts of our body, suffer from blur due to their active movements. However, previous 3D hand mesh recovery methods have mainly focused on sharp hand images rather than considering blur due to the absence of datasets providing blurry hand images. We first present a novel dataset BlurHand, which contains blurry hand images with 3D groundtruths. The BlurHand is construc… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

    Comments: Accepted at CVPR 2023

  24. arXiv:2303.05370  [pdf, other

    cs.CV

    Rethinking Self-Supervised Visual Representation Learning in Pre-training for 3D Human Pose and Shape Estimation

    Authors: Hongsuk Choi, Hyeongjin Nam, Taeryung Lee, Gyeongsik Moon, Kyoung Mu Lee

    Abstract: Recently, a few self-supervised representation learning (SSL) methods have outperformed the ImageNet classification pre-training for vision tasks such as object detection. However, its effects on 3D human body pose and shape estimation (3DHPSE) are open to question, whose target is fixed to a unique class, the human, and has an inherent task gap with SSL. We empirically study and analyze the effec… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

    Comments: Accepted to ICLR 2023, 18 pages including the appendix

  25. arXiv:2303.01979  [pdf, other

    cs.CV

    ACL-SPC: Adaptive Closed-Loop system for Self-Supervised Point Cloud Completion

    Authors: Sangmin Hong, Mohsen Yavartanoo, Reyhaneh Neshatavar, Kyoung Mu Lee

    Abstract: Point cloud completion addresses filling in the missing parts of a partial point cloud obtained from depth sensors and generating a complete point cloud. Although there has been steep progress in the supervised methods on the synthetic point cloud completion task, it is hardly applicable in real-world scenarios due to the domain gap between the synthetic and real-world datasets or the requirement… ▽ More

    Submitted 28 March, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

    Comments: Published at CVPR 2023

    MSC Class: I.5.4

  26. arXiv:2301.09821  [pdf, other

    cs.RO

    Topological Trajectory Prediction with Homotopy Classes

    Authors: Jennifer Wakulicz, Ki Myung Brian Lee, Teresa Vidal-Calleja, Robert Fitch

    Abstract: Trajectory prediction in a cluttered environment is key to many important robotics tasks such as autonomous navigation. However, there are an infinite number of possible trajectories to consider. To simplify the space of trajectories under consideration, we utilise homotopy classes to partition the space into countably many mathematically equivalent classes. All members within a class demonstrate… ▽ More

    Submitted 24 January, 2023; originally announced January 2023.

    Comments: 7 pages, 7 figures, accepted to ICRA 2023

  27. arXiv:2212.08328  [pdf, other

    cs.CV

    MEIL-NeRF: Memory-Efficient Incremental Learning of Neural Radiance Fields

    Authors: Jaeyoung Chung, Kanggeon Lee, Sungyong Baik, Kyoung Mu Lee

    Abstract: Hinged on the representation power of neural networks, neural radiance fields (NeRF) have recently emerged as one of the promising and widely applicable methods for 3D object and scene representation. However, NeRF faces challenges in practical applications, such as large-scale scenes and edge devices with a limited amount of memory, where data needs to be processed sequentially. Under such increm… ▽ More

    Submitted 31 December, 2022; v1 submitted 16 December, 2022; originally announced December 2022.

    Comments: 18 pages. For the project page, see https://robot0321.github.io/meil-nerf/index.html

  28. arXiv:2212.05897  [pdf, other

    cs.CV

    MultiAct: Long-Term 3D Human Motion Generation from Multiple Action Labels

    Authors: Taeryung Lee, Gyeongsik Moon, Kyoung Mu Lee

    Abstract: We tackle the problem of generating long-term 3D human motion from multiple action labels. Two main previous approaches, such as action- and motion-conditioned methods, have limitations to solve this problem. The action-conditioned methods generate a sequence of motion from a single action. Hence, it cannot generate long-term motions composed of multiple actions and transitions between actions. Me… ▽ More

    Submitted 17 February, 2023; v1 submitted 12 December, 2022; originally announced December 2022.

    Comments: AAAI 2023 (Oral presentation)

  29. arXiv:2210.02517  [pdf, other

    cs.RO

    Athletic Mobile Manipulator System for Robotic Wheelchair Tennis

    Authors: Zulfiqar Zaidi, Daniel Martin, Nathaniel Belles, Viacheslav Zakharov, Arjun Krishna, Kin Man Lee, Peter Wagstaff, Sumedh Naik, Matthew Sklar, Sugju Choi, Yoshiki Kakehi, Ruturaj Patil, Divya Mallemadugula, Florian Pesce, Peter Wilson, Wendell Hom, Matan Diamond, Bryan Zhao, Nina Moorman, Rohan Paleja, Letian Chen, Esmaeil Seraj, Matthew Gombolay

    Abstract: Athletics are a quintessential and universal expression of humanity. From French monks who in the 12th century invented jeu de paume, the precursor to modern lawn tennis, back to the K'iche' people who played the Maya Ballgame as a form of religious expression over three thousand years ago, humans have sought to train their minds and bodies to excel in sporting contests. Advances in robotics are o… ▽ More

    Submitted 7 February, 2023; v1 submitted 5 October, 2022; originally announced October 2022.

    Comments: 8 pages, accepted at RA-L, will also be presented at IROS 2023

  30. arXiv:2210.00627  [pdf, other

    cs.CV

    MonoNHR: Monocular Neural Human Renderer

    Authors: Hongsuk Choi, Gyeongsik Moon, Matthieu Armando, Vincent Leroy, Kyoung Mu Lee, Gregory Rogez

    Abstract: Existing neural human rendering methods struggle with a single image input due to the lack of information in invisible areas and the depth ambiguity of pixels in visible areas. In this regard, we propose Monocular Neural Human Renderer (MonoNHR), a novel approach that renders robust free-viewpoint images of an arbitrary human given only a single image. MonoNHR is the first method that (i) renders… ▽ More

    Submitted 2 October, 2022; originally announced October 2022.

    Comments: Hongsuk Choi and Gyeongsik Moon contributed equally, 15 pages including the reference and supplementary material

  31. arXiv:2209.04800  [pdf, other

    cs.RO

    Motion planning in task space with Gromov-Hausdorff approximations

    Authors: Fouad Sukkar, Jennifer Wakulicz, Ki Myung Brian Lee, Robert Fitch

    Abstract: Applications of industrial robotic manipulators such as cobots can require efficient online motion planning in environments that have a combination of static and non-static obstacles. Existing general purpose planning methods often produce poor quality solutions when available computation time is restricted, or fail to produce a solution entirely. We propose a new motion planning framework designe… ▽ More

    Submitted 11 September, 2022; originally announced September 2022.

    Comments: Submitted to International Journal of Robotics Research (IJRR). 23 Pages. 19 Figures

  32. Extraction of Coronary Vessels in Fluoroscopic X-Ray Sequences Using Vessel Correspondence Optimization

    Authors: Seung Yeon Shin, Soochahn Lee, Kyoung Jin Noh, Il Dong Yun, Kyoung Mu Lee

    Abstract: We present a method to extract coronary vessels from fluoroscopic x-ray sequences. Given the vessel structure for the source frame, vessel correspondence candidates in the subsequent frame are generated by a novel hierarchical search scheme to overcome the aperture problem. Optimal correspondences are determined within a Markov random field optimization framework. Post-processing is performed to e… ▽ More

    Submitted 27 July, 2022; originally announced July 2022.

    Comments: MICCAI 2016

  33. arXiv:2207.10345  [pdf, other

    cs.CV eess.IV

    CADyQ: Content-Aware Dynamic Quantization for Image Super-Resolution

    Authors: Cheeun Hong, Sungyong Baik, Heewon Kim, Seungjun Nah, Kyoung Mu Lee

    Abstract: Despite breakthrough advances in image super-resolution (SR) with convolutional neural networks (CNNs), SR has yet to enjoy ubiquitous applications due to the high computational complexity of SR networks. Quantization is one of the promising approaches to solve this problem. However, existing methods fail to quantize SR models with a bit-width lower than 8 bits, suffering from severe accuracy loss… ▽ More

    Submitted 30 October, 2022; v1 submitted 21 July, 2022; originally announced July 2022.

    Comments: ECCV 2022

  34. arXiv:2207.10053  [pdf, other

    cs.CV

    3D Clothed Human Reconstruction in the Wild

    Authors: Gyeongsik Moon, Hyeongjin Nam, Takaaki Shiratori, Kyoung Mu Lee

    Abstract: Although much progress has been made in 3D clothed human reconstruction, most of the existing methods fail to produce robust results from in-the-wild images, which contain diverse human poses and appearances. This is mainly due to the large domain gap between training datasets and in-the-wild datasets. The training datasets are usually synthetic ones, which contain rendered images from GT 3D scans… ▽ More

    Submitted 20 July, 2022; originally announced July 2022.

    Comments: Accepted to ECCV 2022, 25 pages including the supplementary material

  35. arXiv:2206.09506  [pdf, other

    cs.RO

    Log-GPIS-MOP: A Unified Representation for Mapping, Odometry and Planning

    Authors: Lan Wu, Ki Myung Brian Lee, Cedric Le Gentil, Teresa Vidal-Calleja

    Abstract: Whereas dedicated scene representations are required for each different task in conventional robotic systems, this paper demonstrates that a unified representation can be used directly for multiple key tasks. We propose the Log-Gaussian Process Implicit Surface for Mapping, Odometry and Planning (Log-GPIS-MOP): a probabilistic framework for surface reconstruction, localisation and navigation based… ▽ More

    Submitted 11 July, 2023; v1 submitted 19 June, 2022; originally announced June 2022.

  36. arXiv:2206.08488  [pdf, other

    cs.CV

    Controllable Image Enhancement

    Authors: Heewon Kim, Kyoung Mu Lee

    Abstract: Editing flat-looking images into stunning photographs requires skill and time. Automated image enhancement algorithms have attracted increased interest by generating high-quality images without user interaction. However, the quality assessment of a photograph is subjective. Even in tone and color adjustments, a single photograph of auto-enhancement is challenging to fit user preferences which are… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

  37. Some performance considerations when using multi-armed bandit algorithms in the presence of missing data

    Authors: Xijin Chen, Kim May Lee, Sofia S. Villar, David S. Robertson

    Abstract: When comparing the performance of multi-armed bandit algorithms, the potential impact of missing data is often overlooked. In practice, it also affects their implementation where the simplest approach to overcome this is to continue to sample according to the original bandit algorithm, ignoring missing outcomes. We investigate the impact on performance of this approach to deal with missing data fo… ▽ More

    Submitted 7 July, 2022; v1 submitted 8 May, 2022; originally announced May 2022.

    Comments: 30 pages, 6 figures

  38. arXiv:2204.12266  [pdf, other

    cs.CV

    Attentive Fine-Grained Structured Sparsity for Image Restoration

    Authors: Junghun Oh, Heewon Kim, Seungjun Nah, Cheeun Hong, Jonghyun Choi, Kyoung Mu Lee

    Abstract: Image restoration tasks have witnessed great performance improvement in recent years by developing large deep models. Despite the outstanding performance, the heavy computation demanded by the deep models has restricted the application of image restoration. To lift the restriction, it is required to reduce the size of the networks while maintaining accuracy. Recently, N:M structured pruning has ap… ▽ More

    Submitted 15 July, 2022; v1 submitted 26 April, 2022; originally announced April 2022.

    Comments: Accepted to CVPR 2022

  39. arXiv:2203.16063  [pdf, other

    cs.CV

    Pay Attention to Hidden States for Video Deblurring: Ping-Pong Recurrent Neural Networks and Selective Non-Local Attention

    Authors: JoonKyu Park, Seungjun Nah, Kyoung Mu Lee

    Abstract: Video deblurring models exploit information in the neighboring frames to remove blur caused by the motion of the camera and the objects. Recurrent Neural Networks~(RNNs) are often adopted to model the temporal dependency between frames via hidden states. When motion blur is strong, however, hidden states are hard to deliver proper information due to the displacement between different frames. While… ▽ More

    Submitted 7 April, 2022; v1 submitted 30 March, 2022; originally announced March 2022.

    Comments: also attached the supplementary material

  40. arXiv:2203.14564  [pdf, other

    cs.CV

    HandOccNet: Occlusion-Robust 3D Hand Mesh Estimation Network

    Authors: JoonKyu Park, Yeonguk Oh, Gyeongsik Moon, Hongsuk Choi, Kyoung Mu Lee

    Abstract: Hands are often severely occluded by objects, which makes 3D hand mesh estimation challenging. Previous works often have disregarded information at occluded regions. However, we argue that occluded regions have strong correlations with hands so that they can provide highly beneficial information for complete 3D hand mesh estimation. Thus, in this work, we propose a novel 3D hand mesh estimation ne… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: also attached the supplementary material

    Journal ref: Computer Vision and Pattern Recognition (CVPR), 2022

  41. arXiv:2203.13009  [pdf, other

    cs.CV

    CVF-SID: Cyclic multi-Variate Function for Self-Supervised Image Denoising by Disentangling Noise from Image

    Authors: Reyhaneh Neshatavar, Mohsen Yavartanoo, Sanghyun Son, Kyoung Mu Lee

    Abstract: Recently, significant progress has been made on image denoising with strong supervision from large-scale datasets. However, obtaining well-aligned noisy-clean training image pairs for each specific scenario is complicated and costly in practice. Consequently, applying a conventional supervised denoising network on in-the-wild noisy inputs is not straightforward. Although several studies have chall… ▽ More

    Submitted 29 March, 2022; v1 submitted 24 March, 2022; originally announced March 2022.

    Comments: Published at CVPR 2022

  42. arXiv:2203.11799  [pdf, other

    cs.CV eess.IV

    AP-BSN: Self-Supervised Denoising for Real-World Images via Asymmetric PD and Blind-Spot Network

    Authors: Wooseok Lee, Sanghyun Son, Kyoung Mu Lee

    Abstract: Blind-spot network (BSN) and its variants have made significant advances in self-supervised denoising. Nevertheless, they are still bound to synthetic noisy inputs due to less practical assumptions like pixel-wise independent noise. Hence, it is challenging to deal with spatially correlated real-world noise using self-supervised BSN. Recently, pixel-shuffle downsampling (PD) has been proposed to r… ▽ More

    Submitted 24 March, 2022; v1 submitted 22 March, 2022; originally announced March 2022.

    Comments: Accepted to CVPR2022

  43. arXiv:2203.06418  [pdf, other

    eess.IV cs.CV

    Recurrence-in-Recurrence Networks for Video Deblurring

    Authors: Joonkyu Park, Seungjun Nah, Kyoung Mu Lee

    Abstract: State-of-the-art video deblurring methods often adopt recurrent neural networks to model the temporal dependency between the frames. While the hidden states play key role in delivering information to the next frame, abrupt motion blur tend to weaken the relevance in the neighbor frames. In this paper, we propose recurrence-in-recurrence network architecture to cope with the limitations of short-ra… ▽ More

    Submitted 12 March, 2022; originally announced March 2022.

    Comments: accepted paper in BMVC 2021

    MSC Class: I.4.5

    Journal ref: The British Machine Vision Conference (BMVC) 2021

  44. Informative Planning for Worst-Case Error Minimisation in Sparse Gaussian Process Regression

    Authors: Jennifer Wakulicz, Ki Myung Brian Lee, Chanyeol Yoo, Teresa Vidal-Calleja, Robert Fitch

    Abstract: We present a planning framework for minimising the deterministic worst-case error in sparse Gaussian process (GP) regression. We first derive a universal worst-case error bound for sparse GP regression with bounded noise using interpolation theory on reproducing kernel Hilbert spaces (RKHSs). By exploiting the conditional independence (CI) assumption central to sparse GP regression, we show that t… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

    Comments: 7 pages, 6 figures, accepted to Proc. of ICRA 2022

  45. arXiv:2202.09533  [pdf, other

    cs.CV eess.IV

    C2N: Practical Generative Noise Modeling for Real-World Denoising

    Authors: Geonwoon Jang, Wooseok Lee, Sanghyun Son, Kyoung Mu Lee

    Abstract: Learning-based image denoising methods have been bounded to situations where well-aligned noisy and clean images are given, or samples are synthesized from predetermined noise models, e.g., Gaussian. While recent generative noise modeling methods aim to simulate the unknown distribution of real-world noise, several limitations still exist. In a practical scenario, a noise generator should learn to… ▽ More

    Submitted 19 February, 2022; originally announced February 2022.

    Journal ref: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2021, pp. 2350-2359

  46. arXiv:2112.01155  [pdf, other

    cs.CV cs.AI

    Batch Normalization Tells You Which Filter is Important

    Authors: Junghun Oh, Heewon Kim, Sungyong Baik, Cheeun Hong, Kyoung Mu Lee

    Abstract: The goal of filter pruning is to search for unimportant filters to remove in order to make convolutional neural networks (CNNs) efficient without sacrificing the performance in the process. The challenge lies in finding information that can help determine how important or relevant each filter is with respect to the final output of neural networks. In this work, we share our observation that the ba… ▽ More

    Submitted 23 April, 2022; v1 submitted 2 December, 2021; originally announced December 2021.

  47. arXiv:2111.01100  [pdf, other

    cs.MA cs.AI cs.LG

    Investigation of Independent Reinforcement Learning Algorithms in Multi-Agent Environments

    Authors: Ken Ming Lee, Sriram Ganapathi Subramanian, Mark Crowley

    Abstract: Independent reinforcement learning algorithms have no theoretical guarantees for finding the best policy in multi-agent settings. However, in practice, prior works have reported good performance with independent algorithms in some domains and bad performance in others. Moreover, a comprehensive study of the strengths and weaknesses of independent algorithms is lacking in the literature. In this pa… ▽ More

    Submitted 1 November, 2021; originally announced November 2021.

    Comments: 15 pages, 7 figures, Accepted for NeurIPS 2021 Deep Reinforcement Learning Workshop

  48. arXiv:2110.12984  [pdf, other

    eess.IV cs.CV cs.LG

    Generative Residual Attention Network for Disease Detection

    Authors: Euyoung Kim, Soochahn Lee, Kyoung Mu Lee

    Abstract: Accurate identification and localization of abnormalities from radiology images serve as a critical role in computer-aided diagnosis (CAD) systems. Building a highly generalizable system usually requires a large amount of data with high-quality annotations, including disease-specific global and localization information. However, in medical images, only a limited number of high-quality images and a… ▽ More

    Submitted 25 October, 2021; originally announced October 2021.

    Comments: The paper is about Pneumonia detection using Generative Modeling. It proposes a novel approach to construct pseudo-pair images and a GAN to generate radio-realistic Chest Xray images. Then, the paper propose to leverage the differences between the input and the generated Xray images as an additional attention-map to boost the performance in Pneumonia detection

  49. arXiv:2110.07882  [pdf, other

    cs.CV

    PolyNet: Polynomial Neural Network for 3D Shape Recognition with PolyShape Representation

    Authors: Mohsen Yavartanoo, Shih-Hsuan Hung, Reyhaneh Neshatavar, Yue Zhang, Kyoung Mu Lee

    Abstract: 3D shape representation and its processing have substantial effects on 3D shape recognition. The polygon mesh as a 3D shape representation has many advantages in computer graphics and geometry processing. However, there are still some challenges for the existing deep neural network (DNN)-based methods on polygon mesh representation, such as handling the variations in the degree and permutations of… ▽ More

    Submitted 15 October, 2021; originally announced October 2021.

    Journal ref: 2021 International Conference on 3D Vision (3DV)

  50. arXiv:2110.03909  [pdf, other

    cs.LG cs.CV

    Meta-Learning with Task-Adaptive Loss Function for Few-Shot Learning

    Authors: Sungyong Baik, Janghoon Choi, Heewon Kim, Dohee Cho, Jaesik Min, Kyoung Mu Lee

    Abstract: In few-shot learning scenarios, the challenge is to generalize and perform well on new unseen examples when only very few labeled examples are available for each task. Model-agnostic meta-learning (MAML) has gained the popularity as one of the representative few-shot learning methods for its flexibility and applicability to diverse problems. However, MAML and its variants often resort to a simple… ▽ More

    Submitted 17 October, 2021; v1 submitted 8 October, 2021; originally announced October 2021.

    Comments: ICCV 2021 (Oral). Code at https://github.com/baiksung/MeTAL