Skip to main content

Showing 1–50 of 79 results for author: Choe, D

  1. arXiv:2407.00927  [pdf, ps, other

    cs.LG cs.CC stat.ML

    Learnability of Parameter-Bounded Bayes Nets

    Authors: Arnab Bhattacharyya, Davin Choo, Sutanu Gayen, Dimitrios Myrisiotis

    Abstract: Bayes nets are extensively used in practice to efficiently represent joint probability distributions over a set of random variables and capture dependency relations. In a seminal paper, Chickering et al. (JMLR 2004) showed that given a distribution $P$, that is defined as the marginal distribution of a Bayes net, it is $\mathsf{NP}$-hard to decide whether there is a parameter-bounded Bayes net tha… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: 15 pages, 2 figures

  2. arXiv:2406.07803  [pdf, other

    cs.SD cs.AI eess.AS

    EmoSphere-TTS: Emotional Style and Intensity Modeling via Spherical Emotion Vector for Controllable Emotional Text-to-Speech

    Authors: Deok-Hyeon Cho, Hyung-Seok Oh, Seung-Bin Kim, Sang-Hoon Lee, Seong-Whan Lee

    Abstract: Despite rapid advances in the field of emotional text-to-speech (TTS), recent studies primarily focus on mimicking the average style of a particular emotion. As a result, the ability to manipulate speech emotion remains constrained to several predefined labels, compromising the ability to reflect the nuanced variations of emotion. In this paper, we propose EmoSphere-TTS, which synthesizes expressi… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Accepted at INTERSPEECH 2024

  3. arXiv:2405.09784  [pdf, other

    cs.LG cs.AI cs.DS stat.ML

    Online bipartite matching with imperfect advice

    Authors: Davin Choo, Themis Gouleakis, Chun Kai Ling, Arnab Bhattacharyya

    Abstract: We study the problem of online unweighted bipartite matching with $n$ offline vertices and $n$ online vertices where one wishes to be competitive against the optimal offline algorithm. While the classic RANKING algorithm of Karp et al. [1990] provably attains competitive ratio of $1-1/e > 1/2$, we show that no learning-augmented method can be both 1-consistent and strictly better than $1/2$-robust… ▽ More

    Submitted 23 May, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

    Comments: Accepted into ICML 2024

  4. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seongjin Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  5. Envy-Free House Allocation with Minimum Subsidy

    Authors: Davin Choo, Yan Hao Ling, Warut Suksompong, Nicholas Teh, Jian Zhang

    Abstract: House allocation refers to the problem where $m$ houses are to be allocated to $n$ agents so that each agent receives one house. Since an envy-free house allocation does not always exist, we consider finding such an allocation in the presence of subsidy. We show that computing an envy-free allocation with minimum subsidy is NP-hard in general, but can be done efficiently if $m$ differs from $n$ by… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

    Journal ref: Operations Research Letters, 54:107103 (2024)

  6. arXiv:2402.08229  [pdf, other

    cs.LG cs.DS stat.ME stat.ML

    Causal Discovery under Off-Target Interventions

    Authors: Davin Choo, Kirankumar Shiragur, Caroline Uhler

    Abstract: Causal graph discovery is a significant problem with applications across various disciplines. However, with observational data alone, the underlying causal graph can only be recovered up to its Markov equivalence class, and further assumptions or interventions are necessary to narrow down the true graph. This work addresses the causal discovery problem under the setting of stochastic interventions… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: Accepted into AISTATS 2024

  7. arXiv:2401.08095  [pdf, other

    cs.SD cs.AI eess.AS

    DurFlex-EVC: Duration-Flexible Emotional Voice Conversion with Parallel Generation

    Authors: Hyung-Seok Oh, Sang-Hoon Lee, Deok-Hyeon Cho, Seong-Whan Lee

    Abstract: Emotional voice conversion (EVC) seeks to modify the emotional tone of a speaker's voice while preserving the original linguistic content and the speaker's unique vocal characteristics. Recent advancements in EVC have involved the simultaneous modeling of pitch and duration, utilizing the potential of sequence-to-sequence (seq2seq) models. To enhance reliability and efficiency in conversion, this… ▽ More

    Submitted 7 March, 2024; v1 submitted 15 January, 2024; originally announced January 2024.

    Comments: 13 pages, 9 figures, 8 tables

  8. arXiv:2312.02819  [pdf, other

    cs.CV

    Deterministic Guidance Diffusion Model for Probabilistic Weather Forecasting

    Authors: Donggeun Yoon, Minseok Seo, Doyi Kim, Yeji Choi, Donghyeon Cho

    Abstract: Weather forecasting requires not only accuracy but also the ability to perform probabilistic prediction. However, deterministic weather forecasting methods do not support probabilistic predictions, and conversely, probabilistic models tend to be less accurate. To address these challenges, in this paper, we introduce the \textbf{\textit{D}}eterministic \textbf{\textit{G}}uidance \textbf{\textit{D}}… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: 16 pages

  9. arXiv:2310.19261  [pdf, other

    cs.LG

    Diversify & Conquer: Outcome-directed Curriculum RL via Out-of-Distribution Disagreement

    Authors: Daesol Cho, Seungjae Lee, H. Jin Kim

    Abstract: Reinforcement learning (RL) often faces the challenges of uninformed search problems where the agent should explore without access to the domain knowledge such as characteristics of the environment or external rewards. To tackle these challenges, this work proposes a new approach for curriculum RL called Diversify for Disagreement & Conquer (D2C). Unlike previous curriculum learning methods, D2C r… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

  10. arXiv:2310.17330  [pdf, other

    cs.LG cs.AI

    CQM: Curriculum Reinforcement Learning with a Quantized World Model

    Authors: Seungjae Lee, Daesol Cho, Jonghae Park, H. Jin Kim

    Abstract: Recent curriculum Reinforcement Learning (RL) has shown notable progress in solving complex tasks by proposing sequences of surrogate tasks. However, the previous approaches often face challenges when they generate curriculum goals in a high-dimensional space. Thus, they usually rely on manually specified goal spaces. To alleviate this limitation and improve the scalability of the curriculum, we p… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: Accepted to NeurIPS 2023

  11. arXiv:2310.06333  [pdf, ps, other

    cs.LG cs.DS math.PR math.ST stat.ML

    Learning bounded-degree polytrees with known skeleton

    Authors: Davin Choo, Joy Qiping Yang, Arnab Bhattacharyya, Clément L. Canonne

    Abstract: We establish finite-sample guarantees for efficient proper learning of bounded-degree polytrees, a rich class of high-dimensional probability distributions and a subclass of Bayesian networks, a widely-studied type of graphical model. Recently, Bhattacharyya et al. (2021) obtained finite-sample guarantees for recovering tree-structured Bayesian networks, i.e., 1-polytrees. We extend their results… ▽ More

    Submitted 21 January, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: Fixed some typos. Added some discussions. Accepted to ALT 2024

  12. arXiv:2306.05781  [pdf, other

    cs.LG cs.AI cs.DS stat.ME stat.ML

    Adaptivity Complexity for Causal Graph Discovery

    Authors: Davin Choo, Kirankumar Shiragur

    Abstract: Causal discovery from interventional data is an important problem, where the task is to design an interventional strategy that learns the hidden ground truth causal graph $G(V,E)$ on $|V| = n$ nodes while minimizing the number of performed interventions. Most prior interventional strategies broadly fall into two categories: non-adaptive and adaptive. Non-adaptive strategies decide on a single fixe… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Comments: Accepted into UAI 2023

  13. arXiv:2305.19588  [pdf, other

    cs.LG cs.AI cs.DS stat.ML

    Active causal structure learning with advice

    Authors: Davin Choo, Themis Gouleakis, Arnab Bhattacharyya

    Abstract: We introduce the problem of active causal structure learning with advice. In the typical well-studied setting, the learning algorithm is given the essential graph for the observational distribution and is asked to recover the underlying causal directed acyclic graph (DAG) $G^*$ while minimizing the number of interventions made. In our setting, we are additionally given side information about… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

    Comments: Accepted into ICML 2023

  14. arXiv:2305.09943  [pdf, other

    cs.LG cs.AI cs.RO

    Demonstration-free Autonomous Reinforcement Learning via Implicit and Bidirectional Curriculum

    Authors: Jigang Kim, Daesol Cho, H. Jin Kim

    Abstract: While reinforcement learning (RL) has achieved great success in acquiring complex skills solely from environmental interactions, it assumes that resets to the initial state are readily available at the end of each episode. Such an assumption hinders the autonomous learning of embodied agents due to the time-consuming and cumbersome workarounds for resetting in the physical world. Hence, there has… ▽ More

    Submitted 8 June, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: ICML 2023, first two authors contributed equally

  15. arXiv:2305.09858  [pdf, other

    cs.IR cs.AI cs.CL cs.LG

    Knowledge Graph Completion Models are Few-shot Learners: An Empirical Study of Relation Labeling in E-commerce with LLMs

    Authors: Jiao Chen, Luyi Ma, Xiaohan Li, Nikhil Thakurdesai, Jianpeng Xu, Jason H. D. Cho, Kaushiki Nag, Evren Korpeoglu, Sushant Kumar, Kannan Achan

    Abstract: Knowledge Graphs (KGs) play a crucial role in enhancing e-commerce system performance by providing structured information about entities and their relationships, such as complementary or substitutable relations between products or product types, which can be utilized in recommender systems. However, relation labeling in KGs remains a challenging task due to the dynamic nature of e-commerce domains… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

  16. arXiv:2305.08269  [pdf, other

    cs.CC cs.CG cs.DS

    The Sharp Power Law of Local Search on Expanders

    Authors: Simina Brânzei, Davin Choo, Nicholas Recker

    Abstract: Local search is a powerful heuristic in optimization and computer science, the complexity of which was studied in the white box and black box models. In the black box model, we are given a graph $G = (V,E)$ and oracle access to a function $f : V \to \mathbb{R}$. The local search problem is to find a vertex $v$ that is a local minimum, i.e. with $f(v) \leq f(u)$ for all $(u,v) \in E$, using as few… ▽ More

    Submitted 15 August, 2023; v1 submitted 14 May, 2023; originally announced May 2023.

  17. arXiv:2305.04445  [pdf, other

    cs.LG cs.AI cs.DS stat.ML

    New metrics and search algorithms for weighted causal DAGs

    Authors: Davin Choo, Kirankumar Shiragur

    Abstract: Recovering causal relationships from data is an important problem. Using observational data, one can typically only recover causal graphs up to a Markov equivalence class and additional assumptions or interventional data are needed for complete recovery. In this work, under some standard assumptions, we study causal graph discovery via adaptive interventions with node-dependent interventional cost… ▽ More

    Submitted 29 May, 2023; v1 submitted 7 May, 2023; originally announced May 2023.

    Comments: Accepted into ICML 2023

  18. arXiv:2305.01905  [pdf, other

    cs.CV

    Localization using Multi-Focal Spatial Attention for Masked Face Recognition

    Authors: Yooshin Cho, Hanbyel Cho, Hyeong Gwon Hong, Jaesung Ahn, Dongmin Cho, JungWoo Chang, Junmo Kim

    Abstract: Since the beginning of world-wide COVID-19 pandemic, facial masks have been recommended to limit the spread of the disease. However, these masks hide certain facial attributes. Hence, it has become difficult for existing face recognition systems to perform identity verification on masked faces. In this context, it is necessary to develop masked Face Recognition (MFR) for contactless biometric reco… ▽ More

    Submitted 7 September, 2023; v1 submitted 3 May, 2023; originally announced May 2023.

    Comments: Accepted at FG 2023 - InterID Workshop

  19. arXiv:2304.06818  [pdf, other

    cs.CV

    Soundini: Sound-Guided Diffusion for Natural Video Editing

    Authors: Seung Hyun Lee, Sieun Kim, Innfarn Yoo, Feng Yang, Donghyeon Cho, Youngseo Kim, Huiwen Chang, Jinkyu Kim, Sangpil Kim

    Abstract: We propose a method for adding sound-guided visual effects to specific regions of videos with a zero-shot setting. Animating the appearance of the visual effect is challenging because each frame of the edited video should have visual changes while maintaining temporal consistency. Moreover, existing video editing solutions focus on temporal consistency across frames, ignoring the visual style vari… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

  20. arXiv:2301.11741  [pdf, other

    cs.LG cs.AI cs.RO

    Outcome-directed Reinforcement Learning by Uncertainty & Temporal Distance-Aware Curriculum Goal Generation

    Authors: Daesol Cho, Seungjae Lee, H. Jin Kim

    Abstract: Current reinforcement learning (RL) often suffers when solving a challenging exploration problem where the desired outcomes or high rewards are rarely observed. Even though curriculum RL, a framework that solves complex tasks by proposing a sequence of surrogate tasks, shows reasonable results, most of the previous works still have difficulty in proposing curriculum due to the absence of a mechani… ▽ More

    Submitted 20 February, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

    Comments: ICLR 2023 Spotlight. First two authors contributed equally

  21. arXiv:2301.03180  [pdf, other

    cs.LG cs.DS stat.ML

    Subset verification and search algorithms for causal DAGs

    Authors: Davin Choo, Kirankumar Shiragur

    Abstract: Learning causal relationships between variables is a fundamental task in causal inference and directed acyclic graphs (DAGs) are a popular choice to represent the causal relationships. As one can recover a causal graph only up to its Markov equivalence class from observations, interventions are often used for the recovery task. Interventions are costly in general and it is important to design algo… ▽ More

    Submitted 13 February, 2024; v1 submitted 9 January, 2023; originally announced January 2023.

    Comments: Accepted into AISTATS 2023 (https://aistats.org/aistats2023/accepted.html)

  22. arXiv:2211.16465  [pdf, other

    cs.HC

    "I Want to Figure Things Out": Supporting Exploration in Navigation for People with Visual Impairments

    Authors: Gaurav Jain, Yuanyang Teng, Dong Heon Cho, Yunhao Xing, Maryam Aziz, Brian A. Smith

    Abstract: Navigation assistance systems (NASs) aim to help visually impaired people (VIPs) navigate unfamiliar environments. Most of today's NASs support VIPs via turn-by-turn navigation, but a growing body of work highlights the importance of exploration as well. It is unclear, however, how NASs should be designed to help VIPs explore unfamiliar environments. In this paper, we perform a qualitative study t… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

    Comments: To appear in the Proceedings of the ACM on Human-Computer Interaction, CSCW1, April 2023 issue. To be presented at CSCW 2023

  23. arXiv:2211.13291  [pdf, ps, other

    cs.LG cs.DS math.PR math.ST

    Learning and Testing Latent-Tree Ising Models Efficiently

    Authors: Davin Choo, Yuval Dagan, Constantinos Daskalakis, Anthimos Vardis Kandiros

    Abstract: We provide time- and sample-efficient algorithms for learning and testing latent-tree Ising models, i.e. Ising models that may only be observed at their leaf nodes. On the learning side, we obtain efficient algorithms for learning a tree-structured Ising model whose leaf node distribution is close in Total Variation Distance, improving on the results of prior work. On the testing side, we provide… ▽ More

    Submitted 10 July, 2023; v1 submitted 23 November, 2022; originally announced November 2022.

  24. arXiv:2211.07077  [pdf, other

    cs.CV

    IFQA: Interpretable Face Quality Assessment

    Authors: Byungho Jo, Donghyeon Cho, In Kyu Park, Sungeun Hong

    Abstract: Existing face restoration models have relied on general assessment metrics that do not consider the characteristics of facial regions. Recent works have therefore assessed their methods using human studies, which is not scalable and involves significant effort. This paper proposes a novel face-centric metric based on an adversarial framework where a generator simulates face restoration and a discr… ▽ More

    Submitted 16 November, 2022; v1 submitted 13 November, 2022; originally announced November 2022.

    Comments: WACV 2023, Code: https://github.com/VCLLab/IFQA

  25. arXiv:2210.07760  [pdf, other

    cs.CV

    Lightweight Alpha Matting Network Using Distillation-Based Channel Pruning

    Authors: Donggeun Yoon, Jinsun Park, Donghyeon Cho

    Abstract: Recently, alpha matting has received a lot of attention because of its usefulness in mobile applications such as selfies. Therefore, there has been a demand for a lightweight alpha matting model due to the limited computational resources of commercial portable devices. To this end, we suggest a distillation-based channel pruning method for the alpha matting networks. In the pruning step, we remove… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Comments: Accepted by ACCV2022

  26. arXiv:2209.15256  [pdf, other

    cs.LG cs.CV cs.RO

    S2P: State-conditioned Image Synthesis for Data Augmentation in Offline Reinforcement Learning

    Authors: Daesol Cho, Dongseok Shim, H. Jin Kim

    Abstract: Offline reinforcement learning (Offline RL) suffers from the innate distributional shift as it cannot interact with the physical environment during training. To alleviate such limitation, state-based offline RL leverages a learned dynamics model from the logged experience and augments the predicted state transition to extend the data distribution. For exploiting such benefit also on the image-base… ▽ More

    Submitted 30 September, 2022; originally announced September 2022.

    Comments: NeurIPS 2022, first two authors contributed equally

  27. arXiv:2209.05968  [pdf, other

    cs.CV

    Weakly-Supervised Stitching Network for Real-World Panoramic Image Generation

    Authors: Dae-Young Song, Geonsoo Lee, HeeKyung Lee, Gi-Mun Um, Donghyeon Cho

    Abstract: Recently, there has been growing attention on an end-to-end deep learning-based stitching model. However, the most challenging point in deep learning-based stitching is to obtain pairs of input images with a narrow field of view and ground truth images with a wide field of view captured from real-world scenes. To overcome this difficulty, we develop a weakly-supervised learning mechanism to train… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: Accepted by ECCV2022 (poster)

  28. arXiv:2208.13068  [pdf, other

    cs.DB cs.DC

    Apiary: A DBMS-Integrated Transactional Function-as-a-Service Framework

    Authors: Peter Kraft, Qian Li, Kostis Kaffes, Athinagoras Skiadopoulos, Deeptaanshu Kumar, Danny Cho, Jason Li, Robert Redmond, Nathan Weckwerth, Brian Xia, Peter Bailis, Michael Cafarella, Goetz Graefe, Jeremy Kepner, Christos Kozyrakis, Michael Stonebraker, Lalith Suresh, Xiangyao Yu, Matei Zaharia

    Abstract: Developers increasingly use function-as-a-service (FaaS) platforms for data-centric applications that perform low-latency and transactional operations on data, such as for microservices or web serving. Unfortunately, existing FaaS platforms support these applications poorly because they physically and logically separate application logic, executed in cloud functions, from data management, done in… ▽ More

    Submitted 30 June, 2023; v1 submitted 27 August, 2022; originally announced August 2022.

    Comments: 14 pages, 13 figures, 3 tables. Preprint

  29. arXiv:2206.15374  [pdf, other

    cs.LG cs.DM cs.DS stat.ML

    Verification and search algorithms for causal DAGs

    Authors: Davin Choo, Kirankumar Shiragur, Arnab Bhattacharyya

    Abstract: We study two problems related to recovering causal graphs from interventional data: (i) $\textit{verification}$, where the task is to check if a purported causal graph is correct, and (ii) $\textit{search}$, where the task is to recover the correct causal graph. For both, we wish to minimize the number of interventions performed. For the first problem, we give a characterization of a minimal sized… ▽ More

    Submitted 9 October, 2022; v1 submitted 30 June, 2022; originally announced June 2022.

  30. arXiv:2206.02607  [pdf, other

    cs.LG cs.CE cs.GR math.NA physics.comp-ph

    CROM: Continuous Reduced-Order Modeling of PDEs Using Implicit Neural Representations

    Authors: Peter Yichen Chen, Jinxu Xiang, Dong Heon Cho, Yue Chang, G A Pershing, Henrique Teles Maia, Maurizio M. Chiaramonte, Kevin Carlberg, Eitan Grinspun

    Abstract: The long runtime of high-fidelity partial differential equation (PDE) solvers makes them unsuitable for time-critical applications. We propose to accelerate PDE solvers using reduced-order modeling (ROM). Whereas prior ROM approaches reduce the dimensionality of discretized vector fields, our continuous reduced-order modeling (CROM) approach builds a low-dimensional embedding of the continuous vec… ▽ More

    Submitted 3 March, 2023; v1 submitted 6 June, 2022; originally announced June 2022.

  31. Unsupervised Reinforcement Learning for Transferable Manipulation Skill Discovery

    Authors: Daesol Cho, Jigang Kim, H. Jin Kim

    Abstract: Current reinforcement learning (RL) in robotics often experiences difficulty in generalizing to new downstream tasks due to the innate task-specific training paradigm. To alleviate it, unsupervised RL, a framework that pre-trains the agent in a task-agnostic manner without access to the task-specific reward, leverages active exploration for distilling diverse experience into essential skills or re… ▽ More

    Submitted 29 April, 2022; originally announced April 2022.

    Comments: 8 pages, 9 figures; accepted for publication in the IEEE Robotics and Automation Letters (RA-L); supplementary video available at https://www.youtube.com/watch?v=bF3Y4WXfM7c&t=9s

    Journal ref: IEEE Robotics and Automation Letters 7 (2022) 7455-7462

  32. Automating Reinforcement Learning with Example-based Resets

    Authors: Jigang Kim, J. hyeon Park, Daesol Cho, H. Jin Kim

    Abstract: Deep reinforcement learning has enabled robots to learn motor skills from environmental interactions with minimal to no prior knowledge. However, existing reinforcement learning algorithms assume an episodic setting, in which the agent resets to a fixed initial state distribution at the end of each episode, to successfully train the agents from repeated trials. Such reset mechanism, while trivial… ▽ More

    Submitted 5 April, 2022; v1 submitted 5 April, 2022; originally announced April 2022.

    Comments: 8 pages, 6 figures; accepted for publication in the IEEE Robotics and Automation Letters (RA-L); source code available at https://github.com/jigangkim/autoreset_rl ; supplementary video available at https://youtu.be/himd0Z5b64A

    Journal ref: IEEE Robotics and Automation Letters 7 (2022) 6606-6613

  33. arXiv:2202.07605  [pdf, other

    cs.LG cs.AI

    UserBERT: Modeling Long- and Short-Term User Preferences via Self-Supervision

    Authors: Tianyu Li, Ali Cevahir, Derek Cho, Hao Gong, DuyKhuong Nguyen, Bjorn Stenger

    Abstract: E-commerce platforms generate vast amounts of customer behavior data, such as clicks and purchases, from millions of unique users every day. However, effectively using this data for behavior understanding tasks is challenging because there are usually not enough labels to learn from all users in a supervised manner. This paper extends the BERT model to e-commerce user data for pre-training represe… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

  34. NEAT: A Label Noise-resistant Complementary Item Recommender System with Trustworthy Evaluation

    Authors: Luyi Ma, Jianpeng Xu, Jason H. D. Cho, Evren Korpeoglu, Sushant Kumar, Kannan Achan

    Abstract: The complementary item recommender system (CIRS) recommends the complementary items for a given query item. Existing CIRS models consider the item co-purchase signal as a proxy of the complementary relationship due to the lack of human-curated labels from the huge transaction records. These methods represent items in a complementary embedding space and model the complementary relationship as a poi… ▽ More

    Submitted 11 February, 2022; originally announced February 2022.

    Comments: 11 pages, 4 figures; Published in: 2021 IEEE International Conference on Big Data (Big Data)

  35. arXiv:2110.15721  [pdf, ps, other

    cs.CL cs.AI

    Paperswithtopic: Topic Identification from Paper Title Only

    Authors: Daehyun Cho, Christian Wallraven

    Abstract: The deep learning field is growing rapidly as witnessed by the exponential growth of papers submitted to journals, conferences, and pre-print servers. To cope with the sheer number of papers, several text mining tools from natural language processing (NLP) have been proposed that enable researchers to keep track of recent findings. In this context, our paper makes two main contributions: first, we… ▽ More

    Submitted 31 March, 2022; v1 submitted 9 October, 2021; originally announced October 2021.

  36. arXiv:2110.03909  [pdf, other

    cs.LG cs.CV

    Meta-Learning with Task-Adaptive Loss Function for Few-Shot Learning

    Authors: Sungyong Baik, Janghoon Choi, Heewon Kim, Dohee Cho, Jaesik Min, Kyoung Mu Lee

    Abstract: In few-shot learning scenarios, the challenge is to generalize and perform well on new unseen examples when only very few labeled examples are available for each task. Model-agnostic meta-learning (MAML) has gained the popularity as one of the representative few-shot learning methods for its flexibility and applicability to diverse problems. However, MAML and its variants often resort to a simple… ▽ More

    Submitted 17 October, 2021; v1 submitted 8 October, 2021; originally announced October 2021.

    Comments: ICCV 2021 (Oral). Code at https://github.com/baiksung/MeTAL

  37. arXiv:2109.09265  [pdf, other

    cs.LG cs.MS stat.ML

    Merlion: A Machine Learning Library for Time Series

    Authors: Aadyot Bhatnagar, Paul Kassianik, Chenghao Liu, Tian Lan, Wenzhuo Yang, Rowan Cassius, Doyen Sahoo, Devansh Arpit, Sri Subramanian, Gerald Woo, Amrita Saha, Arun Kumar Jagota, Gokulakrishnan Gopalakrishnan, Manpreet Singh, K C Krithika, Sukumar Maddineni, Daeki Cho, Bo Zong, Yingbo Zhou, Caiming Xiong, Silvio Savarese, Steven Hoi, Huan Wang

    Abstract: We introduce Merlion, an open-source machine learning library for time series. It features a unified interface for many commonly used models and datasets for anomaly detection and forecasting on both univariate and multivariate time series, along with standard pre/post-processing layers. It has several modules to improve ease-of-use, including visualization, anomaly score calibration to improve in… ▽ More

    Submitted 19 September, 2021; originally announced September 2021.

    Comments: 22 pages, 1 figure, 14 tables

  38. arXiv:2107.10450  [pdf, other

    cs.DS cs.LG math.ST stat.ML

    Learning Sparse Fixed-Structure Gaussian Bayesian Networks

    Authors: Arnab Bhattacharyya, Davin Choo, Rishikesh Gajjala, Sutanu Gayen, Yuhao Wang

    Abstract: Gaussian Bayesian networks (a.k.a. linear Gaussian structural equation models) are widely used to model causal interactions among continuous variables. In this work, we study the problem of learning a fixed-structure Gaussian Bayesian network up to a bounded error in total variation distance. We analyze the commonly used node-wise least squares regression (LeastSquares) and prove that it has a nea… ▽ More

    Submitted 18 October, 2022; v1 submitted 22 July, 2021; originally announced July 2021.

    Comments: 30 pages, 11 figures, acknowledgement added

  39. arXiv:2106.06308  [pdf, other

    cs.LG cs.CC cs.DS math.ST stat.ML

    The Complexity of Sparse Tensor PCA

    Authors: Davin Choo, Tommaso d'Orsi

    Abstract: We study the problem of sparse tensor principal component analysis: given a tensor $\pmb Y = \pmb W + λx^{\otimes p}$ with $\pmb W \in \otimes^p\mathbb{R}^n$ having i.i.d. Gaussian entries, the goal is to recover the $k$-sparse unit vector $x \in \mathbb{R}^n$. The model captures both sparse PCA (in its Wigner form) and tensor PCA. For the highly sparse regime of $k \leq \sqrt{n}$, we present a… ▽ More

    Submitted 2 November, 2021; v1 submitted 11 June, 2021; originally announced June 2021.

  40. arXiv:2102.11660  [pdf, other

    cs.DC cs.DS

    Massively Parallel Correlation Clustering in Bounded Arboricity Graphs

    Authors: Mélanie Cambus, Davin Choo, Havu Miikonen, Jara Uitto

    Abstract: Identifying clusters of similar elements in a set is a common task in data analysis. With the immense growth of data and physical limitations on single processor speed, it is necessary to find efficient parallel algorithms for clustering tasks. In this paper, we study the problem of correlation clustering in bounded arboricity graphs with respect to the Massively Parallel Computation (MPC) model.… ▽ More

    Submitted 6 August, 2021; v1 submitted 23 February, 2021; originally announced February 2021.

  41. arXiv:2008.12258  [pdf, other

    cs.LG

    Learning to Profile: User Meta-Profile Network for Few-Shot Learning

    Authors: Hao Gong, Qifang Zhao, Tianyu Li, Derek Cho, DuyKhuong Nguyen

    Abstract: Meta-learning approaches have shown great success in vision and language domains. However, few studies discuss the practice of meta-learning for large-scale industrial applications. Although e-commerce companies have spent many efforts on learning representations to provide a better user experience, we argue that such efforts cannot be stopped at this step. In addition to learning a strong profile… ▽ More

    Submitted 9 October, 2020; v1 submitted 20 August, 2020; originally announced August 2020.

    Comments: 8-page Applied Research Paper accepted by CIKM 2020

  42. arXiv:2008.05721  [pdf, other

    cs.CV

    Learning Temporally Invariant and Localizable Features via Data Augmentation for Video Recognition

    Authors: Taeoh Kim, Hyeongmin Lee, MyeongAh Cho, Ho Seong Lee, Dong Heon Cho, Sangyoun Lee

    Abstract: Deep-Learning-based video recognition has shown promising improvements along with the development of large-scale datasets and spatiotemporal network architectures. In image recognition, learning spatially invariant features is a key factor in improving recognition performance and robustness. Data augmentation based on visual inductive priors, such as cropping, flipping, rotating, or photometric ji… ▽ More

    Submitted 13 August, 2020; originally announced August 2020.

    Comments: European Conference on Computer Vision (ECCV) 2020, 1st Visual Inductive Priors for Data-Efficient Deep Learning Workshop (Oral)

  43. arXiv:2007.05181  [pdf, other

    cs.LG stat.ML

    Sample-based Regularization: A Transfer Learning Strategy Toward Better Generalization

    Authors: Yunho Jeon, Yongseok Choi, Jaesun Park, Subin Yi, Dongyeon Cho, Jiwon Kim

    Abstract: Training a deep neural network with a small amount of data is a challenging problem as it is vulnerable to overfitting. However, one of the practical difficulties that we often face is to collect many samples. Transfer learning is a cost-effective solution to this problem. By using the source model trained with a large-scale dataset, the target model can alleviate the overfitting originated from t… ▽ More

    Submitted 10 July, 2020; originally announced July 2020.

  44. arXiv:2007.01524  [pdf, other

    cs.CV cs.LG eess.IV

    Domain Adaptation without Source Data

    Authors: Youngeun Kim, Donghyeon Cho, Kyeongtak Han, Priyadarshini Panda, Sungeun Hong

    Abstract: Domain adaptation assumes that samples from source and target domains are freely accessible during a training phase. However, such an assumption is rarely plausible in the real-world and possibly causes data-privacy issues, especially when the label of the source domain can be a sensitive attribute as an identifier. To avoid accessing source data that may contain sensitive information, we introduc… ▽ More

    Submitted 30 August, 2021; v1 submitted 3 July, 2020; originally announced July 2020.

    Comments: 13 pages

  45. arXiv:2003.09972  [pdf, other

    math.PR cs.DC

    Distributed Computation with Continual Population Growth

    Authors: Da-Jung Cho, Matthias Függer, Corbin Hopper, Manish Kushwaha, Thomas Nowak, Quentin Soubeyran

    Abstract: Computing with synthetically engineered bacteria is a vibrant and active field with numerous applications in bio-production, bio-sensing, and medicine. Motivated by the lack of robustness and by resource limitation inside single cells, distributed approaches with communication among bacteria have recently gained in interest. In this paper, we focus on the problem of population growth happening con… ▽ More

    Submitted 14 May, 2020; v1 submitted 22 March, 2020; originally announced March 2020.

  46. arXiv:2003.04721  [pdf, other

    cs.CV

    Restore from Restored: Single Image Denoising with Pseudo Clean Image

    Authors: Seunghwan Lee, Dongkyu Lee, Donghyeon Cho, Jiwon Kim, Tae Hyun Kim

    Abstract: In this study, we propose a simple and effective fine-tuning algorithm called "restore-from-restored", which can greatly enhance the performance of fully pre-trained image denoising networks. Many supervised denoising approaches can produce satisfactory results using large external training datasets. However, these methods have limitations in using internal information available in a given test im… ▽ More

    Submitted 18 November, 2020; v1 submitted 9 March, 2020; originally announced March 2020.

  47. arXiv:2003.04279  [pdf, other

    cs.CV

    Restore from Restored: Video Restoration with Pseudo Clean Video

    Authors: Seunghwan Lee, Donghyeon Cho, Jiwon Kim, Tae Hyun Kim

    Abstract: In this study, we propose a self-supervised video denoising method called "restore-from-restored." This method fine-tunes a pre-trained network by using a pseudo clean video during the test phase. The pseudo clean video is obtained by applying a noisy video to the baseline network. By adopting a fully convolutional neural network (FCN) as the baseline, we can improve video denoising performance wi… ▽ More

    Submitted 15 March, 2021; v1 submitted 9 March, 2020; originally announced March 2020.

    Comments: CVPR 2021

  48. arXiv:2002.07784  [pdf, ps, other

    cs.DS cs.LG

    k-means++: few more steps yield constant approximation

    Authors: Davin Choo, Christoph Grunau, Julian Portmann, Václav Rozhoň

    Abstract: The k-means++ algorithm of Arthur and Vassilvitskii (SODA 2007) is a state-of-the-art algorithm for solving the k-means clustering problem and is known to give an O(log k)-approximation in expectation. Recently, Lattanzi and Sohler (ICML 2019) proposed augmenting k-means++ with O(k log log k) local search steps to yield a constant approximation (in expectation) to the k-means clustering problem. I… ▽ More

    Submitted 18 February, 2020; originally announced February 2020.

  49. arXiv:2001.06933  [pdf, other

    cs.DB cs.CR

    Fides: Managing Data on Untrusted Infrastructure

    Authors: Sujaya Maiyya, Danny Hyun Bum Cho, Divyakant Agrawal, Amr El Abbadi

    Abstract: Significant amounts of data are currently being stored and managed on third-party servers. It is impractical for many small scale enterprises to own their private datacenters, hence renting third-party servers is a viable solution for such businesses. But the increasing number of malicious attacks, both internal and external, as well as buggy software on third-party servers is causing clients to l… ▽ More

    Submitted 19 January, 2020; originally announced January 2020.

    Comments: 14pages, 15 figures/graphs

  50. arXiv:2001.02905  [pdf, other

    cs.CV

    Fast Adaptation to Super-Resolution Networks via Meta-Learning

    Authors: Seobin Park, Jinsu Yoo, Donghyeon Cho, Jiwon Kim, Tae Hyun Kim

    Abstract: Conventional supervised super-resolution (SR) approaches are trained with massive external SR datasets but fail to exploit desirable properties of the given test image. On the other hand, self-supervised SR approaches utilize the internal information within a test image but suffer from computational complexity in run-time. In this work, we observe the opportunity for further improvement of the per… ▽ More

    Submitted 25 August, 2020; v1 submitted 9 January, 2020; originally announced January 2020.