Skip to main content

Showing 1–24 of 24 results for author: Ryu, E

  1. arXiv:2407.10454  [pdf, other

    cs.LG math.OC

    Deflated Dynamics Value Iteration

    Authors: Jongmin Lee, Amin Rakhsha, Ernest K. Ryu, Amir-massoud Farahmand

    Abstract: The Value Iteration (VI) algorithm is an iterative procedure to compute the value function of a Markov decision process, and is the basis of many reinforcement learning (RL) algorithms as well. As the error convergence rate of VI as a function of iteration $k$ is $O(γ^k)$, it is slow when the discount factor $γ$ is close to $1$. To accelerate the computation of the value function, we propose Defla… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  2. arXiv:2405.03958  [pdf, other

    cs.CV cs.AI cs.LG

    Simple Drop-in LoRA Conditioning on Attention Layers Will Improve Your Diffusion Model

    Authors: Joo Young Choi, Jaesung R. Park, Inkyu Park, Jaewoong Cho, Albert No, Ernest K. Ryu

    Abstract: Current state-of-the-art diffusion models employ U-Net architectures containing convolutional and (qkv) self-attention layers. The U-Net processes images while being conditioned on the time embedding input for each sampling step and the class or caption embedding input corresponding to the desired conditional generation. Such conditioning involves scale-and-shift operations to the convolutional la… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  3. arXiv:2403.17199  [pdf, other

    cs.CL

    Extracting Social Support and Social Isolation Information from Clinical Psychiatry Notes: Comparing a Rule-based NLP System and a Large Language Model

    Authors: Braja Gopal Patra, Lauren A. Lepow, Praneet Kasi Reddy Jagadeesh Kumar, Veer Vekaria, Mohit Manoj Sharma, Prakash Adekkanattu, Brian Fennessy, Gavin Hynes, Isotta Landi, Jorge A. Sanchez-Ruiz, Euijung Ryu, Joanna M. Biernacka, Girish N. Nadkarni, Ardesheer Talati, Myrna Weissman, Mark Olfson, J. John Mann, Alexander W. Charney, Jyotishman Pathak

    Abstract: Background: Social support (SS) and social isolation (SI) are social determinants of health (SDOH) associated with psychiatric outcomes. In electronic health records (EHRs), individual-level SS/SI is typically documented as narrative clinical notes rather than structured coded data. Natural language processing (NLP) algorithms can automate the otherwise labor-intensive process of data extraction.… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 2 figures, 3 tables

  4. arXiv:2403.04616  [pdf, other

    cs.GT

    Modeling reputation-based behavioral biases in school choice

    Authors: Jon Kleinberg, Sigal Oren, Emily Ryu, Éva Tardos

    Abstract: A fundamental component in the theoretical school choice literature is the problem a student faces in deciding which schools to apply to. Recent models have considered a set of schools of different selectiveness and a student who is unsure of their strength and can apply to at most $k$ schools. Such models assume that the student cares solely about maximizing the quality of the school that they at… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: 22 pages, 8 figures

  5. arXiv:2403.03937  [pdf, ps, other

    cs.GT

    Settling the Competition Complexity of Additive Buyers over Independent Items

    Authors: Mahsa Derakhshan, Emily Ryu, S. Matthew Weinberg, Eric Xue

    Abstract: The competition complexity of an auction setting is the number of additional bidders needed such that the simple mechanism of selling items separately (with additional bidders) achieves greater revenue than the optimal but complex (randomized, prior-dependent, Bayesian-truthful) optimal mechanism without the additional bidders. Our main result settles the competition complexity of $n$ bidders with… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 50 pages

  6. arXiv:2402.11867  [pdf, other

    cs.LG math.OC

    LoRA Training in the NTK Regime has No Spurious Local Minima

    Authors: Uijeong Jang, Jason D. Lee, Ernest K. Ryu

    Abstract: Low-rank adaptation (LoRA) has become the standard approach for parameter-efficient fine-tuning of large language models (LLM), but our theoretical understanding of LoRA has been limited. In this work, we theoretically analyze LoRA fine-tuning in the neural tangent kernel (NTK) regime with $N$ data points, showing: (i) full fine-tuning (without LoRA) admits a low-rank solution of rank… ▽ More

    Submitted 28 May, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: 23 pages

  7. arXiv:2310.18297  [pdf, other

    cs.CV cs.AI

    Image Clustering Conditioned on Text Criteria

    Authors: Sehyun Kwon, Jaeseung Park, Minkyu Kim, Jaewoong Cho, Ernest K. Ryu, Kangwook Lee

    Abstract: Classical clustering methods do not provide users with direct control of the clustering results, and the clustering results may not be consistent with the relevant criterion that a user has in mind. In this work, we present a new methodology for performing image clustering based on user-specified text criteria by leveraging modern vision-language models and large language models. We call our metho… ▽ More

    Submitted 21 February, 2024; v1 submitted 27 October, 2023; originally announced October 2023.

  8. arXiv:2307.02770  [pdf, other

    cs.CV cs.AI

    Censored Sampling of Diffusion Models Using 3 Minutes of Human Feedback

    Authors: TaeHo Yoon, Kibeom Myoung, Keon Lee, Jaewoong Cho, Albert No, Ernest K. Ryu

    Abstract: Diffusion models have recently shown remarkable success in high-quality image generation. Sometimes, however, a pre-trained diffusion model exhibits partial misalignment in the sense that the model can generate good images, but it sometimes outputs undesirable images. If so, we simply need to prevent the generation of the bad images, and we call this task censoring. In this work, we present censor… ▽ More

    Submitted 30 October, 2023; v1 submitted 6 July, 2023; originally announced July 2023.

    Comments: Published in NeurIPS 2023

  9. arXiv:2305.16569  [pdf, ps, other

    cs.LG math.OC

    Accelerating Value Iteration with Anchoring

    Authors: Jongmin Lee, Ernest K. Ryu

    Abstract: Value Iteration (VI) is foundational to the theory and practice of modern reinforcement learning, and it is known to converge at a $\mathcal{O}(γ^k)$-rate, where $γ$ is the discount factor. Surprisingly, however, the optimal rate for the VI setup was not known, and finding a general acceleration mechanism has been an open problem. In this paper, we present the first accelerated VI for both the Bel… ▽ More

    Submitted 28 October, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Journal ref: Neural Information Processing System 2023

  10. arXiv:2304.13995  [pdf, other

    cs.CV cs.AI

    Rotation and Translation Invariant Representation Learning with Implicit Neural Representations

    Authors: Sehyun Kwon, Joo Young Choi, Ernest K. Ryu

    Abstract: In many computer vision applications, images are acquired with arbitrary or random rotations and translations, and in such setups, it is desirable to obtain semantic representations disentangled from the image orientation. Examples of such applications include semiconductor wafer defect inspection, plankton microscope images, and inference on single-particle cryo-electron microscopy (cryo-EM) micr… ▽ More

    Submitted 12 June, 2023; v1 submitted 27 April, 2023; originally announced April 2023.

  11. arXiv:2302.03239  [pdf, ps, other

    cs.DS cs.GT cs.SI

    Calibrated Recommendations for Users with Decaying Attention

    Authors: Jon Kleinberg, Emily Ryu, Éva Tardos

    Abstract: Recommendation systems capable of providing diverse sets of results are a focus of increasing importance, with motivations ranging from fairness to novelty and other aspects of optimizing user experience. One form of diversity of recent interest is calibration, the notion that personalized recommendations should reflect the full distribution of a user's interests, rather than a single predominant… ▽ More

    Submitted 12 July, 2024; v1 submitted 6 February, 2023; originally announced February 2023.

    Comments: 31 pages, 1 figure, 17th International Symposium on Algorithmic Game Theory (SAGT 2024). This paper incorporates and supersedes our earlier paper arXiv:2203.00233

  12. arXiv:2203.00233  [pdf, ps, other

    cs.DS cs.GT cs.SI

    Ordered Submodularity and its Applications to Diversifying Recommendations

    Authors: Jon Kleinberg, Emily Ryu, Éva Tardos

    Abstract: A fundamental task underlying many important optimization problems, from influence maximization to sensor placement to content recommendation, is to select the optimal group of $k$ items from a larger set. Submodularity has been very effective in allowing approximation algorithms for such subset selection problems. However, in several applications, we are interested not only in the elements of a s… ▽ More

    Submitted 1 March, 2022; originally announced March 2022.

    Comments: 17 pages

  13. arXiv:2202.11910  [pdf, other

    cs.LG

    Robust Probabilistic Time Series Forecasting

    Authors: TaeHo Yoon, Youngsuk Park, Ernest K. Ryu, Yuyang Wang

    Abstract: Probabilistic time series forecasting has played critical role in decision-making processes due to its capability to quantify uncertainties. Deep forecasting models, however, could be prone to input perturbations, and the notion of such perturbations, together with that of robustness, has not even been completely established in the regime of probabilistic forecasting. In this work, we propose a fr… ▽ More

    Submitted 24 February, 2022; originally announced February 2022.

    Comments: AISTATS 2022 camera ready version

  14. arXiv:2202.02981  [pdf, other

    cs.LG math.OC stat.ML

    Neural Tangent Kernel Analysis of Deep Narrow Neural Networks

    Authors: Jongmin Lee, Joo Young Choi, Ernest K. Ryu, Albert No

    Abstract: The tremendous recent progress in analyzing the training dynamics of overparameterized neural networks has primarily focused on wide networks and therefore does not sufficiently address the role of depth in deep learning. In this work, we present the first trainability guarantee of infinitely deep but narrow neural networks. We study the infinite-depth limit of a multilayer perceptron (MLP) with a… ▽ More

    Submitted 27 June, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

    Journal ref: Published in International Conference on Machine Learning, 2022

  15. arXiv:2201.09077  [pdf, other

    cs.CV cs.AI

    LTC-GIF: Attracting More Clicks on Feature-length Sports Videos

    Authors: Ghulam Mujtaba, Jaehyuk Choi, Eun-Seok Ryu

    Abstract: This paper proposes a lightweight method to attract users and increase views of the video by presenting personalized artistic media -- i.e, static thumbnails and animated GIFs. This method analyzes lightweight thumbnail containers (LTC) using computational resources of the client device to recognize personalized events from full-length sports videos. In addition, instead of processing the entire v… ▽ More

    Submitted 22 January, 2022; originally announced January 2022.

  16. LTC-SUM: Lightweight Client-driven Personalized Video Summarization Framework Using 2D CNN

    Authors: Ghulam Mujtaba, Adeel Malik, Eun-Seok Ryu

    Abstract: This paper proposes a novel lightweight thumbnail container-based summarization (LTC-SUM) framework for full feature-length videos. This framework generates a personalized keyshot summary for concurrent users by using the computational resource of the end-user device. State-of-the-art methods that acquire and process entire video data to generate video summaries are highly computationally intensiv… ▽ More

    Submitted 4 October, 2022; v1 submitted 22 January, 2022; originally announced January 2022.

    Comments: 14

    Journal ref: in IEEE Access, vol. 10, pp. 103041-103055, 2022

  17. arXiv:2112.09379  [pdf

    cs.CV

    Enhanced Frame and Event-Based Simulator and Event-Based Video Interpolation Network

    Authors: Adam Radomski, Andreas Georgiou, Thomas Debrunner, Chenghan Li, Luca Longinotti, Minwon Seo, Moosung Kwak, Chang-Woo Shin, Paul K. J. Park, Hyunsurk Eric Ryu, Kynan Eng

    Abstract: Fast neuromorphic event-based vision sensors (Dynamic Vision Sensor, DVS) can be combined with slower conventional frame-based sensors to enable higher-quality inter-frame interpolation than traditional methods relying on fixed motion approximations using e.g. optical flow. In this work we present a new, advanced event simulator that can produce realistic scenes recorded by a camera rig with an ar… ▽ More

    Submitted 17 December, 2021; originally announced December 2021.

    Comments: 10 pages, 19 figures

  18. arXiv:2104.09644  [pdf

    cs.CL cs.AI cs.IR

    Neural Language Models with Distant Supervision to Identify Major Depressive Disorder from Clinical Notes

    Authors: Bhavani Singh Agnikula Kshatriya, Nicolas A Nunez, Manuel Gardea- Resendez, Euijung Ryu, Brandon J Coombes, Sunyang Fu, Mark A Frye, Joanna M Biernacka, Yanshan Wang

    Abstract: Major depressive disorder (MDD) is a prevalent psychiatric disorder that is associated with significant healthcare burden worldwide. Phenotyping of MDD can help early diagnosis and consequently may have significant advantages in patient management. In prior research MDD phenotypes have been extracted from structured Electronic Health Records (EHR) or using Electroencephalographic (EEG) data with t… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

  19. arXiv:2102.07541  [pdf, other

    cs.LG math.OC

    WGAN with an Infinitely Wide Generator Has No Spurious Stationary Points

    Authors: Albert No, TaeHo Yoon, Sehyun Kwon, Ernest K. Ryu

    Abstract: Generative adversarial networks (GAN) are a widely used class of deep generative models, but their minimax training dynamics are not understood very well. In this work, we show that GANs with a 2-layer infinite-width generator and a 2-layer finite-width discriminator trained with stochastic gradient ascent-descent have no spurious stationary points. We then show that when the width of the generato… ▽ More

    Submitted 9 June, 2021; v1 submitted 15 February, 2021; originally announced February 2021.

    Comments: Published at ICML 2021

  20. arXiv:1906.12141  [pdf, other

    cs.CG

    MGOS: A Library for Molecular Geometry and its Operating System

    Authors: Deok-Soo Kima, Joonghyun Ryua, Youngsong Choa, Mokwon Leeb, Jehyun Cha, Chanyoung Song, Sangwha Kim, Roman A Laskowskid, Kokichi Sugihara, Jong Bhak, Seong Eon Ryu

    Abstract: The geometry of atomic arrangement underpins the structural understanding of molecules in many fields. However, no general framework of mathematical/computational theory for the geometry of atomic arrangement exists. Here we present "Molecular Geometry (MG)" as a theoretical framework accompanied by "MG Operating System (MGOS)" which consists of callable functions implementing the MG theory. MG al… ▽ More

    Submitted 28 June, 2019; originally announced June 2019.

  21. arXiv:1905.10899  [pdf, other

    cs.LG stat.ML

    ODE Analysis of Stochastic Gradient Methods with Optimism and Anchoring for Minimax Problems

    Authors: Ernest K. Ryu, Kun Yuan, Wotao Yin

    Abstract: Despite remarkable empirical success, the training dynamics of generative adversarial networks (GAN), which involves solving a minimax game using stochastic gradients, is still poorly understood. In this work, we analyze last-iterate convergence of simultaneous gradient descent (simGD) and its variants under the assumption of convex-concavity, guided by a continuous-time analysis with differential… ▽ More

    Submitted 11 October, 2020; v1 submitted 26 May, 2019; originally announced May 2019.

  22. arXiv:1905.05406  [pdf, other

    cs.CV eess.IV

    Plug-and-Play Methods Provably Converge with Properly Trained Denoisers

    Authors: Ernest K. Ryu, Jialin Liu, Sicheng Wang, Xiaohan Chen, Zhangyang Wang, Wotao Yin

    Abstract: Plug-and-play (PnP) is a non-convex framework that integrates modern denoising priors, such as BM3D or deep learning-based denoisers, into ADMM or other proximal algorithms. An advantage of PnP is that one can use pre-trained denoisers when there is not sufficient data for end-to-end training. Although PnP has been recently studied extensively with great empirical success, theoretical analysis add… ▽ More

    Submitted 14 May, 2019; originally announced May 2019.

    Comments: Published in the International Conference on Machine Learning, 2019

  23. arXiv:1511.05498  [pdf, ps, other

    cs.NI

    Feasibility Study of Stochastic Streaming with 4K UHD Video Traces

    Authors: Joongheon Kim, Eun-Seok Ryu

    Abstract: This paper performs the feasibility study of stochastic video streaming algorithms with up-to-date 4K ultra-high-definition (UHD) video traces. In previous work, various stochastic video streaming algorithms were proposed which maximize time-average video streaming quality subject to queue stability based on the information of queue-backlog length. The performance improvements with the stochastic… ▽ More

    Submitted 17 November, 2015; originally announced November 2015.

    Comments: Presented at the International Conference on ICT Convergence (ICTC), Jeju Island, Korea, 28 - 30 October 2015

  24. arXiv:1208.2239  [pdf, other

    cs.SI physics.soc-ph

    Stochastic Kronecker Graph on Vertex-Centric BSP

    Authors: Ernest Ryu, Sean Choi

    Abstract: Recently Stochastic Kronecker Graph (SKG), a network generation model, and vertex-centric BSP, a graph processing framework like Pregel, have attracted much attention in the network analysis community. Unfortunately the two are not very well-suited for each other and thus an implementation of SKG on vertex-centric BSP must either be done serially or in an unnatural manner. In this paper, we pres… ▽ More

    Submitted 10 August, 2012; originally announced August 2012.