Skip to main content

Showing 1–12 of 12 results for author: Woo, J O

  1. arXiv:2405.06424  [pdf, other

    cs.CL cs.AI cs.LG

    Improving Instruction Following in Language Models through Proxy-Based Uncertainty Estimation

    Authors: JoonHo Lee, Jae Oh Woo, Juree Seok, Parisa Hassanzadeh, Wooseok Jang, JuYoun Son, Sima Didari, Baruch Gutow, Heng Hao, Hankyu Moon, Wenjun Hu, Yeong-Dae Kwon, Taehee Lee, Seungjai Min

    Abstract: Assessing response quality to instructions in language models is vital but challenging due to the complexity of human language across different contexts. This complexity often results in ambiguous or inconsistent interpretations, making accurate assessment difficult. To address this issue, we propose a novel Uncertainty-aware Reward Model (URM) that introduces a robust uncertainty estimation for t… ▽ More

    Submitted 19 May, 2024; v1 submitted 10 May, 2024; originally announced May 2024.

    Comments: Accepted to ICML 2024

  2. arXiv:2307.10062  [pdf, other

    cs.CV cs.LG

    Unsupervised Accuracy Estimation of Deep Visual Models using Domain-Adaptive Adversarial Perturbation without Source Samples

    Authors: JoonHo Lee, Jae Oh Woo, Hankyu Moon, Kwonho Lee

    Abstract: Deploying deep visual models can lead to performance drops due to the discrepancies between source and target distributions. Several approaches leverage labeled source data to estimate target domain accuracy, but accessing labeled source data is often prohibitively difficult due to data confidentiality or resource limitations on serving devices. Our work proposes a new framework to estimate model… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

    Comments: Accepted to ICCV 2023

  3. arXiv:2208.04278  [pdf, other

    cs.CV cs.GR cs.LG

    Self-Supervised Contrastive Representation Learning for 3D Mesh Segmentation

    Authors: Ayaan Haque, Hankyu Moon, Heng Hao, Sima Didari, Jae Oh Woo, Patrick Bangert

    Abstract: 3D deep learning is a growing field of interest due to the vast amount of information stored in 3D formats. Triangular meshes are an efficient representation for irregular, non-uniform 3D objects. However, meshes are often challenging to annotate due to their high geometrical complexity. Specifically, creating segmentation masks for meshes is tedious and time-consuming. Therefore, it is desirable… ▽ More

    Submitted 21 December, 2022; v1 submitted 8 August, 2022; originally announced August 2022.

    Comments: AAAI 2023

  4. arXiv:2201.09815  [pdf, other

    cs.IT cs.LG

    Analytic Mutual Information in Bayesian Neural Networks

    Authors: Jae Oh Woo

    Abstract: Bayesian neural networks have successfully designed and optimized a robust neural network model in many application problems, including uncertainty quantification. However, with its recent success, information-theoretic understanding about the Bayesian neural network is still at an early stage. Mutual information is an example of an uncertainty measure in a Bayesian neural network to quantify epis… ▽ More

    Submitted 18 June, 2022; v1 submitted 24 January, 2022; originally announced January 2022.

  5. arXiv:2106.08599  [pdf, other

    cs.CV cs.AI

    PatchNet: Unsupervised Object Discovery based on Patch Embedding

    Authors: Hankyu Moon, Heng Hao, Sima Didari, Jae Oh Woo, Patrick Bangert

    Abstract: We demonstrate that frequently appearing objects can be discovered by training randomly sampled patches from a small number of images (100 to 200) by self-supervision. Key to this approach is the pattern space, a latent space of patterns that represents all possible sub-images of the given image data. The distance structure in the pattern space captures the co-occurrence of patterns due to the fre… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

    ACM Class: I.2.10; I.4.10; I.5.3

  6. arXiv:2105.14559  [pdf, other

    cs.LG stat.ML

    Active Learning in Bayesian Neural Networks with Balanced Entropy Learning Principle

    Authors: Jae Oh Woo

    Abstract: Acquiring labeled data is challenging in many machine learning applications with limited budgets. Active learning gives a procedure to select the most informative data points and improve data efficiency by reducing the cost of labeling. The info-max learning principle maximizing mutual information such as BALD has been successful and widely adapted in various active learning applications. However,… ▽ More

    Submitted 15 April, 2023; v1 submitted 30 May, 2021; originally announced May 2021.

    Journal ref: International Conference on Learning Representations 2023

  7. arXiv:2103.05109  [pdf, other

    cs.CV cs.LG eess.IV

    Highly Efficient Representation and Active Learning Framework and Its Application to Imbalanced Medical Image Classification

    Authors: Heng Hao, Hankyu Moon, Sima Didari, Jae Oh Woo, Patrick Bangert

    Abstract: We propose a highly data-efficient active learning framework for image classification. Our novel framework combines: (1) unsupervised representation learning of a Convolutional Neural Network and (2) the Gaussian Process (GP) method, in sequence to achieve highly data and label efficient classifications. Moreover, both elements are less sensitive to the prevalent and challenging class imbalance is… ▽ More

    Submitted 20 June, 2022; v1 submitted 24 February, 2021; originally announced March 2021.

    Comments: Published in NeurIPs Data-Centric AI workshop

  8. arXiv:1712.00913  [pdf, ps, other

    math.CO cs.IT math.PR

    Majorization and Rényi Entropy Inequalities via Sperner Theory

    Authors: Mokshay Madiman, Liyao Wang, Jae Oh Woo

    Abstract: A natural link between the notions of majorization and strongly Sperner posets is elucidated. It is then used to obtain a variety of consequences, including new Rényi entropy inequalities for sums of independent, integer-valued random variables.

    Submitted 13 November, 2018; v1 submitted 4 December, 2017; originally announced December 2017.

    Comments: Introduction was completely rewritten and there are numerous corrections. Expansion of background on Sperner theory, and several references are added

    Journal ref: Discrete Mathematics (AEGT 2017 Special issue edited by S. Cioaba, R. Coulter, E. Fiorini, Q. Xiang, F. Pfender), vol. 342, no. 10, pp. 2911-2923, October 2019

  9. arXiv:1711.00881  [pdf, other

    math.PR cs.SI

    On the Steady State of Continuous Time Stochastic Opinion Dynamics with Power Law Confidence

    Authors: Jae Oh Woo, François Baccelli, Sriram Vishwanath

    Abstract: This paper introduces a class of non-linear and continuous-time opinion dynamics model with additive noise and state dependent interaction rates between agents. The model features interaction rates which are proportional to a negative power of opinion distances. We establish a non-local partial differential equation for the distribution of opinion distances and use Mellin transforms to provide an… ▽ More

    Submitted 12 December, 2020; v1 submitted 2 November, 2017; originally announced November 2017.

  10. arXiv:1710.00812  [pdf, ps, other

    math.CO cs.IT math.NT math.PR

    Entropy Inequalities for Sums in Prime Cyclic Groups

    Authors: Mokshay Madiman, Liyao Wang, Jae Oh Woo

    Abstract: Lower bounds for the Rényi entropies of sums of independent random variables taking values in cyclic groups of prime order under permutations are established. The main ingredients of our approach are extended rearrangement inequalities in prime cyclic groups building on Lev (2001), and notions of stochastic ordering. Several applications are developed, including to discrete entropy power inequalit… ▽ More

    Submitted 26 November, 2020; v1 submitted 2 October, 2017; originally announced October 2017.

    Comments: 25 pages

    Journal ref: SIAM J. Discrete Math., 35(3), pp. 1628-1649, 2021

  11. arXiv:1701.02261  [pdf, other

    cs.IT

    An Analytical Framework for Modeling a Spatially Repulsive Cellular Network

    Authors: Chang-Sik Choi, Jae Oh Woo, Jeffrey G. Andrews

    Abstract: We propose a new cellular network model that captures both deterministic and random aspects of base station deployments. Namely, the base station locations are modeled as the superposition of two independent stationary point processes: a random shifted grid with intensity $λ_g$ and a Poisson point process (PPP) with intensity $λ_p$. Grid and PPP deployments are special cases with $λ_p \to 0$ and… ▽ More

    Submitted 29 September, 2017; v1 submitted 9 January, 2017; originally announced January 2017.

    Comments: Submitted to IEEE Transactions on Communications

  12. Redundancy of Exchangeable Estimators

    Authors: Narayana P. Santhanam, Anand D. Sarwate, Jae Oh Woo

    Abstract: Exchangeable random partition processes are the basis for Bayesian approaches to statistical inference in large alphabet settings. On the other hand, the notion of the pattern of a sequence provides an information-theoretic framework for data compression in large alphabet scenarios. Because data compression and parameter estimation are intimately related, we study the redundancy of Bayes estimator… ▽ More

    Submitted 20 October, 2014; v1 submitted 21 July, 2014; originally announced July 2014.

    Comments: 18 pages