Skip to main content

Showing 1–36 of 36 results for author: So, J

  1. arXiv:2406.04175  [pdf, other

    cs.CL cs.AI

    Confabulation: The Surprising Value of Large Language Model Hallucinations

    Authors: Peiqi Sui, Eamon Duede, Sophie Wu, Richard Jean So

    Abstract: This paper presents a systematic defense of large language model (LLM) hallucinations or 'confabulations' as a potential resource instead of a categorically negative pitfall. The standard view is that confabulations are inherently problematic and AI research should eliminate this flaw. In this paper, we argue and empirically demonstrate that measurable semantic characteristics of LLM confabulation… ▽ More

    Submitted 25 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

    Comments: Forthcoming at ACL2024 main conference. 1 figure

  2. SMCL: Saliency Masked Contrastive Learning for Long-tailed Recognition

    Authors: Sanglee Park, Seung-won Hwang, Jungmin So

    Abstract: Real-world data often follow a long-tailed distribution with a high imbalance in the number of samples between classes. The problem with training from imbalanced data is that some background features, common to all classes, can be unobserved in classes with scarce samples. As a result, this background correlates to biased predictions into ``major" classes. In this paper, we propose saliency masked… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: accepted at ICASSP 2023

  3. arXiv:2406.01801  [pdf, other

    stat.ML cs.LG

    Fearless Stochasticity in Expectation Propagation

    Authors: Jonathan So, Richard E. Turner

    Abstract: Expectation propagation (EP) is a family of algorithms for performing approximate inference in probabilistic models. The updates of EP involve the evaluation of moments -- expectations of certain functions -- which can be estimated from Monte Carlo (MC) samples. However, the updates are not robust to MC noise when performed naively, and various prior works have attempted to address this issue in d… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  4. arXiv:2405.16088  [pdf, ps, other

    math.ST cs.LG stat.ML

    Estimating the normal-inverse-Wishart distribution

    Authors: Jonathan So

    Abstract: The normal-inverse-Wishart (NIW) distribution is commonly used as a prior distribution for the mean and covariance parameters of a multivariate normal distribution. The family of NIW distributions is also a minimal exponential family. In this short note we describe a convergent procedure for converting from mean parameters to natural parameters in the NIW family, or -- equivalently -- for performi… ▽ More

    Submitted 3 June, 2024; v1 submitted 25 May, 2024; originally announced May 2024.

  5. arXiv:2403.17428  [pdf, other

    cs.AI cs.CL

    Aligning Large Language Models for Enhancing Psychiatric Interviews through Symptom Delineation and Summarization

    Authors: Jae-hee So, Joonhwan Chang, Eunji Kim, Junho Na, JiYeon Choi, Jy-yong Sohn, Byung-Hoon Kim, Sang Hui Chu

    Abstract: Recent advancements in Large Language Models (LLMs) have accelerated their usage in various domains. Given the fact that psychiatric interviews are goal-oriented and structured dialogues between the professional interviewer and the interviewee, it is one of the most underexplored areas where LLMs can contribute substantial value. Here, we explore the use of LLMs for enhancing psychiatric interview… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  6. arXiv:2403.00299  [pdf, ps, other

    cs.IT cs.AI cs.LG eess.SP

    Universal Auto-encoder Framework for MIMO CSI Feedback

    Authors: Jinhyun So, Hyukjoon Kwon

    Abstract: Existing auto-encoder (AE)-based channel state information (CSI) frameworks have focused on a specific configuration of user equipment (UE) and base station (BS), and thus the input and output sizes of the AE are fixed. However, in the real-world scenario, the input and output sizes may vary depending on the number of antennas of the BS and UE and the allocated resource block in the frequency dime… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Comments: 7 pages, 11 figures

  7. arXiv:2401.00025  [pdf, other

    cs.RO cs.CV

    Any-point Trajectory Modeling for Policy Learning

    Authors: Chuan Wen, Xingyu Lin, John So, Kai Chen, Qi Dou, Yang Gao, Pieter Abbeel

    Abstract: Learning from demonstration is a powerful method for teaching robots new skills, and having more demonstration data often improves policy learning. However, the high cost of collecting demonstration data is a significant bottleneck. Videos, as a rich data source, contain knowledge of behaviors, physics, and semantics, but extracting control-specific information from them is challenging due to the… ▽ More

    Submitted 12 July, 2024; v1 submitted 28 December, 2023; originally announced January 2024.

    Comments: 18 pages, 15 figures

  8. arXiv:2312.03517  [pdf, other

    cs.CV cs.AI

    FRDiff : Feature Reuse for Universal Training-free Acceleration of Diffusion Models

    Authors: Junhyuk So, Jungwon Lee, Eunhyeok Park

    Abstract: The substantial computational costs of diffusion models, especially due to the repeated denoising steps necessary for high-quality image generation, present a major obstacle to their widespread adoption. While several studies have attempted to address this issue by reducing the number of score function evaluations (NFE) using advanced ODE solvers without fine-tuning, the decreased number of denois… ▽ More

    Submitted 2 April, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

    Comments: Work in progress. Project page : https://jungwon-lee.github.io/Project_FRDiff/

  9. arXiv:2311.16849  [pdf, other

    stat.ML cs.LG

    Identifiable Feature Learning for Spatial Data with Nonlinear ICA

    Authors: Hermanni Hälvä, Jonathan So, Richard E. Turner, Aapo Hyvärinen

    Abstract: Recently, nonlinear ICA has surfaced as a popular alternative to the many heuristic models used in deep representation learning and disentanglement. An advantage of nonlinear ICA is that a sophisticated identifiability theory has been developed; in particular, it has been proven that the original components can be recovered under sufficiently strong latent dependencies. Despite this general theory… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: Work under review

  10. arXiv:2310.11837  [pdf, other

    stat.ML cs.LG

    Optimising Distributions with Natural Gradient Surrogates

    Authors: Jonathan So, Richard E. Turner

    Abstract: Natural gradient methods have been used to optimise the parameters of probability distributions in a variety of settings, often resulting in fast-converging procedures. Unfortunately, for many distributions of interest, computing the natural gradient has a number of challenges. In this work we propose a novel technique for tackling such issues, which involves reframing the optimisation as one with… ▽ More

    Submitted 4 March, 2024; v1 submitted 18 October, 2023; originally announced October 2023.

    Journal ref: PMLR 238 (2024):2224-2232

  11. arXiv:2308.13327  [pdf, other

    cs.CV

    3D Face Alignment Through Fusion of Head Pose Information and Features

    Authors: Jaehyun So, Youngjoon Han

    Abstract: The ability of humans to infer head poses from face shapes, and vice versa, indicates a strong correlation between the two. Accordingly, recent studies on face alignment have employed head pose information to predict facial landmarks in computer vision tasks. In this study, we propose a novel method that employs head pose information to improve face alignment performance by fusing said information… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

  12. arXiv:2307.03567  [pdf, other

    cs.RO cs.CV

    SpawnNet: Learning Generalizable Visuomotor Skills from Pre-trained Networks

    Authors: Xingyu Lin, John So, Sashwat Mahalingam, Fangchen Liu, Pieter Abbeel

    Abstract: The existing internet-scale image and video datasets cover a wide range of everyday objects and tasks, bringing the potential of learning policies that generalize in diverse scenarios. Prior works have explored visual pre-training with different self-supervised objectives. Still, the generalization capabilities of the learned policies and the advantages over well-tuned baselines remain unclear fro… ▽ More

    Submitted 21 October, 2023; v1 submitted 7 July, 2023; originally announced July 2023.

  13. arXiv:2306.02316  [pdf, other

    cs.CV

    Temporal Dynamic Quantization for Diffusion Models

    Authors: Junhyuk So, Jungwon Lee, Daehyun Ahn, Hyungjun Kim, Eunhyeok Park

    Abstract: The diffusion model has gained popularity in vision applications due to its remarkable generative performance and versatility. However, high storage and computation demands, resulting from the model size and iterative generation, hinder its use on mobile devices. Existing quantization techniques struggle to maintain performance even in 8-bit precision due to the diffusion model's unique property o… ▽ More

    Submitted 11 December, 2023; v1 submitted 4 June, 2023; originally announced June 2023.

  14. arXiv:2210.14721  [pdf, other

    cs.LG cs.AI

    Sim-to-Real via Sim-to-Seg: End-to-end Off-road Autonomous Driving Without Real Data

    Authors: John So, Amber Xie, Sunggoo Jung, Jeffrey Edlund, Rohan Thakker, Ali Agha-mohammadi, Pieter Abbeel, Stephen James

    Abstract: Autonomous driving is complex, requiring sophisticated 3D scene understanding, localization, mapping, and control. Rather than explicitly modelling and fusing each of these components, we instead consider an end-to-end approach via reinforcement learning (RL). However, collecting exploration driving data in the real world is impractical and dangerous. While training in simulation and deploying vis… ▽ More

    Submitted 25 October, 2022; originally announced October 2022.

    Comments: CoRL 2022 Paper

  15. arXiv:2208.08812  [pdf, other

    cs.RO eess.IV

    Automatic laser steering for middle ear surgery

    Authors: Jae-Hun So, Jérôme Szewczyk, Brahim Tamadazte

    Abstract: This paper deals with the control of laser spot in the context of minimally invasive surgery of the middle ear, e.g., cholesteatoma removal. More precisely, our work is concerned with the exhaustive burring of residual infected cells after primary mechanical resection of the pathological tissues since the latter cannot guarantee the treatment of all the infected tissues, the remaining infected cel… ▽ More

    Submitted 18 August, 2022; originally announced August 2022.

    Comments: 7 pages, 8 figures, conference

  16. arXiv:2206.08743  [pdf, other

    cs.LG cs.AI cs.CY

    Learning Fair Representation via Distributional Contrastive Disentanglement

    Authors: Changdae Oh, Heeji Won, Junhyuk So, Taero Kim, Yewon Kim, Hosik Choi, Kyungwoo Song

    Abstract: Learning fair representation is crucial for achieving fairness or debiasing sensitive information. Most existing works rely on adversarial representation learning to inject some invariance into representation. However, adversarial learning methods are known to suffer from relatively unstable training, and this might harm the balance between fairness and predictiveness of representation. We propose… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

    Comments: Accepted by KDD 2022 (Research Track)

  17. arXiv:2206.00820  [pdf, other

    cs.LG

    NIPQ: Noise proxy-based Integrated Pseudo-Quantization

    Authors: Juncheol Shin, Junhyuk So, Sein Park, Seungyeop Kang, Sungjoo Yoo, Eunhyeok Park

    Abstract: Straight-through estimator (STE), which enables the gradient flow over the non-differentiable function via approximation, has been favored in studies related to quantization-aware training (QAT). However, STE incurs unstable convergence during QAT, resulting in notable quality degradation in low precision. Recently, pseudoquantization training has been proposed as an alternative approach to updati… ▽ More

    Submitted 1 July, 2023; v1 submitted 1 June, 2022; originally announced June 2022.

  18. arXiv:2203.03897  [pdf, other

    cs.CV cs.CL cs.IR cs.LG

    Geodesic Multi-Modal Mixup for Robust Fine-Tuning

    Authors: Changdae Oh, Junhyuk So, Hoyoon Byun, YongTaek Lim, Minchul Shin, Jong-June Jeon, Kyungwoo Song

    Abstract: Pre-trained multi-modal models, such as CLIP, provide transferable embeddings and show promising results in diverse applications. However, the analysis of learned multi-modal embeddings is relatively unexplored, and the embedding transferability can be improved. In this work, we observe that CLIP holds separated embedding subspaces for two different modalities, and then we investigate it through t… ▽ More

    Submitted 6 November, 2023; v1 submitted 8 March, 2022; originally announced March 2022.

    Comments: To appear at NeurIPS 2023

  19. arXiv:2202.01267  [pdf, other

    cs.LG cs.DC stat.ML

    FedSpace: An Efficient Federated Learning Framework at Satellites and Ground Stations

    Authors: Jinhyun So, Kevin Hsieh, Behnaz Arzani, Shadi Noghabi, Salman Avestimehr, Ranveer Chandra

    Abstract: Large-scale deployments of low Earth orbit (LEO) satellites collect massive amount of Earth imageries and sensor data, which can empower machine learning (ML) to address global challenges such as real-time disaster navigation and mitigation. However, it is often infeasible to download all the high-resolution images and train these ML models on the ground because of limited downlink bandwidth, spar… ▽ More

    Submitted 2 February, 2022; originally announced February 2022.

  20. arXiv:2110.02177  [pdf, other

    cs.LG cs.CR cs.DC cs.IT stat.ML

    Secure Aggregation for Buffered Asynchronous Federated Learning

    Authors: Jinhyun So, Ramy E. Ali, Başak Güler, A. Salman Avestimehr

    Abstract: Federated learning (FL) typically relies on synchronous training, which is slow due to stragglers. While asynchronous training handles stragglers efficiently, it does not ensure privacy due to the incompatibility with the secure aggregation protocols. A buffered asynchronous training protocol known as FedBuff has been proposed recently which bridges the gap between synchronous and asynchronous tra… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

    Comments: arXiv admin note: substantial overlap with arXiv:2109.14236

  21. arXiv:2109.14236  [pdf, other

    cs.LG cs.CR cs.DC cs.IT stat.ML

    LightSecAgg: a Lightweight and Versatile Design for Secure Aggregation in Federated Learning

    Authors: Jinhyun So, Chaoyang He, Chien-Sheng Yang, Songze Li, Qian Yu, Ramy E. Ali, Basak Guler, Salman Avestimehr

    Abstract: Secure model aggregation is a key component of federated learning (FL) that aims at protecting the privacy of each user's individual model while allowing for their global aggregation. It can be applied to any aggregation-based FL approach for training a global or personalized model. Model aggregation needs to also be resilient against likely user dropouts in FL systems, making its design substanti… ▽ More

    Submitted 1 February, 2022; v1 submitted 29 September, 2021; originally announced September 2021.

    Comments: This paper is accepted to the 5th MLSys Conference, Santa Clara, CA, USA, 2022

  22. arXiv:2106.09620  [pdf, other

    stat.ML cs.LG

    Disentangling Identifiable Features from Noisy Data with Structured Nonlinear ICA

    Authors: Hermanni Hälvä, Sylvain Le Corff, Luc Lehéricy, Jonathan So, Yongjie Zhu, Elisabeth Gassiat, Aapo Hyvarinen

    Abstract: We introduce a new general identifiable framework for principled disentanglement referred to as Structured Nonlinear Independent Component Analysis (SNICA). Our contribution is to extend the identifiability theory of deep generative models for a very broad class of structured models. While previous works have shown identifiability for specific classes of time-series models, our theorems extend thi… ▽ More

    Submitted 27 October, 2021; v1 submitted 17 June, 2021; originally announced June 2021.

    Comments: Accepted for publication at NeurIPS 2021

  23. arXiv:2106.03328  [pdf, other

    cs.LG cs.CR cs.DC cs.IT

    Securing Secure Aggregation: Mitigating Multi-Round Privacy Leakage in Federated Learning

    Authors: Jinhyun So, Ramy E. Ali, Basak Guler, Jiantao Jiao, Salman Avestimehr

    Abstract: Secure aggregation is a critical component in federated learning (FL), which enables the server to learn the aggregate model of the users without observing their local models. Conventionally, secure aggregation algorithms focus only on ensuring the privacy of individual users in a single training round. We contend that such designs can lead to significant privacy leakages over multiple training ro… ▽ More

    Submitted 27 July, 2023; v1 submitted 7 June, 2021; originally announced June 2021.

    Journal ref: AAAI 2023

  24. arXiv:2011.05530  [pdf, other

    cs.LG cs.CR cs.IT

    On Polynomial Approximations for Privacy-Preserving and Verifiable ReLU Networks

    Authors: Ramy E. Ali, Jinhyun So, A. Salman Avestimehr

    Abstract: Outsourcing deep neural networks (DNNs) inference tasks to an untrusted cloud raises data privacy and integrity concerns. While there are many techniques to ensure privacy and integrity for polynomial-based computations, DNNs involve non-polynomial computations. To address these challenges, several privacy-preserving and verifiable inference techniques have been proposed based on replacing the non… ▽ More

    Submitted 6 February, 2024; v1 submitted 10 November, 2020; originally announced November 2020.

  25. arXiv:2011.01963  [pdf, ps, other

    cs.LG cs.CR cs.IT stat.ML

    A Scalable Approach for Privacy-Preserving Collaborative Machine Learning

    Authors: Jinhyun So, Basak Guler, A. Salman Avestimehr

    Abstract: We consider a collaborative learning scenario in which multiple data-owners wish to jointly train a logistic regression model, while keeping their individual datasets private from the other parties. We propose COPML, a fully-decentralized training framework that achieves scalability and privacy-protection simultaneously. The key idea of COPML is to securely encode the individual datasets to distri… ▽ More

    Submitted 3 November, 2020; originally announced November 2020.

  26. arXiv:2010.14282  [pdf, other

    cs.LG cs.AI cs.IR

    Active Learning for Human-in-the-Loop Customs Inspection

    Authors: Sundong Kim, Tung-Duong Mai, Sungwon Han, Sungwon Park, Thi Nguyen Duc Khanh, Jaechan So, Karandeep Singh, Meeyoung Cha

    Abstract: We study the human-in-the-loop customs inspection scenario, where an AI-assisted algorithm supports customs officers by recommending a set of imported goods to be inspected. If the inspected items are fraudulent, the officers can levy extra duties. Th formed logs are then used as additional training data for successive iterations. Choosing to inspect suspicious items first leads to an immediate ga… ▽ More

    Submitted 23 February, 2022; v1 submitted 27 October, 2020; originally announced October 2020.

    Comments: To Appear at IEEE TKDE

    ACM Class: H.4.0

  27. arXiv:2010.10177  [pdf, other

    stat.ML cs.LG cs.NE

    Sparse Gaussian Process Variational Autoencoders

    Authors: Matthew Ashman, Jonathan So, Will Tebbutt, Vincent Fortuin, Michael Pearce, Richard E. Turner

    Abstract: Large, multi-dimensional spatio-temporal datasets are omnipresent in modern science and engineering. An effective framework for handling such data are Gaussian process deep generative models (GP-DGMs), which employ GP priors over the latent variables of DGMs. Existing approaches for performing inference in GP-DGMs do not support sparse GP approximations based on inducing points, which are essentia… ▽ More

    Submitted 23 October, 2020; v1 submitted 20 October, 2020; originally announced October 2020.

    Comments: 19 pages, 6 figures

  28. arXiv:2008.10400  [pdf, other

    cs.CV cs.LG

    An Ensemble of Simple Convolutional Neural Network Models for MNIST Digit Recognition

    Authors: Sanghyeon An, Minjun Lee, Sanglee Park, Heerin Yang, Jungmin So

    Abstract: We report that a very high accuracy on the MNIST test set can be achieved by using simple convolutional neural network (CNN) models. We use three different models with 3x3, 5x5, and 7x7 kernel size in the convolution layers. Each model consists of a set of convolution layers followed by a single fully connected layer. Every convolution layer uses batch normalization and ReLU activation, and poolin… ▽ More

    Submitted 4 October, 2020; v1 submitted 12 August, 2020; originally announced August 2020.

    Comments: 10 pages, 12 figures, 7 tables

  29. arXiv:2007.13518  [pdf, other

    cs.LG stat.ML

    FedML: A Research Library and Benchmark for Federated Machine Learning

    Authors: Chaoyang He, Songze Li, Jinhyun So, Xiao Zeng, Mi Zhang, Hongyi Wang, Xiaoyang Wang, Praneeth Vepakomma, Abhishek Singh, Hang Qiu, Xinghua Zhu, Jianzong Wang, Li Shen, Peilin Zhao, Yan Kang, Yang Liu, Ramesh Raskar, Qiang Yang, Murali Annavaram, Salman Avestimehr

    Abstract: Federated learning (FL) is a rapidly growing research field in machine learning. However, existing FL libraries cannot adequately support diverse algorithmic development; inconsistent dataset and model usage make fair algorithm comparison challenging. In this work, we introduce FedML, an open research library and benchmark to facilitate FL algorithm development and fair performance comparison. Fed… ▽ More

    Submitted 8 November, 2020; v1 submitted 27 July, 2020; originally announced July 2020.

    Comments: This is FedML white paper V3. Homepage: https://fedml.ai; GitHub: https://github.com/FedML-AI/FedML; In V3, More advanced algorithms and IoT device training are supported, please check here: https://github.com/FedML-AI/FedML/blob/master/fedml_iot/

  30. arXiv:2007.11115  [pdf, ps, other

    cs.CR cs.DC cs.LG stat.ML

    Byzantine-Resilient Secure Federated Learning

    Authors: Jinhyun So, Basak Guler, A. Salman Avestimehr

    Abstract: Secure federated learning is a privacy-preserving framework to improve machine learning models by training over large volumes of data collected by mobile users. This is achieved through an iterative process where, at each iteration, users update a global model using their local datasets. Each user then masks its local model via random keys, and the masked models are aggregated at a central server… ▽ More

    Submitted 20 February, 2021; v1 submitted 21 July, 2020; originally announced July 2020.

  31. arXiv:2002.04156  [pdf, ps, other

    cs.LG cs.CR cs.DC cs.IT stat.ML

    Turbo-Aggregate: Breaking the Quadratic Aggregation Barrier in Secure Federated Learning

    Authors: Jinhyun So, Basak Guler, A. Salman Avestimehr

    Abstract: Federated learning is a distributed framework for training machine learning models over the data residing at mobile devices, while protecting the privacy of individual users. A major bottleneck in scaling federated learning to a large number of users is the overhead of secure model aggregation across many users. In particular, the overhead of the state-of-the-art protocols for secure model aggrega… ▽ More

    Submitted 20 February, 2021; v1 submitted 10 February, 2020; originally announced February 2020.

  32. arXiv:1902.00641  [pdf, ps, other

    cs.LG cs.CR cs.IT stat.ML

    CodedPrivateML: A Fast and Privacy-Preserving Framework for Distributed Machine Learning

    Authors: Jinhyun So, Basak Guler, A. Salman Avestimehr

    Abstract: How to train a machine learning model while keeping the data private and secure? We present CodedPrivateML, a fast and scalable approach to this critical problem. CodedPrivateML keeps both the data and the model information-theoretically private, while allowing efficient parallelization of training across distributed workers. We characterize CodedPrivateML's privacy threshold and prove its converg… ▽ More

    Submitted 20 February, 2021; v1 submitted 2 February, 2019; originally announced February 2019.

  33. arXiv:1808.07617  [pdf, ps, other

    cs.IT

    Tomlinson-Harashima Precoding-Aided Multi-Antenna Non-Orthogonal Multiple Access

    Authors: Jungho So, Youngchul Sung, Yong H. Lee

    Abstract: In this paper, Tomlinson-Harashima Precoding (THP) is considered for multi-user multiple-input single-output (MU-MISO) non-orthogonal multiple access (NOMA) donwlink. Under the hierarchical structure in which multiple clusters each with two users are formed and served in the spatial domain and users in each cluster are served in the power domain, THP is applied to eliminate the inter-cluster inter… ▽ More

    Submitted 22 August, 2018; originally announced August 2018.

    Comments: 13 pages, 6 figures, double-column, submitted to IEEE J. Sel. Topics in Signal Process

  34. arXiv:1510.07369  [pdf, ps, other

    cs.IT

    Enhancing Non-Orthogonal Multiple Access By Forming Relaying Broadcast Channels

    Authors: Jungho So, Youngchul Sung

    Abstract: In this paper, using relaying broadcast channels (RBCs) as component channels for non-orthogonal multiple access (NOMA) is proposed to enhance the performance of NOMA in single-input single-output (SISO) cellular downlink systems. To analyze the performance of the proposed scheme, an achievable rate region of a RBC with compress-and-forward (CF) relaying is newly derived based on the recent work o… ▽ More

    Submitted 26 October, 2015; originally announced October 2015.

    Comments: 29 pages, 5 figures, submitted to IEEE Transactions on Communications

  35. arXiv:1508.04562  [pdf, other

    cs.CL cs.IR

    Fast, Flexible Models for Discovering Topic Correlation across Weakly-Related Collections

    Authors: Jingwei Zhang, Aaron Gerow, Jaan Altosaar, James Evans, Richard Jean So

    Abstract: Weak topic correlation across document collections with different numbers of topics in individual collections presents challenges for existing cross-collection topic models. This paper introduces two probabilistic topic models, Correlated LDA (C-LDA) and Correlated HDP (C-HDP). These address problems that can arise when analyzing large, asymmetric, and potentially weakly-related collections. Topic… ▽ More

    Submitted 19 August, 2015; originally announced August 2015.

    Comments: EMNLP 2015

  36. Pilot Signal Design for Massive MIMO Systems: A Received Signal-To-Noise-Ratio-Based Approach

    Authors: Jungho So, Donggun Kim, Yuni Lee, Youngchul Sung

    Abstract: In this paper, the pilot signal design for massive MIMO systems to maximize the training-based received signal-to-noise ratio (SNR) is considered under two channel models: block Gauss-Markov and block independent and identically distributed (i.i.d.) channel models. First, it is shown that under the block Gauss-Markov channel model, the optimal pilot design problem reduces to a semi-definite progra… ▽ More

    Submitted 12 June, 2014; originally announced June 2014.

    Comments: 5 pages, double column, 1 figure. Submitted to IEEE Signal Processing Letters