Skip to main content

Showing 1–27 of 27 results for author: Hsieh, K

  1. arXiv:2406.13578  [pdf, other

    cs.CL

    Enhancing Distractor Generation for Multiple-Choice Questions with Retrieval Augmented Pretraining and Knowledge Graph Integration

    Authors: Han-Cheng Yu, Yu-An Shih, Kin-Man Law, Kai-Yu Hsieh, Yu-Chen Cheng, Hsin-Chih Ho, Zih-An Lin, Wen-Chuan Hsu, Yao-Chung Fan

    Abstract: In this paper, we tackle the task of distractor generation (DG) for multiple-choice questions. Our study introduces two key designs. First, we propose \textit{retrieval augmented pretraining}, which involves refining the language model pretraining to align it more closely with the downstream task of DG. Second, we explore the integration of knowledge graphs to enhance the performance of DG. Throug… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: Findings at ACL 2024

  2. arXiv:2402.05625  [pdf, other

    cs.IT eess.SP

    Coded Many-User Multiple Access via Approximate Message Passing

    Authors: Xiaoqi Liu, Kuan Hsieh, Ramji Venkataramanan

    Abstract: We consider communication over the Gaussian multiple-access channel in the regime where the number of users grows linearly with the codelength. In this regime, schemes based on sparse superposition coding can achieve a near-optimal tradeoff between spectral efficiency and signal-to-noise ratio. However, these schemes are feasible only for small values of user payload. This paper investigates effic… ▽ More

    Submitted 1 July, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: 23 pages, 8 figures. A shorter version of this paper to appear in the Proceedings of IEEE ISIT 2024

  3. A Versatile Data Fabric for Advanced IoT-Based Remote Health Monitoring

    Authors: Italo Buleje, Vince S. Siu, Kuan Yu Hsieh, Nigel Hinds, Bing Dang, Erhan Bilal, Thanhnha Nguyen, Ellen E. Lee, Colin A. Depp, Jeffrey L. Rogers

    Abstract: This paper presents a data-centric and security-focused data fabric designed for digital health applications. With the increasing interest in digital health research, there has been a surge in the volume of Internet of Things (IoT) data derived from smartphones, wearables, and ambient sensors. Managing this vast amount of data, encompassing diverse data types and varying time scales, is crucial. M… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Journal ref: 2023 IEEE International Conference on Digital Health (ICDH), Chicago, IL, USA, 2023, pp. 88-90

  4. arXiv:2309.08404  [pdf, other

    cs.IT eess.SP

    Bayes-Optimal Estimation in Generalized Linear Models via Spatial Coupling

    Authors: Pablo Pascual Cobo, Kuan Hsieh, Ramji Venkataramanan

    Abstract: We consider the problem of signal estimation in a generalized linear model (GLM). GLMs include many canonical problems in statistical estimation, such as linear regression, phase retrieval, and 1-bit compressed sensing. Recent work has precisely characterized the asymptotic minimum mean-squared error (MMSE) for GLMs with i.i.d. Gaussian sensing matrices. However, in many models there is a signific… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

    Comments: 39 pages, 4 figures. A shorter version of this paper appeared in the proceedings of the 2023 IEEE International Symposium on Information Theory

  5. arXiv:2308.06261  [pdf, other

    cs.NI cs.AI

    Enhancing Network Management Using Code Generated by Large Language Models

    Authors: Sathiya Kumaran Mani, Yajie Zhou, Kevin Hsieh, Santiago Segarra, Ranveer Chandra, Srikanth Kandula

    Abstract: Analyzing network topologies and communication graphs plays a crucial role in contemporary network management. However, the absence of a cohesive approach leads to a challenging learning curve, heightened errors, and inefficiencies. In this paper, we introduce a novel approach to facilitate a natural-language-based network management experience, utilizing large language models (LLMs) to generate t… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

  6. arXiv:2305.13792  [pdf, other

    cs.NI

    Mitigating the Performance Impact of Network Failures in Public Clouds

    Authors: Pooria Namyar, Behnaz Arzani, Daniel Crankshaw, Daniel S. Berger, Kevin Hsieh, Srikanth Kandula, Ramesh Govindan

    Abstract: Some faults in data center networks require hours to days to repair because they may need reboots, re-imaging, or manual work by technicians. To reduce traffic impact, cloud providers \textit{mitigate} the effect of faults, for example, by steering traffic to alternate paths. The state-of-art in automatic network mitigations uses simple safety checks and proxy metrics to determine mitigations. SWA… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  7. Health Guardian Platform: A technology stack to accelerate discovery in Digital Health research

    Authors: Bo Wen, Vince S. Siu, Italo Buleje, Kuan Yu Hsieh, Takashi Itoh, Lukas Zimmerli, Nigel Hinds, Elif Eyigoz, Bing Dang, Stefan von Cavallar, Jeffrey L. Rogers

    Abstract: This paper highlights the design philosophy and architecture of the Health Guardian, a platform developed by the IBM Digital Health team to accelerate discoveries of new digital biomarkers and development of digital health technologies. The Health Guardian allows for rapid translation of artificial intelligence (AI) research into cloud-based microservices that can be tested with data from clinical… ▽ More

    Submitted 10 November, 2022; originally announced November 2022.

    Comments: 6 pages, 3 figures, https://ieeexplore.ieee.org/document/9861047

    Journal ref: IEEE International Conference on Digital Health (ICDH), 2022, pp. 40-46

  8. arXiv:2206.00799  [pdf, other

    cs.LG

    Federated Learning under Distributed Concept Drift

    Authors: Ellango Jothimurugesan, Kevin Hsieh, Jianyu Wang, Gauri Joshi, Phillip B. Gibbons

    Abstract: Federated Learning (FL) under distributed concept drift is a largely unexplored area. Although concept drift is itself a well-studied phenomenon, it poses particular challenges for FL, because drifts arise staggered in time and space (across clients). To the best of our knowledge, this work is the first to explicitly study data heterogeneity in both dimensions. We first demonstrate that prior solu… ▽ More

    Submitted 27 February, 2023; v1 submitted 1 June, 2022; originally announced June 2022.

    Comments: 20 pages. Published in AISTATS 2023

    ACM Class: I.2.6

  9. arXiv:2202.01267  [pdf, other

    cs.LG cs.DC stat.ML

    FedSpace: An Efficient Federated Learning Framework at Satellites and Ground Stations

    Authors: Jinhyun So, Kevin Hsieh, Behnaz Arzani, Shadi Noghabi, Salman Avestimehr, Ranveer Chandra

    Abstract: Large-scale deployments of low Earth orbit (LEO) satellites collect massive amount of Earth imageries and sensor data, which can empower machine learning (ML) to address global challenges such as real-time disaster navigation and mitigation. However, it is often infeasible to download all the high-resolution images and train these ML models on the ground because of limited downlink bandwidth, spar… ▽ More

    Submitted 2 February, 2022; originally announced February 2022.

  10. arXiv:2110.05554  [pdf, other

    cs.NI cs.IT

    Towards a Cost vs. Quality Sweet Spot for Monitoring Networks

    Authors: Nofel Yaseen, Behnaz Arzani, Krishna Chintalapudi, Vaishnavi Ranganathan, Felipe Frujeri, Kevin Hsieh, Daniel Berger, Vincent Liu, Srikanth Kandula

    Abstract: Continuously monitoring a wide variety of performance and fault metrics has become a crucial part of operating large-scale datacenter networks. In this work, we ask whether we can reduce the costs to monitor -- in terms of collection, storage and analysis -- by judiciously controlling how much and which measurements we collect. By positing that we can treat almost all measured signals as sampled t… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

  11. arXiv:2102.11267  [pdf, other

    cs.LG

    Interpret-able feedback for AutoML systems

    Authors: Behnaz Arzani, Kevin Hsieh, Haoxian Chen

    Abstract: Automated machine learning (AutoML) systems aim to enable training machine learning (ML) models for non-ML experts. A shortcoming of these systems is that when they fail to produce a model with high accuracy, the user has no path to improve the model other than hiring a data scientist or learning ML -- this defeats the purpose of AutoML and limits its adoption. We introduce an interpretable data f… ▽ More

    Submitted 22 February, 2021; originally announced February 2021.

  12. Near-Optimal Coding for Many-user Multiple Access Channels

    Authors: Kuan Hsieh, Cynthia Rush, Ramji Venkataramanan

    Abstract: This paper considers the Gaussian multiple-access channel (MAC) in the asymptotic regime where the number of users grows linearly with the code length. We propose efficient coding schemes based on random linear models with approximate message passing (AMP) decoding and derive the asymptotic error rate achieved for a given user density, user payload (in bits), and user energy. The tradeoff between… ▽ More

    Submitted 9 March, 2022; v1 submitted 9 February, 2021; originally announced February 2021.

    Comments: 15 pages, 4 figures. To appear in IEEE Journal on Selected Areas in Information Theory

    Journal ref: IEEE Journal on Selected Areas in Information Theory, vol. 3, no. 1, pp. 21-36, March 2022

  13. arXiv:2012.10557  [pdf, other

    cs.DC cs.AI

    Ekya: Continuous Learning of Video Analytics Models on Edge Compute Servers

    Authors: Romil Bhardwaj, Zhengxu Xia, Ganesh Ananthanarayanan, Junchen Jiang, Nikolaos Karianakis, Yuanchao Shu, Kevin Hsieh, Victor Bahl, Ion Stoica

    Abstract: Video analytics applications use edge compute servers for the analytics of the videos (for bandwidth and privacy). Compressed models that are deployed on the edge servers for inference suffer from data drift, where the live video data diverges from the training data. Continuous learning handles data drift by periodically retraining the models on new data. Our work addresses the challenge of jointl… ▽ More

    Submitted 18 December, 2020; originally announced December 2020.

  14. Drug repurposing for COVID-19 using graph neural network and harmonizing multiple evidence

    Authors: Kanglin Hsieh, Yinyin Wang, Luyao Chen, Zhongming Zhao, Sean Savitz, Xiaoqian Jiang, Jing Tang, Yejin Kim

    Abstract: Amid the pandemic of 2019 novel coronavirus disease (COVID-19) infected by SARS-CoV-2, a vast amount of drug research for prevention and treatment has been quickly conducted, but these efforts have been unsuccessful thus far. Our objective is to prioritize repurposable drugs using a drug repurposing pipeline that systematically integrates multiple SARS-CoV-2 and drug interactions, deep graph neura… ▽ More

    Submitted 1 February, 2022; v1 submitted 23 September, 2020; originally announced September 2020.

    Comments: 13 pages

    Journal ref: Sci Rep 11, 23179 (2021)

  15. Modulated Sparse Superposition Codes for the Complex AWGN Channel

    Authors: Kuan Hsieh, Ramji Venkataramanan

    Abstract: This paper studies a generalization of sparse superposition codes (SPARCs) for communication over the complex additive white Gaussian noise (AWGN) channel. In a SPARC, the codebook is defined in terms of a design matrix, and each codeword is a generated by multiplying the design matrix with a sparse message vector. In the standard SPARC construction, information is encoded in the locations of the… ▽ More

    Submitted 11 May, 2021; v1 submitted 20 April, 2020; originally announced April 2020.

    Comments: 20 pages, 6 figures. To appear in IEEE Transactions on Information Theory

    Journal ref: IEEE Transactions on Information Theory, vol. 67, no. 7, pp. 4385-4404, July 2021

  16. Capacity-achieving Spatially Coupled Sparse Superposition Codes with AMP Decoding

    Authors: Cynthia Rush, Kuan Hsieh, Ramji Venkataramanan

    Abstract: Sparse superposition codes, also called sparse regression codes (SPARCs), are a class of codes for efficient communication over the AWGN channel at rates approaching the channel capacity. In a standard SPARC, codewords are sparse linear combinations of columns of an i.i.d. Gaussian design matrix, while in a spatially coupled SPARC the design matrix has a block-wise structure, where the variance of… ▽ More

    Submitted 8 May, 2021; v1 submitted 18 February, 2020; originally announced February 2020.

    Comments: To appear in IEEE Transactions on Information Theory. This version contains proofs of two technical lemmas that were omitted in the journal version

    Journal ref: IEEE Transactions on Information Theory, vol. 67, no. 7, pp. 4446-4484, July 2021

  17. arXiv:1910.08663  [pdf, other

    cs.LG cs.DC stat.ML

    Machine Learning Systems for Highly-Distributed and Rapidly-Growing Data

    Authors: Kevin Hsieh

    Abstract: The usability and practicality of any machine learning (ML) applications are largely influenced by two critical but hard-to-attain factors: low latency and low cost. Unfortunately, achieving low latency and low cost is very challenging when ML depends on real-world data that are highly distributed and rapidly growing (e.g., data collected by mobile phones and video cameras all over the world). Suc… ▽ More

    Submitted 18 October, 2019; originally announced October 2019.

  18. arXiv:1910.00189  [pdf, other

    cs.LG stat.ML

    The Non-IID Data Quagmire of Decentralized Machine Learning

    Authors: Kevin Hsieh, Amar Phanishayee, Onur Mutlu, Phillip B. Gibbons

    Abstract: Many large-scale machine learning (ML) applications need to perform decentralized learning over datasets generated at different devices and locations. Such datasets pose a significant challenge to decentralized learning because their different contexts result in significant data distribution skew across devices/locations. In this paper, we take a step toward better understanding this challenge by… ▽ More

    Submitted 18 August, 2020; v1 submitted 30 September, 2019; originally announced October 2019.

    Journal ref: International Conference on Machine Learning (ICML), 2020

  19. arXiv:1805.03154  [pdf, other

    cs.AR

    Flexible-Latency DRAM: Understanding and Exploiting Latency Variation in Modern DRAM Chips

    Authors: Kevin K. Chang, Abhijith Kashyap, Hasan Hassan, Saugata Ghose, Kevin Hsieh, Donghyuk Lee, Tianshi Li, Gennady Pekhimenko, Samira Khan, Onur Mutlu

    Abstract: This article summarizes key results of our work on experimental characterization and analysis of latency variation and latency-reliability trade-offs in modern DRAM chips, which was published in SIGMETRICS 2016, and examines the work's significance and future potential. The goal of this work is to (i) experimentally characterize and understand the latency variation across cells within a DRAM chi… ▽ More

    Submitted 8 May, 2018; originally announced May 2018.

  20. arXiv:1805.02498  [pdf, other

    cs.DC

    Decoupling GPU Programming Models from Resource Management for Enhanced Programming Ease, Portability, and Performance

    Authors: Nandita Vijaykumar, Kevin Hsieh, Gennady Pekhimenko, Samira Khan, Ashish Shrestha, Saugata Ghose, Adwait Jog, Phillip B. Gibbons, Onur Mutlu

    Abstract: The application resource specification--a static specification of several parameters such as the number of threads and the scratchpad memory usage per thread block--forms a critical component of modern GPU programming models. This specification determines the parallelism, and hence performance, of the application during execution because the corresponding on-chip hardware resources are allocated a… ▽ More

    Submitted 2 May, 2018; originally announced May 2018.

    Comments: arXiv admin note: substantial text overlap with arXiv:1802.02573

  21. arXiv:1803.08625  [pdf, other

    cs.AI

    A Concept Learning Tool Based On Calculating Version Space Cardinality

    Authors: Kuo-Kai Hsieh, Li-C. Wang

    Abstract: In this paper, we proposed VeSC-CoL (Version Space Cardinality based Concept Learning) to deal with concept learning on extremely imbalanced datasets, especially when cross-validation is not a viable option. VeSC-CoL uses version space cardinality as a measure for model quality to replace cross-validation. Instead of naive enumeration of the version space, Ordered Binary Decision Diagram and Boole… ▽ More

    Submitted 22 March, 2018; originally announced March 2018.

  22. arXiv:1802.02573  [pdf, other

    cs.DC cs.AR

    Zorua: Enhancing Programming Ease, Portability, and Performance in GPUs by Decoupling Programming Models from Resource Management

    Authors: Nandita Vijaykumar, Kevin Hsieh, Gennady Pekhimenko, Samira Khan, Ashish Shrestha, Saugata Ghose, Phillip B. Gibbons, Onur Mutlu

    Abstract: The application resource specification--a static specification of several parameters such as the number of threads and the scratchpad memory usage per thread block--forms a critical component of the existing GPU programming models. This specification determines the performance of the application during execution because the corresponding on-chip hardware resources are allocated and managed purely… ▽ More

    Submitted 7 February, 2018; originally announced February 2018.

    Report number: SAFARI Technical Report 2016-005

  23. arXiv:1802.00320  [pdf, other

    cs.AR

    Enabling the Adoption of Processing-in-Memory: Challenges, Mechanisms, Future Research Directions

    Authors: Saugata Ghose, Kevin Hsieh, Amirali Boroumand, Rachata Ausavarungnirun, Onur Mutlu

    Abstract: Poor DRAM technology scaling over the course of many years has caused DRAM-based main memory to increasingly become a larger system bottleneck. A major reason for the bottleneck is that data stored within DRAM must be moved across a pin-limited memory channel to the CPU before any computation can take place. This requires a high latency and energy overhead, and the data often cannot benefit from c… ▽ More

    Submitted 1 February, 2018; originally announced February 2018.

  24. arXiv:1801.03493  [pdf, other

    cs.DB cs.CV cs.DC

    Focus: Querying Large Video Datasets with Low Latency and Low Cost

    Authors: Kevin Hsieh, Ganesh Ananthanarayanan, Peter Bodik, Paramvir Bahl, Matthai Philipose, Phillip B. Gibbons, Onur Mutlu

    Abstract: Large volumes of videos are continuously recorded from cameras deployed for traffic control and surveillance with the goal of answering "after the fact" queries: identify video frames with objects of certain classes (cars, bags) from many days of recorded video. While advancements in convolutional neural networks (CNNs) have enabled answering such queries with high accuracy, they are too expensive… ▽ More

    Submitted 10 January, 2018; originally announced January 2018.

  25. arXiv:1801.01796  [pdf, other

    cs.IT

    Spatially Coupled Sparse Regression Codes: Design and State Evolution Analysis

    Authors: Kuan Hsieh, Cynthia Rush, Ramji Venkataramanan

    Abstract: We consider the design and analysis of spatially coupled sparse regression codes (SC-SPARCs), which were recently introduced by Barbier et al. for efficient communication over the additive white Gaussian noise channel. SC-SPARCs can be efficiently decoded using an Approximate Message Passing (AMP) decoder, whose performance in each iteration can be predicted via a set of equations called state evo… ▽ More

    Submitted 26 April, 2018; v1 submitted 5 January, 2018; originally announced January 2018.

    Comments: 8 pages, 6 figures. A shorter version of this paper to appear in ISIT 2018

  26. arXiv:1711.03906  [pdf, other

    cs.LG cs.DC cs.NI cs.RO eess.SY

    D-SLATS: Distributed Simultaneous Localization and Time Synchronization

    Authors: Amr Alanwar, Henrique Ferraz, Kevin Hsieh, Rohit Thazhath, Paul Martin, Joao Hespanha, Mani Srivastava

    Abstract: Through the last decade, we have witnessed a surge of Internet of Things (IoT) devices, and with that a greater need to choreograph their actions across both time and space. Although these two problems, namely time synchronization and localization, share many aspects in common, they are traditionally treated separately or combined on centralized approaches that results in an ineffcient use of reso… ▽ More

    Submitted 10 November, 2017; originally announced November 2017.

  27. arXiv:1706.03162  [pdf, other

    cs.AR

    LazyPIM: Efficient Support for Cache Coherence in Processing-in-Memory Architectures

    Authors: Amirali Boroumand, Saugata Ghose, Minesh Patel, Hasan Hassan, Brandon Lucia, Nastaran Hajinazar, Kevin Hsieh, Krishna T. Malladi, Hongzhong Zheng, Onur Mutlu

    Abstract: Processing-in-memory (PIM) architectures have seen an increase in popularity recently, as the high internal bandwidth available within 3D-stacked memory provides greater incentive to move some computation into the logic layer of the memory. To maintain program correctness, the portions of a program that are executed in memory must remain coherent with the portions of the program that continue to e… ▽ More

    Submitted 9 June, 2017; originally announced June 2017.