Skip to main content

Showing 1–50 of 114 results for author: Ko, S

  1. arXiv:2407.05683  [pdf, other

    eess.IV cs.AI cs.CV

    RadiomicsFill-Mammo: Synthetic Mammogram Mass Manipulation with Radiomics Features

    Authors: Inye Na, Jonghun Kim, Eun Sook Ko, Hyunjin Park

    Abstract: Motivated by the question, "Can we generate tumors with desired attributes?'' this study leverages radiomics features to explore the feasibility of generating synthetic tumor images. Characterized by its low-dimensional yet biologically meaningful markers, radiomics bridges the gap between complex medical imaging data and actionable clinical insights. We present RadiomicsFill-Mammo, the first of t… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: Accepted at MICCAI 2024

  2. Lesion-Aware Cross-Phase Attention Network for Renal Tumor Subtype Classification on Multi-Phase CT Scans

    Authors: Kwang-Hyun Uhm, Seung-Won Jung, Sung-Hoo Hong, Sung-Jea Ko

    Abstract: Multi-phase computed tomography (CT) has been widely used for the preoperative diagnosis of kidney cancer due to its non-invasive nature and ability to characterize renal lesions. However, since enhancement patterns of renal lesions across CT phases are different even for the same lesion type, the visual assessment by radiologists suffers from inter-observer variability in clinical practice. Altho… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: This article has been accepted for publication in Computers in Biology and Medicine

    Journal ref: Computers in Biology and Medicine, 108746, 2024

  3. arXiv:2406.15659  [pdf, other

    cs.LG cs.MA

    Contextual Sprint Classification in Soccer Based on Deep Learning

    Authors: Hyunsung Kim, Gun-Hee Joe, Jinsung Yoon, Sang-Ki Ko

    Abstract: The analysis of high-intensity runs (or sprints) in soccer has long been a topic of interest for sports science researchers and practitioners. In particular, recent studies suggested contextualizing sprints based on their tactical purposes to better understand the physical-tactical requirements of modern match-play. However, they have a limitation in scalability, as human experts have to manually… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: Accepted at IJCAI 2024 Workshop on Intelligent Technologies for Precision Sports Science (IT4PSS 2024)

  4. arXiv:2406.06287  [pdf, other

    math.NA cs.LG

    VS-PINN: A fast and efficient training of physics-informed neural networks using variable-scaling methods for solving PDEs with stiff behavior

    Authors: Seungchan Ko, Sang Hyeon Park

    Abstract: Physics-informed neural networks (PINNs) have recently emerged as a promising way to compute the solutions of partial differential equations (PDEs) using deep neural networks. However, despite their significant success in various fields, it remains unclear in many aspects how to effectively train PINNs if the solutions of PDEs exhibit stiff behaviors or high frequencies. In this paper, we propose… ▽ More

    Submitted 12 July, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

  5. arXiv:2406.01339  [pdf, other

    cs.HC cs.OS cs.SE

    Recover as It is Designed to Be: Recovering from Compatibility Mobile App Crashes by Reusing User Flows

    Authors: Donghwi Kim, Hyungjun Yoon, Chang Min Park, Sujin Han, Youngjin Kwon, Steven Y. Ko, Sung-Ju Lee

    Abstract: Android OS is severely fragmented by API updates and device vendors' OS customization, creating a market condition where vastly different OS versions coexist. This gives rise to compatibility crash problems where Android apps crash on certain Android versions but not on others. Although well-known, this problem is extremely challenging for app developers to overcome due to the sheer number of Andr… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  6. arXiv:2405.01974  [pdf, other

    cs.LG cs.AI q-bio.QM

    Multitask Extension of Geometrically Aligned Transfer Encoder

    Authors: Sung Moon Ko, Sumin Lee, Dae-Woong Jeong, Hyunseung Kim, Chanhui Lee, Soorin Yim, Sehui Han

    Abstract: Molecular datasets often suffer from a lack of data. It is well-known that gathering data is difficult due to the complexity of experimentation or simulation involved. Here, we leverage mutual information across different tasks in molecular data to address this issue. We extend an algorithm that utilizes the geometric characteristics of the encoding space, known as the Geometrically Aligned Transf… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: 7 pages, 3 figures, 2 tables

  7. arXiv:2404.17868  [pdf, other

    math.NA cs.LG

    Error analysis for finite element operator learning methods for solving parametric second-order elliptic PDEs

    Authors: Youngjoon Hong, Seungchan Ko, Jaeyong Lee

    Abstract: In this paper, we provide a theoretical analysis of a type of operator learning method without data reliance based on the classical finite element approximation, which is called the finite element operator network (FEONet). We first establish the convergence of this method for general second-order linear elliptic PDEs with respect to the parameters for neural network approximation. In this regard,… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

  8. arXiv:2404.05144  [pdf, other

    cs.CL cs.CV cs.LG

    Enhancing Clinical Efficiency through LLM: Discharge Note Generation for Cardiac Patients

    Authors: HyoJe Jung, Yunha Kim, Heejung Choi, Hyeram Seo, Minkyoung Kim, JiYe Han, Gaeun Kee, Seohyun Park, Soyoung Ko, Byeolhee Kim, Suyeon Kim, Tae Joon Jun, Young-Hak Kim

    Abstract: Medical documentation, including discharge notes, is crucial for ensuring patient care quality, continuity, and effective medical communication. However, the manual creation of these documents is not only time-consuming but also prone to inconsistencies and potential errors. The automation of this documentation process using artificial intelligence (AI) represents a promising area of innovation in… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Comments: 10 pages, 1 figure, 3 tables, conference

  9. A Study of Vulnerability Repair in JavaScript Programs with Large Language Models

    Authors: Tan Khang Le, Saba Alimadadi, Steven Y. Ko

    Abstract: In recent years, JavaScript has become the most widely used programming language, especially in web development. However, writing secure JavaScript code is not trivial, and programmers often make mistakes that lead to security vulnerabilities in web applications. Large Language Models (LLMs) have demonstrated substantial advancements across multiple domains, and their evolving capabilities indicat… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: camera-ready version accepted to the short paper track at WWW'24

  10. arXiv:2403.02966  [pdf, other

    cs.CL cs.AI cs.LG

    Evidence-Focused Fact Summarization for Knowledge-Augmented Zero-Shot Question Answering

    Authors: Sungho Ko, Hyunjin Cho, Hyungjoo Chae, Jinyoung Yeo, Dongha Lee

    Abstract: Recent studies have investigated utilizing Knowledge Graphs (KGs) to enhance Quesetion Answering (QA) performance of Large Language Models (LLMs), yet structured KG verbalization remains challengin. Existing methods, such as triple-form or free-form textual conversion of triple-form facts, encounter several issues. These include reduced evidence density due to duplicated entities or relationships,… ▽ More

    Submitted 19 June, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

  11. arXiv:2403.02600  [pdf, other

    cs.LG cs.SI

    TESTAM: A Time-Enhanced Spatio-Temporal Attention Model with Mixture of Experts

    Authors: Hyunwook Lee, Sungahn Ko

    Abstract: Accurate traffic forecasting is challenging due to the complex dependency on road networks, various types of roads, and the abrupt speed change due to the events. Recent works mainly focus on dynamic spatial modeling with adaptive graph embedding or graph attention having less consideration for temporal characteristics and in-situ modeling. In this paper, we propose a novel deep learning model nam… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: 19 pages, 7 figures, Accepted as poster to ICLR 2024. Code: https://github.com/HyunWookL/TESTAM

    Journal ref: International Conference on Learning Representations (ICLR 2024)

  12. arXiv:2402.11925  [pdf, other

    cs.LG cs.AI cs.IT

    Energy-Efficient Edge Learning via Joint Data Deepening-and-Prefetching

    Authors: Sujin Kook, Won-Yong Shin, Seong-Lyun Kim, Seung-Woo Ko

    Abstract: The vision of pervasive artificial intelligence (AI) services can be realized by training an AI model on time using real-time data collected by internet of things (IoT) devices. To this end, IoT devices require offloading their data to an edge server in proximity. However, transmitting high-dimensional and voluminous data from energy-constrained IoT devices poses a significant challenge. To addres… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: accepted for publication in IEEE Transactions on Wireless Communications. arXiv admin note: text overlap with arXiv:2211.07146

  13. arXiv:2401.14625  [pdf, ps, other

    cs.CL

    Toward Practical Automatic Speech Recognition and Post-Processing: a Call for Explainable Error Benchmark Guideline

    Authors: Seonmin Koo, Chanjun Park, Jinsung Kim, Jaehyung Seo, Sugyeong Eo, Hyeonseok Moon, Heuiseok Lim

    Abstract: Automatic speech recognition (ASR) outcomes serve as input for downstream tasks, substantially impacting the satisfaction level of end-users. Hence, the diagnosis and enhancement of the vulnerabilities present in the ASR model bear significant importance. However, traditional evaluation methodologies of ASR systems generate a singular, composite quantitative metric, which fails to provide comprehe… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: Accepted for Data-centric Machine Learning Research (DMLR) Workshop at ICML 2023

  14. arXiv:2401.13586  [pdf, other

    cs.LG cs.AI cs.CL

    Instruction Fine-Tuning: Does Prompt Loss Matter?

    Authors: Mathew Huerta-Enochian, Seung Yong Ko

    Abstract: We present a novel study analyzing the effects of various prompt loss token weights (PLW) for supervised instruction fine-tuning (SIFT). While prompt-masking (PLW = 0) is common for SIFT, some fine-tuning APIs support fractional PLWs and suggest that using a small non-zero PLW can help stabilize learning when fine-tuning on short-completion data. However, there has never been a study confirming th… ▽ More

    Submitted 18 June, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: 8 pages of content. 13 pages of appendices. 45 figures

  15. arXiv:2401.08719  [pdf, other

    cs.SE cs.CC

    CodeComplex: A Time-Complexity Dataset for Bilingual Source Codes

    Authors: Seung-Yeop Baik, Mingi Jeon, Joonghyuk Hahn, Jungin Kim, Yo-Sub Han, Sang-Ki Ko

    Abstract: Analyzing the worst-case time complexity of a code is a crucial task in computer science and software engineering for ensuring the efficiency, reliability, and robustness of software systems. However, it is well-known that the problem of determining the worst-case time complexity of a given code written in general-purpose programming language is theoretically undecidable by the famous Halting prob… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  16. arXiv:2312.05548  [pdf, other

    eess.IV cs.CV cs.LG

    A Unified Multi-Phase CT Synthesis and Classification Framework for Kidney Cancer Diagnosis with Incomplete Data

    Authors: Kwang-Hyun Uhm, Seung-Won Jung, Moon Hyung Choi, Sung-Hoo Hong, Sung-Jea Ko

    Abstract: Multi-phase CT is widely adopted for the diagnosis of kidney cancer due to the complementary information among phases. However, the complete set of multi-phase CT is often not available in practical clinical applications. In recent years, there have been some studies to generate the missing modality image from the available data. Nevertheless, the generated images are not guaranteed to be effectiv… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

    Comments: This article has been accepted for publication in IEEE Journal of Biomedical and Health Informatics

    Journal ref: JBHI, 2022

  17. arXiv:2312.05528  [pdf, other

    eess.IV cs.CV

    Exploring 3D U-Net Training Configurations and Post-Processing Strategies for the MICCAI 2023 Kidney and Tumor Segmentation Challenge

    Authors: Kwang-Hyun Uhm, Hyunjun Cho, Zhixin Xu, Seohoon Lim, Seung-Won Jung, Sung-Hoo Hong, Sung-Jea Ko

    Abstract: In 2023, it is estimated that 81,800 kidney cancer cases will be newly diagnosed, and 14,890 people will die from this cancer in the United States. Preoperative dynamic contrast-enhanced abdominal computed tomography (CT) is often used for detecting lesions. However, there exists inter-observer variability due to subtle differences in the imaging features of kidney and kidney tumors. In this paper… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

    Comments: MICCAI 2023, KITS 2023 challenge 2nd place

  18. arXiv:2312.03003  [pdf, other

    cs.HC cs.AI cs.CL

    Explore, Select, Derive, and Recall: Augmenting LLM with Human-like Memory for Mobile Task Automation

    Authors: Sunjae Lee, Junyoung Choi, Jungjae Lee, Munim Hasan Wasi, Hojun Choi, Steven Y. Ko, Sangeun Oh, Insik Shin

    Abstract: The advent of large language models (LLMs) has opened up new opportunities in the field of mobile task automation. Their superior language understanding and reasoning capabilities allow users to automate complex and repetitive tasks. However, due to the inherent unreliability and high operational cost of LLMs, their practical applicability is quite limited. To address these issues, this paper intr… ▽ More

    Submitted 16 March, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

  19. arXiv:2311.10269  [pdf, other

    cs.CV cs.AI

    Interpretable pap smear cell representation for cervical cancer screening

    Authors: Yu Ando, Nora Jee-Young Park and, Gun Oh Chong, Seokhwan Ko, Donghyeon Lee, Junghwan Cho, Hyungsoo Han

    Abstract: Screening is critical for prevention and early detection of cervical cancer but it is time-consuming and laborious. Supervised deep convolutional neural networks have been developed to automate pap smear screening and the results are promising. However, the interest in using only normal samples to train deep neural networks has increased owing to class imbalance problems and high-labeling costs th… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: 20 pages, 6 figures

  20. arXiv:2311.08271  [pdf, other

    cs.LG cs.IT cs.NI eess.SP

    Mobility-Induced Graph Learning for WiFi Positioning

    Authors: Kyuwon Han, Seung Min Yu, Seong-Lyun Kim, Seung-Woo Ko

    Abstract: A smartphone-based user mobility tracking could be effective in finding his/her location, while the unpredictable error therein due to low specification of built-in inertial measurement units (IMUs) rejects its standalone usage but demands the integration to another positioning technique like WiFi positioning. This paper aims to propose a novel integration technique using a graph neural network ca… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: submitted to a possible IEEE journal

  21. arXiv:2310.09506  [pdf, other

    cs.IT cs.AI cs.LG cs.NI

    Towards Semantic Communication Protocols for 6G: From Protocol Learning to Language-Oriented Approaches

    Authors: Jihong Park, Seung-Woo Ko, Jinho Choi, Seong-Lyun Kim, Mehdi Bennis

    Abstract: The forthcoming 6G systems are expected to address a wide range of non-stationary tasks. This poses challenges to traditional medium access control (MAC) protocols that are static and predefined. In response, data-driven MAC protocols have recently emerged, offering ability to tailor their signaling messages for specific tasks. This article presents a novel categorization of these data-driven MAC… ▽ More

    Submitted 14 October, 2023; originally announced October 2023.

    Comments: 11 pages, 13 figures, submitted to IEEE BITS the Information Theory Magazine

  22. arXiv:2310.09394  [pdf, other

    cs.LG cs.AI cs.IT cs.NI

    Semantics Alignment via Split Learning for Resilient Multi-User Semantic Communication

    Authors: Jinhyuk Choi, Jihong Park, Seung-Woo Ko, Jinho Choi, Mehdi Bennis, Seong-Lyun Kim

    Abstract: Recent studies on semantic communication commonly rely on neural network (NN) based transceivers such as deep joint source and channel coding (DeepJSCC). Unlike traditional transceivers, these neural transceivers are trainable using actual source data and channels, enabling them to extract and communicate semantics. On the flip side, each neural transceiver is inherently biased towards specific so… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Comments: 5 pages, 4 figures, 1 table, submitted to the IEEE for possible publication

  23. arXiv:2310.06369  [pdf, other

    cs.AI cs.LG

    Geometrically Aligned Transfer Encoder for Inductive Transfer in Regression Tasks

    Authors: Sung Moon Ko, Sumin Lee, Dae-Woong Jeong, Woohyung Lim, Sehui Han

    Abstract: Transfer learning is a crucial technique for handling a small amount of data that is potentially related to other abundant data. However, most of the existing methods are focused on classification tasks using images and language datasets. Therefore, in order to expand the transfer learning scheme to regression tasks, we propose a novel transfer technique based on differential geometry, namely the… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: 12+11 pages, 6+1 figures, 0+7 tables

  24. arXiv:2309.04062  [pdf, other

    cs.LG cs.AI physics.chem-ph

    3D Denoisers are Good 2D Teachers: Molecular Pretraining via Denoising and Cross-Modal Distillation

    Authors: Sungjun Cho, Dae-Woong Jeong, Sung Moon Ko, Jinwoo Kim, Sehui Han, Seunghoon Hong, Honglak Lee, Moontae Lee

    Abstract: Pretraining molecular representations from large unlabeled data is essential for molecular property prediction due to the high cost of obtaining ground-truth labels. While there exist various 2D graph-based molecular pretraining approaches, these methods struggle to show statistically significant gains in predictive performance. Recent work have thus instead proposed 3D conformer-based pretraining… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

    Comments: 16 pages, 5 figures

  25. arXiv:2308.04690  [pdf, other

    math.NA cs.AI cs.LG physics.comp-ph

    Finite Element Operator Network for Solving Parametric PDEs

    Authors: Jae Yong Lee, Seungchan Ko, Youngjoon Hong

    Abstract: Partial differential equations (PDEs) underlie our understanding and prediction of natural phenomena across numerous fields, including physics, engineering, and finance. However, solving parametric PDEs is a complex task that necessitates efficient numerical methods. In this paper, we propose a novel approach for solving parametric PDEs using a Finite Element Operator Network (FEONet). Our propose… ▽ More

    Submitted 19 December, 2023; v1 submitted 8 August, 2023; originally announced August 2023.

    Comments: 23 pages, 11 figures

    MSC Class: 65M60; 65N30; 68T20; 68U07 ACM Class: G.1.8

  26. arXiv:2306.14377  [pdf, other

    cs.CL cs.AI

    Synthetic Alone: Exploring the Dark Side of Synthetic Data for Grammatical Error Correction

    Authors: Chanjun Park, Seonmin Koo, Seolhwa Lee, Jaehyung Seo, Sugyeong Eo, Hyeonseok Moon, Heuiseok Lim

    Abstract: Data-centric AI approach aims to enhance the model performance without modifying the model and has been shown to impact model performance positively. While recent attention has been given to data-centric AI based on synthetic data, due to its potential for performance improvement, data-centric AI has long been exclusively validated using real-world data and publicly available benchmark datasets. I… ▽ More

    Submitted 25 June, 2023; originally announced June 2023.

    Comments: Accepted for Data-centric Machine Learning Research (DMLR) Workshop at ICML 2023

  27. Ball Trajectory Inference from Multi-Agent Sports Contexts Using Set Transformer and Hierarchical Bi-LSTM

    Authors: Hyunsung Kim, Han-Jun Choi, Chang Jo Kim, Jinsung Yoon, Sang-Ki Ko

    Abstract: As artificial intelligence spreads out to numerous fields, the application of AI to sports analytics is also in the spotlight. However, one of the major challenges is the difficulty of automated acquisition of continuous movement data during sports matches. In particular, it is a conundrum to reliably track a tiny ball on a wide soccer pitch with obstacles such as occlusion and imitations. Tacklin… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

    MSC Class: 68T20 (Primary) 68U35; 68T30 (Secondary)

    Journal ref: Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

  28. Towards Visualization Thumbnail Designs that Entice Reading Data-driven Articles

    Authors: Hwiyeon Kim, Joohee Kim, Yunha Han, Hwajung Hong, Oh-Sang Kwon, Young-Woo Park, Niklas Elmqvist, Sungahn Ko, Bum Chul Kwon

    Abstract: As online news increasingly include data journalism, there is a corresponding increase in the incorporation of visualization in article thumbnail images. However, little research exists on the design rationale for visualization thumbnails, such as resizing, cropping, simplifying, and embellishing charts that appear within the body of the associated article. Therefore, in this paper we aim to under… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: To appear in IEEE Transactions on Visualization and Computer Graphics, 16 pages, 6 figures, 5 tables. arXiv admin note: text overlap with arXiv:1908.06922

  29. arXiv:2211.08900  [pdf, other

    math.NA cs.LG

    Convergence analysis of unsupervised Legendre-Galerkin neural networks for linear second-order elliptic PDEs

    Authors: Seungchan Ko, Seok-Bae Yun, Youngjoon Hong

    Abstract: In this paper, we perform the convergence analysis of unsupervised Legendre--Galerkin neural networks (ULGNet), a deep-learning-based numerical method for solving partial differential equations (PDEs). Unlike existing deep learning-based numerical methods for PDEs, the ULGNet expresses the solution as a spectral expansion with respect to the Legendre basis and predicts the coefficients with deep n… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

  30. arXiv:2211.07860  [pdf, other

    eess.SY cs.LG

    Enabling AI Quality Control via Feature Hierarchical Edge Inference

    Authors: Jinhyuk Choi, Seong-Lyun Kim, Seung-Woo Ko

    Abstract: With the rise of edge computing, various AI services are expected to be available at a mobile side through the inference based on deep neural network (DNN) operated at the network edge, called edge inference (EI). On the other hand, the resulting AI quality (e.g., mean average precision in objective detection) has been regarded as a given factor, and AI quality control has yet to be explored despi… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

    Comments: 7 pages, 6 figures, Conference Version

  31. arXiv:2211.07146  [pdf, other

    cs.NI

    Joint Data Deepening-and-Prefetching for Energy-Efficient Edge Learning

    Authors: Sujin Kook, Won-Yong Shin, Seong-Lyun Kim, Seung-Woo Ko

    Abstract: The vision of pervasive machine learning (ML) services can be realized by training an ML model on time using real-time data collected by internet of things (IoT) devices. To this end, IoT devices require offloading their data to an edge server in proximity. On the other hand, high dimensional data with a heavy volume causes a significant burden to an IoT device with a limited energy budget. To cop… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

    Comments: This work has been submitted to the IEEE for possible publication

  32. arXiv:2211.06225  [pdf, other

    cs.IT eess.SP

    Over-the-Air Consensus for Distributed Vehicle Platooning Control (Extended version)

    Authors: Jihoon Lee, Yonghoon Jang, Hansol Kim, Seong-Lyun Kim, Seung-Woo Ko

    Abstract: A distributed control of vehicle platooning is referred to as distributed consensus (DC) since many autonomous vehicles (AVs) reach a consensus to move as one body with the same velocity and inter-distance. For DC control to be stable, other AVs' real-time position information should be inputted to each AV's controller via vehicle-to-vehicle (V2V) communications. On the other hand, too many V2V li… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

    Comments: This work has been submitted to the IEEE for possible publication

  33. arXiv:2210.15050  [pdf, other

    cs.LG

    TILDE-Q: A Transformation Invariant Loss Function for Time-Series Forecasting

    Authors: Hyunwook Lee, Chunggi Lee, Hongkyu Lim, Sungahn Ko

    Abstract: Time-series forecasting has gained increasing attention in the field of artificial intelligence due to its potential to address real-world problems across various domains, including energy, weather, traffic, and economy. While time-series forecasting is a well-researched field, predicting complex temporal patterns such as sudden changes in sequential data still poses a challenge with current model… ▽ More

    Submitted 12 March, 2024; v1 submitted 26 October, 2022; originally announced October 2022.

    Comments: 17 pages in total, 8 pages main paper, 3 pages references, and 6 pages appendix. Submitted as conference paper to ICML 2024 and currently under review

  34. arXiv:2210.11643  [pdf, other

    cs.GT

    All Politics is Local: Redistricting via Local Fairness

    Authors: Shao-Heng Ko, Erin Taylor, Pankaj K. Agarwal, Kamesh Munagala

    Abstract: In this paper, we propose to use the concept of local fairness for auditing and ranking redistricting plans. Given a redistricting plan, a deviating group is a population-balanced contiguous region in which a majority of individuals are of the same interest and in the minority of their respective districts; such a set of individuals have a justified complaint with how the redistricting plan was dr… ▽ More

    Submitted 19 November, 2022; v1 submitted 20 October, 2022; originally announced October 2022.

  35. arXiv:2209.08945  [pdf, other

    cs.LG cs.IT stat.ML

    A novel approach for wafer defect pattern classification based on topological data analysis

    Authors: Seungchan Ko, Dowan Koo

    Abstract: In semiconductor manufacturing, wafer map defect pattern provides critical information for facility maintenance and yield management, so the classification of defect patterns is one of the most important tasks in the manufacturing process. In this paper, we propose a novel way to represent the shape of the defect pattern as a finite-dimensional vector, which will be used as an input for a neural n… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

  36. Grouping-matrix based Graph Pooling with Adaptive Number of Clusters

    Authors: Sung Moon Ko, Sungjun Cho, Dae-Woong Jeong, Sehui Han, Moontae Lee, Honglak Lee

    Abstract: Graph pooling is a crucial operation for encoding hierarchical structures within graphs. Most existing graph pooling approaches formulate the problem as a node clustering task which effectively captures the graph topology. Conventional methods ask users to specify an appropriate number of clusters as a hyperparameter, then assume that all input graphs share the same number of clusters. In inductiv… ▽ More

    Submitted 7 September, 2022; originally announced September 2022.

    Comments: 10 pages, 3 figures

  37. arXiv:2208.04350  [pdf, other

    cs.HC cs.LG

    A Visual Analytics System for Improving Attention-based Traffic Forecasting Models

    Authors: Seungmin Jin, Hyunwook Lee, Cheonbok Park, Hyeshin Chu, Yunwon Tae, Jaegul Choo, Sungahn Ko

    Abstract: With deep learning (DL) outperforming conventional methods for different tasks, much effort has been devoted to utilizing DL in various domains. Researchers and developers in the traffic domain have also designed and improved DL models for forecasting tasks such as estimation of traffic speed and time of arrival. However, there exist many challenges in analyzing DL models due to the black-box prop… ▽ More

    Submitted 11 August, 2022; v1 submitted 8 August, 2022; originally announced August 2022.

    Comments: 9 pages paper, 2 pages references, and 3 pages appendix. Accepted to IEEE VIS 2022

  38. arXiv:2207.03920  [pdf, other

    cs.IT cs.AI cs.LG cs.NI

    Towards Semantic Communication Protocols: A Probabilistic Logic Perspective

    Authors: Sejin Seo, Jihong Park, Seung-Woo Ko, Jinho Choi, Mehdi Bennis, Seong-Lyun Kim

    Abstract: Classical medium access control (MAC) protocols are interpretable, yet their task-agnostic control signaling messages (CMs) are ill-suited for emerging mission-critical applications. By contrast, neural network (NN) based protocol models (NPMs) learn to generate task-specific CMs, but their rationale and impact lack interpretability. To fill this void, in this article we propose, for the first tim… ▽ More

    Submitted 8 July, 2022; originally announced July 2022.

  39. arXiv:2205.11258  [pdf, other

    cs.LG cs.AI cs.FL cs.PL

    Neuro-Symbolic Regex Synthesis Framework via Neural Example Splitting

    Authors: Su-Hyeon Kim, Hyunjoon Cheon, Yo-Sub Han, Sang-Ki Ko

    Abstract: Due to the practical importance of regular expressions (regexes, for short), there has been a lot of research to automatically generate regexes from positive and negative string examples. We tackle the problem of learning regexes faster from positive and negative strings by relying on a novel approach called `neural example splitting'. Our approach essentially split up each example string into mul… ▽ More

    Submitted 20 May, 2022; originally announced May 2022.

  40. arXiv:2205.09185  [pdf, other

    physics.ins-det cs.LG hep-ex nucl-ex physics.comp-ph

    AI-assisted Optimization of the ECCE Tracking System at the Electron Ion Collider

    Authors: C. Fanelli, Z. Papandreou, K. Suresh, J. K. Adkins, Y. Akiba, A. Albataineh, M. Amaryan, I. C. Arsene, C. Ayerbe Gayoso, J. Bae, X. Bai, M. D. Baker, M. Bashkanov, R. Bellwied, F. Benmokhtar, V. Berdnikov, J. C. Bernauer, F. Bock, W. Boeglin, M. Borysova, E. Brash, P. Brindza, W. J. Briscoe, M. Brooks, S. Bueltmann , et al. (258 additional authors not shown)

    Abstract: The Electron-Ion Collider (EIC) is a cutting-edge accelerator facility that will study the nature of the "glue" that binds the building blocks of the visible matter in the universe. The proposed experiment will be realized at Brookhaven National Laboratory in approximately 10 years from now, with detector design and R&D currently ongoing. Notably, EIC is one of the first large-scale facilities to… ▽ More

    Submitted 19 May, 2022; v1 submitted 18 May, 2022; originally announced May 2022.

    Comments: 16 pages, 18 figures, 2 appendices, 3 tables

  41. arXiv:2205.04687  [pdf, ps, other

    cs.GT

    Optimal Price Discrimination for Randomized Mechanisms

    Authors: Shao-Heng Ko, Kamesh Munagala

    Abstract: We study the power of price discrimination via an intermediary in bilateral trade, when there is a revenue-maximizing seller selling an item to a buyer with a private value drawn from a prior. Between the seller and the buyer, there is an intermediary that can segment the market by releasing information about the true values to the seller. This is termed signaling, and enables the seller to price… ▽ More

    Submitted 27 April, 2023; v1 submitted 10 May, 2022; originally announced May 2022.

    Comments: Appears in ACM EC 2022

  42. arXiv:2205.00364  [pdf, other

    cs.CV

    RADNet: A Deep Neural Network Model for Robust Perception in Moving Autonomous Systems

    Authors: Burhan A. Mudassar, Sho Ko, Maojingjing Li, Priyabrata Saha, Saibal Mukhopadhyay

    Abstract: Interactive autonomous applications require robustness of the perception engine to artifacts in unconstrained videos. In this paper, we examine the effect of camera motion on the task of action detection. We develop a novel ranking method to rank videos based on the degree of global camera motion. For the high ranking camera videos we show that the accuracy of action detection is decreased. We pro… ▽ More

    Submitted 30 April, 2022; originally announced May 2022.

  43. arXiv:2204.10221  [pdf, other

    cs.HC

    A Graphical Workflow Exploration Environment For Visual Analytics

    Authors: Chunggi Lee, Juyoung Oh, Seungmin Jin, Isaac Cho, Sungahn Ko

    Abstract: Graphical history mechanisms have been widely utilized in many domains to support humans' limited working memory, error recovery, collaboration, and presentation in visual analysis. Yet, there are aspects that remain under-explored in designing graphical history systems for visual analytics systems to help analysts who have complicated workflows. In this paper we report on our design study perform… ▽ More

    Submitted 21 April, 2022; originally announced April 2022.

  44. arXiv:2204.09524  [pdf, other

    cs.HC

    An Empirical Study on the Relationship Between the Number of Coordinated Views and Visual Analysis

    Authors: Juyoung Oh, Chunggi Lee, Hwiyeon Kim, Kihwan Kim, Osang Kwon, Eric D. Ragan, Bum Chul Kwon, Sungahn Ko

    Abstract: Coordinated Multiple views (CMVs) are a visualization technique that simultaneously presents multiple visualizations in separate but linked views. There are many studies that report the advantages (e.g., usefulness for finding hidden relationships) and disadvantages (e.g., cognitive load) of CMVs. But little empirical work exists on the impact of the number of views on visual anlaysis results and… ▽ More

    Submitted 20 April, 2022; originally announced April 2022.

  45. Image Generation with Self Pixel-wise Normalization

    Authors: Yoon-Jae Yeo, Min-Cheol Sagong, Seung Park, Sung-Jea Ko, Yong-Goo Shin

    Abstract: Region-adaptive normalization (RAN) methods have been widely used in the generative adversarial network (GAN)-based image-to-image translation technique. However, since these approaches need a mask image to infer the pixel-wise affine transformation parameters, they cannot be applied to the general image generation models having no paired mask images. To resolve this problem, this paper presents a… ▽ More

    Submitted 25 January, 2022; originally announced January 2022.

    Comments: 13 pages, 8 figures

  46. arXiv:2112.06899  [pdf, ps, other

    cs.DS

    Locally Fair Partitioning

    Authors: Pankaj K. Agarwal, Shao-Heng Ko, Kamesh Munagala, Erin Taylor

    Abstract: We model the societal task of redistricting political districts as a partitioning problem: Given a set of $n$ points in the plane, each belonging to one of two parties, and a parameter $k$, our goal is to compute a partition $Π$ of the plane into regions so that each region contains roughly $σ= n/k$ points. $Π$ should satisfy a notion of ''local'' fairness, which is related to the notion of core,… ▽ More

    Submitted 15 December, 2021; v1 submitted 13 December, 2021; originally announced December 2021.

  47. arXiv:2110.10380  [pdf, ps, other

    cs.LG cs.NE

    Learning to Remember Patterns: Pattern Matching Memory Networks for Traffic Forecasting

    Authors: Hyunwook Lee, Seungmin Jin, Hyeshin Chu, Hongkyu Lim, Sungahn Ko

    Abstract: Traffic forecasting is a challenging problem due to complex road networks and sudden speed changes caused by various events on roads. A number of models have been proposed to solve this challenging problem with a focus on learning spatio-temporal dependencies of roads. In this work, we propose a new perspective of converting the forecasting problem into a pattern matching task, assuming that large… ▽ More

    Submitted 8 March, 2022; v1 submitted 20 October, 2021; originally announced October 2021.

    Comments: 15 pages, Accepted as poster to ICLR 2022

    Journal ref: International Conference on Learning Representations (ICLR 2022)

  48. 6MapNet: Representing soccer players from tracking data by a triplet network

    Authors: Hyunsung Kim, Jihun Kim, Dongwook Chung, Jonghyun Lee, Jinsung Yoon, Sang-Ki Ko

    Abstract: Although the values of individual soccer players have become astronomical, subjective judgments still play a big part in the player analysis. Recently, there have been new attempts to quantitatively grasp players' styles using video-based event stream data. However, they have some limitations in scalability due to high annotation costs and sparsity of event stream data. In this paper, we build a t… ▽ More

    Submitted 10 September, 2021; originally announced September 2021.

    Comments: 12 pages, 4 figures, In 8th Workshop on Machine Learning and Data Mining for Sports Analytics (MLSA21)

    Journal ref: ECML-PKDD Workshop on Machine Learning and Data Mining for Sports Analytics, 2021

  49. arXiv:2109.04650  [pdf, other

    cs.CL

    What Changes Can Large-scale Language Models Bring? Intensive Study on HyperCLOVA: Billions-scale Korean Generative Pretrained Transformers

    Authors: Boseop Kim, HyoungSeok Kim, Sang-Woo Lee, Gichang Lee, Donghyun Kwak, Dong Hyeon Jeon, Sunghyun Park, Sungju Kim, Seonhoon Kim, Dongpil Seo, Heungsub Lee, Minyoung Jeong, Sungjae Lee, Minsub Kim, Suk Hyun Ko, Seokhun Kim, Taeyong Park, Jinuk Kim, Soyoung Kang, Na-Hyeon Ryu, Kang Min Yoo, Minsuk Chang, Soobin Suh, Sookyo In, Jinseong Park , et al. (12 additional authors not shown)

    Abstract: GPT-3 shows remarkable in-context learning ability of large-scale language models (LMs) trained on hundreds of billion scale data. Here we address some remaining issues less reported by the GPT-3 paper, such as a non-English LM, the performances of different sized models, and the effect of recently introduced prompt optimization on in-context learning. To achieve this, we introduce HyperCLOVA, a K… ▽ More

    Submitted 28 November, 2021; v1 submitted 9 September, 2021; originally announced September 2021.

    Comments: Accepted to EMNLP2021 as a long paper. Fixed some typos

  50. arXiv:2108.05054  [pdf, other

    cs.CV cs.AI

    Rethinking Coarse-to-Fine Approach in Single Image Deblurring

    Authors: Sung-Jin Cho, Seo-Won Ji, Jun-Pyo Hong, Seung-Won Jung, Sung-Jea Ko

    Abstract: Coarse-to-fine strategies have been extensively used for the architecture design of single image deblurring networks. Conventional methods typically stack sub-networks with multi-scale input images and gradually improve sharpness of images from the bottom sub-network to the top sub-network, yielding inevitably high computational costs. Toward a fast and accurate deblurring network design, we revis… ▽ More

    Submitted 16 September, 2021; v1 submitted 11 August, 2021; originally announced August 2021.

    Comments: Accepted by IEEE International Conference on Computer Vision (ICCV) 2021