Skip to main content

Showing 1–50 of 519 results for author: Hong, S

  1. arXiv:2407.03051  [pdf, other

    cs.CL

    Improving Conversational Abilities of Quantized Large Language Models via Direct Preference Alignment

    Authors: Janghwan Lee, Seongmin Park, Sukjin Hong, Minsoo Kim, Du-Seong Chang, Jungwook Choi

    Abstract: The rapid advancement of large language models (LLMs) has facilitated their transformation into conversational chatbots that can grasp contextual nuances and generate pertinent sentences, closely mirroring human values through advanced techniques such as instruction tuning and reinforcement learning from human feedback (RLHF). However, the computational efficiency required for LLMs, achieved throu… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  2. arXiv:2407.01214  [pdf, other

    cs.LG cs.AI

    Revisiting Random Walks for Learning on Graphs

    Authors: Jinwoo Kim, Olga Zaghen, Ayhan Suleymanzade, Youngmin Ryou, Seunghoon Hong

    Abstract: We revisit a simple idea for machine learning on graphs, where a random walk on a graph produces a machine-readable record, and this record is processed by a deep neural network to directly make vertex-level or graph-level predictions. We refer to these stochastic machines as random walk neural networks, and show that we can design them to be isomorphism invariant while capable of universal approx… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 41 pages, 11 figures

  3. arXiv:2407.00511  [pdf, other

    cs.DS

    Wooly Graphs : A Mathematical Framework For Knitting

    Authors: Kathryn Gray, Brian Bell, Diana Sieper, Stephen Kobourov, Falk Schreiber, Karsten Klein, Seokhee Hong

    Abstract: This paper aims to develop a mathematical foundation to model knitting with graphs. We provide a precise definition for knit objects with a knot theoretic component and propose a simple undirected graph, a simple directed graph, and a directed multigraph model for any arbitrary knit object. Using these models, we propose natural categories related to the complexity of knitting structures. We use t… ▽ More

    Submitted 3 July, 2024; v1 submitted 29 June, 2024; originally announced July 2024.

    Comments: 11 pages, 4 tables, 5 figures

  4. arXiv:2406.17745  [pdf, ps, other

    cs.IR cs.LG

    Light-weight End-to-End Graph Interest Network for CTR Prediction in E-commerce Search

    Authors: Pai Peng, Yunqing Jia, Ziqiang Zhou, Shuang Hong, Zichong Xiao

    Abstract: Click-through-rate (CTR) prediction has an essential impact on improving user experience and revenue in e-commerce search. With the development of deep learning, graph-based methods are well exploited to utilize graph structure extracted from user behaviors and other information to help embedding learning. However, most of the previous graph-based methods mainly focus on recommendation scenarios,… ▽ More

    Submitted 2 July, 2024; v1 submitted 25 June, 2024; originally announced June 2024.

    Comments: 8 pages, 4 figures

    ACM Class: H.3.3

  5. arXiv:2406.16937  [pdf, other

    cs.CL cs.AI

    A Complete Survey on LLM-based AI Chatbots

    Authors: Sumit Kumar Dam, Choong Seon Hong, Yu Qiao, Chaoning Zhang

    Abstract: The past few decades have witnessed an upsurge in data, forming the foundation for data-hungry, learning-based AI technology. Conversational agents, often referred to as AI chatbots, rely heavily on such data to train large language models (LLMs) and generate new content (knowledge) in response to user prompts. With the advent of OpenAI's ChatGPT, LLM-based chatbots have set new standards in the A… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 23 pages, 10 figures

  6. Lesion-Aware Cross-Phase Attention Network for Renal Tumor Subtype Classification on Multi-Phase CT Scans

    Authors: Kwang-Hyun Uhm, Seung-Won Jung, Sung-Hoo Hong, Sung-Jea Ko

    Abstract: Multi-phase computed tomography (CT) has been widely used for the preoperative diagnosis of kidney cancer due to its non-invasive nature and ability to characterize renal lesions. However, since enhancement patterns of renal lesions across CT phases are different even for the same lesion type, the visual assessment by radiologists suffers from inter-observer variability in clinical practice. Altho… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: This article has been accepted for publication in Computers in Biology and Medicine

    Journal ref: Computers in Biology and Medicine, 108746, 2024

  7. arXiv:2406.14953  [pdf, other

    cs.CV cs.AI cs.LG eess.SP

    Deep Imbalanced Regression to Estimate Vascular Age from PPG Data: a Novel Digital Biomarker for Cardiovascular Health

    Authors: Guangkun Nie, Qinghao Zhao, Gongzheng Tang, Jun Li, Shenda Hong

    Abstract: Photoplethysmography (PPG) is emerging as a crucial tool for monitoring human hemodynamics, with recent studies highlighting its potential in assessing vascular aging through deep learning. However, real-world age distributions are often imbalanced, posing significant challenges for deep learning models. In this paper, we introduce a novel, simple, and effective loss function named the Dist Loss t… ▽ More

    Submitted 2 July, 2024; v1 submitted 21 June, 2024; originally announced June 2024.

  8. arXiv:2406.13280  [pdf, other

    cs.NI cs.AI

    Design Optimization of NOMA Aided Multi-STAR-RIS for Indoor Environments: A Convex Approximation Imitated Reinforcement Learning Approach

    Authors: Yu Min Park, Sheikh Salman Hassan, Yan Kyaw Tun, Eui-Nam Huh, Walid Saad, Choong Seon Hong

    Abstract: Sixth-generation (6G) networks leverage simultaneously transmitting and reflecting reconfigurable intelligent surfaces (STAR-RISs) to overcome the limitations of traditional RISs. STAR-RISs offer 360-degree full-space coverage and optimized transmission and reflection for enhanced network performance and dynamic control of the indoor propagation environment. However, deploying STAR-RISs indoors pr… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 37 pages, 11 figures, IEEE Transactions on Communications submitted. arXiv admin note: text overlap with arXiv:2311.08708

  9. arXiv:2406.11672  [pdf, other

    cs.CV

    Effective Rank Analysis and Regularization for Enhanced 3D Gaussian Splatting

    Authors: Junha Hyung, Susung Hong, Sungwon Hwang, Jaeseong Lee, Jaegul Choo, Jin-Hwa Kim

    Abstract: 3D reconstruction from multi-view images is one of the fundamental challenges in computer vision and graphics. Recently, 3D Gaussian Splatting (3DGS) has emerged as a promising technique capable of real-time rendering with high-quality 3D reconstruction. This method utilizes 3D Gaussian representation and tile-based splatting techniques, bypassing the expensive neural field querying. Despite its p… ▽ More

    Submitted 18 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: project page: https://junhahyung.github.io/erankgs.github.io

  10. arXiv:2406.09047  [pdf, other

    cs.CG

    DeepJEB: 3D Deep Learning-based Synthetic Jet Engine Bracket Dataset

    Authors: Seongjun Hong, Yongmin Kwon, Dongju Shin, Jangseop Park, Namwoo Kang

    Abstract: Recent advancements in artificial intelligence (AI) have significantly influenced various fields, including mechanical engineering. Nonetheless, the development of high-quality, diverse datasets for structural analysis still needs to be improved. Although traditional datasets, such as simulated jet engine bracket dataset, are useful, they are constrained by a small number of samples, which must be… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  11. arXiv:2406.08815  [pdf, other

    cs.RO eess.SY

    Deep Reinforcement Learning-based Quadcopter Controller: A Practical Approach and Experiments

    Authors: Truong-Dong Do, Nguyen Xuan Mung, Sung Kyung Hong

    Abstract: Quadcopters have been studied for decades thanks to their maneuverability and capability of operating in a variety of circumstances. However, quadcopters suffer from dynamical nonlinearity, actuator saturation, as well as sensor noise that make it challenging and time consuming to obtain accurate dynamic models and achieve satisfactory control performance. Fortunately, deep reinforcement learning… ▽ More

    Submitted 18 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: 6 pages, 5 figures, 3 tables

  12. arXiv:2406.08098  [pdf, other

    cs.SE

    Scalable Defect Detection via Traversal on Code Graph

    Authors: Zhengyao Liu, Xitong Zhong, Xingjing Deng, Shuo Hong, Xiang Gao, Hailong Sun

    Abstract: Detecting defects and vulnerabilities in the early stage has long been a challenge in software engineering. Static analysis, a technique that inspects code without execution, has emerged as a key strategy to address this challenge. Among recent advancements, the use of graph-based representations, particularly Code Property Graph (CPG), has gained traction due to its comprehensive depiction of cod… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  13. arXiv:2406.07558  [pdf, other

    cs.CY cs.AI cs.CV

    A Large Medical Model based on Visual Physiological Monitoring for Public Health

    Authors: Bin Huang, Changchen Zhao, Zimeng Liu, Shenda Hong, Baochang Zhang, Wenjin Wang, Hui Liu

    Abstract: The widespread outbreak of the COVID-19 pandemic has sounded a warning about the globalization challenges in public health. In this context, the establishment of large-scale public health datasets, of medical models, and of decision-making systems with a human-centric approach holds strategic significance. Recently, groundbreaking advancements have emerged in AI methods for physiological signal mo… ▽ More

    Submitted 21 April, 2024; originally announced June 2024.

    Comments: 17 pages, 7 figures

  14. arXiv:2406.03773  [pdf, other

    cs.IT

    Optimizing Multi-User Semantic Communication via Transfer Learning and Knowledge Distillation

    Authors: Loc X. Nguyen, Kitae Kim, Ye Lin Tun, Sheikh Salman Hassan, Yan Kyaw Tun, Zhu Han, Choong Seon Hong

    Abstract: Semantic communication, notable for ensuring quality of service by jointly optimizing source and channel coding, effectively extracts data semantics, reduces transmission length, and mitigates channel noise. However, most studies overlook multi-user scenarios and resource availability, limiting real-world application. This paper addresses this gap by focusing on downlink communication from a base… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 5 pages, 5 figures

  15. arXiv:2406.02000  [pdf, other

    cs.NI eess.SP

    Advancing Ultra-Reliable 6G: Transformer and Semantic Localization Empowered Robust Beamforming in Millimeter-Wave Communications

    Authors: Avi Deb Raha, Kitae Kim, Apurba Adhikary, Mrityunjoy Gain, Choong Seon Hong

    Abstract: Advancements in 6G wireless technology have elevated the importance of beamforming, especially for attaining ultra-high data rates via millimeter-wave (mmWave) frequency deployment. Although promising, mmWave bands require substantial beam training to achieve precise beamforming. While initial deep learning models that use RGB camera images demonstrated promise in reducing beam training overhead,… ▽ More

    Submitted 21 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

  16. arXiv:2406.00431  [pdf, ps, other

    cs.LG cs.AI cs.DC

    SpaFL: Communication-Efficient Federated Learning with Sparse Models and Low computational Overhead

    Authors: Minsu Kim, Walid Saad, Merouane Debbah, Choong Seon Hong

    Abstract: The large communication and computation overhead of federated learning (FL) is one of the main challenges facing its practical deployment over resource-constrained clients and systems. In this work, SpaFL: a communication-efficient FL framework is proposed to optimize sparse model structures with low computational overhead. In SpaFL, a trainable threshold is defined for each filter/neuron to prune… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  17. arXiv:2405.20602  [pdf, other

    cs.LG cs.CL

    Masked Language Modeling Becomes Conditional Density Estimation for Tabular Data Synthesis

    Authors: Seunghwan An, Gyeongdong Woo, Jaesung Lim, ChangHyun Kim, Sungchul Hong, Jong-June Jeon

    Abstract: In this paper, our goal is to generate synthetic data for heterogeneous (mixed-type) tabular datasets with high machine learning utility (MLu). Given that the MLu performance relies on accurately approximating the conditional distributions, we focus on devising a synthetic data generation method based on conditional distribution estimation. We propose a novel synthetic data generation method, MaCo… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  18. arXiv:2405.19771  [pdf, other

    cs.NI eess.SP

    Data Service Maximization in Integrated Terrestrial-Non-Terrestrial 6G Networks: A Deep Reinforcement Learning Approach

    Authors: Nway Nway Ei, Kitae Kim, Yan Kyaw Tun, Choong Seon Hong

    Abstract: Integrating terrestrial and non-terrestrial networks has emerged as a promising paradigm to fulfill the constantly growing demand for connectivity, low transmission delay, and quality of services (QoS). This integration brings together the strengths of terrestrial and non-terrestrial networks, such as the reliability of terrestrial networks, broad coverage, and service continuity of non-terrestria… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 5 pages, 4 figures

  19. arXiv:2405.19757  [pdf, other

    cs.LG cs.AI

    Improving SMOTE via Fusing Conditional VAE for Data-adaptive Noise Filtering

    Authors: Sungchul Hong, Seunghwan An, Jong-June Jeon

    Abstract: Recent advances in a generative neural network model extend the development of data augmentation methods. However, the augmentation methods based on the modern generative models fail to achieve notable performance for class imbalance data compared to the conventional model, the SMOTE. We investigate the problem of the generative model for imbalanced classification and introduce a framework to enha… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  20. arXiv:2405.19704  [pdf, other

    stat.ML cs.LG stat.ME

    Enhancing Sufficient Dimension Reduction via Hellinger Correlation

    Authors: Seungbeom Hong, Ilmun Kim, Jun Song

    Abstract: In this work, we develop a new theory and method for sufficient dimension reduction (SDR) in single-index models, where SDR is a sub-field of supervised dimension reduction based on conditional independence. Our work is primarily motivated by the recent introduction of the Hellinger correlation as a dependency measure. Utilizing this measure, we develop a method capable of effectively detecting th… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  21. arXiv:2405.15230  [pdf, other

    cs.AI cs.LG

    $i$REPO: $i$mplicit Reward Pairwise Difference based Empirical Preference Optimization

    Authors: Long Tan Le, Han Shu, Tung-Anh Nguyen, Choong Seon Hong, Nguyen H. Tran

    Abstract: While astonishingly capable, large Language Models (LLM) can sometimes produce outputs that deviate from human expectations. Such deviations necessitate an alignment phase to prevent disseminating untruthful, toxic, or biased information. Traditional alignment methods based on reinforcement learning often struggle with the identified instability, whereas preference optimization methods are limited… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: Under Review

  22. arXiv:2405.13046  [pdf, other

    cs.CL cs.LG

    LeaPformer: Enabling Linear Transformers for Autoregressive and Simultaneous Tasks via Learned Proportions

    Authors: Victor Agostinelli, Sanghyun Hong, Lizhong Chen

    Abstract: A promising approach to preserving model performance in linearized transformers is to employ position-based re-weighting functions. However, state-of-the-art re-weighting functions rely heavily on target sequence lengths, making it difficult or impossible to apply them to autoregressive and simultaneous tasks, where the target and sometimes even the input sequence length are unknown. To address th… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

    Comments: Submitted and accepted at ICML 2024

  23. arXiv:2405.06078  [pdf, ps, other

    cs.CY cs.HC

    Collaborative Design for Job-Seekers with Autism: A Conceptual Framework for Future Research

    Authors: Sungsoo Ray Hong, Marcos Zampieri, Brittany N. Hand, Vivian Motti, Dongjun Chung, Ozlem Uzuner

    Abstract: The success of employment is highly related to a job seeker's capability of communicating and collaborating with others. While leveraging one's network during the job-seeking process is intuitive to the neurotypical, this can be challenging for people with autism. Recent empirical findings have started to show how facilitating collaboration between people with autism and their social surroundings… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  24. arXiv:2405.06073  [pdf, other

    cs.LG cs.CR

    Hard Work Does Not Always Pay Off: Poisoning Attacks on Neural Architecture Search

    Authors: Zachary Coalson, Huazheng Wang, Qingyun Wu, Sanghyun Hong

    Abstract: In this paper, we study the robustness of "data-centric" approaches to finding neural network architectures (known as neural architecture search) to data distribution shifts. To audit this robustness, we present a data poisoning attack, when injected to the training data used for architecture search that can prevent the victim algorithm from finding an architecture with optimal accuracy. We first… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  25. arXiv:2405.04746  [pdf, other

    cs.IR cs.AI cs.LG

    SVD-AE: Simple Autoencoders for Collaborative Filtering

    Authors: Seoyoung Hong, Jeongwhan Choi, Yeon-Chang Lee, Srijan Kumar, Noseong Park

    Abstract: Collaborative filtering (CF) methods for recommendation systems have been extensively researched, ranging from matrix factorization and autoencoder-based to graph filtering-based methods. Recently, lightweight methods that require almost no training have been recently proposed to reduce overall computation. However, existing methods still have room to improve the trade-offs among accuracy, efficie… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: Accepted by IJCAI 2024

  26. arXiv:2405.03929  [pdf, other

    cs.AI physics.ao-ph

    Unicorn: U-Net for Sea Ice Forecasting with Convolutional Neural Ordinary Differential Equations

    Authors: Jaesung Park, Sungchul Hong, Yoonseo Cho, Jong-June Jeon

    Abstract: Sea ice at the North Pole is vital to global climate dynamics. However, accurately forecasting sea ice poses a significant challenge due to the intricate interaction among multiple variables. Leveraging the capability to integrate multiple inputs and powerful performances seamlessly, many studies have turned to neural networks for sea ice forecasting. This paper introduces a novel deep architectur… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  27. arXiv:2405.03239  [pdf, other

    cs.LG cs.AI

    Deep Learning for Detecting and Early Predicting Chronic Obstructive Pulmonary Disease from Spirogram Time Series: A UK Biobank Study

    Authors: Shuhao Mei, Yuxi Zhou, Jiahao Xu, Yuxuan Wan, Shan Cao, Qinghao Zhao, Shijia Geng, Junqing Xie, Shenda Hong

    Abstract: Chronic Obstructive Pulmonary Disease (COPD) is a chronic inflammatory lung condition that causes airflow obstruction. The existing methods can only detect patients who already have COPD based on obvious features shown in the spirogram (In this article, the spirogram specifically involves measuring Volume-Flow curve time series). Early prediction of COPD risk is vital for monitoring COPD disease p… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  28. arXiv:2405.00646  [pdf, other

    cs.CV cs.LG

    Learning to Compose: Improving Object Centric Learning by Injecting Compositionality

    Authors: Whie Jung, Jaehoon Yoo, Sungjin Ahn, Seunghoon Hong

    Abstract: Learning compositional representation is a key aspect of object-centric learning as it enables flexible systematic generalization and supports complex visual reasoning. However, most of the existing approaches rely on auto-encoding objective, while the compositionality is implicitly imposed by the architectural or algorithmic bias in the encoder. This misalignment between auto-encoding objective a… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  29. arXiv:2404.18459  [pdf, other

    cs.CV

    Chameleon: A Data-Efficient Generalist for Dense Visual Prediction in the Wild

    Authors: Donggyun Kim, Seongwoong Cho, Semin Kim, Chong Luo, Seunghoon Hong

    Abstract: Large language models have evolved data-efficient generalists, benefiting from the universal language interface and large-scale pre-training. However, constructing a data-efficient generalist for dense visual prediction presents a distinct challenge due to the variation in label structures across different tasks. Consequently, generalization to unseen dense prediction tasks in the low-data regime… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  30. arXiv:2404.18405  [pdf

    cs.HC

    Understanding and Shaping Human-Technology Assemblages in the Age of Generative AI

    Authors: Josh Andres, Chris Danta, Andrea Bianchi, Sungyeon Hong, Zhuying Li, Eduardo B. Sandoval, Charles Martin, Ned Cooper

    Abstract: Generative AI capabilities are rapidly transforming how we perceive, interact with, and relate to machines. This one-day workshop invites HCI researchers, designers, and practitioners to imaginatively inhabit and explore the possible futures that might emerge from humans combining generative AI capabilities into everyday technologies at massive scale. Workshop participants will craft stories, visu… ▽ More

    Submitted 4 May, 2024; v1 submitted 28 April, 2024; originally announced April 2024.

  31. arXiv:2404.16886  [pdf, other

    cs.LG cs.AI

    Review of Data-centric Time Series Analysis from Sample, Feature, and Period

    Authors: Chenxi Sun, Hongyan Li, Yaliang Li, Shenda Hong

    Abstract: Data is essential to performing time series analysis utilizing machine learning approaches, whether for classic models or today's large language models. A good time-series dataset is advantageous for the model's accuracy, robustness, and convergence, as well as task outcomes and costs. The emergence of data-centric AI represents a shift in the landscape from model refinement to prioritizing data q… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 9 pages, 1 figure

  32. arXiv:2404.16257  [pdf, other

    cs.CL cs.AI

    Translation of Multifaceted Data without Re-Training of Machine Translation Systems

    Authors: Hyeonseok Moon, Seungyoon Lee, Seongtae Hong, Seungjun Lee, Chanjun Park, Heuiseok Lim

    Abstract: Translating major language resources to build minor language resources becomes a widely-used approach. Particularly in translating complex data points composed of multiple components, it is common to translate each component separately. However, we argue that this practice often overlooks the interrelation between components within the same data point. To address this limitation, we propose a nove… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: 19 pages

  33. arXiv:2404.13419  [pdf, other

    cs.SC

    On Modeling Multi-Criteria Decision Making with Uncertain Information using Probabilistic Rules

    Authors: Shengxin Hong, Xiuyi Fan

    Abstract: Decision-making processes often involve dealing with uncertainty, which is traditionally addressed through probabilistic models. However, in practical scenarios, assessing probabilities reliably can be challenging, compounded by diverse perceptions of probabilistic information among decision makers. To address this variability and accommodate diverse preferences regarding uncertainty, we introduce… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

  34. arXiv:2404.09259  [pdf, other

    cs.CV cs.AI

    FedCCL: Federated Dual-Clustered Feature Contrast Under Domain Heterogeneity

    Authors: Yu Qiao, Huy Q. Le, Mengchun Zhang, Apurba Adhikary, Chaoning Zhang, Choong Seon Hong

    Abstract: Federated learning (FL) facilitates a privacy-preserving neural network training paradigm through collaboration between edge clients and a central server. One significant challenge is that the distributed data is not independently and identically distributed (non-IID), typically including both intra-domain and inter-domain heterogeneity. However, recent research is limited to simply using averaged… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

  35. arXiv:2404.06776  [pdf, other

    cs.LG cs.AI cs.CV

    Logit Calibration and Feature Contrast for Robust Federated Learning on Non-IID Data

    Authors: Yu Qiao, Chaoning Zhang, Apurba Adhikary, Choong Seon Hong

    Abstract: Federated learning (FL) is a privacy-preserving distributed framework for collaborative model training on devices in edge networks. However, challenges arise due to vulnerability to adversarial examples (AEs) and the non-independent and identically distributed (non-IID) nature of data distribution among devices, hindering the deployment of adversarially robust and accurate learning models at the e… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  36. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seongjin Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  37. arXiv:2404.01231  [pdf, other

    cs.CR cs.LG

    Privacy Backdoors: Enhancing Membership Inference through Poisoning Pre-trained Models

    Authors: Yuxin Wen, Leo Marchyok, Sanghyun Hong, Jonas Geiping, Tom Goldstein, Nicholas Carlini

    Abstract: It is commonplace to produce application-specific models by fine-tuning large pre-trained models using a small bespoke dataset. The widespread availability of foundation model checkpoints on the web poses considerable risks, including the vulnerability to backdoor attacks. In this paper, we unveil a new vulnerability: the privacy backdoor attack. This black-box privacy attack aims to amplify the p… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  38. arXiv:2403.12469  [pdf, other

    cs.CL cs.LG

    When Do "More Contexts" Help with Sarcasm Recognition?

    Authors: Ojas Nimase, Sanghyun Hong

    Abstract: Sarcasm recognition is challenging because it needs an understanding of the true intention, which is opposite to or different from the literal meaning of the words. Prior work has addressed this challenge by developing a series of methods that provide richer $contexts$, e.g., sentiment or cultural nuances, to models. While shown to be effective individually, no study has systematically evaluated t… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Accepted to LREC-COLING 2024 [Short]

  39. arXiv:2403.11981  [pdf, other

    cs.CR cs.CV cs.LG

    Diffusion Denoising as a Certified Defense against Clean-label Poisoning

    Authors: Sanghyun Hong, Nicholas Carlini, Alexey Kurakin

    Abstract: We present a certified defense to clean-label poisoning attacks. These attacks work by injecting a small number of poisoning samples (e.g., 1%) that contain $p$-norm bounded adversarial perturbations into the training data to induce a targeted misclassification of a test-time input. Inspired by the adversarial robustness achieved by $denoised$ $smoothing$, we show how an off-the-shelf diffusion mo… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  40. arXiv:2403.11120  [pdf, other

    cs.CV

    Unifying Feature and Cost Aggregation with Transformers for Semantic and Visual Correspondence

    Authors: Sunghwan Hong, Seokju Cho, Seungryong Kim, Stephen Lin

    Abstract: This paper introduces a Transformer-based integrative feature and cost aggregation network designed for dense matching tasks. In the context of dense matching, many works benefit from one of two forms of aggregation: feature aggregation, which pertains to the alignment of similar features, or cost aggregation, a procedure aimed at instilling coherence in the flow estimates across neighboring pixel… ▽ More

    Submitted 22 April, 2024; v1 submitted 17 March, 2024; originally announced March 2024.

    Comments: Accepted by ICLR'24

  41. arXiv:2403.06294  [pdf, other

    cs.AI cs.MA cs.SC

    ArgMed-Agents: Explainable Clinical Decision Reasoning with LLM Disscusion via Argumentation Schemes

    Authors: Shengxin Hong, Liang Xiao, Xin Zhang, Jianxia Chen

    Abstract: There are two main barriers to using large language models (LLMs) in clinical reasoning. Firstly, while LLMs exhibit significant promise in Natural Language Processing (NLP) tasks, their performance in complex reasoning and planning falls short of expectations. Secondly, LLMs use uninterpretable methods to make clinical decisions that are fundamentally different from the clinician's cognitive proc… ▽ More

    Submitted 20 June, 2024; v1 submitted 10 March, 2024; originally announced March 2024.

  42. arXiv:2403.05131  [pdf, other

    cs.AI cs.CV

    Sora as an AGI World Model? A Complete Survey on Text-to-Video Generation

    Authors: Joseph Cho, Fachrina Dewi Puspitasari, Sheng Zheng, Jingyao Zheng, Lik-Hang Lee, Tae-Ho Kim, Choong Seon Hong, Chaoning Zhang

    Abstract: The evolution of video generation from text, starting with animating MNIST numbers to simulating the physical world with Sora, has progressed at a breakneck speed over the past seven years. While often seen as a superficial expansion of the predecessor text-to-image generation model, text-to-video generation models are developed upon carefully engineered constituents. Here, we systematically discu… ▽ More

    Submitted 7 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

    Comments: First complete survey on Text-to-Video Generation, 44 pages, 20 figures

  43. arXiv:2403.04982  [pdf, other

    cs.AR

    A 28.6 mJ/iter Stable Diffusion Processor for Text-to-Image Generation with Patch Similarity-based Sparsity Augmentation and Text-based Mixed-Precision

    Authors: Jiwon Choi, Wooyoung Jo, Seongyon Hong, Beomseok Kwon, Wonhoon Park, Hoi-Jun Yoo

    Abstract: This paper presents an energy-efficient stable diffusion processor for text-to-image generation. While stable diffusion attained attention for high-quality image synthesis results, its inherent characteristics hinder its deployment on mobile platforms. The proposed processor achieves high throughput and energy efficiency with three key features as solutions: 1) Patch similarity-based sparsity augm… ▽ More

    Submitted 14 March, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

    Comments: Accepted at 2024 IEEE International Symposium on Circuits and Systems (ISCAS)

  44. arXiv:2403.02803  [pdf, other

    cs.CV

    Towards Robust Federated Learning via Logits Calibration on Non-IID Data

    Authors: Yu Qiao, Apurba Adhikary, Chaoning Zhang, Choong Seon Hong

    Abstract: Federated learning (FL) is a privacy-preserving distributed management framework based on collaborative model training of distributed devices in edge networks. However, recent studies have shown that FL is vulnerable to adversarial examples (AEs), leading to a significant drop in its performance. Meanwhile, the non-independent and identically distributed (non-IID) challenge of data distribution be… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: Accepted by IEEE NOMS 2024

  45. arXiv:2403.01722  [pdf, other

    cs.HC

    Closing the Knowledge Gap in Designing Data Annotation Interfaces for AI-powered Disaster Management Analytic Systems

    Authors: Zinat Ara, Hossein Salemi, Sungsoo Ray Hong, Yasas Senarath, Steve Peterson, Amanda Lee Hughes, Hemant Purohit

    Abstract: Data annotation interfaces predominantly leverage ground truth labels to guide annotators toward accurate responses. With the growing adoption of Artificial Intelligence (AI) in domain-specific professional tasks, it has become increasingly important to help beginning annotators identify how their early-stage knowledge can lead to inaccurate answers, which in turn, helps to ensure quality annotati… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

  46. arXiv:2403.01715  [pdf, other

    cs.HC

    Collaborative Job Seeking for People with Autism: Challenges and Design Opportunities

    Authors: Zinat Ara, Amrita Ganguly, Donna Peppard, Dongjun Chung, Slobodan Vucetic, Vivian Genaro Motti, Sungsoo Ray Hong

    Abstract: Successful job search results from job seekers' well-shaped social communication. While well-known differences in communication exist between people with autism and neurotypicals, little is known about how people with autism collaborate with their social surroundings to strive in the job market. To better understand the practices and challenges of collaborative job seeking for people with autism,… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

  47. arXiv:2402.18679  [pdf, other

    cs.AI cs.LG

    Data Interpreter: An LLM Agent For Data Science

    Authors: Sirui Hong, Yizhang Lin, Bang Liu, Bangbang Liu, Binhao Wu, Danyang Li, Jiaqi Chen, Jiayi Zhang, Jinlin Wang, Li Zhang, Lingyao Zhang, Min Yang, Mingchen Zhuge, Taicheng Guo, Tuo Zhou, Wei Tao, Wenyi Wang, Xiangru Tang, Xiangtao Lu, Xiawu Zheng, Xinbing Liang, Yaying Fei, Yuheng Cheng, Zongze Xu, Chenglin Wu

    Abstract: Large Language Model (LLM)-based agents have demonstrated remarkable effectiveness. However, their performance can be compromised in data science scenarios that require real-time data adjustment, expertise in optimization due to complex dependencies among various tasks, and the ability to identify logical errors for precise reasoning. In this study, we introduce the Data Interpreter, a solution de… ▽ More

    Submitted 12 March, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

  48. arXiv:2402.12721  [pdf, other

    cs.CV cs.AI

    PAC-FNO: Parallel-Structured All-Component Fourier Neural Operators for Recognizing Low-Quality Images

    Authors: Jinsung Jeon, Hyundong Jin, Jonghyun Choi, Sanghyun Hong, Dongeun Lee, Kookjin Lee, Noseong Park

    Abstract: A standard practice in developing image recognition models is to train a model on a specific image resolution and then deploy it. However, in real-world inference, models often encounter images different from the training sets in resolution and/or subject to natural variations such as weather changes, noise types and compression artifacts. While traditional solutions involve training multiple mode… ▽ More

    Submitted 14 March, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: Accepted at ICLR 2024

  49. arXiv:2402.06638  [pdf, other

    q-fin.ST cs.AI cs.DC cs.LG

    Transformers with Attentive Federated Aggregation for Time Series Stock Forecasting

    Authors: Chu Myaet Thwal, Ye Lin Tun, Kitae Kim, Seong-Bae Park, Choong Seon Hong

    Abstract: Recent innovations in transformers have shown their superior performance in natural language processing (NLP) and computer vision (CV). The ability to capture long-range dependencies and interactions in sequential data has also triggered a great interest in time series modeling, leading to the widespread use of transformers in many time series applications. However, being the most common and cruci… ▽ More

    Submitted 22 January, 2024; originally announced February 2024.

    Comments: Published in IEEE ICOIN 2023

  50. arXiv:2402.02972  [pdf, other

    cs.CV cs.LG

    Retrieval-Augmented Score Distillation for Text-to-3D Generation

    Authors: Junyoung Seo, Susung Hong, Wooseok Jang, Inès Hyeonsu Kim, Minseop Kwak, Doyup Lee, Seungryong Kim

    Abstract: Text-to-3D generation has achieved significant success by incorporating powerful 2D diffusion models, but insufficient 3D prior knowledge also leads to the inconsistency of 3D geometry. Recently, since large-scale multi-view datasets have been released, fine-tuning the diffusion model on the multi-view datasets becomes a mainstream to solve the 3D inconsistency problem. However, it has confronted… ▽ More

    Submitted 2 May, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: Accepted to ICML 2024 / Project Page: https://ku-cvlab.github.io/ReDream/