Skip to main content

Showing 1–50 of 210 results for author: Jang, J

  1. arXiv:2407.10646  [pdf, other

    cs.SD eess.AS

    Towards zero-shot amplifier modeling: One-to-many amplifier modeling via tone embedding control

    Authors: Yu-Hua Chen, Yen-Tung Yeh, Yuan-Chiao Cheng, Jui-Te Wu, Yu-Hsiang Ho, Jyh-Shing Roger Jang, Yi-Hsuan Yang

    Abstract: Replicating analog device circuits through neural audio effect modeling has garnered increasing interest in recent years. Existing work has predominantly focused on a one-to-one emulation strategy, modeling specific devices individually. In this paper, we tackle the less-explored scenario of one-to-many emulation, utilizing conditioning mechanisms to emulate multiple guitar amplifiers through a si… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: ISMIR 2024

  2. arXiv:2407.07110  [pdf, other

    cs.LG cs.AI eess.SP

    Foundation Models for Electrocardiograms

    Authors: Junho Song, Jong-Hwan Jang, Byeong Tak Lee, DongGyun Hong, Joon-myoung Kwon, Yong-Yeon Jo

    Abstract: Foundation models, enhanced by self-supervised learning (SSL) techniques, represent a cutting-edge frontier in biomedical signal analysis, particularly for electrocardiograms (ECGs), crucial for cardiac health monitoring and diagnosis. This study conducts a comprehensive analysis of foundation models for ECGs by employing and refining innovative SSL methodologies - namely, generative and contrasti… ▽ More

    Submitted 25 June, 2024; originally announced July 2024.

    Comments: 27 pages

  3. arXiv:2407.00657  [pdf, other

    cs.SD cs.LG eess.AS

    Improving Real-Time Music Accompaniment Separation with MMDenseNet

    Authors: Chun-Hsiang Wang, Chung-Che Wang, Jun-You Wang, Jyh-Shing Roger Jang, Yen-Hsun Chu

    Abstract: Music source separation aims to separate polyphonic music into different types of sources. Most existing methods focus on enhancing the quality of separated results by using a larger model structure, rendering them unsuitable for deployment on edge devices. Moreover, these methods may produce low-quality output when the input duration is short, making them impractical for real-time applications. T… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  4. arXiv:2406.15751  [pdf, other

    cs.SD eess.AS

    Improving Unsupervised Clean-to-Rendered Guitar Tone Transformation Using GANs and Integrated Unaligned Clean Data

    Authors: Yu-Hua Chen, Woosung Choi, Wei-Hsiang Liao, Marco Martínez-Ramírez, Kin Wai Cheuk, Yuki Mitsufuji, Jyh-Shing Roger Jang, Yi-Hsuan Yang

    Abstract: Recent years have seen increasing interest in applying deep learning methods to the modeling of guitar amplifiers or effect pedals. Existing methods are mainly based on the supervised approach, requiring temporally-aligned data pairs of unprocessed and rendered audio. However, this approach does not scale well, due to the complicated process involved in creating the data pairs. A very recent work… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: Accepted to DAFx 2024

  5. arXiv:2406.09560  [pdf, other

    cs.CE astro-ph.IM nucl-ex physics.ins-det

    Computational generation of tailored radionuclide libraries for alpha-particle and gamma-ray spectrometry

    Authors: Jaewoong Jang

    Abstract: Radionuclide identification is a radioanalytical method employed in various scientific disciplines that utilize alpha-particle or gamma-ray spectrometric assays, ranging from astrophysics to nuclear medicine. Radionuclide libraries in conventional radionuclide identification systems are crafted in a manual fashion, accompanying labor-intensive and error-prone user tasks and hindering library custo… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  6. arXiv:2406.08528  [pdf, other

    cs.CV cs.LG

    Adaptive Teaching with Shared Classifier for Knowledge Distillation

    Authors: Jaeyeon Jang, Young-Ik Kim, Jisu Lim, Hyeonseong Lee

    Abstract: Knowledge distillation (KD) is a technique used to transfer knowledge from an overparameterized teacher network to a less-parameterized student network, thereby minimizing the incurred performance loss. KD methods can be categorized into offline and online approaches. Offline KD leverages a powerful pretrained teacher network, while online KD allows the teacher network to be adjusted dynamically t… ▽ More

    Submitted 14 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

  7. arXiv:2406.05761  [pdf, other

    cs.CL

    The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models

    Authors: Seungone Kim, Juyoung Suk, Ji Yong Cho, Shayne Longpre, Chaeeun Kim, Dongkeun Yoon, Guijin Son, Yejin Cho, Sheikh Shafayat, Jinheon Baek, Sue Hyun Park, Hyeonbin Hwang, Jinkyung Jo, Hyowon Cho, Haebin Shin, Seongyun Lee, Hanseok Oh, Noah Lee, Namgyu Ho, Se June Joo, Miyoung Ko, Yoonjoo Lee, Hyungjoo Chae, Jamin Shin, Joel Jang , et al. (7 additional authors not shown)

    Abstract: As language models (LMs) become capable of handling a wide range of tasks, their evaluation is becoming as challenging as their development. Most generation benchmarks currently assess LMs using abstract evaluation criteria like helpfulness and harmlessness, which often lack the flexibility and granularity of human assessment. Additionally, these benchmarks tend to focus disproportionately on spec… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: Work in Progress

  8. arXiv:2406.04582  [pdf, other

    eess.AS cs.SD

    Neural Codec-based Adversarial Sample Detection for Speaker Verification

    Authors: Xuanjun Chen, Jiawei Du, Haibin Wu, Jyh-Shing Roger Jang, Hung-yi Lee

    Abstract: Automatic Speaker Verification (ASV), increasingly used in security-critical applications, faces vulnerabilities from rising adversarial attacks, with few effective defenses available. In this paper, we propose a neural codec-based adversarial sample detection method for ASV. The approach leverages the codec's ability to discard redundant perturbations and retain essential information. Specificall… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Accepted by Interspeech 2024

  9. arXiv:2405.17821  [pdf, other

    cs.CV cs.AI

    RITUAL: Random Image Transformations as a Universal Anti-hallucination Lever in LVLMs

    Authors: Sangmin Woo, Jaehyuk Jang, Donguk Kim, Yubin Choi, Changick Kim

    Abstract: Recent advancements in Large Vision Language Models (LVLMs) have revolutionized how machines understand and generate textual responses based on visual inputs. Despite their impressive capabilities, they often produce "hallucinatory" outputs that do not accurately reflect the visual information, posing challenges in reliability and trustworthiness. Current methods such as contrastive decoding have… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: Project page: https://sangminwoo.github.io/RITUAL/

  10. arXiv:2405.17820  [pdf, other

    cs.CV cs.AI

    Don't Miss the Forest for the Trees: Attentional Vision Calibration for Large Vision Language Models

    Authors: Sangmin Woo, Donguk Kim, Jaehyuk Jang, Yubin Choi, Changick Kim

    Abstract: This study addresses the issue observed in Large Vision Language Models (LVLMs), where excessive attention on a few image tokens, referred to as blind tokens, leads to hallucinatory responses in tasks requiring fine-grained understanding of visual objects. We found that tokens receiving lower attention weights often hold essential information for identifying nuanced object details -- ranging from… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: Project page: https://sangminwoo.github.io/AvisC/

  11. arXiv:2405.15101  [pdf, other

    cs.RO eess.SY

    Social Zone as a Barrier Function for Socially-Compliant Robot Navigation

    Authors: Junwoo Jang, Maani Ghaffari

    Abstract: This study addresses the challenge of integrating social norms into robot navigation, which is essential for ensuring that robots operate safely and efficiently in human-centric environments. Social norms, often unspoken and implicitly understood among people, are difficult to explicitly define and implement in robotic systems. To overcome this, we derive these norms from real human trajectory dat… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  12. arXiv:2405.11802  [pdf, other

    cs.HC cs.AI cs.LG

    Counterfactual Explanation-Based Badminton Motion Guidance Generation Using Wearable Sensors

    Authors: Minwoo Seong, Gwangbin Kim, Yumin Kang, Junhyuk Jang, Joseph DelPreto, SeungJun Kim

    Abstract: This study proposes a framework for enhancing the stroke quality of badminton players by generating personalized motion guides, utilizing a multimodal wearable dataset. These guides are based on counterfactual algorithms and aim to reduce the performance gap between novice and expert players. Our approach provides joint-level guidance through visualizable data to assist players in improving their… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: ICRA Wearable Workshop 2024 - 1st Workshop on Advancing Wearable Devices and Applications through Novel Design, Sensing, Actuation, and AI

  13. arXiv:2405.01314  [pdf, other

    eess.SY cs.LG

    Non-iterative Optimization of Trajectory and Radio Resource for Aerial Network

    Authors: Hyeonsu Lyu, Jonggyu Jang, Harim Lee, Hyun Jong Yang

    Abstract: We address a joint trajectory planning, user association, resource allocation, and power control problem to maximize proportional fairness in the aerial IoT network, considering practical end-to-end quality-of-service (QoS) and communication schedules. Though the problem is rather ancient, apart from the fact that the previous approaches have never considered user- and time-specific QoS, we point… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  14. arXiv:2404.17799  [pdf, other

    cs.LG cs.AI

    Personalized Federated Learning via Sequential Layer Expansion in Representation Learning

    Authors: Jaewon Jang, Bonjun Choi

    Abstract: Federated learning ensures the privacy of clients by conducting distributed training on individual client devices and sharing only the model weights with a central server. However, in real-world scenarios, the heterogeneity of data among clients necessitates appropriate personalization methods. In this paper, we aim to address this heterogeneity using a form of parameter decoupling known as repres… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: 12 pages, 7 figure

  15. arXiv:2404.15686  [pdf, ps, other

    cs.CR

    Noise Variance Optimization in Differential Privacy: A Game-Theoretic Approach Through Per-Instance Differential Privacy

    Authors: Sehyun Ryu, Jonggyu Jang, Hyun Jong Yang

    Abstract: The concept of differential privacy (DP) can quantitatively measure privacy loss by observing the changes in the distribution caused by the inclusion of individuals in the target dataset. The DP, which is generally used as a constraint, has been prominent in safeguarding datasets in machine learning in industry giants like Apple and Google. A common methodology for guaranteeing DP is incorporating… ▽ More

    Submitted 27 April, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

  16. arXiv:2404.14687  [pdf, other

    cs.MM cs.AI cs.CL cs.CV

    Pegasus-v1 Technical Report

    Authors: Raehyuk Jung, Hyojun Go, Jaehyuk Yi, Jiho Jang, Daniel Kim, Jay Suh, Aiden Lee, Cooper Han, Jae Lee, Jeff Kim, Jin-Young Kim, Junwan Kim, Kyle Park, Lucas Lee, Mars Ha, Minjoon Seo, Abraham Jo, Ed Park, Hassan Kianinejad, SJ Kim, Tony Moon, Wade Jeong, Andrei Popescu, Esther Kim, EK Yoon , et al. (19 additional authors not shown)

    Abstract: This technical report introduces Pegasus-1, a multimodal language model specialized in video content understanding and interaction through natural language. Pegasus-1 is designed to address the unique challenges posed by video data, such as interpreting spatiotemporal information, to offer nuanced video content comprehension across various lengths. This technical report overviews Pegasus-1's archi… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  17. arXiv:2404.09443  [pdf, other

    cs.LG cs.DC

    Hybrid FedGraph: An efficient hybrid federated learning algorithm using graph convolutional neural network

    Authors: Jaeyeon Jang, Diego Klabjan, Veena Mendiratta, Fanfei Meng

    Abstract: Federated learning is an emerging paradigm for decentralized training of machine learning models on distributed clients, without revealing the data to the central server. Most existing works have focused on horizontal or vertical data distributions, where each client possesses different samples with shared features, or each client fully shares only sample indices, respectively. However, the hybrid… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  18. arXiv:2404.02477  [pdf, ps, other

    eess.SP cs.AI

    Enhancing Sum-Rate Performance in Constrained Multicell Networks: A Low-Information Exchange Approach

    Authors: Youjin Kim, Jonggyu Jang, Hyun Jong Yang

    Abstract: Despite the extensive research on massive MIMO systems for 5G telecommunications and beyond, the reality is that many deployed base stations are equipped with a limited number of antennas rather than supporting massive MIMO configurations. Furthermore, while the cell-less network concept, which eliminates cell boundaries, is under investigation, practical deployments often grapple with significant… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: 5 pages, 12 figures

  19. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seongjin Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  20. arXiv:2404.01842  [pdf, other

    cs.CV

    Semi-Supervised Domain Adaptation for Wildfire Detection

    Authors: JooYoung Jang, Youngseo Cha, Jisu Kim, SooHyung Lee, Geonu Lee, Minkook Cho, Young Hwang, Nojun Kwak

    Abstract: Recently, both the frequency and intensity of wildfires have increased worldwide, primarily due to climate change. In this paper, we propose a novel protocol for wildfire detection, leveraging semi-supervised Domain Adaptation for object detection, accompanied by a corresponding dataset designed for use by both academics and industries. Our dataset encompasses 30 times more diverse labeled scenes… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: 16 pages, 5 figures, 22 tables

  21. arXiv:2404.00974  [pdf, other

    cs.CV

    Improving Visual Recognition with Hyperbolical Visual Hierarchy Mapping

    Authors: Hyeongjun Kwon, Jinhyun Jang, Jin Kim, Kwonyoung Kim, Kwanghoon Sohn

    Abstract: Visual scenes are naturally organized in a hierarchy, where a coarse semantic is recursively comprised of several fine details. Exploring such a visual hierarchy is crucial to recognize the complex relations of visual elements, leading to a comprehensive scene understanding. In this paper, we propose a Visual Hierarchy Mapper (Hi-Mapper), a novel approach for enhancing the structured understanding… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: This paper is accepted to CVPR 2024. The supplementary material is included. The code is available at \url{https://github.com/kwonjunn01/Hi-Mapper}

  22. arXiv:2403.09488  [pdf, other

    cs.CL cs.AI

    Rectifying Demonstration Shortcut in In-Context Learning

    Authors: Joonwon Jang, Sanghwan Jang, Wonbin Kweon, Minjin Jeon, Hwanjo Yu

    Abstract: Large language models (LLMs) are able to solve various tasks with only a few demonstrations utilizing their in-context learning (ICL) abilities. However, LLMs often rely on their pre-trained semantic priors of demonstrations rather than on the input-label relationships to proceed with ICL prediction. In this work, we term this phenomenon as the 'Demonstration Shortcut'. While previous works have p… ▽ More

    Submitted 15 April, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

    Comments: NAACL 2024

  23. arXiv:2403.09024  [pdf, other

    cs.CL cs.AI

    Semiparametric Token-Sequence Co-Supervision

    Authors: Hyunji Lee, Doyoung Kim, Jihoon Jun, Sejune Joo, Joel Jang, Kyoung-Woon On, Minjoon Seo

    Abstract: In this work, we introduce a semiparametric token-sequence co-supervision training method. It trains a language model by simultaneously leveraging supervision from the traditional next token prediction loss which is calculated over the parametric token embedding space and the next sequence prediction loss which is calculated over the nonparametric sequence embedding space. The nonparametric sequen… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  24. arXiv:2403.08299  [pdf, other

    cs.SE cs.AI

    AutoDev: Automated AI-Driven Development

    Authors: Michele Tufano, Anisha Agarwal, Jinu Jang, Roshanak Zilouchian Moghaddam, Neel Sundaresan

    Abstract: The landscape of software development has witnessed a paradigm shift with the advent of AI-powered assistants, exemplified by GitHub Copilot. However, existing solutions are not leveraging all the potential capabilities available in an IDE such as building, testing, executing code, git operations, etc. Therefore, they are constrained by their limited capabilities, primarily focusing on suggesting… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  25. "It's the only thing I can trust": Envisioning Large Language Model Use by Autistic Workers for Communication Assistance

    Authors: JiWoong Jang, Sanika Moharana, Patrick Carrington, Andrew Begel

    Abstract: Autistic adults often experience stigma and discrimination at work, leading them to seek social communication support from coworkers, friends, and family despite emotional risks. Large language models (LLMs) are increasingly considered an alternative. In this work, we investigate the phenomenon of LLM use by autistic adults at work and explore opportunities and risks of LLMs as a source of social… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 19 pages, 6 figure, CHI '24 Conference

  26. arXiv:2403.01861  [pdf, other

    cs.RO cs.AI cs.CV

    AiSDF: Structure-aware Neural Signed Distance Fields in Indoor Scenes

    Authors: Jaehoon Jang, Inha Lee, Minje Kim, Kyungdon Joo

    Abstract: Indoor scenes we are living in are visually homogenous or textureless, while they inherently have structural forms and provide enough structural priors for 3D scene reconstruction. Motivated by this fact, we propose a structure-aware online signed distance fields (SDF) reconstruction framework in indoor scenes, especially under the Atlanta world (AW) assumption. Thus, we dub this incremental SDF r… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: 8 pages, 6 figures, Accepted to IEEE RA-L (First two authors contributed equally)

    Journal ref: IEEE Robotics and Automation Letters (RA-L), vol. 9, no. 5, pp. 4106-4113, 2024

  27. arXiv:2403.01189  [pdf, other

    cs.LG cs.CV

    Training Unbiased Diffusion Models From Biased Dataset

    Authors: Yeongmin Kim, Byeonghu Na, Minsang Park, JoonHo Jang, Dongjun Kim, Wanmo Kang, Il-Chul Moon

    Abstract: With significant advancements in diffusion models, addressing the potential risks of dataset bias becomes increasingly important. Since generated outputs directly suffer from dataset bias, mitigating latent bias becomes a key factor in improving sample quality and proportion. This paper proposes time-dependent importance reweighting to mitigate the bias for the diffusion models. We demonstrate tha… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

    Comments: International Conference on Learning Representations (ICLR 2024)

  28. arXiv:2402.14261  [pdf, other

    cs.SE cs.AI

    Copilot Evaluation Harness: Evaluating LLM-Guided Software Programming

    Authors: Anisha Agarwal, Aaron Chan, Shubham Chandel, Jinu Jang, Shaun Miller, Roshanak Zilouchian Moghaddam, Yevhen Mohylevskyy, Neel Sundaresan, Michele Tufano

    Abstract: The integration of Large Language Models (LLMs) into Development Environments (IDEs) has become a focal point in modern software development. LLMs such as OpenAI GPT-3.5/4 and Code Llama offer the potential to significantly augment developer productivity by serving as intelligent, chat-driven programming assistants. However, utilizing LLMs out of the box is unlikely to be optimal for any given sce… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  29. arXiv:2402.13226  [pdf, other

    eess.IV cs.AI cs.CE eess.SP

    NeRF Solves Undersampled MRI Reconstruction

    Authors: Tae Jun Jang, Chang Min Hyun

    Abstract: This article presents a novel undersampled magnetic resonance imaging (MRI) technique that leverages the concept of Neural Radiance Field (NeRF). With radial undersampling, the corresponding imaging problem can be reformulated into an image modeling task from sparse-view rendered data; therefore, a high dimensional MR image is obtainable from undersampled k-space data by taking advantage of implic… ▽ More

    Submitted 2 March, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

  30. arXiv:2402.13018  [pdf, other

    eess.AS cs.SD

    EMO-SUPERB: An In-depth Look at Speech Emotion Recognition

    Authors: Haibin Wu, Huang-Cheng Chou, Kai-Wei Chang, Lucas Goncalves, Jiawei Du, Jyh-Shing Roger Jang, Chi-Chun Lee, Hung-Yi Lee

    Abstract: Speech emotion recognition (SER) is a pivotal technology for human-computer interaction systems. However, 80.77% of SER papers yield results that cannot be reproduced. We develop EMO-SUPERB, short for EMOtion Speech Universal PERformance Benchmark, which aims to enhance open-source initiatives for SER. EMO-SUPERB includes a user-friendly codebase to leverage 15 state-of-the-art speech self-supervi… ▽ More

    Submitted 12 March, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: webpage: https://emosuperb.github.io/

  31. arXiv:2402.01428  [pdf, ps, other

    cs.LO cs.PL

    Adjoint Natural Deduction (Extended Version)

    Authors: Junyoung Jang, Sophia Roshal, Frank Pfenning, Brigitte Pientka

    Abstract: Adjoint logic is a general approach to combining multiple logics with different structural properties, including linear, affine, strict, and (ordinary) intuitionistic logics, where each proposition has an intrinsic mode of truth. It has been defined in the form of a sequent calculus because the central concept of independence is most clearly understood in this form, and because it permits a proof… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  32. arXiv:2401.17212  [pdf, other

    cs.CV

    ContactGen: Contact-Guided Interactive 3D Human Generation for Partners

    Authors: Dongjun Gu, Jaehyeok Shim, Jaehoon Jang, Changwoo Kang, Kyungdon Joo

    Abstract: Among various interactions between humans, such as eye contact and gestures, physical interactions by contact can act as an essential moment in understanding human behaviors. Inspired by this fact, given a 3D partner human with the desired interaction label, we introduce a new task of 3D human generation in terms of physical contact. Unlike previous works of interacting with static objects or scen… ▽ More

    Submitted 3 February, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

    Comments: Accepted by AAAI 2024

  33. arXiv:2401.10695  [pdf, other

    cs.CL

    LangBridge: Multilingual Reasoning Without Multilingual Supervision

    Authors: Dongkeun Yoon, Joel Jang, Sungdong Kim, Seungone Kim, Sheikh Shafayat, Minjoon Seo

    Abstract: We introduce LangBridge, a zero-shot approach to adapt language models for multilingual reasoning tasks without multilingual supervision. LangBridge operates by bridging two models, each specialized in different aspects: (1) one specialized in understanding multiple languages (e.g., mT5 encoder) and (2) one specialized in reasoning (e.g., MetaMath). LangBridge connects the two models by introducin… ▽ More

    Submitted 3 June, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Comments: ACL 2024 Main

  34. arXiv:2312.15890  [pdf, other

    cs.CV

    Towards Robust Multimodal Prompting With Missing Modalities

    Authors: Jaehyuk Jang, Yooseung Wang, Changick Kim

    Abstract: Recently, multimodal prompting, which introduces learnable missing-aware prompts for all missing modality cases, has exhibited impressive performance. However, it encounters two critical issues: 1) The number of prompts grows exponentially as the number of modalities increases; and 2) It lacks robustness in scenarios with different missing modality settings between training and inference. In this… ▽ More

    Submitted 26 December, 2023; v1 submitted 26 December, 2023; originally announced December 2023.

    Comments: Accepted to ICASSP 2024

  35. arXiv:2312.11042  [pdf, other

    cs.ET

    VECOM: Variation Resilient Encoding and Offset Compensation Schemes for Reliable ReRAM Based DNN Accelerator

    Authors: Je-Woo Jang, Thai-Hoang Nguyen, Joon-Sung Yang

    Abstract: Resistive Random Access Memory (ReRAM) based Processing In Memory (PIM) Accelerator has emerged as a promising computing architecture for memory intensive applications, such as Deep Neural Networks (DNNs). However, due to its immaturity, ReRAM devices often suffer from various reliability issues, which hinder the practicality of the PIM architecture and lead to a severe degradation in DNN accuracy… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: 9 pages, 13 figures, This paper was accepted in International Conference on Computer-Aided Design (ICCAD) 2023

  36. arXiv:2312.10486  [pdf, other

    cs.DB

    Time-Constrained Continuous Subgraph Matching Using Temporal Information for Filtering and Backtracking

    Authors: Seunghwan Min, Jihoon Jang, Kunsoo Park, Dora Giammarresi, Giuseppe F. Italiano, Wook-Shin Han

    Abstract: Real-time analysis of graphs containing temporal information, such as social media streams, Q&A networks, and cyber data sources, plays an important role in various applications. Among them, detecting patterns is one of the fundamental graph analysis problems. In this paper, we study time-constrained continuous subgraph matching, which detects a pattern with a strict partial order on the edge set… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

  37. arXiv:2312.07040  [pdf, ps, other

    cs.AI cs.CR

    Patch-MI: Enhancing Model Inversion Attacks via Patch-Based Reconstruction

    Authors: Jonggyu Jang, Hyeonsu Lyu, Hyun Jong Yang

    Abstract: Model inversion (MI) attacks aim to reveal sensitive information in training datasets by solely accessing model weights. Generative MI attacks, a prominent strand in this field, utilize auxiliary datasets to recreate target data attributes, restricting the images to remain photo-realistic, but their success often depends on the similarity between auxiliary and target datasets. If the distributions… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: 11 pages

  38. arXiv:2312.05586  [pdf, other

    cs.LG cs.AI

    Deeper Understanding of Black-box Predictions via Generalized Influence Functions

    Authors: Hyeonsu Lyu, Jonggyu Jang, Sehyun Ryu, Hyun Jong Yang

    Abstract: Influence functions (IFs) elucidate how training data changes model behavior. However, the increasing size and non-convexity in large-scale models make IFs inaccurate. We suspect that the fragility comes from the first-order approximation which may cause nuisance changes in parameters irrelevant to the examined data. However, simply computing influence from the chosen parameters can be misleading,… ▽ More

    Submitted 6 May, 2024; v1 submitted 9 December, 2023; originally announced December 2023.

    Comments: 16 pages, 6 figures, and 2 tables

    ACM Class: I.2.0

  39. arXiv:2311.17822  [pdf, other

    cs.AI

    Anomalous Behavior Detection in Trajectory Data of Older Drivers

    Authors: Seyedeh Gol Ara Ghoreishi, Sonia Moshfeghi, Muhammad Tanveer Jan, Joshua Conniff, KwangSoo Yang, Jinwoo Jang, Borko Furht, Ruth Tappen, David Newman, Monica Rosselli, Jiannan Zhai

    Abstract: Given a road network and a set of trajectory data, the anomalous behavior detection (ABD) problem is to identify drivers that show significant directional deviations, hardbrakings, and accelerations in their trips. The ABD problem is important in many societal applications, including Mild Cognitive Impairment (MCI) detection and safe route recommendations for older drivers. The ABD problem is comp… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: IEEE HONET 2023

  40. arXiv:2311.16267  [pdf, other

    cs.CL cs.SE

    Novel Preprocessing Technique for Data Embedding in Engineering Code Generation Using Large Language Model

    Authors: Yu-Chen Lin, Akhilesh Kumar, Norman Chang, Wenliang Zhang, Muhammad Zakir, Rucha Apte, Haiyang He, Chao Wang, Jyh-Shing Roger Jang

    Abstract: We present four main contributions to enhance the performance of Large Language Models (LLMs) in generating domain-specific code: (i) utilizing LLM-based data splitting and data renovation techniques to improve the semantic representation of embeddings' space; (ii) introducing the Chain of Density for Renovation Credibility (CoDRC), driven by LLMs, and the Adaptive Text Renovation (ATR) algorithm… ▽ More

    Submitted 30 January, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

  41. arXiv:2311.12488  [pdf, other

    eess.AS cs.SD

    Adapting pretrained speech model for Mandarin lyrics transcription and alignment

    Authors: Jun-You Wang, Chon-In Leong, Yu-Chen Lin, Li Su, Jyh-Shing Roger Jang

    Abstract: The tasks of automatic lyrics transcription and lyrics alignment have witnessed significant performance improvements in the past few years. However, most of the previous works only focus on English in which large-scale datasets are available. In this paper, we address lyrics transcription and alignment of polyphonic Mandarin pop music in a low-resource setting. To deal with the data scarcity issue… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

    Comments: Accepted by ASRU 2023

  42. arXiv:2311.10702  [pdf, other

    cs.CL

    Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2

    Authors: Hamish Ivison, Yizhong Wang, Valentina Pyatkin, Nathan Lambert, Matthew Peters, Pradeep Dasigi, Joel Jang, David Wadden, Noah A. Smith, Iz Beltagy, Hannaneh Hajishirzi

    Abstract: Since the release of TÜLU [Wang et al., 2023b], open resources for instruction tuning have developed quickly, from better base models to new finetuning techniques. We test and incorporate a number of these advances into TÜLU, resulting in TÜLU 2, a suite of improved TÜLU models for advancing the understanding and best practices of adapting pretrained language models to downstream tasks and user pr… ▽ More

    Submitted 19 November, 2023; v1 submitted 17 November, 2023; originally announced November 2023.

    Comments: technical report; fixed zephyr numbers

  43. arXiv:2311.09273  [pdf, other

    cs.HC cs.AI

    In-vehicle Sensing and Data Analysis for Older Drivers with Mild Cognitive Impairment

    Authors: Sonia Moshfeghi, Muhammad Tanveer Jan, Joshua Conniff, Seyedeh Gol Ara Ghoreishi, Jinwoo Jang, Borko Furht, Kwangsoo Yang, Monica Rosselli, David Newman, Ruth Tappen, Dana Smith

    Abstract: Driving is a complex daily activity indicating age and disease related cognitive declines. Therefore, deficits in driving performance compared with ones without mild cognitive impairment (MCI) can reflect changes in cognitive functioning. There is increasing evidence that unobtrusive monitoring of older adults driving performance in a daily-life setting may allow us to detect subtle early changes… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: 6 pages, 5 figures, IEEE HONET Conference

  44. arXiv:2311.09069  [pdf, other

    cs.CL cs.AI

    How Well Do Large Language Models Truly Ground?

    Authors: Hyunji Lee, Sejune Joo, Chaeeun Kim, Joel Jang, Doyoung Kim, Kyoung-Woon On, Minjoon Seo

    Abstract: To reduce issues like hallucinations and lack of control in Large Language Models (LLMs), a common method is to generate responses by grounding on external contexts given as input, known as knowledge-augmented models. However, previous research often narrowly defines "grounding" as just having the correct answer, which does not ensure the reliability of the entire response. To overcome this, we pr… ▽ More

    Submitted 29 June, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: published at NAACL 2022

  45. arXiv:2311.02122  [pdf, other

    cs.CV

    Lost Your Style? Navigating with Semantic-Level Approach for Text-to-Outfit Retrieval

    Authors: Junkyu Jang, Eugene Hwang, Sung-Hyuk Park

    Abstract: Fashion stylists have historically bridged the gap between consumers' desires and perfect outfits, which involve intricate combinations of colors, patterns, and materials. Although recent advancements in fashion recommendation systems have made strides in outfit compatibility prediction and complementary item retrieval, these systems rely heavily on pre-selected customer choices. Therefore, we int… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

    Comments: 10pages, 2024 WACV Accepted

  46. arXiv:2310.20187  [pdf, other

    cs.LG cs.AI

    Self-Supervised Pre-Training for Precipitation Post-Processor

    Authors: Sojung An, Junha Lee, Jiyeon Jang, Inchae Na, Wooyeon Park, Sujeong You

    Abstract: Obtaining a sufficient forecast lead time for local precipitation is essential in preventing hazardous weather events. Global warming-induced climate change increases the challenge of accurately predicting severe precipitation events, such as heavy rainfall. In this paper, we propose a deep learning-based precipitation post-processor for numerical weather prediction (NWP) models. The precipitation… ▽ More

    Submitted 19 February, 2024; v1 submitted 31 October, 2023; originally announced October 2023.

    Comments: 7 pages, 3 figures, 1 table, accepted to NeurIPS 2023 Workshop on Tackling Climate Change with Machine Learning at [this http URL](https://www.climatechange.ai/papers/neurips2023/18)

  47. arXiv:2310.13420  [pdf, other

    cs.CL

    Conversation Chronicles: Towards Diverse Temporal and Relational Dynamics in Multi-Session Conversations

    Authors: Jihyoung Jang, Minseong Boo, Hyounghun Kim

    Abstract: In the field of natural language processing, open-domain chatbots have emerged as an important research topic. However, a major limitation of existing open-domain chatbot research is its singular focus on short single-session dialogue, neglecting the potential need for understanding contextual information in multiple consecutive sessions that precede an ongoing dialogue. Among the elements that co… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 (23 pages); Project website: https://conversation-chronicles.github.io

  48. arXiv:2310.11564  [pdf, other

    cs.CL

    Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging

    Authors: Joel Jang, Seungone Kim, Bill Yuchen Lin, Yizhong Wang, Jack Hessel, Luke Zettlemoyer, Hannaneh Hajishirzi, Yejin Choi, Prithviraj Ammanabrolu

    Abstract: While Reinforcement Learning from Human Feedback (RLHF) aligns Large Language Models (LLMs) with general, aggregate human preferences, it is suboptimal for learning diverse, individual perspectives. In this work, we study Reinforcement Learning from Personalized Human Feedback (RLPHF) problem, wherein LLMs are aligned to multiple (sometimes conflicting) preferences by modeling alignment as a Multi… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: Preprint

  49. arXiv:2310.08491  [pdf, other

    cs.CL cs.LG

    Prometheus: Inducing Fine-grained Evaluation Capability in Language Models

    Authors: Seungone Kim, Jamin Shin, Yejin Cho, Joel Jang, Shayne Longpre, Hwaran Lee, Sangdoo Yun, Seongjin Shin, Sungdong Kim, James Thorne, Minjoon Seo

    Abstract: Recently, using a powerful proprietary Large Language Model (LLM) (e.g., GPT-4) as an evaluator for long-form responses has become the de facto standard. However, for practitioners with large-scale evaluation tasks and custom criteria in consideration (e.g., child-readability), using proprietary LLMs as an evaluator is unreliable due to the closed-source nature, uncontrolled versioning, and prohib… ▽ More

    Submitted 9 March, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

    Comments: ICLR 2024

  50. arXiv:2308.14815  [pdf, other

    cs.AI cs.LG cs.RO

    Distributionally Robust Statistical Verification with Imprecise Neural Networks

    Authors: Souradeep Dutta, Michele Caprio, Vivian Lin, Matthew Cleaveland, Kuk Jin Jang, Ivan Ruchkin, Oleg Sokolsky, Insup Lee

    Abstract: A particularly challenging problem in AI safety is providing guarantees on the behavior of high-dimensional autonomous systems. Verification approaches centered around reachability analysis fail to scale, and purely statistical approaches are constrained by the distributional assumptions about the sampling process. Instead, we pose a distributionally robust version of the statistical verification… ▽ More

    Submitted 11 December, 2023; v1 submitted 28 August, 2023; originally announced August 2023.