Skip to main content

Showing 1–21 of 21 results for author: Ong, K

  1. arXiv:2406.10996  [pdf, other

    cs.CL

    THEANINE: Revisiting Memory Management in Long-term Conversations with Timeline-augmented Response Generation

    Authors: Seo Hyun Kim, Kai Tzu-iunn Ong, Taeyoon Kwon, Namyoung Kim, Keummin Ka, SeongHyeon Bae, Yohan Jo, Seung-won Hwang, Dongha Lee, Jinyoung Yeo

    Abstract: Large language models (LLMs) are capable of processing lengthy dialogue histories during prolonged interaction with users without additional memory modules; however, their responses tend to overlook or incorrectly recall information from the past. In this paper, we revisit memory-augmented response generation in the era of LLMs. While prior work focuses on getting rid of outdated memories, we argu… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: Under Review

  2. arXiv:2406.03880  [pdf, other

    cs.LG cs.AI

    Memorization in deep learning: A survey

    Authors: Jiaheng Wei, Yanjun Zhang, Leo Yu Zhang, Ming Ding, Chao Chen, Kok-Leong Ong, Jun Zhang, Yang Xiang

    Abstract: Deep Learning (DL) powered by Deep Neural Networks (DNNs) has revolutionized various domains, yet understanding the intricacies of DNN decision-making and learning processes remains a significant challenge. Recent investigations have uncovered an interesting memorization phenomenon in which DNNs tend to memorize specific details from examples rather than learning general patterns, affecting model… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  3. arXiv:2404.02575  [pdf, other

    cs.CL

    Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models

    Authors: Hyungjoo Chae, Yeonghyeon Kim, Seungone Kim, Kai Tzu-iunn Ong, Beong-woo Kwak, Moohyeon Kim, Seonghwan Kim, Taeyoon Kwon, Jiwan Chung, Youngjae Yu, Jinyoung Yeo

    Abstract: Algorithmic reasoning refers to the ability to understand the complex patterns behind the problem and decompose them into a sequence of reasoning steps towards the solution. Such nature of algorithmic reasoning makes it a challenge for large language models (LLMs), even though they have demonstrated promising performance in other reasoning tasks. Within this context, some recent studies use progra… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: 38 pages, 4 figures

  4. arXiv:2401.14215  [pdf, other

    cs.CL cs.AI

    Commonsense-augmented Memory Construction and Management in Long-term Conversations via Context-aware Persona Refinement

    Authors: Hana Kim, Kai Tzu-iunn Ong, Seoyeon Kim, Dongha Lee, Jinyoung Yeo

    Abstract: Memorizing and utilizing speakers' personas is a common practice for response generation in long-term conversations. Yet, human-authored datasets often provide uninformative persona sentences that hinder response quality. This paper presents a novel framework that leverages commonsense-based persona expansion to address such issues in long-term conversation. While prior work focuses on not produci… ▽ More

    Submitted 12 February, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: Accepted to EACL 2024

  5. arXiv:2401.09495   

    cs.CV

    IPR-NeRF: Ownership Verification meets Neural Radiance Field

    Authors: Win Kent Ong, Kam Woh Ng, Chee Seng Chan, Yi Zhe Song, Tao Xiang

    Abstract: Neural Radiance Field (NeRF) models have gained significant attention in the computer vision community in the recent past with state-of-the-art visual quality and produced impressive demonstrations. Since then, technopreneurs have sought to leverage NeRF models into a profitable business. Therefore, NeRF models make it worth the risk of plagiarizers illegally copying, re-distributing, or misusing… ▽ More

    Submitted 22 January, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: Error on result tabulation of state of the art method which might cause misleading to readers

  6. arXiv:2312.08764  [pdf, other

    cs.CV

    CattleEyeView: A Multi-task Top-down View Cattle Dataset for Smarter Precision Livestock Farming

    Authors: Kian Eng Ong, Sivaji Retta, Ramarajulu Srinivasan, Shawn Tan, Jun Liu

    Abstract: Cattle farming is one of the important and profitable agricultural industries. Employing intelligent automated precision livestock farming systems that can count animals, track the animals and their poses will raise productivity and significantly reduce the heavy burden on its already limited labor pool. To achieve such intelligent systems, a large cattle video dataset is essential in developing a… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: Published at VCIP 2023. Dataset and code available at https://github.com/AnimalEyeQ/CattleEyeView

  7. arXiv:2312.07399  [pdf, other

    cs.CL cs.AI

    Large Language Models are Clinical Reasoners: Reasoning-Aware Diagnosis Framework with Prompt-Generated Rationales

    Authors: Taeyoon Kwon, Kai Tzu-iunn Ong, Dongjin Kang, Seungjun Moon, Jeong Ryong Lee, Dosik Hwang, Yongsik Sim, Beomseok Sohn, Dongha Lee, Jinyoung Yeo

    Abstract: Machine reasoning has made great progress in recent years owing to large language models (LLMs). In the clinical domain, however, most NLP-driven projects mainly focus on clinical classification or reading comprehension, and under-explore clinical reasoning for disease diagnosis due to the expensive rationale annotation with clinicians. In this work, we present a "reasoning-aware" diagnosis framew… ▽ More

    Submitted 10 May, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: Accepted to AAAI 2024

  8. arXiv:2311.07215  [pdf, other

    cs.CL cs.SE

    Coffee: Boost Your Code LLMs by Fixing Bugs with Feedback

    Authors: Seungjun Moon, Hyungjoo Chae, Yongho Song, Taeyoon Kwon, Dongjin Kang, Kai Tzu-iunn Ong, Seung-won Hwang, Jinyoung Yeo

    Abstract: Code editing is an essential step towards reliable program synthesis to automatically correct critical errors generated from code LLMs. Recent studies have demonstrated that closed-source LLMs (i.e., ChatGPT and GPT-4) are capable of generating corrective feedback to edit erroneous inputs. However, it remains challenging for open-source code LLMs to generate feedback for code editing, since these… ▽ More

    Submitted 23 February, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: Work in progress

  9. arXiv:2310.09343  [pdf, other

    cs.CL cs.AI

    Dialogue Chain-of-Thought Distillation for Commonsense-aware Conversational Agents

    Authors: Hyungjoo Chae, Yongho Song, Kai Tzu-iunn Ong, Taeyoon Kwon, Minjin Kim, Youngjae Yu, Dongha Lee, Dongyeop Kang, Jinyoung Yeo

    Abstract: Human-like chatbots necessitate the use of commonsense reasoning in order to effectively comprehend and respond to implicit information present within conversations. Achieving such coherence and informativeness in responses, however, is a non-trivial task. Even for large language models (LLMs), the task of identifying and aggregating key evidence within a single hop presents a substantial challeng… ▽ More

    Submitted 22 October, 2023; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: 25 pages, 8 figures, Accepted to EMNLP 2023

  10. arXiv:2309.07415  [pdf, other

    cs.CR cs.AI

    Client-side Gradient Inversion Against Federated Learning from Poisoning

    Authors: Jiaheng Wei, Yanjun Zhang, Leo Yu Zhang, Chao Chen, Shirui Pan, Kok-Leong Ong, Jun Zhang, Yang Xiang

    Abstract: Federated Learning (FL) enables distributed participants (e.g., mobile devices) to train a global model without sharing data directly to a central server. Recent studies have revealed that FL is vulnerable to gradient inversion attack (GIA), which aims to reconstruct the original training samples and poses high risk against the privacy of clients in FL. However, most existing GIAs necessitate cont… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

  11. arXiv:2303.02563  [pdf, other

    cs.CL

    FinXABSA: Explainable Finance through Aspect-Based Sentiment Analysis

    Authors: Keane Ong, Wihan van der Heever, Ranjan Satapathy, Erik Cambria, Gianmarco Mengaldo

    Abstract: This paper presents a novel approach for explainability in financial analysis by deriving financially-explainable statistical relationships through aspect-based sentiment analysis, Pearson correlation, Granger causality & uncertainty coefficient. The proposed methodology involves constructing an aspect list from financial literature and applying aspect-based sentiment analysis on social media text… ▽ More

    Submitted 14 October, 2023; v1 submitted 4 March, 2023; originally announced March 2023.

  12. arXiv:2303.01105  [pdf, other

    eess.IV cs.CV cs.LG

    Evidence-empowered Transfer Learning for Alzheimer's Disease

    Authors: Kai Tzu-iunn Ong, Hana Kim, Minjin Kim, Jinseong Jang, Beomseok Sohn, Yoon Seong Choi, Dosik Hwang, Seong Jae Hwang, Jinyoung Yeo

    Abstract: Transfer learning has been widely utilized to mitigate the data scarcity problem in the field of Alzheimer's disease (AD). Conventional transfer learning relies on re-using models trained on AD-irrelevant tasks such as natural image classification. However, it often leads to negative transfer due to the discrepancy between the non-medical source and target medical domains. To address this, we pres… ▽ More

    Submitted 17 April, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

    Comments: Accepted to IEEE International Symposium on Biomedical Imaging (ISBI) 2023. The authorship was changed from co-first authors to a single first author, which was authorized by the adviser/corresponding author Jinyoung Yeo (Apr 18th, 2023)

  13. arXiv:2204.08129  [pdf, other

    cs.CV

    Animal Kingdom: A Large and Diverse Dataset for Animal Behavior Understanding

    Authors: Xun Long Ng, Kian Eng Ong, Qichen Zheng, Yun Ni, Si Yong Yeo, Jun Liu

    Abstract: Understanding animals' behaviors is significant for a wide range of applications. However, existing animal behavior datasets have limitations in multiple aspects, including limited numbers of animal classes, data samples and provided tasks, and also limited variations in environmental conditions and viewpoints. To address these limitations, we create a large and diverse dataset, Animal Kingdom, th… ▽ More

    Submitted 3 June, 2022; v1 submitted 17 April, 2022; originally announced April 2022.

    Comments: Accepted by CVPR2022 (Oral). Dataset: https://sutdcv.github.io/Animal-Kingdom

  14. Bidirectional Representation Learning from Transformers using Multimodal Electronic Health Record Data to Predict Depression

    Authors: Yiwen Meng, William Speier, Michael K. Ong, Corey W. Arnold

    Abstract: Advancements in machine learning algorithms have had a beneficial impact on representation learning, classification, and prediction models built using electronic health record (EHR) data. Effort has been put both on increasing models' overall performance as well as improving their interpretability, particularly regarding the decision-making process. In this study, we present a temporal deep learni… ▽ More

    Submitted 23 March, 2021; v1 submitted 26 September, 2020; originally announced September 2020.

    Comments: in IEEE Journal of Biomedical and Health Informatics (2021)

  15. arXiv:2007.03313  [pdf, other

    cs.LG cs.AI stat.ML

    Predictive Maintenance for Edge-Based Sensor Networks: A Deep Reinforcement Learning Approach

    Authors: Kevin Shen Hoong Ong, Dusit Niyato, Chau Yuen

    Abstract: Failure of mission-critical equipment interrupts production and results in monetary loss. The risk of unplanned equipment downtime can be minimized through Predictive Maintenance of revenue generating assets to ensure optimal performance and safe operation of equipment. However, the increased sensorization of the equipment generates a data deluge, and existing machine-learning based predictive mod… ▽ More

    Submitted 7 July, 2020; originally announced July 2020.

    Comments: 6 pages, 5 figures, accepted in IEEE WF-IoT 2020

  16. arXiv:2007.03165  [pdf, ps, other

    cs.LG cs.NI eess.SP stat.ML

    Cognitive Radio Network Throughput Maximization with Deep Reinforcement Learning

    Authors: Kevin Shen Hoong Ong, Yang Zhang, Dusit Niyato

    Abstract: Radio Frequency powered Cognitive Radio Networks (RF-CRN) are likely to be the eyes and ears of upcoming modern networks such as Internet of Things (IoT), requiring increased decentralization and autonomous operation. To be considered autonomous, the RF-powered network entities need to make decisions locally to maximize the network throughput under the uncertainty of any network environment. Howev… ▽ More

    Submitted 6 July, 2020; originally announced July 2020.

    Comments: 5 pages, 2 figures, accepted in IEEE VTC-Fall 2019

  17. Talent Flow Analytics in Online Professional Network

    Authors: Richard J. Oentaryo, Ee-Peng Lim, Xavier Jayaraj Siddarth Ashok, Philips Kokoh Prasetyo, Koon Han Ong, Zi Quan Lau

    Abstract: Analyzing job hopping behavior is important for understanding job preference and career progression of working individuals. When analyzed at the workforce population level, job hop analysis helps to gain insights of talent flow among different jobs and organizations. Traditionally, surveys are conducted on job seekers and employers to study job hop behavior. Beyond surveys, job hop behavior can al… ▽ More

    Submitted 13 August, 2018; v1 submitted 23 July, 2018; originally announced July 2018.

    Comments: arXiv admin note: extension of arXiv:1711.05887, Data Science and Engineering, 2018

  18. arXiv:1506.05628  [pdf

    cs.IR cs.SI

    Analyzing Web Behavior in Indoor Retail Spaces

    Authors: Yongli Ren, Martin Tomko, Flora Salim, Kevin Ong, Mark Sanderson

    Abstract: We analyze 18 million rows of Wi-Fi access logs collected over a one year period from over 120,000 anonymized users at an inner-city shopping mall. The anonymized dataset gathered from an opt-in system provides users' approximate physical location, as well as Web browsing and some search history. Such data provides a unique opportunity to analyze the interaction between people's behavior in physic… ▽ More

    Submitted 18 June, 2015; originally announced June 2015.

    MSC Class: 68U35 ACM Class: H.3.3

  19. arXiv:1405.3631  [pdf, other

    cs.DB

    The SQL++ Query Language: Configurable, Unifying and Semi-structured

    Authors: Kian Win Ong, Yannis Papakonstantinou, Romain Vernoux

    Abstract: NoSQL databases support semi-structured data, typically modeled as JSON. They also provide limited (but expanding) query languages. Their idiomatic, non-SQL language constructs, the many variations, and the lack of formal semantics inhibit deep understanding of the query languages, and also impede progress towards clean, powerful, declarative query languages. This paper specifies the syntax and… ▽ More

    Submitted 14 December, 2015; v1 submitted 14 May, 2014; originally announced May 2014.

    Comments: 13 pages, [14166]

  20. arXiv:1308.0656  [pdf, other

    cs.DB cs.SE

    Declarative Ajax Web Applications through SQL++ on a Unified Application State

    Authors: Yupeng Fu, Kian Win Ong, Yannis Papakonstantinou

    Abstract: Implementing even a conceptually simple web application requires an inordinate amount of time. FORWARD addresses three problems that reduce developer productivity: (a) Impedance mismatch across the multiple languages used at different tiers of the application architecture. (b) Distributed data access across the multiple data sources of the application (SQL database, user input of the browser page,… ▽ More

    Submitted 16 June, 2014; v1 submitted 2 August, 2013; originally announced August 2013.

    Comments: Proceedings of the 14th International Symposium on Database Programming Languages (DBPL 2013), August 30, 2013, Riva del Garda, Trento, Italy

  21. arXiv:cs/0202001  [pdf, ps, other

    cs.DB cs.AI

    The Deductive Database System LDL++

    Authors: Faiz Arni, KayLiang Ong, Shalom Tsur, Haixun Wang, Carlo Zaniolo

    Abstract: This paper describes the LDL++ system and the research advances that have enabled its design and development. We begin by discussing the new nonmonotonic and nondeterministic constructs that extend the functionality of the LDL++ language, while preserving its model-theoretic and fixpoint semantics. Then, we describe the execution model and the open architecture designed to support these new cons… ▽ More

    Submitted 1 February, 2002; originally announced February 2002.

    ACM Class: D.3.2