Skip to main content

Showing 1–50 of 531 results for author: Ng, S

  1. arXiv:2407.04981  [pdf, other

    cs.CL cs.LG

    TRACE: TRansformer-based Attribution using Contrastive Embeddings in LLMs

    Authors: Cheng Wang, Xinyang Lu, See-Kiong Ng, Bryan Kian Hsiang Low

    Abstract: The rapid evolution of large language models (LLMs) represents a substantial leap forward in natural language understanding and generation. However, alongside these advancements come significant challenges related to the accountability and transparency of LLM responses. Reliable source attribution is essential to adhering to stringent legal and regulatory standards, including those set forth by th… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

  2. arXiv:2407.03788  [pdf, other

    cs.CV cs.CL

    Meta-optimized Angular Margin Contrastive Framework for Video-Language Representation Learning

    Authors: Thong Nguyen, Yi Bin, Xiaobao Wu, Xinshuai Dong, Zhiyuan Hu, Khoi Le, Cong-Duy Nguyen, See-Kiong Ng, Luu Anh Tuan

    Abstract: Data quality stands at the forefront of deciding the effectiveness of video-language representation learning. However, video-text pairs in previous data typically do not align perfectly with each other, which might lead to video-language representations that do not accurately reflect cross-modal semantics. Moreover, previous data also possess an uneven distribution of concepts, thereby hampering t… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: Accepted to ECCV 2024

  3. arXiv:2406.17649  [pdf, other

    cs.LG cs.CR

    Privacy Preserving Reinforcement Learning for Population Processes

    Authors: Samuel Yang-Zhao, Kee Siong Ng

    Abstract: We consider the problem of privacy protection in Reinforcement Learning (RL) algorithms that operate over population processes, a practical but understudied setting that includes, for example, the control of epidemics in large populations of dynamically interacting individuals. In this setting, the RL algorithm interacts with the population over $T$ time steps by receiving population-level statist… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  4. arXiv:2406.17294  [pdf, other

    cs.CL

    Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models

    Authors: Wenhao Shi, Zhiqiang Hu, Yi Bin, Junhua Liu, Yang Yang, See-Kiong Ng, Lidong Bing, Roy Ka-Wei Lee

    Abstract: Large language models (LLMs) have demonstrated impressive reasoning capabilities, particularly in textual mathematical problem-solving. However, existing open-source image instruction fine-tuning datasets, containing limited question-answer pairs per image, do not fully exploit visual information to enhance the multimodal mathematical reasoning capabilities of Multimodal LLMs (MLLMs). To bridge th… ▽ More

    Submitted 26 June, 2024; v1 submitted 25 June, 2024; originally announced June 2024.

    Comments: 8 pages

  5. arXiv:2406.14507  [pdf, other

    cs.LG cs.AI

    On Newton's Method to Unlearn Neural Networks

    Authors: Nhung Bui, Xinyang Lu, See-Kiong Ng, Bryan Kian Hsian Low

    Abstract: Machine unlearning facilitates personal data ownership, including the ``right to be forgotten''. The proliferation of applications of \emph{neural networks} (NNs) trained on users' personal data calls for the need to develop algorithms to unlearn an NN. Since retraining is costly, efficiency is often achieved through approximate unlearning which aims to unlearn a trained NN to be close to the retr… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  6. PetalView: Fine-grained Location and Orientation Extraction of Street-view Images via Cross-view Local Search with Supplementary Materials

    Authors: Wenmiao Hu, Yichen Zhang, Yuxuan Liang, Xianjing Han, Yifang Yin, Hannes Kruppa, See-Kiong Ng, Roger Zimmermann

    Abstract: Satellite-based street-view information extraction by cross-view matching refers to a task that extracts the location and orientation information of a given street-view image query by using one or multiple geo-referenced satellite images. Recent work has initiated a new research direction to find accurate information within a local area covered by one satellite image centered at a location prior (… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: This paper has been accepted by ACM Multimedia 2023. This version contains additional supplementary materials

    Journal ref: Proceedings of the 31st ACM International Conference on Multimedia (2023) 56-66

  7. arXiv:2406.12639  [pdf, other

    cs.CL cs.AI

    Ask-before-Plan: Proactive Language Agents for Real-World Planning

    Authors: Xuan Zhang, Yang Deng, Zifeng Ren, See-Kiong Ng, Tat-Seng Chua

    Abstract: The evolution of large language models (LLMs) has enhanced the planning capabilities of language agents in diverse real-world scenarios. Despite these advancements, the potential of LLM-powered agents to comprehend ambiguous user instructions for reasoning and decision-making is still under exploration. In this work, we introduce a new task, Proactive Agent Planning, which requires language agents… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  8. arXiv:2406.11886  [pdf, other

    cs.LG cs.AI cs.CE q-fin.CP

    Financial Assets Dependency Prediction Utilizing Spatiotemporal Patterns

    Authors: Haoren Zhu, Pengfei Zhao, Wilfred Siu Hung NG, Dik Lun Lee

    Abstract: Financial assets exhibit complex dependency structures, which are crucial for investors to create diversified portfolios to mitigate risk in volatile financial markets. To explore the financial asset dependencies dynamics, we propose a novel approach that models the dependencies of assets as an Asset Dependency Matrix (ADM) and treats the ADM sequences as image sequences. This allows us to leverag… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  9. arXiv:2406.11232  [pdf

    cs.SE cs.AI

    A Collaborative Data Analytics System with Recommender for Diverse Users

    Authors: Siu Lung Ng, Hirad Baradaran Rezaei, Fethi Rabhi

    Abstract: This paper presents the SLEGO (Software-Lego) system, a collaborative analytics platform that bridges the gap between experienced developers and novice users using a cloud-based platform with modular, reusable microservices. These microservices enable developers to share their analytical tools and workflows, while a simple graphical user interface (GUI) allows novice users to build comprehensive a… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 11 pages, 10 figures, 5 tables

    ACM Class: D.2.11; I.2.1

  10. arXiv:2406.09076  [pdf, other

    cs.CL

    3M: Multi-modal Multi-task Multi-teacher Learning for Game Event Detection

    Authors: Thye Shan Ng, Feiqi Cao, Soyeon Caren Han

    Abstract: Esports has rapidly emerged as a global phenomenon with an ever-expanding audience via platforms, like YouTube. Due to the inherent complexity nature of the game, it is challenging for newcomers to comprehend what the event entails. The chaotic nature of online chat, the fast-paced speech of the game commentator, and the game-specific user interface further compound the difficulty for users in com… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  11. arXiv:2406.05615  [pdf, other

    cs.CL

    Video-Language Understanding: A Survey from Model Architecture, Model Training, and Data Perspectives

    Authors: Thong Nguyen, Yi Bin, Junbin Xiao, Leigang Qu, Yicong Li, Jay Zhangjie Wu, Cong-Duy Nguyen, See-Kiong Ng, Luu Anh Tuan

    Abstract: Humans use multiple senses to comprehend the environment. Vision and language are two of the most vital senses since they allow us to easily communicate our thoughts and perceive the world around us. There has been a lot of interest in creating video-language understanding systems with human-like senses since a video-language pair can mimic both our linguistic medium and visual environment with te… ▽ More

    Submitted 1 July, 2024; v1 submitted 8 June, 2024; originally announced June 2024.

    Comments: Accepted at ACL 2024 (Findings)

  12. arXiv:2405.19723  [pdf, other

    cs.CV cs.AI

    Encoding and Controlling Global Semantics for Long-form Video Question Answering

    Authors: Thong Thanh Nguyen, Zhiyuan Hu, Xiaobao Wu, Cong-Duy T Nguyen, See-Kiong Ng, Anh Tuan Luu

    Abstract: Seeking answers effectively for long videos is essential to build video question answering (videoQA) systems. Previous methods adaptively select frames and regions from long videos to save computations. However, this fails to reason over the whole sequence of video, leading to sub-optimal performance. To address this problem, we introduce a state space layer (SSL) into multi-modal Transformer to e… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Work in progress

  13. arXiv:2405.17457  [pdf, other

    cs.CV cs.DC cs.LG

    Data-Free Federated Class Incremental Learning with Diffusion-Based Generative Memory

    Authors: Naibo Wang, Yuchen Deng, Wenjie Feng, Jianwei Yin, See-Kiong Ng

    Abstract: Federated Class Incremental Learning (FCIL) is a critical yet largely underexplored issue that deals with the dynamic incorporation of new classes within federated learning (FL). Existing methods often employ generative adversarial networks (GANs) to produce synthetic images to address privacy concerns in FL. However, GANs exhibit inherent instability and high sensitivity, compromising the effecti… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  14. arXiv:2405.17346  [pdf, other

    cs.LG cs.AI

    Prompt Optimization with Human Feedback

    Authors: Xiaoqiang Lin, Zhongxiang Dai, Arun Verma, See-Kiong Ng, Patrick Jaillet, Bryan Kian Hsiang Low

    Abstract: Large language models (LLMs) have demonstrated remarkable performances in various tasks. However, the performance of LLMs heavily depends on the input prompt, which has given rise to a number of recent works on prompt optimization. However, previous works often require the availability of a numeric score to assess the quality of every prompt. Unfortunately, when a human user interacts with a black… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: Preprint, 18 pages

  15. arXiv:2405.16122  [pdf, other

    cs.AI cs.CL cs.LG stat.ML

    Prompt Optimization with EASE? Efficient Ordering-aware Automated Selection of Exemplars

    Authors: Zhaoxuan Wu, Xiaoqiang Lin, Zhongxiang Dai, Wenyang Hu, Yao Shu, See-Kiong Ng, Patrick Jaillet, Bryan Kian Hsiang Low

    Abstract: Large language models (LLMs) have shown impressive capabilities in real-world applications. The capability of in-context learning (ICL) allows us to adapt an LLM to downstream tasks by including input-label exemplars in the prompt without model fine-tuning. However, the quality of these exemplars in the prompt greatly impacts performance, highlighting the need for an effective automated exemplar s… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: 23 pages, 1 figure, 23 tables

  16. arXiv:2405.15303  [pdf, other

    cs.LG

    Trajectory-Based Multi-Objective Hyperparameter Optimization for Model Retraining

    Authors: Wenyu Wang, Zheyi Fan, Szu Hui Ng

    Abstract: Training machine learning models inherently involves a resource-intensive and noisy iterative learning procedure that allows epoch-wise monitoring of the model performance. However, in multi-objective hyperparameter optimization scenarios, the insights gained from the iterative learning procedure typically remain underutilized. We notice that tracking the model performance across multiple epochs u… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  17. arXiv:2405.15285  [pdf, other

    cs.LG math.OC

    Minimizing UCB: a Better Local Search Strategy in Local Bayesian Optimization

    Authors: Zheyi Fan, Wenyu Wang, Szu Hui Ng, Qingpei Hu

    Abstract: Local Bayesian optimization is a promising practical approach to solve the high dimensional black-box function optimization problem. Among them is the approximated gradient class of methods, which implements a strategy similar to gradient descent. These methods have achieved good experimental results and theoretical guarantees. However, given the distributional properties of the Gaussian processes… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  18. arXiv:2405.07314  [pdf, other

    cs.IR

    Learnable Tokenizer for LLM-based Generative Recommendation

    Authors: Wenjie Wang, Honghui Bao, Xinyu Lin, Jizhi Zhang, Yongqi Li, Fuli Feng, See-Kiong Ng, Tat-Seng Chua

    Abstract: Harnessing Large Language Models (LLMs) for generative recommendation has garnered significant attention due to LLMs' powerful capacities such as rich world knowledge and reasoning. However, a critical challenge lies in transforming recommendation data into the language space of LLMs through effective item tokenization. Existing approaches, such as ID identifiers, textual identifiers, and codebook… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

  19. arXiv:2404.16994  [pdf, other

    cs.CV

    PLLaVA : Parameter-free LLaVA Extension from Images to Videos for Video Dense Captioning

    Authors: Lin Xu, Yilin Zhao, Daquan Zhou, Zhijie Lin, See Kiong Ng, Jiashi Feng

    Abstract: Vision-language pre-training has significantly elevated performance across a wide range of image-language applications. Yet, the pre-training process for video-related tasks demands exceptionally large computational and data resources, which hinders the progress of video-language models. This paper investigates a straight-forward, highly efficient, and resource-light approach to adapting an existi… ▽ More

    Submitted 29 April, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

  20. arXiv:2404.12384  [pdf, other

    cond-mat.quant-gas quant-ph

    Decoupled sound and amplitude modes in trapped dipolar supersolids

    Authors: Jens Hertkorn, Philipp Stürmer, Koushik Mukherjee, Kevin S. H. Ng, Paul Uerlings, Fiona Hellstern, Lucas Lavoine, Stephanie Reimann, Tilman Pfau, Ralf Klemt

    Abstract: We theoretically investigate elementary excitations of dipolar quantum gases across the superfluid to supersolid phase transition in a toroidal trap. We show how decoupled first sound, second sound, and Higgs modes emerge by following their origin from superfluid modes across the transition. The structure of these excitations reveals the interplay between crystal and superfluid oscillations. Our r… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  21. arXiv:2404.12130  [pdf, other

    cs.LG cs.CV cs.DC

    One-Shot Sequential Federated Learning for Non-IID Data by Enhancing Local Model Diversity

    Authors: Naibo Wang, Yuchen Deng, Wenjie Feng, Shichen Fan, Jianwei Yin, See-Kiong Ng

    Abstract: Traditional federated learning mainly focuses on parallel settings (PFL), which can suffer significant communication and computation costs. In contrast, one-shot and sequential federated learning (SFL) have emerged as innovative paradigms to alleviate these costs. However, the issue of non-IID (Independent and Identically Distributed) data persists as a significant challenge in one-shot and SFL se… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  22. arXiv:2404.07662  [pdf, other

    cs.LG cs.AI physics.comp-ph physics.data-an stat.ML

    PINNACLE: PINN Adaptive ColLocation and Experimental points selection

    Authors: Gregory Kang Ruey Lau, Apivich Hemachandra, See-Kiong Ng, Bryan Kian Hsiang Low

    Abstract: Physics-Informed Neural Networks (PINNs), which incorporate PDEs as soft constraints, train with a composite loss function that contains multiple training point types: different types of collocation points chosen during training to enforce each PDE and initial/boundary conditions, and experimental points which are usually costly to obtain via experiments or simulations. Training PINNs using this l… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: Accepted to 12th International Conference on Learning Representations (ICLR 2024), 36 pages

  23. arXiv:2403.18423  [pdf, other

    cs.CL cs.LG

    SemRoDe: Macro Adversarial Training to Learn Representations That are Robust to Word-Level Attacks

    Authors: Brian Formento, Wenjie Feng, Chuan Sheng Foo, Luu Anh Tuan, See-Kiong Ng

    Abstract: Language models (LMs) are indispensable tools for natural language processing tasks, but their vulnerability to adversarial attacks remains a concern. While current research has explored adversarial training techniques, their improvements to defend against word-level attacks have been limited. In this work, we propose a novel approach called Semantic Robust Defence (SemRoDe), a Macro Adversarial T… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: Published in NAACL 2024 (Main Track)

  24. arXiv:2403.16180  [pdf, other

    quant-ph

    The Road to Near-Capacity CV-QKD Reconciliation: An FEC-Agnostic Design

    Authors: Xin Liu, Chao Xu, Yasir Noori, Soon Xin Ng, Lajos Hanzo

    Abstract: New near-capacity continuous-variable quantum key distribution (CV-QKD) reconciliation schemes are proposed, where both the authenticated classical channel (ClC) and the quantum channel (QuC) for QKD are protected by separate forward error correction (FEC) coding schemes. More explicitly, a new codeword-based - rather than syndrome-based - QKD reconciliation scheme is proposed, where Alice sends a… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  25. arXiv:2403.14237  [pdf, other

    physics.optics physics.app-ph

    Bessel-beam direct-write of the etch-mask in a nano-film of alumina for high-efficiency Si solar cells

    Authors: Tomas Katkus, Soon Hock Ng, Haoran Mu, Nguyen Hoai An Le, Dominyka Stonyte, Zahra Khajehsaeidimahabadi, Gediminas Seniutinas, Justas Baltrukonis, Orestas Ulcinas, Mindaugas Mikutis, Vytautas Sabonis, Yoshiaki Nishijima, Michael Rienacker, Jan Krugener, Robby Peibst, Sajeev John, Saulius Juodkazis

    Abstract: Large surface area applications such as high-efficiency > 26% solar cells require surface patterning with 1-10 micrometers periodic patterns at high fidelity over 1-10 cm^2 areas (before up scaling to 1 m^2) to perform at, or exceed, the Lambertian (ray optics) limit of light trapping. Here we show a pathway to high-resolution sub-1 micrometer etch mask patterning by ablation using direct femtosec… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 12 pages, 7 figures and 8 figures in appendix

  26. arXiv:2403.08130  [pdf, other

    econ.EM

    Imputation of Counterfactual Outcomes when the Errors are Predictable

    Authors: Silvia Goncalves, Serena Ng

    Abstract: A crucial input into causal inference is the imputed counterfactual outcome. Imputation error can arise because of sampling uncertainty from estimating the prediction model using the untreated observations, or from out-of-sample information not captured by the model. While the literature has focused on sampling uncertainty, it vanishes with the sample size. Often overlooked is the possibility that… ▽ More

    Submitted 17 May, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

  27. arXiv:2403.04656  [pdf, other

    cs.CL

    Chain of Thought Explanation for Dialogue State Tracking

    Authors: Lin Xu, Ningxin Peng, Daquan Zhou, See-Kiong Ng, Jinlan Fu

    Abstract: Dialogue state tracking (DST) aims to record user queries and goals during a conversational interaction achieved by maintaining a predefined set of slots and their corresponding values. Current approaches decide slot values opaquely, while humans usually adopt a more deliberate approach by collecting information from relevant dialogue turns and then reasoning the appropriate values. In this work,… ▽ More

    Submitted 9 March, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

  28. arXiv:2403.03004  [pdf, other

    astro-ph.CO gr-qc hep-ph

    Ultralight vector dark matter search using data from the KAGRA O3GK run

    Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, H. Abe, I. Abouelfettouh, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi , et al. (1778 additional authors not shown)

    Abstract: Among the various candidates for dark matter (DM), ultralight vector DM can be probed by laser interferometric gravitational wave detectors through the measurement of oscillating length changes in the arm cavities. In this context, KAGRA has a unique feature due to differing compositions of its mirrors, enhancing the signal of vector DM in the length change in the auxiliary channels. Here we prese… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 20 pages, 5 figures

    Report number: LIGO-P2300250

  29. arXiv:2403.02993  [pdf, other

    cs.AI

    Localized Zeroth-Order Prompt Optimization

    Authors: Wenyang Hu, Yao Shu, Zongmin Yu, Zhaoxuan Wu, Xiangqiang Lin, Zhongxiang Dai, See-Kiong Ng, Bryan Kian Hsiang Low

    Abstract: The efficacy of large language models (LLMs) in understanding and generating natural language has aroused a wide interest in developing prompt-based methods to harness the power of black-box LLMs. Existing methodologies usually prioritize a global optimization for finding the global optimum, which however will perform poorly in certain tasks. This thus motivates us to re-think the necessity of fin… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  30. arXiv:2403.02246  [pdf

    cs.CL

    PHAnToM: Personality Has An Effect on Theory-of-Mind Reasoning in Large Language Models

    Authors: Fiona Anting Tan, Gerard Christopher Yeo, Fanyou Wu, Weijie Xu, Vinija Jain, Aman Chadha, Kokil Jaidka, Yang Liu, See-Kiong Ng

    Abstract: Recent advances in large language models (LLMs) demonstrate that their capabilities are comparable, or even superior, to humans in many tasks in natural language processing. Despite this progress, LLMs are still inadequate at social-cognitive reasoning, which humans are naturally good at. Drawing inspiration from psychological research on the links between certain personality traits and Theory-of-… ▽ More

    Submitted 18 March, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  31. arXiv:2403.02061  [pdf, other

    astro-ph.HE astro-ph.GA

    X-ray polarization measurement of the gold standard of radio-quiet active galactic nuclei : NGC 1068

    Authors: F. Marin, A. Marinucci, M. Laurenti, D. E. Kim, T. Barnouin, A. Di Marco, F. Ursini, S. Bianchi, S. Ravi, H. L. Marshall, G. Matt, C. -T. Chen, V. E. Gianolli, A. Ingram, W. P. Maksym, C. Panagiotou, J. Podgorny, S. Puccetti, A. Ratheesh, F. Tombesi, I. Agudo, L. A. Antonelli, M. Bachetti, L. Baldini, W. Baumgartner , et al. (80 additional authors not shown)

    Abstract: We used the Imaging X-ray Polarimetry Explorer (IXPE) satellite to measure, for the first time, the 2-8 keV polarization of NGC 1068. We pointed IXPE for a net exposure time of 1.15 Ms on the target, in addition to two ~ 10 ks each Chandra snapshots in order to account for the potential impact of several ultraluminous X-ray source (ULXs) within IXPE's field-of-view. We measured a 2 - 8 keV polariz… ▽ More

    Submitted 13 May, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: 9 pages, 5 figures, 2 tables, submited to A&A

    MSC Class: 85-06 ACM Class: J.2.3; J.2.9

  32. arXiv:2403.01848  [pdf, other

    cs.CL

    CET2: Modelling Topic Transitions for Coherent and Engaging Knowledge-Grounded Conversations

    Authors: Lin Xu, Qixian Zhou, Jinlan Fu, See-Kiong Ng

    Abstract: Knowledge-grounded dialogue systems aim to generate coherent and engaging responses based on the dialogue contexts and selected external knowledge. Previous knowledge selection methods tend to rely too heavily on the dialogue contexts or over-emphasize the new information in the selected knowledge, resulting in the selection of repetitious or incongruous knowledge and further generating repetitive… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: Accepted by TASLP

  33. arXiv:2402.15062  [pdf, other

    cs.CL cs.LG

    Gotcha! Don't trick me with unanswerable questions! Self-aligning Large Language Models for Responding to Unknown Questions

    Authors: Yang Deng, Yong Zhao, Moxin Li, See-Kiong Ng, Tat-Seng Chua

    Abstract: Despite the remarkable abilities of Large Language Models (LLMs) to answer questions, they often display a considerable level of overconfidence even when the question does not have a definitive answer. To avoid providing hallucinated answers to these unknown questions, existing studies typically investigate approaches to refusing to answer these questions. In this work, we propose a novel and scal… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  34. arXiv:2402.15057  [pdf, other

    cs.CL cs.AI

    On the Multi-turn Instruction Following for Conversational Web Agents

    Authors: Yang Deng, Xuan Zhang, Wenxuan Zhang, Yifei Yuan, See-Kiong Ng, Tat-Seng Chua

    Abstract: Web agents powered by Large Language Models (LLMs) have demonstrated remarkable abilities in planning and executing multi-step interactions within complex web-based environments, fulfilling a wide range of web navigation tasks. Despite these advancements, the potential for LLM-powered agents to effectively engage with sequential user instructions in real-world scenarios has not been fully explored… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  35. arXiv:2402.14310  [pdf, other

    cs.CL

    Hint-before-Solving Prompting: Guiding LLMs to Effectively Utilize Encoded Knowledge

    Authors: Jinlan Fu, Shenzhen Huangfu, Hang Yan, See-Kiong Ng, Xipeng Qiu

    Abstract: Large Language Models (LLMs) have recently showcased remarkable generalizability in various domains. Despite their extensive knowledge, LLMs still face challenges in efficiently utilizing encoded knowledge to develop accurate and logical reasoning processes. To mitigate this problem, we introduced Hint-before-Solving Prompting (HSP), which guides the model to generate hints (e.g., specific knowled… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: 18 pages

  36. arXiv:2402.12761  [pdf, other

    cs.LG cs.CR

    FGAD: Self-boosted Knowledge Distillation for An Effective Federated Graph Anomaly Detection Framework

    Authors: Jinyu Cai, Yunhe Zhang, Zhoumin Lu, Wenzhong Guo, See-kiong Ng

    Abstract: Graph anomaly detection (GAD) aims to identify anomalous graphs that significantly deviate from other ones, which has raised growing attention due to the broad existence and complexity of graph-structured data in many real-world scenarios. However, existing GAD methods usually execute with centralized training, which may lead to privacy leakage risk in some sensitive cases, thereby impeding collab… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  37. arXiv:2402.09959  [pdf, other

    cs.IR

    LLM-based Federated Recommendation

    Authors: Jujia Zhao, Wenjie Wang, Chen Xu, Zhaochun Ren, See-Kiong Ng, Tat-Seng Chua

    Abstract: Large Language Models (LLMs), with their advanced contextual understanding abilities, have demonstrated considerable potential in enhancing recommendation systems via fine-tuning methods. However, fine-tuning requires users' behavior data, which poses considerable privacy risks due to the incorporation of sensitive user information. The unintended disclosure of such data could infringe upon data p… ▽ More

    Submitted 16 February, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

  38. arXiv:2402.07844  [pdf, other

    cs.SE cs.CL

    Mercury: A Code Efficiency Benchmark for Code Large Language Models

    Authors: Mingzhe Du, Anh Tuan Luu, Bin Ji, Qian Liu, See-Kiong Ng

    Abstract: Amidst the recent strides in evaluating Large Language Models for Code (Code LLMs), existing benchmarks have mainly focused on the functional correctness of generated code, neglecting the importance of their computational efficiency. To fill the gap, we present Mercury, the first code efficiency benchmark for Code LLMs. It comprises 1,889 Python tasks, each accompanied by adequate solutions that s… ▽ More

    Submitted 11 June, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

  39. arXiv:2402.07577  [pdf, other

    cs.CL

    Topic Modeling as Multi-Objective Contrastive Optimization

    Authors: Thong Nguyen, Xiaobao Wu, Xinshuai Dong, Cong-Duy T Nguyen, See-Kiong Ng, Anh Tuan Luu

    Abstract: Recent representation learning approaches enhance neural topic models by optimizing the weighted linear combination of the evidence lower bound (ELBO) of the log-likelihood and the contrastive learning objective that contrasts pairs of input documents. However, document-level contrastive learning might capture low-level mutual information, such as word ratio, which disturbs topic modeling. Moreove… ▽ More

    Submitted 9 March, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: Accepted at ICLR 2024 (poster)

  40. arXiv:2402.06642  [pdf, other

    q-fin.ST cs.LG

    From GARCH to Neural Network for Volatility Forecast

    Authors: Pengfei Zhao, Haoren Zhu, Wilfred Siu Hung NG, Dik Lun Lee

    Abstract: Volatility, as a measure of uncertainty, plays a crucial role in numerous financial activities such as risk management. The Econometrics and Machine Learning communities have developed two distinct approaches for financial volatility forecasting: the stochastic approach and the neural network (NN) approach. Despite their individual strengths, these methodologies have conventionally evolved in sepa… ▽ More

    Submitted 29 January, 2024; originally announced February 2024.

    Comments: Accepted by AAAI'24

  41. arXiv:2402.03271  [pdf, other

    cs.CL cs.AI cs.LG

    Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models

    Authors: Zhiyuan Hu, Chumin Liu, Xidong Feng, Yilun Zhao, See-Kiong Ng, Anh Tuan Luu, Junxian He, Pang Wei Koh, Bryan Hooi

    Abstract: In the face of uncertainty, the ability to *seek information* is of fundamental importance. In many practical applications, such as medical diagnosis and troubleshooting, the information needed to solve the task is not initially given and has to be actively sought by asking follow-up questions (for example, a doctor asking a patient for more details about their symptoms). In this work, we introduc… ▽ More

    Submitted 30 May, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: Update Results

  42. arXiv:2312.16864  [pdf, other

    cs.CL

    OmniDialog: An Omnipotent Pre-training Model for Task-Oriented Dialogue System

    Authors: Mingtao Yang, See-Kiong Ng, Jinlan Fu

    Abstract: Pre-trained conversation models (PCMs) have demonstrated remarkable results in task-oriented dialogue (TOD) systems. Many PCMs focus predominantly on dialogue management tasks like dialogue state tracking, dialogue generation tasks like response generation, or both. However, the existing PCMs seldom consider dialogue comprehension tasks, such as dialogue question answering and summarization tasks.… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

    Comments: 9 pages

  43. arXiv:2312.16184  [pdf, other

    cs.AI cs.LG

    Dynamic Knowledge Injection for AIXI Agents

    Authors: Samuel Yang-Zhao, Kee Siong Ng, Marcus Hutter

    Abstract: Prior approximations of AIXI, a Bayesian optimality notion for general reinforcement learning, can only approximate AIXI's Bayesian environment model using an a-priori defined set of models. This is a fundamental source of epistemic uncertainty for the agent in settings where the existence of systematic bias in the predefined model class cannot be resolved by simply collecting more data from the e… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: 16 pages, 2 figures, extended length version of paper to be published in AAAI2024

  44. arXiv:2312.08683  [pdf, other

    math.OA

    A twist over a minimal étale groupoid that is topologically nontrivial over the interior of the isotropy

    Authors: Becky Armstrong, Abraham C. S. Ng, Aidan Sims, Yumiao Zhou

    Abstract: We present an example of a twist over a minimal Hausdorff étale groupoid such that the restriction of the twist to the interior of the isotropy is not topologically trivial; that is, the restricted twist is not induced by a continuous 2-cocycle.

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: 14 pages

    MSC Class: 18B40 (primary); 22A22 (secondary)

  45. arXiv:2312.06950  [pdf, other

    cs.CV cs.CL

    READ-PVLA: Recurrent Adapter with Partial Video-Language Alignment for Parameter-Efficient Transfer Learning in Low-Resource Video-Language Modeling

    Authors: Thong Nguyen, Xiaobao Wu, Xinshuai Dong, Khoi Le, Zhiyuan Hu, Cong-Duy Nguyen, See-Kiong Ng, Luu Anh Tuan

    Abstract: Fully fine-tuning pretrained large-scale transformer models has become a popular paradigm for video-language modeling tasks, such as temporal language grounding and video-language summarization. With a growing number of tasks and limited training data, such full fine-tuning approach leads to costly model storage and unstable training. To overcome these shortcomings, we introduce lightweight adapte… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: Accepted at AAAI 2024

  46. arXiv:2312.02549  [pdf, other

    cs.CV cs.CL

    DemaFormer: Damped Exponential Moving Average Transformer with Energy-Based Modeling for Temporal Language Grounding

    Authors: Thong Nguyen, Xiaobao Wu, Xinshuai Dong, Cong-Duy Nguyen, See-Kiong Ng, Luu Anh Tuan

    Abstract: Temporal Language Grounding seeks to localize video moments that semantically correspond to a natural language query. Recent advances employ the attention mechanism to learn the relations between video moments and the text query. However, naive attention might not be able to appropriately capture such relations, resulting in ineffective distributions where target video moments are difficult to sep… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: Accepted at EMNLP 2023 (Findings)

  47. arXiv:2311.17071  [pdf, other

    astro-ph.GA

    Globular Clusters Contribute to the Nuclear Star Cluster and Galaxy Center Gamma-Ray Excess, Moderated by Galaxy Assembly History

    Authors: Yuan Gao, Hui Li, Xiaojia Zhang, Meng Su, Stephen Chi Yung Ng

    Abstract: Two unresolved questions at galaxy centers, namely the formation of the nuclear star cluster (NSC) and the origin of the gamma-ray excess in the Milky Way (MW) and Andromeda (M31), are both related to the formation and evolution of globular clusters (GCs). They migrate towards the galaxy center due to dynamical friction, and get tidally disrupted to release the stellar mass content including milli… ▽ More

    Submitted 26 December, 2023; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: 11 pages, 18 figures. Accepted for publication in MNRAS. Comments welcome!

  48. arXiv:2311.15636  [pdf

    physics.ins-det physics.med-ph

    Liquid-shaped microlens for scalable production of ultrahigh-resolution OCT microendoscope

    Authors: Chao Xu, Xin Guan, Syeda Aimen Abbasi, Neng Xia, To Ngai, Li Zhang, Ho-Pui Ho, Sze Hang Calvin Ng, Wu Yuan

    Abstract: Endoscopic optical coherence tomography (OCT) is a valuable tool for providing diagnostic images of internal organs and guiding interventions in real time. Miniaturized OCT endoscopes are essential for imaging small and convoluted luminal organs while minimizing invasiveness. However, current methods for fabricating miniature fiber probes have limited ability to correct optical aberrations, leadin… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: 42 pages, 7 figures in the main text

    MSC Class: 78-05

  49. arXiv:2311.13974  [pdf, other

    quant-ph

    The Evolution of Quantum Secure Direct Communication: On the Road to the Qinternet

    Authors: Dong Pan, Gui-Lu Long, Liuguo Yin, Yu-Bo Sheng, Dong Ruan, Soon Xin Ng, Jianhua Lu, Lajos Hanzo

    Abstract: Communication security has to evolve to a higher plane in the face of the threat from the massive computing power of the emerging quantum computers. Quantum secure direct communication (QSDC) constitutes a promising branch of quantum communication, which is provably secure and overcomes the threat of quantum computing, whilst conveying secret messages directly via the quantum channel. In this surv… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

  50. arXiv:2311.12042  [pdf, other

    physics.app-ph cond-mat.mes-hall cs.ET quant-ph

    Atomic Defect-Aware Physical Design of Silicon Dangling Bond Logic on the H-Si(100)2x1 Surface

    Authors: Marcel Walter, Jeremiah Croshaw, Samuel Sze Hang Ng, Konrad Walus, Robert Wolkow, Robert Wille

    Abstract: Although fabrication capabilities of Silicon Dangling Bonds have rapidly advanced from manual labor-driven laboratory work to automated manufacturing in just recent years, sub-nanometer substrate defects still pose a hindrance to production due to the need for atomic precision. In essence, unpassivated or missing surface atoms, contaminants, and structural deformations disturb the fabricated logic… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Comments: 7 pages, 5 figures