Skip to main content

Showing 1–50 of 56 results for author: Foo, C

  1. arXiv:2407.04411  [pdf, other

    cs.CR cs.AI cs.CL

    Waterfall: Framework for Robust and Scalable Text Watermarking

    Authors: Gregory Kang Ruey Lau, Xinyuan Niu, Hieu Dao, Jiangwei Chen, Chuan-Sheng Foo, Bryan Kian Hsiang Low

    Abstract: Protecting intellectual property (IP) of text such as articles and code is increasingly important, especially as sophisticated attacks become possible, such as paraphrasing by large language models (LLMs) or even unauthorized training of LLMs on copyrighted text to infringe such IP. However, existing text watermarking methods are not robust enough against such attacks nor scalable to millions of u… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  2. arXiv:2406.04606  [pdf, other

    cs.LG cs.AI

    Helpful or Harmful Data? Fine-tuning-free Shapley Attribution for Explaining Language Model Predictions

    Authors: Jingtan Wang, Xiaoqiang Lin, Rui Qiao, Chuan-Sheng Foo, Bryan Kian Hsiang Low

    Abstract: The increasing complexity of foundational models underscores the necessity for explainability, particularly for fine-tuning, the most widely used training method for adapting models to downstream tasks. Instance attribution, one type of explanation, attributes the model prediction to each training example by an instance score. However, the robustness of instance scores, specifically towards datase… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Accepted to ICML 2024

  3. arXiv:2406.02635  [pdf, other

    cs.LG cs.AI

    Evidentially Calibrated Source-Free Time-Series Domain Adaptation with Temporal Imputation

    Authors: Mohamed Ragab, Peiliang Gong, Emadeldeen Eldele, Wenyu Zhang, Min Wu, Chuan-Sheng Foo, Daoqiang Zhang, Xiaoli Li, Zhenghua Chen

    Abstract: Source-free domain adaptation (SFDA) aims to adapt a model pre-trained on a labeled source domain to an unlabeled target domain without access to source data, preserving the source domain's privacy. While SFDA is prevalent in computer vision, it remains largely unexplored in time series analysis. Existing SFDA methods, designed for visual data, struggle to capture the inherent temporal dynamics of… ▽ More

    Submitted 12 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

  4. arXiv:2405.02954  [pdf, other

    cs.CV cs.LG

    Source-Free Domain Adaptation Guided by Vision and Vision-Language Pre-Training

    Authors: Wenyu Zhang, Li Shen, Chuan-Sheng Foo

    Abstract: Source-free domain adaptation (SFDA) aims to adapt a source model trained on a fully-labeled source domain to a related but unlabeled target domain. While the source model is a key avenue for acquiring target pseudolabels, the generated pseudolabels may exhibit source bias. In the conventional SFDA pipeline, a large data (e.g. ImageNet) pre-trained feature extractor is used to initialize the sourc… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

    Comments: Extension of ICCV paper arXiv:2212.07585, submitted to IJCV

  5. arXiv:2404.11151  [pdf, other

    cs.CV

    REACTO: Reconstructing Articulated Objects from a Single Video

    Authors: Chaoyue Song, Jiacheng Wei, Chuan-Sheng Foo, Guosheng Lin, Fayao Liu

    Abstract: In this paper, we address the challenge of reconstructing general articulated 3D objects from a single video. Existing works employing dynamic neural radiance fields have advanced the modeling of articulated objects like humans and animals from videos, but face challenges with piece-wise rigid general articulated objects due to limitations in their deformation models. To tackle this, we propose Qu… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  6. arXiv:2403.18423  [pdf, other

    cs.CL cs.LG

    SemRoDe: Macro Adversarial Training to Learn Representations That are Robust to Word-Level Attacks

    Authors: Brian Formento, Wenjie Feng, Chuan Sheng Foo, Luu Anh Tuan, See-Kiong Ng

    Abstract: Language models (LMs) are indispensable tools for natural language processing tasks, but their vulnerability to adversarial attacks remains a concern. While current research has explored adversarial training techniques, their improvements to defend against word-level attacks have been limited. In this work, we propose a novel approach called Semantic Robust Defence (SemRoDe), a Macro Adversarial T… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: Published in NAACL 2024 (Main Track)

  7. arXiv:2403.11234  [pdf, other

    cs.CV

    Universal Semi-Supervised Domain Adaptation by Mitigating Common-Class Bias

    Authors: Wenyu Zhang, Qingmu Liu, Felix Ong Wei Cong, Mohamed Ragab, Chuan-Sheng Foo

    Abstract: Domain adaptation is a critical task in machine learning that aims to improve model performance on a target domain by leveraging knowledge from a related source domain. In this work, we introduce Universal Semi-Supervised Domain Adaptation (UniSSDA), a practical yet challenging setting where the target domain is partially labeled, and the source and target label space may not strictly match. UniSS… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR 2024

  8. arXiv:2403.09140  [pdf, other

    cs.CV

    Sculpt3D: Multi-View Consistent Text-to-3D Generation with Sparse 3D Prior

    Authors: Cheng Chen, Xiaofeng Yang, Fan Yang, Chengzeng Feng, Zhoujie Fu, Chuan-Sheng Foo, Guosheng Lin, Fayao Liu

    Abstract: Recent works on text-to-3d generation show that using only 2D diffusion supervision for 3D generation tends to produce results with inconsistent appearances (e.g., faces on the back view) and inaccurate shapes (e.g., animals with extra legs). Existing methods mainly address this issue by retraining diffusion models with images rendered from 3D data to ensure multi-view consistency while struggling… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR 2024. Project Page: https://stellarcheng.github.io/Sculpt3D/

  9. arXiv:2402.19197  [pdf, other

    cs.CV cs.AI cs.LG

    Fine Structure-Aware Sampling: A New Sampling Training Scheme for Pixel-Aligned Implicit Models in Single-View Human Reconstruction

    Authors: Kennard Yanting Chan, Fayao Liu, Guosheng Lin, Chuan Sheng Foo, Weisi Lin

    Abstract: Pixel-aligned implicit models, such as PIFu, PIFuHD, and ICON, are used for single-view clothed human reconstruction. These models need to be trained using a sampling training scheme. Existing sampling training schemes either fail to capture thin surfaces (e.g. ears, fingers) or cause noisy artefacts in reconstructed meshes. To address these problems, we introduce Fine Structured-Aware Sampling (F… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: Accepted in Proceedings of the AAAI Conference on Artificial Intelligence, 2024 (AAAI 2024)

    Journal ref: Proceedings of the AAAI Conference on Artificial Intelligence, 2024, pp. 964-971

  10. arXiv:2402.18998  [pdf, other

    cs.CV

    COFT-AD: COntrastive Fine-Tuning for Few-Shot Anomaly Detection

    Authors: Jingyi Liao, Xun Xu, Manh Cuong Nguyen, Adam Goodge, Chuan Sheng Foo

    Abstract: Existing approaches towards anomaly detection~(AD) often rely on a substantial amount of anomaly-free data to train representation and density models. However, large anomaly-free datasets may not always be available before the inference stage; in which case an anomaly detection model must be trained with only a handful of normal samples, a.k.a. few-shot anomaly detection (FSAD). In this paper, we… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: IEEE Transactions on Image Processing

  11. arXiv:2310.00646  [pdf, other

    cs.LG cs.AI stat.ML

    WASA: WAtermark-based Source Attribution for Large Language Model-Generated Data

    Authors: Jingtan Wang, Xinyang Lu, Zitong Zhao, Zhongxiang Dai, Chuan-Sheng Foo, See-Kiong Ng, Bryan Kian Hsiang Low

    Abstract: The impressive performances of large language models (LLMs) and their immense potential for commercialization have given rise to serious concerns over the intellectual property (IP) of their training data. In particular, the synthetic texts generated by LLMs may infringe the IP of the data being used to train the LLMs. To this end, it is imperative to be able to (a) identify the data provider who… ▽ More

    Submitted 1 October, 2023; originally announced October 2023.

  12. arXiv:2308.02488  [pdf, other

    physics.plasm-ph

    Recovering non-Maxwellian particle velocity distribution functions from collective Thomson-scattered spectra

    Authors: Bryan C. Foo, Derek B. Schaeffer, Peter V. Heuer

    Abstract: Collective optical Thomson scattering (TS) is a diagnostic commonly used to characterize plasma parameters. These parameters are typically extracted by a fitting algorithm that minimizes the difference between a measured scattered spectrum and an analytic spectrum calculated from the velocity distribution function (VDF) of the plasma. However, most existing TS analysis algorithms assume the VDFs a… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

  13. arXiv:2307.07542  [pdf, other

    eess.SP cs.AI cs.LG

    Source-Free Domain Adaptation with Temporal Imputation for Time Series Data

    Authors: Mohamed Ragab, Emadeldeen Eldele, Min Wu, Chuan-Sheng Foo, Xiaoli Li, Zhenghua Chen

    Abstract: Source-free domain adaptation (SFDA) aims to adapt a pretrained model from a labeled source domain to an unlabeled target domain without access to the source domain data, preserving source domain privacy. Despite its prevalence in visual applications, SFDA is largely unexplored in time series applications. The existing SFDA methods that are mainly designed for visual applications may fail to handl… ▽ More

    Submitted 14 July, 2023; originally announced July 2023.

    Comments: Accepted in KDD'23

  14. arXiv:2307.07489  [pdf, other

    cs.LG cs.CV

    PseudoCal: A Source-Free Approach to Unsupervised Uncertainty Calibration in Domain Adaptation

    Authors: Dapeng Hu, Jian Liang, Xinchao Wang, Chuan-Sheng Foo

    Abstract: Unsupervised domain adaptation (UDA) has witnessed remarkable advancements in improving the accuracy of models for unlabeled target domains. However, the calibration of predictive uncertainty in the target domain, a crucial aspect of the safe deployment of UDA models, has received limited attention. The conventional in-domain calibration method, \textit{temperature scaling} (TempScal), encounters… ▽ More

    Submitted 14 July, 2023; originally announced July 2023.

  15. arXiv:2306.05764  [pdf, other

    cs.LG cs.AI cs.GT cs.MA

    Fair yet Asymptotically Equal Collaborative Learning

    Authors: Xiaoqiang Lin, Xinyi Xu, See-Kiong Ng, Chuan-Sheng Foo, Bryan Kian Hsiang Low

    Abstract: In collaborative learning with streaming data, nodes (e.g., organizations) jointly and continuously learn a machine learning (ML) model by sharing the latest model updates computed from their latest streaming data. For the more resourceful nodes to be willing to share their model updates, they need to be fairly incentivized. This paper explores an incentive design that guarantees fairness so that… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Comments: Accepted to 40th International Conference on Machine Learning (ICML 2023), 37 pages

  16. arXiv:2305.18080  [pdf, other

    cond-mat.str-el cond-mat.mes-hall

    Extended magic phase in twisted graphene multilayers

    Authors: D. C. W. Foo, Z. Zhan, Mohammed M. Al Ezzi, L. Peng, S. Adam, F. Guinea

    Abstract: Theoretical and experimental studies have verified the existence of ``magic angles'' in twisted bilayer graphene, where the twist between layers gives rise to flat bands and consequently highly correlated phases. Narrow bands can also exist in multilayers with alternating twist angles, and recent theoretical work suggests that they can also be found in trilayers with twist angles between neighbori… ▽ More

    Submitted 7 June, 2023; v1 submitted 29 May, 2023; originally announced May 2023.

    Comments: Main text: 4 pages, 4 figures -- Supplementary Material: 15 pages, 13 figures

    Journal ref: Phys. Rev. Research 6, 013165 (2024)

  17. arXiv:2304.08279  [pdf, other

    cs.CV

    MoDA: Modeling Deformable 3D Objects from Casual Videos

    Authors: Chaoyue Song, Jiacheng Wei, Tianyi Chen, Yiwen Chen, Chuan Sheng Foo, Fayao Liu, Guosheng Lin

    Abstract: In this paper, we focus on the challenges of modeling deformable 3D objects from casual videos. With the popularity of neural radiance fields (NeRF), many works extend it to dynamic scenes with a canonical NeRF and a deformation model that achieves 3D point transformation between the observation space and the canonical space. Recent works rely on linear blend skinning (LBS) to achieve the canonica… ▽ More

    Submitted 19 June, 2024; v1 submitted 17 April, 2023; originally announced April 2023.

  18. arXiv:2303.13724  [pdf, other

    cs.CV

    Harmonizing Base and Novel Classes: A Class-Contrastive Approach for Generalized Few-Shot Segmentation

    Authors: Weide Liu, Zhonghua Wu, Yang Zhao, Yuming Fang, Chuan-Sheng Foo, Jun Cheng, Guosheng Lin

    Abstract: Current methods for few-shot segmentation (FSSeg) have mainly focused on improving the performance of novel classes while neglecting the performance of base classes. To overcome this limitation, the task of generalized few-shot semantic segmentation (GFSSeg) has been introduced, aiming to predict segmentation masks for both base and novel classes. However, the current prototype-based methods do no… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

  19. arXiv:2301.13514  [pdf, other

    cs.CV cs.AI cs.LG

    Fourier Sensitivity and Regularization of Computer Vision Models

    Authors: Kiran Krishnamachari, See-Kiong Ng, Chuan-Sheng Foo

    Abstract: Recent work has empirically shown that deep neural networks latch on to the Fourier statistics of training data and show increased sensitivity to Fourier-basis directions in the input. Understanding and modifying this Fourier-sensitivity of computer vision models may help improve their robustness. Hence, in this paper we study the frequency sensitivity characteristics of deep neural networks using… ▽ More

    Submitted 31 January, 2023; originally announced January 2023.

    Comments: Published in TMLR, https://openreview.net/forum?id=VmTYgjYloM

    Journal ref: TMLR 2022

  20. arXiv:2212.07585  [pdf, other

    cs.CV cs.LG

    Rethinking the Role of Pre-Trained Networks in Source-Free Domain Adaptation

    Authors: Wenyu Zhang, Li Shen, Chuan-Sheng Foo

    Abstract: Source-free domain adaptation (SFDA) aims to adapt a source model trained on a fully-labeled source domain to an unlabeled target domain. Large-data pre-trained networks are used to initialize source models during source training, and subsequently discarded. However, source training can cause the model to overfit to source data distribution and lose applicable target domain knowledge. We propose t… ▽ More

    Submitted 25 August, 2023; v1 submitted 14 December, 2022; originally announced December 2022.

    Comments: Accepted to ICCV 2023

  21. arXiv:2212.04372  [pdf

    eess.SY

    DECO2 An Open-source Energy System Decarbonisation Planning Software Including Negative Emissions Technologies

    Authors: Purusothmn Nair S. Bhasker Nair, Raymond R. Tan, Dominic C. Y. Foo, Disni Gamaralalage, Michael Short

    Abstract: The deployment of CO2 capture and storage (CCS) and negative emissions technologies (NETs) are crucial to meet the net-zero target by year 2050, as emphasised by the Glasgow Climate Pact. Over the years, several energy planning models have been developed to address the temporal aspects of carbon management. However, limited works have incorporated CCS and NETs for bottom-up energy planning at the… ▽ More

    Submitted 8 December, 2022; originally announced December 2022.

  22. arXiv:2212.00630  [pdf, other

    cs.LG cs.CY

    Probably Approximate Shapley Fairness with Applications in Machine Learning

    Authors: Zijian Zhou, Xinyi Xu, Rachael Hwee Ling Sim, Chuan Sheng Foo, Kian Hsiang Low

    Abstract: The Shapley value (SV) is adopted in various scenarios in machine learning (ML), including data valuation, agent valuation, and feature attribution, as it satisfies their fairness requirements. However, as exact SVs are infeasible to compute in practice, SV estimates are approximated instead. This approximation step raises an important question: do the SV estimates preserve the fairness guarantees… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

    Comments: 37th AAAI Conference on Artificial Intelligence (AAAI 2023)

  23. arXiv:2209.14624  [pdf, other

    cs.LG cs.CV

    Is Complexity Required for Neural Network Pruning? A Case Study on Global Magnitude Pruning

    Authors: Manas Gupta, Efe Camci, Vishandi Rudy Keneta, Abhishek Vaidyanathan, Ritwik Kanodia, Chuan-Sheng Foo, Wu Min, Lin Jie

    Abstract: Pruning neural networks has become popular in the last decade when it was shown that a large number of weights can be safely removed from modern neural networks without compromising accuracy. Numerous pruning methods have been proposed since, each claiming to be better than prior art, however, at the cost of increasingly complex pruning methodologies. These methodologies include utilizing importan… ▽ More

    Submitted 7 January, 2024; v1 submitted 29 September, 2022; originally announced September 2022.

  24. arXiv:2206.07876  [pdf, other

    cs.LG

    Domain Generalization via Selective Consistency Regularization for Time Series Classification

    Authors: Wenyu Zhang, Mohamed Ragab, Chuan-Sheng Foo

    Abstract: Domain generalization methods aim to learn models robust to domain shift with data from a limited number of source domains and without access to target domain samples during training. Popular domain alignment methods for domain generalization seek to extract domain-invariant features by minimizing the discrepancy between feature distributions across all domains, disregarding inter-domain relations… ▽ More

    Submitted 15 June, 2022; originally announced June 2022.

    Comments: Accepted to ICPR 2022

  25. arXiv:2205.15234  [pdf, other

    cs.CV cs.LG

    Few-Shot Adaptation of Pre-Trained Networks for Domain Shift

    Authors: Wenyu Zhang, Li Shen, Wanyue Zhang, Chuan-Sheng Foo

    Abstract: Deep networks are prone to performance degradation when there is a domain shift between the source (training) data and target (test) data. Recent test-time adaptation methods update batch normalization layers of pre-trained source models deployed in new target environments with streaming data to mitigate such performance degradation. Although such methods can adapt on-the-fly without first collect… ▽ More

    Submitted 22 October, 2022; v1 submitted 30 May, 2022; originally announced May 2022.

    Comments: Accepted to IJCAI 2022

  26. SemiCurv: Semi-Supervised Curvilinear Structure Segmentation

    Authors: Xun Xu, Manh Cuong Nguyen, Yasin Yazici, Kangkang Lu, Hlaing Min, Chuan-Sheng Foo

    Abstract: Recent work on curvilinear structure segmentation has mostly focused on backbone network design and loss engineering. The challenge of collecting labelled data, an expensive and labor intensive process, has been overlooked. While labelled data is expensive to obtain, unlabelled data is often readily available. In this work, we propose SemiCurv, a semi-supervised learning (SSL) framework for curvil… ▽ More

    Submitted 19 May, 2022; v1 submitted 17 May, 2022; originally announced May 2022.

    Comments: IEEE Transactions on Image Processing

  27. arXiv:2205.03001  [pdf, other

    cs.CV

    Revisiting Pretraining for Semi-Supervised Learning in the Low-Label Regime

    Authors: Xun Xu, Jingyi Liao, Lile Cai, Manh Cuong Nguyen, Kangkang Lu, Wanyue Zhang, Yasin Yazici, Chuan Sheng Foo

    Abstract: Semi-supervised learning (SSL) addresses the lack of labeled data by exploiting large unlabeled data through pseudolabeling. However, in the extremely low-label regime, pseudo labels could be incorrect, a.k.a. the confirmation bias, and the pseudo labels will in turn harm the network training. Recent studies combined finetuning (FT) from pretrained weights with SSL to mitigate the challenges and c… ▽ More

    Submitted 5 May, 2022; originally announced May 2022.

  28. arXiv:2205.01006  [pdf, other

    cs.CV cs.AI

    Open-Set Semi-Supervised Learning for 3D Point Cloud Understanding

    Authors: Xian Shi, Xun Xu, Wanyue Zhang, Xiatian Zhu, Chuan Sheng Foo, Kui Jia

    Abstract: Semantic understanding of 3D point cloud relies on learning models with massively annotated data, which, in many cases, are expensive or difficult to collect. This has led to an emerging research interest in semi-supervised learning (SSL) for 3D point cloud. It is commonly assumed in SSL that the unlabeled data are drawn from the same distribution as that of the labeled ones; This assumption, howe… ▽ More

    Submitted 2 May, 2022; originally announced May 2022.

  29. ADATIME: A Benchmarking Suite for Domain Adaptation on Time Series Data

    Authors: Mohamed Ragab, Emadeldeen Eldele, Wee Ling Tan, Chuan-Sheng Foo, Zhenghua Chen, Min Wu, Chee-Keong Kwoh, Xiaoli Li

    Abstract: Unsupervised domain adaptation methods aim to generalize well on unlabeled test data that may have a different (shifted) distribution from the training data. Such methods are typically developed on image data, and their application to time series data is less explored. Existing works on time series domain adaptation suffer from inconsistencies in evaluation schemes, datasets, and backbone neural n… ▽ More

    Submitted 5 May, 2023; v1 submitted 15 March, 2022; originally announced March 2022.

    Comments: Accepted in the ACM Transactions on Knowledge Discovery from Data (TKDD)

  30. arXiv:2202.09072  [pdf, other

    cond-mat.dis-nn cond-mat.quant-gas cond-mat.str-el

    A stabilization mechanism for many-body localization in two dimensions

    Authors: D. C. W. Foo, N. Swain, P. Sengupta, G. Lemarié, S. Adam

    Abstract: Experiments in cold atom systems see almost identical signatures of many body localization (MBL) in both one-dimensional ($d=1$) and two-dimensional ($d=2$) systems despite the thermal avalanche hypothesis showing that the MBL phase is unstable for $d>1$. Underpinning the thermal avalanche argument is the assumption of exponential localization of local integrals of motion (LIOMs). In this work we… ▽ More

    Submitted 18 February, 2022; originally announced February 2022.

    Journal ref: Phys. Rev. Research 5, L032011 (2023)

  31. arXiv:2112.09327  [pdf, other

    cs.LG

    Incentivizing Collaboration in Machine Learning via Synthetic Data Rewards

    Authors: Sebastian Shenghong Tay, Xinyi Xu, Chuan Sheng Foo, Bryan Kian Hsiang Low

    Abstract: This paper presents a novel collaborative generative modeling (CGM) framework that incentivizes collaboration among self-interested parties to contribute data to a pool for training a generative model (e.g., GAN), from which synthetic data are drawn and distributed to the parties as rewards commensurate to their contributions. Distributing synthetic data as rewards (instead of trained models or mo… ▽ More

    Submitted 17 December, 2021; originally announced December 2021.

    Comments: 36th AAAI Conference on Artificial Intelligence (AAAI 2022), Extended version with derivations, 42 pages

  32. arXiv:2112.06029  [pdf, other

    cs.CV cs.LG

    On Automatic Data Augmentation for 3D Point Cloud Classification

    Authors: Wanyue Zhang, Xun Xu, Fayao Liu, Le Zhang, Chuan-Sheng Foo

    Abstract: Data augmentation is an important technique to reduce overfitting and improve learning performance, but existing works on data augmentation for 3D point cloud data are based on heuristics. In this work, we instead propose to automatically learn a data augmentation strategy using bilevel optimization. An augmentor is designed in a similar fashion to a conditional generator and is optimized by minim… ▽ More

    Submitted 18 December, 2021; v1 submitted 11 December, 2021; originally announced December 2021.

    Comments: BMVC 2021

  33. On Representation Knowledge Distillation for Graph Neural Networks

    Authors: Chaitanya K. Joshi, Fayao Liu, Xu Xun, Jie Lin, Chuan-Sheng Foo

    Abstract: Knowledge distillation is a learning paradigm for boosting resource-efficient graph neural networks (GNNs) using more expressive yet cumbersome teacher models. Past work on distillation for GNNs proposed the Local Structure Preserving loss (LSP), which matches local structural relationships defined over edges across the student and teacher's node embeddings. This paper studies whether preserving t… ▽ More

    Submitted 4 February, 2023; v1 submitted 9 November, 2021; originally announced November 2021.

    Comments: IEEE Transactions on Neural Networks and Learning Representation (TNNLS), Special Issue on Deep Neural Networks for Graphs: Theory, Models, Algorithms and Applications

  34. arXiv:2109.11428  [pdf, other

    cs.LG cs.AI stat.ML

    An Evaluation of Anomaly Detection and Diagnosis in Multivariate Time Series

    Authors: Astha Garg, Wenyu Zhang, Jules Samaran, Savitha Ramasamy, Chuan-Sheng Foo

    Abstract: Several techniques for multivariate time series anomaly detection have been proposed recently, but a systematic comparison on a common set of datasets and metrics is lacking. This paper presents a systematic and comprehensive evaluation of unsupervised and semi-supervised deep-learning based methods for anomaly detection and diagnosis on multivariate time series data from cyberphysical systems. Un… ▽ More

    Submitted 23 September, 2021; originally announced September 2021.

    Comments: IEEE Transactions on Neural Networks and Learning Systems

  35. Semi-supervised classification of radiology images with NoTeacher: A Teacher that is not Mean

    Authors: Balagopal Unnikrishnan, Cuong Nguyen, Shafa Balaram, Chao Li, Chuan Sheng Foo, Pavitra Krishnaswamy

    Abstract: Deep learning models achieve strong performance for radiology image classification, but their practical application is bottlenecked by the need for large labeled training datasets. Semi-supervised learning (SSL) approaches leverage small labeled datasets alongside larger unlabeled datasets and offer potential for reducing labeling cost. In this work, we introduce NoTeacher, a novel consistency-bas… ▽ More

    Submitted 9 August, 2021; originally announced August 2021.

    Comments: Preprint submitted to Medical Image Analysis. Accepted in June 2021

    MSC Class: 41A05; 41A10; 65D05; 65D17

  36. arXiv:2108.02104  [pdf, other

    cs.CV

    Point Discriminative Learning for Data-efficient 3D Point Cloud Analysis

    Authors: Fayao Liu, Guosheng Lin, Chuan-Sheng Foo, Chaitanya K. Joshi, Jie Lin

    Abstract: 3D point cloud analysis has drawn a lot of research attention due to its wide applications. However, collecting massive labelled 3D point cloud data is both time-consuming and labor-intensive. This calls for data-efficient learning methods. In this work we propose PointDisc, a point discriminative learning method to leverage self-supervisions for data-efficient 3D point cloud classification and se… ▽ More

    Submitted 20 January, 2023; v1 submitted 4 August, 2021; originally announced August 2021.

    Comments: This work is published in 3DV 2022

  37. arXiv:2106.08587  [pdf, other

    cond-mat.str-el cond-mat.dis-nn

    Evidence of many-body localization in 2D from quantum Monte Carlo simulation

    Authors: Ho-Kin Tang, N. Swain, D. C. W. Foo, B. J. J. Khor, G. Lemarié, F. F. Assaad, S. Adam, P. Sengupta

    Abstract: We use the stochastic series expansion quantum Monte Carlo method, together with the eigenstate-to-Hamiltonian construction, to map the localized Bose glass ground state of the disordered two-dimensional Heisenberg model to excited states of new target Hamiltonians. The localized nature of the ground state is established by studying the participation entropy, local entanglement entropy, and local… ▽ More

    Submitted 3 May, 2022; v1 submitted 16 June, 2021; originally announced June 2021.

    Comments: (4+ε) pages with 4 figures, and the supplementary material

  38. arXiv:2101.06931  [pdf, other

    cs.CV

    Label-Efficient Point Cloud Semantic Segmentation: An Active Learning Approach

    Authors: Xian Shi, Xun Xu, Ke Chen, Lile Cai, Chuan Sheng Foo, Kui Jia

    Abstract: Deep learning models are the state-of-the-art methods for semantic point cloud segmentation, the success of which relies on the availability of large-scale annotated datasets. However, it can be extremely time-consuming and prohibitively expensive to compile such datasets. In this work, we propose an active learning approach to maximize model performance given limited annotation budgets. We invest… ▽ More

    Submitted 12 April, 2021; v1 submitted 18 January, 2021; originally announced January 2021.

  39. arXiv:2101.04859  [pdf

    cs.LG eess.SP

    A*HAR: A New Benchmark towards Semi-supervised learning for Class-imbalanced Human Activity Recognition

    Authors: Govind Narasimman, Kangkang Lu, Arun Raja, Chuan Sheng Foo, Mohamed Sabry Aly, Jie Lin, Vijay Chandrasekhar

    Abstract: Despite the vast literature on Human Activity Recognition (HAR) with wearable inertial sensor data, it is perhaps surprising that there are few studies investigating semisupervised learning for HAR, particularly in a challenging scenario with class imbalance problem. In this work, we present a new benchmark, called A*HAR, towards semisupervised learning for class-imbalanced HAR. We evaluate state-… ▽ More

    Submitted 12 January, 2021; originally announced January 2021.

    Comments: 5 pages, 3 figures

  40. arXiv:2006.14265  [pdf, other

    cs.LG cs.CV stat.ML

    Empirical Analysis of Overfitting and Mode Drop in GAN Training

    Authors: Yasin Yazici, Chuan-Sheng Foo, Stefan Winkler, Kim-Hui Yap, Vijay Chandrasekhar

    Abstract: We examine two key questions in GAN training, namely overfitting and mode drop, from an empirical perspective. We show that when stochasticity is removed from the training procedure, GANs can overfit and exhibit almost no mode drop. Our results shed light on important characteristics of the GAN training procedure. They also provide evidence against prevailing intuitions that GANs do not memorize t… ▽ More

    Submitted 25 June, 2020; originally announced June 2020.

    Comments: To appear in ICIP2020

  41. Classify and Generate: Using Classification Latent Space Representations for Image Generations

    Authors: Saisubramaniam Gopalakrishnan, Pranshu Ranjan Singh, Yasin Yazici, Chuan-Sheng Foo, Vijay Chandrasekhar, ArulMurugan Ambikapathi

    Abstract: Utilization of classification latent space information for downstream reconstruction and generation is an intriguing and a relatively unexplored area. In general, discriminative representations are rich in class-specific features but are too sparse for reconstruction, whereas, in autoencoders the representations are dense but have limited indistinguishable class-specific features, making them less… ▽ More

    Submitted 14 December, 2021; v1 submitted 16 April, 2020; originally announced April 2020.

    Journal ref: Saisubramaniam Gopalakrishnan, Pranshu Ranjan Singh et. al., Classify and generate: Using classification latent space representations for image generations, Neurocomputing, Volume 471, 2022, Pages 296-334, ISSN 0925-2312

  42. arXiv:2002.06015  [pdf, other

    cs.LG stat.ML

    Scalable and Practical Natural Gradient for Large-Scale Deep Learning

    Authors: Kazuki Osawa, Yohei Tsuji, Yuichiro Ueno, Akira Naruse, Chuan-Sheng Foo, Rio Yokota

    Abstract: Large-scale distributed training of deep neural networks results in models with worse generalization performance as a result of the increase in the effective mini-batch size. Previous approaches attempt to address this problem by varying the learning rate and batch size over epochs and layers, or ad hoc modifications of batch normalization. We propose Scalable and Practical Natural Gradient Descen… ▽ More

    Submitted 13 February, 2020; originally announced February 2020.

    Comments: arXiv admin note: text overlap with arXiv:1811.12019

  43. arXiv:1912.10364  [pdf, other

    cs.LG cs.CV

    Learning to Impute: A General Framework for Semi-supervised Learning

    Authors: Wei-Hong Li, Chuan-Sheng Foo, Hakan Bilen

    Abstract: Recent semi-supervised learning methods have shown to achieve comparable results to their supervised counterparts while using only a small portion of labels in image classification tasks thanks to their regularization strategies. In this paper, we take a more direct approach for semi-supervised learning and propose learning to impute the labels of unlabeled samples such that a network achieves bet… ▽ More

    Submitted 24 September, 2020; v1 submitted 21 December, 2019; originally announced December 2019.

    Comments: Semi-supervised Learning, Meta-Learning, Learning to impute

  44. arXiv:1910.13582  [pdf, other

    cond-mat.str-el cond-mat.quant-gas cond-mat.supr-con

    Diffusion Monte Carlo study of a spin-imbalanced two-dimensional Fermi gas with attractive interactions

    Authors: D. C. W. Foo, G. J. Conduit

    Abstract: We probe the superconducting gap in the zero temperature ground state of an attractively interacting spin-imbalanced two-dimensional Fermi gas with Diffusion Monte Carlo. A condensate fraction at nonzero pair momentum evidences a spatially non-uniform superconducting order parameter. Comparison with exact diagonalisation studies confirms that the nonzero condensate fraction across a range of nonze… ▽ More

    Submitted 29 October, 2019; originally announced October 2019.

    Journal ref: Phys. Rev. A 100, 063602 (2019)

  45. arXiv:1902.03444  [pdf, other

    cs.LG stat.ML

    Venn GAN: Discovering Commonalities and Particularities of Multiple Distributions

    Authors: Yasin Yazıcı, Bruno Lecouat, Chuan-Sheng Foo, Stefan Winkler, Kim-Hui Yap, Georgios Piliouras, Vijay Chandrasekhar

    Abstract: We propose a GAN design which models multiple distributions effectively and discovers their commonalities and particularities. Each data distribution is modeled with a mixture of $K$ generator distributions. As the generators are partially shared between the modeling of different true data distributions, shared ones captures the commonality of the distributions, while non-shared ones capture uniqu… ▽ More

    Submitted 9 February, 2019; originally announced February 2019.

  46. arXiv:1812.07832  [pdf, other

    cs.CV

    Semi-Supervised Deep Learning for Abnormality Classification in Retinal Images

    Authors: Bruno Lecouat, Ken Chang, Chuan-Sheng Foo, Balagopal Unnikrishnan, James M. Brown, Houssam Zenati, Andrew Beers, Vijay Chandrasekhar, Jayashree Kalpathy-Cramer, Pavitra Krishnaswamy

    Abstract: Supervised deep learning algorithms have enabled significant performance gains in medical image classification tasks. But these methods rely on large labeled datasets that require resource-intensive expert annotation. Semi-supervised generative adversarial network (GAN) approaches offer a means to learn from limited labeled data alongside larger unlabeled datasets, but have not been applied to dis… ▽ More

    Submitted 19 December, 2018; originally announced December 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:1811.07216

    Report number: ML4H/2018/227

  47. arXiv:1812.02288  [pdf, other

    cs.LG stat.ML

    Adversarially Learned Anomaly Detection

    Authors: Houssam Zenati, Manon Romain, Chuan Sheng Foo, Bruno Lecouat, Vijay Ramaseshan Chandrasekhar

    Abstract: Anomaly detection is a significant and hence well-studied problem. However, developing effective anomaly detection methods for complex and high-dimensional data remains a challenge. As Generative Adversarial Networks (GANs) are able to model the complex high-dimensional distributions of real-world data, they offer a promising approach to address this challenge. In this work, we propose an anomaly… ▽ More

    Submitted 5 December, 2018; originally announced December 2018.

    Comments: In the Proceedings of the 20th IEEE International Conference on Data Mining (ICDM), 2018

  48. arXiv:1811.12065  [pdf, other

    cs.NE cs.LG

    TEA-DNN: the Quest for Time-Energy-Accuracy Co-optimized Deep Neural Networks

    Authors: Lile Cai, Anne-Maelle Barneche, Arthur Herbout, Chuan Sheng Foo, Jie Lin, Vijay Ramaseshan Chandrasekhar, Mohamed M. Sabry

    Abstract: Embedded deep learning platforms have witnessed two simultaneous improvements. First, the accuracy of convolutional neural networks (CNNs) has been significantly improved through the use of automated neural-architecture search (NAS) algorithms to determine CNN structure. Second, there has been increasing interest in developing hardware accelerators for CNNs that provide improved inference performa… ▽ More

    Submitted 21 October, 2019; v1 submitted 29 November, 2018; originally announced November 2018.

    Comments: Accepted by ISLPED2019

  49. arXiv:1811.06219  [pdf, other

    physics.comp-ph cond-mat.mtrl-sci cs.LG

    Predicting thermoelectric properties from crystal graphs and material descriptors - first application for functional materials

    Authors: Leo Laugier, Daniil Bash, Jose Recatala, Hong Kuan Ng, Savitha Ramasamy, Chuan-Sheng Foo, Vijay R Chandrasekhar, Kedar Hippalgaonkar

    Abstract: We introduce the use of Crystal Graph Convolutional Neural Networks (CGCNN), Fully Connected Neural Networks (FCNN) and XGBoost to predict thermoelectric properties. The dataset for the CGCNN is independent of Density Functional Theory (DFT) and only relies on the crystal and atomic information, while that for the FCNN is based on a rich attribute list mined from Materialsproject.org. The results… ▽ More

    Submitted 15 November, 2018; originally announced November 2018.

  50. arXiv:1811.04595  [pdf, other

    cs.CV

    Holistic Multi-modal Memory Network for Movie Question Answering

    Authors: Anran Wang, Anh Tuan Luu, Chuan-Sheng Foo, Hongyuan Zhu, Yi Tay, Vijay Chandrasekhar

    Abstract: Answering questions according to multi-modal context is a challenging problem as it requires a deep integration of different data sources. Existing approaches only employ partial interactions among data sources in one attention hop. In this paper, we present the Holistic Multi-modal Memory Network (HMMN) framework which fully considers the interactions between different input sources (multi-modal… ▽ More

    Submitted 12 November, 2018; originally announced November 2018.