Skip to main content

Showing 1–45 of 45 results for author: Nguyen, A T

  1. arXiv:2407.09035  [pdf, other

    eess.IV cs.CV

    GPC: Generative and General Pathology Image Classifier

    Authors: Anh Tien Nguyen, Jin Tae Kwak

    Abstract: Deep learning has been increasingly incorporated into various computational pathology applications to improve its efficiency, accuracy, and robustness. Although successful, most previous approaches for image classification have crucial drawbacks. There exist numerous tasks in pathology, but one needs to build a model per task, i.e., a task-specific model, thereby increasing the number of models, t… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: MICCAI-MedAGI 2023 (Best Paper Honorable Mention)

  2. arXiv:2407.09030  [pdf, other

    eess.IV cs.CV

    CAMP: Continuous and Adaptive Learning Model in Pathology

    Authors: Anh Tien Nguyen, Keunho Byeon, Kyungeun Kim, Boram Song, Seoung Wan Chae, Jin Tae Kwak

    Abstract: There exist numerous diagnostic tasks in pathology. Conventional computational pathology formulates and tackles them as independent and individual image classification problems, thereby resulting in computational inefficiency and high costs. To address the challenges, we propose a generic, unified, and universal framework, called a continuous and adaptive learning model in pathology (CAMP), for pa… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: Under review

  3. arXiv:2407.07360  [pdf, other

    cs.CV cs.LG

    Towards a text-based quantitative and explainable histopathology image analysis

    Authors: Anh Tien Nguyen, Trinh Thi Le Vuong, Jin Tae Kwak

    Abstract: Recently, vision-language pre-trained models have emerged in computational pathology. Previous works generally focused on the alignment of image-text pairs via the contrastive pre-training paradigm. Such pre-trained models have been applied to pathology image classification in zero-shot learning or transfer learning fashion. Herein, we hypothesize that the pre-trained vision-language models can be… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: MICCAI 2024 - Early acceptance (Top 11%)

  4. arXiv:2407.06581  [pdf, other

    cs.AI cs.CV

    Vision language models are blind

    Authors: Pooyan Rahmanzadehgervi, Logan Bolton, Mohammad Reza Taesiri, Anh Totti Nguyen

    Abstract: Large language models with vision capabilities (VLMs), e.g., GPT-4o and Gemini 1.5 Pro are powering countless image-text applications and scoring high on many vision-understanding benchmarks. We propose BlindTest, a suite of 7 visual tasks absurdly easy to humans such as identifying (a) whether two circles overlap; (b) whether two lines intersect; (c) which letter is being circled in a word; and (… ▽ More

    Submitted 12 July, 2024; v1 submitted 9 July, 2024; originally announced July 2024.

  5. arXiv:2403.05297  [pdf, other

    cs.CV cs.AI cs.CL

    PEEB: Part-based Image Classifiers with an Explainable and Editable Language Bottleneck

    Authors: Thang M. Pham, Peijie Chen, Tin Nguyen, Seunghyun Yoon, Trung Bui, Anh Totti Nguyen

    Abstract: CLIP-based classifiers rely on the prompt containing a {class name} that is known to the text encoder. Therefore, they perform poorly on new classes or the classes whose names rarely appear on the Internet (e.g., scientific names of birds). For fine-grained classification, we propose PEEB - an explainable and editable classifier to (1) express the class name into a set of text descriptors that des… ▽ More

    Submitted 12 April, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

    Comments: Findings of NAACL 2024 (long paper)

  6. An Application of Vector Autoregressive Model for Analyzing the Impact of Weather And Nearby Traffic Flow On The Traffic Volume

    Authors: Anh Thi-Hoang Nguyen, Dung Ha Nguyen, Trong-Hop Do

    Abstract: This paper aims to predict the traffic flow at one road segment based on nearby traffic volume and weather conditions. Our team also discover the impact of weather conditions and nearby traffic volume on the traffic flow at a target point. The analysis results will help solve the problem of traffic flow prediction and develop an optimal transport network with efficient traffic movement and minimal… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

    Comments: International Conference on Computing and Communication Technologies (RIVF2022)

    Report number: D1-2022-48

  7. arXiv:2311.06851  [pdf, other

    cs.CL

    Automatic Textual Normalization for Hate Speech Detection

    Authors: Anh Thi-Hoang Nguyen, Dung Ha Nguyen, Nguyet Thi Nguyen, Khanh Thanh-Duy Ho, Kiet Van Nguyen

    Abstract: Social media data is a valuable resource for research, yet it contains a wide range of non-standard words (NSW). These irregularities hinder the effective operation of NLP tools. Current state-of-the-art methods for the Vietnamese language address this issue as a problem of lexical normalization, involving the creation of manual rules or the implementation of multi-staged deep learning frameworks,… ▽ More

    Submitted 4 December, 2023; v1 submitted 12 November, 2023; originally announced November 2023.

    Comments: Accepted to present at 2023 International Conference on Intelligent Systems Design and Applications (ISDA2023)

  8. arXiv:2311.02803  [pdf, other

    cs.CV

    Fast and Interpretable Face Identification for Out-Of-Distribution Data Using Vision Transformers

    Authors: Hai Phan, Cindy Le, Vu Le, Yihui He, Anh Totti Nguyen

    Abstract: Most face identification approaches employ a Siamese neural network to compare two images at the image embedding level. Yet, this technique can be subject to occlusion (e.g. faces with masks or sunglasses) and out-of-distribution data. DeepFace-EMD (Phan et al. 2022) reaches state-of-the-art accuracy on out-of-distribution data by first comparing two images at the image level, and then at the patc… ▽ More

    Submitted 5 November, 2023; originally announced November 2023.

    Comments: 20 pages, 15 Figures

  9. arXiv:2310.07984  [pdf

    cs.AI cs.CE

    Large Language Models for Scientific Synthesis, Inference and Explanation

    Authors: Yizhen Zheng, Huan Yee Koh, Jiaxin Ju, Anh T. N. Nguyen, Lauren T. May, Geoffrey I. Webb, Shirui Pan

    Abstract: Large language models are a form of artificial intelligence systems whose primary knowledge consists of the statistical patterns, semantic relationships, and syntactical structures of language1. Despite their limited forms of "knowledge", these systems are adept at numerous complex tasks including creative writing, storytelling, translation, question-answering, summarization, and computer code gen… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: Supplementary Information: https://drive.google.com/file/d/1KrpUpzuFTeMx6a6zl18lqdo8vV-UUa1Z/view?usp=sharing Github Repo: https://github.com/zyzisastudyreallyhardguy/LLM4SD

  10. arXiv:2308.13651  [pdf, other

    cs.CV cs.HC

    PCNN: Probable-Class Nearest-Neighbor Explanations Improve Fine-Grained Image Classification Accuracy for AIs and Humans

    Authors: Giang Nguyen, Valerie Chen, Mohammad Reza Taesiri, Anh Totti Nguyen

    Abstract: Nearest neighbors (NN) are traditionally used to compute final decisions, e.g., in Support Vector Machines or k-NN classifiers, and to provide users with explanations for the model's decision. In this paper, we show a novel utility of nearest neighbors: To improve predictions of a frozen, pretrained classifier C. We leverage an image comparator S that (1) compares the input image with NN images fr… ▽ More

    Submitted 23 April, 2024; v1 submitted 25 August, 2023; originally announced August 2023.

  11. arXiv:2305.18458  [pdf, other

    cs.LG

    Conditional Support Alignment for Domain Adaptation with Label Shift

    Authors: Anh T Nguyen, Lam Tran, Anh Tong, Tuan-Duy H. Nguyen, Toan Tran

    Abstract: Unsupervised domain adaptation (UDA) refers to a domain adaptation framework in which a learning model is trained based on the labeled samples on the source domain and unlabelled ones in the target domain. The dominant existing methods in the field that rely on the classical covariate shift assumption to learn domain-invariant feature representation have yielded suboptimal performance under the la… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

  12. arXiv:2301.11799  [pdf

    cs.HC

    Factors influencing to use of Bluezone

    Authors: Vinh T. Nguyen, Anh T. Nguyen, Tan H. Nguyen, Dinh K. Luong

    Abstract: This study aims to understand the main factors and their influence on the behavioral intention of users about using Bluezone. Surveys are sent to users through the Google Form tool. Experimental results through analysis of exploratory factors on 224 survey subjects show that there are 4 main factors affecting user behavior. Structural equation modeling indicates that trust, performance expectation… ▽ More

    Submitted 24 January, 2023; originally announced January 2023.

    Comments: in Vietnamese language

  13. arXiv:2209.03148  [pdf, other

    cs.LG

    Improving Out-of-Distribution Detection via Epistemic Uncertainty Adversarial Training

    Authors: Derek Everett, Andre T. Nguyen, Luke E. Richards, Edward Raff

    Abstract: The quantification of uncertainty is important for the adoption of machine learning, especially to reject out-of-distribution (OOD) data back to human experts for review. Yet progress has been slow, as a balance must be struck between computational efficiency and the quality of uncertainty estimates. For this reason many use deep ensembles of neural networks or Monte Carlo dropout for reasonable u… ▽ More

    Submitted 9 September, 2022; v1 submitted 5 September, 2022; originally announced September 2022.

    Comments: 8 pages, 5 figures

  14. Detecting COVID-19 from digitized ECG printouts using 1D convolutional neural networks

    Authors: Thao Nguyen, Hieu H. Pham, Huy Khiem Le, Anh Tu Nguyen, Ngoc Tien Thanh, Cuong Do

    Abstract: The COVID-19 pandemic has exposed the vulnerability of healthcare services worldwide, raising the need to develop novel tools to provide rapid and cost-effective screening and diagnosis. Clinical reports indicated that COVID-19 infection may cause cardiac injury, and electrocardiograms (ECG) may serve as a diagnostic biomarker for COVID-19. This study aims to utilize ECG signals to detect COVID-19… ▽ More

    Submitted 5 October, 2022; v1 submitted 10 August, 2022; originally announced August 2022.

    Comments: Accepted with minor revision by Plos One

  15. arXiv:2206.00524  [pdf, other

    cs.CL cs.AI cs.LG

    Vietnamese Hate and Offensive Detection using PhoBERT-CNN and Social Media Streaming Data

    Authors: Khanh Q. Tran, An T. Nguyen, Phu Gia Hoang, Canh Duc Luu, Trong-Hop Do, Kiet Van Nguyen

    Abstract: Society needs to develop a system to detect hate and offense to build a healthy and safe environment. However, current research in this field still faces four major shortcomings, including deficient pre-processing techniques, indifference to data imbalance issues, modest performance models, and lacking practical applications. This paper focused on developing an intelligent system capable of addres… ▽ More

    Submitted 1 June, 2022; originally announced June 2022.

  16. arXiv:2205.01039  [pdf

    cs.CY

    Big Tech Companies Impact on Research at the Faculty of Information Technology and Electrical Engineering

    Authors: Ahmad Hassanpour, An Thi Nguyen, Anshul Rani, Sarang Shaikh, Ying Xu, Haoyu Zhang

    Abstract: Artificial intelligence is gaining momentum, ongoing pandemic is fuel to that with more opportunities in every sector specially in health and education sector. But with the growth in technology, challenges associated with ethics also grow (Katharine Schwab, 2021). Whenever a new AI product is developed, companies publicize that their systems are transparent, fair, and are in accordance with the ex… ▽ More

    Submitted 10 April, 2022; originally announced May 2022.

  17. arXiv:2203.08648  [pdf, other

    cs.RO cs.AI cs.HC cs.LG q-bio.NC

    Artificial Intelligence Enables Real-Time and Intuitive Control of Prostheses via Nerve Interface

    Authors: Diu Khue Luu, Anh Tuan Nguyen, Ming Jiang, Markus W. Drealan, Jian Xu, Tong Wu, Wing-kin Tam, Wenfeng Zhao, Brian Z. H. Lim, Cynthia K. Overstreet, Qi Zhao, Jonathan Cheng, Edward W. Keefer, Zhi Yang

    Abstract: Objective: The next generation prosthetic hand that moves and feels like a real hand requires a robust neural interconnection between the human minds and machines. Methods: Here we present a neuroprosthetic system to demonstrate that principle by employing an artificial intelligence (AI) agent to translate the amputee's movement intent through a peripheral nerve interface. The AI agent is designed… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

  18. arXiv:2203.07596  [pdf, other

    cs.LG cs.CV

    Task-Agnostic Robust Representation Learning

    Authors: A. Tuan Nguyen, Ser Nam Lim, Philip Torr

    Abstract: It has been reported that deep learning models are extremely vulnerable to small but intentionally chosen perturbations of its input. In particular, a deep network, despite its near-optimal accuracy on the clean images, often mis-classifies an image with a worst-case but humanly imperceptible perturbation (so-called adversarial examples). To tackle this problem, a great amount of research has been… ▽ More

    Submitted 14 March, 2022; originally announced March 2022.

  19. arXiv:2202.08985  [pdf, ps, other

    cs.LG

    Out of Distribution Data Detection Using Dropout Bayesian Neural Networks

    Authors: Andre T. Nguyen, Fred Lu, Gary Lopez Munoz, Edward Raff, Charles Nicholas, James Holt

    Abstract: We explore the utility of information contained within a dropout based Bayesian neural network (BNN) for the task of detecting out of distribution (OOD) data. We first show how previous attempts to leverage the randomized embeddings induced by the intermediate layers of a dropout BNN can fail due to the distance metric used. We introduce an alternative approach to measuring embedding uncertainty,… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

  20. arXiv:2111.13807  [pdf, other

    cs.LG cs.AI stat.ML

    Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization

    Authors: Thanh Nguyen-Tang, Sunil Gupta, A. Tuan Nguyen, Svetha Venkatesh

    Abstract: Offline policy learning (OPL) leverages existing data collected a priori for policy optimization without any active exploration. Despite the prevalence and recent interest in this problem, its theoretical and algorithmic foundations in function approximation settings remain under-developed. In this paper, we consider this problem on the axes of distributional shift, optimization, and generalizatio… ▽ More

    Submitted 13 March, 2022; v1 submitted 26 November, 2021; originally announced November 2021.

    Comments: A full version at ICLR'22; a preliminary version at Offline RL Workshop at NeurIPS'21; code: https://github.com/thanhnguyentang/offline_neural_bandits

    Journal ref: ICLR 2022

  21. arXiv:2108.04081  [pdf, other

    cs.LG cs.CR

    Leveraging Uncertainty for Improved Static Malware Detection Under Extreme False Positive Constraints

    Authors: Andre T. Nguyen, Edward Raff, Charles Nicholas, James Holt

    Abstract: The detection of malware is a critical task for the protection of computing environments. This task often requires extremely low false positive rates (FPR) of 0.01% or even lower, for which modern machine learning has no readily available tools. We introduce the first broad investigation of the use of uncertainty for malware detection across multiple datasets, models, and feature types. We show ho… ▽ More

    Submitted 9 August, 2021; originally announced August 2021.

    Report number: IJCAI-ACD/2021/102

  22. arXiv:2106.07780  [pdf, other

    cs.LG

    KL Guided Domain Adaptation

    Authors: A. Tuan Nguyen, Toan Tran, Yarin Gal, Philip H. S. Torr, Atılım Güneş Baydin

    Abstract: Domain adaptation is an important problem and often needed for real-world applications. In this problem, instead of i.i.d. training and testing datapoints, we assume that the source (training) data and the target (testing) data have different distributions. With that setting, the empirical risk minimization training procedure often does not perform well, since it does not account for the change in… ▽ More

    Submitted 14 March, 2022; v1 submitted 14 June, 2021; originally announced June 2021.

    Comments: Accepted to ICLR2022

  23. arXiv:2103.13452  [pdf, other

    cs.RO cs.AI cs.HC

    A Portable, Self-Contained Neuroprosthetic Hand with Deep Learning-Based Finger Control

    Authors: Anh Tuan Nguyen, Markus W. Drealan, Diu Khue Luu, Ming Jiang, Jian Xu, Jonathan Cheng, Qi Zhao, Edward W. Keefer, Zhi Yang

    Abstract: Objective: Deep learning-based neural decoders have emerged as the prominent approach to enable dexterous and intuitive control of neuroprosthetic hands. Yet few studies have materialized the use of deep learning in clinical settings due to its high computational requirements. Methods: Recent advancements of edge computing devices bring the potential to alleviate this problem. Here we present the… ▽ More

    Submitted 24 March, 2021; originally announced March 2021.

    Journal ref: Journal of Neural Engineering 18 (2021) 056051

  24. arXiv:2102.05082  [pdf, other

    cs.LG

    Domain Invariant Representation Learning with Domain Density Transformations

    Authors: A. Tuan Nguyen, Toan Tran, Yarin Gal, Atılım Güneş Baydin

    Abstract: Domain generalization refers to the problem where we aim to train a model on data from a set of source domains so that the model can generalize to unseen target domains. Naively training a model on the aggregate set of data (pooled from all source domains) has been shown to perform suboptimally, since the information learned by that model might be domain-specific and generalize imperfectly to targ… ▽ More

    Submitted 15 February, 2022; v1 submitted 9 February, 2021; originally announced February 2021.

    Comments: NeurIPS 2021

  25. arXiv:2101.11294  [pdf, other

    cs.IT cs.DM

    Improved algorithms for non-adaptive group testing with consecutive positives

    Authors: Thach V. Bui, Mahdi Cheraghchi, An T. H. Nguyen, Thuc D. Nguyen

    Abstract: The goal of group testing is to efficiently identify a few specific items, called positives, in a large population of items via tests. A test is an action on a subset of items which returns positive if the subset contains at least one positive and negative otherwise. In non-adaptive group testing, all tests are fixed in advance and can be performed in parallel. In this work, we consider non-adapti… ▽ More

    Submitted 5 November, 2021; v1 submitted 27 January, 2021; originally announced January 2021.

  26. arXiv:2010.01891  [pdf, other

    cs.CL cs.AI

    A Pilot Study of Text-to-SQL Semantic Parsing for Vietnamese

    Authors: Anh Tuan Nguyen, Mai Hoang Dao, Dat Quoc Nguyen

    Abstract: Semantic parsing is an important NLP task. However, Vietnamese is a low-resource language in this research area. In this paper, we present the first public large-scale Text-to-SQL semantic parsing dataset for Vietnamese. We extend and evaluate two strong semantic parsing baselines EditSQL (Zhang et al., 2019) and IRNet (Guo et al., 2019) on our dataset. We compare the two baselines with key config… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

    Comments: EMNLP 2020 (Findings)

  27. arXiv:2009.05147  [pdf, other

    cs.CV cs.LG cs.RO stat.ML

    Practical Cross-modal Manifold Alignment for Grounded Language

    Authors: Andre T. Nguyen, Luke E. Richards, Gaoussou Youssouf Kebe, Edward Raff, Kasra Darvish, Frank Ferraro, Cynthia Matuszek

    Abstract: We propose a cross-modality manifold alignment procedure that leverages triplet loss to jointly learn consistent, multi-modal embeddings of language-based concepts of real-world items. Our approach learns these embeddings by sampling triples of anchor, positive, and negative data points from RGB-depth images and their natural language descriptions. We show that our approach can benefit from, but d… ▽ More

    Submitted 1 September, 2020; originally announced September 2020.

  28. arXiv:2008.12854  [pdf, other

    cs.CL cs.LG

    TATL at W-NUT 2020 Task 2: A Transformer-based Baseline System for Identification of Informative COVID-19 English Tweets

    Authors: Anh Tuan Nguyen

    Abstract: As the COVID-19 outbreak continues to spread throughout the world, more and more information about the pandemic has been shared publicly on social media. For example, there are a huge number of COVID-19 English Tweets daily on Twitter. However, the majority of those Tweets are uninformative, and hence it is important to be able to automatically select only the informative ones for downstream appli… ▽ More

    Submitted 28 August, 2020; originally announced August 2020.

  29. arXiv:2006.14222  [pdf, other

    cs.LG stat.ML

    Set Based Stochastic Subsampling

    Authors: Bruno Andreis, Seanie Lee, A. Tuan Nguyen, Juho Lee, Eunho Yang, Sung Ju Hwang

    Abstract: Deep models are designed to operate on huge volumes of high dimensional data such as images. In order to reduce the volume of data these models must process, we propose a set-based two-stage end-to-end neural subsampling model that is jointly optimized with an \textit{arbitrary} downstream task network (e.g. classifier). In the first stage, we efficiently subsample \textit{candidate elements} usin… ▽ More

    Submitted 30 May, 2022; v1 submitted 25 June, 2020; originally announced June 2020.

    Comments: 20 pages

  30. arXiv:2006.12777  [pdf, other

    cs.LG stat.ML

    Clinical Risk Prediction with Temporal Probabilistic Asymmetric Multi-Task Learning

    Authors: A. Tuan Nguyen, Hyewon Jeong, Eunho Yang, Sung Ju Hwang

    Abstract: Although recent multi-task learning methods have shown to be effective in improving the generalization of deep neural networks, they should be used with caution for safety-critical applications, such as clinical risk prediction. This is because even if they achieve improved task-average performance, they may still yield degraded performance on individual tasks, which may be critical (e.g., predict… ▽ More

    Submitted 18 February, 2021; v1 submitted 23 June, 2020; originally announced June 2020.

    Comments: AAAI 2021. The first two authors contributed equally to this work. 10 pages, 4 figures, 4 tables

  31. arXiv:2005.10200  [pdf, other

    cs.CL cs.LG

    BERTweet: A pre-trained language model for English Tweets

    Authors: Dat Quoc Nguyen, Thanh Vu, Anh Tuan Nguyen

    Abstract: We present BERTweet, the first public large-scale pre-trained language model for English Tweets. Our BERTweet, having the same architecture as BERT-base (Devlin et al., 2019), is trained using the RoBERTa pre-training procedure (Liu et al., 2019). Experiments show that BERTweet outperforms strong baselines RoBERTa-base and XLM-R-base (Conneau et al., 2020), producing better performance results tha… ▽ More

    Submitted 5 October, 2020; v1 submitted 20 May, 2020; originally announced May 2020.

    Comments: In Proceedings of EMNLP 2020: System Demonstrations

  32. arXiv:2003.00744  [pdf, other

    cs.CL cs.AI

    PhoBERT: Pre-trained language models for Vietnamese

    Authors: Dat Quoc Nguyen, Anh Tuan Nguyen

    Abstract: We present PhoBERT with two versions, PhoBERT-base and PhoBERT-large, the first public large-scale monolingual language models pre-trained for Vietnamese. Experimental results show that PhoBERT consistently outperforms the recent best pre-trained multilingual model XLM-R (Conneau et al., 2020) and improves the state-of-the-art in multiple Vietnamese-specific NLP tasks including Part-of-speech tagg… ▽ More

    Submitted 5 October, 2020; v1 submitted 2 March, 2020; originally announced March 2020.

    Comments: EMNLP 2020 (Findings)

  33. arXiv:1911.04636  [pdf, other

    cs.LG eess.SY stat.ML

    Robust Design of Deep Neural Networks against Adversarial Attacks based on Lyapunov Theory

    Authors: Arash Rahnama, Andre T. Nguyen, Edward Raff

    Abstract: Deep neural networks (DNNs) are vulnerable to subtle adversarial perturbations applied to the input. These adversarial perturbations, though imperceptible, can easily mislead the DNN. In this work, we take a control theoretic approach to the problem of robustness in DNNs. We treat each individual layer of the DNN as a nonlinear dynamical system and use Lyapunov theory to prove stability and robust… ▽ More

    Submitted 11 November, 2019; originally announced November 2019.

  34. arXiv:1911.02673  [pdf, other

    cs.LG cs.CY stat.AP stat.ML

    Towards the Use of Neural Networks for Influenza Prediction at Multiple Spatial Resolutions

    Authors: Emily L. Aiken, Andre T. Nguyen, Mauricio Santillana

    Abstract: We introduce the use of a Gated Recurrent Unit (GRU) for influenza prediction at the state- and city-level in the US, and experiment with the inclusion of real-time flu-related Internet search data. We find that a GRU has lower prediction error than current state-of-the-art methods for data-driven influenza prediction at time horizons of over two weeks. In contrast with other machine learning appr… ▽ More

    Submitted 13 November, 2019; v1 submitted 6 November, 2019; originally announced November 2019.

    Comments: Machine Learning for Health (ML4H) at NeurIPS 2019 - Extended Abstract; Added Footer

  35. arXiv:1910.04753  [pdf

    cs.CR cs.LG

    Would a File by Any Other Name Seem as Malicious?

    Authors: Andre T. Nguyen, Edward Raff, Aaron Sant-Miller

    Abstract: Successful malware attacks on information technology systems can cause millions of dollars in damage, the exposure of sensitive and private information, and the irreversible destruction of data. Anti-virus systems that analyze a file's contents use a combination of static and dynamic analysis to detect and remove/remediate such malware. However, examining a file's entire contents is not always pos… ▽ More

    Submitted 9 October, 2019; originally announced October 2019.

  36. arXiv:1908.09219  [pdf, other

    cs.LG stat.ML

    Heterogeneous Relational Kernel Learning

    Authors: Andre T. Nguyen, Edward Raff

    Abstract: Recent work has developed Bayesian methods for the automatic statistical analysis and description of single time series as well as of homogeneous sets of time series data. We extend prior work to create an interpretable kernel embedding for heterogeneous time series. Our method adds practically no computational cost compared to prior results by leveraging previously discarded intermediate results.… ▽ More

    Submitted 24 August, 2019; originally announced August 2019.

    Comments: MileTS '19: 5th KDD Workshop on Mining and Learning from Time Series

  37. arXiv:1907.07732  [pdf, other

    cs.CR cs.LG

    Connecting Lyapunov Control Theory to Adversarial Attacks

    Authors: Arash Rahnama, Andre T. Nguyen, Edward Raff

    Abstract: Significant work is being done to develop the math and tools necessary to build provable defenses, or at least bounds, against adversarial attacks of neural networks. In this work, we argue that tools from control theory could be leveraged to aid in defending against such attacks. We do this by example, building a provable defense against a weaker adversary. This is done so we can focus on the mec… ▽ More

    Submitted 17 July, 2019; originally announced July 2019.

    Comments: 8 pages, 3 figures, AdvML'19: Workshop on Adversarial Learning Methods for Machine Learning and Data Mining at KDD

  38. arXiv:1904.04710  [pdf

    cs.CR

    Secure Biometric-based Remote Authentication Protocol using Chebyshev Polynomials and Fuzzy Extractor

    Authors: Thi Ai Thao Nguyen, Tran Khanh Dang, Quynh Chi Truong, Dinh Thanh Nguyen

    Abstract: In this paper, we have proposed a multi factor biometric-based remote authentication protocol. Our proposal overcomes the vulnerabilities of some previous works. At the same time, the protocol also obtains a low false accept rate (FAR) and false reject rate (FRR).

    Submitted 9 April, 2019; originally announced April 2019.

    Comments: RCCIE17

  39. arXiv:1904.00264  [pdf

    cs.CR

    A New Biometric Template Protection using Random Orthonormal Projection and Fuzzy Commitment

    Authors: Thi Ai Thao Nguyen, Tran Khanh Dang, Dinh Thanh Nguyen

    Abstract: Biometric template protection is one of most essential parts in putting a biometric-based authentication system into practice. There have been many researches proposing different solutions to secure biometric templates of users. They can be categorized into two approaches: feature transformation and biometric cryptosystem. However, no one single template protection approach can satisfy all the req… ▽ More

    Submitted 30 March, 2019; originally announced April 2019.

    Comments: 11 pages, 6 figures, accepted for IMCOM 2019

  40. arXiv:1812.02885  [pdf

    cs.LG cs.CR stat.ML

    Adversarial Attacks, Regression, and Numerical Stability Regularization

    Authors: Andre T. Nguyen, Edward Raff

    Abstract: Adversarial attacks against neural networks in a regression setting are a critical yet understudied problem. In this work, we advance the state of the art by investigating adversarial attacks against regression networks and by formulating a more effective defense against these attacks. In particular, we take the perspective that adversarial attacks are likely caused by numerical instability in lea… ▽ More

    Submitted 6 December, 2018; originally announced December 2018.

    Comments: Presented at the AAAI 2019 Workshop on Engineering Dependable and Secure Machine Learning Systems

  41. arXiv:1810.07834  [pdf, other

    cs.IT

    Superimposed Frame Synchronization Optimization for Finite Blocklength Regime

    Authors: Alex The Phuong Nguyen, Raphaël Le Bidan, Frédéric Guilloud

    Abstract: Considering a short frame length, which is typical in Ultra-Reliable Low-Latency and massive Machine Type Communications, a trade-off exists between improving the performance of frame synchronization (FS) and improving the performance of information throughput. In this paper, we consider the case of continuous transmission over AWGN channels where the synchronization sequence is superimposed to th… ▽ More

    Submitted 9 March, 2019; v1 submitted 17 October, 2018; originally announced October 2018.

    Comments: to appear at 2019 WCNC Workshop on Mathematical Tools and technologies for IoT and mMTC Networks Modeling (MoTION)

  42. arXiv:1805.04168  [pdf, other

    cs.IT cs.ET eess.SP

    Achieving Super-Resolution with Redundant Sensing

    Authors: Diu Khue Luu, Anh Tuan Nguyen, Zhi Yang

    Abstract: Analog-to-digital (quantization) and digital-to-analog (de-quantization) conversion are fundamental operations of many information processing systems. In practice, the precision of these operations is always bounded, first by the random mismatch error (ME) occurred during system implementation, and subsequently by the intrinsic quantization error (QE) determined by the system architecture itself.… ▽ More

    Submitted 10 May, 2018; originally announced May 2018.

    Journal ref: IEEE Transactions on Biomedical Engineering (2018)

  43. arXiv:1802.05686  [pdf, other

    cs.NE cs.IT eess.SP

    A Bio-inspired Redundant Sensing Architecture

    Authors: Anh Tuan Nguyen, Jian Xu, Zhi Yang

    Abstract: Sensing is the process of deriving signals from the environment that allows artificial systems to interact with the physical world. The Shannon theorem specifies the maximum rate at which information can be acquired. However, this upper bound is hard to achieve in many man-made systems. The biological visual systems, on the other hand, have highly efficient signal representation and processing mec… ▽ More

    Submitted 15 February, 2018; originally announced February 2018.

    Journal ref: (2016) A Bio-inspired Redundant Sensing Architecture. Advances in Neural Information Processing Systems (NIPS), Dec. 2016

  44. Advancing System Performance with Redundancy: From Biological to Artificial Designs

    Authors: Anh Tuan Nguyen, Jian Xu, Diu Khue Luu, Qi Zhao, Zhi Yang

    Abstract: Redundancy is a fundamental characteristic of many biological processes such as those in the genetic, visual, muscular and nervous system; yet its function has not been fully understood. The conventional interpretation of redundancy is that it serves as a fault-tolerance mechanism, which leads to redundancy's de facto application in man-made systems for reliability enhancement. On the contrary, ou… ▽ More

    Submitted 14 February, 2018; originally announced February 2018.

    Journal ref: Neural Computation, MIT Press, 2019

  45. arXiv:1611.06792  [pdf, other

    cs.IR

    Neural Information Retrieval: A Literature Review

    Authors: Ye Zhang, Md Mustafizur Rahman, Alex Braylan, Brandon Dang, Heng-Lu Chang, Henna Kim, Quinten McNamara, Aaron Angert, Edward Banner, Vivek Khetan, Tyler McDonnell, An Thanh Nguyen, Dan Xu, Byron C. Wallace, Matthew Lease

    Abstract: A recent "third wave" of Neural Network (NN) approaches now delivers state-of-the-art performance in many machine learning tasks, spanning speech recognition, computer vision, and natural language processing. Because these modern NNs often comprise multiple interconnected layers, this new NN research is often referred to as deep learning. Stemming from this tide of NN work, a number of researchers… ▽ More

    Submitted 3 March, 2017; v1 submitted 17 November, 2016; originally announced November 2016.

    Comments: 44 pages