Skip to main content

Showing 1–46 of 46 results for author: Tran, S

  1. arXiv:2407.09073  [pdf, other

    cs.CV

    Open Vocabulary Multi-Label Video Classification

    Authors: Rohit Gupta, Mamshad Nayeem Rizve, Jayakrishnan Unnikrishnan, Ashish Tawari, Son Tran, Mubarak Shah, Benjamin Yao, Trishul Chilimbi

    Abstract: Pre-trained vision-language models (VLMs) have enabled significant progress in open vocabulary computer vision tasks such as image classification, object detection and image segmentation. Some recent works have focused on extending VLMs to open vocabulary single label action classification in videos. However, previous methods fall short in holistic video understanding which requires the ability to… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: Accepted at ECCV 2024

  2. arXiv:2403.15882  [pdf, other

    cs.CL

    VLUE: A New Benchmark and Multi-task Knowledge Transfer Learning for Vietnamese Natural Language Understanding

    Authors: Phong Nguyen-Thuan Do, Son Quoc Tran, Phu Gia Hoang, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen

    Abstract: The success of Natural Language Understanding (NLU) benchmarks in various languages, such as GLUE for English, CLUE for Chinese, KLUE for Korean, and IndoNLU for Indonesian, has facilitated the evaluation of new NLU models across a wide range of tasks. To establish a standardized set of benchmarks for Vietnamese NLU, we introduce the first Vietnamese Language Understanding Evaluation (VLUE) benchm… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

    Comments: Accepted at NAACL 2024 (Findings)

  3. arXiv:2403.14870  [pdf, other

    cs.CV cs.CL cs.LG

    VidLA: Video-Language Alignment at Scale

    Authors: Mamshad Nayeem Rizve, Fan Fei, Jayakrishnan Unnikrishnan, Son Tran, Benjamin Z. Yao, Belinda Zeng, Mubarak Shah, Trishul Chilimbi

    Abstract: In this paper, we propose VidLA, an approach for video-language alignment at scale. There are two major limitations of previous video-language alignment approaches. First, they do not capture both short-range and long-range temporal dependencies and typically employ complex hierarchical deep network architectures that are hard to integrate with existing pretrained image-text foundation models. To… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR 2024

  4. arXiv:2310.16273  [pdf, other

    cs.CV

    Deep Learning for Plant Identification and Disease Classification from Leaf Images: Multi-prediction Approaches

    Authors: Jianping Yao, Son N. Tran, Saurabh Garg, Samantha Sawyer

    Abstract: Deep learning plays an important role in modern agriculture, especially in plant pathology using leaf images where convolutional neural networks (CNN) are attracting a lot of attention. While numerous reviews have explored the applications of deep learning within this research domain, there remains a notable absence of an empirical study to offer insightful comparisons due to the employment of var… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: Jianping and Son are joint first authors (equal contribution)

  5. Machine Learning for Leaf Disease Classification: Data, Techniques and Applications

    Authors: Jianping Yao, Son N. Tran, Samantha Sawyer, Saurabh Garg

    Abstract: The growing demand for sustainable development brings a series of information technologies to help agriculture production. Especially, the emergence of machine learning applications, a branch of artificial intelligence, has shown multiple breakthroughs which can enhance and revolutionize plant pathology approaches. In recent years, machine learning has been adopted for leaf disease classification… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Journal ref: Artificial Intelligence Review 2023

  6. arXiv:2309.05103  [pdf, other

    cs.CL cs.AI

    AGent: A Novel Pipeline for Automatically Creating Unanswerable Questions

    Authors: Son Quoc Tran, Gia-Huy Do, Phong Nguyen-Thuan Do, Matt Kretchmar, Xinya Du

    Abstract: The development of large high-quality datasets and high-performing models have led to significant advancements in the domain of Extractive Question Answering (EQA). This progress has sparked considerable interest in exploring unanswerable questions within the EQA domain. Training EQA models with unanswerable questions helps them avoid extracting misleading or incorrect answers for queries that lac… ▽ More

    Submitted 10 September, 2023; originally announced September 2023.

    Comments: 16 pages, 10 tables, 3 figures

  7. arXiv:2309.01078  [pdf, other

    cs.CV cs.AI

    UnsMOT: Unified Framework for Unsupervised Multi-Object Tracking with Geometric Topology Guidance

    Authors: Son Tran, Cong Tran, Anh Tran, Cuong Pham

    Abstract: Object detection has long been a topic of high interest in computer vision literature. Motivated by the fact that annotating data for the multi-object tracking (MOT) problem is immensely expensive, recent studies have turned their attention to the unsupervised learning setting. In this paper, we push forward the state-of-the-art performance of unsupervised MOT methods by proposing UnsMOT, a novel… ▽ More

    Submitted 3 September, 2023; originally announced September 2023.

  8. arXiv:2308.16688  [pdf

    cs.CL cs.AI

    Using Large Language Models to Automate Category and Trend Analysis of Scientific Articles: An Application in Ophthalmology

    Authors: Hina Raja, Asim Munawar, Mohammad Delsoz, Mohammad Elahi, Yeganeh Madadi, Amr Hassan, Hashem Abu Serhan, Onur Inam, Luis Hermandez, Sang Tran, Wuqas Munir, Alaa Abd-Alrazaq, Hao Chen, SiamakYousefi

    Abstract: Purpose: In this paper, we present an automated method for article classification, leveraging the power of Large Language Models (LLM). The primary focus is on the field of ophthalmology, but the model is extendable to other fields. Methods: We have developed a model based on Natural Language Processing (NLP) techniques, including advanced LLMs, to process and analyze the textual content of scient… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

  9. arXiv:2308.04566  [pdf, other

    cs.CL

    Single-Sentence Reader: A Novel Approach for Addressing Answer Position Bias

    Authors: Son Quoc Tran, Matt Kretchmar

    Abstract: Machine Reading Comprehension (MRC) models tend to take advantage of spurious correlations (also known as dataset bias or annotation artifacts in the research community). Consequently, these models may perform the MRC task without fully comprehending the given context and question, which is undesirable since it may result in low robustness against distribution shift. The main focus of this paper i… ▽ More

    Submitted 6 September, 2023; v1 submitted 8 August, 2023; originally announced August 2023.

    Comments: 10 pages, 5 tables, 2 figures

  10. arXiv:2308.00521  [pdf, other

    cs.AI cs.SI econ.GN

    SurveyLM: A platform to explore emerging value perspectives in augmented language models' behaviors

    Authors: Steve J. Bickley, Ho Fai Chan, Bang Dao, Benno Torgler, Son Tran

    Abstract: This white paper presents our work on SurveyLM, a platform for analyzing augmented language models' (ALMs) emergent alignment behaviors through their dynamically evolving attitude and value perspectives in complex social contexts. Social Artificial Intelligence (AI) systems, like ALMs, often function within nuanced social scenarios where there is no singular correct response, or where an answer is… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

    Comments: 8 pages, 1 figure

    Report number: Panalogy Lab Technical Report 2023-001

  11. arXiv:2303.13355  [pdf, other

    cs.CL cs.AI

    Revealing Weaknesses of Vietnamese Language Models Through Unanswerable Questions in Machine Reading Comprehension

    Authors: Son Quoc Tran, Phong Nguyen-Thuan Do, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen

    Abstract: Although the curse of multilinguality significantly restricts the language abilities of multilingual models in monolingual settings, researchers now still have to rely on multilingual models to develop state-of-the-art systems in Vietnamese Machine Reading Comprehension. This difficulty in researching is because of the limited number of high-quality works in developing Vietnamese language models.… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

    Comments: Accepted at The 2023 EACL Student Research Workshop

  12. arXiv:2303.05952  [pdf, other

    cs.LG cs.AI cs.CV

    Understanding and Constructing Latent Modality Structures in Multi-modal Representation Learning

    Authors: Qian Jiang, Changyou Chen, Han Zhao, Liqun Chen, Qing Ping, Son Dinh Tran, Yi Xu, Belinda Zeng, Trishul Chilimbi

    Abstract: Contrastive loss has been increasingly used in learning representations from multiple modalities. In the limit, the nature of the contrastive loss encourages modalities to exactly match each other in the latent space. Yet it remains an open question how the modality alignment affects the downstream task performance. In this paper, based on an information-theoretic argument, we first prove that exa… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.

    Comments: 14 pages, 8 figure, CVPR 2023 accepted

  13. arXiv:2302.08505  [pdf, other

    cs.CV cs.AI

    Rapid-Motion-Track: Markerless Tracking of Fast Human Motion with Deeper Learning

    Authors: Renjie Li, Chun Yu Lao, Rebecca St. George, Katherine Lawler, Saurabh Garg, Son N. Tran, Quan Bai, Jane Alty

    Abstract: Objective The coordination of human movement directly reflects function of the central nervous system. Small deficits in movement are often the first sign of an underlying neurological problem. The objective of this research is to develop a new end-to-end, deep learning-based system, Rapid-Motion-Track (RMT) that can track the fastest human movement accurately when webcams or laptop cameras are us… ▽ More

    Submitted 18 January, 2023; originally announced February 2023.

  14. arXiv:2302.00094  [pdf, other

    cs.AI

    The Impacts of Unanswerable Questions on the Robustness of Machine Reading Comprehension Models

    Authors: Son Quoc Tran, Phong Nguyen-Thuan Do, Uyen Le, Matt Kretchmar

    Abstract: Pretrained language models have achieved super-human performances on many Machine Reading Comprehension (MRC) benchmarks. Nevertheless, their relative inability to defend against adversarial attacks has spurred skepticism about their natural language understanding. In this paper, we ask whether training with unanswerable questions in SQuAD 2.0 can help improve the robustness of MRC models against… ▽ More

    Submitted 31 January, 2023; originally announced February 2023.

    Comments: Accepted atThe 17th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2023)

  15. arXiv:2211.09888  [pdf, other

    eess.IV cs.CV math.OC

    Bayesian Optimization of 2D Echocardiography Segmentation

    Authors: Son-Tung Tran, Joshua V. Stough, Xiaoyan Zhang, Christopher M. Haggerty

    Abstract: Bayesian Optimization (BO) is a well-studied hyperparameter tuning technique that is more efficient than grid search for high-cost, high-parameter machine learning problems. Echocardiography is a ubiquitous modality for evaluating heart structure and function in cardiology. In this work, we use BO to optimize the architectural and training-related hyperparameters of a previously published deep ful… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

    Journal ref: 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI), 2021, pp. 1007-1011

  16. arXiv:2207.02376  [pdf, other

    cs.CV cs.AI

    A Comprehensive Review on Deep Supervision: Theories and Applications

    Authors: Renjie Li, Xinyi Wang, Guan Huang, Wenli Yang, Kaining Zhang, Xiaotong Gu, Son N. Tran, Saurabh Garg, Jane Alty, Quan Bai

    Abstract: Deep supervision, or known as 'intermediate supervision' or 'auxiliary supervision', is to add supervision at hidden layers of a neural network. This technique has been increasingly applied in deep neural network learning systems for various computer vision applications recently. There is a consensus that deep supervision helps improve neural network performance by alleviating the gradient vanishi… ▽ More

    Submitted 5 July, 2022; originally announced July 2022.

  17. VLSP 2021 - ViMRC Challenge: Vietnamese Machine Reading Comprehension

    Authors: Kiet Van Nguyen, Son Quoc Tran, Luan Thanh Nguyen, Tin Van Huynh, Son T. Luu, Ngan Luu-Thuy Nguyen

    Abstract: One of the emerging research trends in natural language understanding is machine reading comprehension (MRC) which is the task to find answers to human questions based on textual data. Existing Vietnamese datasets for MRC research concentrate solely on answerable questions. However, in reality, questions can be unanswerable for which the correct answer is not stated in the given textual data. To a… ▽ More

    Submitted 4 April, 2022; v1 submitted 21 March, 2022; originally announced March 2022.

    Comments: The 8th International Workshop on Vietnamese Language and Speech Processing (VLSP 2021)

  18. arXiv:2203.10609  [pdf, other

    cs.CV

    A Novel Transparency Strategy-based Data Augmentation Approach for BI-RADS Classification of Mammograms

    Authors: Sam B. Tran, Huyen T. X. Nguyen, Chi Phan, Hieu H. Pham, Ha Q. Nguyen

    Abstract: Image augmentation techniques have been widely investigated to improve the performance of deep learning (DL) algorithms on mammography classification tasks. Recent methods have proved the efficiency of image augmentation on data deficiency or data imbalance issues. In this paper, we propose a novel transparency strategy to boost the Breast Imaging Reporting and Data System (BI-RADS) scores of mamm… ▽ More

    Submitted 17 April, 2023; v1 submitted 20 March, 2022; originally announced March 2022.

    Comments: Accepted for presentation at the 22nd IEEE Statistical Signal Processing (SSP) workshop

  19. arXiv:2203.00048  [pdf, other

    cs.CV cs.AI

    Multi-modal Alignment using Representation Codebook

    Authors: Jiali Duan, Liqun Chen, Son Tran, Jinyu Yang, Yi Xu, Belinda Zeng, Trishul Chilimbi

    Abstract: Aligning signals from different modalities is an important step in vision-language representation learning as it affects the performance of later stages such as cross-modality fusion. Since image and text typically reside in different regions of the feature space, directly aligning them at instance level is challenging especially when features are still evolving during training. In this paper, we… ▽ More

    Submitted 27 March, 2022; v1 submitted 28 February, 2022; originally announced March 2022.

    Comments: Accepted by CVPR 2022

  20. arXiv:2202.10401  [pdf, other

    cs.CV

    Vision-Language Pre-Training with Triple Contrastive Learning

    Authors: Jinyu Yang, Jiali Duan, Son Tran, Yi Xu, Sampath Chanda, Liqun Chen, Belinda Zeng, Trishul Chilimbi, Junzhou Huang

    Abstract: Vision-language representation learning largely benefits from image-text alignment through contrastive losses (e.g., InfoNCE loss). The success of this alignment strategy is attributed to its capability in maximizing the mutual information (MI) between an image and its matched text. However, simply performing cross-modal alignment (CMA) ignores data potential within each modality, which may result… ▽ More

    Submitted 28 March, 2022; v1 submitted 21 February, 2022; originally announced February 2022.

    Comments: CVPR 2022; code: https://github.com/uta-smile/TCL

  21. A Neural Network Solves, Explains, and Generates University Math Problems by Program Synthesis and Few-Shot Learning at Human Level

    Authors: Iddo Drori, Sarah Zhang, Reece Shuttleworth, Leonard Tang, Albert Lu, Elizabeth Ke, Kevin Liu, Linda Chen, Sunny Tran, Newman Cheng, Roman Wang, Nikhil Singh, Taylor L. Patti, Jayson Lynch, Avi Shporer, Nakul Verma, Eugene Wu, Gilbert Strang

    Abstract: We demonstrate that a neural network pre-trained on text and fine-tuned on code solves mathematics course problems, explains solutions, and generates new questions at a human level. We automatically synthesize programs using few-shot learning and OpenAI's Codex transformer and execute them to solve course problems at 81% automatic accuracy. We curate a new dataset of questions from MIT's largest m… ▽ More

    Submitted 30 May, 2022; v1 submitted 31 December, 2021; originally announced December 2021.

    Comments: 181 pages, 8 figures, 280 tables

  22. arXiv:2112.10275  [pdf, other

    cs.CV cs.AI

    Parallel Multi-Scale Networks with Deep Supervision for Hand Keypoint Detection

    Authors: Renjie Li, Son Tran, Saurabh Garg, Katherine Lawler, Jane Alty, Quan Bai

    Abstract: Keypoint detection plays an important role in a wide range of applications. However, predicting keypoints of small objects such as human hands is a challenging problem. Recent works fuse feature maps of deep Convolutional Neural Networks (CNNs), either via multi-level feature integration or multi-resolution aggregation. Despite achieving some success, the feature fusion approaches increase the com… ▽ More

    Submitted 19 December, 2021; originally announced December 2021.

  23. arXiv:2112.05841  [pdf, other

    cs.AI cs.LG cs.LO

    Logical Boltzmann Machines

    Authors: Son N. Tran, Artur d'Avila Garcez

    Abstract: The idea of representing symbolic knowledge in connectionist systems has been a long-standing endeavour which has attracted much attention recently with the objective of combining machine learning and scalable sound reasoning. Early work has shown a correspondence between propositional logic and symmetrical neural networks which nevertheless did not scale well with the number of variables and whos… ▽ More

    Submitted 10 December, 2021; originally announced December 2021.

    Comments: 15 pages, 5 figures, 2 tables

    MSC Class: 68T01 ACM Class: I.2.4; I.2.6; I.2.11

  24. arXiv:2112.04490  [pdf, other

    eess.IV cs.CV

    A novel multi-view deep learning approach for BI-RADS and density assessment of mammograms

    Authors: Huyen T. X. Nguyen, Sam B. Tran, Dung B. Nguyen, Hieu H. Pham, Ha Q. Nguyen

    Abstract: Advanced deep learning (DL) algorithms may predict the patient's risk of developing breast cancer based on the Breast Imaging Reporting and Data System (BI-RADS) and density standards. Recent studies have suggested that the combination of multi-view analysis improved the overall breast exam classification. In this paper, we propose a novel multi-view DL approach for BI-RADS and density assessment… ▽ More

    Submitted 17 April, 2022; v1 submitted 8 December, 2021; originally announced December 2021.

    Comments: This paper has been accepted by the 44th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (2022 IEEE EMBC)

  25. Hand gesture detection in tests performed by older adults

    Authors: Guan Huang, Son N. Tran, Quan Bai, Jane Alty

    Abstract: Our team are developing a new online test that analyses hand movement features associated with ageing that can be completed remotely from the research centre. To obtain hand movement features, participants will be asked to perform a variety of hand gestures using their own computer cameras. However, it is challenging to collect high quality hand movement video data, especially for older participan… ▽ More

    Submitted 28 October, 2021; v1 submitted 27 October, 2021; originally announced October 2021.

    Journal ref: Neural Comput & Applic (2022)

  26. arXiv:2110.07554  [pdf, other

    cs.LG cs.AI cs.SE

    Looper: An end-to-end ML platform for product decisions

    Authors: Igor L. Markov, Hanson Wang, Nitya Kasturi, Shaun Singh, Sze Wai Yuen, Mia Garrard, Sarah Tran, Yin Huang, Zehui Wang, Igor Glotov, Tanvi Gupta, Boshuang Huang, Peng Chen, Xiaowen Xie, Michael Belkin, Sal Uryasev, Sam Howie, Eytan Bakshy, Norm Zhou

    Abstract: Modern software systems and products increasingly rely on machine learning models to make data-driven decisions based on interactions with users, infrastructure and other systems. For broader adoption, this practice must (i) accommodate product engineers without ML backgrounds, (ii) support finegrain product-metric evaluation and (iii) optimize for product goals. To address shortcomings of prior p… ▽ More

    Submitted 21 June, 2022; v1 submitted 14 October, 2021; originally announced October 2021.

    Comments: 11 pages + references, 7 figures; to appear in KDD 2022

  27. arXiv:2108.01229  [pdf, other

    q-bio.NC cs.AI math.CT physics.bio-ph

    Taking Cognition Seriously: A generalised physics of cognition

    Authors: Sophie Alyx Taylor, Son Cao Tran, Dan V. Nicolau Jr

    Abstract: The study of complex systems through the lens of category theory consistently proves to be a powerful approach. We propose that cognition deserves the same category-theoretic treatment. We show that by considering a highly-compact cognitive system, there are fundamental physical trade-offs resulting in a utility problem. We then examine how to do this systematically, and propose some requirements… ▽ More

    Submitted 2 August, 2021; originally announced August 2021.

  28. arXiv:2107.05684  [pdf, other

    cs.CL cs.IR

    Accenture at CheckThat! 2021: Interesting claim identification and ranking with contextually sensitive lexical training data augmentation

    Authors: Evan Williams, Paul Rodrigues, Sieu Tran

    Abstract: This paper discusses the approach used by the Accenture Team for CLEF2021 CheckThat! Lab, Task 1, to identify whether a claim made in social media would be interesting to a wide audience and should be fact-checked. Twitter training and test data were provided in English, Arabic, Spanish, Turkish, and Bulgarian. Claims were to be classified (check-worthy/not check-worthy) and ranked in priority ord… ▽ More

    Submitted 12 July, 2021; originally announced July 2021.

    Comments: To Appear As: Evan Williams, Paul Rodrigues, Sieu Tran. Accenture at CheckThat! 2021: Interesting claim identification and ranking with contextually sensitive lexical training data augmentation. In: Faggioli et al. Working Notes of CLEF 2021-Conference and Labs of the Evaluation Forum. Bucharest, Romania. 21-24 September 2021

  29. arXiv:2107.01238  [pdf, other

    cs.LG

    Solving Machine Learning Problems

    Authors: Sunny Tran, Pranav Krishna, Ishan Pakuwal, Prabhakar Kafle, Nikhil Singh, Jayson Lynch, Iddo Drori

    Abstract: Can a machine learn Machine Learning? This work trains a machine learning model to solve machine learning problems from a University undergraduate level course. We generate a new training set of questions and answers consisting of course exercises, homework, and quiz questions from MIT's 6.036 Introduction to Machine Learning course and train a machine learning model to answer these questions. Our… ▽ More

    Submitted 2 July, 2021; originally announced July 2021.

    Comments: 38 pages, 29 figures

  30. arXiv:2105.04356  [pdf, other

    eess.IV cs.CV cs.LG

    Coconut trees detection and segmentation in aerial imagery using mask region-based convolution neural network

    Authors: Muhammad Shakaib Iqbal, Hazrat Ali, Son N. Tran, Talha Iqbal

    Abstract: Food resources face severe damages under extraordinary situations of catastrophes such as earthquakes, cyclones, and tsunamis. Under such scenarios, speedy assessment of food resources from agricultural land is critical as it supports aid activity in the disaster hit areas. In this article, a deep learning approach is presented for the detection and segmentation of coconut tress in aerial imagery… ▽ More

    Submitted 10 May, 2021; originally announced May 2021.

    Comments: Published in IET Computer Vision, 09 April 2021

  31. arXiv:2011.10269  [pdf, other

    cs.CV

    SLADE: A Self-Training Framework For Distance Metric Learning

    Authors: Jiali Duan, Yen-Liang Lin, Son Tran, Larry S. Davis, C. -C. Jay Kuo

    Abstract: Most existing distance metric learning approaches use fully labeled data to learn the sample similarities in an embedding space. We present a self-training framework, SLADE, to improve retrieval performance by leveraging additional unlabeled data. We first train a teacher model on the labeled data and use it to generate pseudo labels for the unlabeled data. We then train a student model on both la… ▽ More

    Submitted 29 March, 2021; v1 submitted 20 November, 2020; originally announced November 2020.

    Comments: Accepted by CVPR 2021

  32. arXiv:2009.11485   

    cs.AI cs.CL

    CogniFNN: A Fuzzy Neural Network Framework for Cognitive Word Embedding Evaluation

    Authors: Xinping Liu, Zehong Cao, Son Tran

    Abstract: Word embeddings can reflect the semantic representations, and the embedding qualities can be comprehensively evaluated with human natural reading-related cognitive data sources. In this paper, we proposed the CogniFNN framework, which is the first attempt at using fuzzy neural networks to extract non-linear and non-stationary characteristics for evaluations of English word embeddings against the c… ▽ More

    Submitted 29 July, 2021; v1 submitted 24 September, 2020; originally announced September 2020.

    Comments: The method and results need to be further investigated

  33. arXiv:2004.13236  [pdf, other

    cs.CV

    Deep Auto-Encoders with Sequential Learning for Multimodal Dimensional Emotion Recognition

    Authors: Dung Nguyen, Duc Thanh Nguyen, Rui Zeng, Thanh Thi Nguyen, Son N. Tran, Thin Nguyen, Sridha Sridharan, Clinton Fookes

    Abstract: Multimodal dimensional emotion recognition has drawn a great attention from the affective computing community and numerous schemes have been extensively investigated, making a significant progress in this area. However, several questions still remain unanswered for most of existing approaches including: (i) how to simultaneously learn compact yet representative features from multimodal data, (ii)… ▽ More

    Submitted 27 April, 2020; originally announced April 2020.

    Comments: Under Review on Transaction on Multimedia

  34. arXiv:2003.11136  [pdf, other

    cs.CV

    Joint Deep Cross-Domain Transfer Learning for Emotion Recognition

    Authors: Dung Nguyen, Sridha Sridharan, Duc Thanh Nguyen, Simon Denman, Son N. Tran, Rui Zeng, Clinton Fookes

    Abstract: Deep learning has been applied to achieve significant progress in emotion recognition. Despite such substantial progress, existing approaches are still hindered by insufficient training data, and the resulting models do not generalize well under mismatched conditions. To address this challenge, we propose a learning strategy which jointly transfers the knowledge learned from rich datasets to sourc… ▽ More

    Submitted 24 March, 2020; originally announced March 2020.

  35. arXiv:1912.08967  [pdf, other

    cs.CV

    Fashion Outfit Complementary Item Retrieval

    Authors: Yen-Liang Lin, Son Tran, Larry S. Davis

    Abstract: Complementary fashion item recommendation is critical for fashion outfit completion. Existing methods mainly focus on outfit compatibility prediction but not in a retrieval setting. We propose a new framework for outfit complementary item retrieval. Specifically, a category-based subspace attention network is presented, which is a scalable approach for learning the subspace attentions. In addition… ▽ More

    Submitted 4 March, 2020; v1 submitted 18 December, 2019; originally announced December 2019.

    Comments: Accepted by CVPR 2020

  36. arXiv:1907.02244  [pdf, other

    cs.CV eess.IV

    Searching for Apparel Products from Images in the Wild

    Authors: Son Tran, Ming Du, Sampath Chanda, R. Manmatha, Cj Taylor

    Abstract: In this age of social media, people often look at what others are wearing. In particular, Instagram and Twitter influencers often provide images of themselves wearing different outfits and their followers are often inspired to buy similar clothes.We propose a system to automatically find the closest visually similar clothes in the online Catalog (street-to-shop searching). The problem is challengi… ▽ More

    Submitted 7 April, 2022; v1 submitted 4 July, 2019; originally announced July 2019.

    Comments: KDD2019, AI for Fashion Workshop

  37. arXiv:1905.06088  [pdf, other

    cs.AI

    Neural-Symbolic Computing: An Effective Methodology for Principled Integration of Machine Learning and Reasoning

    Authors: Artur d'Avila Garcez, Marco Gori, Luis C. Lamb, Luciano Serafini, Michael Spranger, Son N. Tran

    Abstract: Current advances in Artificial Intelligence and machine learning in general, and deep learning in particular have reached unprecedented impact not only across research communities, but also over popular media channels. However, concerns about interpretability and accountability of AI have been raised by influential thinkers. In spite of the recent impact of AI, several works have identified the ne… ▽ More

    Submitted 15 May, 2019; originally announced May 2019.

  38. arXiv:1903.10453  [pdf, other

    cs.CL cs.CR

    dpUGC: Learn Differentially Private Representation for User Generated Contents

    Authors: Xuan-Son Vu, Son N. Tran, Lili Jiang

    Abstract: This paper firstly proposes a simple yet efficient generalized approach to apply differential privacy to text representation (i.e., word embedding). Based on it, we propose a user-level approach to learn personalized differentially private word embedding model on user generated contents (UGC). To our best knowledge, this is the first work of learning user-level differentially private word embeddin… ▽ More

    Submitted 25 March, 2019; originally announced March 2019.

    Journal ref: Proceedings of the 20th International Conference on Computational Linguistics and Intelligent Text Processing, La Rochelle, France, 2019

  39. arXiv:1903.04433  [pdf, other

    cs.CL

    ETNLP: a visual-aided systematic approach to select pre-trained embeddings for a downstream task

    Authors: Xuan-Son Vu, Thanh Vu, Son N. Tran, Lili Jiang

    Abstract: Given many recent advanced embedding models, selecting pre-trained word embedding (a.k.a., word representation) models best fit for a specific downstream task is non-trivial. In this paper, we propose a systematic approach, called ETNLP, for extracting, evaluating, and visualizing multiple sets of pre-trained word embeddings to determine which embeddings should be used in a downstream task. For ex… ▽ More

    Submitted 3 August, 2019; v1 submitted 11 March, 2019; originally announced March 2019.

    Comments: 10 pages

    Journal ref: Proceedings of the International Conference Recent Advances in Natural Language Processing (RANLP), 2019

  40. arXiv:1806.06611  [pdf, other

    cs.CV

    On Multi-resident Activity Recognition in Ambient Smart-Homes

    Authors: Son N. Tran, Qing Zhang, Mohan Karunanithi

    Abstract: Increasing attention to the research on activity monitoring in smart homes has motivated the employment of ambient intelligence to reduce the deployment cost and solve the privacy issue. Several approaches have been proposed for multi-resident activity recognition, however, there still lacks a comprehensive benchmark for future research and practical selection of models. In this paper we study dif… ▽ More

    Submitted 18 June, 2018; originally announced June 2018.

  41. arXiv:1710.02245  [pdf, other

    cs.LG stat.ML

    Linear-Time Sequence Classification using Restricted Boltzmann Machines

    Authors: Son N. Tran, Srikanth Cherla, Artur Garcez, Tillman Weyde

    Abstract: Classification of sequence data is the topic of interest for dynamic Bayesian models and Recurrent Neural Networks (RNNs). While the former can explicitly model the temporal dependencies between class variables, the latter have a capability of learning representations. Several attempts have been made to improve performance by combining these two approaches or increasing the processing capability o… ▽ More

    Submitted 8 March, 2018; v1 submitted 5 October, 2017; originally announced October 2017.

  42. arXiv:1706.01991  [pdf, other

    cs.AI

    Unsupervised Neural-Symbolic Integration

    Authors: Son N. Tran

    Abstract: Symbolic has been long considered as a language of human intelligence while neural networks have advantages of robust computation and dealing with noisy data. The integration of neural-symbolic can offer better learning and reasoning while providing a means for interpretability through the representation of symbolic knowledge. Although previous works focus intensively on supervised feedforward neu… ▽ More

    Submitted 22 June, 2017; v1 submitted 6 June, 2017; originally announced June 2017.

  43. arXiv:1705.10899  [pdf, other

    cs.AI

    Propositional Knowledge Representation and Reasoning in Restricted Boltzmann Machines

    Authors: Son N. Tran

    Abstract: While knowledge representation and reasoning are considered the keys for human-level artificial intelligence, connectionist networks have been shown successful in a broad range of applications due to their capacity for robust learning and flexible inference under uncertainty. The idea of representing symbolic knowledge in connectionist networks has been well-received and attracted much attention f… ▽ More

    Submitted 29 May, 2018; v1 submitted 30 May, 2017; originally announced May 2017.

  44. arXiv:1604.01806  [pdf, ps, other

    cs.LG

    Generalising the Discriminative Restricted Boltzmann Machine

    Authors: Srikanth Cherla, Son N Tran, Tillman Weyde, Artur d'Avila Garcez

    Abstract: We present a novel theoretical result that generalises the Discriminative Restricted Boltzmann Machine (DRBM). While originally the DRBM was defined assuming the {0, 1}-Bernoulli distribution in each of its hidden units, this result makes it possible to derive cost functions for variants of the DRBM that utilise other distributions, including some that are often encountered in the literature. This… ▽ More

    Submitted 6 April, 2016; originally announced April 2016.

    Comments: Submitted to ECML 2016 conference track

  45. arXiv:1312.6190  [pdf, other

    cs.LG

    Adaptive Feature Ranking for Unsupervised Transfer Learning

    Authors: Son N. Tran, Artur d'Avila Garcez

    Abstract: Transfer Learning is concerned with the application of knowledge gained from solving a problem to a different but related problem domain. In this paper, we propose a method and efficient algorithm for ranking and selecting representations from a Restricted Boltzmann Machine trained on a source domain to be transferred onto a target domain. Experiments carried out using the MNIST, ICDAR and TiCC im… ▽ More

    Submitted 28 May, 2014; v1 submitted 20 December, 2013; originally announced December 2013.

    Comments: 9 pages 7 figures, new experimental results on ranking and transfer have been added, typo fixed

  46. arXiv:1302.4689  [pdf

    cs.OH

    An Approach to Select Cost-Effective Risk Countermeasures Exemplified in CORAS

    Authors: Le Minh Sang Tran, Bjørnar Solhaug, Ketil Stølen

    Abstract: Risk is unavoidable in business and risk management is needed amongst others to set up good security policies. Once the risks are evaluated, the next step is to decide how they should be treated. This involves managers making decisions on proper countermeasures to be implemented to mitigate the risks. The countermeasure expenditure, together with its ability to mitigate risks, is factors that affe… ▽ More

    Submitted 7 March, 2013; v1 submitted 19 February, 2013; originally announced February 2013.

    Comments: 33 pages