Skip to main content

Showing 1–43 of 43 results for author: Chong, Y

  1. arXiv:2406.07842  [pdf, other

    eess.AS cs.CL

    Dual-Pipeline with Low-Rank Adaptation for New Language Integration in Multilingual ASR

    Authors: Yerbolat Khassanov, Zhipeng Chen, Tianfeng Chen, Tze Yuang Chong, Wei Li, Jun Zhang, Lu Lu, Yuxuan Wang

    Abstract: This paper addresses challenges in integrating new languages into a pre-trained multilingual automatic speech recognition (mASR) system, particularly in scenarios where training data for existing languages is limited or unavailable. The proposed method employs a dual-pipeline with low-rank adaptation (LoRA). It maintains two data flow pipelines-one for existing languages and another for new langua… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 5 pages, 2 figures, 4 tables

  2. arXiv:2406.01191  [pdf, other

    eess.IV cs.CV cs.LG

    S-CycleGAN: Semantic Segmentation Enhanced CT-Ultrasound Image-to-Image Translation for Robotic Ultrasonography

    Authors: Yuhan Song, Nak Young Chong

    Abstract: Ultrasound imaging is pivotal in various medical diagnoses due to its non-invasive nature and safety. In clinical practice, the accuracy and precision of ultrasound image analysis are critical. Recent advancements in deep learning are showing great capacity of processing medical images. However, the data hungry nature of deep learning and the shortage of high-quality ultrasound image training data… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: This paper is submitted to 2024 IEEE International Conference on Cyborg and Bionic Systems, and still under review

  3. arXiv:2404.13097  [pdf, other

    eess.IV cs.CV cs.LG q-bio.QM

    DISC: Latent Diffusion Models with Self-Distillation from Separated Conditions for Prostate Cancer Grading

    Authors: Man M. Ho, Elham Ghelichkhan, Yosep Chong, Yufei Zhou, Beatrice Knudsen, Tolga Tasdizen

    Abstract: Latent Diffusion Models (LDMs) can generate high-fidelity images from noise, offering a promising approach for augmenting histopathology images for training cancer grading models. While previous works successfully generated high-fidelity histopathology images using LDMs, the generation of image tiles to improve prostate cancer grading has not yet been explored. Additionally, LDMs face challenges i… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: Abstract accepted for ISBI 2024. Extended version to be presented at SynData4CV @ CVPR 2024. See more at https://minhmanho.github.io/disc/

  4. arXiv:2404.12650  [pdf, other

    eess.IV cs.CV cs.LG

    F2FLDM: Latent Diffusion Models with Histopathology Pre-Trained Embeddings for Unpaired Frozen Section to FFPE Translation

    Authors: Man M. Ho, Shikha Dubey, Yosep Chong, Beatrice Knudsen, Tolga Tasdizen

    Abstract: The Frozen Section (FS) technique is a rapid and efficient method, taking only 15-30 minutes to prepare slides for pathologists' evaluation during surgery, enabling immediate decisions on further surgical interventions. However, FS process often introduces artifacts and distortions like folds and ice-crystal effects. In contrast, these artifacts and distortions are absent in the higher-quality for… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: Preprint. Our work is available at https://minhmanho.github.io/f2f_ldm/

  5. arXiv:2402.08772  [pdf, other

    cs.AI cs.MA

    Optimal Task Assignment and Path Planning using Conflict-Based Search with Precedence and Temporal Constraints

    Authors: Yu Quan Chong, Jiaoyang Li, Katia Sycara

    Abstract: The Multi-Agent Path Finding (MAPF) problem entails finding collision-free paths for a set of agents, guiding them from their start to goal locations. However, MAPF does not account for several practical task-related constraints. For example, agents may need to perform actions at goal locations with specific execution times, adhering to predetermined orders and timeframes. Moreover, goal assignmen… ▽ More

    Submitted 21 April, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

    ACM Class: I.2.11

  6. arXiv:2401.08134  [pdf, other

    cs.RO

    S3M: Semantic Segmentation Sparse Mapping for UAVs with RGB-D Camera

    Authors: Thanh Nguyen Canh, Van-Truong Nguyen, Xiem HoangVan, Armagan Elibol, Nak Young Chong

    Abstract: Unmanned Aerial Vehicles (UAVs) hold immense potential for critical applications, such as search and rescue operations, where accurate perception of indoor environments is paramount. However, the concurrent amalgamation of localization, 3D reconstruction, and semantic segmentation presents a notable hurdle, especially in the context of UAVs equipped with constrained power and computational resourc… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: In The 2024 IEEE/SICE International Symposium on System Integration (SII2024), Ha Long, Vietnam

  7. arXiv:2401.08132  [pdf, other

    cs.RO

    Object-Oriented Semantic Mapping for Reliable UAVs Navigation

    Authors: Thanh Nguyen Canh, Armagan Elibol, Nak Young Chong, Xiem HoangVan

    Abstract: To autonomously navigate in real-world environments, special in search and rescue operations, Unmanned Aerial Vehicles (UAVs) necessitate comprehensive maps to ensure safety. However, the prevalent metric map often lacks semantic information crucial for holistic scene comprehension. In this paper, we proposed a system to construct a probabilistic metric map enriched with object information extract… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: In the 12th International Conference on Control, Automation and Information Sciences (ICCAIS 2023), Hanoi, Vietnam

  8. arXiv:2401.03676  [pdf, other

    cs.SE cs.AI

    Assessing AI Detectors in Identifying AI-Generated Code: Implications for Education

    Authors: Wei Hung Pan, Ming Jie Chok, Jonathan Leong Shan Wong, Yung Xin Shin, Yeong Shian Poon, Zhou Yang, Chun Yong Chong, David Lo, Mei Kuan Lim

    Abstract: Educators are increasingly concerned about the usage of Large Language Models (LLMs) such as ChatGPT in programming education, particularly regarding the potential exploitation of imperfections in Artificial Intelligence Generated Content (AIGC) Detectors for academic misconduct. In this paper, we present an empirical study where the LLM is examined for its attempts to bypass detection by AIGC Det… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Comments: 11 pages, paper accepted at 46th International Conference on Software Engineering, Software Engineering Education and Training Track (ICSE-SEET 2024)

  9. arXiv:2312.08782  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Toward General-Purpose Robots via Foundation Models: A Survey and Meta-Analysis

    Authors: Yafei Hu, Quanting Xie, Vidhi Jain, Jonathan Francis, Jay Patrikar, Nikhil Keetha, Seungchan Kim, Yaqi Xie, Tianyi Zhang, Shibo Zhao, Yu Quan Chong, Chen Wang, Katia Sycara, Matthew Johnson-Roberson, Dhruv Batra, Xiaolong Wang, Sebastian Scherer, Zsolt Kira, Fei Xia, Yonatan Bisk

    Abstract: Building general-purpose robots that can operate seamlessly, in any environment, with any object, and utilizing various skills to complete diverse tasks has been a long-standing goal in Artificial Intelligence. Unfortunately, however, most existing robotic systems have been constrained - having been designed for specific tasks, trained on specific datasets, and deployed within specific environment… ▽ More

    Submitted 15 December, 2023; v1 submitted 14 December, 2023; originally announced December 2023.

  10. arXiv:2312.06978  [pdf, other

    cs.CV

    CLASS-M: Adaptive stain separation-based contrastive learning with pseudo-labeling for histopathological image classification

    Authors: Bodong Zhang, Hamid Manoochehri, Man Minh Ho, Fahimeh Fooladgar, Yosep Chong, Beatrice S. Knudsen, Deepika Sirohi, Tolga Tasdizen

    Abstract: Histopathological image classification is an important task in medical image analysis. Recent approaches generally rely on weakly supervised learning due to the ease of acquiring case-level labels from pathology reports. However, patch-level classification is preferable in applications where only a limited number of cases are available or when local prediction accuracy is critical. On the other ha… ▽ More

    Submitted 4 January, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

  11. arXiv:2311.10502  [pdf, other

    cs.NE

    Fast Estimations of Hitting Time of Elitist Evolutionary Algorithms from Fitness Levels

    Authors: Jun He, Siang Yew Chong, Xin Yao

    Abstract: The fitness level method is an easy-to-use tool for estimating the hitting time of elitist evolutionary algorithms. Recently, linear lower and upper bounds by fitness levels have been constructed. But these bounds require recursive computation, which makes them difficult to use in practice. We address this shortcoming with a new directed graph (digraph) method that does not require recursive compu… ▽ More

    Submitted 16 May, 2024; v1 submitted 17 November, 2023; originally announced November 2023.

  12. Theory of Mind for Multi-Agent Collaboration via Large Language Models

    Authors: Huao Li, Yu Quan Chong, Simon Stepputtis, Joseph Campbell, Dana Hughes, Michael Lewis, Katia Sycara

    Abstract: While Large Language Models (LLMs) have demonstrated impressive accomplishments in both reasoning and planning, their abilities in multi-agent collaborations remains largely unexplored. This study evaluates LLM-based agents in a multi-agent cooperative text game with Theory of Mind (ToM) inference tasks, comparing their performance with Multi-Agent Reinforcement Learning (MARL) and planning-based… ▽ More

    Submitted 26 June, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023 (Main Conference). Code available at https://github.com/romanlee6/multi_LLM_comm

    Journal ref: in Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Page 180-192, ACL

  13. arXiv:2309.00544  [pdf

    cs.RO

    Modular, Multi-Robot Integration of Laboratories: An Autonomous Solid-State Workflow for Powder X-Ray Diffraction

    Authors: Amy. M. Lunt, Hatem Fakhruldeen, Gabriella Pizzuto, Louis Longley, Alexander White, Nicola Rankin, Rob Clowes, Ben Alston, Lucia Gigli, Graeme M. Day, Andrew I. Cooper, Sam. Y. Chong

    Abstract: Automation can transform productivity in research activities that use liquid handling, such as organic synthesis, but it has made less impact in materials laboratories, which require sample preparation steps and a range of solid-state characterization techniques. For example, powder X-ray diffraction (PXRD) is a key method in materials and pharmaceutical chemistry, but its end-to-end automation is… ▽ More

    Submitted 23 November, 2023; v1 submitted 1 September, 2023; originally announced September 2023.

  14. arXiv:2308.15137  [pdf, other

    cs.CV cs.AI

    Abdominal Multi-Organ Segmentation Based on Feature Pyramid Network and Spatial Recurrent Neural Network

    Authors: Yuhan Song, Armagan Elibol, Nak Young Chong

    Abstract: As recent advances in AI are causing the decline of conventional diagnostic methods, the realization of end-to-end diagnosis is fast approaching. Ultrasound image segmentation is an important step in the diagnostic process. An accurate and robust segmentation model accelerates the process and reduces the burden of sonographers. In contrast to previous research, we take two inherent features of ult… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

    Comments: IFAC World Congress 2023 paper

  15. arXiv:2306.04557  [pdf, other

    cs.CV cs.RO

    PhenoBench -- A Large Dataset and Benchmarks for Semantic Image Interpretation in the Agricultural Domain

    Authors: Jan Weyler, Federico Magistri, Elias Marks, Yue Linn Chong, Matteo Sodano, Gianmarco Roggiolani, Nived Chebrolu, Cyrill Stachniss, Jens Behley

    Abstract: The production of food, feed, fiber, and fuel is a key task of agriculture. Especially crop production has to cope with a multitude of challenges in the upcoming decades caused by a growing world population, climate change, the need for sustainable production, lack of skilled workers, and generally the limited availability of arable land. Vision systems could help cope with these challenges by off… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

  16. arXiv:2305.17445  [pdf, other

    cs.SE

    Synthesizing Speech Test Cases with Text-to-Speech? An Empirical Study on the False Alarms in Automated Speech Recognition Testing

    Authors: Julia Kaiwen Lau, Kelvin Kai Wen Kong, Julian Hao Yong, Per Hoong Tan, Zhou Yang, Zi Qian Yong, Joshua Chern Wey Low, Chun Yong Chong, Mei Kuan Lim, David Lo

    Abstract: Recent studies have proposed the use of Text-To-Speech (TTS) systems to automatically synthesise speech test cases on a scale and uncover a large number of failures in ASR systems. However, the failures uncovered by synthetic test cases may not reflect the actual performance of an ASR system when it transcribes human audio, which we refer to as false alarms. Given a failed test case synthesised fr… ▽ More

    Submitted 18 July, 2023; v1 submitted 27 May, 2023; originally announced May 2023.

    Comments: 13 pages, Accepted at ISSTA2023

  17. arXiv:2304.08924  [pdf, other

    cs.CV cs.LG

    Quantum Annealing for Single Image Super-Resolution

    Authors: Han Yao Choong, Suryansh Kumar, Luc Van Gool

    Abstract: This paper proposes a quantum computing-based algorithm to solve the single image super-resolution (SISR) problem. One of the well-known classical approaches for SISR relies on the well-established patch-wise sparse modeling of the problem. Yet, this field's current state of affairs is that deep neural networks (DNNs) have demonstrated far superior results than traditional approaches. Nevertheless… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Comments: Accepted to IEEE/CVF CVPR 2023, NTIRE Challenge and Workshop. Draft info: 10 pages, 6 Figures, 2 Tables

  18. arXiv:2303.06283  [pdf, other

    cs.SE

    Closing the Loop for Software Remodularisation -- REARRANGE: An Effort Estimation Approach for Software Clustering-based Remodularisation

    Authors: Alvin Jian Jia Tan, Chun Yong Chong, Aldeida Aleti

    Abstract: Software remodularization through clustering is a common practice to improve internal software quality. However, the true benefit of software clustering is only realized if developers follow through with the recommended refactoring suggestions, which can be complex and time-consuming. Simply producing clustering results is not enough to realize the benefits of remodularization. For the recommended… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.

    Comments: Accepted for publication at ICSE23 Poster Track

  19. arXiv:2303.04566  [pdf, other

    cs.CV cs.SE

    Robustness Evaluation in Hand Pose Estimation Models using Metamorphic Testing

    Authors: Muxin Pu, Chun Yong Chong, Mei Kuan Lim

    Abstract: Hand pose estimation (HPE) is a task that predicts and describes the hand poses from images or video frames. When HPE models estimate hand poses captured in a laboratory or under controlled environments, they normally deliver good performance. However, the real-world environment is complex, and various uncertainties may happen, which could degrade the performance of HPE models. For example, the ha… ▽ More

    Submitted 8 March, 2023; originally announced March 2023.

    Comments: Accepted at 2023 8th International Workshop on Metamorphic Testing, 8 pages

  20. arXiv:2302.05582  [pdf, other

    eess.AS cs.CL cs.SD cs.SE

    ASDF: A Differential Testing Framework for Automatic Speech Recognition Systems

    Authors: Daniel Hao Xian Yuen, Andrew Yong Chen Pang, Zhou Yang, Chun Yong Chong, Mei Kuan Lim, David Lo

    Abstract: Recent years have witnessed wider adoption of Automated Speech Recognition (ASR) techniques in various domains. Consequently, evaluating and enhancing the quality of ASR systems is of great importance. This paper proposes ASDF, an Automated Speech Recognition Differential Testing Framework for testing ASR systems. ASDF extends an existing ASR testing tool, the CrossASR++, which synthesizes test ca… ▽ More

    Submitted 10 February, 2023; originally announced February 2023.

    Comments: Accpeted by ICST 2023 Tool Demo Track

  21. arXiv:2301.05069  [pdf, ps, other

    cs.SE

    Open Design Case Study -- A Crowdsourcing Effort to Curate Software Design Case Studies

    Authors: Chun Yong Chong, Eunsuk Kang, Mary Shaw

    Abstract: Case study-based learning has been successfully integrated into various courses, including software engineering education. In the context of software design courses, the use of case studies often entails sharing of real successful or failed software development. Using examples of real-world case studies allows educators to reinforce the applicability and usefulness of fundamental design concepts,… ▽ More

    Submitted 12 January, 2023; originally announced January 2023.

    Comments: 6 pages, accepted at ICSE-SEET2023

  22. arXiv:2211.06034  [pdf, other

    cs.LG cs.AI

    Does Deep Learning REALLY Outperform Non-deep Machine Learning for Clinical Prediction on Physiological Time Series?

    Authors: Ke Liao, Wei Wang, Armagan Elibol, Lingzhong Meng, Xu Zhao, Nak Young Chong

    Abstract: Machine learning has been widely used in healthcare applications to approximate complex models, for clinical diagnosis, prognosis, and treatment. As deep learning has the outstanding ability to extract information from time series, its true capabilities on sparse, irregularly sampled, multivariate, and imbalanced physiological data are not yet fully explored. In this paper, we systematically exami… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

  23. arXiv:2210.15876  [pdf, ps, other

    eess.AS cs.CL cs.SD

    Random Utterance Concatenation Based Data Augmentation for Improving Short-video Speech Recognition

    Authors: Yist Y. Lin, Tao Han, Haihua Xu, Van Tung Pham, Yerbolat Khassanov, Tze Yuang Chong, Yi He, Lu Lu, Zejun Ma

    Abstract: One of limitations in end-to-end automatic speech recognition (ASR) framework is its performance would be compromised if train-test utterance lengths are mismatched. In this paper, we propose an on-the-fly random utterance concatenation (RUC) based data augmentation method to alleviate train-test utterance length mismatch issue for short-video ASR task. Specifically, we are motivated by observatio… ▽ More

    Submitted 25 May, 2023; v1 submitted 27 October, 2022; originally announced October 2022.

    Comments: 5 pages, 3 figures, 4 tables

  24. SERCNN: Stacked Embedding Recurrent Convolutional Neural Network in Detecting Depression on Twitter

    Authors: Heng Ee Tay, Mei Kuan Lim, Chun Yong Chong

    Abstract: Conventional approaches to identify depression are not scalable, and the public has limited awareness of mental health, especially in developing countries. As evident by recent studies, social media has the potential to complement mental health screening on a greater scale. The vast amount of first-person narrative posts in chronological order can provide insights into one's thoughts, feelings, be… ▽ More

    Submitted 5 August, 2022; v1 submitted 29 July, 2022; originally announced July 2022.

    Comments: This paper has been accepted at the AIHA 2022 workshop of the ICPR 2022 conference

  25. arXiv:2204.08806  [pdf, other

    cs.CY cs.CL

    Understanding Toxicity Triggers on Reddit in the Context of Singapore

    Authors: Yun Yu Chong, Haewoon Kwak

    Abstract: While the contagious nature of online toxicity sparked increasing interest in its early detection and prevention, most of the literature focuses on the Western world. In this work, we demonstrate that 1) it is possible to detect toxicity triggers in an Asian online community, and 2) toxicity triggers can be strikingly different between Western and Eastern contexts.

    Submitted 19 April, 2022; originally announced April 2022.

    Comments: Accepted in AAAI ICWSM'22

    ACM Class: J.4; K.4

  26. arXiv:2204.08612  [pdf, other

    cs.CV

    Metamorphic Testing-based Adversarial Attack to Fool Deepfake Detectors

    Authors: Nyee Thoang Lim, Meng Yi Kuan, Muxin Pu, Mei Kuan Lim, Chun Yong Chong

    Abstract: Deepfakes utilise Artificial Intelligence (AI) techniques to create synthetic media where the likeness of one person is replaced with another. There are growing concerns that deepfakes can be maliciously used to create misleading and harmful digital contents. As deepfakes become more common, there is a dire need for deepfake detection technology to help spot deepfake media. Present deepfake detect… ▽ More

    Submitted 31 May, 2022; v1 submitted 18 April, 2022; originally announced April 2022.

    Comments: paper accepted at 26TH International Conference on Pattern Recognition (ICPR2022)

  27. arXiv:2203.06825  [pdf, other

    cs.CV cs.SE

    Fairness Evaluation in Deepfake Detection Models using Metamorphic Testing

    Authors: Muxin Pu, Meng Yi Kuan, Nyee Thoang Lim, Chun Yong Chong, Mei Kuan Lim

    Abstract: Fairness of deepfake detectors in the presence of anomalies are not well investigated, especially if those anomalies are more prominent in either male or female subjects. The primary motivation for this work is to evaluate how deepfake detection model behaves under such anomalies. However, due to the black-box nature of deep learning (DL) and artificial intelligence (AI) systems, it is hard to pre… ▽ More

    Submitted 13 March, 2022; originally announced March 2022.

    Comments: 8 pages, accepted at 7th International Workshop on Metamorphic Testing (MET22)

  28. arXiv:2107.01766  [pdf, other

    cs.SE

    E-SC4R: Explaining Software Clustering for Remodularisation

    Authors: Alvin Jian Jia Tan, Chun Yong Chong, Aldeida Aleti

    Abstract: Maintenance of existing software requires a large amount of time for comprehending the source code. The architecture of a software, however, may not be clear to maintainers if up to date documentations are not available. Software clustering is often used as a remodularisation and architecture recovery technique to help recover a semantic representation of the software design. Due to the diverse do… ▽ More

    Submitted 2 October, 2021; v1 submitted 4 July, 2021; originally announced July 2021.

    Comments: 31 pages

  29. arXiv:2106.07513  [pdf, other

    cs.SE

    CodeLabeller: A Web-based Code Annotation Tool for Java Design Patterns and Summaries

    Authors: Najam Nazar, Norman Chen, Chun Yong Chong

    Abstract: While constructing supervised learning models, we require labelled examples to build a corpus and train a machine learning model. However, most studies have built the labelled dataset manually, which in many occasions is a daunting task. To mitigate this problem, we have built an online tool called CodeLabeller. CodeLabeller is a web-based tool that aims to provide an efficient approach to handlin… ▽ More

    Submitted 13 March, 2023; v1 submitted 14 June, 2021; originally announced June 2021.

    Comments: 15 pages, 5 Figures, 6 Tables

  30. arXiv:2104.01572  [pdf, other

    cs.CL

    TransfoRNN: Capturing the Sequential Information in Self-Attention Representations for Language Modeling

    Authors: Tze Yuang Chong, Xuyang Wang, Lin Yang, Junjie Wang

    Abstract: In this paper, we describe the use of recurrent neural networks to capture sequential information from the self-attention representations to improve the Transformers. Although self-attention mechanism provides a means to exploit long context, the sequential information, i.e. the arrangement of tokens, is not explicitly captured. We propose to cascade the recurrent neural networks to the Transforme… ▽ More

    Submitted 4 April, 2021; originally announced April 2021.

    Comments: INTERSPEECH 2021 (under reviewed)

  31. Robustness Evaluation of Stacked Generative Adversarial Networks using Metamorphic Testing

    Authors: Hyejin Park, Taaha Waseem, Wen Qi Teo, Ying Hwei Low, Mei Kuan Lim, Chun Yong Chong

    Abstract: Synthesising photo-realistic images from natural language is one of the challenging problems in computer vision. Over the past decade, a number of approaches have been proposed, of which the improved Stacked Generative Adversarial Network (StackGAN-v2) has proven capable of generating high resolution images that reflect the details specified in the input text descriptions. In this paper, we aim to… ▽ More

    Submitted 2 October, 2021; v1 submitted 4 March, 2021; originally announced March 2021.

    Comments: 8 pages, accepted at the 6th International Workshop on Metamorphic Testing (MET'21)

  32. arXiv:2101.04837  [pdf, other

    cs.SE

    Assessing the Students' Understanding and their Mistakes in Code Review Checklists -- An Experience Report of 1,791 Code Review Checklist Questions from 394 Students

    Authors: Chun Yong Chong, Patanamon Thongtanunam, Chakkrit Tantithamthavorn

    Abstract: Code review is a widely-used practice in software development companies to identify defects. Hence, code review has been included in many software engineering curricula at universities worldwide. However, teaching code review is still a challenging task because the code review effectiveness depends on the code reading and analytical skills of a reviewer. While several studies have investigated the… ▽ More

    Submitted 12 January, 2021; originally announced January 2021.

    Comments: 10 pages, accepted at the International Conference on Software Engineering: Joint Track on Software Engineering Education and Training Track (ICSE'21-JSEET)

  33. arXiv:2010.04962  [pdf, other

    cs.CV

    HCNet: Hierarchical Context Network for Semantic Segmentation

    Authors: Yanwen Chong, Congchong Nie, Yulong Tao, Xiaoshu Chen, Shaoming Pan

    Abstract: Global context information is vital in visual understanding problems, especially in pixel-level semantic segmentation. The mainstream methods adopt the self-attention mechanism to model global context information. However, pixels belonging to different classes usually have weak feature correlation. Modeling the global pixel-level correlation matrix indiscriminately is extremely redundant in the se… ▽ More

    Submitted 19 October, 2020; v1 submitted 10 October, 2020; originally announced October 2020.

  34. arXiv:2008.12298  [pdf, other

    cs.CV cs.GR

    One Shot 3D Photography

    Authors: Johannes Kopf, Kevin Matzen, Suhib Alsisan, Ocean Quigley, Francis Ge, Yangming Chong, Josh Patterson, Jan-Michael Frahm, Shu Wu, Matthew Yu, Peizhao Zhang, Zijian He, Peter Vajda, Ayush Saraf, Michael Cohen

    Abstract: 3D photography is a new medium that allows viewers to more fully experience a captured moment. In this work, we refer to a 3D photo as one that displays parallax induced by moving the viewpoint (as opposed to a stereo pair with a fixed viewpoint). 3D photos are static in time, like traditional photos, but are displayed with interactive parallax on mobile or desktop screens, as well as on Virtual R… ▽ More

    Submitted 1 September, 2020; v1 submitted 27 August, 2020; originally announced August 2020.

    Comments: Project page: https://facebookresearch.github.io/one_shot_3d_photography/ Code: https://github.com/facebookresearch/one_shot_3d_photography

    Journal ref: ACM Transactions on Graphics (Proceedings of SIGGRAPH 2020), Volume 39, Number 4, 2020

  35. arXiv:1907.04003  [pdf, other

    cs.LG stat.ML

    Mean Spectral Normalization of Deep Neural Networks for Embedded Automation

    Authors: Anand Krishnamoorthy Subramanian, Nak Young Chong

    Abstract: Deep Neural Networks (DNNs) have begun to thrive in the field of automation systems, owing to the recent advancements in standardising various aspects such as architecture, optimization techniques, and regularization. In this paper, we take a step towards a better understanding of Spectral Normalization (SN) and its potential for standardizing regularization of a wider range of Deep Learning model… ▽ More

    Submitted 9 July, 2019; originally announced July 2019.

    Comments: 8 pagesm IEEE CASE 2019

  36. arXiv:1905.12559  [pdf, other

    cs.RO

    ORangE: Operational Range Estimation for Mobile Robot Exploration on a Single Discharge Cycle

    Authors: Kshitij Tiwari, Xuesu Xiao, Ville Kyrki, Nak Young Chong

    Abstract: This paper presents an approach for estimating the operational range for mobile robot exploration on a single battery discharge. Deploying robots in the wild usually requires uninterrupted energy sources to maintain the robot's mobility throughout the entire mission. However, for most endeavors into the unknown environments, recharging is usually not an option, due to the lack of pre-installed rec… ▽ More

    Submitted 19 June, 2019; v1 submitted 29 May, 2019; originally announced May 2019.

    Comments: Accepted by RSS 2019 Workshop Robots in the Wild: Challenges in Deploying Robust Autonomy for Robotic Exploration

  37. arXiv:1811.03159  [pdf, other

    cs.RO

    Estimating Achievable Range of Ground Robots Operating on Single Battery Discharge for Operational Efficacy Amelioration

    Authors: Kshitij Tiwari, Xuesu Xiao, Nak Young Chong

    Abstract: Mobile robots are increasingly being used to assist with active pursuit and law enforcement. One major limitation for such missions is the resource (battery) allocated to the robot. Factors like nature and agility of evader, terrain over which pursuit is being carried out, plausible traversal velocity and the amount of necessary data to be collected all influence how long the robot can last in the… ▽ More

    Submitted 7 November, 2018; originally announced November 2018.

    Comments: 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, Madrid, Spain

  38. Paving the Way for Culturally Competent Robots: a Position Paper

    Authors: Barbara Bruno, Nak Young Chong, Hiroko Kamide, Sanjeev Kanoria, Jaeryoung Lee, Yuto Lim, Amit Kumar Pandey, Chris Papadopoulos, Irena Papadopoulos, Federico Pecora, Alessandro Saffiotti, Antonio Sgorbissa

    Abstract: Cultural competence is a well known requirement for an effective healthcare, widely investigated in the nursing literature. We claim that personal assistive robots should likewise be culturally competent, aware of general cultural characteristics and of the different forms they take in different individuals, and sensitive to cultural differences while perceiving, reasoning, and acting. Drawing ins… ▽ More

    Submitted 22 March, 2018; originally announced March 2018.

    Comments: Presented at: 26th IEEE International Symposium onRobot and Human Interactive Communication (RO-MAN), 2017, Lisbon, Portugal. arXiv admin note: substantial text overlap with arXiv:1708.06276

    Journal ref: Proc. 26th IEEE International Symposium onRobot and Human Interactive Communication (RO-MAN), 2017, Lisbon, Portugal

  39. arXiv:1708.06276  [pdf, other

    cs.RO cs.AI cs.CY cs.HC

    The CARESSES EU-Japan project: making assistive robots culturally competent

    Authors: Barbara Bruno, Nak Young Chong, Hiroko Kamide, Sanjeev Kanoria, Jaeryoung Lee, Yuto Lim, Amit Kumar Pandey, Chris Papadopoulos, Irena Papadopoulos, Federico Pecora, Alessandro Saffiotti, Antonio Sgorbissa

    Abstract: The nursing literature shows that cultural competence is an important requirement for effective healthcare. We claim that personal assistive robots should likewise be culturally competent, that is, they should be aware of general cultural characteristics and of the different forms they take in different individuals, and take these into account while perceiving, reasoning, and acting. The CARESSES… ▽ More

    Submitted 21 August, 2017; originally announced August 2017.

    Comments: Paper presented at: Ambient Assisted Living, Italian Forum. Genova, Italy, June 12--15, 2017

    ACM Class: I.2.9

  40. arXiv:1701.06439  [pdf

    cs.CV

    Segmentation-free Vehicle License Plate Recognition using ConvNet-RNN

    Authors: Teik Koon Cheang, Yong Shean Chong, Yong Haur Tay

    Abstract: While vehicle license plate recognition (VLPR) is usually done with a sliding window approach, it can have limited performance on datasets with characters that are of variable width. This can be solved by hand-crafting algorithms to prescale the characters. While this approach can work fairly well, the recognizer is only aware of the pixels within each detector window, and fails to account for oth… ▽ More

    Submitted 23 January, 2017; originally announced January 2017.

    Comments: 5 pages, 3 figures, International Workshop on Advanced Image Technology, January, 8-10, 2017. Penang, Malaysia. Proceeding IWAIT2017

    ACM Class: I.4.0, I.5.1, I.5.4, I.7.5

  41. arXiv:1701.01546  [pdf, other

    cs.CV

    Abnormal Event Detection in Videos using Spatiotemporal Autoencoder

    Authors: Yong Shean Chong, Yong Haur Tay

    Abstract: We present an efficient method for detecting anomalies in videos. Recent applications of convolutional neural networks have shown promises of convolutional layers for object detection and recognition, especially in images. However, convolutional neural networks are supervised and require labels as learning signals. We propose a spatiotemporal architecture for anomaly detection in videos including… ▽ More

    Submitted 6 January, 2017; originally announced January 2017.

  42. arXiv:1505.00523  [pdf

    cs.CV

    Modeling Representation of Videos for Anomaly Detection using Deep Learning: A Review

    Authors: Yong Shean Chong, Yong Haur Tay

    Abstract: This review article surveys the current progresses made toward video-based anomaly detection. We address the most fundamental aspect for video anomaly detection, that is, video feature representation. Much research works have been done in finding the right representation to perform anomaly detection in video streams accurately with an acceptable false alarm rate. However, this is very challenging… ▽ More

    Submitted 4 May, 2015; originally announced May 2015.

    Comments: arXiv admin note: text overlap with arXiv:1411.4423 by other authors

  43. arXiv:1304.3427  [pdf

    cs.AI

    Metaprobability and Dempster-Shafer in Evidential Reasoning

    Authors: Robert Fung, Chee Yee Chong

    Abstract: Evidential reasoning in expert systems has often used ad-hoc uncertainty calculi. Although it is generally accepted that probability theory provides a firm theoretical foundation, researchers have found some problems with its use as a workable uncertainty calculus. Among these problems are representation of ignorance, consistency of probabilistic judgements, and adjustment of a priori judgements w… ▽ More

    Submitted 27 March, 2013; originally announced April 2013.

    Comments: Appears in Proceedings of the First Conference on Uncertainty in Artificial Intelligence (UAI1985)

    Report number: UAI-P-1985-PG-76-83