Skip to main content

Showing 1–44 of 44 results for author: Chan, C S

  1. arXiv:2405.17462  [pdf, other

    cs.LG

    Ferrari: Federated Feature Unlearning via Optimizing Feature Sensitivity

    Authors: Hanlin Gu, WinKent Ong, Chee Seng Chan, Lixin Fan

    Abstract: The advent of Federated Learning (FL) highlights the practical necessity for the 'right to be forgotten' for all clients, allowing them to request data deletion from the machine learning model's service provider. This necessity has spurred a growing demand for Federated Unlearning (FU). Feature unlearning has gained considerable attention due to its applications in unlearning sensitive features, b… ▽ More

    Submitted 29 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: TLDR: The need for a "right to be forgotten" in Federated Learning has led to the development of the Ferrari framework, which efficiently unlearns sensitive features using a Lipschitz continuity-based metric, proven effective in extensive testing

  2. arXiv:2404.14135  [pdf, other

    cs.CV

    Text in the Dark: Extremely Low-Light Text Image Enhancement

    Authors: Che-Tsung Lin, Chun Chet Ng, Zhi Qin Tan, Wan Jun Nah, Xinyu Wang, Jie Long Kew, Pohao Hsu, Shang Hong Lai, Chee Seng Chan, Christopher Zach

    Abstract: Extremely low-light text images are common in natural scenes, making scene text detection and recognition challenging. One solution is to enhance these images using low-light image enhancement methods before text extraction. However, previous methods often do not try to particularly address the significance of low-level features, which are crucial for optimal performance on downstream scene text t… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: The first two authors contributed equally to this work

  3. arXiv:2404.13944  [pdf, other

    cs.CV cs.MM

    Gorgeous: Create Your Desired Character Facial Makeup from Any Ideas

    Authors: Jia Wei Sii, Chee Seng Chan

    Abstract: Contemporary makeup transfer methods primarily focus on replicating makeup from one face to another, considerably limiting their use in creating diverse and creative character makeup essential for visual storytelling. Such methods typically fail to address the need for uniqueness and contextual relevance, specifically aligning with character and story settings as they depend heavily on existing fa… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: Project page: https://github.com/JiaWeiSii/gorgeous/

  4. arXiv:2401.09495   

    cs.CV

    IPR-NeRF: Ownership Verification meets Neural Radiance Field

    Authors: Win Kent Ong, Kam Woh Ng, Chee Seng Chan, Yi Zhe Song, Tao Xiang

    Abstract: Neural Radiance Field (NeRF) models have gained significant attention in the computer vision community in the recent past with state-of-the-art visual quality and produced impressive demonstrations. Since then, technopreneurs have sought to leverage NeRF models into a profitable business. Therefore, NeRF models make it worth the risk of plagiarizers illegally copying, re-distributing, or misusing… ▽ More

    Submitted 22 January, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: Error on result tabulation of state of the art method which might cause misleading to readers

  5. arXiv:2312.05849  [pdf, other

    cs.CV cs.GR cs.MM

    InteractDiffusion: Interaction Control in Text-to-Image Diffusion Models

    Authors: Jiun Tian Hoe, Xudong Jiang, Chee Seng Chan, Yap-Peng Tan, Weipeng Hu

    Abstract: Large-scale text-to-image (T2I) diffusion models have showcased incredible capabilities in generating coherent images based on textual descriptions, enabling vast applications in content generation. While recent advancements have introduced control over factors such as object localization, posture, and image contours, a crucial gap remains in our ability to control the interactions between objects… ▽ More

    Submitted 26 February, 2024; v1 submitted 10 December, 2023; originally announced December 2023.

    Comments: Website: https://jiuntian.github.io/interactdiffusion. Accepted at CVPR2024

  6. arXiv:2312.03419  [pdf, other

    cs.CR

    Synthesizing Physical Backdoor Datasets: An Automated Framework Leveraging Deep Generative Models

    Authors: Sze Jue Yang, Chinh D. La, Quang H. Nguyen, Kok-Seng Wong, Anh Tuan Tran, Chee Seng Chan, Khoa D. Doan

    Abstract: Backdoor attacks, representing an emerging threat to the integrity of deep neural networks, have garnered significant attention due to their ability to compromise deep learning systems clandestinely. While numerous backdoor attacks occur within the digital realm, their practical implementation in real-world prediction systems remains limited and vulnerable to disturbances in the physical world. Co… ▽ More

    Submitted 15 March, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

  7. arXiv:2308.16684  [pdf, other

    cs.CR cs.AI cs.CV cs.LG

    Everyone Can Attack: Repurpose Lossy Compression as a Natural Backdoor Attack

    Authors: Sze Jue Yang, Quang Nguyen, Chee Seng Chan, Khoa D. Doan

    Abstract: The vulnerabilities to backdoor attacks have recently threatened the trustworthiness of machine learning models in practical applications. Conventional wisdom suggests that not everyone can be an attacker since the process of designing the trigger generation algorithm often involves significant effort and extensive experimentation to ensure the attack's stealthiness and effectiveness. Alternativel… ▽ More

    Submitted 3 September, 2023; v1 submitted 31 August, 2023; originally announced August 2023.

    Comments: 14 pages. This paper shows everyone can mount a powerful and stealthy backdoor attack with the widely-used lossy image compression

  8. arXiv:2302.07669  [pdf, other

    cs.CV cs.IR

    Unsupervised Hashing with Similarity Distribution Calibration

    Authors: Kam Woh Ng, Xiatian Zhu, Jiun Tian Hoe, Chee Seng Chan, Tianyu Zhang, Yi-Zhe Song, Tao Xiang

    Abstract: Unsupervised hashing methods typically aim to preserve the similarity between data points in a feature space by mapping them to binary hash codes. However, these methods often overlook the fact that the similarity between data points in the continuous feature space may not be preserved in the discrete hash code space, due to the limited similarity range of hash codes. The similarity range is bound… ▽ More

    Submitted 31 August, 2023; v1 submitted 15 February, 2023; originally announced February 2023.

    Comments: BMVC 2023

  9. arXiv:2210.00743  [pdf, other

    cs.CL cs.CR

    An Embarrassingly Simple Approach for Intellectual Property Rights Protection on Recurrent Neural Networks

    Authors: Zhi Qin Tan, Hao Shan Wong, Chee Seng Chan

    Abstract: Capitalise on deep learning models, offering Natural Language Processing (NLP) solutions as a part of the Machine Learning as a Service (MLaaS) has generated handsome revenues. At the same time, it is known that the creation of these lucrative deep models is non-trivial. Therefore, protecting these inventions intellectual property rights (IPR) from being abused, stolen and plagiarized is vital. Th… ▽ More

    Submitted 3 October, 2022; v1 submitted 3 October, 2022; originally announced October 2022.

    Comments: Accepted at AACL-IJCNLP 2022 (Fig. 1 updated)

  10. arXiv:2204.05514  [pdf, other

    cs.CL cs.LG

    A Comparative Study of Faithfulness Metrics for Model Interpretability Methods

    Authors: Chun Sik Chan, Huanqi Kong, Guanqing Liang

    Abstract: Interpretation methods to reveal the internal reasoning processes behind machine learning models have attracted increasing attention in recent years. To quantify the extent to which the identified interpretations truly reflect the intrinsic decision-making mechanisms, various faithfulness evaluation metrics have been proposed. However, we find that different faithfulness metrics show conflicting p… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.

    Comments: Accepted as a long paper to ACL 2022 main conference

  11. arXiv:2204.00630  [pdf, other

    eess.IV cs.CV

    Extremely Low-light Image Enhancement with Scene Text Restoration

    Authors: Pohao Hsu, Che-Tsung Lin, Chun Chet Ng, Jie-Long Kew, Mei Yih Tan, Shang-Hong Lai, Chee Seng Chan, Christopher Zach

    Abstract: Deep learning-based methods have made impressive progress in enhancing extremely low-light images - the image quality of the reconstructed images has generally improved. However, we found out that most of these methods could not sufficiently recover the image details, for instance, the texts in the scene. In this paper, a novel image enhancement framework is proposed to precisely restore the scene… ▽ More

    Submitted 1 April, 2022; originally announced April 2022.

  12. arXiv:2202.05451  [pdf, other

    cs.CV cs.CL cs.LG

    ACORT: A Compact Object Relation Transformer for Parameter Efficient Image Captioning

    Authors: Jia Huei Tan, Ying Hua Tan, Chee Seng Chan, Joon Huang Chuah

    Abstract: Recent research that applies Transformer-based architectures to image captioning has resulted in state-of-the-art image captioning performance, capitalising on the success of Transformers on natural language tasks. Unfortunately, though these models work well, one major flaw is their large model sizes. To this end, we present three parameter reduction methods for image captioning Transformers: Rad… ▽ More

    Submitted 11 February, 2022; originally announced February 2022.

    Comments: Neurocomputing; In Press

  13. End-to-End Supermask Pruning: Learning to Prune Image Captioning Models

    Authors: Jia Huei Tan, Chee Seng Chan, Joon Huang Chuah

    Abstract: With the advancement of deep models, research work on image captioning has led to a remarkable gain in raw performance over the last decade, along with increasing model complexity and computational cost. However, surprisingly works on compression of deep networks for image captioning task has received little to no attention. For the first time in image captioning research, we provide an extensive… ▽ More

    Submitted 7 October, 2021; originally announced October 2021.

    Comments: Pattern Recognition; In Press

  14. arXiv:2109.14449  [pdf, other

    cs.CV cs.LG

    One Loss for All: Deep Hashing with a Single Cosine Similarity based Learning Objective

    Authors: Jiun Tian Hoe, Kam Woh Ng, Tianyu Zhang, Chee Seng Chan, Yi-Zhe Song, Tao Xiang

    Abstract: A deep hashing model typically has two main learning objectives: to make the learned binary hash codes discriminative and to minimize a quantization error. With further constraints such as bit balance and code orthogonality, it is not uncommon for existing models to employ a large number (>4) of losses. This leads to difficulties in model training and subsequently impedes their effectiveness. In t… ▽ More

    Submitted 29 September, 2021; originally announced September 2021.

    Comments: Accepted at NeurIPS 2021

  15. arXiv:2107.05279  [pdf, other

    cs.CV

    ICDAR 2021 Competition on Integrated Circuit Text Spotting and Aesthetic Assessment

    Authors: Chun Chet Ng, Akmalul Khairi Bin Nazaruddin, Yeong Khang Lee, Xinyu Wang, Yuliang Liu, Chee Seng Chan, Lianwen Jin, Yipeng Sun, Lixin Fan

    Abstract: With hundreds of thousands of electronic chip components are being manufactured every day, chip manufacturers have seen an increasing demand in seeking a more efficient and effective way of inspecting the quality of printed texts on chip components. The major problem that deters this area of research is the lacking of realistic text on chips datasets to act as a strong foundation. Hence, a text on… ▽ More

    Submitted 12 July, 2021; originally announced July 2021.

    Comments: Technical report of ICDAR 2021 Competition on Integrated Circuit Text Spotting and Aesthetic Assessment

    Journal ref: International Conference on Document Analysis and Recognition (ICDAR) 2021

  16. arXiv:2103.09173  [pdf, other

    cs.AI

    Ternary Hashing

    Authors: Chang Liu, Lixin Fan, Kam Woh Ng, Yilun Jin, Ce Ju, Tianyu Zhang, Chee Seng Chan, Qiang Yang

    Abstract: This paper proposes a novel ternary hash encoding for learning to hash methods, which provides a principled more efficient coding scheme with performances better than those of the state-of-the-art binary hashing counterparts. Two kinds of axiomatic ternary logic, Kleene logic and Łukasiewicz logic are adopted to calculate the Ternary Hamming Distance (THD) for both the learning/encoding and testin… ▽ More

    Submitted 19 March, 2021; v1 submitted 16 March, 2021; originally announced March 2021.

  17. arXiv:2102.04362  [pdf, other

    cs.CR cs.AI cs.CV

    Protecting Intellectual Property of Generative Adversarial Networks from Ambiguity Attack

    Authors: Ding Sheng Ong, Chee Seng Chan, Kam Woh Ng, Lixin Fan, Qiang Yang

    Abstract: Ever since Machine Learning as a Service (MLaaS) emerges as a viable business that utilizes deep learning models to generate lucrative revenue, Intellectual Property Right (IPR) has become a major concern because these deep learning models can easily be replicated, shared, and re-distributed by any unauthorized third parties. To the best of our knowledge, one of the prominent deep learning models… ▽ More

    Submitted 28 February, 2021; v1 submitted 8 February, 2021; originally announced February 2021.

    Comments: Accepted at CVPR2021

  18. arXiv:2008.11009  [pdf, other

    cs.CV cs.CR

    Protect, Show, Attend and Tell: Empowering Image Captioning Models with Ownership Protection

    Authors: Jian Han Lim, Chee Seng Chan, Kam Woh Ng, Lixin Fan, Qiang Yang

    Abstract: By and large, existing Intellectual Property (IP) protection on deep neural networks typically i) focus on image classification task only, and ii) follow a standard digital watermarking framework that was conventionally used to protect the ownership of multimedia and video content. This paper demonstrates that the current digital watermarking framework is insufficient to protect image captioning t… ▽ More

    Submitted 31 August, 2021; v1 submitted 25 August, 2020; originally announced August 2020.

    Comments: Accepted at Pattern Recognition, 17 pages

  19. arXiv:2006.11601  [pdf, other

    cs.LG cs.CR cs.DC stat.ML

    Rethinking Privacy Preserving Deep Learning: How to Evaluate and Thwart Privacy Attacks

    Authors: Lixin Fan, Kam Woh Ng, Ce Ju, Tianyu Zhang, Chang Liu, Chee Seng Chan, Qiang Yang

    Abstract: This paper investigates capabilities of Privacy-Preserving Deep Learning (PPDL) mechanisms against various forms of privacy attacks. First, we propose to quantitatively measure the trade-off between model accuracy and privacy losses incurred by reconstruction, tracing and membership attacks. Second, we formulate reconstruction attacks as solving a noisy system of linear equations, and prove that a… ▽ More

    Submitted 23 June, 2020; v1 submitted 20 June, 2020; originally announced June 2020.

    Comments: under review, 36 pages (updated Eq. 3 and Fig. 8)

  20. arXiv:2002.10215  [pdf, other

    cs.CV

    On the General Value of Evidence, and Bilingual Scene-Text Visual Question Answering

    Authors: Xinyu Wang, Yuliang Liu, Chunhua Shen, Chun Chet Ng, Canjie Luo, Lianwen Jin, Chee Seng Chan, Anton van den Hengel, Liangwei Wang

    Abstract: Visual Question Answering (VQA) methods have made incredible progress, but suffer from a failure to generalize. This is visible in the fact that they are vulnerable to learning coincidental correlations in the data rather than deeper relations between image content and ideas expressed in language. We present a dataset that takes a step towards addressing this problem in that it contains questions… ▽ More

    Submitted 25 February, 2020; v1 submitted 24 February, 2020; originally announced February 2020.

    Comments: Accepted to Proc. IEEE Conf. Computer Vision and Pattern Recognition 2020

  21. arXiv:1909.07830  [pdf, other

    cs.CR cs.CV cs.LG

    [Extended version] Rethinking Deep Neural Network Ownership Verification: Embedding Passports to Defeat Ambiguity Attacks

    Authors: Lixin Fan, Kam Woh Ng, Chee Seng Chan

    Abstract: With substantial amount of time, resources and human (team) efforts invested to explore and develop successful deep neural networks (DNN), there emerges an urgent need to protect these inventions from being illegally copied, redistributed, or abused without respecting the intellectual properties of legitimate owners. Following recent progresses along this line, we investigate a number of watermark… ▽ More

    Submitted 2 November, 2019; v1 submitted 16 September, 2019; originally announced September 2019.

    Comments: This paper is accepted by NeurIPS 2019; Our code is available at https://github.com/kamwoh/DeepIPR. This is the extended version

  22. arXiv:1909.07741  [pdf, other

    cs.CV cs.LG cs.MM

    ICDAR 2019 Competition on Large-scale Street View Text with Partial Labeling -- RRC-LSVT

    Authors: Yipeng Sun, Zihan Ni, Chee-Kheng Chng, Yuliang Liu, Canjie Luo, Chun Chet Ng, Junyu Han, Errui Ding, Jingtuo Liu, Dimosthenis Karatzas, Chee Seng Chan, Lianwen Jin

    Abstract: Robust text reading from street view images provides valuable information for various applications. Performance improvement of existing methods in such a challenging scenario heavily relies on the amount of fully annotated training data, which is costly and in-efficient to obtain. To scale up the amount of training data while keeping the labeling procedure cost-effective, this competition introduc… ▽ More

    Submitted 17 September, 2019; originally announced September 2019.

    Comments: ICDAR 2019 Robust Reading Challenge in IAPR International Conference on Document Analysis and Recognition (ICDAR)

  23. arXiv:1909.07145  [pdf, other

    cs.CV

    ICDAR2019 Robust Reading Challenge on Arbitrary-Shaped Text (RRC-ArT)

    Authors: Chee-Kheng Chng, Yuliang Liu, Yipeng Sun, Chun Chet Ng, Canjie Luo, Zihan Ni, ChuanMing Fang, Shuaitao Zhang, Junyu Han, Errui Ding, Jingtuo Liu, Dimosthenis Karatzas, Chee Seng Chan, Lianwen Jin

    Abstract: This paper reports the ICDAR2019 Robust Reading Challenge on Arbitrary-Shaped Text (RRC-ArT) that consists of three major challenges: i) scene text detection, ii) scene text recognition, and iii) scene text spotting. A total of 78 submissions from 46 unique teams/individuals were received for this competition. The top performing score of each challenge is as follows: i) T1 - 82.65%, ii) T2.1 - 74.… ▽ More

    Submitted 16 September, 2019; originally announced September 2019.

    Comments: Technical report of ICDAR2019 Robust Reading Challenge on Arbitrary-Shaped Text (RRC-ArT) Competition

  24. arXiv:1908.10797  [pdf, other

    cs.CV cs.CL cs.LG

    Image Captioning with Sparse Recurrent Neural Network

    Authors: Jia Huei Tan, Chee Seng Chan, Joon Huang Chuah

    Abstract: Recurrent Neural Network (RNN) has been widely used to tackle a wide variety of language generation problems and are capable of attaining state-of-the-art (SOTA) performance. However despite its impressive results, the large number of parameters in the RNN model makes deployment to mobile and embedded devices infeasible. Driven by this problem, many works have proposed a number of pruning methods… ▽ More

    Submitted 28 October, 2019; v1 submitted 28 August, 2019; originally announced August 2019.

    Comments: Corrected Eq 11, updated Table 5

  25. arXiv:1905.04368  [pdf, other

    cs.CR cs.CV cs.LG

    Digital Passport: A Novel Technological Strategy for Intellectual Property Protection of Convolutional Neural Networks

    Authors: Lixin Fan, KamWoh Ng, Chee Seng Chan

    Abstract: In order to prevent deep neural networks from being infringed by unauthorized parties, we propose a generic solution which embeds a designated digital passport into a network, and subsequently, either paralyzes the network functionalities for unauthorized usages or maintain its functionalities in the presence of a verified passport. Such a desired network behavior is successfully demonstrated in a… ▽ More

    Submitted 10 May, 2019; originally announced May 2019.

    Comments: This paper proposes a new timely IPR solution that embed digital passports into CNN models to prevent the unauthorized network usage (i.e. infringement) by paralyzing the networks while maintaining its functionality for verified users

  26. COMIC: Towards A Compact Image Captioning Model with Attention

    Authors: Jia Huei Tan, Chee Seng Chan, Joon Huang Chuah

    Abstract: Recent works in image captioning have shown very promising raw performance. However, we realize that most of these encoder-decoder style networks with attention do not scale naturally to large vocabulary size, making them difficult to be deployed on embedded system with limited hardware resources. This is because the size of word and output embedding matrices grow proportionally with the size of v… ▽ More

    Submitted 11 June, 2019; v1 submitted 4 March, 2019; originally announced March 2019.

    Comments: Added source code link and new results in Table 3

  27. arXiv:1901.08551  [pdf

    cs.LG stat.ML

    A Universal Logic Operator for Interpretable Deep Convolution Networks

    Authors: KamWoh Ng, Lixin Fan, Chee Seng Chan

    Abstract: Explaining neural network computation in terms of probabilistic/fuzzy logical operations has attracted much attention due to its simplicity and high interpretability. Different choices of logical operators such as AND, OR and XOR give rise to another dimension for network optimization, and in this paper, we study the open problem of learning a universal logical operator without prescribing to any… ▽ More

    Submitted 20 January, 2019; originally announced January 2019.

    Comments: In AAAI-19 Workshop on Network Interpretability for Deep Learning

  28. arXiv:1805.11227  [pdf, other

    cs.CV

    Getting to Know Low-light Images with The Exclusively Dark Dataset

    Authors: Yuen Peng Loh, Chee Seng Chan

    Abstract: Low-light is an inescapable element of our daily surroundings that greatly affects the efficiency of our vision. Research works on low-light has seen a steady growth, particularly in the field of image enhancement, but there is still a lack of a go-to database as benchmark. Besides, research fields that may assist us in low-light environments, such as object detection, has glossed over this aspect… ▽ More

    Submitted 28 May, 2018; originally announced May 2018.

    Comments: Exclusively Dark (ExDARK) dataset is a collection of 7,363 low-light images from very low-light environments to twilight (i.e 10 different conditions), and 12 object classes (as to PASCAL VOC) annotated on both image class level and local object bounding boxes. 16 pages, 13 figures, submitted to CVIU

  29. arXiv:1711.05557  [pdf, other

    cs.CV cs.AI cs.CL

    Phrase-based Image Captioning with Hierarchical LSTM Model

    Authors: Ying Hua Tan, Chee Seng Chan

    Abstract: Automatic generation of caption to describe the content of an image has been gaining a lot of research interests recently, where most of the existing works treat the image caption as pure sequential data. Natural language, however possess a temporal hierarchy structure, with complex dependencies between each subsequence. In this paper, we propose a phrase-based hierarchical Long Short-Term Memory… ▽ More

    Submitted 11 November, 2017; originally announced November 2017.

    Comments: 17 pages, 12 figures, ACCV2016 extension, phrase-based image captioning

  30. arXiv:1710.10400  [pdf, other

    cs.CV

    Total-Text: A Comprehensive Dataset for Scene Text Detection and Recognition

    Authors: Chee Kheng Chng, Chee Seng Chan

    Abstract: Text in curve orientation, despite being one of the common text orientations in real world environment, has close to zero existence in well received scene text datasets such as ICDAR2013 and MSRA-TD500. The main motivation of Total-Text is to fill this gap and facilitate a new research direction for the scene text community. On top of the conventional horizontal and multi-oriented texts, it featur… ▽ More

    Submitted 28 October, 2017; originally announced October 2017.

    Comments: Accepted as Oral presentation in ICDAR2017 (Extended version, 13 pages 17 figures). We introduce a new scene text dataset namely as Total-Text, which is more comprehensive than the existing scene text datasets as it consists of 1555 natural images with more than 3 different text orientations, one of a kind

  31. A Comprehensive Survey of Deep Learning in Remote Sensing: Theories, Tools and Challenges for the Community

    Authors: John E. Ball, Derek T. Anderson, Chee Seng Chan

    Abstract: In recent years, deep learning (DL), a re-branding of neural networks (NNs), has risen to the top in numerous areas, namely computer vision (CV), speech recognition, natural language processing, etc. Whereas remote sensing (RS) possesses a number of unique challenges, primarily related to sensors and applications, inevitably RS draws from many of the same theories as CV; e.g., statistics, fusion,… ▽ More

    Submitted 24 September, 2017; v1 submitted 1 September, 2017; originally announced September 2017.

    Comments: 64 pages, 411 references. To appear in Journal of Applied Remote Sensing

    Journal ref: J. Appl. Remote Sens. 11(4) (2017) 042609

  32. arXiv:1708.09533  [pdf, other

    cs.CV

    Improved ArtGAN for Conditional Synthesis of Natural Image and Artwork

    Authors: Wei Ren Tan, Chee Seng Chan, Hernan Aguirre, Kiyoshi Tanaka

    Abstract: This paper proposes a series of new approaches to improve Generative Adversarial Network (GAN) for conditional image synthesis and we name the proposed model as ArtGAN. One of the key innovation of ArtGAN is that, the gradient of the loss function w.r.t. the label (randomly assigned to each generated image) is back-propagated from the categorical discriminator to the generator. With the feedback f… ▽ More

    Submitted 23 August, 2018; v1 submitted 30 August, 2017; originally announced August 2017.

    Comments: 16 pages, 11 figures, accepted version at IEEE Transactions on Image Processing (T-IP)

  33. arXiv:1702.03410  [pdf, other

    cs.CV

    ArtGAN: Artwork Synthesis with Conditional Categorical GANs

    Authors: Wei Ren Tan, Chee Seng Chan, Hernan Aguirre, Kiyoshi Tanaka

    Abstract: This paper proposes an extension to the Generative Adversarial Networks (GANs), namely as ARTGAN to synthetically generate more challenging and complex images such as artwork that have abstract characteristics. This is in contrast to most of the current solutions that focused on generating natural images such as room interiors, birds, flowers and faces. The key innovation of our work is to allow b… ▽ More

    Submitted 19 April, 2017; v1 submitted 11 February, 2017; originally announced February 2017.

    Comments: 10 pages, 10 figures, submitted to ICIP2017 (extension version)

  34. arXiv:1608.05813  [pdf, other

    cs.CL cs.CV

    phi-LSTM: A Phrase-based Hierarchical LSTM Model for Image Captioning

    Authors: Ying Hua Tan, Chee Seng Chan

    Abstract: A picture is worth a thousand words. Not until recently, however, we noticed some success stories in understanding of visual scenes: a model that is able to detect/name objects, describe their attributes, and recognize their relationships/interactions. In this paper, we propose a phrase-based hierarchical Long Short-Term Memory (phi-LSTM) model to generate image description. The proposed model enc… ▽ More

    Submitted 26 October, 2017; v1 submitted 20 August, 2016; originally announced August 2016.

    Comments: This paper introduces phrase-based image captioning. Accepted in ACCV2016 (extended version, 21 pages, 12 figures)

  35. Crowd Behavior Analysis: A Review where Physics meets Biology

    Authors: Ven Jyn Kok, Mei Kuan Lim, Chee Seng Chan

    Abstract: Although the traits emerged in a mass gathering are often non-deliberative, the act of mass impulse may lead to irre- vocable crowd disasters. The two-fold increase of carnage in crowd since the past two decades has spurred significant advances in the field of computer vision, towards effective and proactive crowd surveillance. Computer vision stud- ies related to crowd are observed to resonate wi… ▽ More

    Submitted 20 November, 2015; originally announced November 2015.

    Comments: Accepted in Neurocomputing, 31 pages, 180 references

    Journal ref: Neurocomputing 177 (2016) 342-362

  36. arXiv:1506.08425  [pdf, other

    cs.CV cs.AI cs.NE

    Deep-Plant: Plant Identification with convolutional neural networks

    Authors: Sue Han Lee, Chee Seng Chan, Paul Wilkin, Paolo Remagnino

    Abstract: This paper studies convolutional neural networks (CNN) to learn unsupervised feature representations for 44 different plant species, collected at the Royal Botanic Gardens, Kew, England. To gain intuition on the chosen features from the CNN model (opposed to a 'black box' solution), a visualisation technique based on the deconvolutional networks (DN) is utilized. It is found that venations of diff… ▽ More

    Submitted 28 June, 2015; originally announced June 2015.

    Comments: 6 pages, 8 figures, accepted as oral presentation in ICIP2015, Québec City, Canada

  37. Fuzzy human motion analysis: A review

    Authors: Chern Hong Lim, Ekta Vats, Chee Seng Chan

    Abstract: Human Motion Analysis (HMA) is currently one of the most popularly active research domains as such significant research interests are motivated by a number of real world applications such as video surveillance, sports analysis, healthcare monitoring and so on. However, most of these real world applications face high levels of uncertainties that can affect the operations of such applications. Hence… ▽ More

    Submitted 2 December, 2014; v1 submitted 1 December, 2014; originally announced December 2014.

    Comments: Accepted in Pattern Recognition, first survey paper that discusses and reviews fuzzy approaches towards HMA

    Journal ref: Pattern Recognition 48(5) 2015 1773-1796

  38. arXiv:1410.3932  [pdf, other

    cs.CV

    Detection of Salient Regions in Crowded Scenes

    Authors: Mei Kuan Lim, Chee Seng Chan, Dorothy Monekosso, Paolo Remagnino

    Abstract: The increasing number of cameras and a handful of human operators to monitor the video inputs from hundreds of cameras leave the system ill equipped to fulfil the task of detecting anomalies. Thus, there is a dire need to automatically detect regions that require immediate attention for a more effective and proactive surveillance. We propose a framework that utilises the temporal variations in the… ▽ More

    Submitted 15 October, 2014; originally announced October 2014.

    Comments: Accepted in Electronics Letters Vol. 5, Issue 5

  39. arXiv:1410.3756  [pdf, other

    cs.CV stat.ML

    Crowd Saliency Detection via Global Similarity Structure

    Authors: Mei Kuan Lim, Ven Jyn Kok, Chen Change Loy, Chee Seng Chan

    Abstract: It is common for CCTV operators to overlook inter- esting events taking place within the crowd due to large number of people in the crowded scene (i.e. marathon, rally). Thus, there is a dire need to automate the detection of salient crowd regions acquiring immediate attention for a more effective and proactive surveillance. This paper proposes a novel framework to identify and localize salient re… ▽ More

    Submitted 14 October, 2014; originally announced October 2014.

    Comments: Accepted in ICPR 2014 (Oral). Mei Kuan Lim and Ven Jyn Kok share equal contributions

  40. arXiv:1410.3752  [pdf, ps, other

    cs.CV stat.ML

    Enhanced Random Forest with Image/Patch-Level Learning for Image Understanding

    Authors: Wai Lam Hoo, Tae-Kyun Kim, Yuru Pei, Chee Seng Chan

    Abstract: Image understanding is an important research domain in the computer vision due to its wide real-world applications. For an image understanding framework that uses the Bag-of-Words model representation, the visual codebook is an essential part. Random forest (RF) as a tree-structure discriminative codebook has been a popular choice. However, the performance of the RF can be degraded if the local pa… ▽ More

    Submitted 14 October, 2014; originally announced October 2014.

    Comments: Accepted in ICPR 2014 (Oral)

  41. A Fusion Approach for Efficient Human Skin Detection

    Authors: Wei Ren Tan, Chee Seng Chan, Pratheepan Yogarajah, Joan Condell

    Abstract: A reliable human skin detection method that is adaptable to different human skin colours and illu- mination conditions is essential for better human skin segmentation. Even though different human skin colour detection solutions have been successfully applied, they are prone to false skin detection and are not able to cope with the variety of human skin colours across different ethnic. Moreover, ex… ▽ More

    Submitted 14 October, 2014; originally announced October 2014.

    Comments: Accepted in IEEE Transactions on Industrial Informatics, vol. 8(1), pp. 138-147, new skin detection + ground truth (Pratheepan) dataset

    Journal ref: IEEE Transactions on Industrial Informatics, vol. 8(1), pp. 138-147, 2012

  42. Zero-Shot Object Recognition System based on Topic Model

    Authors: Wai Lam Hoo, Chee Seng Chan

    Abstract: Object recognition systems usually require fully complete manually labeled training data to train the classifier. In this paper, we study the problem of object recognition where the training samples are missing during the classifier learning stage, a task also known as zero-shot learning. We propose a novel zero-shot learning strategy that utilizes the topic model and hierarchical class concept. O… ▽ More

    Submitted 14 October, 2014; originally announced October 2014.

    Comments: To appear in IEEE Transactions on Human-Machine Systems

  43. Refined Particle Swarm Intelligence Method for Abrupt Motion Tracking

    Authors: Mei Kuan Lim, Chee Seng Chan, Dorothy Monekosso, Paolo Remagnino

    Abstract: Conventional tracking solutions are not feasible in handling abrupt motion as they are based on smooth motion assumption or an accurate motion model. Abrupt motion is not subject to motion continuity and smoothness. To assuage this, we deem tracking as an optimisation problem and propose a novel abrupt motion tracker that based on swarm intelligence - the SwaTrack. Unlike existing swarm-based filt… ▽ More

    Submitted 14 October, 2014; originally announced October 2014.

    Comments: Accepted in Information Sciences, new abrupt motion (MAMo) dataset is introduced

    Journal ref: Information Sciences, vol. 283, pp. 267-287, 2014

  44. Scene Image is Non-Mutually Exclusive - A Fuzzy Qualitative Scene Understanding

    Authors: Chern Hong Lim, Anhar Risnumawan, Chee Seng Chan

    Abstract: Ambiguity or uncertainty is a pervasive element of many real world decision making processes. Variation in decisions is a norm in this situation when the same problem is posed to different subjects. Psychological and metaphysical research had proven that decision making by human is subjective. It is influenced by many factors such as experience, age, background, etc. Scene understanding is one of… ▽ More

    Submitted 14 October, 2014; originally announced October 2014.

    Comments: Accepted in IEEE Transactions on Fuzzy Systems

    Journal ref: IEEE Transactions on Fuzzy Systems, vol. 22(6), pp. 1541 - 1556, 2014