Skip to main content

Showing 1–31 of 31 results for author: Yap, K

  1. arXiv:2406.11340  [pdf, other

    cs.CV cs.LG

    CM2-Net: Continual Cross-Modal Mapping Network for Driver Action Recognition

    Authors: Ruoyu Wang, Chen Cai, Wenqian Wang, Jianjun Gao, Dan Lin, Wenyang Liu, Kim-Hui Yap

    Abstract: Driver action recognition has significantly advanced in enhancing driver-vehicle interactions and ensuring driving safety by integrating multiple modalities, such as infrared and depth. Nevertheless, compared to RGB modality only, it is always laborious and costly to collect extensive data for all types of non-RGB modalities in car cabin environments. Therefore, previous works have suggested indep… ▽ More

    Submitted 18 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

  2. arXiv:2404.13611  [pdf, other

    cs.CV cs.CL

    Video sentence grounding with temporally global textual knowledge

    Authors: Cai Chen, Runzhong Zhang, Jianjun Gao, Kejun Wu, Kim-Hui Yap, Yi Wang

    Abstract: Temporal sentence grounding involves the retrieval of a video moment with a natural language query. Many existing works directly incorporate the given video and temporally localized query for temporal grounding, overlooking the inherent domain gap between different modalities. In this paper, we utilize pseudo-query features containing extensive temporally global textual knowledge sourced from the… ▽ More

    Submitted 1 June, 2024; v1 submitted 21 April, 2024; originally announced April 2024.

  3. arXiv:2401.14838  [pdf, other

    cs.CV

    Multi-modality action recognition based on dual feature shift in vehicle cabin monitoring

    Authors: Dan Lin, Philip Hann Yung Lee, Yiming Li, Ruoyu Wang, Kim-Hui Yap, Bingbing Li, You Shing Ngim

    Abstract: Driver Action Recognition (DAR) is crucial in vehicle cabin monitoring systems. In real-world applications, it is common for vehicle cabins to be equipped with cameras featuring different modalities. However, multi-modality fusion strategies for the DAR task within car cabins have rarely been studied. In this paper, we propose a novel yet efficient multi-modality driver action recognition method b… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

  4. arXiv:2401.08126  [pdf, other

    cs.NI

    Octopus: A Fair Packet Delivery Service

    Authors: Junzhi Gong, Yuliang Li, Devdeep Ray, KK Yap, Nandita Dukkipati

    Abstract: The packet delivery fairness is critical in many applications in the cloud, such as exchange systems, consensus protocols, and online gaming applications. However, due to nonidentical and dynamic packet forwarding paths, as well as many in-network queuing delays, supporting packet delivery fairness is challenging in a shared compute environment. In this paper, we present Octopus, the first general… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  5. arXiv:2311.06070  [pdf, other

    cs.CV

    Learning-Based Biharmonic Augmentation for Point Cloud Classification

    Authors: Jiacheng Wei, Guosheng Lin, Henghui Ding, Jie Hu, Kim-Hui Yap

    Abstract: Point cloud datasets often suffer from inadequate sample sizes in comparison to image datasets, making data augmentation challenging. While traditional methods, like rigid transformations and scaling, have limited potential in increasing dataset diversity due to their constraints on altering individual sample shapes, we introduce the Biharmonic Augmentation (BA) method. BA is a novel and efficient… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

  6. arXiv:2309.13890  [pdf, other

    cs.CV eess.IV

    Bitstream-Corrupted Video Recovery: A Novel Benchmark Dataset and Method

    Authors: Tianyi Liu, Kejun Wu, Yi Wang, Wenyang Liu, Kim-Hui Yap, Lap-Pui Chau

    Abstract: The past decade has witnessed great strides in video recovery by specialist technologies, like video inpainting, completion, and error concealment. However, they typically simulate the missing content by manual-designed error masks, thus failing to fill in the realistic video loss in video communication (e.g., telepresence, live streaming, and internet video) and multimedia forensics. To address t… ▽ More

    Submitted 26 September, 2023; v1 submitted 25 September, 2023; originally announced September 2023.

    Comments: Accepted by NeurIPS Dataset and Benchmark Track 2023

  7. arXiv:2309.10360  [pdf, other

    cs.CV cs.AI

    OccluTrack: Rethinking Awareness of Occlusion for Enhancing Multiple Pedestrian Tracking

    Authors: Jianjun Gao, Yi Wang, Kim-Hui Yap, Kratika Garg, Boon Siew Han

    Abstract: Multiple pedestrian tracking faces the challenge of tracking pedestrians in the presence of occlusion. Existing methods suffer from inaccurate motion estimation, appearance feature extraction, and association due to occlusion, leading to inadequate Identification F1-Score (IDF1), excessive ID switches (IDSw), and insufficient association accuracy and recall (AssA and AssR). We found that the main… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

  8. arXiv:2308.16763  [pdf, other

    cs.CL cs.AI

    Ladder-of-Thought: Using Knowledge as Steps to Elevate Stance Detection

    Authors: Kairui Hu, Ming Yan, Joey Tianyi Zhou, Ivor W. Tsang, Wen Haw Chong, Yong Keong Yap

    Abstract: Stance detection aims to identify the attitude expressed in a document towards a given target. Techniques such as Chain-of-Thought (CoT) prompting have advanced this task, enhancing a model's reasoning capabilities through the derivation of intermediate rationales. However, CoT relies primarily on a model's pre-trained internal knowledge during reasoning, thereby neglecting the valuable external i… ▽ More

    Submitted 7 September, 2023; v1 submitted 31 August, 2023; originally announced August 2023.

    Comments: 5 pages, 2 figures, 2 tables

  9. arXiv:2306.07490  [pdf, other

    cs.CV

    Top-Down Framework for Weakly-supervised Grounded Image Captioning

    Authors: Chen Cai, Suchen Wang, Kim-hui Yap, Yi Wang

    Abstract: Weakly-supervised grounded image captioning (WSGIC) aims to generate the caption and ground (localize) predicted object words in the input image without using bounding box supervision. Recent two-stage solutions mostly apply a bottom-up pipeline: (1) encode the input image into multiple region features using an object detector; (2) leverage region features for captioning and grounding. However, ut… ▽ More

    Submitted 2 March, 2024; v1 submitted 12 June, 2023; originally announced June 2023.

  10. arXiv:2305.19845  [pdf, other

    cs.CL

    Guiding Computational Stance Detection with Expanded Stance Triangle Framework

    Authors: Zhengyuan Liu, Yong Keong Yap, Hai Leong Chieu, Nancy F. Chen

    Abstract: Stance detection determines whether the author of a piece of text is in favor of, against, or neutral towards a specified target, and can be used to gain valuable insights into social media. The ubiquitous indirect referral of targets makes this task challenging, as it requires computational solutions to model semantic features and infer the corresponding implications from a literal statement. Mor… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

    Comments: Main Conference in ACL 2023

  11. arXiv:2304.11404  [pdf, other

    cs.CV eess.IV eess.SP

    SSN: Stockwell Scattering Network for SAR Image Change Detection

    Authors: Gong Chen, Yanan Zhao, Yi Wang, Kim-Hui Yap

    Abstract: Recently, synthetic aperture radar (SAR) image change detection has become an interesting yet challenging direction due to the presence of speckle noise. Although both traditional and modern learning-driven methods attempted to overcome this challenge, deep convolutional neural networks (DCNNs)-based methods are still hindered by the lack of interpretability and the requirement of large computatio… ▽ More

    Submitted 22 April, 2023; originally announced April 2023.

    Comments: 5 pages, 6 figures

    MSC Class: 53-04 ACM Class: I.2.1

    Journal ref: IEEE Geoscience and Remote Sensing Letters, 2023

  12. arXiv:2304.06983  [pdf, other

    cs.CV eess.SP

    A Byte Sequence is Worth an Image: CNN for File Fragment Classification Using Bit Shift and n-Gram Embeddings

    Authors: Wenyang Liu, Yi Wang, Kejun Wu, Kim-Hui Yap, Lap-Pui Chau

    Abstract: File fragment classification (FFC) on small chunks of memory is essential in memory forensics and Internet security. Existing methods mainly treat file fragments as 1d byte signals and utilize the captured inter-byte features for classification, while the bit information within bytes, i.e., intra-byte information, is seldom considered. This is inherently inapt for classifying variable-length codin… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

    Comments: Accepted by AICAS 2023

  13. arXiv:2304.06976  [pdf, other

    eess.IV cs.CV

    Bitstream-Corrupted JPEG Images are Restorable: Two-stage Compensation and Alignment Framework for Image Restoration

    Authors: Wenyang Liu, Yi Wang, Kim-Hui Yap, Lap-Pui Chau

    Abstract: In this paper, we study a real-world JPEG image restoration problem with bit errors on the encrypted bitstream. The bit errors bring unpredictable color casts and block shifts on decoded image contents, which cannot be resolved by existing image restoration methods mainly relying on pre-defined degradation models in the pixel domain. To address these challenges, we propose a robust JPEG decoder, f… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

    Comments: Accepted by CVPR 2023

  14. arXiv:2303.13273  [pdf, other

    cs.CV

    TAPS3D: Text-Guided 3D Textured Shape Generation from Pseudo Supervision

    Authors: Jiacheng Wei, Hao Wang, Jiashi Feng, Guosheng Lin, Kim-Hui Yap

    Abstract: In this paper, we investigate an open research task of generating controllable 3D textured shapes from the given textual descriptions. Previous works either require ground truth caption labeling or extensive optimization time. To resolve these issues, we present a novel framework, TAPS3D, to train a text-guided 3D shape generator with pseudo captions. Specifically, based on rendered 2D images, we… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

    Comments: Accepted to CVPR2023

  15. Dense Supervision Propagation for Weakly Supervised Semantic Segmentation on 3D Point Clouds

    Authors: Jiacheng Wei, Guosheng Lin, Kim-Hui Yap, Fayao Liu, Tzu-Yi Hung

    Abstract: Semantic segmentation on 3D point clouds is an important task for 3D scene understanding. While dense labeling on 3D data is expensive and time-consuming, only a few works address weakly supervised semantic point cloud segmentation methods to relieve the labeling cost by learning from simpler and cheaper labels. Meanwhile, there are still huge performance gaps between existing weakly supervised me… ▽ More

    Submitted 1 April, 2024; v1 submitted 23 July, 2021; originally announced July 2021.

  16. arXiv:2106.00256  [pdf, other

    cs.CV

    Reconciliation of Statistical and Spatial Sparsity For Robust Image and Image-Set Classification

    Authors: Hao Cheng, Kim-Hui Yap, Bihan Wen

    Abstract: Recent image classification algorithms, by learning deep features from large-scale datasets, have achieved significantly better results comparing to the classic feature-based approaches. However, there are still various challenges of image classifications in practice, such as classifying noisy image or image-set queries and training deep image classification models over the limited-scale dataset.… ▽ More

    Submitted 1 June, 2021; originally announced June 2021.

    Comments: Submitted to IEEE Transactions on Multimedia

  17. arXiv:2105.03856  [pdf, ps, other

    cs.SC

    The D-plus Discriminant and Complexity of Root Clustering

    Authors: Jing Yang, Chee K. Yap

    Abstract: Let $p(x)$ be an integer polynomial with $m\ge 2$ distinct roots $ρ_1,\ldots,ρ_m$ whose multiplicities are $\boldsymbolμ=(μ_1,\ldots,μ_m)$. We define the D-plus discriminant of $p(x)$ to be $D^+(p):= \prod_{1\le i<j\le m}(ρ_i-ρ_j)^{μ_i+μ_j}$. We first prove a conjecture that $D^+(p)$ is a $\boldsymbolμ$-symmetric function of its roots $ρ_1,\ldots,ρ_m$. Our main result gives an explicit formula for… ▽ More

    Submitted 19 May, 2021; v1 submitted 9 May, 2021; originally announced May 2021.

    MSC Class: 68W30; 11R29; 68Q25

  18. arXiv:2006.14265  [pdf, other

    cs.LG cs.CV stat.ML

    Empirical Analysis of Overfitting and Mode Drop in GAN Training

    Authors: Yasin Yazici, Chuan-Sheng Foo, Stefan Winkler, Kim-Hui Yap, Vijay Chandrasekhar

    Abstract: We examine two key questions in GAN training, namely overfitting and mode drop, from an empirical perspective. We show that when stochasticity is removed from the training procedure, GANs can overfit and exhibit almost no mode drop. Our results shed light on important characteristics of the GAN training procedure. They also provide evidence against prevailing intuitions that GANs do not memorize t… ▽ More

    Submitted 25 June, 2020; originally announced June 2020.

    Comments: To appear in ICIP2020

  19. arXiv:2006.14256  [pdf, other

    cs.AR

    Arnold: an eFPGA-Augmented RISC-V SoC for Flexible and Low-Power IoT End-Nodes

    Authors: Pasquale Davide Schiavone, Davide Rossi, Alfio Di Mauro, Frank Gurkaynak, Timothy Saxe, Mao Wang, Ket Chong Yap, Luca Benini

    Abstract: A wide range of Internet of Things (IoT) applications require powerful, energy-efficient and flexible end-nodes to acquire data from multiple sources, process and distill the sensed data through near-sensor data analytics algorithms, and transmit it wirelessly. This work presents Arnold: a 0.5 V to 0.8 V, 46.83 uW/MHz, 600 MOPS fully programmable RISC-V Microcontroller unit (MCU) fabricated in 22… ▽ More

    Submitted 25 June, 2020; originally announced June 2020.

  20. arXiv:2003.13035  [pdf, other

    cs.CV

    Multi-Path Region Mining For Weakly Supervised 3D Semantic Segmentation on Point Clouds

    Authors: Jiacheng Wei, Guosheng Lin, Kim-Hui Yap, Tzu-Yi Hung, Lihua Xie

    Abstract: Point clouds provide intrinsic geometric information and surface context for scene understanding. Existing methods for point cloud segmentation require a large amount of fully labeled data. Using advanced depth sensors, collection of large scale 3D dataset is no longer a cumbersome process. However, manually producing point-level label on the large scale dataset is time and labor-intensive. In thi… ▽ More

    Submitted 29 March, 2020; originally announced March 2020.

    Comments: Accepted by CVPR2020

  21. arXiv:2001.07403  [pdf, other

    cs.SC

    On mu-Symmetric Polynomials

    Authors: Jing Yang, Chee K. Yap

    Abstract: In this paper, we study functions of the roots of a univariate polynomial in which the roots have a given multiplicity structure $μ$. Traditionally, root functions are studied via the theory of symmetric polynomials; we extend this theory to $μ$-symmetric polynomials. We were motivated by a conjecture from Becker et al.~(ISSAC 2016) about the $μ$-symmetry of a particular root function $D^+(μ)$, ca… ▽ More

    Submitted 21 January, 2020; originally announced January 2020.

  22. arXiv:1912.09021  [pdf, other

    cs.CV

    AANet: Attribute Attention Network for Person Re-Identifications

    Authors: Chiat-Pin Tay, Sharmili Roy, Kim-Hui Yap

    Abstract: This paper proposes Attribute Attention Network (AANet), a new architecture that integrates person attributes and attribute attention maps into a classification framework to solve the person re-identification (re-ID) problem. Many person re-ID models typically employ semantic cues such as body parts or human pose to improve the re-ID performance. Attribute information, however, is often not utiliz… ▽ More

    Submitted 19 December, 2019; originally announced December 2019.

    Comments: CVPR 2019

  23. arXiv:1911.06047  [pdf, other

    cs.CV cs.LG

    Semantic Granularity Metric Learning for Visual Search

    Authors: Dipu Manandhar, Muhammet Bastan, Kim-Hui Yap

    Abstract: Deep metric learning applied to various applications has shown promising results in identification, retrieval and recognition. Existing methods often do not consider different granularity in visual similarity. However, in many domain applications, images exhibit similarity at multiple granularities with visual semantic concepts, e.g. fashion demonstrates similarity ranging from clothing of the exa… ▽ More

    Submitted 14 November, 2019; originally announced November 2019.

    Comments: 10 pages, 10 figures

  24. arXiv:1902.03444  [pdf, other

    cs.LG stat.ML

    Venn GAN: Discovering Commonalities and Particularities of Multiple Distributions

    Authors: Yasin Yazıcı, Bruno Lecouat, Chuan-Sheng Foo, Stefan Winkler, Kim-Hui Yap, Georgios Piliouras, Vijay Chandrasekhar

    Abstract: We propose a GAN design which models multiple distributions effectively and discovers their commonalities and particularities. Each data distribution is modeled with a mixture of $K$ generator distributions. As the generators are partially shared between the modeling of different true data distributions, shared ones captures the commonality of the distributions, while non-shared ones capture uniqu… ▽ More

    Submitted 9 February, 2019; originally announced February 2019.

  25. arXiv:1901.00031  [pdf, ps, other

    cs.CV

    Interest Point Detection based on Adaptive Ternary Coding

    Authors: Zhenwei Miao, Kim-Hui Yap, Xudong Jiang

    Abstract: In this paper, an adaptive pixel ternary coding mechanism is proposed and a contrast invariant and noise resistant interest point detector is developed on the basis of this mechanism. Every pixel in a local region is adaptively encoded into one of the three statuses: bright, uncertain and dark. The blob significance of the local region is measured by the spatial distribution of the bright and dark… ▽ More

    Submitted 31 December, 2018; originally announced January 2019.

  26. arXiv:1901.00027  [pdf, other

    cs.CV

    DCI: Discriminative and Contrast Invertible Descriptor

    Authors: Zhenwei Miao, Kim-Hui Yap, Xudong Jiang, Subbhuraam Sinduja, Zhenhua Wang

    Abstract: Local feature descriptors have been widely used in fine-grained visual object search thanks to their robustness in scale and rotation variation and cluttered background. However, the performance of such descriptors drops under severe illumination changes. In this paper, we proposed a Discriminative and Contrast Invertible (DCI) local feature descriptor. In order to increase the discriminative abil… ▽ More

    Submitted 31 December, 2018; originally announced January 2019.

  27. arXiv:1806.04498  [pdf, other

    stat.ML cs.CV cs.LG

    The Unusual Effectiveness of Averaging in GAN Training

    Authors: Yasin Yazıcı, Chuan-Sheng Foo, Stefan Winkler, Kim-Hui Yap, Georgios Piliouras, Vijay Chandrasekhar

    Abstract: We examine two different techniques for parameter averaging in GAN training. Moving Average (MA) computes the time-average of parameters, whereas Exponential Moving Average (EMA) computes an exponentially discounted sum. Whilst MA is known to lead to convergence in bilinear settings, we provide the -- to our knowledge -- first theoretical arguments in support of EMA. We show that EMA converges to… ▽ More

    Submitted 26 February, 2019; v1 submitted 12 June, 2018; originally announced June 2018.

    Comments: Published as a conference paper at ICLR 2019

  28. arXiv:1804.10805  [pdf, ps, other

    cs.CV

    Remote Detection of Idling Cars Using Infrared Imaging and Deep Networks

    Authors: Muhammet Bastan, Kim-Hui Yap, Lap-Pui Chau

    Abstract: Idling vehicles waste energy and pollute the environment through exhaust emission. In some countries, idling a vehicle for more than a predefined duration is prohibited and automatic idling vehicle detection is desirable for law enforcement. We propose the first automatic system to detect idling cars, using infrared (IR) imaging and deep networks. We rely on the differences in spatio-temporal he… ▽ More

    Submitted 28 April, 2018; originally announced April 2018.

    Comments: Neural Computing and Applications

  29. Handling state space explosion in verification of component-based systems: A review

    Authors: Faranak Nejati, Abdul Azim Abd. Ghani, Ng Keng Yap, Azmi Jaafar

    Abstract: Component-based software development (CBSD) is an alternative approach to constructing software systems that offers numerous benefits, particularly in decreasing the complexity of system design. However, deploying components into a system is a challenging and error-prone task. Model-checking is one of the reliable methods to systematically analyze the correctness of a system. It is a bruce-force c… ▽ More

    Submitted 26 May, 2021; v1 submitted 28 July, 2017; originally announced September 2017.

    Journal ref: IEEEAccess, 2021

  30. arXiv:1704.05123  [pdf, other

    cs.CG cs.RO

    Resolution-Exact Planner for Thick Non-Crossing 2-Link Robots

    Authors: Chee K. Yap, Zhongdi Luo, Ching-Hsiang Hsu

    Abstract: We consider the path planning problem for a 2-link robot amidst polygonal obstacles. Our robot is parametrizable by the lengths $\ell_1, \ell_2>0$ of its two links, the thickness $τ\ge 0$ of the links, and an angle $κ$ that constrains the angle between the 2 links to be strictly greater than $κ$. The case $τ>0$ and $κ\ge 0$ corresponds to "thick non-crossing" robots. This results in a novel 4DOF c… ▽ More

    Submitted 17 April, 2017; originally announced April 2017.

  31. arXiv:1506.06265  [pdf, ps, other

    cs.CG

    Certified Computation of planar Morse-Smale Complexes

    Authors: Amit Chattopadhyay, Gert Vegter, Chee K. Yap

    Abstract: The Morse-Smale complex is an important tool for global topological analysis in various problems of computational geometry and topology. Algorithms for Morse-Smale complexes have been presented in case of piecewise linear manifolds. However, previous research in this field is incomplete in the case of smooth functions. In the current paper we address the following question: Given an arbitrarily co… ▽ More

    Submitted 20 June, 2015; originally announced June 2015.

    Comments: Under Review in Journal