Skip to main content

Showing 1–10 of 10 results for author: Sim, T

  1. arXiv:2403.06381  [pdf, other

    cs.CV

    Enhancing Semantic Fidelity in Text-to-Image Synthesis: Attention Regulation in Diffusion Models

    Authors: Yang Zhang, Teoh Tze Tzun, Lim Wei Hern, Tiviatis Sim, Kenji Kawaguchi

    Abstract: Recent advancements in diffusion models have notably improved the perceptual quality of generated images in text-to-image synthesis tasks. However, diffusion models often struggle to produce images that accurately reflect the intended semantics of the associated text prompts. We examine cross-attention layers in diffusion models and observe a propensity for these layers to disproportionately focus… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

  2. arXiv:2401.16559  [pdf, other

    cs.CV

    IEEE BigData 2023 Keystroke Verification Challenge (KVC)

    Authors: Giuseppe Stragapede, Ruben Vera-Rodriguez, Ruben Tolosana, Aythami Morales, Ivan DeAndres-Tame, Naser Damer, Julian Fierrez, Javier-Ortega Garcia, Nahuel Gonzalez, Andrei Shadrikov, Dmitrii Gordin, Leon Schmitt, Daniel Wimmer, Christoph Grossmann, Joerdis Krieger, Florian Heinz, Ron Krestel, Christoffer Mayer, Simon Haberl, Helena Gschrey, Yosuke Yamagishi, Sanjay Saha, Sanka Rasnayaka, Sandareka Wickramanayake, Terence Sim , et al. (4 additional authors not shown)

    Abstract: This paper describes the results of the IEEE BigData 2023 Keystroke Verification Challenge (KVC), that considers the biometric verification performance of Keystroke Dynamics (KD), captured as tweet-long sequences of variable transcript text from over 185,000 subjects. The data are obtained from two of the largest public databases of KD up to date, the Aalto Desktop and Mobile Keystroke Databases,… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: 9 pages, 10 pages, 2 figures. arXiv admin note: text overlap with arXiv:2311.06000

  3. arXiv:2305.06564  [pdf, other

    cs.CV

    Undercover Deepfakes: Detecting Fake Segments in Videos

    Authors: Sanjay Saha, Rashindrie Perera, Sachith Seneviratne, Tamasha Malepathirana, Sanka Rasnayaka, Deshani Geethika, Terence Sim, Saman Halgamuge

    Abstract: The recent renaissance in generative models, driven primarily by the advent of diffusion models and iterative improvement in GAN methods, has enabled many creative applications. However, each advancement is also accompanied by a rise in the potential for misuse. In the arena of the deepfake generation, this is a key societal issue. In particular, the ability to modify segments of videos using such… ▽ More

    Submitted 24 August, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

    Comments: ICCV 2023 Workshop and Challenge on DeepFake Analysis and Detection

  4. Is Face Recognition Safe from Realizable Attacks?

    Authors: Sanjay Saha, Terence Sim

    Abstract: Face recognition is a popular form of biometric authentication and due to its widespread use, attacks have become more common as well. Recent studies show that Face Recognition Systems are vulnerable to attacks and can lead to erroneous identification of faces. Interestingly, most of these attacks are white-box, or they are manipulating facial images in ways that are not physically realizable. In… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Comments: 2020 IEEE International Joint Conference on Biometrics (IJCB)

    Journal ref: 2020 IEEE International Joint Conference on Biometrics (IJCB), Houston, TX, USA, 2020

  5. arXiv:2202.03639  [pdf, ps, other

    cs.LG

    Contrastive predictive coding for Anomaly Detection in Multi-variate Time Series Data

    Authors: Theivendiram Pranavan, Terence Sim, Arulmurugan Ambikapathi, Savitha Ramasamy

    Abstract: Anomaly detection in multi-variate time series (MVTS) data is a huge challenge as it requires simultaneous representation of long term temporal dependencies and correlations across multiple variables. More often, this is solved by breaking the complexity through modeling one dependency at a time. In this paper, we propose a Time-series Representational Learning through Contrastive Predictive Codin… ▽ More

    Submitted 7 February, 2022; originally announced February 2022.

  6. Understanding Humans in Crowded Scenes: Deep Nested Adversarial Learning and A New Benchmark for Multi-Human Parsing

    Authors: Jian Zhao, Jianshu Li, Yu Cheng, Li Zhou, Terence Sim, Shuicheng Yan, Jiashi Feng

    Abstract: Despite the noticeable progress in perceptual tasks like detection, instance segmentation and human parsing, computers still perform unsatisfactorily on visually understanding humans in crowded scenes, such as group behavior analysis, person re-identification and autonomous driving, etc. To this end, models need to comprehensively perceive the semantic information and the differences between insta… ▽ More

    Submitted 6 July, 2018; v1 submitted 9 April, 2018; originally announced April 2018.

    Comments: The first three authors are with equal contributions

  7. arXiv:1711.06055  [pdf, other

    cs.CV

    Integrated Face Analytics Networks through Cross-Dataset Hybrid Training

    Authors: Jianshu Li, Shengtao Xiao, Fang Zhao, Jian Zhao, Jianan Li, Jiashi Feng, Shuicheng Yan, Terence Sim

    Abstract: Face analytics benefits many multimedia applications. It consists of a number of tasks, such as facial emotion recognition and face parsing, and most existing approaches generally treat these tasks independently, which limits their deployment in real scenarios. In this paper we propose an integrated Face Analytics Network (iFAN), which is able to perform multiple tasks jointly for face analytics w… ▽ More

    Submitted 16 November, 2017; originally announced November 2017.

    Comments: 10 pages

  8. arXiv:1705.07206  [pdf, other

    cs.CV

    Multiple-Human Parsing in the Wild

    Authors: Jianshu Li, Jian Zhao, Yunchao Wei, Congyan Lang, Yidong Li, Terence Sim, Shuicheng Yan, Jiashi Feng

    Abstract: Human parsing is attracting increasing research attention. In this work, we aim to push the frontier of human parsing by introducing the problem of multi-human parsing in the wild. Existing works on human parsing mainly tackle single-person scenarios, which deviates from real-world applications where multiple persons are present simultaneously with interaction and occlusion. To address the multi-h… ▽ More

    Submitted 14 March, 2018; v1 submitted 19 May, 2017; originally announced May 2017.

    Comments: The first two authors are with equal contribution

  9. arXiv:1507.04441   

    cs.HC

    Eye-2-I: Eye-tracking for just-in-time implicit user profiling

    Authors: Keng-Teck Ma, Qianli Xu, Liyuan Li, Terence Sim, Mohan Kankanhalli, Rosary Lim

    Abstract: For many applications, such as targeted advertising and content recommendation, knowing users' traits and interests is a prerequisite. User profiling is a helpful approach for this purpose. However, current methods, i.e. self-reporting, web-activity monitoring and social media mining are either intrusive or require data over long periods of time. Recently, there is growing evidence in cognitive sc… ▽ More

    Submitted 13 April, 2016; v1 submitted 15 July, 2015; originally announced July 2015.

    Comments: A bug was found in the codes which resulted in information leak. New experimental results will be updated at a later date. I assume all responsibility for this mistake. KT Ma

    ACM Class: H.3.4

  10. arXiv:1403.7876  [pdf, other

    cs.CV

    Correlation Filters with Limited Boundaries

    Authors: Hamed Kiani Galoogahi, Terence Sim, Simon Lucey

    Abstract: Correlation filters take advantage of specific properties in the Fourier domain allowing them to be estimated efficiently: O(NDlogD) in the frequency domain, versus O(D^3 + ND^2) spatially where D is signal length, and N is the number of signals. Recent extensions to correlation filters, such as MOSSE, have reignited interest of their use in the vision community due to their robustness and attract… ▽ More

    Submitted 31 March, 2014; originally announced March 2014.

    Comments: 8 pages, 6 figures, 2 tables