Skip to main content

Showing 1–6 of 6 results for author: Bhaumik, K K

  1. arXiv:2401.02113  [pdf, other

    cs.CV

    Source-Free Online Domain Adaptive Semantic Segmentation of Satellite Images under Image Degradation

    Authors: Fahim Faisal Niloy, Kishor Kumar Bhaumik, Simon S. Woo

    Abstract: Online adaptation to distribution shifts in satellite image segmentation stands as a crucial yet underexplored problem. In this paper, we address source-free and online domain adaptation, i.e., test-time adaptation (TTA), for satellite images, with the focus on mitigating distribution shifts caused by various forms of image degradation. Towards achieving this goal, we propose a novel TTA approach… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

    Comments: ICASSP 2024

  2. arXiv:2307.11052  [pdf, other

    cs.CV

    HRFNet: High-Resolution Forgery Network for Localizing Satellite Image Manipulation

    Authors: Fahim Faisal Niloy, Kishor Kumar Bhaumik, Simon S. Woo

    Abstract: Existing high-resolution satellite image forgery localization methods rely on patch-based or downsampling-based training. Both of these training methods have major drawbacks, such as inaccurate boundaries between pristine and forged regions, the generation of unwanted artifacts, etc. To tackle the aforementioned challenges, inspired by the high-resolution image segmentation literature, we propose… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

    Comments: ICIP 2023

  3. arXiv:2212.04548  [pdf, other

    cs.LG

    STLGRU: Spatio-Temporal Lightweight Graph GRU for Traffic Flow Prediction

    Authors: Kishor Kumar Bhaumik, Fahim Faisal Niloy, Saif Mahmud, Simon Woo

    Abstract: Reliable forecasting of traffic flow requires efficient modeling of traffic data. Indeed, different correlations and influences arise in a dynamic traffic network, making modeling a complicated task. Existing literature has proposed many different methods to capture traffic networks' complex underlying spatial-temporal relations. However, given the heterogeneity of traffic data, consistently captu… ▽ More

    Submitted 19 February, 2024; v1 submitted 8 December, 2022; originally announced December 2022.

    Comments: PAKDD 2024 (Oral)

  4. arXiv:2210.02182  [pdf, other

    cs.CV

    CFL-Net: Image Forgery Localization Using Contrastive Learning

    Authors: Fahim Faisal Niloy, Kishor Kumar Bhaumik, Simon S. Woo

    Abstract: Conventional forgery localizing methods usually rely on different forgery footprints such as JPEG artifacts, edge inconsistency, camera noise, etc., with cross-entropy loss to locate manipulated regions. However, these methods have the disadvantage of over-fitting and focusing on only a few specific forgery footprints. On the other hand, real-life manipulated images are generated via a wide variet… ▽ More

    Submitted 4 October, 2022; originally announced October 2022.

    Comments: WACV 2023

  5. arXiv:2201.00985  [pdf, other

    cs.CV cs.CL

    Variational Stacked Local Attention Networks for Diverse Video Captioning

    Authors: Tonmoay Deb, Akib Sadmanee, Kishor Kumar Bhaumik, Amin Ahsan Ali, M Ashraful Amin, A K M Mahbubur Rahman

    Abstract: While describing Spatio-temporal events in natural language, video captioning models mostly rely on the encoder's latent visual representation. Recent progress on the encoder-decoder model attends encoder features mainly in linear interaction with the decoder. However, growing model complexity for visual data encourages more explicit feature interaction for fine-grained information, which is curre… ▽ More

    Submitted 4 January, 2022; originally announced January 2022.

    Comments: To be published in Winter Conference on Applications of Computer Vision 2022

  6. arXiv:2003.09018  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Human Activity Recognition from Wearable Sensor Data Using Self-Attention

    Authors: Saif Mahmud, M Tanjid Hasan Tonmoy, Kishor Kumar Bhaumik, A K M Mahbubur Rahman, M Ashraful Amin, Mohammad Shoyaib, Muhammad Asif Hossain Khan, Amin Ahsan Ali

    Abstract: Human Activity Recognition from body-worn sensor data poses an inherent challenge in capturing spatial and temporal dependencies of time-series signals. In this regard, the existing recurrent or convolutional or their hybrid models for activity recognition struggle to capture spatio-temporal context from the feature space of sensor reading sequence. To address this complex problem, we propose a se… ▽ More

    Submitted 17 March, 2020; originally announced March 2020.

    Comments: Accepted for publication at the 24th European Conference on Artificial Intelligence (ECAI-2020); 8 pages, 4 figures