Skip to main content

Showing 1–37 of 37 results for author: Saddik, A E

  1. arXiv:2407.00465  [pdf, other

    cs.SD cs.CV cs.LG eess.AS

    Characterizing Continual Learning Scenarios and Strategies for Audio Analysis

    Authors: Ruchi Bhatt, Pratibha Kumari, Dwarikanath Mahapatra, Abdulmotaleb El Saddik, Mukesh Saini

    Abstract: Audio analysis is useful in many application scenarios. The state-of-the-art audio analysis approaches assume that the data distribution at training and deployment time will be the same. However, due to various real-life environmental factors, the data may encounter drift in its distribution or can encounter new classes in the late future. Thus, a one-time trained model might not perform adequatel… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  2. arXiv:2405.07759  [pdf, other

    cs.MM cs.AI cs.NI eess.IV

    MADRL-Based Rate Adaptation for 360° Video Streaming with Multi-Viewpoint Prediction

    Authors: Haopeng Wang, Zijian Long, Haiwei Dong, Abdulmotaleb El Saddik

    Abstract: Over the last few years, 360° video traffic on the network has grown significantly. A key challenge of 360° video playback is ensuring a high quality of experience (QoE) with limited network bandwidth. Currently, most studies focus on tile-based adaptive bitrate (ABR) streaming based on single viewport prediction to reduce bandwidth consumption. However, the performance of models for single-viewpo… ▽ More

    Submitted 17 May, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

    Comments: Accepted by IEEE Internet of Things Journal

  3. Tile-Weighted Rate-Distortion Optimized Packet Scheduling for 360$^\circ$ VR Video Streaming

    Authors: Haopeng Wang, Haiwei Dong, Abdulmotaleb El Saddik

    Abstract: A key challenge of 360$^\circ$ VR video streaming is ensuring high quality with limited network bandwidth. Currently, most studies focus on tile-based adaptive bitrate streaming to reduce bandwidth consumption, where resources in network nodes are not fully utilized. This article proposes a tile-weighted rate-distortion (TWRD) packet scheduling optimization system to reduce data volume and improve… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: Accepted by IEEE Intelligent Systems

  4. How to Cache Important Contents for Multi-modal Service in Dynamic Networks: A DRL-based Caching Scheme

    Authors: Zhe Zhang, Marc St-Hilaire, Xin Wei, Haiwei Dong, Abdulmotaleb El Saddik

    Abstract: With the continuous evolution of networking technologies, multi-modal services that involve video, audio, and haptic contents are expected to become the dominant multimedia service in the near future. Edge caching is a key technology that can significantly reduce network load and content transmission latency, which is critical for the delivery of multi-modal contents. However, existing caching app… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Journal ref: IEEE Transactions on Multimedia (Early Access), 2024

  5. arXiv:2403.18293  [pdf, other

    cs.CV

    Efficient Test-Time Adaptation of Vision-Language Models

    Authors: Adilbek Karmanov, Dayan Guan, Shijian Lu, Abdulmotaleb El Saddik, Eric Xing

    Abstract: Test-time adaptation with pre-trained vision-language models has attracted increasing attention for tackling distribution shifts during the test time. Though prior studies have achieved very promising performance, they involve intensive computation which is severely unaligned with test-time adaptation. We design TDA, a training-free dynamic adapter that enables effective and efficient test-time ad… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR 2024. The code has been released in \url{https://kdiaaa.github.io/tda/}

  6. Experimental Studies of Metaverse Streaming

    Authors: Haopeng Wang, Roberto Martinez-Velazquez, Haiwei Dong, Abdulmotaleb El Saddik

    Abstract: Metaverse aims to construct a large, unified, immersive, and shared digital realm by combining various technologies, namely XR (extended reality), blockchain, and digital twin, among others. This article explores the Metaverse from the perspective of multimedia communication by conducting and analyzing real-world experiments on four different Metaverse platforms: VR (virtual reality) Vircadia, VR… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: Accepted by IEEE Consumer Electronics Magazine

  7. Bringing Robots Home: The Rise of AI Robots in Consumer Electronics

    Authors: Haiwei Dong, Yang Liu, Ted Chu, Abdulmotaleb El Saddik

    Abstract: On March 18, 2024, NVIDIA unveiled Project GR00T, a general-purpose multimodal generative AI model designed specifically for training humanoid robots. Preceding this event, Tesla's unveiling of the Optimus Gen 2 humanoid robot on December 12, 2023, underscored the profound impact robotics is poised to have on reshaping various facets of our daily lives. While robots have long dominated industrial… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: Accepted by IEEE Consumer Electronics Magazine

  8. arXiv:2401.06957  [pdf, other

    cs.CV

    EVOKE: Emotion Enabled Virtual Avatar Mapping Using Optimized Knowledge Distillation

    Authors: Maryam Nadeem, Raza Imam, Rouqaiah Al-Refai, Meriem Chkir, Mohamad Hoda, Abdulmotaleb El Saddik

    Abstract: As virtual environments continue to advance, the demand for immersive and emotionally engaging experiences has grown. Addressing this demand, we introduce Emotion enabled Virtual avatar mapping using Optimized KnowledgE distillation (EVOKE), a lightweight emotion recognition framework designed for the seamless integration of emotion recognition into 3D avatars within virtual environments. Our appr… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

    Comments: Presented at IEEE 42nd International Conference on Consumer Electronics (ICCE) 2024

  9. arXiv:2401.00393  [pdf

    cs.CV cs.AI cs.LG cs.MM eess.IV

    Generative Model-Driven Synthetic Training Image Generation: An Approach to Cognition in Rail Defect Detection

    Authors: Rahatara Ferdousi, Chunsheng Yang, M. Anwar Hossain, Fedwa Laamarti, M. Shamim Hossain, Abdulmotaleb El Saddik

    Abstract: Recent advancements in cognitive computing, with the integration of deep learning techniques, have facilitated the development of intelligent cognitive systems (ICS). This is particularly beneficial in the context of rail defect detection, where the ICS would emulate human-like analysis of image data for defect patterns. Despite the success of Convolutional Neural Networks (CNN) in visual defect c… ▽ More

    Submitted 30 December, 2023; originally announced January 2024.

    Comments: 26 pages, 13 figures, Springer Journal

    MSC Class: 68T05; 94A08; 90B25 ACM Class: I.2.6; I.2.10; I.5.4; I.4.10

  10. Human-Centric Resource Allocation for the Metaverse With Multiaccess Edge Computing

    Authors: Zijian Long, Haiwei Dong, Abdulmotaleb El Saddik

    Abstract: Multi-access edge computing (MEC) is a promising solution to the computation-intensive, low-latency rendering tasks of the metaverse. However, how to optimally allocate limited communication and computation resources at the edge to a large number of users in the metaverse is quite challenging. In this paper, we propose an adaptive edge resource allocation method based on multi-agent soft actor-cri… ▽ More

    Submitted 23 December, 2023; originally announced December 2023.

    Journal ref: IEEE Internet of Things Journal, vol. 10, no. 22, pp. 19993-20005, 2023

  11. arXiv:2312.06926  [pdf, other

    cs.CL

    Content-Localization based Neural Machine Translation for Informal Dialectal Arabic: Spanish/French to Levantine/Gulf Arabic

    Authors: Fatimah Alzamzami, Abdulmotaleb El Saddik

    Abstract: Resources in high-resource languages have not been efficiently exploited in low-resource languages to solve language-dependent research problems. Spanish and French are considered high resource languages in which an adequate level of data resources for informal online social behavior modeling, is observed. However, a machine translation system to access those data resources and transfer their cont… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: arXiv admin note: text overlap with arXiv:2312.03727

  12. arXiv:2312.03727  [pdf, other

    cs.CL cs.AI

    Content-Localization based System for Analyzing Sentiment and Hate Behaviors in Low-Resource Dialectal Arabic: English to Levantine and Gulf

    Authors: Fatimah Alzamzami, Abdulmotaleb El Saddik

    Abstract: Even though online social movements can quickly become viral on social media, languages can be a barrier to timely monitoring and analyzing the underlying online social behaviors (OSB). This is especially true for under-resourced languages on social media like dialectal Arabic; the primary language used by Arabs on social media. Therefore, it is crucial to provide solutions to efficiently exploit… ▽ More

    Submitted 27 November, 2023; originally announced December 2023.

  13. arXiv:2311.17629  [pdf, other

    cs.CV

    Efficient Decoder for End-to-End Oriented Object Detection in Remote Sensing Images

    Authors: Jiaqi Zhao, Zeyu Ding, Yong Zhou, Hancheng Zhu, Wenliang Du, Rui Yao, Abdulmotaleb El Saddik

    Abstract: Object instances in remote sensing images often distribute with multi-orientations, varying scales, and dense distribution. These issues bring challenges to end-to-end oriented object detectors including multi-scale features alignment and a large number of queries. To address these limitations, we propose an end-to-end oriented detector equipped with an efficient decoder, which incorporates two te… ▽ More

    Submitted 1 December, 2023; v1 submitted 29 November, 2023; originally announced November 2023.

    Comments: 11 pages, 7 figures, 13 tables

  14. arXiv:2311.14824  [pdf

    cs.CV cs.AI cs.LG

    A Reusable AI-Enabled Defect Detection System for Railway Using Ensembled CNN

    Authors: Rahatara Ferdousi, Fedwa Laamarti, Chunsheng Yang, Abdulmotaleb El Saddik

    Abstract: Accurate Defect detection is crucial for ensuring the trustworthiness of intelligent railway systems. Current approaches rely on single deep-learning models, like CNNs, which employ a large amount of data to capture underlying patterns. Training a new defect classifier with limited samples often leads to overfitting and poor performance on unseen images. To address this, researchers have advocated… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

    Comments: 28 pages, 13 Figures, Applied Intelligence Journal, Springer Nature

    MSC Class: 68T45; 68T05 ACM Class: I.2.10; I.5.2

  15. arXiv:2311.10256  [pdf

    cs.HC cs.MM

    Exploring User Perceptions of Virtual Reality Scene Design in Metaverse Learning Environments

    Authors: Rahatara Ferdousi, Mohammed Faisal, Fedwa Laamarti, Chunsheng Yang, Abdulmotaleb El Saddik

    Abstract: Metaverse learning environments allow for a seamless and intuitive transition between activities compared to Virtual Reality (VR) learning environments, due to their interconnected design. The design of VR scenes is important for creating effective learning experiences in the Metaverse. However, there is limited research on the impact of different design elements on user's learning experiences in… ▽ More

    Submitted 21 November, 2023; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: 6 pages,3 figures, accepted to present at IEEE 42nd International Conference on Consumer Electronics

    ACM Class: K.3; J.7

  16. arXiv:2309.12137  [pdf, other

    cs.CL cs.AI

    OSN-MDAD: Machine Translation Dataset for Arabic Multi-Dialectal Conversations on Online Social Media

    Authors: Fatimah Alzamzami, Abdulmotaleb El Saddik

    Abstract: While resources for English language are fairly sufficient to understand content on social media, similar resources in Arabic are still immature. The main reason that the resources in Arabic are insufficient is that Arabic has many dialects in addition to the standard version (MSA). Arabs do not use MSA in their daily communications; rather, they use dialectal versions. Unfortunately, social users… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  17. arXiv:2308.02039  [pdf, other

    cs.CY cs.SI

    Harnessing Web3 on Carbon Offset Market for Sustainability: Framework and A Case Study

    Authors: Chenyu Zhou, Hongzhou Chen, Shiman Wang, Xinyao Sun, Abdulmotaleb El Saddik, Wei Cai

    Abstract: Blockchain, pivotal in shaping the metaverse and Web3, often draws criticism for high energy consumption and carbon emission. The rise of sustainability-focused blockchains, especially when intersecting with innovative wireless technologies, revises this predicament. To understand blockchain's role in sustainability, we propose a three-layers structure encapsulating four green utilities: Recording… ▽ More

    Submitted 25 July, 2023; originally announced August 2023.

  18. arXiv:2305.14093  [pdf, other

    cs.CV

    Weakly Supervised 3D Open-vocabulary Segmentation

    Authors: Kunhao Liu, Fangneng Zhan, Jiahui Zhang, Muyu Xu, Yingchen Yu, Abdulmotaleb El Saddik, Christian Theobalt, Eric Xing, Shijian Lu

    Abstract: Open-vocabulary segmentation of 3D scenes is a fundamental function of human perception and thus a crucial objective in computer vision research. However, this task is heavily impeded by the lack of large-scale and diverse 3D open-vocabulary segmentation datasets for training robust and generalizable models. Distilling knowledge from pre-trained 2D open-vocabulary segmentation models helps but it… ▽ More

    Submitted 9 January, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted to NeurIPS 2023

  19. arXiv:2304.11445  [pdf, other

    eess.IV cs.CV

    Improving Stain Invariance of CNNs for Segmentation by Fusing Channel Attention and Domain-Adversarial Training

    Authors: Kudaibergen Abutalip, Numan Saeed, Mustaqeem Khan, Abdulmotaleb El Saddik

    Abstract: Variability in staining protocols, such as different slide preparation techniques, chemicals, and scanner configurations, can result in a diverse set of whole slide images (WSIs). This distribution shift can negatively impact the performance of deep learning models on unseen samples, presenting a significant challenge for developing new computational pathology applications. In this study, we propo… ▽ More

    Submitted 22 April, 2023; originally announced April 2023.

  20. arXiv:2304.00690  [pdf, other

    cs.CV

    3D Semantic Segmentation in the Wild: Learning Generalized Models for Adverse-Condition Point Clouds

    Authors: Aoran Xiao, Jiaxing Huang, Weihao Xuan, Ruijie Ren, Kangcheng Liu, Dayan Guan, Abdulmotaleb El Saddik, Shijian Lu, Eric Xing

    Abstract: Robust point cloud parsing under all-weather conditions is crucial to level-5 autonomy in autonomous driving. However, how to learn a universal 3D semantic segmentation (3DSS) model is largely neglected as most existing benchmarks are dominated by point clouds captured under normal weather. We introduce SemanticSTF, an adverse-weather point cloud dataset that provides dense point-level annotations… ▽ More

    Submitted 2 April, 2023; originally announced April 2023.

    Comments: CVPR2023

  21. arXiv:2303.10598  [pdf, other

    cs.CV

    StyleRF: Zero-shot 3D Style Transfer of Neural Radiance Fields

    Authors: Kunhao Liu, Fangneng Zhan, Yiwen Chen, Jiahui Zhang, Yingchen Yu, Abdulmotaleb El Saddik, Shijian Lu, Eric Xing

    Abstract: 3D style transfer aims to render stylized novel views of a 3D scene with multi-view consistency. However, most existing work suffers from a three-way dilemma over accurate geometry reconstruction, high-quality stylization, and being generalizable to arbitrary new styles. We propose StyleRF (Style Radiance Fields), an innovative 3D style transfer technique that resolves the three-way dilemma by per… ▽ More

    Submitted 24 March, 2023; v1 submitted 19 March, 2023; originally announced March 2023.

    Comments: Accepted to CVPR 2023. Project website: https://kunhao-liu.github.io/StyleRF/

  22. A Framework of Reconfigurable Transducer Nodes for Smart Home Environments

    Authors: Basim Hafidh, Hussein Al Osman, Haiwei Dong, Abdulmotaleb El Saddik

    Abstract: This letter presents a transducer network framework that supports the amalgamation of multiple transducers into single wireless nodes. This approach is aimed at decreasing energy consumption by reducing the number of wireless transceivers involved in such networks. To make wireless nodes easily reconfigurable, a plug and play mechanism is applied to enable the clustering of any number of transduce… ▽ More

    Submitted 25 December, 2022; originally announced January 2023.

    Journal ref: IEEE Embedded Systems Letters, vol. 7, no. 3, pp. 81-84, 2015

  23. 3-D Markerless Tracking of Human Gait by Geometric Trilateration of Multiple Kinects

    Authors: Lin Yang, Bowen Yang, Haiwei Dong, Abdulmotaleb El Saddik

    Abstract: In this paper, we develop an integrated markerless gait tracking system with three Kinect v2 sensors. A geometric principle-based trilateration method is proposed for optimizing the accuracy of the measured gait data. To tackle the data synchronization problem among the Kinect clients and the server, a synchronization mechanism based on NTP (Network Time Protocol) is designed for synchronizing the… ▽ More

    Submitted 25 December, 2022; originally announced January 2023.

    Journal ref: IEEE Systems Journal, vol. 12, no. 2, pp. 1393-1403, 2018

  24. Development of an automatic 3D human head scanning-printing system

    Authors: Longyu Zhang, Bote Han, Haiwei Dong, Abdulmotaleb El Saddik

    Abstract: Three-dimensional (3D) technologies have been developing rapidly recent years, and have influenced industrial, medical, cultural, and many other fields. In this paper, we introduce an automatic 3D human head scanning-printing system, which provides a complete pipeline to scan, reconstruct, select, and finally print out physical 3D human heads. To enhance the accuracy of our system, we developed a… ▽ More

    Submitted 25 December, 2022; originally announced December 2022.

    Journal ref: Multimedia Tools and Applications, vol. 76, no. 3, pp. 4381-4403, 2017

  25. arXiv:2212.14772  [pdf, other

    cs.CV cs.MM

    A Combined Approach Toward Consistent Reconstructions of Indoor Spaces Based on 6D RGB-D Odometry and KinectFusion

    Authors: Nadia Figueroa, Haiwei Dong, Abdulmotaleb El Saddik

    Abstract: We propose a 6D RGB-D odometry approach that finds the relative camera pose between consecutive RGB-D frames by keypoint extraction and feature matching both on the RGB and depth image planes. Furthermore, we feed the estimated pose to the highly accurate KinectFusion algorithm, which uses a fast ICP (Iterative Closest Point) to fine-tune the frame-to-frame relative pose and fuse the depth data in… ▽ More

    Submitted 25 December, 2022; originally announced December 2022.

    Journal ref: ACM Trans. Intell. Syst., vol. 6, no. 2, pp. 14:1-10, 2015

  26. Development of a Self-Calibrated Motion Capture System by Nonlinear Trilateration of Multiple Kinects v2

    Authors: Bowen Yang, Haiwei Dong, Abdulmotaleb El Saddik

    Abstract: In this paper, a Kinect-based distributed and real-time motion capture system is developed. A trigonometric method is applied to calculate the relative position of Kinect v2 sensors with a calibration wand and register the sensors' positions automatically. By combining results from multiple sensors with a nonlinear least square method, the accuracy of the motion capture is optimized. Moreover, to… ▽ More

    Submitted 25 December, 2022; originally announced December 2022.

    Journal ref: IEEE Sensors Journal, vol. 17, no. 8, pp. 2481-2491, 2017

  27. Evaluating and Improving the Depth Accuracy of Kinect for Windows v2

    Authors: Lin Yang, Longyu Zhang, Haiwei Dong, Abdulhameed Alelaiwi, Abdulmotaleb El Saddik

    Abstract: Microsoft Kinect sensor has been widely used in many applications since the launch of its first version. Recently, Microsoft released a new version of Kinect sensor with improved hardware. However, the accuracy assessment of the sensor remains to be answered. In this paper, we measure the depth accuracy of the newly released Kinect v2 depth sensor, and obtain a cone model to illustrate its accurac… ▽ More

    Submitted 25 December, 2022; originally announced December 2022.

    Journal ref: IEEE Sensors Journal, vol. 15, no. 8, pp. 4275-4285, 2015

  28. EVM-CNN: Real-Time Contactless Heart Rate Estimation from Facial Video

    Authors: Ying Qiu, Yang Liu, Juan Arteaga-Falconi, Haiwei Dong, Abdulmotaleb El Saddik

    Abstract: With the increase in health consciousness, noninvasive body monitoring has aroused interest among researchers. As one of the most important pieces of physiological information, researchers have remotely estimated the heart rate (HR) from facial videos in recent years. Although progress has been made over the past few years, there are still some limitations, like the processing time increasing with… ▽ More

    Submitted 25 December, 2022; originally announced December 2022.

    Journal ref: IEEE Transactions on Multimedia, vol. 21, no. 7, pp. 1778-1787, 2019

  29. Towards a QoE Model to Evaluate Holographic Augmented Reality Devices

    Authors: Longyu Zhang, Haiwei Dong, Abdulmotaleb El Saddik

    Abstract: Augmented reality (AR) technology is developing fast and provides users with new ways to interact with the real-world surrounding environment. Although the performance of holographic AR multimedia devices can be measured with traditional quality-of-service parameters, a quality-of-experience (QoE) model can better evaluate the device from the perspective of users. As there are currently no well-re… ▽ More

    Submitted 25 December, 2022; originally announced December 2022.

    Journal ref: IEEE Multimedia, vol. 26, no. 2, pp. 21-32, 2018

  30. Learning to Estimate 3D Human Pose from Point Cloud

    Authors: Yufan Zhou, Haiwei Dong, Abdulmotaleb El Saddik

    Abstract: 3D pose estimation is a challenging problem in computer vision. Most of the existing neural-network-based approaches address color or depth images through convolution networks (CNNs). In this paper, we study the task of 3D human pose estimation from depth images. Different from the existing CNN-based human pose estimation method, we propose a deep human pose network for 3D pose estimation by takin… ▽ More

    Submitted 25 December, 2022; originally announced December 2022.

    Journal ref: IEEE Sensors Journal, vol. 20, no. 20, pp. 12334-12342, 2020

  31. arXiv:2212.12908  [pdf, other

    eess.SP cs.LG cs.NE

    Sitting Posture Recognition Using a Spiking Neural Network

    Authors: Jianquan Wang, Basim Hafidh, Haiwei Dong, Abdulmotaleb El Saddik

    Abstract: To increase the quality of citizens' lives, we designed a personalized smart chair system to recognize sitting behaviors. The system can receive surface pressure data from the designed sensor and provide feedback for guiding the user towards proper sitting postures. We used a liquid state machine and a logistic regression classifier to construct a spiking neural network for classifying 15 sitting… ▽ More

    Submitted 25 December, 2022; originally announced December 2022.

    Journal ref: IEEE Sensors Journal, vol. 21, no. 2, pp. 1779-1786, 2021

  32. Technical Evaluation of HoloLens for Multimedia: A First Look

    Authors: Yang Liu, Haiwei Dong, Longyu Zhang, Abdulmotaleb El Saddik

    Abstract: A recently released cutting-edge AR device, Microsoft HoloLens, has attracted considerable attention with its advanced capabilities. In this article, we report the design and execution of a series of experiments to quantitatively evaluate HoloLens' performance in head localization, real environment reconstruction, spatial mapping, hologram visualization, and speech recognition.

    Submitted 25 December, 2022; originally announced December 2022.

    Journal ref: IEEE Multimedia, vol. 25, no. 4, pp. 8-18, 2018

  33. arXiv:2212.10295  [pdf, other

    cs.MM cs.HC cs.NI

    Interacting with New York City Data by HoloLens through Remote Rendering

    Authors: Zijian Long, Haiwei Dong, Abdulmotaleb El Saddik

    Abstract: In the digital era, Extended Reality (XR) is considered the next frontier. However, XR systems are computationally intensive, and they must be implemented within strict latency constraints. Thus, XR devices with finite computing resources are limited in terms of quality of experience (QoE) they can offer, particularly in cases of big 3D data. This problem can be effectively addressed by offloading… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

    Journal ref: IEEE Consumer Electronics Magazine, vol. 11, no. 5, pp. 64-72, 2022

  34. arXiv:2210.04606  [pdf, ps, other

    cs.HC cs.AI

    Integrating Digital Twin and Advanced Intelligent Technologies to Realize the Metaverse

    Authors: Moayad Aloqaily, Ouns Bouachir, Fakhri Karray, Ismaeel Al Ridhawi, Abdulmotaleb El Saddik

    Abstract: The advances in Artificial Intelligence (AI) have led to technological advancements in a plethora of domains. Healthcare, education, and smart city services are now enriched with AI capabilities. These technological advancements would not have been realized without the assistance of fast, secure, and fault-tolerant communication media. Traditional processing, communication and storage technologies… ▽ More

    Submitted 3 October, 2022; originally announced October 2022.

    Comments: 7 pages, 2 figures, Accepted for publication, IEEE Consumer Electronics Magazine

  35. arXiv:2207.12850  [pdf, other

    cs.CV cs.AI

    SSIVD-Net: A Novel Salient Super Image Classification & Detection Technique for Weaponized Violence

    Authors: Toluwani Aremu, Li Zhiyuan, Reem Alameeri, Mustaqeem Khan, Abdulmotaleb El Saddik

    Abstract: Detection of violence and weaponized violence in closed-circuit television (CCTV) footage requires a comprehensive approach. In this work, we introduce the \emph{Smart-City CCTV Violence Detection (SCVD)} dataset, specifically designed to facilitate the learning of weapon distribution in surveillance videos. To tackle the complexities of analyzing 3D surveillance video for violence recognition tas… ▽ More

    Submitted 7 November, 2023; v1 submitted 26 July, 2022; originally announced July 2022.

    Comments: Contains 5 tables and 3 figures. Accepted at the 2024 SAI Computing Conference

  36. arXiv:2207.07913  [pdf, other

    cs.CV

    Dual-branch Hybrid Learning Network for Unbiased Scene Graph Generation

    Authors: Chaofan Zheng, Lianli Gao, Xinyu Lyu, Pengpeng Zeng, Abdulmotaleb El Saddik, Heng Tao Shen

    Abstract: The current studies of Scene Graph Generation (SGG) focus on solving the long-tailed problem for generating unbiased scene graphs. However, most de-biasing methods overemphasize the tail predicates and underestimate head ones throughout training, thereby wrecking the representation ability of head predicate features. Furthermore, these impaired features from head predicates harm the learning of ta… ▽ More

    Submitted 16 July, 2022; originally announced July 2022.

  37. arXiv:1909.10164  [pdf, other

    cs.MM cs.AI

    sZoom: A Framework for Automatic Zoom into High Resolution Surveillance Videos

    Authors: Mukesh Saini, Benjamin Guthier, Hao Kuang, Dwarikanath Mahapatra, Abdulmotaleb El Saddik

    Abstract: Current cameras are capable of recording high resolution video. While viewing on a mobile device, a user can manually zoom into this high resolution video to get more detailed view of objects and activities. However, manual zooming is not suitable for surveillance and monitoring. It is tiring to continuously keep zooming into various regions of the video. Also, while viewing one region, the operat… ▽ More

    Submitted 23 September, 2019; originally announced September 2019.