Skip to main content

Showing 1–50 of 129 results for author: Khan, R

  1. arXiv:2407.06345  [pdf, other

    cs.HC cs.CE cs.CY cs.ET

    Multi-person eye tracking for real-world scene perception in social settings

    Authors: Shreshth Saxena, Areez Visram, Neil Lobo, Zahid Mirza, Mehak Rafi Khan, Biranugan Pirabaharan, Alexander Nguyen, Lauren K. Fink

    Abstract: Eye movements provide a window into human behaviour, attention, and interaction dynamics. Previous research suggests that eye movements are highly influenced by task, setting, and social others; however, most eye tracking research is conducted in single-person, in-lab settings and is yet to be validated in multi-person, naturalistic contexts. One such prevalent real-world context is the collective… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: Please refer to the supplementary video illustrating the proposed approach in this paper here: https://tinyurl.com/multipersonET

    ACM Class: I.4.8; J.4; J.5; C.4; D.2.10

  2. arXiv:2406.13439  [pdf, other

    cs.CL

    Finding Blind Spots in Evaluator LLMs with Interpretable Checklists

    Authors: Sumanth Doddapaneni, Mohammed Safi Ur Rahman Khan, Sshubam Verma, Mitesh M. Khapra

    Abstract: Large Language Models (LLMs) are increasingly relied upon to evaluate text outputs of other LLMs, thereby influencing leaderboards and development decisions. However, concerns persist over the accuracy of these assessments and the potential for misleading conclusions. In this work, we investigate the effectiveness of LLMs as evaluators for text generation tasks. We propose FBI, a novel framework d… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  3. arXiv:2406.06638  [pdf, other

    hep-ph cs.LG

    Particle Multi-Axis Transformer for Jet Tagging

    Authors: Muhammad Usman, M Husnain Shahid, Maheen Ejaz, Ummay Hani, Nayab Fatima, Abdul Rehman Khan, Asifullah Khan, Nasir Majid Mirza

    Abstract: Jet tagging is an essential categorization problem in high energy physics. In recent times, Deep Learning has not only risen to the challenge of jet tagging but also significantly improved its performance. In this article, we proposed an idea of a new architecture, Particle Multi-Axis transformer (ParMAT) which is a modified version of Particle transformer (ParT). ParMAT contains local and global… ▽ More

    Submitted 16 July, 2024; v1 submitted 9 June, 2024; originally announced June 2024.

  4. arXiv:2406.00532  [pdf, other

    cs.AI cs.LG

    Breast Cancer Diagnosis: A Comprehensive Exploration of Explainable Artificial Intelligence (XAI) Techniques

    Authors: Samita Bai, Sidra Nasir, Rizwan Ahmed Khan, Sheeraz Arif, Alexandre Meyer, Hubert Konik

    Abstract: Breast cancer (BC) stands as one of the most common malignancies affecting women worldwide, necessitating advancements in diagnostic methodologies for better clinical outcomes. This article provides a comprehensive exploration of the application of Explainable Artificial Intelligence (XAI) techniques in the detection and diagnosis of breast cancer. As Artificial Intelligence (AI) technologies cont… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  5. arXiv:2405.20363  [pdf, other

    cs.CV

    LLMGeo: Benchmarking Large Language Models on Image Geolocation In-the-wild

    Authors: Zhiqiang Wang, Dejia Xu, Rana Muhammad Shahroz Khan, Yanbin Lin, Zhiwen Fan, Xingquan Zhu

    Abstract: Image geolocation is a critical task in various image-understanding applications. However, existing methods often fail when analyzing challenging, in-the-wild images. Inspired by the exceptional background knowledge of multimodal language models, we systematically evaluate their geolocation capabilities using a novel image dataset and a comprehensive evaluation framework. We first collect images f… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 7 pages, 3 figures, 5 tables, CVPR 2024 Workshop on Computer Vision in the Wild

  6. arXiv:2404.03892  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Enhancing Breast Cancer Diagnosis in Mammography: Evaluation and Integration of Convolutional Neural Networks and Explainable AI

    Authors: Maryam Ahmed, Tooba Bibi, Rizwan Ahmed Khan, Sidra Nasir

    Abstract: The Deep learning (DL) models for diagnosing breast cancer from mammographic images often operate as "black boxes", making it difficult for healthcare professionals to trust and understand their decision-making processes. The study presents an integrated framework combining Convolutional Neural Networks (CNNs) and Explainable Artificial Intelligence (XAI) for the enhanced diagnosis of breast cance… ▽ More

    Submitted 27 April, 2024; v1 submitted 5 April, 2024; originally announced April 2024.

  7. arXiv:2404.01878  [pdf, other

    cs.CV cs.AI

    Real, fake and synthetic faces -- does the coin have three sides?

    Authors: Shahzeb Naeem, Ramzi Al-Sharawi, Muhammad Riyyan Khan, Usman Tariq, Abhinav Dhall, Hasan Al-Nashash

    Abstract: With the ever-growing power of generative artificial intelligence, deepfake and artificially generated (synthetic) media have continued to spread online, which creates various ethical and moral concerns regarding their usage. To tackle this, we thus present a novel exploration of the trends and patterns observed in real, deepfake and synthetic facial images. The proposed analysis is done in two pa… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  8. arXiv:2404.01438  [pdf

    cs.CV cs.AI

    Generation and Detection of Sign Language Deepfakes -- A Linguistic and Visual Analysis

    Authors: Shahzeb Naeem, Muhammad Riyyan Khan, Usman Tariq, Abhinav Dhall, Carlos Ivan Colon, Hasan Al-Nashash

    Abstract: A question in the realm of deepfakes is slowly emerging pertaining to whether we can go beyond facial deepfakes and whether it would be beneficial to society. Therefore, this research presents a positive application of deepfake technology in upper body generation, while performing sign-language for the Deaf and Hard of Hearing (DHoH) community. The resulting videos are later vetted with a sign lan… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: 13 pages, 13 figures, Computer Vision and Image Understanding Journal

  9. arXiv:2403.06350  [pdf, other

    cs.CL

    IndicLLMSuite: A Blueprint for Creating Pre-training and Fine-Tuning Datasets for Indian Languages

    Authors: Mohammed Safi Ur Rahman Khan, Priyam Mehta, Ananth Sankar, Umashankar Kumaravelan, Sumanth Doddapaneni, Suriyaprasaad G, Varun Balan G, Sparsh Jain, Anoop Kunchukuttan, Pratyush Kumar, Raj Dabre, Mitesh M. Khapra

    Abstract: Despite the considerable advancements in English LLMs, the progress in building comparable models for other languages has been hindered due to the scarcity of tailored resources. Our work aims to bridge this divide by introducing an expansive suite of resources specifically designed for the development of Indic LLMs, covering 22 languages, containing a total of 251B tokens and 74.8M instruction-re… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

  10. arXiv:2402.09573  [pdf, other

    cs.LG cs.CL

    Changes by Butterflies: Farsighted Forecasting with Group Reservoir Transformer

    Authors: Md Kowsher, Abdul Rafae Khan, Jia Xu

    Abstract: In Chaos, a minor divergence between two initial conditions exhibits exponential amplification over time, leading to far-away outcomes, known as the butterfly effect. Thus, the distant future is full of uncertainty and hard to forecast. We introduce Group Reservoir Transformer to predict long-term events more accurately and robustly by overcoming two challenges in Chaos: (1) the extensive historic… ▽ More

    Submitted 13 June, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  11. arXiv:2401.15006  [pdf, other

    cs.CL cs.AI

    Airavata: Introducing Hindi Instruction-tuned LLM

    Authors: Jay Gala, Thanmay Jayakumar, Jaavid Aktar Husain, Aswanth Kumar M, Mohammed Safi Ur Rahman Khan, Diptesh Kanojia, Ratish Puduppully, Mitesh M. Khapra, Raj Dabre, Rudra Murthy, Anoop Kunchukuttan

    Abstract: We announce the initial release of "Airavata," an instruction-tuned LLM for Hindi. Airavata was created by fine-tuning OpenHathi with diverse, instruction-tuning Hindi datasets to make it better suited for assistive tasks. Along with the model, we also share the IndicInstruct dataset, which is a collection of diverse instruction-tuning datasets to enable further research for Indic LLMs. Additional… ▽ More

    Submitted 26 February, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

    Comments: Work in progress

  12. The State of Documentation Practices of Third-party Machine Learning Models and Datasets

    Authors: Ernesto Lang Oreamuno, Rohan Faiyaz Khan, Abdul Ali Bangash, Catherine Stinson, Bram Adams

    Abstract: Model stores offer third-party ML models and datasets for easy project integration, minimizing coding efforts. One might hope to find detailed specifications of these models and datasets in the documentation, leveraging documentation standards such as model and dataset cards. In this study, we use statistical analysis and hybrid card sorting to assess the state of the practice of documenting model… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

    Comments: 7 pages, 4 figures, IEEESoftware format

    Journal ref: IEEE Software 2024

  13. arXiv:2312.13041  [pdf, other

    cs.CR

    Advancing SQL Injection Detection for High-Speed Data Centers: A Novel Approach Using Cascaded NLP

    Authors: Kasim Tasdemir, Rafiullah Khan, Fahad Siddiqui, Sakir Sezer, Fatih Kurugollu, Sena Busra Yengec-Tasdemir, Alperen Bolat

    Abstract: Detecting SQL Injection (SQLi) attacks is crucial for web-based data center security, but it is challenging to balance accuracy and computational efficiency, especially in high-speed networks. Traditional methods struggle with this balance, while NLP-based approaches, although accurate, are computationally intensive. We introduce a novel cascade SQLi detection method, blending classical and tran… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Comments: 11 pages, The code is available at https://github.com/gdrlab/cascaded-sqli-detection This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  14. arXiv:2312.00634  [pdf

    eess.IV cs.CV

    A Recent Survey of Vision Transformers for Medical Image Segmentation

    Authors: Asifullah Khan, Zunaira Rauf, Abdul Rehman Khan, Saima Rathore, Saddam Hussain Khan, Najmus Saher Shah, Umair Farooq, Hifsa Asif, Aqsa Asif, Umme Zahoora, Rafi Ullah Khalil, Suleman Qamar, Umme Hani Asif, Faiza Babar Khan, Abdul Majid, Jeonghwan Gwak

    Abstract: Medical image segmentation plays a crucial role in various healthcare applications, enabling accurate diagnosis, treatment planning, and disease monitoring. Traditionally, convolutional neural networks (CNNs) dominated this domain, excelling at local feature extraction. However, their limitations in capturing long-range dependencies across image regions pose challenges for segmenting complex, inte… ▽ More

    Submitted 18 December, 2023; v1 submitted 1 December, 2023; originally announced December 2023.

  15. arXiv:2310.17729  [pdf

    cs.LG cs.AI cs.CV

    Improving Traffic Density Forecasting in Intelligent Transportation Systems Using Gated Graph Neural Networks

    Authors: Razib Hayat Khan, Jonayet Miah, S M Yasir Arafat, M M Mahbubul Syeed, Duc M Ca

    Abstract: This study delves into the application of graph neural networks in the realm of traffic forecasting, a crucial facet of intelligent transportation systems. Accurate traffic predictions are vital for functions like trip planning, traffic control, and vehicle routing in such systems. Three prominent GNN architectures Graph Convolutional Networks (Graph Sample and Aggregation) and Gated Graph Neural… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

  16. arXiv:2310.07252  [pdf

    cs.CV cs.LG

    A Comparative Study of Pre-trained CNNs and GRU-Based Attention for Image Caption Generation

    Authors: Rashid Khan, Bingding Huang, Haseeb Hassan, Asim Zaman, Zhongfu Ye

    Abstract: Image captioning is a challenging task involving generating a textual description for an image using computer vision and natural language processing techniques. This paper proposes a deep neural framework for image caption generation using a GRU-based attention mechanism. Our approach employs multiple pre-trained convolutional neural networks as the encoder to extract features from the image and a… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: 15pages, 10 figures, 5 tables. 2023 the 5th International Conference on Robotics and Computer Vision (ICRCV 2023). arXiv admin note: substantial text overlap with arXiv:2203.01594

  17. Ethical Framework for Harnessing the Power of AI in Healthcare and Beyond

    Authors: Sidra Nasir, Rizwan Ahmed Khan, Samita Bai

    Abstract: In the past decade, the deployment of deep learning (Artificial Intelligence (AI)) methods has become pervasive across a spectrum of real-world applications, often in safety-critical contexts. This comprehensive research article rigorously investigates the ethical dimensions intricately linked to the rapid evolution of AI technologies, with a particular focus on the healthcare domain. Delving deep… ▽ More

    Submitted 31 August, 2023; originally announced September 2023.

    Journal ref: IEEE Access 2024

  18. arXiv:2308.16571  [pdf, ps, other

    cs.CV cs.LG

    Document Layout Analysis on BaDLAD Dataset: A Comprehensive MViTv2 Based Approach

    Authors: Ashrafur Rahman Khan, Asif Azad

    Abstract: In the rapidly evolving digital era, the analysis of document layouts plays a pivotal role in automated information extraction and interpretation. In our work, we have trained MViTv2 transformer model architecture with cascaded mask R-CNN on BaDLAD dataset to extract text box, paragraphs, images and tables from a document. After training on 20365 document images for 36 epochs in a 3 phase cycle, w… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

  19. arXiv:2308.01760  [pdf, other

    eess.IV cs.CV

    NuInsSeg: A Fully Annotated Dataset for Nuclei Instance Segmentation in H&E-Stained Histological Images

    Authors: Amirreza Mahbod, Christine Polak, Katharina Feldmann, Rumsha Khan, Katharina Gelles, Georg Dorffner, Ramona Woitek, Sepideh Hatamikia, Isabella Ellinger

    Abstract: In computational pathology, automatic nuclei instance segmentation plays an essential role in whole slide image analysis. While many computerized approaches have been proposed for this task, supervised deep learning (DL) methods have shown superior segmentation performances compared to classical machine learning and image processing techniques. However, these models need fully annotated datasets f… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

    Comments: 7 pages, 1 Figure

  20. arXiv:2307.06824  [pdf

    cs.AI cs.DB cs.LG

    CLAIMED -- the open source framework for building coarse-grained operators for accelerated discovery in science

    Authors: Romeo Kienzler, Rafflesia Khan, Jerome Nilmeier, Ivan Nesic, Ibrahim Haddad

    Abstract: In modern data-driven science, reproducibility and reusability are key challenges. Scientists are well skilled in the process from data to publication. Although some publication channels require source code and data to be made accessible, rerunning and verifying experiments is usually hard due to a lack of standards. Therefore, reusing existing scientific data processing code from state-of-the-art… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

    Comments: Received IEEE OSS Award 2023 - https://conferences.computer.org/services/2023/symposia/oss.html

  21. arXiv:2307.04479  [pdf, other

    cs.DS cs.CE q-bio.GN

    A Linear Time Quantum Algorithm for Pairwise Sequence Alignment

    Authors: Md. Rabiul Islam Khan, Shadman Shahriar, Shaikh Farhan Rafid

    Abstract: Sequence Alignment is the process of aligning biological sequences in order to identify similarities between multiple sequences. In this paper, a Quantum Algorithm for finding the optimal alignment between DNA sequences has been demonstrated which works by mapping the sequence alignment problem into a path-searching problem through a 2D graph. The transition, which converges to a fixed path on the… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

  22. arXiv:2305.08396  [pdf, other

    eess.IV cs.CV cs.LG

    MaxViT-UNet: Multi-Axis Attention for Medical Image Segmentation

    Authors: Abdul Rehman Khan, Asifullah Khan

    Abstract: Since their emergence, Convolutional Neural Networks (CNNs) have made significant strides in medical image analysis. However, the local nature of the convolution operator may pose a limitation for capturing global and long-range interactions in CNNs. Recently, Transformers have gained popularity in the computer vision community and also in medical image segmentation due to their ability to process… ▽ More

    Submitted 29 March, 2024; v1 submitted 15 May, 2023; originally announced May 2023.

    Comments: 19 pages, 6 figures, 5 tables

  23. arXiv:2302.03232  [pdf, other

    cs.LG math.OC

    Linear Optimal Partial Transport Embedding

    Authors: Yikun Bai, Ivan Medri, Rocio Diaz Martin, Rana Muhammad Shahroz Khan, Soheil Kolouri

    Abstract: Optimal transport (OT) has gained popularity due to its various applications in fields such as machine learning, statistics, and signal processing. However, the balanced mass requirement limits its performance in practical problems. To address these limitations, variants of the OT problem, including unbalanced OT, Optimal partial transport (OPT), and Hellinger Kantorovich (HK), have been proposed.… ▽ More

    Submitted 23 April, 2024; v1 submitted 6 February, 2023; originally announced February 2023.

  24. arXiv:2301.08479  [pdf, other

    eess.IV cs.CV cs.LG

    Pneumonia Detection in Chest X-Ray Images : Handling Class Imbalance

    Authors: Wardah Ali, Eesha Qureshi, Omama Ahmed Farooqi, Rizwan Ahmed Khan

    Abstract: People all over the globe are affected by pneumonia but deaths due to it are highest in Sub-Saharan Asia and South Asia. In recent years, the overall incidence and mortality rate of pneumonia regardless of the utilization of effective vaccines and compelling antibiotics has escalated. Thus, pneumonia remains a disease that needs spry prevention and treatment. The widespread prevalence of pneumonia… ▽ More

    Submitted 20 January, 2023; originally announced January 2023.

  25. A Hybrid Evolutionary Approach to Solve University Course Allocation Problem

    Authors: Dibyo Fabian Dofadar, Riyo Hayat Khan, Shafqat Hasan, Towshik Anam Taj, Arif Shakil, Mahbub Majumdar

    Abstract: This paper discusses various types of constraints, difficulties and solutions to overcome the challenges regarding university course allocation problem. A hybrid evolutionary algorithm has been defined combining Local Repair Algorithm and Modified Genetic Algorithm to generate the best course assignment. After analyzing the collected dataset, all the necessary constraints were formulated. These co… ▽ More

    Submitted 24 July, 2023; v1 submitted 15 November, 2022; originally announced December 2022.

  26. arXiv:2211.06642  [pdf, other

    cs.CL

    ConceptX: A Framework for Latent Concept Analysis

    Authors: Firoj Alam, Fahim Dalvi, Nadir Durrani, Hassan Sajjad, Abdul Rafae Khan, Jia Xu

    Abstract: The opacity of deep neural networks remains a challenge in deploying solutions where explanation is as important as precision. We present ConceptX, a human-in-the-loop framework for interpreting and annotating latent representational space in pre-trained Language Models (pLMs). We use an unsupervised method to discover concepts learned in these models and enable a graphical interface for humans to… ▽ More

    Submitted 12 November, 2022; originally announced November 2022.

    Comments: AAAI 23

  27. arXiv:2210.11670  [pdf, ps, other

    cs.CL

    SIT at MixMT 2022: Fluent Translation Built on Giant Pre-trained Models

    Authors: Abdul Rafae Khan, Hrishikesh Kanade, Girish Amar Budhrani, Preet Jhanglani, Jia Xu

    Abstract: This paper describes the Stevens Institute of Technology's submission for the WMT 2022 Shared Task: Code-mixed Machine Translation (MixMT). The task consisted of two subtasks, subtask $1$ Hindi/English to Hinglish and subtask $2$ Hinglish to English translation. Our findings lie in the improvements made through the use of large pre-trained multilingual NMT models and in-domain datasets, as well as… ▽ More

    Submitted 16 November, 2022; v1 submitted 20 October, 2022; originally announced October 2022.

  28. A Comparative Study on COVID-19 Fake News Detection Using Different Transformer Based Models

    Authors: Sajib Kumar Saha Joy, Dibyo Fabian Dofadar, Riyo Hayat Khan, Md. Sabbir Ahmed, Rafeed Rahman

    Abstract: The rapid advancement of social networks and the convenience of internet availability have accelerated the rampant spread of false news and rumors on social media sites. Amid the COVID 19 epidemic, this misleading information has aggravated the situation by putting peoples mental and physical lives in danger. To limit the spread of such inaccuracies, identifying the fake news from online platforms… ▽ More

    Submitted 2 August, 2022; originally announced August 2022.

  29. arXiv:2206.13289  [pdf, other

    cs.CL cs.AI

    Analyzing Encoded Concepts in Transformer Language Models

    Authors: Hassan Sajjad, Nadir Durrani, Fahim Dalvi, Firoj Alam, Abdul Rafae Khan, Jia Xu

    Abstract: We propose a novel framework ConceptX, to analyze how latent concepts are encoded in representations learned within pre-trained language models. It uses clustering to discover the encoded concepts and explains them by aligning with a large set of human-defined concepts. Our analysis on seven transformer language models reveal interesting insights: i) the latent space within the learned representat… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

    Comments: 20 pages, 10 figures

    Journal ref: 2022 Annual Conference of the North American Chapter of the Association for Computational Linguistics

  30. arXiv:2206.12815  [pdf, other

    eess.IV cs.CV cs.LG

    Breast Cancer Classification using Deep Learned Features Boosted with Handcrafted Features

    Authors: Unaiza Sajid, Rizwan Ahmed Khan, Shahid Munir Shah, Sheeraz Arif

    Abstract: Breast cancer is one of the leading causes of death among women across the globe. It is difficult to treat if detected at advanced stages, however, early detection can significantly increase chances of survival and improves lives of millions of women. Given the widespread prevalence of breast cancer, it is of utmost importance for the research community to come up with the framework for early dete… ▽ More

    Submitted 16 January, 2023; v1 submitted 26 June, 2022; originally announced June 2022.

    Journal ref: Biomedical Signal Processing and Control 2023

  31. arXiv:2206.08464  [pdf, other

    cs.LG

    PRANC: Pseudo RAndom Networks for Compacting deep models

    Authors: Parsa Nooralinejad, Ali Abbasi, Soroush Abbasi Koohpayegani, Kossar Pourahmadi Meibodi, Rana Muhammad Shahroz Khan, Soheil Kolouri, Hamed Pirsiavash

    Abstract: We demonstrate that a deep model can be reparametrized as a linear combination of several randomly initialized and frozen deep models in the weight space. During training, we seek local minima that reside within the subspace spanned by these random models (i.e., `basis' networks). Our framework, PRANC, enables significant compaction of a deep model. The model can be reconstructed using a single sc… ▽ More

    Submitted 28 August, 2023; v1 submitted 16 June, 2022; originally announced June 2022.

  32. arXiv:2206.06925  [pdf

    cs.CR

    Towards a secured smart IoT using light weight blockchain: An aim to secure Pharmacy Products

    Authors: Md. Faruk Abdullah Al Sohan, Samiur Rahman Khan, Nusrat Jahan Anannya, Md Taimur Ahad

    Abstract: Blockchain has proven a very developed and secured technology. It ensures data integrity with authentic connected nodes. Now-a-days, blockchain with IoT is a great combination for secured and smart end to end product delivery. This observation has motivated the research to develop a conceptual model to provide a secure pharmaceutical product delivery by developing a IoT integrated with lightweight… ▽ More

    Submitted 7 June, 2022; originally announced June 2022.

    Comments: 9 pages 3 figures

  33. arXiv:2205.07237  [pdf, other

    cs.CL

    Discovering Latent Concepts Learned in BERT

    Authors: Fahim Dalvi, Abdul Rafae Khan, Firoj Alam, Nadir Durrani, Jia Xu, Hassan Sajjad

    Abstract: A large number of studies that analyze deep neural network models and their ability to encode various linguistic and non-linguistic concepts provide an interpretation of the inner mechanics of these models. The scope of the analyses is limited to pre-defined concepts that reinforce the traditional linguistic knowledge and do not reflect on how novel concepts are learned by the model. We address th… ▽ More

    Submitted 15 May, 2022; originally announced May 2022.

    Comments: ICLR 2022

  34. arXiv:2204.01205  [pdf, other

    cs.LG cs.DC math.NA

    Model-Parallel Fourier Neural Operators as Learned Surrogates for Large-Scale Parametric PDEs

    Authors: Thomas J. Grady II, Rishi Khan, Mathias Louboutin, Ziyi Yin, Philipp A. Witte, Ranveer Chandra, Russell J. Hewett, Felix J. Herrmann

    Abstract: Fourier neural operators (FNOs) are a recently introduced neural network architecture for learning solution operators of partial differential equations (PDEs), which have been shown to perform significantly better than comparable deep learning approaches. Once trained, FNOs can achieve speed-ups of multiple orders of magnitude over conventional numerical PDE solvers. However, due to the high dimen… ▽ More

    Submitted 1 February, 2023; v1 submitted 3 April, 2022; originally announced April 2022.

  35. arXiv:2203.06721  [pdf

    cs.CV

    Food Recipe Recommendation Based on Ingredients Detection Using Deep Learning

    Authors: Md. Shafaat Jamil Rokon, Md Kishor Morol, Ishra Binte Hasan, A. M. Saif, Rafid Hussain Khan

    Abstract: Food is essential for human survival, and people always try to taste different types of delicious recipes. Frequently, people choose food ingredients without even knowing their names or pick up some food ingredients that are not obvious to them from a grocery store. Knowing which ingredients can be mixed to make a delicious food recipe is essential. Selecting the right recipe by choosing a list of… ▽ More

    Submitted 13 March, 2022; originally announced March 2022.

    Comments: Accepted at ICCA 2022

  36. arXiv:2203.03022  [pdf, ps, other

    cs.SD cs.AI cs.LG eess.AS stat.ML

    HEAR: Holistic Evaluation of Audio Representations

    Authors: Joseph Turian, Jordie Shier, Humair Raj Khan, Bhiksha Raj, Björn W. Schuller, Christian J. Steinmetz, Colin Malloy, George Tzanetakis, Gissel Velarde, Kirk McNally, Max Henry, Nicolas Pinto, Camille Noufi, Christian Clough, Dorien Herremans, Eduardo Fonseca, Jesse Engel, Justin Salamon, Philippe Esling, Pranay Manocha, Shinji Watanabe, Zeyu Jin, Yonatan Bisk

    Abstract: What audio embedding approach generalizes best to a wide range of downstream tasks across a variety of everyday domains without fine-tuning? The aim of the HEAR benchmark is to develop a general-purpose audio representation that provides a strong basis for learning in a wide variety of tasks and scenarios. HEAR evaluates audio representations using a benchmark suite across a variety of domains, in… ▽ More

    Submitted 29 May, 2022; v1 submitted 6 March, 2022; originally announced March 2022.

    Comments: to appear in Proceedings of Machine Learning Research (PMLR): NeurIPS 2021 Competition Track

  37. arXiv:2203.02791  [pdf, ps, other

    cs.NI

    Deep Q-Learning Based Resource Allocation in Interference Systems With Outage Constraint

    Authors: Saniul Alam, Sadia Islam, Muhammad R. A. Khandaker, Risala T. Khan, Faisal Tariq, Apriana Toding

    Abstract: This correspondence considers the resource allocation problem in wireless interference channel (IC) under link outage constraints. Since the optimization problem is non-convex in nature, existing approaches to find the optimal power allocation are computationally intensive and thus practically infeasible. Recently, deep reinforcement learning has shown promising outcome in solving non-convex optim… ▽ More

    Submitted 5 March, 2022; originally announced March 2022.

    Comments: Submitted to IEEE TVT

  38. arXiv:2203.01594  [pdf

    cs.CL cs.CV

    A Deep Neural Framework for Image Caption Generation Using GRU-Based Attention Mechanism

    Authors: Rashid Khan, M Shujah Islam, Khadija Kanwal, Mansoor Iqbal, Md. Imran Hossain, Zhongfu Ye

    Abstract: Image captioning is a fast-growing research field of computer vision and natural language processing that involves creating text explanations for images. This study aims to develop a system that uses a pre-trained convolutional neural network (CNN) to extract features from an image, integrates the features with an attention mechanism, and creates captions using a recurrent neural network (RNN). To… ▽ More

    Submitted 3 March, 2022; originally announced March 2022.

    Comments: 16 PAGES, 8 figures, 1 TABLE

    Journal ref: Information Technology and Control 2022

  39. arXiv:2112.13170  [pdf, other

    eess.SP cs.NI

    On the Feasibility of 4.9 GHz Public Safety Band as Spectrum Option for Internet of Vehicles

    Authors: Muhammad Faizan Rizwan Khan, Seungmo Kim

    Abstract: There is an unprecedented impetus on the advancement of internet of vehicles (IoV). The vehicle-to-everything (V2X) communication is well acknowledged as the key technology in constitution of the IoV. Nevertheless, the spectrum for V2X communication is undergoing a massive change in the United States: a majority of the bandwidth has been reallocated to Wi-Fi leaving even less than a half of the ba… ▽ More

    Submitted 24 December, 2021; originally announced December 2021.

  40. Federated 3GPP Mobile Edge Computing Systems: A Transparent Proxy for Third Party Authentication with Application Mobility Support

    Authors: Asad Ali, Samin Rahman Khan, Sadman Sakib, Md. Shohrab Hossain, Ying-Dar Lin

    Abstract: Multi-Access or Mobile Edge Computing (MEC) is being deployed by 4G/5G operators to provide computational services at lower latencies. Federating MECs across operators expands capability, capacity, and coverage but gives rise to two issues - third-party authentication and application mobility - for continuous service during roaming without re-authentication. In this work, we propose a Federated St… ▽ More

    Submitted 15 December, 2021; originally announced December 2021.

    Comments: 14 pages. 8 figures. Submitted to IEEE Access

  41. arXiv:2110.15742  [pdf, other

    cs.LG

    Barlow Graph Auto-Encoder for Unsupervised Network Embedding

    Authors: Rayyan Ahmad Khan, Martin Kleinsteuber

    Abstract: Network embedding has emerged as a promising research field for network analysis. Recently, an approach, named Barlow Twins, has been proposed for self-supervised learning in computer vision by applying the redundancy-reduction principle to the embedding vectors corresponding to two distorted versions of the image samples. Motivated by this, we propose Barlow Graph Auto-Encoder, a simple yet effec… ▽ More

    Submitted 13 December, 2022; v1 submitted 29 October, 2021; originally announced October 2021.

  42. Artificial Intelligence For Breast Cancer Detection: Trends & Directions

    Authors: Shahid Munir Shah, Rizwan Ahmed Khan, Sheeraz Arif, Unaiza Sajid

    Abstract: In the last decade, researchers working in the domain of computer vision and Artificial Intelligence (AI) have beefed up their efforts to come up with the automated framework that not only detects but also identifies stage of breast cancer. The reason for this surge in research activities in this direction are mainly due to advent of robust AI algorithms (deep learning), availability of hardware t… ▽ More

    Submitted 3 October, 2021; originally announced October 2021.

    Journal ref: Computers in Biology and Medicine 2022

  43. arXiv:2109.14197  [pdf

    cs.CL

    Context based Roman-Urdu to Urdu Script Transliteration System

    Authors: H Muhammad Shakeel, Rashid Khan, Muhammad Waheed

    Abstract: Now a day computer is necessary for human being and it is very useful in many fields like search engine, text processing, short messaging services, voice chatting and text recognition. Since last many years there are many tools and techniques that have been developed to support the writing of language script. Most of the Asian languages like Arabic, Urdu, Persian, Chains and Korean are written in… ▽ More

    Submitted 29 September, 2021; originally announced September 2021.

  44. arXiv:2109.04653  [pdf, other

    cs.CL

    Towards Developing a Multilingual and Code-Mixed Visual Question Answering System by Knowledge Distillation

    Authors: Humair Raj Khan, Deepak Gupta, Asif Ekbal

    Abstract: Pre-trained language-vision models have shown remarkable performance on the visual question answering (VQA) task. However, most pre-trained models are trained by only considering monolingual learning, especially the resource-rich language like English. Training such models for multilingual setups demand high computing resources and multilingual language-vision dataset which hinders their applicati… ▽ More

    Submitted 9 September, 2021; originally announced September 2021.

    Comments: Accepted in EMNLP-Findings (2021)

  45. arXiv:2108.03953  [pdf, other

    cs.LG

    A Framework for Joint Unsupervised Learning of Cluster-Aware Embedding for Heterogeneous Networks

    Authors: Rayyan Ahmad Khan, Martin Kleinsteuber

    Abstract: Heterogeneous Information Network (HIN) embedding refers to the low-dimensional projections of the HIN nodes that preserve the HIN structure and semantics. HIN embedding has emerged as a promising research field for network analysis as it enables downstream tasks such as clustering and node classification. In this work, we propose \ours for joint learning of cluster embeddings as well as cluster-a… ▽ More

    Submitted 9 August, 2021; originally announced August 2021.

  46. arXiv:2108.02899  [pdf, other

    cs.CL cs.LG

    Lights, Camera, Action! A Framework to Improve NLP Accuracy over OCR documents

    Authors: Amit Gupte, Alexey Romanov, Sahitya Mantravadi, Dalitso Banda, Jianjie Liu, Raza Khan, Lakshmanan Ramu Meenal, Benjamin Han, Soundar Srinivasan

    Abstract: Document digitization is essential for the digital transformation of our societies, yet a crucial step in the process, Optical Character Recognition (OCR), is still not perfect. Even commercial OCR systems can produce questionable output depending on the fidelity of the scanned documents. In this paper, we demonstrate an effective framework for mitigating OCR errors for any downstream NLP task, us… ▽ More

    Submitted 5 August, 2021; originally announced August 2021.

    Comments: Accepted to the Document Intelligence Workshop at KDD 2021. The source code of Genalog is available at https://github.com/microsoft/genalog

  47. arXiv:2106.13456  [pdf, other

    cs.LG

    Interpreting Criminal Charge Prediction and Its Algorithmic Bias via Quantum-Inspired Complex Valued Networks

    Authors: Abdul Rafae Khan, Jia Xu, Peter Varsanyi, Rachit Pabreja

    Abstract: While predictive policing has become increasingly common in assisting with decisions in the criminal justice system, the use of these results is still controversial. Some software based on deep learning lacks accuracy (e.g., in F-1), and importantly many decision processes are not transparent, causing doubt about decision bias, such as perceived racial and age disparities. This paper addresses bia… ▽ More

    Submitted 13 July, 2021; v1 submitted 25 June, 2021; originally announced June 2021.

    Comments: First two authors alphabetically ordered

  48. arXiv:2105.01316  [pdf

    cs.CR cs.DC

    Technology Review of Blockchain Data Privacy Solutions

    Authors: Jack Tanner, Roshaan Khan

    Abstract: This objective of this report is to review existing enterprise blockchain technologies - EOSIO powered systems, Hyperledger Fabric and Besu, Consensus Quorum, R3 Corda and Ernst and Young's Nightfall - that provide data privacy while leveraging the data integrity benefits of blockchain. By reviewing and comparing how and how well these technologies achieve data privacy, a snapshot is captured of t… ▽ More

    Submitted 4 May, 2021; originally announced May 2021.

    ACM Class: C.2.2; E.3

  49. arXiv:2103.01322  [pdf, ps, other

    cs.CR

    Thinking Out of the Blocks: Holochain for Distributed Security in IoT Healthcare

    Authors: Shakila Zaman, Muhammad R. A. Khandaker, Risala T. Khan, Faisal Tariq, Kai-Kit Wong

    Abstract: The Internet-of-Things (IoT) is an emerging and cognitive technology which connects a massive number of smart physical devices with virtual objects operating in diverse platforms through the internet. IoT is increasingly being implemented in distributed settings, making footprints in almost every sector of our life. Unfortunately, for healthcare systems, the entities connected to the IoT networks… ▽ More

    Submitted 1 March, 2021; originally announced March 2021.

    Comments: Submitted to IEEE

  50. arXiv:2101.03885  [pdf, other

    cs.LG

    Variational Embeddings for Community Detection and Node Representation

    Authors: Rayyan Ahmad Khan, Muhammad Umer Anwaar, Omran Kaddah, Martin Kleinsteuber

    Abstract: In this paper, we study how to simultaneously learn two highly correlated tasks of graph analysis, i.e., community detection and node representation learning. We propose an efficient generative model called VECoDeR for jointly learning Variational Embeddings for Community Detection and node Representation. VECoDeR assumes that every node can be a member of one or more communities. The node embeddi… ▽ More

    Submitted 11 January, 2021; originally announced January 2021.