Skip to main content

Showing 1–50 of 81 results for author: Singh, B

  1. arXiv:2407.10958  [pdf, other

    cs.CV

    InVi: Object Insertion In Videos Using Off-the-Shelf Diffusion Models

    Authors: Nirat Saini, Navaneeth Bodla, Ashish Shrivastava, Avinash Ravichandran, Xiao Zhang, Abhinav Shrivastava, Bharat Singh

    Abstract: We introduce InVi, an approach for inserting or replacing objects within videos (referred to as inpainting) using off-the-shelf, text-to-image latent diffusion models. InVi targets controlled manipulation of objects and blending them seamlessly into a background video unlike existing video editing methods that focus on comprehensive re-styling or entire scene alterations. To achieve this goal, we… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  2. arXiv:2406.10722  [pdf, other

    cs.CV cs.AI cs.LG

    GenMM: Geometrically and Temporally Consistent Multimodal Data Generation for Video and LiDAR

    Authors: Bharat Singh, Viveka Kulharia, Luyu Yang, Avinash Ravichandran, Ambrish Tyagi, Ashish Shrivastava

    Abstract: Multimodal synthetic data generation is crucial in domains such as autonomous driving, robotics, augmented/virtual reality, and retail. We propose a novel approach, GenMM, for jointly editing RGB videos and LiDAR scans by inserting temporally and geometrically consistent 3D objects. Our method uses a reference image and 3D bounding boxes to seamlessly insert and blend new objects into target video… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

  3. arXiv:2406.00133  [pdf, other

    cs.LG cs.AI

    Streamflow Prediction with Uncertainty Quantification for Water Management: A Constrained Reasoning and Learning Approach

    Authors: Mohammed Amine Gharsallaoui, Bhupinderjeet Singh, Supriya Savalkar, Aryan Deshwal, Yan Yan, Ananth Kalyanaraman, Kirti Rajagopalan, Janardhan Rao Doppa

    Abstract: Predicting the spatiotemporal variation in streamflow along with uncertainty quantification enables decision-making for sustainable management of scarce water resources. Process-based hydrological models (aka physics-based models) are based on physical laws, but using simplifying assumptions which can lead to poor accuracy. Data-driven approaches offer a powerful alternative, but they require larg… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

  4. arXiv:2403.17223  [pdf

    cs.CV cs.AI cs.LG

    Co-Occurring of Object Detection and Identification towards unlabeled object discovery

    Authors: Binay Kumar Singh, Niels Da Vitoria Lobo

    Abstract: In this paper, we propose a novel deep learning based approach for identifying co-occurring objects in conjunction with base objects in multilabel object categories. Nowadays, with the advancement in computer vision based techniques we need to know about co-occurring objects with respect to base object for various purposes. The pipeline of the proposed work is composed of two stages: in the first… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 6 pages, 2 figures,

  5. arXiv:2312.05461  [pdf, other

    cs.LG cs.AI

    STREAMLINE: An Automated Machine Learning Pipeline for Biomedicine Applied to Examine the Utility of Photography-Based Phenotypes for OSA Prediction Across International Sleep Centers

    Authors: Ryan J. Urbanowicz, Harsh Bandhey, Brendan T. Keenan, Greg Maislin, Sy Hwang, Danielle L. Mowery, Shannon M. Lynch, Diego R. Mazzotti, Fang Han, Qing Yun Li, Thomas Penzel, Sergio Tufik, Lia Bittencourt, Thorarinn Gislason, Philip de Chazal, Bhajan Singh, Nigel McArdle, Ning-Hung Chen, Allan Pack, Richard J. Schwab, Peter A. Cistulli, Ulysses J. Magalang

    Abstract: While machine learning (ML) includes a valuable array of tools for analyzing biomedical data, significant time and expertise is required to assemble effective, rigorous, and unbiased pipelines. Automated ML (AutoML) tools seek to facilitate ML application by automating a subset of analysis pipeline elements. In this study we develop and validate a Simple, Transparent, End-to-end Automated Machine… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

    Comments: 23 pages, 7 figures, 1 table, 1 supplemental information document (77 pages), and 7 ancillary files

  6. arXiv:2311.03388  [pdf, other

    cs.LG cs.AI physics.ao-ph

    Attention-based Models for Snow-Water Equivalent Prediction

    Authors: Krishu K. Thapa, Bhupinderjeet Singh, Supriya Savalkar, Alan Fern, Kirti Rajagopalan, Ananth Kalyanaraman

    Abstract: Snow Water-Equivalent (SWE) -- the amount of water available if snowpack is melted -- is a key decision variable used by water management agencies to make irrigation, flood control, power generation and drought management decisions. SWE values vary spatiotemporally -- affected by weather, topography and other environmental factors. While daily SWE can be measured by Snow Telemetry (SNOTEL) station… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

    Comments: 7 pages, To be published in Proceedings of The Thirty-Sixth Annual Conference on Innovative Applications of Artificial Intelligence (IAAI-24)

    ACM Class: I.2

  7. arXiv:2311.00429  [pdf, other

    eess.IV cs.LG

    Crop Disease Classification using Support Vector Machines with Green Chromatic Coordinate (GCC) and Attention based feature extraction for IoT based Smart Agricultural Applications

    Authors: Shashwat Jha, Vishvaditya Luhach, Gauri Shanker Gupta, Beependra Singh

    Abstract: Crops hold paramount significance as they serve as the primary provider of energy, nutrition, and medicinal benefits for the human population. Plant diseases, however, can negatively affect leaves during agricultural cultivation, resulting in significant losses in crop output and economic value. Therefore, it is crucial for farmers to identify crop diseases. However, this method frequently necessi… ▽ More

    Submitted 6 November, 2023; v1 submitted 1 November, 2023; originally announced November 2023.

  8. arXiv:2306.05989  [pdf, other

    cs.LG stat.ML

    QBSD: Quartile-Based Seasonality Decomposition for Cost-Effective Time Series Forecasting

    Authors: Ebenezer RHP Isaac, Bulbul Singh

    Abstract: In the telecom domain, precise forecasting of time series patterns, such as cell key performance indicators (KPIs), plays a pivotal role in enhancing service quality and operational efficiency. State-of-the-art forecasting approaches prioritize forecasting accuracy at the expense of computational performance, rendering them less suitable for data-intensive applications encompassing systems with a… ▽ More

    Submitted 16 August, 2023; v1 submitted 9 June, 2023; originally announced June 2023.

  9. arXiv:2303.12343  [pdf, other

    cs.CV

    LD-ZNet: A Latent Diffusion Approach for Text-Based Image Segmentation

    Authors: Koutilya Pnvr, Bharat Singh, Pallabi Ghosh, Behjat Siddiquie, David Jacobs

    Abstract: Large-scale pre-training tasks like image classification, captioning, or self-supervised techniques do not incentivize learning the semantic boundaries of objects. However, recent generative foundation models built using text-based latent diffusion techniques may learn semantic boundaries. This is because they have to synthesize intricate details about all objects in an image based on a text descr… ▽ More

    Submitted 23 August, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

    Comments: Supplementary material is included in the paper following the references section

  10. arXiv:2302.04790  [pdf, other

    cs.CL cs.AI cs.IR

    Massively Multilingual Language Models for Cross Lingual Fact Extraction from Low Resource Indian Languages

    Authors: Bhavyajeet Singh, Pavan Kandru, Anubhav Sharma, Vasudeva Varma

    Abstract: Massive knowledge graphs like Wikidata attempt to capture world knowledge about multiple entities. Recent approaches concentrate on automatically enriching these KGs from text. However a lot of information present in the form of natural text in low resource languages is often missed out. Cross Lingual Information Extraction aims at extracting factual information in the form of English triples from… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

    Comments: 5 pages, 2 page Apendix, 3 figures, accepted at 19th International Conference on Natural Language Processing

  11. arXiv:2302.03845  [pdf, other

    cs.LG physics.ao-ph

    Two-step hyperparameter optimization method: Accelerating hyperparameter search by using a fraction of a training dataset

    Authors: Sungduk Yu, Mike Pritchard, Po-Lun Ma, Balwinder Singh, Sam Silva

    Abstract: Hyperparameter optimization (HPO) is an important step in machine learning (ML) model development, but common practices are archaic -- primarily relying on manual or grid searches. This is partly because adopting advanced HPO algorithms introduces added complexity to the workflow, leading to longer computation times. This poses a notable challenge to ML applications, as suboptimal hyperparameter s… ▽ More

    Submitted 7 September, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

    Journal ref: Artificial Intelligence for the Earth Systems, 3(1), 2024, e230013

  12. arXiv:2212.08299  [pdf, ps, other

    cs.LG math.OC

    Metaheuristic for Hub-Spoke Facility Location Problem: Application to Indian E-commerce Industry

    Authors: Aakash Sachdeva, Bhupinder Singh, Rahul Prasad, Nakshatra Goel, Ronit Mondal, Jatin Munjal, Abhishek Bhatnagar, Manjeet Dahiya

    Abstract: Indian e-commerce industry has evolved over the last decade and is expected to grow over the next few years. The focus has now shifted to turnaround time (TAT) due to the emergence of many third-party logistics providers and higher customer expectations. The key consideration for delivery providers is to balance their overall operating costs while meeting the promised TAT to their customers. E-com… ▽ More

    Submitted 16 December, 2022; originally announced December 2022.

  13. arXiv:2212.03474  [pdf, other

    cs.LG cs.AI

    Tree DNN: A Deep Container Network

    Authors: Brijraj Singh, Swati Gupta, Mayukh Das, Praveen Doreswamy Naidu, Sharan Kumar Allur

    Abstract: Multi-Task Learning (MTL) has shown its importance at user products for fast training, data efficiency, reduced overfitting etc. MTL achieves it by sharing the network parameters and training a network for multiple tasks simultaneously. However, MTL does not provide the solution, if each task needs training from a different dataset. In order to solve the stated problem, we have proposed an archite… ▽ More

    Submitted 7 December, 2022; originally announced December 2022.

  14. arXiv:2210.16093  [pdf, other

    cs.CV cs.AI cs.LG

    A CNN-LSTM Combination Network for Cataract Detection using Eye Fundus Images

    Authors: Dishant Padalia, Abhishek Mazumdar, Bharati Singh

    Abstract: According to multiple authoritative authorities, including the World Health Organization, vision-related impairments and disorders are becoming a significant issue. According to a recent report, one of the leading causes of irreversible blindness in persons over the age of 50 is delayed cataract treatment. A cataract is a cloudy spot in the eye's lens that causes visual loss. Cataracts often devel… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

    Comments: 8 pages, 3 figures

  15. arXiv:2210.08940  [pdf

    cs.NI

    Configured Grant for Ultra-Reliable and Low-Latency Communications: Standardization and Beyond

    Authors: Majid Gerami, Bikramjit Singh

    Abstract: Uplink configured Grant allocation has been introduced in 3rd Generation Partnership Project New Radio Release 15. This is beneficial in supporting Ultra-Reliable and Low Latency Communication for industrial communication, a key Fifth Generation mobile communication usage scenario. This scheduling mechanism enables a user with periodic traffic to transmits its data readily and bypasses the control… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

    Comments: Accepted in IEEE Communications Standard Magazine 2021, 5 figures

  16. arXiv:2209.11252  [pdf, other

    cs.CL

    XF2T: Cross-lingual Fact-to-Text Generation for Low-Resource Languages

    Authors: Shivprasad Sagare, Tushar Abhishek, Bhavyajeet Singh, Anubhav Sharma, Manish Gupta, Vasudeva Varma

    Abstract: Multiple business scenarios require an automated generation of descriptive human-readable text from structured input data. Hence, fact-to-text generation systems have been developed for various downstream tasks like generating soccer reports, weather and financial reports, medical reports, person biographies, etc. Unfortunately, previous work on fact-to-text (F2T) generation has focused primarily… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

  17. arXiv:2207.13021  [pdf

    eess.IV cs.CV

    Topological Optimized Convolutional Visual Recurrent Network for Brain Tumor Segmentation and Classification

    Authors: Dhananjay Joshi, Bhupesh Kumar Singh, Kapil Kumar Nagwanshi, Nitin S. Choubey

    Abstract: In today's world of health care, brain tumor detection has become common. However, the manual brain tumor classification approach is time-consuming. So Deep Convolutional Neural Network (DCNN) is used by many researchers in the medical field for making accurate diagnoses and aiding in the patient's treatment. The traditional techniques have problems such as overfitting and the inability to extract… ▽ More

    Submitted 14 July, 2024; v1 submitted 6 June, 2022; originally announced July 2022.

    MSC Class: 68U10 ACM Class: I.4

  18. arXiv:2207.11654  [pdf, other

    cs.NI cs.CR cs.LG cs.PF

    BPFISH: Blockchain and Privacy-preserving FL Inspired Smart Healthcare

    Authors: Moirangthem Biken Singh, Ajay Pratap

    Abstract: This paper proposes Federated Learning (FL) based smart healthcare system where Medical Centers (MCs) train the local model using the data collected from patients and send the model weights to the miners in a blockchain-based robust framework without sharing raw data, keeping privacy preservation into deliberation. We formulate an optimization problem by maximizing the utility and minimizing the l… ▽ More

    Submitted 27 July, 2022; v1 submitted 24 July, 2022; originally announced July 2022.

  19. arXiv:2207.00808  [pdf, other

    physics.ao-ph cs.LG

    On the modern deep learning approaches for precipitation downscaling

    Authors: Bipin Kumar, Kaustubh Atey, Bhupendra Bahadur Singh, Rajib Chattopadhyay, Nachiket Acharya, Manmeet Singh, Ravi S. Nanjundiah, Suryachandra A. Rao

    Abstract: Deep Learning (DL) based downscaling has become a popular tool in earth sciences recently. Increasingly, different DL approaches are being adopted to downscale coarser precipitation data and generate more accurate and reliable estimates at local (~few km or even smaller) scales. Despite several studies adopting dynamical or statistical downscaling of precipitation, the accuracy is limited by the a… ▽ More

    Submitted 2 July, 2022; originally announced July 2022.

    Report number: https://link.springer.com/epdf/10.1007/s12145-023-00970-4?sharing_token=M2LtWJ53pv-LFqv8L_7A9_e4RwlQNchNByi7wbcMAY6_OzYwFMXUC7cAoOcWE6w2ZfADFLxgA09ceiotTMNU3MFJkk4Uz7yDh2Sm_5GVwT31ims1NgmcJlE9PNP5VLG9KcfXtKgbCDXMyShcFb_r1fWMORXAH5iwFTYmyJReRXs%3D

    Journal ref: Earth Science Informatics, 2023

  20. arXiv:2203.15408  [pdf, other

    cs.LG cs.AI cs.CV

    AutoCoMet: Smart Neural Architecture Search via Co-Regulated Shaping Reinforcement

    Authors: Mayukh Das, Brijraj Singh, Harsh Kanti Chheda, Pawan Sharma, Pradeep NS

    Abstract: Designing suitable deep model architectures, for AI-driven on-device apps and features, at par with rapidly evolving mobile hardware and increasingly complex target scenarios is a difficult task. Though Neural Architecture Search (NAS/AutoML) has made this easier by shifting paradigm from extensive manual effort to automated architecture learning from data, yet it has major limitations, leading to… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: ICPR 2022

  21. arXiv:2202.00291  [pdf, other

    cs.CL

    XAlign: Cross-lingual Fact-to-Text Alignment and Generation for Low-Resource Languages

    Authors: Tushar Abhishek, Shivprasad Sagare, Bhavyajeet Singh, Anubhav Sharma, Manish Gupta, Vasudeva Varma

    Abstract: Multiple critical scenarios (like Wikipedia text generation given English Infoboxes) need automated generation of descriptive text in low resource (LR) languages from English fact triples. Previous work has focused on English fact-to-text (F2T) generation. To the best of our knowledge, there has been no previous attempt on cross-lingual alignment or generation for LR languages. Building an effecti… ▽ More

    Submitted 24 April, 2022; v1 submitted 1 February, 2022; originally announced February 2022.

    Comments: Update the code repository and acknowledgement

  22. arXiv:2202.00216  [pdf

    cs.IR cs.CL

    Semantic Annotation and Querying Framework based on Semi-structured Ayurvedic Text

    Authors: Hrishikesh Terdalkar, Arnab Bhattacharya, Madhulika Dubey, Ramamurthy S, Bhavna Naneria Singh

    Abstract: Knowledge bases (KB) are an important resource in a number of natural language processing (NLP) and information retrieval (IR) tasks, such as semantic search, automated question-answering etc. They are also useful for researchers trying to gain information from a text. Unfortunately, however, the state-of-the-art in Sanskrit NLP does not yet allow automated construction of knowledge bases due to u… ▽ More

    Submitted 31 January, 2022; originally announced February 2022.

    Comments: 19 pages including appendix

    Journal ref: n Proceedings of the Computational Sanskrit & Digital Humanities: Selected papers presented at the 18th World Sanskrit Conference, 2023, pages 155--173, Canberra, Australia (Online mode). Association for Computational Linguistics

  23. Sentiment Analysis of Microblogging dataset on Coronavirus Pandemic

    Authors: Nosin Ibna Mahbub, Md Rakibul Islam, Md Al Amin, Md Khairul Islam, Bikash Chandra Singh, Md Imran Hossain Showrov, Anirudda Sarkar

    Abstract: Sentiment analysis can largely influence the people to get the update of the current situation. Coronavirus (COVID-19) is a contagious illness caused by the coronavirus 2 that causes severe respiratory symptoms. The lives of millions have continued to be affected by this pandemic, several countries have resorted to a full lockdown. During this lockdown, people have taken social networks to express… ▽ More

    Submitted 17 November, 2021; originally announced November 2021.

    Comments: 7 pages, 5 figures, 5th IEEE International Conference on Electrical Information and Communication Technology (EICT)

    MSC Class: 68Uxx ACM Class: I.7

    Journal ref: 2021 5th International Conference on Electrical Information and Communication Technology (EICT)

  24. Improved Robustness and Safety for Pre-Adaptation of Meta Reinforcement Learning with Prior Regularization

    Authors: Lu Wen, Songan Zhang, H. Eric Tseng, Baljeet Singh, Dimitar Filev, Huei Peng

    Abstract: Meta Reinforcement Learning (Meta-RL) has seen substantial advancements recently. In particular, off-policy methods were developed to improve the data efficiency of Meta-RL techniques. \textit{Probabilistic embeddings for actor-critic RL} (PEARL) is a leading approach for multi-MDP adaptation problems. A major drawback of many existing Meta-RL methods, including PEARL, is that they do not explicit… ▽ More

    Submitted 9 February, 2023; v1 submitted 18 August, 2021; originally announced August 2021.

  25. arXiv:2106.10430  [pdf, other

    cs.MM cs.CV eess.IV

    Multi-Contextual Design of Convolutional Neural Network for Steganalysis

    Authors: Brijesh Singh, Arijit Sur, Pinaki Mitra

    Abstract: In recent times, deep learning-based steganalysis classifiers became popular due to their state-of-the-art performance. Most deep steganalysis classifiers usually extract noise residuals using high-pass filters as preprocessing steps and feed them to their deep model for classification. It is observed that recent steganographic embedding does not always restrict their embedding in the high-frequen… ▽ More

    Submitted 4 November, 2021; v1 submitted 19 June, 2021; originally announced June 2021.

    Comments: Under Review

  26. arXiv:2105.11097  [pdf, other

    cs.GT cs.CY eess.SP

    Criticality and Utility-aware Fog Computing System for Remote Health Monitoring

    Authors: Moirangthem Biken Singh, Navneet Taunk, Naveen Kumar Mall, Ajay Pratap

    Abstract: Growing remote health monitoring system allows constant monitoring of the patient's condition and performance of preventive and control check-ups outside medical facilities. However, the real-time smart-healthcare application poses a delay constraint that has to be solved efficiently. Fog computing is emerging as an efficient solution for such real-time applications. Moreover, different medical ce… ▽ More

    Submitted 2 April, 2022; v1 submitted 24 May, 2021; originally announced May 2021.

  27. arXiv:2102.05646  [pdf, other

    cs.CV cs.AI

    Scale Normalized Image Pyramids with AutoFocus for Object Detection

    Authors: Bharat Singh, Mahyar Najibi, Abhishek Sharma, Larry S. Davis

    Abstract: We present an efficient foveal framework to perform object detection. A scale normalized image pyramid (SNIP) is generated that, like human vision, only attends to objects within a fixed size range at different scales. Such a restriction of objects' size during training affords better learning of object-sensitive filters, and therefore, results in better accuracy. However, the use of an image pyra… ▽ More

    Submitted 10 February, 2021; originally announced February 2021.

    Comments: Accepted in T-PAMI 2021

  28. arXiv:2009.04635  [pdf

    cs.NI

    Configured Grant for Semi-Deterministic Traffic for Ultra-Reliable and Low Latency Communications

    Authors: Bikramjit Singh, Majid Gerami

    Abstract: Configured Grant-based allocation has been adopted in New Radio 3rd Generation Partnership Project Release 16. This scheme is beneficial in supporting Ultra-Reliable and Low Latency Communication for industrial communication, a key Fifth Generation mobile communication usage scenario. This scheduling mechanism enables a user with a periodic traffic to transmit its data readily and bypasses the sig… ▽ More

    Submitted 9 September, 2020; originally announced September 2020.

    Comments: 6G Wireless Summit 2020, poster paper, 2 pages

  29. arXiv:2008.09803  [pdf, other

    cs.CY

    COVID-19 Pandemic Outbreak in the Subcontinent: A data-driven analysis

    Authors: Bikash Chandra Singh, Zulfikar Alom, Mohammad Muntasir Rahman, Mrinal Kanti Baowaly, Mohammad Abdul Azim

    Abstract: Human civilization is experiencing a critical situation that presents itself for a new coronavirus disease 2019 (COVID-19). This virus emerged in late December 2019 in Wuhan city, Hubei, China. The grim fact of COVID-19 is, it is highly contagious in nature, therefore, spreads rapidly all over the world and causes severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). Responding to the seve… ▽ More

    Submitted 22 August, 2020; originally announced August 2020.

    Comments: 11 pages, 7 figures, Submitted to: Travel Medicine and Infectious Disease

  30. arXiv:2007.09785  [pdf, other

    cs.CV

    ASAP-NMS: Accelerating Non-Maximum Suppression Using Spatially Aware Priors

    Authors: Rohun Tripathi, Vasu Singla, Mahyar Najibi, Bharat Singh, Abhishek Sharma, Larry Davis

    Abstract: The widely adopted sequential variant of Non Maximum Suppression (or Greedy-NMS) is a crucial module for object-detection pipelines. Unfortunately, for the region proposal stage of two/multi-stage detectors, NMS is turning out to be a latency bottleneck due to its sequential nature. In this article, we carefully profile Greedy-NMS iterations to find that a major chunk of computation is wasted in c… ▽ More

    Submitted 21 August, 2020; v1 submitted 19 July, 2020; originally announced July 2020.

    Comments: Under Review at CVIU

  31. arXiv:2006.10547  [pdf

    cs.CV cs.LG eess.IV

    MOSQUITO-NET: A deep learning based CADx system for malaria diagnosis along with model interpretation using GradCam and class activation maps

    Authors: Aayush Kumar, Sanat B Singh, Suresh Chandra Satapathy, Minakhi Rout

    Abstract: Malaria is considered one of the deadliest diseases in today world which causes thousands of deaths per year. The parasites responsible for malaria are scientifically known as Plasmodium which infects the red blood cells in human beings. The parasites are transmitted by a female class of mosquitos known as Anopheles. The diagnosis of malaria requires identification and manual counting of parasitiz… ▽ More

    Submitted 19 June, 2020; v1 submitted 17 June, 2020; originally announced June 2020.

    Comments: arXiv admin note: text overlap with arXiv:2003.09871 by other authors

  32. arXiv:2005.05955  [pdf, other

    cs.LG stat.ML

    RSO: A Gradient Free Sampling Based Approach For Training Deep Neural Networks

    Authors: Rohun Tripathi, Bharat Singh

    Abstract: We propose RSO (random search optimization), a gradient free Markov Chain Monte Carlo search based approach for training deep neural networks. To this end, RSO adds a perturbation to a weight in a deep neural network and tests if it reduces the loss on a mini-batch. If this reduces the loss, the weight is updated, otherwise the existing weight is retained. Surprisingly, we find that repeating this… ▽ More

    Submitted 12 May, 2020; originally announced May 2020.

    Comments: Technical Report

  33. arXiv:2003.07507  [pdf

    cs.CL

    Multi-label natural language processing to identify diagnosis and procedure codes from MIMIC-III inpatient notes

    Authors: A. K. Bhavani Singh, Mounika Guntu, Ananth Reddy Bhimireddy, Judy W. Gichoya, Saptarshi Purkayastha

    Abstract: In the United States, 25% or greater than 200 billion dollars of hospital spending accounts for administrative costs that involve services for medical coding and billing. With the increasing number of patient records, manual assignment of the codes performed is overwhelming, time-consuming and error-prone, causing billing errors. Natural language processing can automate the extraction of codes/lab… ▽ More

    Submitted 16 March, 2020; originally announced March 2020.

    Comments: This is a shortened version of the Capstone Project that was accepted by the Faculty of Indiana University, in partial fulfillment of the requirements for the degree of Master of Science in Health Informatics

  34. arXiv:1912.13000  [pdf, other

    cs.CV cs.LG eess.IV

    Recognizing Instagram Filtered Images with Feature De-stylization

    Authors: Zhe Wu, Zuxuan Wu, Bharat Singh, Larry S. Davis

    Abstract: Deep neural networks have been shown to suffer from poor generalization when small perturbations are added (like Gaussian noise), yet little work has been done to evaluate their robustness to more natural image transformations like photo filters. This paper presents a study on how popular pretrained models are affected by commonly used Instagram filters. To this end, we introduce ImageNet-Instagra… ▽ More

    Submitted 30 December, 2019; originally announced December 2019.

    Comments: Accepted in AAAI 2020 as an oral presentation paper

  35. arXiv:1907.06327  [pdf, other

    cs.CV cs.HC cs.LG eess.IV

    FastV2C-HandNet: Fast Voxel to Coordinate Hand Pose Estimation with 3D Convolutional Neural Networks

    Authors: Rohan Lekhwani, Bhupendra Singh

    Abstract: Hand pose estimation from monocular depth images has been an important and challenging problem in the Computer Vision community. In this paper, we present a novel approach to estimate 3D hand joint locations from 2D depth images. Unlike most of the previous methods, our model captures the 3D spatial information from a depth image thereby giving it a greater understanding of the input. We voxelize… ▽ More

    Submitted 20 February, 2020; v1 submitted 15 July, 2019; originally announced July 2019.

    Comments: 13 pages, 5 figures, 2 tables

  36. arXiv:1906.08834  [pdf

    cs.LG cs.RO eess.SP stat.ML

    Deep Learning in the Automotive Industry: Recent Advances and Application Examples

    Authors: Kanwar Bharat Singh, Mustafa Ali Arat

    Abstract: One of the most exciting technology breakthroughs in the last few years has been the rise of deep learning. State-of-the-art deep learning models are being widely deployed in academia and industry, across a variety of areas, from image analysis to natural language processing. These models have grown from fledgling research subjects to mature techniques in real-world use. The increasing scale of da… ▽ More

    Submitted 24 June, 2019; v1 submitted 20 June, 2019; originally announced June 2019.

  37. arXiv:1905.08617  [pdf, other

    cs.CV cs.AI

    Automatic Long-Term Deception Detection in Group Interaction Videos

    Authors: Chongyang Bai, Maksim Bolonkin, Judee Burgoon, Chao Chen, Norah Dunbar, Bharat Singh, V. S. Subrahmanian, Zhe Wu

    Abstract: Most work on automated deception detection (ADD) in video has two restrictions: (i) it focuses on a video of one person, and (ii) it focuses on a single act of deception in a one or two minute video. In this paper, we propose a new ADD framework which captures long term deception in a group setting. We study deception in the well-known Resistance game (like Mafia and Werewolf) which consists of 5-… ▽ More

    Submitted 15 June, 2019; v1 submitted 15 May, 2019; originally announced May 2019.

    Comments: ICME 2019

  38. arXiv:1905.00125  [pdf, other

    cs.LG eess.SP stat.ML

    Multi-resolution Networks For Flexible Irregular Time Series Modeling (Multi-FIT)

    Authors: Bhanu Pratap Singh, Iman Deznabi, Bharath Narasimhan, Bryon Kucharski, Rheeya Uppaal, Akhila Josyula, Madalina Fiterau

    Abstract: Missing values, irregularly collected samples, and multi-resolution signals commonly occur in multivariate time series data, making predictive tasks difficult. These challenges are especially prevalent in the healthcare domain, where patients' vital signs and electronic records are collected at different frequencies and have occasionally missing information due to the imperfections in equipment or… ▽ More

    Submitted 30 April, 2019; originally announced May 2019.

  39. arXiv:1904.05871  [pdf, other

    cs.CV

    An Analysis of Pre-Training on Object Detection

    Authors: Hengduo Li, Bharat Singh, Mahyar Najibi, Zuxuan Wu, Larry S. Davis

    Abstract: We provide a detailed analysis of convolutional neural networks which are pre-trained on the task of object detection. To this end, we train detectors on large datasets like OpenImagesV4, ImageNet Localization and COCO. We analyze how well their features generalize to tasks like image classification, semantic segmentation and object detection on small datasets like PASCAL-VOC, Caltech-256, SUN-397… ▽ More

    Submitted 11 April, 2019; originally announced April 2019.

  40. arXiv:1902.03570  [pdf, other

    cs.AI cs.CL cs.CV cs.LG

    EvalAI: Towards Better Evaluation Systems for AI Agents

    Authors: Deshraj Yadav, Rishabh Jain, Harsh Agrawal, Prithvijit Chattopadhyay, Taranjeet Singh, Akash Jain, Shiv Baran Singh, Stefan Lee, Dhruv Batra

    Abstract: We introduce EvalAI, an open source platform for evaluating and comparing machine learning (ML) and artificial intelligence algorithms (AI) at scale. EvalAI is built to provide a scalable solution to the research community to fulfill the critical need of evaluating machine learning models and agents acting in an environment against annotations or with a human-in-the-loop. This will help researcher… ▽ More

    Submitted 10 February, 2019; originally announced February 2019.

  41. arXiv:1812.06203  [pdf, other

    cs.CV

    TAN: Temporal Aggregation Network for Dense Multi-label Action Recognition

    Authors: Xiyang Dai, Bharat Singh, Joe Yue-Hei Ng, Larry S. Davis

    Abstract: We present Temporal Aggregation Network (TAN) which decomposes 3D convolutions into spatial and temporal aggregation blocks. By stacking spatial and temporal convolutions repeatedly, TAN forms a deep hierarchical representation for capturing spatio-temporal information in videos. Since we do not apply 3D convolutions in each layer but only apply temporal aggregation blocks once after each spatial… ▽ More

    Submitted 14 December, 2018; originally announced December 2018.

    Comments: WACV 2019

  42. arXiv:1812.05586  [pdf, other

    cs.CV

    FA-RPN: Floating Region Proposals for Face Detection

    Authors: Mahyar Najibi, Bharat Singh, Larry S. Davis

    Abstract: We propose a novel approach for generating region proposals for performing face-detection. Instead of classifying anchor boxes using features from a pixel in the convolutional feature map, we adopt a pooling-based approach for generating region proposals. However, pooling hundreds of thousands of anchors which are evaluated for generating proposals becomes a computational bottleneck during inferen… ▽ More

    Submitted 13 December, 2018; originally announced December 2018.

  43. arXiv:1812.01600  [pdf, other

    cs.CV

    AutoFocus: Efficient Multi-Scale Inference

    Authors: Mahyar Najibi, Bharat Singh, Larry S. Davis

    Abstract: This paper describes AutoFocus, an efficient multi-scale inference algorithm for deep-learning based object detectors. Instead of processing an entire image pyramid, AutoFocus adopts a coarse to fine approach and only processes regions which are likely to contain small objects at finer scales. This is achieved by predicting category agnostic segmentation maps for small objects at coarser scales, c… ▽ More

    Submitted 1 August, 2019; v1 submitted 4 December, 2018; originally announced December 2018.

    Comments: To appear in Proceedings of International Conference on Computer Vision (ICCV), 2019

  44. arXiv:1810.08305  [pdf, other

    cs.LG stat.ML

    Open Vocabulary Learning on Source Code with a Graph-Structured Cache

    Authors: Milan Cvitkovic, Badal Singh, Anima Anandkumar

    Abstract: Machine learning models that take computer program source code as input typically use Natural Language Processing (NLP) techniques. However, a major challenge is that code is written using an open, rapidly changing vocabulary due to, e.g., the coinage of new variable and method names. Reasoning over such a vocabulary is not something for which most NLP methods are designed. We introduce a Graph-St… ▽ More

    Submitted 19 May, 2019; v1 submitted 18 October, 2018; originally announced October 2018.

    Comments: Published in the International Conference on Machine Learning (ICML 2019), 13 pages

  45. arXiv:1809.00310  [pdf, other

    cs.IR cs.SI

    A Datamining Approach for Emotions Extraction and Discovering Cricketers performance from Stadium to Sensex

    Authors: Amit Agarwal, Brijraj Singh, Jatin Bedi, Durga Toshniwal

    Abstract: Microblogging sites are the direct platform for the users to express their views. It has been observed from previous studies that people are viable to flaunt their emotions for events (eg. natural catastrophes, sports, academics etc.), for persons (actor/actress, sports person, scientist) and for the places they visit. In this study we focused on a sport event, particularly the cricket tournament… ▽ More

    Submitted 2 September, 2018; originally announced September 2018.

    Comments: Accepted as a Workshop paper at WSDM 2018

  46. Reduction of Redundant Rules in Association Rule Mining-Based Bug Assignment

    Authors: Meera Sharma, Abhishek Tandon, Madhu Kumari, V B Singh

    Abstract: Bug triaging is a process to decide what to do with newly coming bug reports. In this paper, we have mined association rules for the prediction of bug assignee of a newly reported bug using different bug attributes, namely, severity, priority, component and operating system. To deal with the problem of large data sets, we have taken subsets of data set by dividing the large data set using K-means… ▽ More

    Submitted 23 July, 2018; originally announced July 2018.

    Comments: 14 pages

    Journal ref: International Journal of Reliability, Quality and Safety Engineering Vol. 24, No. 6 (2017) 1740005 (14 pages) World Scientific Publishing Company

  47. arXiv:1806.06986  [pdf, other

    cs.CV

    Soft Sampling for Robust Object Detection

    Authors: Zhe Wu, Navaneeth Bodla, Bharat Singh, Mahyar Najibi, Rama Chellappa, Larry S. Davis

    Abstract: We study the robustness of object detection under the presence of missing annotations. In this setting, the unlabeled object instances will be treated as background, which will generate an incorrect training signal for the detector. Interestingly, we observe that after dropping 30% of the annotations (and labeling them as background), the performance of CNN-based object detectors like Faster-RCNN… ▽ More

    Submitted 21 July, 2019; v1 submitted 18 June, 2018; originally announced June 2018.

    Comments: Accepted in BMVC 2019

  48. arXiv:1805.09300  [pdf, other

    cs.CV

    SNIPER: Efficient Multi-Scale Training

    Authors: Bharat Singh, Mahyar Najibi, Larry S. Davis

    Abstract: We present SNIPER, an algorithm for performing efficient multi-scale training in instance level visual recognition tasks. Instead of processing every pixel in an image pyramid, SNIPER processes context regions around ground-truth instances (referred to as chips) at the appropriate scale. For background sampling, these context-regions are generated using proposals extracted from a region proposal n… ▽ More

    Submitted 13 December, 2018; v1 submitted 23 May, 2018; originally announced May 2018.

    Comments: Presented at the NIPS 2018 conference

  49. arXiv:1712.04415  [pdf, other

    cs.AI cs.CV

    Deception Detection in Videos

    Authors: Zhe Wu, Bharat Singh, Larry S. Davis, V. S. Subrahmanian

    Abstract: We present a system for covert automated deception detection in real-life courtroom trial videos. We study the importance of different modalities like vision, audio and text for this task. On the vision side, our system uses classifiers trained on low level video features which predict human micro-expressions. We show that predictions of high-level micro-expressions can be used as features for dec… ▽ More

    Submitted 12 December, 2017; originally announced December 2017.

    Comments: AAAI 2018, project page: https://doubaibai.github.io/DARE/

  50. arXiv:1712.01802  [pdf, other

    cs.CV

    R-FCN-3000 at 30fps: Decoupling Detection and Classification

    Authors: Bharat Singh, Hengduo Li, Abhishek Sharma, Larry S. Davis

    Abstract: We present R-FCN-3000, a large-scale real-time object detector in which objectness detection and classification are decoupled. To obtain the detection score for an RoI, we multiply the objectness score with the fine-grained classification score. Our approach is a modification of the R-FCN architecture in which position-sensitive filters are shared across different object classes for performing loc… ▽ More

    Submitted 5 December, 2017; originally announced December 2017.

    Comments: CVPR 2018 submission