Skip to main content

Showing 1–50 of 107 results for author: Tiwari, A

  1. arXiv:2407.11661  [pdf, other

    cs.PL cs.SE

    Challenges of Multilingual Program Specification and Analysis

    Authors: Carlo A. Furia, Abhishek Tiwari

    Abstract: Multilingual programs, whose implementations are made of different languages, are gaining traction especially in domains, such as web programming, that particularly benefit from the additional flexibility brought by using multiple languages. In this paper, we discuss the impact that the features commonly used in multilingual programming have on our capability of specifying and analyzing them. To t… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

  2. arXiv:2407.09294  [pdf, other

    cs.CV

    SS-SfP:Neural Inverse Rendering for Self Supervised Shape from (Mixed) Polarization

    Authors: Ashish Tiwari, Shanmuganathan Raman

    Abstract: We present a novel inverse rendering-based framework to estimate the 3D shape (per-pixel surface normals and depth) of objects and scenes from single-view polarization images, the problem popularly known as Shape from Polarization (SfP). The existing physics-based and learning-based methods for SfP perform under certain restrictions, i.e., (a) purely diffuse or purely specular reflections, which a… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: Published in Pacific Graphics 2023

  3. arXiv:2407.05255  [pdf, other

    cs.CV

    Estimation of the Area and Precipitation Associated with a Tropical Cyclone Biparjoy by using Image Processing

    Authors: Shikha Verma, Kuldeep Srivastava, Akhilesh Tiwari, Shekhar Verma

    Abstract: The rainfall associated with Topical Cyclone(TC) contributes a major amount to the annual rainfall in India. Due to the limited research on the quantitative precipitation associated with Tropical Cyclones (TC), the prediction of the amount of precipitation and area that it may cover remains a challenge. This paper proposes an approach to estimate the accumulated precipitation and impact on affecte… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  4. arXiv:2405.11181  [pdf, other

    cs.AI cs.CL

    Towards Knowledge-Infused Automated Disease Diagnosis Assistant

    Authors: Mohit Tomar, Abhisek Tiwari, Sriparna Saha

    Abstract: With the advancement of internet communication and telemedicine, people are increasingly turning to the web for various healthcare activities. With an ever-increasing number of diseases and symptoms, diagnosing patients becomes challenging. In this work, we build a diagnosis assistant to assist doctors, which identifies diseases based on patient-doctor interaction. During diagnosis, doctors utiliz… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

  5. arXiv:2405.03099  [pdf, other

    cs.CV

    SketchGPT: Autoregressive Modeling for Sketch Generation and Recognition

    Authors: Adarsh Tiwari, Sanket Biswas, Josep Lladós

    Abstract: We present SketchGPT, a flexible framework that employs a sequence-to-sequence autoregressive model for sketch generation, and completion, and an interpretation case study for sketch recognition. By mapping complex sketches into simplified sequences of abstract primitives, our approach significantly streamlines the input for autoregressive modeling. SketchGPT leverages the next token prediction ob… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

    Comments: Accepted in ICDAR 2024

  6. arXiv:2403.03276  [pdf, other

    eess.SP cs.AI cs.LG

    ARNN: Attentive Recurrent Neural Network for Multi-channel EEG Signals to Identify Epileptic Seizures

    Authors: Salim Rukhsar, Anil Kumar Tiwari

    Abstract: We proposed an Attentive Recurrent Neural Network (ARNN), which recurrently applies attention layers along a sequence and has linear complexity with respect to the sequence length. The proposed model operates on multi-channel EEG signals rather than single channel signals and leverages parallel computation. In this cell, the attention layer is a computational unit that efficiently applies self-att… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 9 pages, 7 figures, Journal Paper

  7. arXiv:2402.01758  [pdf, other

    cs.CY cs.AI cs.CL

    Aalap: AI Assistant for Legal & Paralegal Functions in India

    Authors: Aman Tiwari, Prathamesh Kalamkar, Atreyo Banerjee, Saurabh Karn, Varun Hemachandran, Smita Gupta

    Abstract: Using proprietary Large Language Models on legal tasks poses challenges due to data privacy issues, domain data heterogeneity, domain knowledge sophistication, and domain objectives uniqueness. We created Aalalp, a fine-tuned Mistral 7B model on instructions data related to specific Indian legal tasks. The performance of Aalap is better than gpt-3.5-turbo in 31\% of our test data and obtains an eq… ▽ More

    Submitted 30 January, 2024; originally announced February 2024.

  8. arXiv:2401.06807  [pdf, other

    cs.CL cs.AI

    An EcoSage Assistant: Towards Building A Multimodal Plant Care Dialogue Assistant

    Authors: Mohit Tomar, Abhisek Tiwari, Tulika Saha, Prince Jha, Sriparna Saha

    Abstract: In recent times, there has been an increasing awareness about imminent environmental challenges, resulting in people showing a stronger dedication to taking care of the environment and nurturing green life. The current $19.6 billion indoor gardening industry, reflective of this growing sentiment, not only signifies a monetary value but also speaks of a profound human desire to reconnect with the n… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

  9. arXiv:2401.05134  [pdf, other

    cs.AI cs.CL

    Yes, this is what I was looking for! Towards Multi-modal Medical Consultation Concern Summary Generation

    Authors: Abhisek Tiwari, Shreyangshu Bera, Sriparna Saha, Pushpak Bhattacharyya, Samrat Ghosh

    Abstract: Over the past few years, the use of the Internet for healthcare-related tasks has grown by leaps and bounds, posing a challenge in effectively managing and processing information to ensure its efficient utilization. During moments of emotional turmoil and psychological challenges, we frequently turn to the internet as our initial source of support, choosing this over discussing our feelings with o… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

  10. arXiv:2312.10553  [pdf, other

    cs.LG

    Machine Learning-Enhanced Prediction of Surface Smoothness for Inertial Confinement Fusion Target Polishing Using Limited Data

    Authors: Antonios Alexos, Junze Liu, Akash Tiwari, Kshitij Bhardwaj, Sean Hayes, Pierre Baldi, Satish Bukkapatnam, Suhas Bhandarkar

    Abstract: In Inertial Confinement Fusion (ICF) process, roughly a 2mm spherical shell made of high density carbon is used as target for laser beams, which compress and heat it to energy levels needed for high fusion yield. These shells are polished meticulously to meet the standards for a fusion shot. However, the polishing of these shells involves multiple stages, with each stage taking several hours. To m… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

    Comments: Accepted as Extended Abstract in AIM 2024

  11. arXiv:2311.11662  [pdf, other

    cs.CV

    Enhanced Spatio-Temporal Context for Temporally Consistent Robust 3D Human Motion Recovery from Monocular Videos

    Authors: Sushovan Chanda, Amogh Tiwari, Lokender Tiwari, Brojeshwar Bhowmick, Avinash Sharma, Hrishav Barua

    Abstract: Recovering temporally consistent 3D human body pose, shape and motion from a monocular video is a challenging task due to (self-)occlusions, poor lighting conditions, complex articulated body poses, depth ambiguity, and limited availability of annotated data. Further, doing a simple perframe estimation is insufficient as it leads to jittery and implausible results. In this paper, we propose a nove… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  12. arXiv:2310.16164  [pdf, other

    cs.HC

    Conversational Challenges in AI-Powered Data Science: Obstacles, Needs, and Design Opportunities

    Authors: Bhavya Chopra, Ananya Singha, Anna Fariha, Sumit Gulwani, Chris Parnin, Ashish Tiwari, Austin Z. Henley

    Abstract: Large Language Models (LLMs) are being increasingly employed in data science for tasks like data preprocessing and analytics. However, data scientists encounter substantial obstacles when conversing with LLM-powered chatbots and acting on their suggestions and answers. We conducted a mixed-methods study, including contextual observations, semi-structured interviews (n=14), and a survey (n=114), to… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: 24 pages, 8 figures

  13. arXiv:2310.05380  [pdf, other

    cs.IR cs.LG

    Augmented Embeddings for Custom Retrievals

    Authors: Anirudh Khatry, Yasharth Bajpai, Priyanshu Gupta, Sumit Gulwani, Ashish Tiwari

    Abstract: Information retrieval involves selecting artifacts from a corpus that are most relevant to a given search query. The flavor of retrieval typically used in classical applications can be termed as homogeneous and relaxed, where queries and corpus elements are both natural language (NL) utterances (homogeneous) and the goal is to pick most relevant elements from the corpus in the Top-K, where K is la… ▽ More

    Submitted 8 October, 2023; originally announced October 2023.

    Comments: 14 pages

    ACM Class: I.2.6

  14. Experience and Evidence are the eyes of an excellent summarizer! Towards Knowledge Infused Multi-modal Clinical Conversation Summarization

    Authors: Abhisek Tiwari, Anisha Saha, Sriparna Saha, Pushpak Bhattacharyya, Minakshi Dhar

    Abstract: With the advancement of telemedicine, both researchers and medical practitioners are working hand-in-hand to develop various techniques to automate various medical operations, such as diagnosis report generation. In this paper, we first present a multi-modal clinical conversation summary generation task that takes a clinician-patient interaction (both textual and visual information) and generates… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

  15. arXiv:2309.12436  [pdf, other

    cs.DB

    Rapidash: Efficient Constraint Discovery via Rapid Verification

    Authors: Zifan Liu, Shaleen Deep, Anna Fariha, Fotis Psallidas, Ashish Tiwari, Avrilia Floratou

    Abstract: Denial Constraint (DC) is a well-established formalism that captures a wide range of integrity constraints commonly encountered, including candidate keys, functional dependencies, and ordering constraints, among others. Given their significance, there has been considerable research interest in achieving fast verification and discovery of exact DCs within the database community. Despite the signifi… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Comments: comments and suggestions are welcome!

  16. arXiv:2309.05804  [pdf, other

    cs.CL

    Hi Model, generating 'nice' instead of 'good' is not as bad as generating 'rice'! Towards Context and Semantic Infused Dialogue Generation Loss Function and Evaluation Metric

    Authors: Abhisek Tiwari, Muhammed Sinan, Kaushik Roy, Amit Sheth, Sriparna Saha, Pushpak Bhattacharyya

    Abstract: Over the past two decades, dialogue modeling has made significant strides, moving from simple rule-based responses to personalized and persuasive response generation. However, despite these advancements, the objective functions and evaluation metrics for dialogue generation have remained stagnant. These lexical-based metrics, e.g., cross-entropy and BLEU, have two key limitations: (a) word-to-word… ▽ More

    Submitted 29 May, 2024; v1 submitted 11 September, 2023; originally announced September 2023.

  17. arXiv:2308.10995  [pdf, ps, other

    physics.ao-ph cs.LG

    Deep Learning Techniques in Extreme Weather Events: A Review

    Authors: Shikha Verma, Kuldeep Srivastava, Akhilesh Tiwari, Shekhar Verma

    Abstract: Extreme weather events pose significant challenges, thereby demanding techniques for accurate analysis and precise forecasting to mitigate its impact. In recent years, deep learning techniques have emerged as a promising approach for weather forecasting and understanding the dynamics of extreme weather events. This review aims to provide a comprehensive overview of the state-of-the-art deep learni… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

  18. arXiv:2308.02225  [pdf, other

    cs.CV

    Deep Semantic Model Fusion for Ancient Agricultural Terrace Detection

    Authors: Yi Wang, Chenying Liu, Arti Tiwari, Micha Silver, Arnon Karnieli, Xiao Xiang Zhu, Conrad M Albrecht

    Abstract: Discovering ancient agricultural terraces in desert regions is important for the monitoring of long-term climate changes on the Earth's surface. However, traditional ground surveys are both costly and limited in scale. With the increasing accessibility of aerial and satellite data, machine learning techniques bear large potential for the automatic detection and recognition of archaeological landsc… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

    Comments: IEEE Big Data 2022 workshop on Digital Twins for Accelerated Discovery of Climate & Sustainability Solutions (ADoCS)

  19. arXiv:2308.00705  [pdf

    cs.DL cs.AI cs.CY

    A Bibliographic Study on Artificial Intelligence Research: Global Panorama and Indian Appearance

    Authors: Amit Tiwari, Susmita Bardhan, Vikas Kumar

    Abstract: The present study identifies and assesses the bibliographic trend in Artificial Intelligence (AI) research for the years 2015-2020 using the science mapping method of bibliometric study. The required data has been collected from the Scopus database. To make the collected data analysis-ready, essential data transformation was performed manually and with the help of a tool viz. OpenRefine. For deter… ▽ More

    Submitted 4 July, 2023; originally announced August 2023.

    Comments: 21 pages, 9 figures, 6 tables

  20. Ensemble Framework for Cardiovascular Disease Prediction

    Authors: Achyut Tiwari, Aryan Chugh, Aman Sharma

    Abstract: Heart disease is the major cause of non-communicable and silent death worldwide. Heart diseases or cardiovascular diseases are classified into four types: coronary heart disease, heart failure, congenital heart disease, and cardiomyopathy. It is vital to diagnose heart disease early and accurately in order to avoid further injury and save patients' lives. As a result, we need a system that can pre… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

    Journal ref: Computers in Biology and Medicine Volume 146, July 2022, 105624

  21. arXiv:2305.14129  [pdf, other

    cs.SE cs.LG

    GrACE: Generation using Associated Code Edits

    Authors: Priyanshu Gupta, Avishree Khare, Yasharth Bajpai, Saikat Chakraborty, Sumit Gulwani, Aditya Kanade, Arjun Radhakrishna, Gustavo Soares, Ashish Tiwari

    Abstract: Developers expend a significant amount of time in editing code for a variety of reasons such as bug fixing or adding new features. Designing effective methods to predict code edits has been an active yet challenging area of research due to the diversity of code edits and the difficulty of capturing the developer intent. In this work, we address these challenges by endowing pre-trained large langua… ▽ More

    Submitted 20 September, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

  22. arXiv:2305.11581  [pdf

    cs.AI econ.GN

    Trustworthy, responsible, ethical AI in manufacturing and supply chains: synthesis and emerging research questions

    Authors: Alexandra Brintrup, George Baryannis, Ashutosh Tiwari, Svetan Ratchev, Giovanna Martinez-Arellano, Jatinder Singh

    Abstract: While the increased use of AI in the manufacturing sector has been widely noted, there is little understanding on the risks that it may raise in a manufacturing organisation. Although various high level frameworks and definitions have been proposed to consolidate potential risks, practitioners struggle with understanding and implementing them. This lack of understanding exposes manufacturing to… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

    Comments: Pre-print under peer-review

  23. arXiv:2305.07552  [pdf, other

    cs.CV cs.AI cs.CY

    Dish detection in food platters: A framework for automated diet logging and nutrition management

    Authors: Mansi Goel, Shashank Dargar, Shounak Ghatak, Nidhi Verma, Pratik Chauhan, Anushka Gupta, Nikhila Vishnumolakala, Hareesh Amuru, Ekta Gambhir, Ronak Chhajed, Meenal Jain, Astha Jain, Samiksha Garg, Nitesh Narwade, Nikhilesh Verhwani, Abhuday Tiwari, Kirti Vashishtha, Ganesh Bagler

    Abstract: Diet is central to the epidemic of lifestyle disorders. Accurate and effortless diet logging is one of the significant bottlenecks for effective diet management and calorie restriction. Dish detection from food platters is a challenging problem due to a visually complex food layout. We present an end-to-end computational framework for diet management, from data compilation, annotation, and state-o… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

    Comments: 11 pages, 5 figures, 5 tables. Submitted to the 8th International Conference on Computer Vision & Image Processing (CVIP-2023)

    ACM Class: I.4.9; I.5.4; J.3

  24. arXiv:2305.04325  [pdf, other

    eess.SP cs.LG

    Lightweight Convolution Transformer for Cross-patient Seizure Detection in Multi-channel EEG Signals

    Authors: Salim Rukhsar, Anil K. Tiwari

    Abstract: Background: Epilepsy is a neurological illness affecting the brain that makes people more likely to experience frequent, spontaneous seizures. There has to be an accurate automated method for measuring seizure frequency and severity in order to assess the efficacy of pharmacological therapy for epilepsy. The drug quantities are often derived from patient reports which may cause significant issues… ▽ More

    Submitted 7 May, 2023; originally announced May 2023.

    Comments: The paper is under review in Neural Network, Elsevier

  25. arXiv:2305.03916  [pdf, other

    cs.SE cs.PL

    Unifying Pointer Analyses for Polyglot Inter-operations through Summary Specialization

    Authors: Jyoti Prakash, Abhishek Tiwari, Christian Hammer

    Abstract: Modular analysis of polyglot applications is challenging because heap object flows across language boundaries must be resolved. The state-of-the-art analyses for polyglot applications have two fundamental limitations. First, they assume explicit boundaries between the host and the guest language to determine inter-language dataflows. Second, they rely on specific analyses of the host and guest lan… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

  26. arXiv:2305.01598  [pdf, other

    cs.DB cs.AI cs.HC

    From Words to Code: Harnessing Data for Program Synthesis from Natural Language

    Authors: Anirudh Khatry, Joyce Cahoon, Jordan Henkel, Shaleen Deep, Venkatesh Emani, Avrilia Floratou, Sumit Gulwani, Vu Le, Mohammad Raza, Sherry Shi, Mukul Singh, Ashish Tiwari

    Abstract: Creating programs to correctly manipulate data is a difficult task, as the underlying programming languages and APIs can be challenging to learn for many users who are not skilled programmers. Large language models (LLMs) demonstrate remarkable potential for generating code from natural language, but in the data manipulation domain, apart from the natural language (NL) description of the intended… ▽ More

    Submitted 3 May, 2023; v1 submitted 2 May, 2023; originally announced May 2023.

    Comments: 14 pages

  27. arXiv:2304.09548  [pdf, other

    cs.CL cs.AI cs.LG

    SemEval 2023 Task 6: LegalEval - Understanding Legal Texts

    Authors: Ashutosh Modi, Prathamesh Kalamkar, Saurabh Karn, Aman Tiwari, Abhinav Joshi, Sai Kiran Tanikella, Shouvik Kumar Guha, Sachin Malhan, Vivek Raghavan

    Abstract: In populous countries, pending legal cases have been growing exponentially. There is a need for developing NLP-based techniques for processing and automatically understanding legal documents. To promote research in the area of Legal NLP we organized the shared task LegalEval - Understanding Legal Texts at SemEval 2023. LegalEval task has three sub-tasks: Task-A (Rhetorical Roles Labeling) is about… ▽ More

    Submitted 1 May, 2023; v1 submitted 19 April, 2023; originally announced April 2023.

    Comments: 13 Pages (9 Pages + References), Accepted at SemEval 2023 at ACL 2023

  28. arXiv:2304.01908  [pdf

    cs.LG cs.CR

    Leveraging Deep Learning Approaches for Deepfake Detection: A Review

    Authors: Aniruddha Tiwari, Rushit Dave, Mounika Vanamala

    Abstract: Conspicuous progression in the field of machine learning and deep learning have led the jump of highly realistic fake media, these media oftentimes referred as deepfakes. Deepfakes are fabricated media which are generated by sophisticated AI that are at times very difficult to set apart from the real media. So far, this media can be uploaded to the various social media platforms, hence advertising… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

  29. arXiv:2212.03800  [pdf, other

    cs.LG stat.ML

    Unsupervised spectral-band feature identification for optimal process discrimination

    Authors: Akash Tiwari, Satish Bukkapatnam

    Abstract: Changes in real-world dynamic processes are often described in terms of differences in energies $\textbf{E}(\underlineα)$ of a set of spectral-bands $\underlineα$. Given continuous spectra of two classes $A$ and $B$, or in general, two stochastic processes $S^{(A)}(f)$ and $S^{(B)}(f)$, $f \in \mathbb{R}^+$, we address the ubiquitous problem of identifying a subset of intervals of $f$ called spect… ▽ More

    Submitted 7 December, 2022; originally announced December 2022.

  30. arXiv:2211.03442  [pdf, other

    cs.CL cs.AI

    Named Entity Recognition in Indian court judgments

    Authors: Prathamesh Kalamkar, Astha Agarwal, Aman Tiwari, Smita Gupta, Saurabh Karn, Vivek Raghavan

    Abstract: Identification of named entities from legal texts is an essential building block for developing other legal Artificial Intelligence applications. Named Entities in legal texts are slightly different and more fine-grained than commonly used named entities like Person, Organization, Location etc. In this paper, we introduce a new corpus of 46545 annotated legal named entities mapped to 14 legal enti… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: to be published in NLLP 2022 Workshop at EMNLP

  31. arXiv:2211.00565  [pdf, other

    cs.LG

    Revisiting Heterophily in Graph Convolution Networks by Learning Representations Across Topological and Feature Spaces

    Authors: Ashish Tiwari, Sresth Tosniwal, Shanmuganathan Raman

    Abstract: Graph convolution networks (GCNs) have been enormously successful in learning representations over several graph-based machine learning tasks. Specific to learning rich node representations, most of the methods have solely relied on the homophily assumption and have shown limited performance on the heterophilous graphs. While several methods have been developed with new architectures to address he… ▽ More

    Submitted 2 November, 2022; v1 submitted 1 November, 2022; originally announced November 2022.

    Comments: Under Review Project Page: https://sites.google.com/iitgn.ac.in/hetgcn/home

  32. arXiv:2210.15965  [pdf

    cs.AI cs.SE

    System Network Analytics: Evolution and Stable Rules of a State Series

    Authors: Animesh Chaturvedi, Aruna Tiwari, Nicolas Spyratos

    Abstract: System Evolution Analytics on a system that evolves is a challenge because it makes a State Series SS = {S1, S2... SN} (i.e., a set of states ordered by time) with several inter-connected entities changing over time. We present stability characteristics of interesting evolution rules occurring in multiple states. We defined an evolution rule with its stability as the fraction of states in which th… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

    Comments: Accepted on IEEE DSAA and Video Presentation https://www.youtube.com/watch?v=ohOeTXoI-IY&list=PLtvWi5o3JBnF3yxcjGdT4KCDLxRBIpsyR

    Journal ref: IEEE 9th International Conference on Data Science and Advanced Analytics (DSAA), October 13-16, 2022, Shenzhen, China. IEEE, 2022. (Core A)

  33. arXiv:2209.11972  [pdf, other

    cs.CV

    Ground then Navigate: Language-guided Navigation in Dynamic Scenes

    Authors: Kanishk Jain, Varun Chhangani, Amogh Tiwari, K. Madhava Krishna, Vineet Gandhi

    Abstract: We investigate the Vision-and-Language Navigation (VLN) problem in the context of autonomous driving in outdoor settings. We solve the problem by explicitly grounding the navigable regions corresponding to the textual command. At each timestamp, the model predicts a segmentation mask corresponding to the intermediate or the final navigable region. Our work contrasts with existing efforts in VLN, w… ▽ More

    Submitted 24 September, 2022; originally announced September 2022.

  34. arXiv:2208.11066  [pdf, other

    cs.NE

    Enhanced Opposition Differential Evolution Algorithm for Multimodal Optimization

    Authors: Shatendra Singh, Aruna Tiwari

    Abstract: Most of the real-world problems are multimodal in nature that consists of multiple optimum values. Multimodal optimization is defined as the process of finding multiple global and local optima (as opposed to a single solution) of a function. It enables a user to switch between different solutions as per the need while still maintaining the optimal system performance. Classical gradient-based metho… ▽ More

    Submitted 23 August, 2022; originally announced August 2022.

  35. arXiv:2208.01968  [pdf, other

    cs.CR cs.SE

    Our fingerprints don't fade from the Apps we touch: Fingerprinting the Android WebView

    Authors: Abhishek Tiwari, Jyoti Prakash, Alimerdan Rahimov, Christian Hammer

    Abstract: Numerous studies demonstrated that browser fingerprinting is detrimental to users' security and privacy. However, little is known about the effects of browser fingerprinting on Android hybrid apps -- where a stripped-down Chromium browser is integrated into an app. These apps expand the attack surface by employing two-way communication between native apps and the web. This paper studies the impact… ▽ More

    Submitted 3 August, 2022; originally announced August 2022.

  36. arXiv:2207.12456  [pdf, other

    cs.PL cs.AI cs.SE

    Overwatch: Learning Patterns in Code Edit Sequences

    Authors: Yuhao Zhang, Yasharth Bajpai, Priyanshu Gupta, Ameya Ketkar, Miltiadis Allamanis, Titus Barik, Sumit Gulwani, Arjun Radhakrishna, Mohammad Raza, Gustavo Soares, Ashish Tiwari

    Abstract: Integrated Development Environments (IDEs) provide tool support to automate many source code editing tasks. Traditionally, IDEs use only the spatial context, i.e., the location where the developer is editing, to generate candidate edit recommendations. However, spatial context alone is often not sufficient to confidently predict the developer's next edit, and thus IDEs generate many suggestions at… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

    Comments: 25 pages, 7 Figures, 4 Algorithms, 3 Tables

  37. arXiv:2207.11765  [pdf, other

    cs.SE cs.AI

    Neurosymbolic Repair for Low-Code Formula Languages

    Authors: Rohan Bavishi, Harshit Joshi, José Pablo Cambronero Sánchez, Anna Fariha, Sumit Gulwani, Vu Le, Ivan Radicek, Ashish Tiwari

    Abstract: Most users of low-code platforms, such as Excel and PowerApps, write programs in domain-specific formula languages to carry out nontrivial tasks. Often users can write most of the program they want, but introduce small mistakes that yield broken formulas. These mistakes, which can be both syntactic and semantic, are hard for low-code users to identify and fix, even though they can be resolved with… ▽ More

    Submitted 24 July, 2022; originally announced July 2022.

  38. arXiv:2207.02025  [pdf, other

    cs.CV

    DeepPS2: Revisiting Photometric Stereo Using Two Differently Illuminated Images

    Authors: Ashish Tiwari, Shanmuganathan Raman

    Abstract: Photometric stereo, a problem of recovering 3D surface normals using images of an object captured under different lightings, has been of great interest and importance in computer vision research. Despite the success of existing traditional and deep learning-based methods, it is still challenging due to: (i) the requirement of three or more differently illuminated images, (ii) the inability to mode… ▽ More

    Submitted 30 August, 2022; v1 submitted 5 July, 2022; originally announced July 2022.

    Comments: Accepted in ECCV 2022 Project Page: https://sites.google.com/iitgn.ac.in/deepps2/home

  39. arXiv:2204.11835  [pdf, other

    q-bio.QM cs.AI cs.LG

    A Novel Scalable Apache Spark Based Feature Extraction Approaches for Huge Protein Sequence and their Clustering Performance Analysis

    Authors: Preeti Jha, Aruna Tiwari, Neha Bharill, Milind Ratnaparkhe, Om Prakash Patel, Nilagiri Harshith, Mukkamalla Mounika, Neha Nagendra

    Abstract: Genome sequencing projects are rapidly increasing the number of high-dimensional protein sequence datasets. Clustering a high-dimensional protein sequence dataset using traditional machine learning approaches poses many challenges. Many different feature extraction methods exist and are widely used. However, extracting features from millions of protein sequences becomes impractical because they ar… ▽ More

    Submitted 21 April, 2022; originally announced April 2022.

  40. arXiv:2203.16960  [pdf, other

    cs.MA cs.RO

    Multi-Agent Spatial Predictive Control with Application to Drone Flocking (Extended Version)

    Authors: Andreas Brandstätter, Scott A. Smolka, Scott D. Stoller, Ashish Tiwari, Radu Grosu

    Abstract: We introduce the novel concept of Spatial Predictive Control (SPC) to solve the following problem: given a collection of agents (e.g., drones) with positional low-level controllers (LLCs) and a mission-specific distributed cost function, how can a distributed controller achieve and maintain cost-function minimization without a plant model and only positional observations of the environment? Our fu… ▽ More

    Submitted 31 March, 2022; originally announced March 2022.

  41. arXiv:2203.05367  [pdf, other

    cs.CR

    TIDF-DLPM: Term and Inverse Document Frequency based Data Leakage Prevention Model

    Authors: Ishu Gupta, Sloni Mittal, Ankit Tiwari, Priya Agarwal, Ashutosh Kumar Singh

    Abstract: Confidentiality of the data is being endangered as it has been categorized into false categories which might get leaked to an unauthorized party. For this reason, various organizations are mainly implementing data leakage prevention systems (DLPs). Firewalls and intrusion detection systems are being outdated versions of security mechanisms. The data which are being used, in sending state or are re… ▽ More

    Submitted 10 March, 2022; originally announced March 2022.

  42. GENOME: A GENeric methodology for Ontological Modelling of Epics

    Authors: Udaya Varadarajan, Mayukh Bagchi, Amit Tiwari, M. P. Satija

    Abstract: Ontological knowledge modelling of epics, though being an established research arena backed by concrete multilingual and multicultural works, still suffer from two key shortcomings. Firstly, all epic ontological models developed till date have been designed following ad-hoc methodologies, most often, combining existing general purpose ontology development methodologies. Secondly, none of the ad-ho… ▽ More

    Submitted 13 February, 2022; originally announced February 2022.

  43. arXiv:2201.13125  [pdf, other

    cs.CL cs.AI cs.LG

    Corpus for Automatic Structuring of Legal Documents

    Authors: Prathamesh Kalamkar, Aman Tiwari, Astha Agarwal, Saurabh Karn, Smita Gupta, Vivek Raghavan, Ashutosh Modi

    Abstract: In populous countries, pending legal cases have been growing exponentially. There is a need for developing techniques for processing and organizing legal documents. In this paper, we introduce a new corpus for structuring legal documents. In particular, we introduce a corpus of legal judgment documents in English that are segmented into topical and coherent parts. Each of these parts is annotated… ▽ More

    Submitted 19 September, 2022; v1 submitted 31 January, 2022; originally announced January 2022.

    Comments: Accepted at LREC 2022, 10 Pages (8 page main paper + 2 page references)

  44. arXiv:2201.11227  [pdf, other

    cs.LG cs.PL

    Synchromesh: Reliable code generation from pre-trained language models

    Authors: Gabriel Poesia, Oleksandr Polozov, Vu Le, Ashish Tiwari, Gustavo Soares, Christopher Meek, Sumit Gulwani

    Abstract: Large pre-trained language models have been used to generate code,providing a flexible interface for synthesizing programs from natural language specifications. However, they often violate syntactic and semantic rules of their output language, limiting their practical usability. In this paper, we propose Synchromesh: a framework for substantially improving the reliability of pre-trained models for… ▽ More

    Submitted 26 January, 2022; originally announced January 2022.

    Comments: 10 pages, 9 additional pages of Appendix

  45. arXiv:2111.06054  [pdf, other

    cs.CV

    Indian Licence Plate Dataset in the wild

    Authors: Sanchit Tanwar, Ayush Tiwari, Ritesh Chowdhry

    Abstract: Indian Licence Plate Detection is a problem that has not been explored much at an open-source level.There are proprietary solutions available for it, but there is no big open-source dataset that can be used to perform experiments and test different approaches.Most of the large datasets available are for countries like China, Brazil, but the model trained on these datasets does not perform well on… ▽ More

    Submitted 11 November, 2021; originally announced November 2021.

    Comments: 5 pages, 4 figures, 3 tables

  46. arXiv:2109.02445  [pdf, other

    cs.AI cs.PL

    Multi-modal Program Inference: a Marriage of Pre-trainedLanguage Models and Component-based Synthesis

    Authors: Kia Rahmani, Mohammad Raza, Sumit Gulwani, Vu Le, Daniel Morris, Arjun Radhakrishna, Gustavo Soares, Ashish Tiwari

    Abstract: Multi-modal program synthesis refers to the task of synthesizing programs (code) from their specification given in different forms, such as a combination of natural language and examples. Examples provide a precise but incomplete specification, and natural language provides an ambiguous but more "complete" task description. Machine-learned pre-trained models (PTMs) are adept at handling ambiguous… ▽ More

    Submitted 3 September, 2021; originally announced September 2021.

    Comments: To be published in OOPSLA 2021

  47. arXiv:2108.06922  [pdf, ps, other

    cs.LO

    A Conjecture Equivalent to the Collatz Conjecture

    Authors: Ashish Tiwari

    Abstract: We present a formulation of the Collatz conjecture that is potentially more amenable to modeling and analysis by automated termination checking tools.

    Submitted 16 August, 2021; originally announced August 2021.

  48. arXiv:2108.06670  [pdf, other

    cs.LG cs.AI

    Deep Geospatial Interpolation Networks

    Authors: Sumit Kumar Varshney, Jeetu Kumar, Aditya Tiwari, Rishabh Singh, Venkata M. V. Gunturi, Narayanan C. Krishnan

    Abstract: Interpolation in Spatio-temporal data has applications in various domains such as climate, transportation, and mining. Spatio-Temporal interpolation is highly challenging due to the complex spatial and temporal relationships. However, traditional techniques such as Kriging suffer from high running time and poor performance on data that exhibit high variance across space and time dimensions. To thi… ▽ More

    Submitted 15 August, 2021; originally announced August 2021.

  49. arXiv:2108.06394  [pdf, other

    astro-ph.SR astro-ph.IM cs.LG physics.space-ph stat.ML

    A Machine-Learning-Ready Dataset Prepared from the Solar and Heliospheric Observatory Mission

    Authors: Carl Shneider, Andong Hu, Ajay K. Tiwari, Monica G. Bobra, Karl Battams, Jannis Teunissen, Enrico Camporeale

    Abstract: We present a Python tool to generate a standard dataset from solar images that allows for user-defined selection criteria and a range of pre-processing steps. Our Python tool works with all image products from both the Solar and Heliospheric Observatory (SoHO) and Solar Dynamics Observatory (SDO) missions. We discuss a dataset produced from the SoHO mission's multi-spectral images which is free of… ▽ More

    Submitted 4 August, 2021; originally announced August 2021.

    Comments: under review

  50. arXiv:2108.02490  [pdf, other

    cs.SE cs.PL

    HIPPODROME: Data Race Repair using Static Analysis Summaries

    Authors: Andreea Costea, Abhishek Tiwari, Sigmund Chianasta, Kishore R, Abhik Roychoudhury, Ilya Sergey

    Abstract: Implementing bug-free concurrent programs is a challenging task in modern software development. State-of-the-art static analyses find hundreds of concurrency bugs in production code, scaling to large codebases. Yet, fixing these bugs in constantly changing codebases represents a daunting effort for programmers, particularly because a fix in the concurrent code can introduce other bugs in a subtle… ▽ More

    Submitted 6 August, 2021; v1 submitted 5 August, 2021; originally announced August 2021.