Skip to main content

Showing 1–50 of 70 results for author: Patel, K

  1. arXiv:2407.06237  [pdf, ps, other

    cs.AI cs.LG math.OC

    Discounted Pseudocosts in MILP

    Authors: Krunal Kishor Patel

    Abstract: In this article, we introduce the concept of discounted pseudocosts, inspired by discounted total reward in reinforcement learning, and explore their application in mixed-integer linear programming (MILP). Traditional pseudocosts estimate changes in the objective function due to variable bound changes during the branch-and-bound process. By integrating reinforcement learning concepts, we propose a… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    MSC Class: 90C11 (Primary); 90C10; 90-08 (Secondary)

  2. arXiv:2407.00101  [pdf, other

    cs.LG cs.AI cs.CC cs.DC cs.NE

    Hybrid Approach to Parallel Stochastic Gradient Descent

    Authors: Aakash Sudhirbhai Vora, Dhrumil Chetankumar Joshi, Aksh Kantibhai Patel

    Abstract: Stochastic Gradient Descent is used for large datasets to train models to reduce the training time. On top of that data parallelism is widely used as a method to efficiently train neural networks using multiple worker nodes in parallel. Synchronous and asynchronous approach to data parallelism is used by most systems to train the model in parallel. However, both of them have their drawbacks. We pr… ▽ More

    Submitted 27 June, 2024; originally announced July 2024.

  3. arXiv:2406.07693  [pdf

    cs.CY cs.AI cs.CL cs.LG cs.SI

    A Labelled Dataset for Sentiment Analysis of Videos on YouTube, TikTok, and Other Sources about the 2024 Outbreak of Measles

    Authors: Nirmalya Thakur, Vanessa Su, Mingchen Shao, Kesha A. Patel, Hongseok Jeong, Victoria Knieling, Andrew Bian

    Abstract: The work of this paper presents a dataset that contains the data of 4011 videos about the ongoing outbreak of measles published on 264 websites on the internet between January 1, 2024, and May 31, 2024. The dataset is available at https://dx.doi.org/10.21227/40s8-xf63. These websites primarily include YouTube and TikTok, which account for 48.6% and 15.2% of the videos, respectively. The remainder… ▽ More

    Submitted 16 June, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

    Comments: 19 pages

    ACM Class: I.2.7; I.2.8; I.5.4; K.4.2; H.2.8; I.2.6

  4. arXiv:2405.11667  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    The Limits and Potentials of Local SGD for Distributed Heterogeneous Learning with Intermittent Communication

    Authors: Kumar Kshitij Patel, Margalit Glasgow, Ali Zindari, Lingxiao Wang, Sebastian U. Stich, Ziheng Cheng, Nirmit Joshi, Nathan Srebro

    Abstract: Local SGD is a popular optimization method in distributed learning, often outperforming other algorithms in practice, including mini-batch SGD. Despite this success, theoretically proving the dominance of local SGD in settings with reasonable data heterogeneity has been difficult, creating a significant gap between theory and practice. In this paper, we provide new lower bounds for local SGD under… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

  5. arXiv:2405.02425  [pdf, other

    cs.RO cs.AI

    Learning Robot Soccer from Egocentric Vision with Deep Reinforcement Learning

    Authors: Dhruva Tirumala, Markus Wulfmeier, Ben Moran, Sandy Huang, Jan Humplik, Guy Lever, Tuomas Haarnoja, Leonard Hasenclever, Arunkumar Byravan, Nathan Batchelor, Neil Sreendra, Kushal Patel, Marlon Gwira, Francesco Nori, Martin Riedmiller, Nicolas Heess

    Abstract: We apply multi-agent deep reinforcement learning (RL) to train end-to-end robot soccer policies with fully onboard computation and sensing via egocentric RGB vision. This setting reflects many challenges of real-world robotics, including active perception, agile full-body control, and long-horizon planning in a dynamic, partially-observable, multi-agent domain. We rely on large-scale, simulation-b… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  6. arXiv:2403.19770  [pdf, other

    cs.RO cs.AI cs.LG

    Hierarchical Deep Learning for Intention Estimation of Teleoperation Manipulation in Assembly Tasks

    Authors: Mingyu Cai, Karankumar Patel, Soshi Iba, Songpo Li

    Abstract: In human-robot collaboration, shared control presents an opportunity to teleoperate robotic manipulation to improve the efficiency of manufacturing and assembly processes. Robots are expected to assist in executing the user's intentions. To this end, robust and prompt intention estimation is needed, relying on behavioral observations. The framework presents an intention estimation technique at hie… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: ICRA 2024

  7. arXiv:2403.18180  [pdf, other

    cs.CV

    Multi-Layer Dense Attention Decoder for Polyp Segmentation

    Authors: Krushi Patel, Fengjun Li, Guanghui Wang

    Abstract: Detecting and segmenting polyps is crucial for expediting the diagnosis of colon cancer. This is a challenging task due to the large variations of polyps in color, texture, and lighting conditions, along with subtle differences between the polyp and its surrounding area. Recently, vision Transformers have shown robust abilities in modeling global context for polyp segmentation. However, they face… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  8. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  9. arXiv:2402.10797  [pdf, other

    cs.MS cs.LG stat.CO stat.ML

    BlackJAX: Composable Bayesian inference in JAX

    Authors: Alberto Cabezas, Adrien Corenflos, Junpeng Lao, Rémi Louf, Antoine Carnec, Kaustubh Chaudhari, Reuben Cohn-Gordon, Jeremie Coullon, Wei Deng, Sam Duffield, Gerardo Durán-Martín, Marcin Elantkowski, Dan Foreman-Mackey, Michele Gregori, Carlos Iguaran, Ravin Kumar, Martin Lysy, Kevin Murphy, Juan Camilo Orduz, Karm Patel, Xi Wang, Rob Zinkov

    Abstract: BlackJAX is a library implementing sampling and variational inference algorithms commonly used in Bayesian computation. It is designed for ease of use, speed, and modularity by taking a functional approach to the algorithms' implementation. BlackJAX is written in Python, using JAX to compile and run NumpPy-like samplers and variational methods on CPUs, GPUs, and TPUs. The library integrates well w… ▽ More

    Submitted 22 February, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: Companion paper for the library https://github.com/blackjax-devs/blackjax Update: minor changes and updated the list of authors to include technical contributors

  10. arXiv:2312.17643  [pdf, other

    cs.RO

    b-it-bots RoboCup@Work Team Description Paper 2023

    Authors: Kevin Patel, Vamsi Kalagaturu, Vivek Mannava, Ravisankar Selvaraju, Shubham Shinde, Dharmin Bakaraniya, Deebul Nair, Mohammad Wasil, Santosh Thoduka, Iman Awaad, Sven Schneider, Nico Hochgeschwender, Paul G. Plöger

    Abstract: This paper presents the b-it-bots RoboCup@Work team and its current hardware and functional architecture for the KUKA youBot robot. We describe the underlying software framework and the developed capabilities required for operating in industrial environments including features such as reliable and precise navigation, flexible manipulation, robust object recognition and task planning. New developme… ▽ More

    Submitted 29 December, 2023; originally announced December 2023.

  11. arXiv:2312.11885  [pdf

    cs.SI cs.CY physics.soc-ph

    A Large-Scale Dataset of Search Interests Related to Disease X Originating from Different Geographic Regions

    Authors: Nirmalya Thakur, Shuqi Cui, Kesha A. Patel, Isabella Hall, Yuvraj Nihal Duggal

    Abstract: The World Health Organization added Disease X to their shortlist of blueprint priority diseases to represent a hypothetical, unknown pathogen that could cause a future epidemic. During different virus outbreaks of the past, such as COVID-19, Influenza, Lyme Disease, and Zika virus, researchers from various disciplines utilized Google Trends to mine multimodal components of web behavior to study, i… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  12. arXiv:2311.17586  [pdf, other

    cs.LG math.OC stat.ML

    Federated Online and Bandit Convex Optimization

    Authors: Kumar Kshitij Patel, Lingxiao Wang, Aadirupa Saha, Nati Sebro

    Abstract: We study the problems of distributed online and bandit convex optimization against an adaptive adversary. We aim to minimize the average regret on $M$ machines working in parallel over $T$ rounds with $R$ intermittent communications. Assuming the underlying cost functions are convex and can be generated adaptively, our results show that collaboration is not beneficial when the machines have access… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  13. arXiv:2311.16766  [pdf, other

    cs.CV cs.LG

    Rescuing referral failures during automated diagnosis of domain-shifted medical images

    Authors: Anuj Srivastava, Karm Patel, Pradeep Shenoy, Devarajan Sridharan

    Abstract: The success of deep learning models deployed in the real world depends critically on their ability to generalize well across diverse data domains. Here, we address a fundamental challenge with selective classification during automated diagnosis with domain-shifted medical images. In this scenario, models must learn to avoid making predictions when label confidence is low, especially when tested wi… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  14. arXiv:2311.16459  [pdf, other

    cs.LG cs.DC cs.GT

    On the Effect of Defections in Federated Learning and How to Prevent Them

    Authors: Minbiao Han, Kumar Kshitij Patel, Han Shao, Lingxiao Wang

    Abstract: Federated learning is a machine learning protocol that enables a large population of agents to collaborate over multiple rounds to produce a single consensus model. There are several federated learning applications where agents may choose to defect permanently$-$essentially withdrawing from the collaboration$-$if they are content with their instantaneous model in that round. This work demonstrates… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  15. arXiv:2310.10026  [pdf, other

    eess.AS cs.SD

    Real-time Speech Enhancement and Separation with a Unified Deep Neural Network for Single/Dual Talker Scenarios

    Authors: Kashyap Patel, Anton Kovalyov, Issa Panahi

    Abstract: This paper introduces a practical approach for leveraging a real-time deep learning model to alternate between speech enhancement and joint speech enhancement and separation depending on whether the input mixture contains one or two active speakers. Scale-invariant signal-to-distortion ratio (SI-SDR) has shown to be a highly effective training measure in time-domain speech separation. However, the… ▽ More

    Submitted 15 October, 2023; originally announced October 2023.

    Comments: 6 Pages, Accepted at IEEE Asilomar

  16. arXiv:2310.04546  [pdf, other

    cs.CR

    Privacy-Preserving Financial Anomaly Detection via Federated Learning & Multi-Party Computation

    Authors: Sunpreet Arora, Andrew Beams, Panagiotis Chatzigiannis, Sebastian Meiser, Karan Patel, Srinivasan Raghuraman, Peter Rindal, Harshal Shah, Yizhen Wang, Yuhang Wu, Hao Yang, Mahdi Zamani

    Abstract: One of the main goals of financial institutions (FIs) today is combating fraud and financial crime. To this end, FIs use sophisticated machine-learning models trained using data collected from their customers. The output of machine learning models may be manually reviewed for critical use cases, e.g., determining the likelihood of a transaction being anomalous and the subsequent course of action.… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

    Comments: 12 pages

  17. arXiv:2308.11477  [pdf, ps, other

    cs.LG cs.AI math.OC

    An improved column-generation-based matheuristic for learning classification trees

    Authors: Krunal Kishor Patel, Guy Desaulniers, Andrea Lodi

    Abstract: Decision trees are highly interpretable models for solving classification problems in machine learning (ML). The standard ML algorithms for training decision trees are fast but generate suboptimal trees in terms of accuracy. Other discrete optimization models in the literature address the optimality problem but only work well on relatively small datasets. \cite{firat2020column} proposed a column-g… ▽ More

    Submitted 22 January, 2024; v1 submitted 22 August, 2023; originally announced August 2023.

    Comments: Submitted to Computers and Operations Research journal

  18. arXiv:2306.09109  [pdf, other

    cs.CV

    NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations

    Authors: Varun Jampani, Kevis-Kokitsi Maninis, Andreas Engelhardt, Arjun Karpur, Karen Truong, Kyle Sargent, Stefan Popov, André Araujo, Ricardo Martin-Brualla, Kaushal Patel, Daniel Vlasic, Vittorio Ferrari, Ameesh Makadia, Ce Liu, Yuanzhen Li, Howard Zhou

    Abstract: Recent advances in neural reconstruction enable high-quality 3D object reconstruction from casually captured image collections. Current techniques mostly analyze their progress on relatively simple image collections where Structure-from-Motion (SfM) techniques can provide ground-truth (GT) camera poses. We note that SfM techniques tend to fail on in-the-wild image collections such as image search… ▽ More

    Submitted 13 October, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023 camera ready. Project page: https://navidataset.github.io

  19. arXiv:2305.05630  [pdf, other

    eess.AS cs.SD

    Accurate Real-Time Estimation of 2-Dimensional Direction of Arrival using a 3-Microphone Array

    Authors: Anton Kovalyov, Kashyap Patel, Issa Panahi

    Abstract: This paper presents a method for real-time estimation of 2-dimensional direction of arrival (2D-DOA) of one or more sound sources using a nonlinear array of three microphones. 2D-DOA is estimated employing frame-level time difference of arrival (TDOA) measurements. Unlike conventional methods, which infer location parameters from TDOAs using a theoretical model, we propose a more practical approac… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: 5 pages, 6 figures

  20. Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning

    Authors: Tuomas Haarnoja, Ben Moran, Guy Lever, Sandy H. Huang, Dhruva Tirumala, Jan Humplik, Markus Wulfmeier, Saran Tunyasuvunakool, Noah Y. Siegel, Roland Hafner, Michael Bloesch, Kristian Hartikainen, Arunkumar Byravan, Leonard Hasenclever, Yuval Tassa, Fereshteh Sadeghi, Nathan Batchelor, Federico Casarini, Stefano Saliceti, Charles Game, Neil Sreendra, Kushal Patel, Marlon Gwira, Andrea Huber, Nicole Hurley , et al. (3 additional authors not shown)

    Abstract: We investigate whether Deep Reinforcement Learning (Deep RL) is able to synthesize sophisticated and safe movement skills for a low-cost, miniature humanoid robot that can be composed into complex behavioral strategies in dynamic environments. We used Deep RL to train a humanoid robot with 20 actuated joints to play a simplified one-versus-one (1v1) soccer game. The resulting agent exhibits robust… ▽ More

    Submitted 11 April, 2024; v1 submitted 26 April, 2023; originally announced April 2023.

    Comments: Project website: https://sites.google.com/view/op3-soccer

  21. arXiv:2302.13407  [pdf, other

    eess.AS cs.SD

    DFSNet: A Steerable Neural Beamformer Invariant to Microphone Array Configuration for Real-Time, Low-Latency Speech Enhancement

    Authors: Anton Kovalyov, Kashyap Patel, Issa Panahi

    Abstract: Invariance to microphone array configuration is a rare attribute in neural beamformers. Filter-and-sum (FS) methods in this class define the target signal with respect to a reference channel. However, this not only complicates formulation in reverberant conditions but also the network, which must have a mechanism to infer what the reference channel is. To address these issues, this study presents… ▽ More

    Submitted 26 February, 2023; originally announced February 2023.

    Comments: 5 pages, 1 figure, 2 tables

  22. arXiv:2302.03222  [pdf, other

    cs.CL

    Bringing the State-of-the-Art to Customers: A Neural Agent Assistant Framework for Customer Service Support

    Authors: Stephen Obadinma, Faiza Khan Khattak, Shirley Wang, Tania Sidhom, Elaine Lau, Sean Robertson, Jingcheng Niu, Winnie Au, Alif Munim, Karthik Raja K. Bhaskar, Bencheng Wei, Iris Ren, Waqar Muhammad, Erin Li, Bukola Ishola, Michael Wang, Griffin Tanner, Yu-Jia Shiah, Sean X. Zhang, Kwesi P. Apponsah, Kanishk Patel, Jaswinder Narain, Deval Pandya, Xiaodan Zhu, Frank Rudzicz , et al. (1 additional authors not shown)

    Abstract: Building Agent Assistants that can help improve customer service support requires inputs from industry users and their customers, as well as knowledge about state-of-the-art Natural Language Processing (NLP) technology. We combine expertise from academia and industry to bridge the gap and build task/domain-specific Neural Agent Assistants (NAA) with three high-level components for: (1) Intent Iden… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

    Comments: Camera Ready Version of Paper Published in EMNLP 2022 Industry Track

  23. arXiv:2212.02346  [pdf, other

    cs.LG

    Accu-Help: A Machine Learning based Smart Healthcare Framework for Accurate Detection of Obsessive Compulsive Disorder

    Authors: Kabita Patel, Ajaya Kumar Tripathy, Laxmi Narayan Padhy, Sujita Kumar Kar, Susanta Kumar Padhy, Saraju Prasad Mohanty

    Abstract: In recent years the importance of Smart Healthcare cannot be overstated. The current work proposed to expand the state-of-art of smart healthcare in integrating solutions for Obsessive Compulsive Disorder (OCD). Identification of OCD from oxidative stress biomarkers (OSBs) using machine learning is an important development in the study of OCD. However, this process involves the collection of OCD c… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

  24. arXiv:2212.00625  [pdf, other

    cs.ET

    Probabilistic Neural Circuits leveraging AI-Enhanced Codesign for Random Number Generation

    Authors: Suma G. Cardwell, Catherine D. Schuman, J. Darby Smith, Karan Patel, Jaesuk Kwon, Samuel Liu, Christopher Allemang, Shashank Misra, Jean Anne Incorvia, James B. Aimone

    Abstract: Stochasticity is ubiquitous in the world around us. However, our predominant computing paradigm is deterministic. Random number generation (RNG) can be a computationally inefficient operation in this system especially for larger workloads. Our work leverages the underlying physics of emerging devices to develop probabilistic neural circuits for RNGs from a given distribution. However, codesign for… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

    Report number: SAND2022-16607 C

  25. arXiv:2210.15822  [pdf, other

    eess.AS cs.SD

    UX-NET: Filter-and-Process-based Improved U-Net for Real-time Time-domain Audio Separation

    Authors: Kashyap Patel, Anton Kovalyov, Issa Panahi

    Abstract: This study presents UX-Net, a time-domain audio separation network (TasNet) based on a modified U-Net architecture. The proposed UX-Net works in real-time and handles either single or multi-microphone input. Inspired by the filter-and-process-based human auditory behavior, the proposed system introduces novel mixer and separation modules, which result in cost and memory efficient modeling of speec… ▽ More

    Submitted 27 October, 2022; originally announced October 2022.

    Comments: Submitted to ICASSP 2023

  26. arXiv:2210.12061  [pdf, other

    cs.LG stat.ML

    Validation of Composite Systems by Discrepancy Propagation

    Authors: David Reeb, Kanil Patel, Karim Barsim, Martin Schiegg, Sebastian Gerwinn

    Abstract: Assessing the validity of a real-world system with respect to given quality criteria is a common yet costly task in industrial applications due to the vast number of required real-world tests. Validating such systems by means of simulation offers a promising and less expensive alternative, but requires an assessment of the simulation accuracy and therefore end-to-end measurements. Additionally, co… ▽ More

    Submitted 3 January, 2024; v1 submitted 21 October, 2022; originally announced October 2022.

    Comments: 21 pages incl. 11 pages appendix; camera-ready version at UAI 2023

    Journal ref: Proceedings of the Thirty-Ninth Conference on Uncertainty in Artificial Intelligence (UAI 2023), PMLR 216:1730-1740, 2023

  27. arXiv:2208.12466  [pdf, other

    cs.LG

    An approach to implement Reinforcement Learning for Heterogeneous Vehicular Networks

    Authors: Bhavya Peshavaria, Sagar Kavaiya, Dhaval K. Patel

    Abstract: This paper presents the extension of the idea of spectrum sharing in the vehicular networks towards the Heterogeneous Vehicular Network(HetVNET) based on multi-agent reinforcement learning. Here, the multiple vehicle-to-vehicle(V2V) links reuse the spectrum of other vehicle-to-interface(V2I) and also those of other networks. The fast-changing environment in vehicular networks limits the idea of ce… ▽ More

    Submitted 26 August, 2022; originally announced August 2022.

  28. arXiv:2208.06640  [pdf, other

    cs.SE

    The Sense of Logging in the Linux Kernel

    Authors: Keyur Patel, Joao Faccin, Abdelwahab Hamou-Lhadj, Ingrid Nunes

    Abstract: Logging plays a crucial role in software engineering because it is key to perform various tasks including debugging, performance analysis, and detection of anomalies. Despite the importance of log data, the practice of logging still suffers from the lack of common guidelines and best practices. Recent studies investigated logging in C/C++ and Java open-source systems. In this paper, we complement… ▽ More

    Submitted 13 August, 2022; originally announced August 2022.

    Comments: Accepted for publication in the Empirical Software Engineering journal

  29. arXiv:2208.04955  [pdf, other

    cs.LG cs.AI math.OC

    Explainable prediction of Qcodes for NOTAMs using column generation

    Authors: Krunal Kishor Patel, Guy Desaulniers, Andrea Lodi, Freddy Lecue

    Abstract: A NOtice To AirMen (NOTAM) contains important flight route related information. To search and filter them, NOTAMs are grouped into categories called QCodes. In this paper, we develop a tool to predict, with some explanations, a Qcode for a NOTAM. We present a way to extend the interpretable binary classification using column generation proposed in Dash, Gunluk, and Wei (2018) to a multiclass text… ▽ More

    Submitted 20 January, 2023; v1 submitted 9 August, 2022; originally announced August 2022.

  30. arXiv:2202.09664  [pdf, other

    cs.LG stat.ML

    Accurate Prediction and Uncertainty Estimation using Decoupled Prediction Interval Networks

    Authors: Kinjal Patel, Steven Waslander

    Abstract: We propose a network architecture capable of reliably estimating uncertainty of regression based predictions without sacrificing accuracy. The current state-of-the-art uncertainty algorithms either fall short of achieving prediction accuracy comparable to the mean square error optimization or underestimate the variance of network predictions. We propose a decoupled network architecture that is cap… ▽ More

    Submitted 19 February, 2022; originally announced February 2022.

  31. arXiv:2202.02950  [pdf, other

    cs.HC cs.AI cs.LG

    Jury Learning: Integrating Dissenting Voices into Machine Learning Models

    Authors: Mitchell L. Gordon, Michelle S. Lam, Joon Sung Park, Kayur Patel, Jeffrey T. Hancock, Tatsunori Hashimoto, Michael S. Bernstein

    Abstract: Whose labels should a machine learning (ML) algorithm learn to emulate? For ML tasks ranging from online comment toxicity to misinformation detection to medical diagnosis, different groups in society may have irreconcilable disagreements about ground truth labels. Supervised ML today resolves these label disagreements implicitly using majority vote, which overrides minority groups' labels. We intr… ▽ More

    Submitted 7 February, 2022; originally announced February 2022.

    Comments: To appear at CHI 2022

  32. arXiv:2201.12903  [pdf, other

    cs.CV

    Aggregating Global Features into Local Vision Transformer

    Authors: Krushi Patel, Andres M. Bur, Fengjun Li, Guanghui Wang

    Abstract: Local Transformer-based classification models have recently achieved promising results with relatively low computational costs. However, the effect of aggregating spatial global information of local Transformer-based architecture is not clear. This work investigates the outcome of applying a global attention-based module named multi-resolution overlapped attention (MOA) in the local window-based t… ▽ More

    Submitted 30 January, 2022; originally announced January 2022.

  33. Task Scheduling in Cloud Computing Using Hybrid Meta-heuristic: A Review

    Authors: Sandeep Kumar Patel, Avtar Singh

    Abstract: In recent years with the advent of high bandwidth internet access availability, the cloud computing applications have boomed. With more and more applications being run over the cloud and an increase in the overall user base of the different cloud platforms, the need for highly efficient job scheduling techniques has also increased. The task of a conventional job scheduling algorithm is to determin… ▽ More

    Submitted 23 January, 2022; originally announced January 2022.

  34. arXiv:2201.02977  [pdf, other

    cs.CL

    Indian Language Wordnets and their Linkages with Princeton WordNet

    Authors: Diptesh Kanojia, Kevin Patel, Pushpak Bhattacharyya

    Abstract: Wordnets are rich lexico-semantic resources. Linked wordnets are extensions of wordnets, which link similar concepts in wordnets of different languages. Such resources are extremely useful in many Natural Language Processing (NLP) applications, primarily those based on knowledge-based approaches. In such approaches, these resources are considered as gold standard/oracle. Thus, it is crucial that t… ▽ More

    Submitted 9 January, 2022; originally announced January 2022.

    Comments: Published at LREC 2018

  35. arXiv:2201.01747  [pdf, other

    cs.CL

    Semi-automatic WordNet Linking using Word Embeddings

    Authors: Kevin Patel, Diptesh Kanojia, Pushpak Bhattacharyya

    Abstract: Wordnets are rich lexico-semantic resources. Linked wordnets are extensions of wordnets, which link similar concepts in wordnets of different languages. Such resources are extremely useful in many Natural Language Processing (NLP) applications, primarily those based on knowledge-based approaches. In such approaches, these resources are considered as gold standard/oracle. Thus, it is crucial that t… ▽ More

    Submitted 5 January, 2022; originally announced January 2022.

    Comments: Published at GWC 2018

  36. arXiv:2112.15124  [pdf, other

    cs.CL

    Utilizing Wordnets for Cognate Detection among Indian Languages

    Authors: Diptesh Kanojia, Kevin Patel, Pushpak Bhattacharyya, Malhar Kulkarni, Gholamreza Haffari

    Abstract: Automatic Cognate Detection (ACD) is a challenging task which has been utilized to help NLP applications like Machine Translation, Information Retrieval and Computational Phylogenetics. Unidentified cognate pairs can pose a challenge to these applications and result in a degradation of performance. In this paper, we detect cognate word pairs among ten Indian languages with Hindi and use deep learn… ▽ More

    Submitted 30 December, 2021; originally announced December 2021.

    Comments: Published at GWC 2019

  37. arXiv:2112.07219  [pdf, other

    cs.CV cs.AI

    A real-time spatiotemporal AI model analyzes skill in open surgical videos

    Authors: Emmett D. Goodman, Krishna K. Patel, Yilun Zhang, William Locke, Chris J. Kennedy, Rohan Mehrotra, Stephen Ren, Melody Y. Guan, Maren Downing, Hao Wei Chen, Jevin Z. Clark, Gabriel A. Brat, Serena Yeung

    Abstract: Open procedures represent the dominant form of surgery worldwide. Artificial intelligence (AI) has the potential to optimize surgical practice and improve patient outcomes, but efforts have focused primarily on minimally invasive techniques. Our work overcomes existing data limitations for training AI models by curating, from YouTube, the largest dataset of open surgical videos to date: 1997 video… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

    Comments: 22 pages, 4 main text figures, 7 extended data figures, 4 extended data tables

  38. arXiv:2112.05861  [pdf, other

    cs.CV

    A Discriminative Channel Diversification Network for Image Classification

    Authors: Krushi Patel, Guanghui Wang

    Abstract: Channel attention mechanisms in convolutional neural networks have been proven to be effective in various computer vision tasks. However, the performance improvement comes with additional model complexity and computation cost. In this paper, we propose a light-weight and effective attention module, called channel diversification block, to enhance the global context by establishing the channel rela… ▽ More

    Submitted 10 December, 2021; originally announced December 2021.

  39. arXiv:2110.12536  [pdf, other

    cs.HC cs.AI cs.LG

    Neo: Generalizing Confusion Matrix Visualization to Hierarchical and Multi-Output Labels

    Authors: Jochen Görtler, Fred Hohman, Dominik Moritz, Kanit Wongsuphasawat, Donghao Ren, Rahul Nair, Marc Kirchner, Kayur Patel

    Abstract: The confusion matrix, a ubiquitous visualization for helping people evaluate machine learning models, is a tabular layout that compares predicted class labels against actual class labels over all data instances. We conduct formative research with machine learning practitioners at Apple and find that conventional confusion matrices do not support more complex data-structures found in modern-day app… ▽ More

    Submitted 17 February, 2022; v1 submitted 24 October, 2021; originally announced October 2021.

    Comments: Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems

    ACM Class: H.2.m; I.7.m

  40. Deep Tactile Experience: Estimating Tactile Sensor Output from Depth Sensor Data

    Authors: Karankumar Patel, Soshi Iba, Nawid Jamali

    Abstract: Tactile sensing is inherently contact based. To use tactile data, robots need to make contact with the surface of an object. This is inefficient in applications where an agent needs to make a decision between multiple alternatives that depend the physical properties of the contact location. We propose a method to get tactile data in a non-invasive manner. The proposed method estimates the output o… ▽ More

    Submitted 17 October, 2021; originally announced October 2021.

    Comments: Accepted for publication in the 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2020)

  41. arXiv:2110.02954  [pdf, other

    math.OC cs.LG stat.ML

    A Stochastic Newton Algorithm for Distributed Convex Optimization

    Authors: Brian Bullins, Kumar Kshitij Patel, Ohad Shamir, Nathan Srebro, Blake Woodworth

    Abstract: We propose and analyze a stochastic Newton algorithm for homogeneous distributed stochastic convex optimization, where each machine can calculate stochastic gradients of the same population objective, as well as stochastic Hessian-vector products (products of an independent unbiased estimator of the Hessian of the population objective with arbitrary vectors), with many such stochastic computations… ▽ More

    Submitted 7 October, 2021; originally announced October 2021.

  42. arXiv:2109.12851  [pdf, other

    cs.LG eess.SP

    Improving Uncertainty of Deep Learning-based Object Classification on Radar Spectra using Label Smoothing

    Authors: Kanil Patel, William Beluch, Kilian Rambach, Michael Pfeiffer, Bin Yang

    Abstract: Object type classification for automotive radar has greatly improved with recent deep learning (DL) solutions, however these developments have mostly focused on the classification accuracy. Before employing DL solutions in safety-critical applications, such as automated driving, an indispensable prerequisite is the accurate quantification of the classifiers' reliability. Unfortunately, DL classifi… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

    Comments: Submitted to IEEE Radar Conference 2022

  43. arXiv:2107.02839  [pdf, other

    cs.RO

    Toward Robotically Automated Femoral Vascular Access

    Authors: Nico Zevallos, Evan Harber, Abhimanyu, Kirtan Patel, Yizhu Gu, Kenny Sladick, Francis Guyette, Leonard Weiss, Michael R. Pinsky, Hernando Gomez, John Galeotti, Howie Choset

    Abstract: Advanced resuscitative technologies, such as Extra Corporeal Membrane Oxygenation (ECMO) cannulation or Resuscitative Endovascular Balloon Occlusion of the Aorta (REBOA), are technically difficult even for skilled medical personnel. This paper describes the core technologies that comprise a teleoperated system capable of granting femoral vascular access, which is an important step in both of these… ▽ More

    Submitted 6 July, 2021; originally announced July 2021.

    Comments: 6 pages, 5 figures, 1 table, submitted (but not accepted yet) to ISMR

  44. arXiv:2106.08921  [pdf, other

    cs.NE cs.CV cs.LG

    A Spiking Neural Network for Image Segmentation

    Authors: Kinjal Patel, Eric Hunsberger, Sean Batir, Chris Eliasmith

    Abstract: We seek to investigate the scalability of neuromorphic computing for computer vision, with the objective of replicating non-neuromorphic performance on computer vision tasks while reducing power consumption. We convert the deep Artificial Neural Network (ANN) architecture U-Net to a Spiking Neural Network (SNN) architecture using the Nengo framework. Both rate-based and spike-based models are trai… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

  45. arXiv:2106.05870  [pdf, other

    cs.LG cs.AI

    Investigation of Uncertainty of Deep Learning-based Object Classification on Radar Spectra

    Authors: Kanil Patel, William Beluch, Kilian Rambach, Adriana-Eliza Cozma, Michael Pfeiffer, Bin Yang

    Abstract: Deep learning (DL) has recently attracted increasing interest to improve object type classification for automotive radar.In addition to high accuracy, it is crucial for decision making in autonomous vehicles to evaluate the reliability of the predictions; however, decisions of DL networks are non-transparent. Current DL research has investigated how uncertainties of predictions can be quantified,… ▽ More

    Submitted 1 June, 2021; originally announced June 2021.

    Comments: 6 pages

    Journal ref: IEEE Radar Conference 2021

  46. arXiv:2105.00999  [pdf, other

    eess.IV cs.CV

    Enhanced U-Net: A Feature Enhancement Network for Polyp Segmentation

    Authors: Krushi Patel, Andres M. Bur, Guanghui Wang

    Abstract: Colonoscopy is a procedure to detect colorectal polyps which are the primary cause for developing colorectal cancer. However, polyp segmentation is a challenging task due to the diverse shape, size, color, and texture of polyps, shuttle difference between polyp and its background, as well as low contrast of the colonoscopic images. To address these challenges, we propose a feature enhancement netw… ▽ More

    Submitted 3 May, 2021; originally announced May 2021.

  47. arXiv:2104.12835  [pdf, other

    cs.CV cs.AI cs.LG

    Less is more: Selecting informative and diverse subsets with balancing constraints

    Authors: Srikumar Ramalingam, Daniel Glasner, Kaushal Patel, Raviteja Vemulapalli, Sadeep Jayasumana, Sanjiv Kumar

    Abstract: Deep learning has yielded extraordinary results in vision and natural language processing, but this achievement comes at a cost. Most models require enormous resources during training, both in terms of computation and in human labeling effort. We show that we can identify informative and diverse subsets of data that lead to deep learning models with similar performance as the ones trained with the… ▽ More

    Submitted 8 October, 2021; v1 submitted 26 April, 2021; originally announced April 2021.

    Comments: Added error bars to the experiments

  48. Colonoscopy Polyp Detection and Classification: Dataset Creation and Comparative Evaluations

    Authors: Kaidong Li, Mohammad I. Fathan, Krushi Patel, Tianxiao Zhang, Cuncong Zhong, Ajay Bansal, Amit Rastogi, Jean S. Wang, Guanghui Wang

    Abstract: Colorectal cancer (CRC) is one of the most common types of cancer with a high mortality rate. Colonoscopy is the preferred procedure for CRC screening and has proven to be effective in reducing CRC mortality. Thus, a reliable computer-aided polyp detection and classification system can significantly increase the effectiveness of colonoscopy. In this paper, we create an endoscopic dataset collected… ▽ More

    Submitted 5 August, 2021; v1 submitted 21 April, 2021; originally announced April 2021.

  49. mage: Fluid Moves Between Code and Graphical Work in Computational Notebooks

    Authors: Mary Beth Kery, Donghao Ren, Fred Hohman, Dominik Moritz, Kanit Wongsuphasawat, Kayur Patel

    Abstract: We aim to increase the flexibility at which a data worker can choose the right tool for the job, regardless of whether the tool is a code library or an interactive graphical user interface (GUI). To achieve this flexibility, we extend computational notebooks with a new API mage, which supports tools that can represent themselves as both code and GUI as needed. We discuss the design of mage as well… ▽ More

    Submitted 22 September, 2020; originally announced September 2020.

  50. arXiv:2009.08492  [pdf, other

    cs.NI

    Designing knowledge plane to optimize leaf and spine data center

    Authors: Mujahid Sultan, Dodi Imbuido, Kam Patel, James MacDonald, Kumar Ratnam

    Abstract: In the last few decades, data center architecture evolved from the traditional client-server to access-aggregation-core architectures. Recently there is a new shift in the data center architecture due to the increasing need for low latency and high throughput between server-to-server communications, load balancing and, loop-free environment. This new architecture, known as leaf and spine architect… ▽ More

    Submitted 17 September, 2020; originally announced September 2020.

    Comments: 3 pages