Skip to main content

Showing 1–17 of 17 results for author: Radhakrishnan, S

  1. arXiv:2404.01990  [pdf, other

    cs.CV

    What is Point Supervision Worth in Video Instance Segmentation?

    Authors: Shuaiyi Huang, De-An Huang, Zhiding Yu, Shiyi Lan, Subhashree Radhakrishnan, Jose M. Alvarez, Abhinav Shrivastava, Anima Anandkumar

    Abstract: Video instance segmentation (VIS) is a challenging vision task that aims to detect, segment, and track objects in videos. Conventional VIS methods rely on densely-annotated object masks which are expensive. We reduce the human annotations to only one point for each object in a video frame during training, and obtain high-quality mask predictions close to fully supervised models. Our proposed train… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  2. arXiv:2403.19046  [pdf, other

    cs.CV cs.AI

    LITA: Language Instructed Temporal-Localization Assistant

    Authors: De-An Huang, Shijia Liao, Subhashree Radhakrishnan, Hongxu Yin, Pavlo Molchanov, Zhiding Yu, Jan Kautz

    Abstract: There has been tremendous progress in multimodal Large Language Models (LLMs). Recent works have extended these models to video input with promising instruction following capabilities. However, an important missing piece is temporal localization. These models cannot accurately answer the "When?" questions. We identify three key aspects that limit their temporal localization capabilities: (i) time… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  3. arXiv:2403.16157  [pdf, other

    cs.DL

    pyKCN: A Python Tool for Bridging Scientific Knowledge

    Authors: Zhenyuan Lu, Wei Li, Burcu Ozek, Haozhou Zhou, Srinivasan Radhakrishnan, Sagar Kamarthi

    Abstract: The study of research trends is pivotal for understanding scientific development on specific topics. Traditionally, this involves keyword analysis within scholarly literature, yet comprehensive tools for such analysis are scarce, especially those capable of parsing large datasets with precision. pyKCN, a Python toolkit, addresses this gap by automating keyword cleaning, extraction and trend analys… ▽ More

    Submitted 26 March, 2024; v1 submitted 24 March, 2024; originally announced March 2024.

  4. arXiv:2311.08569  [pdf

    cs.LG

    Uncertainty Quantification in Neural-Network Based Pain Intensity Estimation

    Authors: Burcu Ozek, Zhenyuan Lu, Srinivasan Radhakrishnan, Sagar Kamarthi

    Abstract: Improper pain management can lead to severe physical or mental consequences, including suffering, and an increased risk of opioid dependency. Assessing the presence and severity of pain is imperative to prevent such outcomes and determine the appropriate intervention. However, the evaluation of pain intensity is challenging because different individuals experience pain differently. To overcome thi… ▽ More

    Submitted 29 November, 2023; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: 26 pages, 5 figures, 9 tables

  5. arXiv:2310.06434  [pdf, other

    cs.CL cs.AI cs.MM cs.SD eess.AS

    Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition

    Authors: Srijith Radhakrishnan, Chao-Han Huck Yang, Sumeer Ahmad Khan, Rohit Kumar, Narsis A. Kiani, David Gomez-Cabrero, Jesper N. Tegner

    Abstract: We introduce a new cross-modal fusion technique designed for generative error correction in automatic speech recognition (ASR). Our methodology leverages both acoustic information and external linguistic representations to generate accurate speech transcription contexts. This marks a step towards a fresh paradigm in generative error correction within the realm of n-best hypotheses. Unlike the exis… ▽ More

    Submitted 16 October, 2023; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023 as main paper. 10 pages. Revised math notations. GitHub: https://github.com/Srijith-rkr/Whispering-LLaMA

  6. arXiv:2306.12377  [pdf, ps, other

    cs.LG cs.CG cs.CR

    Geometric Algorithms for $k$-NN Poisoning

    Authors: Diego Ihara Centurion, Karine Chubarian, Bohan Fan, Francesco Sgherzi, Thiruvenkadam S Radhakrishnan, Anastasios Sidiropoulos, Angelo Straight

    Abstract: We propose a label poisoning attack on geometric data sets against $k$-nearest neighbor classification. We provide an algorithm that can compute an $\varepsilon n$-additive approximation of the optimal poisoning in $n\cdot 2^{2^{O(d+k/\varepsilon)}}$ time for a given data set $X \in \mathbb{R}^d$, where $|X| = n$. Our algorithm achieves its objectives through the application of multi-scale random… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

    Comments: 14 pages, 1 figure

  7. arXiv:2305.11244  [pdf, other

    cs.CL cs.AI cs.LG cs.NE eess.AS

    A Parameter-Efficient Learning Approach to Arabic Dialect Identification with Pre-Trained General-Purpose Speech Model

    Authors: Srijith Radhakrishnan, Chao-Han Huck Yang, Sumeer Ahmad Khan, Narsis A. Kiani, David Gomez-Cabrero, Jesper N. Tegner

    Abstract: In this work, we explore Parameter-Efficient-Learning (PEL) techniques to repurpose a General-Purpose-Speech (GSM) model for Arabic dialect identification (ADI). Specifically, we investigate different setups to incorporate trainable features into a multi-layer encoder-decoder GSM formulation under frozen pre-trained settings. Our architecture includes residual adapter and model reprogramming (inpu… ▽ More

    Submitted 3 October, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: Accepted to Interspeech 2023, 5 pages. Code is available at: https://github.com/Srijith-rkr/KAUST-Whisper-Adapter under MIT license

  8. arXiv:2105.06464  [pdf, other

    cs.CV cs.LG

    DiscoBox: Weakly Supervised Instance Segmentation and Semantic Correspondence from Box Supervision

    Authors: Shiyi Lan, Zhiding Yu, Christopher Choy, Subhashree Radhakrishnan, Guilin Liu, Yuke Zhu, Larry S. Davis, Anima Anandkumar

    Abstract: We introduce DiscoBox, a novel framework that jointly learns instance segmentation and semantic correspondence using bounding box supervision. Specifically, we propose a self-ensembling framework where instance segmentation and semantic correspondence are jointly guided by a structured teacher in addition to the bounding box supervision. The teacher is a structured energy model incorporating a pai… ▽ More

    Submitted 5 June, 2021; v1 submitted 13 May, 2021; originally announced May 2021.

    Comments: Tech Report

  9. arXiv:2003.04150  [pdf, other

    cs.DC

    Lightweight Inter-transaction Caching with Precise Clocks and Dynamic Self-invalidation

    Authors: Pulkit A. Misra, Srihari Radhakrishnan, Jeffrey S. Chase, Johannes Gehrke, Alvin R. Lebeck

    Abstract: Distributed, transactional storage systems scale by sharding data across servers. However, workload-induced hotspots result in contention, leading to higher abort rates and performance degradation. We present KAIROS, a transactional key-value storage system that leverages client-side inter-transaction caching and sharded transaction validation to balance the dynamic load and alleviate workload-i… ▽ More

    Submitted 9 March, 2020; originally announced March 2020.

  10. arXiv:2001.08805  [pdf

    cs.RO q-bio.NC

    Inexpensive and Portable System for Dexterous High-Density Myoelectric Control of Multiarticulate Prostheses

    Authors: Jacob A. George, Sridharan Radhakrishnan, Mark R. Brinton, Gregory A. Clark

    Abstract: Multiarticulate bionic arms are now capable of mimicking the endogenous movements of the human hand. 3D-printing has reduced the cost of prosthetic hands themselves, but there is currently no low-cost alternative to dexterous electromyographic (EMG) control systems. To address this need, we developed an inexpensive (~$675) and portable EMG control system by integrating low-cost microcontrollers wi… ▽ More

    Submitted 23 January, 2020; originally announced January 2020.

    Comments: IEEE EMBC 2020

  11. arXiv:1509.04394  [pdf

    cs.DC

    Efficient Kernel Fusion Techniques for Massive Video Data Analysis on GPGPUs

    Authors: Asif M Adnan, Sridhar Radhakrishnan, Suleyman Karabuk

    Abstract: Kernels are executable code segments and kernel fusion is a technique for combing the segments in a coherent manner to improve execution time. For the first time, we have developed a technique to fuse image processing kernels to be executed on GPGPUs for improving execution time and total throughput (amount of data processed in unit time). We have applied our techniques for feature tracking on vid… ▽ More

    Submitted 15 September, 2015; originally announced September 2015.

  12. Heterogeneous processor pipeline for a product cipher application

    Authors: I. B. Nawinne, M. S. Wickramasinghe, R. G. Ragel, S. Radhakrishnan

    Abstract: Processing data received as a stream is a task commonly performed by modern embedded devices, in a wide range of applications such as multimedia (encoding/decoding/ playing media), networking (switching and routing), digital security, scientific data processing, etc. Such processing normally tends to be calculation intensive and therefore requiring significant processing power. Therefore, hardware… ▽ More

    Submitted 28 March, 2014; originally announced March 2014.

    Journal ref: Industrial and Information Systems (ICIIS), 2011 6th IEEE International Conference on, 16-19 Aug 2011, pp. 32 - 37, Kandy

  13. Instruction-set Selection for Multi-application based ASIP Design: An Instruction-level Study

    Authors: R. G. Ragel, Swarnalatha Radhakrishnan, Angelo Ambrose

    Abstract: Efficiency in embedded systems is paramount to achieve high performance while consuming less area and power. Processors in embedded systems have to be designed carefully to achieve such design constraints. Application Specific Instruction set Processors (ASIPs) exploit the nature of applications to design an optimal instruction set. Despite being not general to execute any application, ASIPs are h… ▽ More

    Submitted 28 March, 2014; originally announced March 2014.

    Journal ref: Information and Automation for Sustainability (ICIAfS), 2012 IEEE 6th International Conference on, 27-29 Sept 2012, pp 141-146, Beijing

  14. Loop Unrolling in Multi-pipeline ASIP Design

    Authors: Rajitha Navarathna, Swarnalatha Radhakrishnan, Roshan Ragel

    Abstract: Application Specific Instruction-set Processor (ASIP) is one of the popular processor design techniques for embedded systems which allows customizability in processor design without overly hindering design flexibility. Multi-pipeline ASIPs were proposed to improve the performance of such systems by compromising between speed and processor area. One of the problems in the multi-pipeline design is t… ▽ More

    Submitted 4 February, 2014; originally announced February 2014.

    Comments: 6 pages

    Journal ref: Navarathna, H. M R D B; Radhakrishnan, S.; Ragel, R.G., "Loop unrolling in multi-pipeline ASIP design," Industrial and Information Systems (ICIIS), 2009 International Conference on , pp.306-311, 28-31 Dec. 2009

  15. Axis2UNO: Web Services Enabled Openoffice.org

    Authors: B. A. N. M. Bambarasinghe, H. M. S. Huruggamuwa, R. G. Ragel, S. Radhakrishnan

    Abstract: Openoffice.org is a popular, free and open source office product. This product is used by millions of people and developed, maintained and extended by thousands of developers worldwide. Playing a dominant role in the web, web services technology is serving millions of people every day. Axis2 is one of the most popular, free and open source web service engines. The framework presented in this paper… ▽ More

    Submitted 4 February, 2014; originally announced February 2014.

    Comments: 6 pages, 4th International Conference on Information and Automation for Sustainability, 2008. ICIAFS 2008

    Journal ref: ICIAFS 2008. 437-442, 12-14 Dec. 2008

  16. arXiv:1204.2041  [pdf

    cs.NI

    Improving Route Discovery Using Stable Connected Dominating Set in MANETs

    Authors: R. Ramalakshmi, S. Radhakrishnan

    Abstract: A Connected Dominating Set (CDS) based virtual backbone plays an important role in wireless ad hoc networks for efficient routing and broadcasting. Each node in the network can select some of its 1-hop neighbors as Multi Point Relay (MPR) to cover all its 2-hop neighbors. A MPR based CDS is a promising approach for broadcasting. A node in the CDS consumes more energy and the energy depletes quickl… ▽ More

    Submitted 24 April, 2012; v1 submitted 10 April, 2012; originally announced April 2012.

    Comments: International Conference on NetCom-3.0

    Journal ref: International journal on applications of graph theory in wireless ad hoc networks and sensor networks(GRAPH-HOC), Vol 4, No.1, March 2012

  17. arXiv:1004.1757  [pdf

    cs.NI cs.MM

    Processor Based Active Queue Management for providing QoS in Multimedia Application

    Authors: N. Saravana Selvam, S. Radhakrishnan

    Abstract: The objective of this paper is to implement the Active Network based Active Queue Management Technique for providing Quality of Service (QoS) using Network Processor(NP) based router to enhance multimedia applications. The performance is evaluated using Intel IXP2400 NP Simulator. The results demonstrate that, Active Network based Active Queue Management has better performance than RED algorithm i… ▽ More

    Submitted 10 April, 2010; originally announced April 2010.

    Comments: IEEE Publication format, International Journal of Computer Science and Information Security, IJCSIS, Vol. 7 No. 3, March 2010, USA. ISSN 1947 5500, http://sites.google.com/site/ijcsis/