Skip to main content

Showing 1–50 of 137 results for author: Joshi, R

  1. arXiv:2407.12869  [pdf, ps, other

    cs.CL cs.AI

    Bilingual Adaptation of Monolingual Foundation Models

    Authors: Gurpreet Gosal, Yishi Xu, Gokul Ramakrishnan, Rituraj Joshi, Avraham Sheinin, Zhiming, Chen, Biswajit Mishra, Natalia Vassilieva, Joel Hestness, Neha Sengupta, Sunil Kumar Sahu, Bokang Jia, Satheesh Katipomu, Onkar Pandit, Samta Kamboj, Rahul Pal, Parvez Mullah, Soundar Doraiswamy, Mohamed El Karim Chami

    Abstract: We present an efficient method for adapting a monolingual Large Language Model (LLM) to another language, addressing challenges of catastrophic forgetting and tokenizer limitations. We focus this study on adapting Llama 2 to Arabic. Our two-stage approach begins with expanding the vocabulary and training only the embeddings matrix, followed by full model continual pretraining on a bilingual corpus… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

  2. arXiv:2407.05264  [pdf, ps, other

    math.CO cs.DM

    $θ$-free matching covered graphs

    Authors: Rohinee Joshi, Nishad Kothari

    Abstract: A nontrivial connected graph is matching covered if each edge belongs to some perfect matching. For most problems pertaining to perfect matchings, one may restrict attention to matching covered graphs; thus, there is extensive literature on them. A cornerstone of this theory is an ear decomposition result due to Lovász and Plummer. Their theorem is a fundamental problem-solving tool, and also yiel… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: Submitted to a journal

  3. Curating Stopwords in Marathi: A TF-IDF Approach for Improved Text Analysis and Information Retrieval

    Authors: Rohan Chavan, Gaurav Patil, Vishal Madle, Raviraj Joshi

    Abstract: Stopwords are commonly used words in a language that are often considered to be of little value in determining the meaning or significance of a document. These words occur frequently in most texts and don't provide much useful information for tasks like sentiment analysis and text classification. English, which is a high-resource language, takes advantage of the availability of stopwords, whereas… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: Accepted at I2CT 2024

  4. Universal Cross-Lingual Text Classification

    Authors: Riya Savant, Anushka Shelke, Sakshi Todmal, Sanskruti Kanphade, Ananya Joshi, Raviraj Joshi

    Abstract: Text classification, an integral task in natural language processing, involves the automatic categorization of text into predefined classes. Creating supervised labeled datasets for low-resource languages poses a considerable challenge. Unlocking the language potential of low-resource languages requires robust datasets with supervised labels. However, such datasets are scarce, and the label space… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: Accepted at I2CT 2024

  5. arXiv:2405.19107  [pdf, ps, other

    cs.LG cs.AI

    Offline Regularised Reinforcement Learning for Large Language Models Alignment

    Authors: Pierre Harvey Richemond, Yunhao Tang, Daniel Guo, Daniele Calandriello, Mohammad Gheshlaghi Azar, Rafael Rafailov, Bernardo Avila Pires, Eugene Tarassov, Lucas Spangher, Will Ellsworth, Aliaksei Severyn, Jonathan Mallinson, Lior Shani, Gil Shamir, Rishabh Joshi, Tianqi Liu, Remi Munos, Bilal Piot

    Abstract: The dominant framework for alignment of large language models (LLM), whether through reinforcement learning from human feedback or direct preference optimisation, is to learn from preference data. This involves building datasets where each element is a quadruplet composed of a prompt, two independent responses (completions of the prompt) and a human preference between the two independent responses… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  6. arXiv:2405.07933  [pdf, other

    cs.CV

    Authentic Hand Avatar from a Phone Scan via Universal Hand Model

    Authors: Gyeongsik Moon, Weipeng Xu, Rohan Joshi, Chenglei Wu, Takaaki Shiratori

    Abstract: The authentic 3D hand avatar with every identifiable information, such as hand shapes and textures, is necessary for immersive experiences in AR/VR. In this paper, we present a universal hand model (UHM), which 1) can universally represent high-fidelity 3D hand meshes of arbitrary identities (IDs) and 2) can be adapted to each person with a short phone scan for the authentic hand avatar. For effec… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: Accepted to CVPR 2024

  7. TextGram: Towards a better domain-adaptive pretraining

    Authors: Sharayu Hiwarkhedkar, Saloni Mittal, Vidula Magdum, Omkar Dhekane, Raviraj Joshi, Geetanjali Kale, Arnav Ladkat

    Abstract: For green AI, it is crucial to measure and reduce the carbon footprint emitted during the training of large language models. In NLP, performing pre-training on Transformer models requires significant computational resources. This pre-training involves using a large amount of text data to gain prior knowledge for performing downstream tasks. Thus, it is important that we select the correct data in… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

    Comments: Accepted at SPELLL 2023

  8. L3Cube-MahaNews: News-based Short Text and Long Document Classification Datasets in Marathi

    Authors: Saloni Mittal, Vidula Magdum, Omkar Dhekane, Sharayu Hiwarkhedkar, Raviraj Joshi

    Abstract: The availability of text or topic classification datasets in the low-resource Marathi language is limited, typically consisting of fewer than 4 target labels, with some achieving nearly perfect accuracy. In this work, we introduce L3Cube-MahaNews, a Marathi text classification corpus that focuses on News headlines and articles. This corpus stands out as the largest supervised Marathi Corpus, conta… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

    Comments: Accepted at SPELLL 2023

  9. arXiv:2404.13364  [pdf, other

    cs.CL cs.LG

    MahaSQuAD: Bridging Linguistic Divides in Marathi Question-Answering

    Authors: Ruturaj Ghatage, Aditya Kulkarni, Rajlaxmi Patil, Sharvi Endait, Raviraj Joshi

    Abstract: Question-answering systems have revolutionized information retrieval, but linguistic and cultural boundaries limit their widespread accessibility. This research endeavors to bridge the gap of the absence of efficient QnA datasets in low-resource languages by translating the English Question Answering Dataset (SQuAD) using a robust data curation approach. We introduce MahaSQuAD, the first-ever full… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

    Comments: Accepted at the International Conference on Natural Language Processing (ICON 2023)

  10. arXiv:2403.08635  [pdf, other

    cs.LG cs.AI stat.ML

    Human Alignment of Large Language Models through Online Preference Optimisation

    Authors: Daniele Calandriello, Daniel Guo, Remi Munos, Mark Rowland, Yunhao Tang, Bernardo Avila Pires, Pierre Harvey Richemond, Charline Le Lan, Michal Valko, Tianqi Liu, Rishabh Joshi, Zeyu Zheng, Bilal Piot

    Abstract: Ensuring alignment of language models' outputs with human preferences is critical to guarantee a useful, safe, and pleasant user experience. Thus, human alignment has been extensively studied recently and several methods such as Reinforcement Learning from Human Feedback (RLHF), Direct Policy Optimisation (DPO) and Sequence Likelihood Calibration (SLiC) have emerged. In this paper, our contributio… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  11. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  12. arXiv:2403.04981  [pdf, other

    cs.ET

    Paving the Way for Pass Disturb Free Vertical NAND Storage via A Dedicated and String-Compatible Pass Gate

    Authors: Zijian Zhao, Sola Woo, Khandker Akif Aabrar, Sharadindu Gopal Kirtania, Zhouhang Jiang, Shan Deng, Yi Xiao, Halid Mulaosmanovic, Stefan Duenkel, Dominik Kleimaier, Steven Soss, Sven Beyer, Rajiv Joshi, Scott Meninger, Mohamed Mohamed, Kijoon Kim, Jongho Woo, Suhwan Lim, Kwangsoo Kim, Wanki Kim, Daewon Ha, Vijaykrishnan Narayanan, Suman Datta, Shimeng Yu, Kai Ni

    Abstract: In this work, we propose a dual-port cell design to address the pass disturb in vertical NAND storage, which can pass signals through a dedicated and string-compatible pass gate. We demonstrate that: i) the pass disturb-free feature originates from weakening of the depolarization field by the pass bias at the high-${V}_{TH}$ (HVT) state and the screening of the applied field by channel at the low-… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: 29 pages, 7 figures

  13. arXiv:2402.09545  [pdf

    cs.CR cs.ET

    A 3D Memristor Architecture for In-Memory Computing Demonstrated with SHA3

    Authors: Muayad J. Aljafar, Rasika Joshi, John M. Acken

    Abstract: Security is a growing problem that needs hardware support. Memristors provide an alternative technology for hardware-supported security implementation. This paper presents a specific technique that utilizes the benefits of hybrid CMOS-memristors technology demonstrated with SHA3 over implementations that use only memristor technology. In the proposed technique, SHA3 is implemented in a set of perp… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: 14 pages, 4 tables, 12 figures

  14. arXiv:2402.06185  [pdf, other

    cs.CV cs.AI cs.LG

    Development and validation of an artificial intelligence model to accurately predict spinopelvic parameters

    Authors: Edward S. Harake, Joseph R. Linzey, Cheng Jiang, Rushikesh S. Joshi, Mark M. Zaki, Jaes C. Jones, Siri S. Khalsa, John H. Lee, Zachary Wilseck, Jacob R. Joseph, Todd C. Hollon, Paul Park

    Abstract: Objective. Achieving appropriate spinopelvic alignment has been shown to be associated with improved clinical symptoms. However, measurement of spinopelvic radiographic parameters is time-intensive and interobserver reliability is a concern. Automated measurement tools have the promise of rapid and consistent measurements, but existing tools are still limited by some degree of manual user-entry re… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: 10 pages, 5 figures, to appear in Journal of Neurosurgery: Spine

  15. arXiv:2402.01878  [pdf, other

    cs.CL cs.LG

    LiPO: Listwise Preference Optimization through Learning-to-Rank

    Authors: Tianqi Liu, Zhen Qin, Junru Wu, Jiaming Shen, Misha Khalman, Rishabh Joshi, Yao Zhao, Mohammad Saleh, Simon Baumgartner, Jialu Liu, Peter J. Liu, Xuanhui Wang

    Abstract: Aligning language models (LMs) with curated human feedback is critical to control their behaviors in real-world applications. Several recent policy optimization methods, such as DPO and SLiC, serve as promising alternatives to the traditional Reinforcement Learning from Human Feedback (RLHF) approach. In practice, human feedback often comes in a format of a ranked list over multiple responses to a… ▽ More

    Submitted 22 May, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

  16. arXiv:2401.05334  [pdf, other

    cs.CV cs.GR

    URHand: Universal Relightable Hands

    Authors: Zhaoxi Chen, Gyeongsik Moon, Kaiwen Guo, Chen Cao, Stanislav Pidhorskyi, Tomas Simon, Rohan Joshi, Yuan Dong, Yichen Xu, Bernardo Pires, He Wen, Lucas Evans, Bo Peng, Julia Buffalini, Autumn Trimble, Kevyn McPhail, Melissa Schoeller, Shoou-I Yu, Javier Romero, Michael Zollhöfer, Yaser Sheikh, Ziwei Liu, Shunsuke Saito

    Abstract: Existing photorealistic relightable hand models require extensive identity-specific observations in different views, poses, and illuminations, and face challenges in generalizing to natural illuminations and novel identities. To bridge this gap, we present URHand, the first universal relightable hand model that generalizes across viewpoints, poses, illuminations, and identities. Our model allows f… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    Comments: Project Page https://frozenburning.github.io/projects/urhand/

  17. arXiv:2401.02254  [pdf, other

    cs.CL cs.LG

    L3Cube-IndicNews: News-based Short Text and Long Document Classification Datasets in Indic Languages

    Authors: Aishwarya Mirashi, Srushti Sonavane, Purva Lingayat, Tejas Padhiyar, Raviraj Joshi

    Abstract: In this work, we introduce L3Cube-IndicNews, a multilingual text classification corpus aimed at curating a high-quality dataset for Indian regional languages, with a specific focus on news headlines and articles. We have centered our work on 10 prominent Indic languages, including Hindi, Bengali, Marathi, Telugu, Tamil, Gujarati, Kannada, Odia, Malayalam, and Punjabi. Each of these news datasets c… ▽ More

    Submitted 26 April, 2024; v1 submitted 4 January, 2024; originally announced January 2024.

    Comments: Accepted at the International Conference on Natural Language Processing (ICON 2023)

  18. L3Cube-MahaSocialNER: A Social Media based Marathi NER Dataset and BERT models

    Authors: Harsh Chaudhari, Anuja Patil, Dhanashree Lavekar, Pranav Khairnar, Raviraj Joshi

    Abstract: This work introduces the L3Cube-MahaSocialNER dataset, the first and largest social media dataset specifically designed for Named Entity Recognition (NER) in the Marathi language. The dataset comprises 18,000 manually labeled sentences covering eight entity classes, addressing challenges posed by social media data, including non-standard language and informal idioms. Deep learning models, includin… ▽ More

    Submitted 30 December, 2023; originally announced January 2024.

    Comments: Accepted at Forum for Information Retrieval Evaluation (FIRE 2023)

  19. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  20. IndoorGNN: A Graph Neural Network based approach for Indoor Localization using WiFi RSSI

    Authors: Rahul Vishwakarma, Rucha Bhalchandra Joshi, Subhankar Mishra

    Abstract: Indoor localization is the process of determining the location of a person or object inside a building. Potential usage of indoor localization includes navigation, personalization, safety and security, and asset tracking. Commonly used technologies for indoor localization include WiFi, Bluetooth, RFID, and Ultra-wideband. Among these, WiFi's Received Signal Strength Indicator (RSSI)-based localiza… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Journal ref: Lecture Notes in Computer Science, vol 14418, year 2023. Springer, Cham

  21. arXiv:2312.07418  [pdf

    cs.CV

    Attention Based Encoder Decoder Model for Video Captioning in Nepali (2023)

    Authors: Kabita Parajuli, Shashidhar Ram Joshi

    Abstract: Video captioning in Nepali, a language written in the Devanagari script, presents a unique challenge due to the lack of existing academic work in this domain. This work develops a novel encoder-decoder paradigm for Nepali video captioning to tackle this difficulty. LSTM and GRU sequence-to-sequence models are used in the model to produce related textual descriptions based on features retrieved fro… ▽ More

    Submitted 19 May, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

  22. On Significance of Subword tokenization for Low Resource and Efficient Named Entity Recognition: A case study in Marathi

    Authors: Harsh Chaudhari, Anuja Patil, Dhanashree Lavekar, Pranav Khairnar, Raviraj Joshi, Sachin Pande

    Abstract: Named Entity Recognition (NER) systems play a vital role in NLP applications such as machine translation, summarization, and question-answering. These systems identify named entities, which encompass real-world concepts like locations, persons, and organizations. Despite extensive research on NER systems for the English language, they have not received adequate attention in the context of low reso… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

    Comments: Accepted at ICDAM 2023

  23. arXiv:2312.01107  [pdf, other

    cs.LG

    Rapid Speaker Adaptation in Low Resource Text to Speech Systems using Synthetic Data and Transfer learning

    Authors: Raviraj Joshi, Nikesh Garera

    Abstract: Text-to-speech (TTS) systems are being built using end-to-end deep learning approaches. However, these systems require huge amounts of training data. We present our approach to built production quality TTS and perform speaker adaptation in extremely low resource settings. We propose a transfer learning approach using high-resource language data and synthetically generated data. We transfer the lea… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

    Comments: Accepted at PACLIC 2023

  24. Code-Mixed Text to Speech Synthesis under Low-Resource Constraints

    Authors: Raviraj Joshi, Nikesh Garera

    Abstract: Text-to-speech (TTS) systems are an important component in voice-based e-commerce applications. These applications include end-to-end voice assistant and customer experience (CX) voice bot. Code-mixed TTS is also relevant in these applications since the product names are commonly described in English while the surrounding text is in a regional language. In this work, we describe our approaches for… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

    Comments: Accepted at SPECOM 2023

  25. arXiv:2311.17722  [pdf, other

    cs.CL cs.LG

    SenTest: Evaluating Robustness of Sentence Encoders

    Authors: Tanmay Chavan, Shantanu Patankar, Aditya Kane, Omkar Gokhale, Geetanjali Kale, Raviraj Joshi

    Abstract: Contrastive learning has proven to be an effective method for pre-training models using weakly labeled data in the vision domain. Sentence transformers are the NLP counterparts to this architecture, and have been growing in popularity due to their rich and effective sentence representations. Having effective sentence representations is paramount in multiple tasks, such as information retrieval, re… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  26. arXiv:2311.14335  [pdf, other

    cs.LG cs.AI

    Comparative Analysis of Transformers for Modeling Tabular Data: A Casestudy using Industry Scale Dataset

    Authors: Usneek Singh, Piyush Arora, Shamika Ganesan, Mohit Kumar, Siddhant Kulkarni, Salil R. Joshi

    Abstract: We perform a comparative analysis of transformer-based models designed for modeling tabular data, specifically on an industry-scale dataset. While earlier studies demonstrated promising outcomes on smaller public or synthetic datasets, the effectiveness did not extend to larger industry-scale datasets. The challenges identified include handling high-dimensional data, the necessity for efficient pr… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

    Comments: Accepted at 7th Joint International Conference on Data Science & Management of Data (11th ACMIKDD CODS and 29th COMAD)

  27. arXiv:2311.02579  [pdf, other

    cs.CL cs.LG

    mahaNLP: A Marathi Natural Language Processing Library

    Authors: Vidula Magdum, Omkar Dhekane, Sharayu Hiwarkhedkar, Saloni Mittal, Raviraj Joshi

    Abstract: We present mahaNLP, an open-source natural language processing (NLP) library specifically built for the Marathi language. It aims to enhance the support for the low-resource Indian language Marathi in the field of NLP. It is an easy-to-use, extensible, and modular toolkit for Marathi text analysis built on state-of-the-art MahaBERT-based transformer models. Our work holds significant importance as… ▽ More

    Submitted 5 November, 2023; originally announced November 2023.

    Comments: Accepted at IJCNLP-AACL 2023

  28. arXiv:2310.17768  [pdf, other

    cs.CV

    A Dataset of Relighted 3D Interacting Hands

    Authors: Gyeongsik Moon, Shunsuke Saito, Weipeng Xu, Rohan Joshi, Julia Buffalini, Harley Bellan, Nicholas Rosen, Jesse Richardson, Mallorie Mize, Philippe de Bree, Tomas Simon, Bo Peng, Shubham Garg, Kevyn McPhail, Takaaki Shiratori

    Abstract: The two-hand interaction is one of the most challenging signals to analyze due to the self-similarity, complicated articulations, and occlusions of hands. Although several datasets have been proposed for the two-hand interaction analysis, all of them do not achieve 1) diverse and realistic image appearances and 2) diverse and large-scale groundtruth (GT) 3D poses at the same time. In this work, we… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: Accepted by NeurIPS 2023 (Datasets and Benchmarks Track)

  29. arXiv:2310.08764  [pdf, other

    cs.CL cs.LG

    Calibrating Likelihoods towards Consistency in Summarization Models

    Authors: Polina Zablotskaia, Misha Khalman, Rishabh Joshi, Livio Baldini Soares, Shoshana Jakobovits, Joshua Maynez, Shashi Narayan

    Abstract: Despite the recent advances in abstractive text summarization, current summarization models still suffer from generating factually inconsistent summaries, reducing their utility for real-world application. We argue that the main reason for such behavior is that the summarization models trained with maximum likelihood objective assign high probability to plausible sequences given the context, but t… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  30. arXiv:2310.08743  [pdf

    cs.CV cs.AI cs.LG

    Development and Validation of a Deep Learning-Based Microsatellite Instability Predictor from Prostate Cancer Whole-Slide Images

    Authors: Qiyuan Hu, Abbas A. Rizvi, Geoffery Schau, Kshitij Ingale, Yoni Muller, Rachel Baits, Sebastian Pretzer, Aïcha BenTaieb, Abigail Gordhamer, Roberto Nussenzveig, Adam Cole, Matthew O. Leavitt, Rohan P. Joshi, Nike Beaubier, Martin C. Stumpe, Kunal Nagpal

    Abstract: Microsatellite instability-high (MSI-H) is a tumor agnostic biomarker for immune checkpoint inhibitor therapy. However, MSI status is not routinely tested in prostate cancer, in part due to low prevalence and assay cost. As such, prediction of MSI status from hematoxylin and eosin (H&E) stained whole-slide images (WSIs) could identify prostate cancer patients most likely to benefit from confirmato… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  31. arXiv:2310.07682  [pdf

    cs.CV

    Prediction of MET Overexpression in Non-Small Cell Lung Adenocarcinomas from Hematoxylin and Eosin Images

    Authors: Kshitij Ingale, Sun Hae Hong, Josh S. K. Bell, Abbas Rizvi, Amy Welch, Lingdao Sha, Irvin Ho, Kunal Nagpal, Aicha BenTaieb, Rohan P Joshi, Martin C Stumpe

    Abstract: MET protein overexpression is a targetable event in non-small cell lung cancer (NSCLC) and is the subject of active drug development. Challenges in identifying patients for these therapies include lack of access to validated testing, such as standardized immunohistochemistry (IHC) assessment, and consumption of valuable tissue for a single gene/protein assay. Development of pre-screening algorithm… ▽ More

    Submitted 12 October, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

  32. arXiv:2310.02249  [pdf, other

    cs.CL cs.LG

    Harnessing Pre-Trained Sentence Transformers for Offensive Language Detection in Indian Languages

    Authors: Ananya Joshi, Raviraj Joshi

    Abstract: In our increasingly interconnected digital world, social media platforms have emerged as powerful channels for the dissemination of hate speech and offensive content. This work delves into the domain of hate speech detection, placing specific emphasis on three low-resource Indian languages: Bengali, Assamese, and Gujarati. The challenge is framed as a text classification task, aimed at discerning… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: HASOC at FIRE 2023

  33. arXiv:2310.00734  [pdf, other

    cs.CL cs.LG

    Robust Sentiment Analysis for Low Resource languages Using Data Augmentation Approaches: A Case Study in Marathi

    Authors: Aabha Pingle, Aditya Vyawahare, Isha Joshi, Rahul Tangsali, Geetanjali Kale, Raviraj Joshi

    Abstract: Sentiment analysis plays a crucial role in understanding the sentiment expressed in text data. While sentiment analysis research has been extensively conducted in English and other Western languages, there exists a significant gap in research efforts for sentiment analysis in low-resource languages. Limited resources, including datasets and NLP research, hinder the progress in this area. In this w… ▽ More

    Submitted 1 October, 2023; originally announced October 2023.

  34. arXiv:2309.06657  [pdf, other

    cs.CL

    Statistical Rejection Sampling Improves Preference Optimization

    Authors: Tianqi Liu, Yao Zhao, Rishabh Joshi, Misha Khalman, Mohammad Saleh, Peter J. Liu, Jialu Liu

    Abstract: Improving the alignment of language models with human preferences remains an active research challenge. Previous approaches have primarily utilized Reinforcement Learning from Human Feedback (RLHF) via online RL methods such as Proximal Policy Optimization (PPO). Recently, offline methods such as Sequence Likelihood Calibration (SLiC) and Direct Preference Optimization (DPO) have emerged as attrac… ▽ More

    Submitted 23 January, 2024; v1 submitted 12 September, 2023; originally announced September 2023.

    Comments: Accepted in ICLR 2024

  35. arXiv:2308.08941  [pdf, other

    cs.CV

    Automatic Signboard Recognition in Low Quality Night Images

    Authors: Manas Kagde, Priyanka Choudhary, Rishi Joshi, Somnath Dey

    Abstract: An essential requirement for driver assistance systems and autonomous driving technology is implementing a robust system for detecting and recognizing traffic signs. This system enables the vehicle to autonomously analyze the environment and make appropriate decisions regarding its movement, even when operating at higher frame rates. However, traffic sign images captured in inadequate lighting and… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

    Comments: 13 pages, CVIP 2023

  36. arXiv:2306.14030  [pdf, other

    cs.CL cs.LG

    My Boli: Code-mixed Marathi-English Corpora, Pretrained Language Models and Evaluation Benchmarks

    Authors: Tanmay Chavan, Omkar Gokhale, Aditya Kane, Shantanu Patankar, Raviraj Joshi

    Abstract: The research on code-mixed data is limited due to the unavailability of dedicated code-mixed datasets and pre-trained language models. In this work, we focus on the low-resource Indian language Marathi which lacks any prior work in code-mixing. We present L3Cube-MeCorpus, a large code-mixed Marathi-English (Mr-En) corpus with 10 million social media sentences for pretraining. We also release L3Cub… ▽ More

    Submitted 20 July, 2023; v1 submitted 24 June, 2023; originally announced June 2023.

  37. arXiv:2306.13888  [pdf, other

    cs.CL cs.LG

    L3Cube-MahaSent-MD: A Multi-domain Marathi Sentiment Analysis Dataset and Transformer Models

    Authors: Aabha Pingle, Aditya Vyawahare, Isha Joshi, Rahul Tangsali, Raviraj Joshi

    Abstract: The exploration of sentiment analysis in low-resource languages, such as Marathi, has been limited due to the availability of suitable datasets. In this work, we present L3Cube-MahaSent-MD, a multi-domain Marathi sentiment analysis dataset, with four different domains - movie reviews, general tweets, TV show subtitles, and political tweets. The dataset consists of around 60,000 manually tagged sam… ▽ More

    Submitted 24 June, 2023; originally announced June 2023.

    Comments: Accepted at DMLR Workshop @ ICML 2023

  38. Enhancing Low Resource NER Using Assisting Language And Transfer Learning

    Authors: Maithili Sabane, Aparna Ranade, Onkar Litake, Parth Patil, Raviraj Joshi, Dipali Kadam

    Abstract: Named Entity Recognition (NER) is a fundamental task in NLP that is used to locate the key information in text and is primarily applied in conversational and search systems. In commercial applications, NER or comparable slot-filling methods have been widely deployed for popular languages. NER is used in applications such as human resources, customer service, search engines, content classification,… ▽ More

    Submitted 10 June, 2023; originally announced June 2023.

    Comments: Accepted at International Conference on Applied Artificial Intelligence and Computing (ICAAIC) 2023

  39. arXiv:2306.04964  [pdf, other

    cs.CL cs.LG

    Leveraging Language Identification to Enhance Code-Mixed Text Classification

    Authors: Gauri Takawane, Abhishek Phaltankar, Varad Patwardhan, Aryan Patil, Raviraj Joshi, Mukta S. Takalikar

    Abstract: The usage of more than one language in the same text is referred to as Code Mixed. It is evident that there is a growing degree of adaption of the use of code-mixed data, especially English with a regional language, on social media platforms. Existing deep-learning models do not take advantage of the implicit language information in the code-mixed text. Our study aims to improve BERT-based models… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

  40. arXiv:2305.17824  [pdf, other

    cs.FL

    A Unified Model for Real-Time Systems: Symbolic Techniques and Implementation

    Authors: S Akshay, Paul Gastin, R Govind, Aniruddha R Joshi, B Srivathsan

    Abstract: In this paper, we consider a model of generalized timed automata (GTA) with two kinds of clocks, history and future, that can express many timed features succinctly, including timed automata, event-clock automata with and without diagonal constraints, and automata with timers. Our main contribution is a new simulation-based zone algorithm for checking reachability in this unified model. While such… ▽ More

    Submitted 28 May, 2023; originally announced May 2023.

    Comments: arXiv admin note: text overlap with arXiv:2207.02633

  41. Comparative Study of Pre-Trained BERT Models for Code-Mixed Hindi-English Data

    Authors: Aryan Patil, Varad Patwardhan, Abhishek Phaltankar, Gauri Takawane, Raviraj Joshi

    Abstract: The term "Code Mixed" refers to the use of more than one language in the same text. This phenomenon is predominantly observed on social media platforms, with an increasing amount of adaptation as time goes on. It is critical to detect foreign elements in a language and process them correctly, as a considerable number of individuals are using code-mixed languages that could not be comprehended by u… ▽ More

    Submitted 26 May, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: Accepted at IEEE 8th International Conference for Convergence in Technology

  42. arXiv:2305.15365  [pdf, other

    cs.CV

    Boundary Attention Mapping (BAM): Fine-grained saliency maps for segmentation of Burn Injuries

    Authors: Mahla Abdolahnejad, Justin Lee, Hannah Chan, Alex Morzycki, Olivier Ethier, Anthea Mo, Peter X. Liu, Joshua N. Wong, Colin Hong, Rakesh Joshi

    Abstract: Burn injuries can result from mechanisms such as thermal, chemical, and electrical insults. A prompt and accurate assessment of burns is essential for deciding definitive clinical treatments. Currently, the primary approach for burn assessments, via visual and tactile observations, is approximately 60%-80% accurate. The gold standard is biopsy and a close second would be non-invasive methods like… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  43. arXiv:2305.10425  [pdf, other

    cs.CL cs.AI

    SLiC-HF: Sequence Likelihood Calibration with Human Feedback

    Authors: Yao Zhao, Rishabh Joshi, Tianqi Liu, Misha Khalman, Mohammad Saleh, Peter J. Liu

    Abstract: Learning from human feedback has been shown to be effective at aligning language models with human preferences. Past work has often relied on Reinforcement Learning from Human Feedback (RLHF), which optimizes the language model using reward scores assigned from a reward model trained on human preference data. In this work we show how the recently introduced Sequence Likelihood Calibration (SLiC),… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

  44. arXiv:2305.01484  [pdf, other

    cs.ET

    Powering Disturb-Free Reconfigurable Computing and Tunable Analog Electronics with Dual-Port Ferroelectric FET

    Authors: Zijian Zhao, Shan Deng, Swetaki Chatterjee, Zhouhang Jiang, Muhammad Shaffatul Islam, Yi Xiao, Yixin Xu, Scott Meninger, Mohamed Mohamed, Rajiv Joshi, Yogesh Singh Chauhan, Halid Mulaosmanovic, Stefan Duenkel, Dominik Kleimaier, Sven Beyer, Hussam Amrouch, Vijaykrishnan Narayanan, Kai Ni

    Abstract: Single-port ferroelectric FET (FeFET) that performs write and read operations on the same electrical gate prevents its wide application in tunable analog electronics and suffers from read disturb, especially to the high-threshold voltage (VTH) state as the retention energy barrier is reduced by the applied read bias. To address both issues, we propose to adopt a read disturb-free dual-port FeFET w… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

    Comments: 32 pages

  45. arXiv:2304.11434  [pdf, other

    cs.CL cs.LG

    L3Cube-IndicSBERT: A simple approach for learning cross-lingual sentence representations using multilingual BERT

    Authors: Samruddhi Deode, Janhavi Gadre, Aditi Kajale, Ananya Joshi, Raviraj Joshi

    Abstract: The multilingual Sentence-BERT (SBERT) models map different languages to common representation space and are useful for cross-language similarity and mining tasks. We propose a simple yet effective approach to convert vanilla multilingual BERT models into multilingual sentence BERT models using synthetic corpus. We simply aggregate translated NLI or STS datasets of the low-resource target language… ▽ More

    Submitted 22 April, 2023; originally announced April 2023.

  46. arXiv:2303.14401  [pdf, other

    cs.LG

    Deep Linear Discriminant Analysis with Variation for Polycystic Ovary Syndrome Classification

    Authors: Raunak Joshi, Abhishek Gupta, Himanshu Soni, Ronald Laban

    Abstract: The polycystic ovary syndrome diagnosis is a problem that can be leveraged using prognostication based learning procedures. Many implementations of PCOS can be seen with Machine Learning but the algorithms have certain limitations in utilizing the processing power graphical processing units. The simple machine learning algorithms can be improved with advanced frameworks using Deep Learning. The Li… ▽ More

    Submitted 25 March, 2023; originally announced March 2023.

    Comments: 7 pages, 5 figures. To appear in proceedings of Intelligent Computing and Networking (IC-ICN 2022)

  47. arXiv:2303.07524  [pdf, ps, other

    astro-ph.IM cs.DC

    Integration of storage endpoints into a Rucio data lake, as an activity to prototype a SKA Regional Centres Network

    Authors: Manuel Parra-Royón, Jesús Sánchez-Castañeda, Julián Garrido, Susana Sánchez-Expósito, Rohini Joshi, James Collinson, Rob Barnsley, Jesús Salgado, Lourdes Verdes-Montenegro

    Abstract: The Square Kilometre Array (SKA) infrastructure will consist of two radio telescopes that will be the most sensitive telescopes on Earth. The SKA community will have to process and manage near exascale data, which will be a technical challenge for the coming years. In this respect, the SKA Global Network of Regional Centres plays a key role in data distribution and management. The SRCNet will prov… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

  48. arXiv:2302.04866  [pdf, other

    cs.CV cs.GR

    RelightableHands: Efficient Neural Relighting of Articulated Hand Models

    Authors: Shun Iwase, Shunsuke Saito, Tomas Simon, Stephen Lombardi, Timur Bagautdinov, Rohan Joshi, Fabian Prada, Takaaki Shiratori, Yaser Sheikh, Jason Saragih

    Abstract: We present the first neural relighting approach for rendering high-fidelity personalized hands that can be animated in real-time under novel illumination. Our approach adopts a teacher-student framework, where the teacher learns appearance under a single point light from images captured in a light-stage, allowing us to synthesize hands in arbitrary illuminations but with heavy compute. Using image… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

    Comments: 8 pages, 16 figures, Website: https://sh8.io/#/relightable_hands

  49. arXiv:2212.13170  [pdf, other

    cs.CV

    Weakly-Supervised Semantic Segmentation of Ships Using Thermal Imagery

    Authors: Rushil Joshi, Ethan Adams, Matthew Ziemann, Christopher A. Metzler

    Abstract: The United States coastline spans 95,471 miles; a distance that cannot be effectively patrolled or secured by manual human effort alone. Unmanned Aerial Vehicles (UAVs) equipped with infrared cameras and deep-learning based algorithms represent a more efficient alternative for identifying and segmenting objects of interest - namely, ships. However, standard approaches to training these algorithms… ▽ More

    Submitted 26 December, 2022; originally announced December 2022.

    MSC Class: I.4

  50. arXiv:2212.10039  [pdf, other

    cs.CL

    A Twitter BERT Approach for Offensive Language Detection in Marathi

    Authors: Tanmay Chavan, Shantanu Patankar, Aditya Kane, Omkar Gokhale, Raviraj Joshi

    Abstract: Automated offensive language detection is essential in combating the spread of hate speech, particularly in social media. This paper describes our work on Offensive Language Identification in low resource Indic language Marathi. The problem is formulated as a text classification task to identify a tweet as offensive or non-offensive. We evaluate different mono-lingual and multi-lingual BERT models… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.