Skip to main content

Showing 1–50 of 186 results for author: Agrawal, S

  1. arXiv:2407.10582  [pdf, other

    cs.CL cs.AI

    Boosting Zero-Shot Crosslingual Performance using LLM-Based Augmentations with Effective Data Selection

    Authors: Barah Fazili, Ashish Sunil Agrawal, Preethi Jyothi

    Abstract: Large language models (LLMs) are very proficient text generators. We leverage this capability of LLMs to generate task-specific data via zero-shot prompting and promote cross-lingual transfer for low-resource target languages. Given task-specific data in a source language and a teacher model trained on this data, we propose using this teacher to label LLM generations and employ a set of simple dat… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: Accepted in Findings of ACL 2024

  2. arXiv:2407.07186  [pdf, other

    cs.CV cs.RO

    Barely-Visible Surface Crack Detection for Wind Turbine Sustainability

    Authors: Sourav Agrawal, Isaac Corley, Conor Wallace, Clovis Vaughn, Jonathan Lwowski

    Abstract: The production of wind energy is a crucial part of sustainable development and reducing the reliance on fossil fuels. Maintaining the integrity of wind turbines to produce this energy is a costly and time-consuming task requiring repeated inspection and maintenance. While autonomous drones have proven to make this process more efficient, the algorithms for detecting anomalies to prevent catastroph… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  3. arXiv:2406.19482  [pdf, other

    cs.CL

    xTower: A Multilingual LLM for Explaining and Correcting Translation Errors

    Authors: Marcos Treviso, Nuno M. Guerreiro, Sweta Agrawal, Ricardo Rei, José Pombal, Tania Vaz, Helena Wu, Beatriz Silva, Daan van Stigt, André F. T. Martins

    Abstract: While machine translation (MT) systems are achieving increasingly strong performance on benchmarks, they often produce translations with errors and anomalies. Understanding these errors can potentially help improve the translation quality and user experience. This paper introduces xTower, an open large language model (LLM) built on top of TowerBase designed to provide free-text explanations for tr… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  4. arXiv:2406.11409  [pdf, other

    cs.CL cs.AI

    CodeGemma: Open Code Models Based on Gemma

    Authors: CodeGemma Team, Heri Zhao, Jeffrey Hui, Joshua Howland, Nam Nguyen, Siqi Zuo, Andrea Hu, Christopher A. Choquette-Choo, Jingyue Shen, Joe Kelley, Kshitij Bansal, Luke Vilnis, Mateo Wirth, Paul Michel, Peter Choy, Pratik Joshi, Ravin Kumar, Sarmad Hashmi, Shubham Agrawal, Zhitao Gong, Jane Fine, Tris Warkentin, Ale Jakse Hartman, Bin Ni, Kathy Korevec , et al. (2 additional authors not shown)

    Abstract: This paper introduces CodeGemma, a collection of specialized open code models built on top of Gemma, capable of a variety of code and natural language generation tasks. We release three model variants. CodeGemma 7B pretrained (PT) and instruction-tuned (IT) variants have remarkably resilient natural language understanding, excel in mathematical reasoning, and match code capabilities of other open… ▽ More

    Submitted 18 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: v1: 11 pages, 4 figures, 5 tables. v2: Update metadata

  5. arXiv:2406.06608  [pdf, other

    cs.CL cs.AI

    The Prompt Report: A Systematic Survey of Prompting Techniques

    Authors: Sander Schulhoff, Michael Ilie, Nishant Balepur, Konstantine Kahadze, Amanda Liu, Chenglei Si, Yinheng Li, Aayush Gupta, HyoJung Han, Sevien Schulhoff, Pranav Sandeep Dulepet, Saurav Vidyadhara, Dayeon Ki, Sweta Agrawal, Chau Pham, Gerson Kroiz, Feileen Li, Hudson Tao, Ashay Srivastava, Hevander Da Costa, Saloni Gupta, Megan L. Rogers, Inna Goncearenco, Giuseppe Sarli, Igor Galynker , et al. (6 additional authors not shown)

    Abstract: Generative Artificial Intelligence (GenAI) systems are being increasingly deployed across all parts of industry and research settings. Developers and end users interact with these systems through the use of prompting or prompt engineering. While prompting is a widespread and highly researched concept, there exists conflicting terminology and a poor ontological understanding of what constitutes a p… ▽ More

    Submitted 14 July, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

  6. arXiv:2406.00049  [pdf, other

    cs.CL cs.LG

    QUEST: Quality-Aware Metropolis-Hastings Sampling for Machine Translation

    Authors: Gonçalo R. A. Faria, Sweta Agrawal, António Farinhas, Ricardo Rei, José G. C. de Souza, André F. T. Martins

    Abstract: An important challenge in machine translation (MT) is to generate high-quality and diverse translations. Prior work has shown that the estimated likelihood from the MT model correlates poorly with translation quality. In contrast, quality evaluation metrics (such as COMET or BLEURT) exhibit high correlations with human judgments, which has motivated their use as rerankers (such as quality-aware an… ▽ More

    Submitted 28 May, 2024; originally announced June 2024.

  7. arXiv:2405.20501  [pdf, other

    cs.RO cs.AI cs.CV cs.HC cs.LG

    ShelfHelp: Empowering Humans to Perform Vision-Independent Manipulation Tasks with a Socially Assistive Robotic Cane

    Authors: Shivendra Agrawal, Suresh Nayak, Ashutosh Naik, Bradley Hayes

    Abstract: The ability to shop independently, especially in grocery stores, is important for maintaining a high quality of life. This can be particularly challenging for people with visual impairments (PVI). Stores carry thousands of products, with approximately 30,000 new products introduced each year in the US market alone, presenting a challenge even for modern computer vision solutions. Through this work… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 8 pages, 14 figures and charts

    Journal ref: In AAMAS (pp. 1514-1523) 2023

  8. arXiv:2405.18348  [pdf, other

    cs.CL

    Can Automatic Metrics Assess High-Quality Translations?

    Authors: Sweta Agrawal, António Farinhas, Ricardo Rei, André F. T. Martins

    Abstract: Automatic metrics for evaluating translation quality are typically validated by measuring how well they correlate with human assessments. However, correlation methods tend to capture only the ability of metrics to differentiate between good and bad source-translation pairs, overlooking their reliability in distinguishing alternative translations for the same source. In this paper, we confirm that… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: work in progress

  9. arXiv:2405.13692  [pdf, other

    cs.LG

    Challenging Gradient Boosted Decision Trees with Tabular Transformers for Fraud Detection at Booking.com

    Authors: Sergei Krutikov, Bulat Khaertdinov, Rodion Kiriukhin, Shubham Agrawal, Kees Jan De Vries

    Abstract: Transformer-based neural networks, empowered by Self-Supervised Learning (SSL), have demonstrated unprecedented performance across various domains. However, related literature suggests that tabular Transformers may struggle to outperform classical Machine Learning algorithms, such as Gradient Boosted Decision Trees (GBDT). In this paper, we aim to challenge GBDTs with tabular Transformers on a typ… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: Submitted to CIKM'24, Applied Research track

  10. arXiv:2405.10949  [pdf, other

    cs.CV

    Global License Plate Dataset

    Authors: Siddharth Agrawal

    Abstract: In the pursuit of advancing the state-of-the-art (SOTA) in road safety, traffic monitoring, surveillance, and logistics automation, we introduce the Global License Plate Dataset (GLPD). The dataset consists of over 5 million images, including diverse samples captured from 74 countries with meticulous annotations, including license plate characters, license plate segmentation masks, license plate c… ▽ More

    Submitted 22 March, 2024; originally announced May 2024.

  11. arXiv:2404.17922  [pdf, other

    cs.CV cs.RO

    Open-Set 3D Semantic Instance Maps for Vision Language Navigation -- O3D-SIM

    Authors: Laksh Nanwani, Kumaraditya Gupta, Aditya Mathur, Swayam Agrawal, A. H. Abdul Hafez, K. Madhava Krishna

    Abstract: Humans excel at forming mental maps of their surroundings, equipping them to understand object relationships and navigate based on language queries. Our previous work SI Maps [1] showed that having instance-level information and the semantic understanding of an environment helps significantly improve performance for language-guided tasks. We extend this instance-level approach to 3D while increasi… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

  12. arXiv:2404.12541  [pdf, other

    cs.CV

    GenVideo: One-shot Target-image and Shape Aware Video Editing using T2I Diffusion Models

    Authors: Sai Sree Harsha, Ambareesh Revanur, Dhwanit Agarwal, Shradha Agrawal

    Abstract: Video editing methods based on diffusion models that rely solely on a text prompt for the edit are hindered by the limited expressive power of text prompts. Thus, incorporating a reference target image as a visual guide becomes desirable for precise control over edit. Also, most existing methods struggle to accurately edit a video when the shape and size of the object in the target image differ fr… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: CVPRw 2024

  13. arXiv:2404.08716  [pdf, other

    cs.CR cs.OS

    Securing Monolithic Kernels using Compartmentalization

    Authors: Soo Yee Lim, Sidhartha Agrawal, Xueyuan Han, David Eyers, Dan O'Keeffe, Thomas Pasquier

    Abstract: Monolithic operating systems, where all kernel functionality resides in a single, shared address space, are the foundation of most mainstream computer systems. However, a single flaw, even in a non-essential part of the kernel (e.g., device drivers), can cause the entire operating system to fall under an attacker's control. Kernel hardening techniques might prevent certain types of vulnerabilities… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: 24 pages, 7 figures

  14. arXiv:2404.06977  [pdf

    cs.CV

    Accurate Tennis Court Line Detection on Amateur Recorded Matches

    Authors: Sameer Agrawal, Ragoth Sundararajan, Vishak Sagar

    Abstract: Typically, tennis court line detection is done by running Hough-Line-Detection to find straight lines in the image, and then computing a transformation matrix from the detected lines to create the final court structure. We propose numerous improvements and enhancements to this algorithm, including using pretrained State-of-the-Art shadow-removal and object-detection ML models to make our line-dete… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: Accepted to 5th International conference on Image, Video Processing and Artificial Intelligence

    ACM Class: I.4.6

  15. arXiv:2404.06807  [pdf, other

    cs.RO

    Sound Matters: Auditory Detectability of Mobile Robots

    Authors: Subham Agrawal, Marlene Wessels, Jorge de Heuvel, Johannes Kraus, Maren Bennewitz

    Abstract: Mobile robots are increasingly being used in noisy environments for social purposes, e.g. to provide support in healthcare or public spaces. Since these robots also operate beyond human sight, the question arises as to how different robot types, ambient noise or cognitive engagement impacts the detection of the robots by their sound. To address this research gap, we conducted a user study measurin… ▽ More

    Submitted 25 June, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

  16. arXiv:2403.09123  [pdf, other

    cs.LG cs.IT stat.ML

    Optimal Top-Two Method for Best Arm Identification and Fluid Analysis

    Authors: Agniv Bandyopadhyay, Sandeep Juneja, Shubhada Agrawal

    Abstract: Top-$2$ methods have become popular in solving the best arm identification (BAI) problem. The best arm, or the arm with the largest mean amongst finitely many, is identified through an algorithm that at any sequential step independently pulls the empirical best arm, with a fixed probability $β$, and pulls the best challenger arm otherwise. The probability of incorrect selection is guaranteed to li… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  17. arXiv:2403.08314  [pdf, other

    cs.CL

    Is Context Helpful for Chat Translation Evaluation?

    Authors: Sweta Agrawal, Amin Farajian, Patrick Fernandes, Ricardo Rei, André F. T. Martins

    Abstract: Despite the recent success of automatic metrics for assessing translation quality, their application in evaluating the quality of machine-translated chats has been limited. Unlike more structured texts like news, chat conversations are often unstructured, short, and heavily reliant on contextual information. This poses questions about the reliability of existing sentence-level metrics in this doma… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  18. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  19. arXiv:2403.01410  [pdf, other

    cs.RO

    Barrier Functions Inspired Reward Shaping for Reinforcement Learning

    Authors: Nilaksh Nilaksh, Abhishek Ranjan, Shreenabh Agrawal, Aayush Jain, Pushpak Jagtap, Shishir Kolathaya

    Abstract: Reinforcement Learning (RL) has progressed from simple control tasks to complex real-world challenges with large state spaces. While RL excels in these tasks, training time remains a limitation. Reward shaping is a popular solution, but existing methods often rely on value functions, which face scalability issues. This paper presents a novel safety-oriented reward-shaping framework inspired by bar… ▽ More

    Submitted 1 April, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

    Comments: 7 pages, 10 figures, Accepted as contributed paper at ICRA 2024

    ACM Class: I.2.9

  20. arXiv:2402.17733  [pdf, other

    cs.CL

    Tower: An Open Multilingual Large Language Model for Translation-Related Tasks

    Authors: Duarte M. Alves, José Pombal, Nuno M. Guerreiro, Pedro H. Martins, João Alves, Amin Farajian, Ben Peters, Ricardo Rei, Patrick Fernandes, Sweta Agrawal, Pierre Colombo, José G. C. de Souza, André F. T. Martins

    Abstract: While general-purpose large language models (LLMs) demonstrate proficiency on multiple tasks within the domain of translation, approaches based on open LLMs are competitive only when specializing on a single task. In this paper, we propose a recipe for tailoring LLMs to multiple tasks present in translation workflows. We perform continued pretraining on a multilingual mixture of monolingual and pa… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  21. arXiv:2402.17447  [pdf, other

    cs.CL cs.AI cs.IR

    Deep Learning Based Named Entity Recognition Models for Recipes

    Authors: Mansi Goel, Ayush Agarwal, Shubham Agrawal, Janak Kapuriya, Akhil Vamshi Konam, Rishabh Gupta, Shrey Rastogi, Niharika, Ganesh Bagler

    Abstract: Food touches our lives through various endeavors, including flavor, nourishment, health, and sustainability. Recipes are cultural capsules transmitted across generations via unstructured text. Automated protocols for recognizing named entities, the building blocks of recipe text, are of immense value for various applications ranging from information extraction to novel recipe generation. Named ent… ▽ More

    Submitted 6 June, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: 13 pages, 6 main figures and 2 in appendices, and 3 main tables; Accepted for publication in LREC-COLING 2024

  22. arXiv:2402.12562  [pdf, ps, other

    cs.LG cs.GT

    Dynamic Pricing and Learning with Long-term Reference Effects

    Authors: Shipra Agrawal, Wei Tang

    Abstract: We consider a dynamic pricing problem where customer response to the current price is impacted by the customer price expectation, aka reference price. We study a simple and novel reference price mechanism where reference price is the average of the past prices offered by the seller. As opposed to the more commonly studied exponential smoothing mechanism, in our reference price mechanism the prices… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: 48 pages, two figures

  23. arXiv:2402.04744  [pdf, other

    cs.LG cs.AR

    Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers

    Authors: Abhimanyu Rajeshkumar Bambhaniya, Amir Yazdanbakhsh, Suvinay Subramanian, Sheng-Chun Kao, Shivani Agrawal, Utku Evci, Tushar Krishna

    Abstract: N:M Structured sparsity has garnered significant interest as a result of relatively modest overhead and improved efficiency. Additionally, this form of sparsity holds considerable appeal for reducing the memory footprint owing to their modest representation overhead. There have been efforts to develop training recipes for N:M structured sparsity, they primarily focus on low-sparsity regions (… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: 18 pages, 8 figures, 17 tables. Code is available at https://github.com/abhibambhaniya/progressive_gradient_flow_nm_sparsity

  24. arXiv:2402.02080  [pdf, other

    cs.CL

    Translation Errors Significantly Impact Low-Resource Languages in Cross-Lingual Learning

    Authors: Ashish Sunil Agrawal, Barah Fazili, Preethi Jyothi

    Abstract: Popular benchmarks (e.g., XNLI) used to evaluate cross-lingual language understanding consist of parallel versions of English evaluation sets in multiple target languages created with the help of professional translators. When creating such parallel data, it is critical to ensure high-quality translations for all target languages for an accurate characterization of cross-lingual transfer. In this… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

    Comments: Accepted to main proceedings of "The 18th Conference of the European Chapter of the Association for Computational Linguistics"

  25. arXiv:2401.09126  [pdf, other

    cs.CV cs.GR

    Objects With Lighting: A Real-World Dataset for Evaluating Reconstruction and Rendering for Object Relighting

    Authors: Benjamin Ummenhofer, Sanskar Agrawal, Rene Sepulveda, Yixing Lao, Kai Zhang, Tianhang Cheng, Stephan Richter, Shenlong Wang, German Ros

    Abstract: Reconstructing an object from photos and placing it virtually in a new environment goes beyond the standard novel view synthesis task as the appearance of the object has to not only adapt to the novel viewpoint but also to the new lighting conditions and yet evaluations of inverse rendering methods rely on novel view synthesis data or simplistic synthetic datasets for quantitative analysis. This w… ▽ More

    Submitted 13 April, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

    Comments: Accepted at 3DV 2024, Oral presentation. For the project page see https://github.com/isl-org/objects-with-lighting

  26. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  27. arXiv:2312.10126  [pdf, other

    cs.CL

    Do Text Simplification Systems Preserve Meaning? A Human Evaluation via Reading Comprehension

    Authors: Sweta Agrawal, Marine Carpuat

    Abstract: Automatic text simplification (TS) aims to automate the process of rewriting text to make it easier for people to read. A pre-requisite for TS to be useful is that it should convey information that is consistent with the meaning of the original text. However, current TS evaluation protocols assess system outputs for simplicity and meaning preservation without regard for the document context in whi… ▽ More

    Submitted 28 February, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: Accepted at TACL (a pre-MIT Press publication version)

  28. arXiv:2312.08553  [pdf, other

    eess.AS cs.SD

    USM-Lite: Quantization and Sparsity Aware Fine-tuning for Speech Recognition with Universal Speech Models

    Authors: Shaojin Ding, David Qiu, David Rim, Yanzhang He, Oleg Rybakov, Bo Li, Rohit Prabhavalkar, Weiran Wang, Tara N. Sainath, Zhonglin Han, Jian Li, Amir Yazdanbakhsh, Shivani Agrawal

    Abstract: End-to-end automatic speech recognition (ASR) models have seen revolutionary quality gains with the recent development of large-scale universal speech models (USM). However, deploying these massive USMs is extremely expensive due to the enormous memory usage and computational cost. Therefore, model compression is an important research topic to fit USM-based ASR under budget in real-world scenarios… ▽ More

    Submitted 16 January, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: Accepted by ICASSP 2024. Preprint

  29. arXiv:2311.11795  [pdf, ps, other

    cs.PL

    Effects and Coeffects in Call-By-Push-Value (Extended Version)

    Authors: Cassia Torczon, Emmanuel Suárez Acevedo, Shubh Agrawal, Joey Velez-Ginorio, Stephanie Weirich

    Abstract: Effect and coeffect tracking are a useful way to integrate many types of compile-time analysis, such as cost, liveness or dataflow, into a language's type system. However, their interactions with call-by-push-value (CBPV), a computational model useful in compilation for its isolation of effects and its ability to encompass both call-by-name and call-by-value computations, are still poorly understo… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  30. arXiv:2311.09828  [pdf, other

    cs.CL

    AfriMTE and AfriCOMET: Enhancing COMET to Embrace Under-resourced African Languages

    Authors: Jiayi Wang, David Ifeoluwa Adelani, Sweta Agrawal, Marek Masiak, Ricardo Rei, Eleftheria Briakou, Marine Carpuat, Xuanli He, Sofia Bourhim, Andiswa Bukula, Muhidin Mohamed, Temitayo Olatoye, Tosin Adewumi, Hamam Mokayed, Christine Mwase, Wangui Kimotho, Foutse Yuehgoh, Anuoluwapo Aremu, Jessica Ojo, Shamsuddeen Hassan Muhammad, Salomey Osei, Abdul-Hakeem Omotayo, Chiamaka Chukwuneke, Perez Ogayo, Oumaima Hourrane , et al. (33 additional authors not shown)

    Abstract: Despite the recent progress on scaling multilingual machine translation (MT) to several under-resourced African languages, accurately measuring this progress remains challenging, since evaluation is often performed on n-gram matching metrics such as BLEU, which typically show a weaker correlation with human judgments. Learned metrics such as COMET have higher correlation; however, the lack of eval… ▽ More

    Submitted 23 April, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: Accepted by NAACL 2024

  31. arXiv:2310.20274  [pdf, other

    cs.IR cs.CL cs.LG

    Extracting Entities of Interest from Comparative Product Reviews

    Authors: Jatin Arora, Sumit Agrawal, Pawan Goyal, Sayan Pathak

    Abstract: This paper presents a deep learning based approach to extract product comparison information out of user reviews on various e-commerce websites. Any comparative product review has three major entities of information: the names of the products being compared, the user opinion (predicate) and the feature or aspect under comparison. All these informing entities are dependent on each other and bound b… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

    Comments: Source Code: https://github.com/jatinarora2702/Review-Information-Extraction

    ACM Class: I.2.7; H.3.3

    Journal ref: Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, Pages 1975 - 1978

  32. arXiv:2310.19961  [pdf, other

    cs.LG cs.AI

    ExPT: Synthetic Pretraining for Few-Shot Experimental Design

    Authors: Tung Nguyen, Sudhanshu Agrawal, Aditya Grover

    Abstract: Experimental design is a fundamental problem in many science and engineering fields. In this problem, sample efficiency is crucial due to the time, money, and safety costs of real-world design evaluations. Existing approaches either rely on active data collection or access to large, labeled datasets of past experiments, making them impractical in many real-world scenarios. In this work, we address… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: 2023 Conference on Neural Information Processing Systems (NeurIPS)

  33. arXiv:2310.16924  [pdf, other

    cs.CL cs.HC

    Physician Detection of Clinical Harm in Machine Translation: Quality Estimation Aids in Reliance and Backtranslation Identifies Critical Errors

    Authors: Nikita Mehandru, Sweta Agrawal, Yimin Xiao, Elaine C Khoong, Ge Gao, Marine Carpuat, Niloufar Salehi

    Abstract: A major challenge in the practical use of Machine Translation (MT) is that users lack guidance to make informed decisions about when to rely on outputs. Progress in quality estimation research provides techniques to automatically assess MT quality, but these techniques have primarily been evaluated in vitro by comparison against human judgments outside of a specific context of use. This paper eval… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023

  34. arXiv:2310.15773  [pdf, other

    cs.CL

    BLESS: Benchmarking Large Language Models on Sentence Simplification

    Authors: Tannon Kew, Alison Chi, Laura Vásquez-Rodríguez, Sweta Agrawal, Dennis Aumiller, Fernando Alva-Manchego, Matthew Shardlow

    Abstract: We present BLESS, a comprehensive performance benchmark of the most recent state-of-the-art large language models (LLMs) on the task of text simplification (TS). We examine how well off-the-shelf LLMs can solve this challenging task, assessing a total of 44 models, differing in size, architecture, pre-training methods, and accessibility, on three test sets from different domains (Wikipedia, news,… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: This paper has been accepted to EMNLP 2023 as a main long paper. 9 pages, 7 figures

  35. arXiv:2309.16916  [pdf, other

    cs.LG cs.AI cs.CV

    ONNXExplainer: an ONNX Based Generic Framework to Explain Neural Networks Using Shapley Values

    Authors: Yong Zhao, Runxin He, Nicholas Kersting, Can Liu, Shubham Agrawal, Chiranjeet Chetia, Yu Gu

    Abstract: Understanding why a neural network model makes certain decisions can be as important as the inference performance. Various methods have been proposed to help practitioners explain the prediction of a neural network model, of which Shapley values are most popular. SHAP package is a leading implementation of Shapley values to explain neural networks implemented in TensorFlow or PyTorch but lacks cro… ▽ More

    Submitted 3 October, 2023; v1 submitted 28 September, 2023; originally announced September 2023.

    Comments: 11 pages, 11 figures

  36. arXiv:2309.16563  [pdf, other

    stat.ML cs.LG

    CRIMED: Lower and Upper Bounds on Regret for Bandits with Unbounded Stochastic Corruption

    Authors: Shubhada Agrawal, Timothée Mathieu, Debabrota Basu, Odalric-Ambrym Maillard

    Abstract: We investigate the regret-minimisation problem in a multi-armed bandit setting with arbitrary corruptions. Similar to the classical setup, the agent receives rewards generated independently from the distribution of the arm chosen at each time. However, these rewards are not directly observed. Instead, with a fixed $\varepsilon\in (0,\frac{1}{2})$, the agent observes a sample from the chosen arm's… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: 50 pages; 4 figures

  37. arXiv:2309.15616  [pdf, other

    cs.RO cs.AI

    Perception for Humanoid Robots

    Authors: Arindam Roychoudhury, Shahram Khorshidi, Subham Agrawal, Maren Bennewitz

    Abstract: Purpose of Review: The field of humanoid robotics, perception plays a fundamental role in enabling robots to interact seamlessly with humans and their surroundings, leading to improved safety, efficiency, and user experience. This scientific study investigates various perception modalities and techniques employed in humanoid robots, including visual, auditory, and tactile sensing by exploring rece… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: 20 pages, 4 figures. To be published in Current Robotics Reports (Springer Nature)

  38. arXiv:2309.09291  [pdf, other

    cs.CR cs.OS

    OSmosis: No more Déjà vu in OS isolation

    Authors: Sidhartha Agrawal, Reto Achermann, Margo Seltzer

    Abstract: Operating systems provide an abstraction layer between the hardware and higher-level software. Many abstractions, such as threads, processes, containers, and virtual machines, are mechanisms to provide isolation. New application scenarios frequently introduce new isolation mechanisms. Implementing each isolation mechanism as an independent abstraction makes it difficult to reason about the state a… ▽ More

    Submitted 17 September, 2023; originally announced September 2023.

    Comments: 6 pages, 1 figure

    ACM Class: D.4.6; D.4.7

  39. Beyond Labels: Leveraging Deep Learning and LLMs for Content Metadata

    Authors: Saurabh Agrawal, John Trenkle, Jaya Kawale

    Abstract: Content metadata plays a very important role in movie recommender systems as it provides valuable information about various aspects of a movie such as genre, cast, plot synopsis, box office summary, etc. Analyzing the metadata can help understand the user preferences to generate personalized recommendations and item cold starting. In this talk, we will focus on one particular type of metadata - \t… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

  40. arXiv:2308.15560  [pdf, other

    physics.ao-ph cs.AI

    WeatherBench 2: A benchmark for the next generation of data-driven global weather models

    Authors: Stephan Rasp, Stephan Hoyer, Alexander Merose, Ian Langmore, Peter Battaglia, Tyler Russel, Alvaro Sanchez-Gonzalez, Vivian Yang, Rob Carver, Shreya Agrawal, Matthew Chantry, Zied Ben Bouallegue, Peter Dueben, Carla Bromberg, Jared Sisk, Luke Barrington, Aaron Bell, Fei Sha

    Abstract: WeatherBench 2 is an update to the global, medium-range (1-14 day) weather forecasting benchmark proposed by Rasp et al. (2020), designed with the aim to accelerate progress in data-driven weather modeling. WeatherBench 2 consists of an open-source evaluation framework, publicly available training, ground truth and baseline data as well as a continuously updated website with the latest metrics and… ▽ More

    Submitted 26 January, 2024; v1 submitted 29 August, 2023; originally announced August 2023.

  41. arXiv:2307.11932  [pdf, other

    cs.CV

    RIC: Rotate-Inpaint-Complete for Generalizable Scene Reconstruction

    Authors: Isaac Kasahara, Shubham Agrawal, Selim Engin, Nikhil Chavan-Dafle, Shuran Song, Volkan Isler

    Abstract: General scene reconstruction refers to the task of estimating the full 3D geometry and texture of a scene containing previously unseen objects. In many practical applications such as AR/VR, autonomous navigation, and robotics, only a single view of the scene may be available, making the scene reconstruction task challenging. In this paper, we present a method for scene reconstruction by structural… ▽ More

    Submitted 4 October, 2023; v1 submitted 21 July, 2023; originally announced July 2023.

  42. arXiv:2307.05440  [pdf, other

    cs.CL cs.AI cs.LG

    ISLTranslate: Dataset for Translating Indian Sign Language

    Authors: Abhinav Joshi, Susmit Agrawal, Ashutosh Modi

    Abstract: Sign languages are the primary means of communication for many hard-of-hearing people worldwide. Recently, to bridge the communication gap between the hard-of-hearing community and the rest of the population, several sign language translation datasets have been proposed to enable the development of statistical sign language translation systems. However, there is a dearth of sign language resources… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

    Comments: Accepted at ACL 2023 Findings, 8 Pages

  43. arXiv:2306.09048  [pdf, other

    cs.LG stat.ML

    Optimal Best-Arm Identification in Bandits with Access to Offline Data

    Authors: Shubhada Agrawal, Sandeep Juneja, Karthikeyan Shanmugam, Arun Sai Suggala

    Abstract: Learning paradigms based purely on offline data as well as those based solely on sequential online learning have been well-studied in the literature. In this paper, we consider combining offline data with online learning, an area less studied but of obvious practical importance. We consider the stochastic $K$-armed bandit problem, where our goal is to identify the arm with the highest mean in the… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: 45 pages, 5 figures

  44. arXiv:2306.06079  [pdf, other

    physics.ao-ph cs.LG

    Deep Learning for Day Forecasts from Sparse Observations

    Authors: Marcin Andrychowicz, Lasse Espeholt, Di Li, Samier Merchant, Alexander Merose, Fred Zyda, Shreya Agrawal, Nal Kalchbrenner

    Abstract: Deep neural networks offer an alternative paradigm for modeling weather conditions. The ability of neural models to make a prediction in less than a second once the data is available and to do so with very high temporal and spatial resolution, and the ability to learn directly from atmospheric observations, are just some of these models' unique advantages. Neural models trained using atmospheric o… ▽ More

    Submitted 6 July, 2023; v1 submitted 6 June, 2023; originally announced June 2023.

  45. arXiv:2305.14993  [pdf, other

    cs.CL

    Controlling Pre-trained Language Models for Grade-Specific Text Simplification

    Authors: Sweta Agrawal, Marine Carpuat

    Abstract: Text simplification (TS) systems rewrite text to make it more readable while preserving its content. However, what makes a text easy to read depends on the intended readers. Recent work has shown that pre-trained language models can simplify text using a wealth of techniques to control output simplicity, ranging from specifying only the desired reading grade level, to directly specifying low-level… ▽ More

    Submitted 30 November, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023

  46. arXiv:2305.09510  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Real-time Simultaneous Multi-Object 3D Shape Reconstruction, 6DoF Pose Estimation and Dense Grasp Prediction

    Authors: Shubham Agrawal, Nikhil Chavan-Dafle, Isaac Kasahara, Selim Engin, Jinwook Huh, Volkan Isler

    Abstract: Robotic manipulation systems operating in complex environments rely on perception systems that provide information about the geometry (pose and 3D shape) of the objects in the scene along with other semantic information such as object labels. This information is then used for choosing the feasible grasps on relevant objects. In this paper, we present a novel method to provide this geometric and se… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    ACM Class: I.4.5; I.4.8; I.4.10; I.2.9; I.2.10; I.6.3

  47. arXiv:2305.08358  [pdf, other

    cs.CR cs.DC cs.LG

    Quadratic Functional Encryption for Secure Training in Vertical Federated Learning

    Authors: Shuangyi Chen, Anuja Modi, Shweta Agrawal, Ashish Khisti

    Abstract: Vertical federated learning (VFL) enables the collaborative training of machine learning (ML) models in settings where the data is distributed amongst multiple parties who wish to protect the privacy of their individual data. Notably, in VFL, the labels are available to a single party and the complete feature set is formed only when data from all parties is combined. Recently, Xu et al. proposed a… ▽ More

    Submitted 19 June, 2023; v1 submitted 15 May, 2023; originally announced May 2023.

    Comments: Accepted to ISIT 2023

  48. arXiv:2304.14385  [pdf, ps, other

    cs.GT cs.LG

    Dynamic Pricing and Learning with Bayesian Persuasion

    Authors: Shipra Agrawal, Yiding Feng, Wei Tang

    Abstract: We consider a novel dynamic pricing and learning setting where in addition to setting prices of products in sequential rounds, the seller also ex-ante commits to 'advertising schemes'. That is, in the beginning of each round the seller can decide what kind of signal they will provide to the buyer about the product's quality upon realization. Using the popular Bayesian persuasion framework to model… ▽ More

    Submitted 10 December, 2023; v1 submitted 27 April, 2023; originally announced April 2023.

    Comments: Conference version appeared in NeurIPS'23

  49. arXiv:2304.14082  [pdf, other

    cs.LG cs.SE

    JaxPruner: A concise library for sparsity research

    Authors: Joo Hyung Lee, Wonpyo Park, Nicole Mitchell, Jonathan Pilault, Johan Obando-Ceron, Han-Byul Kim, Namhoon Lee, Elias Frantar, Yun Long, Amir Yazdanbakhsh, Shivani Agrawal, Suvinay Subramanian, Xin Wang, Sheng-Chun Kao, Xingyao Zhang, Trevor Gale, Aart Bik, Woohyun Han, Milen Ferev, Zhonglin Han, Hong-Seok Kim, Yann Dauphin, Gintare Karolina Dziugaite, Pablo Samuel Castro, Utku Evci

    Abstract: This paper introduces JaxPruner, an open-source JAX-based pruning and sparse training library for machine learning research. JaxPruner aims to accelerate research on sparse neural networks by providing concise implementations of popular pruning and sparse training algorithms with minimal memory and latency overhead. Algorithms implemented in JaxPruner use a common API and work seamlessly with the… ▽ More

    Submitted 18 December, 2023; v1 submitted 27 April, 2023; originally announced April 2023.

    Comments: Jaxpruner is hosted at http://github.com/google-research/jaxpruner

  50. arXiv:2304.01787  [pdf, other

    cs.CC

    $k$-SUM in the Sparse Regime

    Authors: Shweta Agrawal, Sagnik Saha, Nikolaj I. Schwartzbach, and Akhil Vanukuri, Prashant Nalini Vasudevan

    Abstract: In the average-case $k$-SUM problem, given $r$ integers chosen uniformly at random from $\{0,\dots,M-1\}$, the objective is to find a ``solution'' set of $k$ numbers that sum to $0$ modulo $M$. In the dense regime of $M \leq r^k$, where solutions exist with high probability, the complexity of these problems is well understood. Much less is known in the sparse regime of $M\gg r^k$, where solutions… ▽ More

    Submitted 21 November, 2023; v1 submitted 4 April, 2023; originally announced April 2023.