Skip to main content

Showing 1–50 of 77 results for author: Iyer, S

  1. arXiv:2407.04708  [pdf, other

    cs.CV cs.LG

    QMViT: A Mushroom is worth 16x16 Words

    Authors: Siddhant Dutta, Hemant Singh, Kalpita Shankhdhar, Sridhar Iyer

    Abstract: Consuming poisonous mushrooms can have severe health consequences, even resulting in fatality and accurately distinguishing edible from toxic mushroom varieties remains a significant challenge in ensuring food safety. So, it's crucial to distinguish between edible and poisonous mushrooms within the existing species. This is essential due to the significant demand for mushrooms in people's daily me… ▽ More

    Submitted 10 May, 2024; originally announced July 2024.

  2. arXiv:2407.01802  [pdf, ps, other

    cs.CC

    An XOR Lemma for Deterministic Communication Complexity

    Authors: Siddharth Iyer, Anup Rao

    Abstract: We prove a lower bound on the communication complexity of computing the $n$-fold xor of an arbitrary function $f$, in terms of the communication complexity and rank of $f$. We prove that $D(f^{\oplus n}) \geq n \cdot \Big(\frac{Ω(D(f))}{\log \mathsf{rk}(f)} -\log \mathsf{rk}(f)\Big )$, where here $D(f), D(f^{\oplus n})$ represent the deterministic communication complexity, and $\mathsf{rk}(f)$ is… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  3. arXiv:2403.15076  [pdf

    q-bio.QM cs.AI q-bio.BM q-bio.SC

    Comprehensive Lipidomic Automation Workflow using Large Language Models

    Authors: Connor Beveridge, Sanjay Iyer, Caitlin E. Randolph, Matthew Muhoberac, Palak Manchanda, Amy C. Clingenpeel, Shane Tichy, Gaurav Chopra

    Abstract: Lipidomics generates large data that makes manual annotation and interpretation challenging. Lipid chemical and structural diversity with structural isomers further complicates annotation. Although, several commercial and open-source software for targeted lipid identification exists, it lacks automated method generation workflows and integration with statistical and bioinformatics tools. We have d… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: 53 pages, 4 main figures, 23 Supporting figures, 10 Supporting Tables

  4. arXiv:2403.06734  [pdf, other

    cs.AI cs.CL cs.CV

    Real-Time Multimodal Cognitive Assistant for Emergency Medical Services

    Authors: Keshara Weerasinghe, Saahith Janapati, Xueren Ge, Sion Kim, Sneha Iyer, John A. Stankovic, Homa Alemzadeh

    Abstract: Emergency Medical Services (EMS) responders often operate under time-sensitive conditions, facing cognitive overload and inherent risks, requiring essential skills in critical thinking and rapid decision-making. This paper presents CognitiveEMS, an end-to-end wearable cognitive assistant system that can act as a collaborative virtual partner engaging in the real-time acquisition and analysis of mu… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  5. arXiv:2402.12847  [pdf, other

    cs.CL cs.AI cs.LG

    Instruction-tuned Language Models are Better Knowledge Learners

    Authors: Zhengbao Jiang, Zhiqing Sun, Weijia Shi, Pedro Rodriguez, Chunting Zhou, Graham Neubig, Xi Victoria Lin, Wen-tau Yih, Srinivasan Iyer

    Abstract: In order for large language model (LLM)-based assistants to effectively adapt to evolving information needs, it must be possible to update their factual knowledge through continued training on new data. The standard recipe for doing so involves continued pre-training on new documents followed by instruction-tuning on question-answer (QA) pairs. However, we find that LLMs trained with this recipe s… ▽ More

    Submitted 25 May, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: ACL 2024. The reproduced data for this paper is available at https://github.com/Edward-Sun/PIT

  6. arXiv:2312.10048  [pdf

    cs.CL

    Knowledge Graph Enhanced Aspect-Level Sentiment Analysis

    Authors: Kavita Sharma, Ritu Patel, Sunita Iyer

    Abstract: In this paper, we propose a novel method to enhance sentiment analysis by addressing the challenge of context-specific word meanings. It combines the advantages of a BERT model with a knowledge graph based synonym data. This synergy leverages a dynamic attention mechanism to develop a knowledge-driven state vector. For classifying sentiments linked to specific aspects, the approach constructs a me… ▽ More

    Submitted 26 January, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

  7. arXiv:2312.06129  [pdf, other

    cs.RO

    Household navigation and manipulation for everyday object rearrangement tasks

    Authors: Shrutheesh R. Iyer, Anwesan Pal, Jiaming Hu, Akanimoh Adeleye, Aditya Aggarwal, Henrik I. Christensen

    Abstract: We consider the problem of building an assistive robotic system that can help humans in daily household cleanup tasks. Creating such an autonomous system in real-world environments is inherently quite challenging, as a general solution may not suit the preferences of a particular customer. Moreover, such a system consists of multi-objective tasks comprising -- (i) Detection of misplaced objects an… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: Paper accepted at IEEE IRC-2023

  8. arXiv:2312.03076  [pdf, ps, other

    cs.CC

    XOR Lemmas for Communication via Marginal Information

    Authors: Siddharth Iyer, Anup Rao

    Abstract: We define the $\textit{marginal information}$ of a communication protocol, and use it to prove XOR lemmas for communication complexity. We show that if every $C$-bit protocol has bounded advantage for computing a Boolean function $f$, then every $\tilde Ω(C \sqrt{n})$-bit protocol has advantage $\exp(-Ω(n))$ for computing the $n$-fold xor $f^{\oplus n}$. We prove exponentially small bounds in the… ▽ More

    Submitted 2 July, 2024; v1 submitted 5 December, 2023; originally announced December 2023.

    Comments: Fixed typos

  9. arXiv:2311.10812  [pdf, other

    cs.CV cs.GR cs.LG

    SplatArmor: Articulated Gaussian splatting for animatable humans from monocular RGB videos

    Authors: Rohit Jena, Ganesh Subramanian Iyer, Siddharth Choudhary, Brandon Smith, Pratik Chaudhari, James Gee

    Abstract: We propose SplatArmor, a novel approach for recovering detailed and animatable human models by `armoring' a parameterized body model with 3D Gaussians. Our approach represents the human as a set of 3D Gaussians within a canonical space, whose articulation is defined by extending the skinning of the underlying SMPL geometry to arbitrary locations in the canonical space. To account for pose-dependen… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

  10. arXiv:2310.08494  [pdf, other

    cs.RO

    An Experience-based TAMP Framework for Foliated Manifolds

    Authors: Jiaming Hu, Shrutheesh R. Iyer, Henrik I. Christensen

    Abstract: Due to their complexity, foliated structure problems often pose intricate challenges to task and motion planning in robotics manipulation. To counter this, our study presents the ``Foliated Repetition Roadmap.'' This roadmap assists task and motion planners by transforming the complex foliated structure problem into a more accessible graph format. By leveraging query experiences from different fol… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  11. arXiv:2309.13872  [pdf, other

    eess.IV cs.CV cs.LG

    Attention and Pooling based Sigmoid Colon Segmentation in 3D CT images

    Authors: Md Akizur Rahman, Sonit Singh, Kuruparan Shanmugalingam, Sankaran Iyer, Alan Blair, Praveen Ravindran, Arcot Sowmya

    Abstract: Segmentation of the sigmoid colon is a crucial aspect of treating diverticulitis. It enables accurate identification and localisation of inflammation, which in turn helps healthcare professionals make informed decisions about the most appropriate treatment options. This research presents a novel deep learning architecture for segmenting the sigmoid colon from Computed Tomography (CT) images using… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

    Comments: 8 Pages, 6 figures, Accepted at IEEE DICTA 2023

  12. arXiv:2306.02444  [pdf, other

    cs.NI eess.SP

    Energy-Sustainable IoT Connectivity: Vision, Technological Enablers, Challenges, and Future Directions

    Authors: Onel A. López, Osmel M. Rosabal, David Ruiz-Guirola, Prasoon Raghuwanshi, Konstantin Mikhaylov, Lauri Lovén, Sridhar Iyer

    Abstract: Technology solutions must effectively balance economic growth, social equity, and environmental integrity to achieve a sustainable society. Notably, although the Internet of Things (IoT) paradigm constitutes a key sustainability enabler, critical issues such as the increasing maintenance operations, energy consumption, and manufacturing/disposal of IoT devices have long-term negative economic, soc… ▽ More

    Submitted 27 October, 2023; v1 submitted 4 June, 2023; originally announced June 2023.

    Comments: 25 figures, 12 tables, submitted to IEEE Open Journal of the Communications Society

    MSC Class: 94-02; 68-02

  13. arXiv:2306.01999  [pdf, other

    cs.LG cs.AI

    GAT-GAN : A Graph-Attention-based Time-Series Generative Adversarial Network

    Authors: Srikrishna Iyer, Teng Teck Hou

    Abstract: Generative Adversarial Networks (GANs) have proven to be a powerful tool for generating realistic synthetic data. However, traditional GANs often struggle to capture complex relationships between features which results in generation of unrealistic multivariate time-series data. In this paper, we propose a Graph-Attention-based Generative Adversarial Network (GAT-GAN) that explicitly includes two g… ▽ More

    Submitted 3 June, 2023; originally announced June 2023.

    Comments: 9 pages, 1 figure, 3 tables, preprint under review

  14. arXiv:2305.11206  [pdf, other

    cs.CL cs.AI cs.LG

    LIMA: Less Is More for Alignment

    Authors: Chunting Zhou, Pengfei Liu, Puxin Xu, Srini Iyer, Jiao Sun, Yuning Mao, Xuezhe Ma, Avia Efrat, Ping Yu, Lili Yu, Susan Zhang, Gargi Ghosh, Mike Lewis, Luke Zettlemoyer, Omer Levy

    Abstract: Large language models are trained in two stages: (1) unsupervised pretraining from raw text, to learn general-purpose representations, and (2) large scale instruction tuning and reinforcement learning, to better align to end tasks and user preferences. We measure the relative importance of these two stages by training LIMA, a 65B parameter LLaMa language model fine-tuned with the standard supervis… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

  15. TinyML: Tools, Applications, Challenges, and Future Research Directions

    Authors: Rakhee Kallimani, Krishna Pai, Prasoon Raghuwanshi, Sridhar Iyer, Onel L. A. López

    Abstract: In recent years, Artificial Intelligence (AI) and Machine learning (ML) have gained significant interest from both, industry and academia. Notably, conventional ML techniques require enormous amounts of power to meet the desired accuracy, which has limited their use mainly to high-capability devices such as network nodes. However, with many advancements in technologies such as the Internet of Thin… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

    Comments: 12 pags, 3 tables, 4 figures

    Journal ref: Multimedia Tools and Applications, 2023

  16. arXiv:2302.08468  [pdf, other

    cs.LG cs.CL cs.PL cs.SE

    LEVER: Learning to Verify Language-to-Code Generation with Execution

    Authors: Ansong Ni, Srini Iyer, Dragomir Radev, Ves Stoyanov, Wen-tau Yih, Sida I. Wang, Xi Victoria Lin

    Abstract: The advent of large language models trained on code (code LLMs) has led to significant progress in language-to-code generation. State-of-the-art approaches in this area combine LLM decoding with sample pruning and reranking using test cases or heuristics based on the execution results. However, it is challenging to obtain test cases for many real-world language-to-code applications, and heuristics… ▽ More

    Submitted 1 September, 2023; v1 submitted 16 February, 2023; originally announced February 2023.

    Comments: ICML'23; code available at https://github.com/niansong1996/lever

  17. arXiv:2212.12017  [pdf, other

    cs.CL

    OPT-IML: Scaling Language Model Instruction Meta Learning through the Lens of Generalization

    Authors: Srinivasan Iyer, Xi Victoria Lin, Ramakanth Pasunuru, Todor Mihaylov, Daniel Simig, Ping Yu, Kurt Shuster, Tianlu Wang, Qing Liu, Punit Singh Koura, Xian Li, Brian O'Horo, Gabriel Pereyra, Jeff Wang, Christopher Dewan, Asli Celikyilmaz, Luke Zettlemoyer, Ves Stoyanov

    Abstract: Recent work has shown that fine-tuning large pre-trained language models on a collection of tasks described via instructions, a.k.a. instruction-tuning, improves their zero and few-shot generalization to unseen tasks. However, there is a limited understanding of the performance trade-offs of different decisions made during the instruction-tuning process. These decisions include the scale and diver… ▽ More

    Submitted 30 January, 2023; v1 submitted 22 December, 2022; originally announced December 2022.

    Comments: 56 pages. v2->v3: fix OPT-30B evaluation results across benchmarks (previously we reported lower performance of this model due to an evaluation pipeline bug)

  18. arXiv:2212.04037  [pdf, other

    cs.CL

    Demystifying Prompts in Language Models via Perplexity Estimation

    Authors: Hila Gonen, Srini Iyer, Terra Blevins, Noah A. Smith, Luke Zettlemoyer

    Abstract: Language models can be prompted to perform a wide variety of zero- and few-shot learning problems. However, performance varies significantly with the choice of prompt, and we do not yet understand why this happens or how to pick the best prompts. In this work, we analyze the factors that contribute to this variance and establish a new empirical hypothesis: the performance of a prompt is coupled wi… ▽ More

    Submitted 7 December, 2022; originally announced December 2022.

  19. arXiv:2211.13892  [pdf, other

    cs.CL

    Complementary Explanations for Effective In-Context Learning

    Authors: Xi Ye, Srinivasan Iyer, Asli Celikyilmaz, Ves Stoyanov, Greg Durrett, Ramakanth Pasunuru

    Abstract: Large language models (LLMs) have exhibited remarkable capabilities in learning from explanations in prompts, but there has been limited understanding of exactly how these explanations function or why they are effective. This work aims to better understand the mechanisms by which explanations are used for in-context learning. We first study the impact of two different factors on the performance of… ▽ More

    Submitted 12 June, 2023; v1 submitted 24 November, 2022; originally announced November 2022.

    Comments: ACL Findings 2023 Camera-Ready

  20. arXiv:2211.08956  [pdf

    cs.NI

    A Comprehensive Survey on Spectrum Sharing Techniques for 5G/B5G Intelligent Wireless Networks: Opportunities, Challenges and Future Research Directions

    Authors: Anita Patil, Sridhar Iyer, Onel L. A. Lopez, Rahul J Pandya, Krishna Pai, Anshuman Kalla, Rakhee Kallimani

    Abstract: The increasing popularity of Internet of Everything and small-cell devices has enormously accelerated traffic loads. Consequently, increased bandwidth and high data rate requirements stimulate the operation at the millimeter wave and the Tera-Hertz spectrum bands in the fifth generation (5G) and beyond 5G (B5G) wireless networks. Furthermore, efficient spectrum allocation, maximizing the spectrum… ▽ More

    Submitted 17 November, 2022; v1 submitted 16 November, 2022; originally announced November 2022.

  21. arXiv:2211.05288  [pdf, other

    cs.SI

    The Friendship Paradox and Social Network Participation

    Authors: Ahmed Medhat, Shankar Iyer

    Abstract: The friendship paradox implies that a person will, on average, have fewer friends than their friends do. Prior work has shown how the friendship paradox can lead to perception biases regarding behaviors that correlate with the number of friends: for example, people tend to perceive their friends as being more socially engaged than they are. Here, we investigate the consequences of this type of soc… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

    Comments: 10 pages, 15 figures

  22. arXiv:2208.13985  [pdf

    cs.NI cs.PF

    ZEUS: An Experimental Toolkit for Evaluating Congestion Control Algorithms in 5G Environments

    Authors: Rohail Asim, Muhammad Khan, Luis Diez, Shiva Iyer, Ramon Aguero, Lakshmi Subramanian, Yasir Zaki

    Abstract: As global cellular networks converge to 5G, one question lingers: Are we ready for the 5G challenge? A growing concern surrounds how well do existing congestion control algorithms perform in diverse 5G networks. Given that 5G networks are not yet widely deployed, assessing the performance of existing congestion control algorithms in realistic 5G settings presents several challenges. Moreover, exis… ▽ More

    Submitted 30 August, 2022; originally announced August 2022.

    Comments: 21 pages, 18 figures

  23. arXiv:2207.13312  [pdf, ps, other

    cs.CC math.CO

    Searching for Regularity in Bounded Functions

    Authors: Siddharth Iyer, Michael Whitmeyer

    Abstract: Given a function $f$ on $\mathbb{F}_2^n$, we study the following problem. What is the largest affine subspace $\mathcal{U}$ such that when restricted to $\mathcal{U}$, all the non-trivial Fourier coefficients of $f$ are very small? For the natural class of bounded Fourier degree $d$ functions $f:\mathbb{F}_2^n \to [-1,1]$, we show that there exists an affine subspace of dimension at least… ▽ More

    Submitted 3 May, 2023; v1 submitted 27 July, 2022; originally announced July 2022.

    Comments: 27 pages

  24. Survey on Wireless Information Energy Transfer (WIET) and Related Applications in 6G Internet of NanoThings (IoNT)

    Authors: Pragati Sharma, Rahul Jashvantbhai Pandya, Sridhar Iyer, Anubhav Sharma

    Abstract: This article contains an overview of WIET and the related applications in 6G IoNT. Specifically, to explore the following, we: (i) introduce the 6G network along with the implementation challenges, possible techniques, THz communication and related research challenges, (ii) focus on the WIET architecture, and different energy carrying code words for efficient charging through WIET, (iii) discuss I… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.

    Journal ref: Proceedings of the Indian National Science Academy 2023

  25. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  26. arXiv:2206.00807  [pdf

    cs.LG

    Applied Federated Learning: Architectural Design for Robust and Efficient Learning in Privacy Aware Settings

    Authors: Branislav Stojkovic, Jonathan Woodbridge, Zhihan Fang, Jerry Cai, Andrey Petrov, Sathya Iyer, Daoyu Huang, Patrick Yau, Arvind Sastha Kumar, Hitesh Jawa, Anamita Guha

    Abstract: The classical machine learning paradigm requires the aggregation of user data in a central location where machine learning practitioners can preprocess data, calculate features, tune models and evaluate performance. The advantage of this approach includes leveraging high performance hardware (such as GPUs) and the ability of machine learning practitioners to do in depth data analysis to improve mo… ▽ More

    Submitted 7 June, 2022; v1 submitted 1 June, 2022; originally announced June 2022.

  27. arXiv:2205.12495  [pdf, other

    cs.CL

    ToKen: Task Decomposition and Knowledge Infusion for Few-Shot Hate Speech Detection

    Authors: Badr AlKhamissi, Faisal Ladhak, Srini Iyer, Ves Stoyanov, Zornitsa Kozareva, Xian Li, Pascale Fung, Lambert Mathias, Asli Celikyilmaz, Mona Diab

    Abstract: Hate speech detection is complex; it relies on commonsense reasoning, knowledge of stereotypes, and an understanding of social nuance that differs from one culture to the next. It is also difficult to collect a large-scale hate speech annotated dataset. In this work, we frame this problem as a few-shot learning task, and show significant gains with decomposing the task into its "constituent" parts… ▽ More

    Submitted 20 May, 2023; v1 submitted 25 May, 2022; originally announced May 2022.

    Comments: Accepted at EMNLP 2022

    Journal ref: In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, pages 2109-2120, Abu Dhabi, United Arab Emirates. Association for Computational Linguistics

  28. arXiv:2205.01703  [pdf, other

    cs.CL

    Improving In-Context Few-Shot Learning via Self-Supervised Training

    Authors: Mingda Chen, Jingfei Du, Ramakanth Pasunuru, Todor Mihaylov, Srini Iyer, Veselin Stoyanov, Zornitsa Kozareva

    Abstract: Self-supervised pretraining has made few-shot learning possible for many NLP tasks. But the pretraining objectives are not typically adapted specifically for in-context few-shot learning. In this paper, we propose to use self-supervision in an intermediate training stage between pretraining and downstream few-shot usage with the goal to teach the model to perform in-context few shot learning. We p… ▽ More

    Submitted 6 June, 2022; v1 submitted 3 May, 2022; originally announced May 2022.

    Comments: NAACL 2022

  29. A Survey on Brain-Computer Interface and Related Applications

    Authors: Krishna Pai, Rakhee Kallimani, Sridhar Iyer, B. Uma Maheswari, Rajashri Khanai, Dattaprasad Torse

    Abstract: BCI systems are able to communicate directly between the brain and computer using neural activity measurements without the involvement of muscle movements. For BCI systems to be widely used by people with severe disabilities, long-term studies of their real-world use are needed, along with effective and feasible dissemination models. In addition, the robustness of the BCI systems' performance shou… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

    Journal ref: Machine Intelligence for Internet of Medical Things: Applications and Future Trends, Computational Intelligence for Data Analysis (2023) 2:210-228 (19)

  30. arXiv:2203.08429  [pdf

    cs.NI

    A Survey of Machine Learning Algorithms for 6G Wireless Networks

    Authors: Anita Patil, Sridhar Iyer, Rahul Jashvantbhai Pandya

    Abstract: The primary focus of Artificial Intelligence/Machine Learning (AI/ML) integration within the wireless technology is to reduce capital expenditures, optimize network performance, and build new revenue streams. Replacing traditional algorithms with deep learning AI techniques have dramatically reduced the power consumption and improved the system performance. Further, implementation of ML algorithms… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

  31. arXiv:2203.08426  [pdf

    cs.NI eess.SP

    Survey on Internet of Things enabled by 6G Wireless Networks

    Authors: Sridhar Iyer, Rahul Jashvantbhai Pandya, Rakhee Kallimani, Krishna Pai, Rajashri Khanai, Dattaprasad Torse, Swati Mavinkattimath

    Abstract: The 6G wireless technology is visualized to revolutionize multiple customer services with the Internet of Things (IoT), thereby contributing to a ubiquitous intelligent society comprising autonomous systems. In this chapter, we conduct a detailed survey on the IoT networks with 6G wireless networks and investigate the trending possibilities provided by the 6G technology within the IoT networks and… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

  32. A Survey on Technological Trends to Enhance Spectrum Efficiency in 6G Communications

    Authors: Sridhar Iyer, Anita Patil, Shilpa Bhairanatti, Soumya Halagatti, Rahul Jashvantbhai Pandya

    Abstract: The research community has already identified that, by 2030, 5G networks will reach the capacity limits, and hence, will be inadequate to support next generation bandwidth-hungry, ubiquitous, intelligent services, and applications. Therefore, in view of sustaining the competitive edge of wireless technology and stratifying the next decade's communication requirements both, industry and research co… ▽ More

    Submitted 23 February, 2022; originally announced February 2022.

    Journal ref: 2022

  33. A Survey on Semantic Communications for Intelligent Wireless Networks

    Authors: Sridhar Iyer, Rajashri Khanai, Dattaprasad Torse, Rahul Jashvantbhai Pandya, Khaled Rabie, Krishna Pai, Wali Ullah Khan, Zubair Fadlullah

    Abstract: With deployment of 6G technology, it is envisioned that competitive edge of wireless networks will be sustained and next decade's communication requirements will be stratified. Also 6G will aim to aid development of a human society which is ubiquitous and mobile, simultaneously providing solutions to key challenges such as, coverage, capacity, etc. In addition, 6G will focus on providing intellige… ▽ More

    Submitted 10 August, 2022; v1 submitted 8 February, 2022; originally announced February 2022.

    Journal ref: Wireless Personal Communications 129, 569-611 (2023)

  34. arXiv:2112.10684  [pdf, other

    cs.CL cs.AI cs.LG

    Efficient Large Scale Language Modeling with Mixtures of Experts

    Authors: Mikel Artetxe, Shruti Bhosale, Naman Goyal, Todor Mihaylov, Myle Ott, Sam Shleifer, Xi Victoria Lin, Jingfei Du, Srinivasan Iyer, Ramakanth Pasunuru, Giri Anantharaman, Xian Li, Shuohui Chen, Halil Akin, Mandeep Baines, Louis Martin, Xing Zhou, Punit Singh Koura, Brian O'Horo, Jeff Wang, Luke Zettlemoyer, Mona Diab, Zornitsa Kozareva, Ves Stoyanov

    Abstract: Mixture of Experts layers (MoEs) enable efficient scaling of language models through conditional computation. This paper presents a detailed empirical study of how autoregressive MoE language models scale in comparison with dense models in a wide range of settings: in- and out-of-domain language modeling, zero- and few-shot priming, and full-shot fine-tuning. With the exception of fine-tuning, we… ▽ More

    Submitted 26 October, 2022; v1 submitted 20 December, 2021; originally announced December 2021.

    Comments: EMNLP 2022

  35. arXiv:2112.04552  [pdf, other

    cs.CE cs.AI cs.LG

    PATO: Producibility-Aware Topology Optimization using Deep Learning for Metal Additive Manufacturing

    Authors: Naresh S. Iyer, Amir M. Mirzendehdel, Sathyanarayanan Raghavan, Yang Jiao, Erva Ulu, Morad Behandish, Saigopal Nelaturi, Dean M. Robinson

    Abstract: In this paper, we propose PATO-a producibility-aware topology optimization (TO) framework to help efficiently explore the design space of components fabricated using metal additive manufacturing (AM), while ensuring manufacturability with respect to cracking. Specifically, parts fabricated through Laser Powder Bed Fusion are prone to defects such as warpage or cracking due to high residual stress… ▽ More

    Submitted 8 December, 2021; originally announced December 2021.

  36. arXiv:2112.03276  [pdf, other

    eess.IV cs.CV cs.LG

    Organ localisation using supervised and semi supervised approaches combining reinforcement learning with imitation learning

    Authors: Sankaran Iyer, Alan Blair, Laughlin Dawes, Daniel Moses, Christopher White, Arcot Sowmya

    Abstract: Computer aided diagnostics often requires analysis of a region of interest (ROI) within a radiology scan, and the ROI may be an organ or a suborgan. Although deep learning algorithms have the ability to outperform other methods, they rely on the availability of a large amount of annotated data. Motivated by the need to address this limitation, an approach to localisation and detection of multiple… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

    Comments: 16 pages, 12 figures

  37. arXiv:2111.13654  [pdf, other

    cs.CL cs.AI cs.LG

    Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs

    Authors: Peter Hase, Mona Diab, Asli Celikyilmaz, Xian Li, Zornitsa Kozareva, Veselin Stoyanov, Mohit Bansal, Srinivasan Iyer

    Abstract: Do language models have beliefs about the world? Dennett (1995) famously argues that even thermostats have beliefs, on the view that a belief is simply an informational state decoupled from any motivational state. In this paper, we discuss approaches to detecting when models have beliefs about the world, and we improve on methods for updating model beliefs to be more truthful, with a focus on meth… ▽ More

    Submitted 26 November, 2021; originally announced November 2021.

    Comments: 19 pages

  38. arXiv:2111.06474  [pdf, other

    cs.CL

    AnswerSumm: A Manually-Curated Dataset and Pipeline for Answer Summarization

    Authors: Alexander R. Fabbri, Xiaojian Wu, Srini Iyer, Haoran Li, Mona Diab

    Abstract: Community Question Answering (CQA) fora such as Stack Overflow and Yahoo! Answers contain a rich resource of answers to a wide range of community-based questions. Each question thread can receive a large number of answers with different perspectives. One goal of answer summarization is to produce a summary that reflects the range of answer perspectives. A major obstacle for this task is the absenc… ▽ More

    Submitted 29 April, 2022; v1 submitted 11 November, 2021; originally announced November 2021.

    Comments: NAACL 2022; arXiv admin note: substantial text overlap with arXiv:2104.08536

  39. arXiv:2109.00435   

    cs.CY cs.AI econ.GN

    Proceedings of KDD 2020 Workshop on Data-driven Humanitarian Mapping: Harnessing Human-Machine Intelligence for High-Stake Public Policy and Resilience Planning

    Authors: Snehalkumar, S. Gaikwad, Shankar Iyer, Dalton Lunga, Yu-Ru Lin

    Abstract: Humanitarian challenges, including natural disasters, food insecurity, climate change, racial and gender violence, environmental crises, the COVID-19 coronavirus pandemic, human rights violations, and forced displacements, disproportionately impact vulnerable communities worldwide. According to UN OCHA, 235 million people will require humanitarian assistance in 2021 . Despite these growing perils,… ▽ More

    Submitted 7 September, 2021; v1 submitted 1 September, 2021; originally announced September 2021.

    Comments: The proceedings of the 1st Data-driven Humanitarian Mapping workshop at the 26th ACM SIGKDD Conference on Knowledge Discovery & Data Mining

  40. arXiv:2109.00100   

    cs.CY cs.AI econ.GN

    Proceedings of KDD 2021 Workshop on Data-driven Humanitarian Mapping: Harnessing Human-Machine Intelligence for High-Stake Public Policy and Resilience Planning

    Authors: Snehalkumar, S. Gaikwad, Shankar Iyer, Dalton Lunga, Elizabeth Bondi

    Abstract: Humanitarian challenges, including natural disasters, food insecurity, climate change, racial and gender violence, environmental crises, the COVID-19 coronavirus pandemic, human rights violations, and forced displacements, disproportionately impact vulnerable communities worldwide. According to UN OCHA, 235 million people will require humanitarian assistance in 2021. Despite these growing perils,… ▽ More

    Submitted 7 September, 2021; v1 submitted 31 August, 2021; originally announced September 2021.

    Comments: The proceedings of the 2nd Data-driven Humanitarian Mapping workshop at the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. August 15th, 2021

  41. arXiv:2107.06309  [pdf, ps, other

    cs.CC math.FA

    Tight bounds on the Fourier growth of bounded functions on the hypercube

    Authors: Siddharth Iyer, Anup Rao, Victor Reis, Thomas Rothvoss, Amir Yehudayoff

    Abstract: We give tight bounds on the degree $\ell$ homogenous parts $f_\ell$ of a bounded function $f$ on the cube. We show that if $f: \{\pm 1\}^n \rightarrow [-1,1]$ has degree $d$, then $\| f_\ell \|_\infty$ is bounded by $d^\ell/\ell!$, and $\| \hat{f}_\ell \|_1$ is bounded by $d^\ell e^{\binom{\ell+1}{2}} n^{\frac{\ell-1}{2}}$. We describe applications to pseudorandomness and learning theory. We use s… ▽ More

    Submitted 19 July, 2021; v1 submitted 13 July, 2021; originally announced July 2021.

  42. arXiv:2105.06982  [pdf, other

    cs.CL

    EASE: Extractive-Abstractive Summarization with Explanations

    Authors: Haoran Li, Arash Einolghozati, Srinivasan Iyer, Bhargavi Paranjape, Yashar Mehdad, Sonal Gupta, Marjan Ghazvininejad

    Abstract: Current abstractive summarization systems outperform their extractive counterparts, but their widespread adoption is inhibited by the inherent lack of interpretability. To achieve the best of both worlds, we propose EASE, an extractive-abstractive framework for evidence-based text generation and apply it to document summarization. We present an explainable summarization system based on the Informa… ▽ More

    Submitted 14 May, 2021; originally announced May 2021.

  43. arXiv:2104.08536  [pdf, other

    cs.CL

    Multi-Perspective Abstractive Answer Summarization

    Authors: Alexander R. Fabbri, Xiaojian Wu, Srini Iyer, Mona Diab

    Abstract: Community Question Answering (CQA) forums such as Stack Overflow and Yahoo! Answers contain a rich resource of answers to a wide range of questions. Each question thread can receive a large number of answers with different perspectives. The goal of multi-perspective answer summarization is to produce a summary that includes all perspectives of the answer. A major obstacle for multi-perspective, ab… ▽ More

    Submitted 17 April, 2021; originally announced April 2021.

  44. arXiv:2102.04911  [pdf, other

    cs.NI eess.SY

    The case for model-driven interpretability of delay-based congestion control protocols

    Authors: Muhammad Khan, Yasir Zaki, Shiva Iyer, Talal Ahamd, Thomas Pötsch, Jay Chen, Anirudh Sivaraman, Lakshmi Subramanian

    Abstract: Analyzing and interpreting the exact behavior of new delay-based congestion control protocols with complex non-linear control loops is exceptionally difficult in highly variable networks such as cellular networks. This paper proposes a Model-Driven Interpretability (MDI) congestion control framework, which derives a model version of a delay-based protocol by simplifying a congestion control protoc… ▽ More

    Submitted 9 February, 2021; originally announced February 2021.

  45. arXiv:2012.15482  [pdf, other

    cs.CL

    FiD-Ex: Improving Sequence-to-Sequence Models for Extractive Rationale Generation

    Authors: Kushal Lakhotia, Bhargavi Paranjape, Asish Ghoshal, Wen-tau Yih, Yashar Mehdad, Srinivasan Iyer

    Abstract: Natural language (NL) explanations of model predictions are gaining popularity as a means to understand and verify decisions made by large black-box pre-trained models, for NLP tasks such as Question Answering (QA) and Fact Verification. Recently, pre-trained sequence to sequence (seq2seq) models have proven to be very effective in jointly making predictions, as well as generating NL explanations.… ▽ More

    Submitted 31 December, 2020; originally announced December 2020.

  46. arXiv:2012.15075  [pdf, other

    cs.CL

    Human Evaluation of Spoken vs. Visual Explanations for Open-Domain QA

    Authors: Ana Valeria Gonzalez, Gagan Bansal, Angela Fan, Robin Jia, Yashar Mehdad, Srinivasan Iyer

    Abstract: While research on explaining predictions of open-domain QA systems (ODQA) to users is gaining momentum, most works have failed to evaluate the extent to which explanations improve user trust. While few works evaluate explanations using user studies, they employ settings that may deviate from the end-user's usage in-the-wild: ODQA is most ubiquitous in voice-assistants, yet current research only ev… ▽ More

    Submitted 30 December, 2020; originally announced December 2020.

    Comments: pre-print

  47. arXiv:2010.10757  [pdf, other

    cs.CL

    RECONSIDER: Re-Ranking using Span-Focused Cross-Attention for Open Domain Question Answering

    Authors: Srinivasan Iyer, Sewon Min, Yashar Mehdad, Wen-tau Yih

    Abstract: State-of-the-art Machine Reading Comprehension (MRC) models for Open-domain Question Answering (QA) are typically trained for span selection using distantly supervised positive examples and heuristically retrieved negative examples. This training scheme possibly explains empirical observations that these models achieve a high recall amongst their top few predictions, but a low overall accuracy, mo… ▽ More

    Submitted 21 October, 2020; originally announced October 2020.

  48. arXiv:2010.09648  [pdf

    cs.MA cs.CV eess.IV physics.soc-ph

    Agent-based Simulation Model and Deep Learning Techniques to Evaluate and Predict Transportation Trends around COVID-19

    Authors: Ding Wang, Fan Zuo, Jingqin Gao, Yueshuai He, Zilin Bian, Suzana Duran Bernardes, Chaekuk Na, Jingxing Wang, John Petinos, Kaan Ozbay, Joseph Y. J. Chow, Shri Iyer, Hani Nassif, Xuegang Jeff Ban

    Abstract: The COVID-19 pandemic has affected travel behaviors and transportation system operations, and cities are grappling with what policies can be effective for a phased reopening shaped by social distancing. This edition of the white paper updates travel trends and highlights an agent-based simulation model's results to predict the impact of proposed phased reopening strategies. It also introduces a re… ▽ More

    Submitted 23 September, 2020; originally announced October 2020.

  49. arXiv:2010.02413  [pdf, other

    cs.CL cs.AI

    Efficient One-Pass End-to-End Entity Linking for Questions

    Authors: Belinda Z. Li, Sewon Min, Srinivasan Iyer, Yashar Mehdad, Wen-tau Yih

    Abstract: We present ELQ, a fast end-to-end entity linking model for questions, which uses a biencoder to jointly perform mention detection and linking in one pass. Evaluated on WebQSP and GraphQuestions with extended annotations that cover multiple entities per question, ELQ outperforms the previous state of the art by a large margin of +12.7% and +19.6% F1, respectively. With a very fast inference time (1… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

    Comments: 9 pages, EMNLP 2020

  50. arXiv:2009.14018  [pdf

    physics.soc-ph cs.SI

    Toward the "New Normal": A Surge in Speeding, New Volume Patterns, and Recent Trends in Taxis/For-Hire Vehicles

    Authors: Jingqin Gao, Abhinav Bhattacharyya, Ding Wang, Nick Hudanich, Siva Sooryaa, Muruga Thambiran, Suzana Duran Bernardes, Chaekuk Na, Fan Zuo, Zilin Bian, Kaan Ozbay, Shri Iyer, Hani Nassif, Joseph Y. J. Chow

    Abstract: Six months into the pandemic and one month after the phase four reopening in New York City (NYC), restrictions are lifting, businesses and schools are reopening, but global infections are still rising. This white paper updates travel trends observed in the aftermath of the COVID-19 outbreak in NYC and highlight some findings toward the "new normal."

    Submitted 23 September, 2020; originally announced September 2020.