Skip to main content

Showing 1–50 of 480 results for author: Arvind

  1. arXiv:2407.01502  [pdf, other

    cs.LG cs.AI

    AI Agents That Matter

    Authors: Sayash Kapoor, Benedikt Stroebl, Zachary S. Siegel, Nitya Nadgir, Arvind Narayanan

    Abstract: AI agents are an exciting new research direction, and agent development is driven by benchmarks. Our analysis of current agent benchmarks and evaluation practices reveals several shortcomings that hinder their usefulness in real-world applications. First, there is a narrow focus on accuracy without attention to other metrics. As a result, SOTA agents are needlessly complex and costly, and the comm… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2406.19670  [pdf, other

    cs.SE cs.AI cs.LG

    Function+Data Flow: A Framework to Specify Machine Learning Pipelines for Digital Twinning

    Authors: Eduardo de Conto, Blaise Genest, Arvind Easwaran

    Abstract: The development of digital twins (DTs) for physical systems increasingly leverages artificial intelligence (AI), particularly for combining data from different sources or for creating computationally efficient, reduced-dimension models. Indeed, even in very different application domains, twinning employs common techniques such as model order reduction and modelization with hybrid data (that is, da… ▽ More

    Submitted 8 July, 2024; v1 submitted 28 June, 2024; originally announced June 2024.

    Comments: 9 pages, 10 figures, to be published in AIware'24

  3. arXiv:2406.16746  [pdf, other

    cs.LG cs.AI cs.CL

    The Responsible Foundation Model Development Cheatsheet: A Review of Tools & Resources

    Authors: Shayne Longpre, Stella Biderman, Alon Albalak, Hailey Schoelkopf, Daniel McDuff, Sayash Kapoor, Kevin Klyman, Kyle Lo, Gabriel Ilharco, Nay San, Maribeth Rauh, Aviya Skowron, Bertie Vidgen, Laura Weidinger, Arvind Narayanan, Victor Sanh, David Adelani, Percy Liang, Rishi Bommasani, Peter Henderson, Sasha Luccioni, Yacine Jernite, Luca Soldaini

    Abstract: Foundation model development attracts a rapidly expanding body of contributors, scientists, and applications. To help shape responsible development practices, we introduce the Foundation Model Development Cheatsheet: a growing collection of 250+ tools and resources spanning text, vision, and speech modalities. We draw on a large body of prior work to survey resources (e.g. software, documentation,… ▽ More

    Submitted 25 June, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

  4. arXiv:2406.14657  [pdf, other

    cs.CL cs.AI cs.LG

    OpenDebateEvidence: A Massive-Scale Argument Mining and Summarization Dataset

    Authors: Allen Roush, Yusuf Shabazz, Arvind Balaji, Peter Zhang, Stefano Mezza, Markus Zhang, Sanjay Basu, Sriram Vishwanath, Mehdi Fatemi, Ravid Shwartz-Ziv

    Abstract: We introduce OpenDebateEvidence, a comprehensive dataset for argument mining and summarization sourced from the American Competitive Debate community. This dataset includes over 3.5 million documents with rich metadata, making it one of the most extensive collections of debate evidence. OpenDebateEvidence captures the complexity of arguments in high school and college debates, providing valuable r… ▽ More

    Submitted 5 July, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

    Comments: Accepted for Publication to ARGMIN 2024 at ACL2024

  5. arXiv:2406.09351  [pdf, ps, other

    cs.CC cs.DM cs.LG math.CO

    On the Expressibility of the Reconstructional Color Refinement

    Authors: V. Arvind, Johannes Köbler, Oleg Verbitsky

    Abstract: One of the most basic facts related to the famous Ulam reconstruction conjecture is that the connectedness of a graph can be determined by the deck of its vertex-deleted subgraphs, which are considered up to isomorphism. We strengthen this result by proving that connectedness can still be determined when the subgraphs in the deck are given up to equivalence under the color refinement isomorphism t… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 9 pages

  6. arXiv:2405.19524  [pdf, other

    cs.CR cs.AI

    AI Risk Management Should Incorporate Both Safety and Security

    Authors: Xiangyu Qi, Yangsibo Huang, Yi Zeng, Edoardo Debenedetti, Jonas Geiping, Luxi He, Kaixuan Huang, Udari Madhushani, Vikash Sehwag, Weijia Shi, Boyi Wei, Tinghao Xie, Danqi Chen, Pin-Yu Chen, Jeffrey Ding, Ruoxi Jia, Jiaqi Ma, Arvind Narayanan, Weijie J Su, Mengdi Wang, Chaowei Xiao, Bo Li, Dawn Song, Peter Henderson, Prateek Mittal

    Abstract: The exposure of security vulnerabilities in safety-aligned language models, e.g., susceptibility to adversarial attacks, has shed light on the intricate interplay between AI safety and AI security. Although the two disciplines now come together under the overarching goal of AI risk management, they have historically evolved separately, giving rise to differing perspectives. Therefore, in this pape… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  7. arXiv:2405.07264  [pdf, other

    cs.IT

    Information Rates Over Multi-View Channels

    Authors: V. Arvind Rameshwar, Nir Weinberger

    Abstract: We investigate the fundamental limits of reliable communication over multi-view channels, in which the channel output is comprised of a large number of independent noisy views of a transmitted symbol. We consider first the setting of multi-view discrete memoryless channels and then extend our results to general multi-view channels (using multi-letter formulas). We argue that the channel capacity a… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    Comments: 33 pages, 1 figure, submitted to the IEEE

  8. arXiv:2405.06261  [pdf, other

    cs.CR cs.IT

    Improving the Privacy Loss Under User-Level DP Composition for Fixed Estimation Error

    Authors: V. Arvind Rameshwar, Anshoo Tandon

    Abstract: This paper considers the private release of statistics of several disjoint subsets of a datasets, under user-level $ε$-differential privacy (DP). In particular, we consider the user-level differentially private release of sample means and variances of speed values in several grids in a city, in a potentially sequential manner. Traditional analysis of the privacy loss due to the sequential composit… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    Comments: 15 pages, 6 figures, to be submitted to the ACM

  9. arXiv:2405.05530  [pdf, other

    cs.CV

    NurtureNet: A Multi-task Video-based Approach for Newborn Anthropometry

    Authors: Yash Khandelwal, Mayur Arvind, Sriram Kumar, Ashish Gupta, Sachin Kumar Danisetty, Piyush Bagad, Anish Madan, Mayank Lunayach, Aditya Annavajjala, Abhishek Maiti, Sansiddh Jain, Aman Dalmia, Namrata Deka, Jerome White, Jigar Doshi, Angjoo Kanazawa, Rahul Panicker, Alpan Raval, Srinivas Rana, Makarand Tapaswi

    Abstract: Malnutrition among newborns is a top public health concern in developing countries. Identification and subsequent growth monitoring are key to successful interventions. However, this is challenging in rural communities where health systems tend to be inaccessible and under-equipped, with poor adherence to protocol. Our goal is to equip health workers and public health systems with a solution for c… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: Accepted at CVPM Workshop at CVPR 2024

  10. arXiv:2405.05506  [pdf, other

    cs.CL

    Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model Bias

    Authors: Shan Chen, Jack Gallifant, Mingye Gao, Pedro Moreira, Nikolaj Munch, Ajay Muthukkumar, Arvind Rajan, Jaya Kolluri, Amelia Fiske, Janna Hastings, Hugo Aerts, Brian Anthony, Leo Anthony Celi, William G. La Cava, Danielle S. Bitterman

    Abstract: Large language models (LLMs) are increasingly essential in processing natural languages, yet their application is frequently compromised by biases and inaccuracies originating in their training data. In this study, we introduce Cross-Care, the first benchmark framework dedicated to assessing biases and real world knowledge in LLMs, specifically focusing on the representation of disease prevalence… ▽ More

    Submitted 24 June, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

    Comments: Submitted for review, data visualization tool available at: www.crosscare.net

  11. arXiv:2404.19109  [pdf, other

    cs.LG q-fin.GN

    The Shape of Money Laundering: Subgraph Representation Learning on the Blockchain with the Elliptic2 Dataset

    Authors: Claudio Bellei, Muhua Xu, Ross Phillips, Tom Robinson, Mark Weber, Tim Kaler, Charles E. Leiserson, Arvind, Jie Chen

    Abstract: Subgraph representation learning is a technique for analyzing local structures (or shapes) within complex networks. Enabled by recent developments in scalable Graph Neural Networks (GNNs), this approach encodes relational information at a subgroup level (multiple connected nodes) rather than at a node level of abstraction. We posit that certain domain applications, such as anti-money laundering (A… ▽ More

    Submitted 1 May, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

  12. arXiv:2404.16382  [pdf, ps, other

    cs.CC

    A Multivariate to Bivariate Reduction for Noncommutative Rank and Related Results

    Authors: Vikraman Arvind, Pushkar S Joglekar

    Abstract: We study the noncommutative rank problem, ncRANK, of computing the rank of matrices with linear entries in $n$ noncommuting variables and the problem of noncommutative Rational Identity Testing, RIT, which is to decide if a given rational formula in $n$ noncommuting variables is zero on its domain of definition. Motivated by the question whether these problems have deterministic NC algorithms, we… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: 31 pages

  13. arXiv:2404.10732  [pdf, other

    cs.HC

    Attention-Aware Visualization: Tracking and Responding to User Perception Over Time

    Authors: Arvind Srinivasan, Johannes Ellemose, Peter W. S. Butcher, Panagiotis D. Ritsos, Niklas Elmqvist

    Abstract: We propose the notion of Attention-Aware Visualizations (AAVs) that track the user's perception of a visual representation over time and feed this information back to the visualization. Such context awareness is particularly useful for ubiquitous and immersive analytics where knowing which embedded visualizations the user is looking at can be used to make visualizations react appropriately to the… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  14. arXiv:2404.10085   

    cs.IT

    The Average Spectrum Norm and Near-Optimal Tensor Completion

    Authors: Oscar López, Richard Lehoucq, Carlos Llosa-Vite, Arvind Prasadan, Daniel M. Dunlavy

    Abstract: We introduce a new tensor norm, the average spectrum norm, to study sample complexity of tensor completion problems based on the canonical polyadic decomposition (CPD). Properties of the average spectrum norm and its dual norm are investigated, demonstrating their utility for low-rank tensor recovery analysis. Our novel approach significantly reduces the provable sample rate for CPD-based noisy te… ▽ More

    Submitted 17 June, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

    Comments: Error, in Section 2.1.2

  15. arXiv:2404.09091  [pdf, other

    cs.IR cs.AI cs.CL cs.LG

    Semantic In-Domain Product Identification for Search Queries

    Authors: Sanat Sharma, Jayant Kumar, Twisha Naik, Zhaoyu Lu, Arvind Srikantan, Tracy Holloway King

    Abstract: Accurate explicit and implicit product identification in search queries is critical for enhancing user experiences, especially at a company like Adobe which has over 50 products and covers queries across hundreds of tools. In this work, we present a novel approach to training a product classifier from user behavioral data. Our semantic model led to >25% relative improvement in CTR (click through r… ▽ More

    Submitted 29 May, 2024; v1 submitted 13 April, 2024; originally announced April 2024.

  16. arXiv:2404.07986  [pdf, ps, other

    cs.CC cs.FL

    Trading Determinism for Noncommutativity in Edmonds' Problem

    Authors: V. Arvind, Abhranil Chatterjee, Partha Mukhopadhyay

    Abstract: Let $X=X_1\sqcup X_2\sqcup\ldots\sqcup X_k$ be a partitioned set of variables such that the variables in each part $X_i$ are noncommuting but for any $i\neq j$, the variables $x\in X_i$ commute with the variables $x'\in X_j$. Given as input a square matrix $T$ whose entries are linear forms over $\mathbb{Q}\langle{X}\rangle$, we consider the problem of checking if $T$ is invertible or not over the… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  17. Contextual AI Journaling: Integrating LLM and Time Series Behavioral Sensing Technology to Promote Self-Reflection and Well-being using the MindScape App

    Authors: Subigya Nepal, Arvind Pillai, William Campbell, Talie Massachi, Eunsol Soul Choi, Orson Xu, Joanna Kuc, Jeremy Huckins, Jason Holden, Colin Depp, Nicholas Jacobson, Mary Czerwinski, Eric Granholm, Andrew T. Campbell

    Abstract: MindScape aims to study the benefits of integrating time series behavioral patterns (e.g., conversational engagement, sleep, location) with Large Language Models (LLMs) to create a new form of contextual AI journaling, promoting self-reflection and well-being. We argue that integrating behavioral sensing in LLMs will likely lead to a new frontier in AI. In this Late-Breaking Work paper, we discuss… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    ACM Class: H.5.0; H.5.3; H.5.m; J.0

  18. arXiv:2403.17277  [pdf, other

    cs.NI

    Relational Network Verification

    Authors: Xieyang Xu, Yifei Yuan, Zachary Kincaid, Arvind Krishnamurthy, Ratul Mahajan, David Walker, Ennan Zhai

    Abstract: Relational network verification is a new approach to validating network changes. In contrast to traditional network verification, which analyzes specifications for a single network snapshot, relational network verification analyzes specifications concerning two network snapshots (e.g., pre- and post-change snapshots) and captures their similarities and differences. Relational change specifications… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  19. arXiv:2403.13411  [pdf, other

    cs.DC

    Optimal Fixed Priority Scheduling in Multi-Stage Multi-Resource Distributed Real-Time Systems

    Authors: Niraj Kumar, Chuanchao Gao, Arvind Easwaran

    Abstract: This work studies fixed priority (FP) scheduling of real-time jobs with end-to-end deadlines in a distributed system. Specifically, given a multi-stage pipeline with multiple heterogeneous resources of the same type at each stage, the problem is to assign priorities to a set of real-time jobs with different release times to access a resource at each stage of the pipeline subject to the end-to-end… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: Accepted in DATE (Design, Automation and Test in Europe Conference) 2024

  20. arXiv:2403.11411  [pdf, other

    cs.NI

    Laconic: Streamlined Load Balancers for SmartNICs

    Authors: Tianyi Cui, Chenxingyu Zhao, Wei Zhang, Kaiyuan Zhang, Arvind Krishnamurthy

    Abstract: Load balancers are pervasively used inside today's clouds to scalably distribute network requests across data center servers. Given the extensive use of load balancers and their associated operating costs, several efforts have focused on improving their efficiency by implementing Layer-4 load-balancing logic within the kernel or using hardware acceleration. This work explores whether the more comp… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  21. arXiv:2403.10401  [pdf, other

    cs.RO cs.AI

    SculptDiff: Learning Robotic Clay Sculpting from Humans with Goal Conditioned Diffusion Policy

    Authors: Alison Bartsch, Arvind Car, Charlotte Avra, Amir Barati Farimani

    Abstract: Manipulating deformable objects remains a challenge within robotics due to the difficulties of state estimation, long-horizon planning, and predicting how the object will deform given an interaction. These challenges are the most pronounced with 3D deformable objects. We propose SculptDiff, a goal-conditioned diffusion-based imitation learning framework that works with point cloud state observatio… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  22. arXiv:2403.07918  [pdf, other

    cs.CY cs.AI cs.LG

    On the Societal Impact of Open Foundation Models

    Authors: Sayash Kapoor, Rishi Bommasani, Kevin Klyman, Shayne Longpre, Ashwin Ramaswami, Peter Cihon, Aspen Hopkins, Kevin Bankston, Stella Biderman, Miranda Bogen, Rumman Chowdhury, Alex Engler, Peter Henderson, Yacine Jernite, Seth Lazar, Stefano Maffulli, Alondra Nelson, Joelle Pineau, Aviya Skowron, Dawn Song, Victor Storchan, Daniel Zhang, Daniel E. Ho, Percy Liang, Arvind Narayanan

    Abstract: Foundation models are powerful technologies: how they are released publicly directly shapes their societal impact. In this position paper, we focus on open foundation models, defined here as those with broadly available model weights (e.g. Llama 2, Stable Diffusion XL). We identify five distinctive properties (e.g. greater customizability, poor monitoring) of open foundation models that lead to bo… ▽ More

    Submitted 27 February, 2024; originally announced March 2024.

  23. arXiv:2403.05893  [pdf, other

    cs.IT

    Estimating the Weight Enumerators of Reed-Muller Codes via Sampling

    Authors: Shreyas Jain, V. Arvind Rameshwar, Navin Kashyap

    Abstract: This paper develops an algorithmic approach for obtaining estimates of the weight enumerators of Reed-Muller (RM) codes. Our algorithm is based on a technique for estimating the partition functions of spin systems, which in turn employs a sampler that produces codewords according to a suitably defined Gibbs distribution. We apply our method to moderate-blocklength RM codes and derive approximate v… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: 8 pages, 1 figure, 4 tables; submitted to the IEEE for possible publication. arXiv admin note: substantial text overlap with arXiv:2309.08907

  24. arXiv:2403.04893  [pdf, other

    cs.AI

    A Safe Harbor for AI Evaluation and Red Teaming

    Authors: Shayne Longpre, Sayash Kapoor, Kevin Klyman, Ashwin Ramaswami, Rishi Bommasani, Borhane Blili-Hamelin, Yangsibo Huang, Aviya Skowron, Zheng-Xin Yong, Suhas Kotha, Yi Zeng, Weiyan Shi, Xianjun Yang, Reid Southen, Alexander Robey, Patrick Chao, Diyi Yang, Ruoxi Jia, Daniel Kang, Sandy Pentland, Arvind Narayanan, Percy Liang, Peter Henderson

    Abstract: Independent evaluation and red teaming are critical for identifying the risks posed by generative AI systems. However, the terms of service and enforcement strategies used by prominent AI companies to deter model misuse have disincentives on good faith safety evaluations. This causes some researchers to fear that conducting such research or releasing their findings will result in account suspensio… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  25. Umwelt: Accessible Structured Editing of Multimodal Data Representations

    Authors: Jonathan Zong, Isabella Pedraza Pineros, Mengzhu Katie Chen, Daniel Hajas, Arvind Satyanarayan

    Abstract: We present Umwelt, an authoring environment for interactive multimodal data representations. In contrast to prior approaches, which center the visual modality, Umwelt treats visualization, sonification, and textual description as coequal representations: they are all derived from a shared abstract data model, such that no modality is prioritized over the others. To simplify specification, Umwelt e… ▽ More

    Submitted 3 March, 2024; v1 submitted 29 February, 2024; originally announced March 2024.

    Comments: ACM CHI 2024

  26. arXiv:2402.16268  [pdf, other

    cs.LG cs.AI cs.CY

    Foundation Model Transparency Reports

    Authors: Rishi Bommasani, Kevin Klyman, Shayne Longpre, Betty Xiong, Sayash Kapoor, Nestor Maslej, Arvind Narayanan, Percy Liang

    Abstract: Foundation models are critical digital technologies with sweeping societal impact that necessitates transparency. To codify how foundation model developers should provide transparency about the development and deployment of their models, we propose Foundation Model Transparency Reports, drawing upon the transparency reporting practices in social media. While external documentation of societal harm… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

  27. MoodCapture: Depression Detection Using In-the-Wild Smartphone Images

    Authors: Subigya Nepal, Arvind Pillai, Weichen Wang, Tess Griffin, Amanda C. Collins, Michael Heinz, Damien Lekkas, Shayan Mirjafari, Matthew Nemesure, George Price, Nicholas C. Jacobson, Andrew T. Campbell

    Abstract: MoodCapture presents a novel approach that assesses depression based on images automatically captured from the front-facing camera of smartphones as people go about their daily lives. We collect over 125,000 photos in the wild from N=177 participants diagnosed with major depressive disorder for 90 days. Images are captured naturalistically while participants respond to the PHQ-8 depression survey… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

    ACM Class: H.5.0; H.5.3; H.5.m; J.0

  28. arXiv:2402.06787  [pdf, other

    cs.NI cs.DC cs.LG

    ForestColl: Efficient Collective Communications on Heterogeneous Network Fabrics

    Authors: Liangyu Zhao, Saeed Maleki, Ziyue Yang, Hossein Pourreza, Aashaka Shah, Changho Hwang, Arvind Krishnamurthy

    Abstract: As modern DNN models grow ever larger, collective communications between the accelerators (allreduce, etc.) emerge as a significant performance bottleneck. Designing efficient communication schedules is challenging given today's highly diverse and heterogeneous network fabrics. In this paper, we present ForestColl, a tool that generates efficient schedules for any network topology. ForestColl cons… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

    Comments: arXiv admin note: text overlap with arXiv:2305.18461

  29. arXiv:2402.01656  [pdf, other

    cs.CY cs.AI

    Promises and pitfalls of artificial intelligence for legal applications

    Authors: Sayash Kapoor, Peter Henderson, Arvind Narayanan

    Abstract: Is AI set to redefine the legal profession? We argue that this claim is not supported by the current evidence. We dive into AI's increasingly prevalent roles in three types of legal tasks: information processing; tasks involving creativity, reasoning, or judgment; and predictions about the future. We find that the ease of evaluating legal applications varies greatly across legal tasks, based on th… ▽ More

    Submitted 10 January, 2024; originally announced February 2024.

  30. arXiv:2401.15906  [pdf, other

    cs.CR cs.IT stat.AP

    Mean Estimation with User-Level Privacy for Spatio-Temporal IoT Datasets

    Authors: V. Arvind Rameshwar, Anshoo Tandon, Prajjwal Gupta, Aditya Vikram Singh, Novoneel Chakraborty, Abhay Sharma

    Abstract: This paper considers the problem of the private release of sample means of speed values from traffic datasets. Our key contribution is the development of user-level differentially private algorithms that incorporate carefully chosen parameter values to ensure low estimation errors on real-world datasets, while ensuring privacy. We test our algorithms on ITMS (Intelligent Traffic Management System)… ▽ More

    Submitted 25 April, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

    Comments: 14 pages, 5 figures, submitted to the ACM for possible publication

  31. arXiv:2401.11037  [pdf, other

    cs.LG math.NA q-bio.QM

    Equivariant Graph Neural Operator for Modeling 3D Dynamics

    Authors: Minkai Xu, Jiaqi Han, Aaron Lou, Jean Kossaifi, Arvind Ramanathan, Kamyar Azizzadenesheli, Jure Leskovec, Stefano Ermon, Anima Anandkumar

    Abstract: Modeling the complex three-dimensional (3D) dynamics of relational systems is an important problem in the natural sciences, with applications ranging from molecular simulations to particle mechanics. Machine learning methods have achieved good success by learning graph neural networks to model spatial interactions. However, these approaches do not faithfully capture temporal correlations since the… ▽ More

    Submitted 2 June, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Comments: Proceedings of the 41 st International Conference on Machine Learning, Vienna, Austria. PMLR 235, 2024. Copyright 2024 by the author(s)

  32. arXiv:2311.10302  [pdf, other

    cs.HC cs.CY

    Social Isolation and Serious Mental Illness: The Role of Context-Aware Mobile Interventions

    Authors: Subigya Nepal, Arvind Pillai, Emma M. Parrish, Jason Holden, Colin Depp, Andrew T. Campbell, Eric Granholm

    Abstract: Social isolation is a common problem faced by individuals with serious mental illness (SMI), and current intervention approaches have limited effectiveness. This paper presents a blended intervention approach, called mobile Social Interaction Therapy by Exposure (mSITE), to address social isolation in individuals with serious mental illness. The approach combines brief in-person cognitive-behavior… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    ACM Class: H.5.0; H.5.m; J.0; J.3; J.4; J.m

  33. arXiv:2311.07753  [pdf, other

    cs.NI

    Bringing Reconfigurability to the Network Stack

    Authors: Akshay Narayan, Aurojit Panda, Mohammad Alizadeh, Hari Balakrishnan, Arvind Krishnamurthy, Scott Shenker

    Abstract: Reconfiguring the network stack allows applications to specialize the implementations of communication libraries depending on where they run, the requests they serve, and the performance they need to provide. Specializing applications in this way is challenging because developers need to choose the libraries they use when writing a program and cannot easily change them at runtime. This paper intro… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: 12 pages, 10 figures

  34. arXiv:2311.02332  [pdf, other

    cs.LG cs.CV

    Multimodal Machine Learning in Image-Based and Clinical Biomedicine: Survey and Prospects

    Authors: Elisa Warner, Joonsang Lee, William Hsu, Tanveer Syeda-Mahmood, Charles Kahn, Olivier Gevaert, Arvind Rao

    Abstract: Machine learning (ML) applications in medical artificial intelligence (AI) systems have shifted from traditional and statistical methods to increasing application of deep learning models. This survey navigates the current landscape of multimodal ML, focusing on its profound impact on medical image analysis and clinical decision support systems. Emphasizing challenges and innovations in addressing… ▽ More

    Submitted 19 January, 2024; v1 submitted 4 November, 2023; originally announced November 2023.

  35. arXiv:2310.19102  [pdf, other

    cs.LG

    Atom: Low-bit Quantization for Efficient and Accurate LLM Serving

    Authors: Yilong Zhao, Chien-Yu Lin, Kan Zhu, Zihao Ye, Lequn Chen, Size Zheng, Luis Ceze, Arvind Krishnamurthy, Tianqi Chen, Baris Kasikci

    Abstract: The growing demand for Large Language Models (LLMs) in applications such as content generation, intelligent chatbots, and sentiment analysis poses considerable challenges for LLM service providers. To efficiently use GPU resources and boost throughput, batching multiple requests has emerged as a popular paradigm; to further speed up batching, LLM quantization techniques reduce memory consumption a… ▽ More

    Submitted 16 April, 2024; v1 submitted 29 October, 2023; originally announced October 2023.

  36. arXiv:2310.18547  [pdf, other

    cs.DC cs.LG

    Punica: Multi-Tenant LoRA Serving

    Authors: Lequn Chen, Zihao Ye, Yongji Wu, Danyang Zhuo, Luis Ceze, Arvind Krishnamurthy

    Abstract: Low-rank adaptation (LoRA) has become an important and popular method to adapt pre-trained models to specific domains. We present Punica, a system to serve multiple LoRA models in a shared GPU cluster. Punica contains a new CUDA kernel design that allows batching of GPU operations for different LoRA models. This allows a GPU to hold only a single copy of the underlying pre-trained model when servi… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

  37. arXiv:2310.04727  [pdf, other

    cs.LG cs.AI

    Task Aware Modulation using Representation Learning: An Approach for Few Shot Learning in Heterogeneous Systems

    Authors: Arvind Renganathan, Rahul Ghosh, Ankush Khandelwal, Vipin Kumar

    Abstract: We present a Task-aware modulation using Representation Learning (TAM-RL) framework that enhances personalized predictions in few-shot settings for heterogeneous systems when individual task characteristics are not known. TAM-RL extracts embeddings representing the actual inherent characteristics of these entities and uses these characteristics to personalize the predictions for each entity/task.… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

  38. arXiv:2310.04610  [pdf, other

    cs.AI cs.LG

    DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery through Sophisticated AI System Technologies

    Authors: Shuaiwen Leon Song, Bonnie Kruft, Minjia Zhang, Conglong Li, Shiyang Chen, Chengming Zhang, Masahiro Tanaka, Xiaoxia Wu, Jeff Rasley, Ammar Ahmad Awan, Connor Holmes, Martin Cai, Adam Ghanem, Zhongzhu Zhou, Yuxiong He, Pete Luferenko, Divya Kumar, Jonathan Weyn, Ruixiong Zhang, Sylwester Klocek, Volodymyr Vragov, Mohammed AlQuraishi, Gustaf Ahdritz, Christina Floristean, Cristina Negri , et al. (67 additional authors not shown)

    Abstract: In the upcoming decade, deep learning may revolutionize the natural sciences, enhancing our capacity to model and predict natural occurrences. This could herald a new era of scientific exploration, bringing significant advancements across sectors from drug development to renewable energy. To answer this call, we present DeepSpeed4Science initiative (deepspeed4science.ai) which aims to build unique… ▽ More

    Submitted 11 October, 2023; v1 submitted 6 October, 2023; originally announced October 2023.

  39. arXiv:2310.04391  [pdf, ps, other

    cs.CC math.CO

    On a Hierarchy of Spectral Invariants for Graphs

    Authors: V. Arvind, Frank Fuhlbrück, Johannes Köbler, Oleg Verbitsky

    Abstract: We consider a hierarchy of graph invariants that naturally extends the spectral invariants defined by Fürer (Lin. Alg. Appl. 2010) based on the angles formed by the set of standard basis vectors and their projections onto eigenspaces of the adjacency matrix. We provide a purely combinatorial characterization of this hierarchy in terms of the walk counts. This allows us to give a complete answer to… ▽ More

    Submitted 30 May, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

    Comments: 32 pages, 1 diagram, 1 figure. A preliminary version of this paper appeared in the Proceedings of the 41st International Symposium on Theoretical Aspects of Computer Science (STACS'24), published in LIPIcs Vol. 289, Schloss Dagstuhl, Leibniz-Zentrum für Informatik, 2024

  40. arXiv:2310.02193  [pdf, other

    cs.LG cs.AI stat.AP

    Uncertainty Quantification in Inverse Models in Hydrology

    Authors: Somya Sharma Chatterjee, Rahul Ghosh, Arvind Renganathan, Xiang Li, Snigdhansu Chatterjee, John Nieber, Christopher Duffy, Vipin Kumar

    Abstract: In hydrology, modeling streamflow remains a challenging task due to the limited availability of basin characteristics information such as soil geology and geomorphology. These characteristics may be noisy due to measurement errors or may be missing altogether. To overcome this challenge, we propose a knowledge-guided, probabilistic inverse modeling method for recovering physical characteristics fr… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2210.06213

  41. arXiv:2309.16882  [pdf, other

    cs.LG

    Message Propagation Through Time: An Algorithm for Sequence Dependency Retention in Time Series Modeling

    Authors: Shaoming Xu, Ankush Khandelwal, Arvind Renganathan, Vipin Kumar

    Abstract: Time series modeling, a crucial area in science, often encounters challenges when training Machine Learning (ML) models like Recurrent Neural Networks (RNNs) using the conventional mini-batch training strategy that assumes independent and identically distributed (IID) samples and initializes RNNs with zero hidden states. The IID assumption ignores temporal dependencies among samples, resulting in… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

  42. arXiv:2309.16654  [pdf, other

    cs.CV

    Novel Deep Learning Pipeline for Automatic Weapon Detection

    Authors: Haribharathi Sivakumar, Vijay Arvind. R, Pawan Ragavendhar V, G. Balamurugan

    Abstract: Weapon and gun violence have recently become a pressing issue today. The degree of these crimes and activities has risen to the point of being termed as an epidemic. This prevalent misuse of weapons calls for an automatic system that detects weapons in real-time. Real-time surveillance video is captured and recorded in almost all public forums and places. These videos contain abundant raw data whi… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: Accepted for presentation at the IEEE 2nd International Conference on Automation, Robotics and Computer Engineering

  43. arXiv:2309.15647  [pdf, ps, other

    cs.CC cs.DS

    Black-Box Identity Testing of Noncommutative Rational Formulas in Deterministic Quasipolynomial Time

    Authors: V. Arvind, Abhranil Chatterjee, Partha Mukhopadhyay

    Abstract: Rational Identity Testing (RIT) is the decision problem of determining whether or not a noncommutative rational formula computes zero in the free skew field. It admits a deterministic polynomial-time white-box algorithm [Garg, Gurvits, Oliveira, and Wigderson (2016); Ivanyos, Qiao, Subrahmanyam (2018); Hamada and Hirai (2021)], and a randomized polynomial-time algorithm [Derksen and Makam (2017)]… ▽ More

    Submitted 6 April, 2024; v1 submitted 27 September, 2023; originally announced September 2023.

  44. arXiv:2309.13541  [pdf, other

    cs.DC cs.NI

    Efficient All-to-All Collective Communication Schedules for Direct-Connect Topologies

    Authors: Prithwish Basu, Liangyu Zhao, Jason Fantl, Siddharth Pal, Arvind Krishnamurthy, Joud Khoury

    Abstract: The all-to-all collective communications primitive is widely used in machine learning (ML) and high performance computing (HPC) workloads, and optimizing its performance is of interest to both ML and HPC communities. All-to-all is a particularly challenging workload that can severely strain the underlying interconnect bandwidth at scale. This paper takes a holistic approach to optimize the perform… ▽ More

    Submitted 25 April, 2024; v1 submitted 23 September, 2023; originally announced September 2023.

    Comments: HPDC '24

  45. arXiv:2309.12624  [pdf, other

    cs.NI

    Quark: A High-Performance Secure Container Runtime for Serverless Computing

    Authors: Chenxingyu Zhao, Yulin Sun, Ying Xiong, Arvind Krishnamurthy

    Abstract: Secure container runtimes serve as the foundational layer for creating and running containers, which is the bedrock of emerging computing paradigms like microservices and serverless computing. Although existing secure container runtimes indeed enhance security via running containers over a guest kernel and a Virtual Machine Monitor (VMM or Hypervisor), they incur performance penalties in critical… ▽ More

    Submitted 6 October, 2023; v1 submitted 22 September, 2023; originally announced September 2023.

    Comments: arXiv admin note: text overlap with arXiv:2305.10621. The paper on arXiv:2305.10621 presents a detailed version of the TSoR module in Quark

  46. arXiv:2309.10291  [pdf, other

    cs.LG cs.AI eess.SP

    Koopman Invertible Autoencoder: Leveraging Forward and Backward Dynamics for Temporal Modeling

    Authors: Kshitij Tayal, Arvind Renganathan, Rahul Ghosh, Xiaowei Jia, Vipin Kumar

    Abstract: Accurate long-term predictions are the foundations for many machine learning applications and decision-making processes. However, building accurate long-term prediction models remains challenging due to the limitations of existing temporal models like recurrent neural networks (RNNs), as they capture only the statistical connections in the training data and may fail to learn the underlying dynamic… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

    Comments: Accepted at IEEE International Conference on Data Mining (ICDM) 2023

  47. arXiv:2309.09944  [pdf, other

    cs.LG cs.AI cs.CV cs.CY

    DiffusionWorldViewer: Exposing and Broadening the Worldview Reflected by Generative Text-to-Image Models

    Authors: Zoe De Simone, Angie Boggust, Arvind Satyanarayan, Ashia Wilson

    Abstract: Generative text-to-image (TTI) models produce high-quality images from short textual descriptions and are widely used in academic and creative domains. Like humans, TTI models have a worldview, a conception of the world learned from their training data and task that influences the images they generate for a given prompt. However, the worldviews of TTI models are often hidden from users, making it… ▽ More

    Submitted 5 February, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: 20 pages, 8 figures

  48. arXiv:2309.09191  [pdf, other

    cs.LG q-bio.BM

    End-to-End Optimized Pipeline for Prediction of Protein Folding Kinetics

    Authors: Vijay Arvind. R, Haribharathi Sivakumar, Brindha. R

    Abstract: Protein folding is the intricate process by which a linear sequence of amino acids self-assembles into a unique three-dimensional structure. Protein folding kinetics is the study of pathways and time-dependent mechanisms a protein undergoes when it folds. Understanding protein kinetics is essential as a protein needs to fold correctly for it to perform its biological functions optimally, and a mis… ▽ More

    Submitted 17 September, 2023; originally announced September 2023.

    Comments: Accepted for presentation at the 22nd International Conference on Machine Learning and Applications

  49. arXiv:2309.09175   

    cs.LG cs.AI

    Imbalanced Data Stream Classification using Dynamic Ensemble Selection

    Authors: Priya. S, Haribharathi Sivakumar, Vijay Arvind. R

    Abstract: Modern streaming data categorization faces significant challenges from concept drift and class imbalanced data. This negatively impacts the output of the classifier, leading to improper classification. Furthermore, other factors such as the overlapping of multiple classes limit the extent of the correctness of the output. This work proposes a novel framework for integrating data pre-processing and… ▽ More

    Submitted 28 September, 2023; v1 submitted 17 September, 2023; originally announced September 2023.

    Comments: Made an error in the research and need to rectify it

  50. arXiv:2309.08907  [pdf, other

    cs.IT

    Sampling-Based Estimates of the Sizes of Constrained Subcodes of Reed-Muller Codes

    Authors: V. Arvind Rameshwar, Shreyas Jain, Navin Kashyap

    Abstract: This paper develops an algorithmic approach for obtaining approximate, numerical estimates of the sizes of subcodes of Reed-Muller (RM) codes, all of the codewords in which satisfy a given constraint. Our algorithm is based on a statistical physics technique for estimating the partition functions of spin systems, which in turn makes use of a sampler that produces RM codewords according to a Gibbs… ▽ More

    Submitted 19 September, 2023; v1 submitted 16 September, 2023; originally announced September 2023.

    Comments: 20 pages, 3 figures, 4 tables, to be submitted to the IEEE for possible publication