Skip to main content

Showing 1–50 of 199 results for author: Chaudhuri, S

  1. arXiv:2407.11274  [pdf, other

    cs.LG cs.CR stat.ML

    Empirical Mean and Frequency Estimation Under Heterogeneous Privacy: A Worst-Case Analysis

    Authors: Syomantak Chaudhuri, Thomas A. Courtade

    Abstract: Differential Privacy (DP) is the current gold-standard for measuring privacy. Estimation problems under DP constraints appearing in the literature have largely focused on providing equal privacy to all users. We consider the problems of empirical mean estimation for univariate data and frequency estimation for categorical data, two pillars of data analysis in the industry, subject to heterogeneous… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  2. arXiv:2407.11214  [pdf, ps, other

    cs.AI cs.CL cs.LG cs.LO cs.PL

    PutnamBench: Evaluating Neural Theorem-Provers on the Putnam Mathematical Competition

    Authors: George Tsoukalas, Jasper Lee, John Jennings, Jimmy Xin, Michelle Ding, Michael Jennings, Amitayush Thakur, Swarat Chaudhuri

    Abstract: We present PutnamBench, a new multilingual benchmark for evaluating the ability of neural theorem-provers to solve competition mathematics problems. PutnamBench consists of 1697 hand-constructed formalizations of 640 theorems sourced from the William Lowell Putnam Mathematical Competition, the premier undergraduate-level mathematics competition in North America. All the theorems have formalization… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  3. arXiv:2405.15282  [pdf, other

    cs.LG cs.AI

    Prompt Tuning Strikes Back: Customizing Foundation Models with Low-Rank Prompt Adaptation

    Authors: Abhinav Jain, Swarat Chaudhuri, Thomas Reps, Chris Jermaine

    Abstract: Parameter-Efficient Fine-Tuning (PEFT) has become the standard for customising Foundation Models (FMs) to user-specific downstream tasks. However, typical PEFT methods require storing multiple task-specific adapters, creating scalability issues as these adapters must be housed and run at the FM server. Traditional prompt tuning offers a potential solution by customising them through task-specific… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 11 pages, 4 figures, 3 tables

  4. arXiv:2404.12608  [pdf, other

    cs.DB cs.CL cs.PL

    Auto-Formula: Recommend Formulas in Spreadsheets using Contrastive Learning for Table Representations

    Authors: Sibei Chen, Yeye He, Weiwei Cui, Ju Fan, Song Ge, Haidong Zhang, Dongmei Zhang, Surajit Chaudhuri

    Abstract: Spreadsheets are widely recognized as the most popular end-user programming tools, which blend the power of formula-based computation, with an intuitive table-based interface. Today, spreadsheets are used by billions of users to manipulate tables, most of whom are neither database experts nor professional programmers. Despite the success of spreadsheets, authoring complex formulas remains challe… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: full version of a paper to appear in SIGMOD 2024

  5. arXiv:2404.11578  [pdf, other

    cs.LG cs.AI cs.FL

    LTL-Constrained Policy Optimization with Cycle Experience Replay

    Authors: Ameesh Shah, Cameron Voloshin, Chenxi Yang, Abhinav Verma, Swarat Chaudhuri, Sanjit A. Seshia

    Abstract: Linear Temporal Logic (LTL) offers a precise means for constraining the behavior of reinforcement learning agents. However, in many tasks, LTL is insufficient for task specification; LTL-constrained policy optimization, where the goal is to optimize a scalar reward under LTL constraints, is needed. Prior methods for this constrained problem are restricted to finite state spaces. In this work, we p… ▽ More

    Submitted 24 May, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

    Comments: preprint, 9 pages in main text

  6. arXiv:2404.07645  [pdf, other

    cs.CV

    Simba: Mamba augmented U-ShiftGCN for Skeletal Action Recognition in Videos

    Authors: Soumyabrata Chaudhuri, Saumik Bhattacharya

    Abstract: Skeleton Action Recognition (SAR) involves identifying human actions using skeletal joint coordinates and their interconnections. While plain Transformers have been attempted for this task, they still fall short compared to the current leading methods, which are rooted in Graph Convolutional Networks (GCNs) due to the absence of structural priors. Recently, a novel selective state space model, Mam… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: 20 pages, 6 tables, 1 figure

  7. arXiv:2403.15476  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    Learning to Infer Generative Template Programs for Visual Concepts

    Authors: R. Kenny Jones, Siddhartha Chaudhuri, Daniel Ritchie

    Abstract: People grasp flexible visual concepts from a few examples. We explore a neurosymbolic system that learns how to infer programs that capture visual concepts in a domain-general fashion. We introduce Template Programs: programmatic expressions from a domain-specific language that specify structural and parametric patterns common to an input concept. Our framework supports multiple concept-related ta… ▽ More

    Submitted 9 June, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

    Comments: ICML 2024; Project page: https://rkjones4.github.io/template.html

  8. arXiv:2402.16994  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    GEM3D: GEnerative Medial Abstractions for 3D Shape Synthesis

    Authors: Dmitry Petrov, Pradyumn Goyal, Vikas Thamizharasan, Vladimir G. Kim, Matheus Gadelha, Melinos Averkiou, Siddhartha Chaudhuri, Evangelos Kalogerakis

    Abstract: We introduce GEM3D -- a new deep, topology-aware generative model of 3D shapes. The key ingredient of our method is a neural skeleton-based representation encoding information on both shape topology and geometry. Through a denoising diffusion probabilistic model, our method first generates skeleton-based representations following the Medial Axis Transform (MAT), then generates surfaces through a s… ▽ More

    Submitted 10 April, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: Webpage: https://lodurality.github.io/GEM3D/ -- Cond. accept. to SIGGRAPH 2024 (conf. track) -- Changes (based on reviews): changed style to sigconf; rearranged figures for readability; added missing citations; fixed misaligned centers in Fig. 3; added failure cases (Fig. 10); rewrote discussion; added categories averages to Tab. 8; added Tab. 10 with model capacities

  9. arXiv:2402.08073  [pdf, other

    cs.LG cs.PL cs.SE

    Grounding Data Science Code Generation with Input-Output Specifications

    Authors: Yeming Wen, Pengcheng Yin, Kensen Shi, Henryk Michalewski, Swarat Chaudhuri, Alex Polozov

    Abstract: Large language models (LLMs) have recently demonstrated a remarkable ability to generate code from natural language (NL) prompts. However, in the real world, NL is often too ambiguous to capture the true intent behind programming problems, requiring additional input-output (I/O) specifications. Unfortunately, LLMs can have difficulty aligning their outputs with both the NL prompt and the I/O speci… ▽ More

    Submitted 14 March, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

  10. arXiv:2402.04513  [pdf, other

    cs.LG cs.CL

    Online Cascade Learning for Efficient Inference over Streams

    Authors: Lunyiu Nie, Zhimin Ding, Erdong Hu, Christopher Jermaine, Swarat Chaudhuri

    Abstract: Large Language Models (LLMs) have a natural role in answering complex queries about data streams, but the high computational cost of LLM inference makes them infeasible in many such tasks. We propose online cascade learning, the first approach to address this challenge. The objective here is to learn a "cascade" of models, starting with lower-capacity models (such as logistic regression) and endin… ▽ More

    Submitted 17 June, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: ICML 2024 Main Conference Paper

  11. arXiv:2312.14182  [pdf, other

    cs.LG cs.AI cs.CR

    Find the Lady: Permutation and Re-Synchronization of Deep Neural Networks

    Authors: Carl De Sousa Trias, Mihai Petru Mitrea, Attilio Fiandrotti, Marco Cagnazzo, Sumanta Chaudhuri, Enzo Tartaglione

    Abstract: Deep neural networks are characterized by multiple symmetrical, equi-loss solutions that are redundant. Thus, the order of neurons in a layer and feature maps can be given arbitrary permutations, without affecting (or minimally affecting) their output. If we shuffle these neurons, or if we apply to them some perturbations (like fine-tuning) can we put them back in the original order i.e. re-synchr… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  12. arXiv:2312.07813  [pdf, other

    cs.OS cs.LG

    On a Foundation Model for Operating Systems

    Authors: Divyanshu Saxena, Nihal Sharma, Donghyun Kim, Rohit Dwivedula, Jiayi Chen, Chenxi Yang, Sriram Ravula, Zichao Hu, Aditya Akella, Sebastian Angel, Joydeep Biswas, Swarat Chaudhuri, Isil Dillig, Alex Dimakis, P. Brighten Godfrey, Daehyeok Kim, Chris Rossbach, Gang Wang

    Abstract: This paper lays down the research agenda for a domain-specific foundation model for operating systems (OSes). Our case for a foundation model revolves around the observations that several OS components such as CPU, memory, and network subsystems are interrelated and that OS traces offer the ideal dataset for a foundation model to grasp the intricacies of diverse OS components and their behavior in… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: Machine Learning for Systems Workshop at 37th NeurIPS Conference, 2023, New Orleans, LA, USA

  13. arXiv:2312.05677  [pdf, other

    cs.LG cs.AI cs.CL

    Batched Low-Rank Adaptation of Foundation Models

    Authors: Yeming Wen, Swarat Chaudhuri

    Abstract: Low-Rank Adaptation (LoRA) has recently gained attention for fine-tuning foundation models by incorporating trainable low-rank matrices, thereby reducing the number of trainable parameters. While LoRA offers numerous advantages, its applicability for real-time serving to a diverse and global user base is constrained by its incapability to handle multiple task-specific adapters efficiently. This im… ▽ More

    Submitted 25 April, 2024; v1 submitted 9 December, 2023; originally announced December 2023.

    Comments: 16 pages, 3 figures

  14. arXiv:2311.00682  [pdf, other

    physics.ins-det cs.LG

    Deep Learning-Based Classification of Gamma Photon Interactions in Room-Temperature Semiconductor Radiation Detectors

    Authors: Sandeep K. Chaudhuri, Qinyang Li, Krishna C. Mandal, Jianjun Hu

    Abstract: Photon counting radiation detectors have become an integral part of medical imaging modalities such as Positron Emission Tomography or Computed Tomography. One of the most promising detectors is the wide bandgap room temperature semiconductor detectors, which depends on the interaction gamma/x-ray photons with the detector material involves Compton scattering which leads to multiple interaction ph… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: 17 pages

  15. arXiv:2310.16049  [pdf, other

    cs.CL

    MuSR: Testing the Limits of Chain-of-thought with Multistep Soft Reasoning

    Authors: Zayne Sprague, Xi Ye, Kaj Bostrom, Swarat Chaudhuri, Greg Durrett

    Abstract: While large language models (LLMs) equipped with techniques like chain-of-thought prompting have demonstrated impressive capabilities, they still fall short in their ability to reason robustly in complex settings. However, evaluating LLM reasoning is challenging because system capabilities continue to grow while benchmark datasets for tasks like logical deduction have remained static. We introduce… ▽ More

    Submitted 23 March, 2024; v1 submitted 24 October, 2023; originally announced October 2023.

    Journal ref: ICLR 2024 (Spotlight)

  16. arXiv:2310.13137  [pdf, other

    cs.CR cs.DS cs.LG stat.ML

    Mean Estimation Under Heterogeneous Privacy Demands

    Authors: Syomantak Chaudhuri, Konstantin Miagkov, Thomas A. Courtade

    Abstract: Differential Privacy (DP) is a well-established framework to quantify privacy loss incurred by any algorithm. Traditional formulations impose a uniform privacy requirement for all users, which is often inconsistent with real-world scenarios in which users dictate their privacy preferences individually. This work considers the problem of mean estimation, where each user can impose their own distinc… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: A preliminary conference version was published at ISIT 2023 and uploaded to arxiv (arXiv:2305.09668). This version significantly expands on the previous article and is being submitted to a journal

  17. arXiv:2310.12690  [pdf, other

    cs.LG cs.AI stat.ML

    Neurosymbolic Grounding for Compositional World Models

    Authors: Atharva Sehgal, Arya Grayeli, Jennifer J. Sun, Swarat Chaudhuri

    Abstract: We introduce Cosmos, a framework for object-centric world modeling that is designed for compositional generalization (CompGen), i.e., high performance on unseen input scenes obtained through the composition of known visual "atoms." The central insight behind Cosmos is the use of a novel form of neurosymbolic grounding. Specifically, the framework introduces two new tools: (i) neurosymbolic scene e… ▽ More

    Submitted 10 May, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: Uploading ICLR,2024 Camera Ready Version

  18. arXiv:2310.09263  [pdf, other

    cs.CL cs.AI cs.DB

    Table-GPT: Table-tuned GPT for Diverse Table Tasks

    Authors: Peng Li, Yeye He, Dror Yashar, Weiwei Cui, Song Ge, Haidong Zhang, Danielle Rifinski Fainman, Dongmei Zhang, Surajit Chaudhuri

    Abstract: Language models, such as GPT-3.5 and ChatGPT, demonstrate remarkable abilities to follow diverse human instructions and perform a wide range of tasks. However, when probing language models using a range of basic table-understanding tasks, we observe that today's language models are still sub-optimal in many table-related tasks, likely because they are pre-trained predominantly on \emph{one-dimensi… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

  19. arXiv:2310.07814  [pdf, other

    cs.GR cs.CV cs.LG

    Explorable Mesh Deformation Subspaces from Unstructured Generative Models

    Authors: Arman Maesumi, Paul Guerrero, Vladimir G. Kim, Matthew Fisher, Siddhartha Chaudhuri, Noam Aigerman, Daniel Ritchie

    Abstract: Exploring variations of 3D shapes is a time-consuming process in traditional 3D modeling tools. Deep generative models of 3D shapes often feature continuous latent spaces that can, in principle, be used to explore potential variations starting from a set of input shapes. In practice, doing so can be problematic: latent spaces are high dimensional and hard to visualize, contain shapes that are not… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: SIGGRAPH Asia 2023, 15 pages

  20. arXiv:2310.04353  [pdf, other

    cs.LG cs.AI cs.LO cs.PL

    An In-Context Learning Agent for Formal Theorem-Proving

    Authors: Amitayush Thakur, George Tsoukalas, Yeming Wen, Jimmy Xin, Swarat Chaudhuri

    Abstract: We present an in-context learning agent for formal theorem-proving in environments like Lean and Coq. Current state-of-the-art models for the problem are finetuned on environment-specific proof data. By contrast, our approach, called COPRA, repeatedly asks a high-capacity, general-purpose large language model (GPT-4) to propose tactic applications from within a stateful backtracking search. Propos… ▽ More

    Submitted 7 February, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

  21. arXiv:2309.01050  [pdf, other

    cs.CV

    Efficient Curriculum based Continual Learning with Informative Subset Selection for Remote Sensing Scene Classification

    Authors: S Divakar Bhat, Biplab Banerjee, Subhasis Chaudhuri, Avik Bhattacharya

    Abstract: We tackle the problem of class incremental learning (CIL) in the realm of landcover classification from optical remote sensing (RS) images in this paper. The paradigm of CIL has recently gained much prominence given the fact that data are generally obtained in a sequential manner for real-world phenomenon. However, CIL has not been extensively considered yet in the domain of RS irrespective of the… ▽ More

    Submitted 2 September, 2023; originally announced September 2023.

  22. arXiv:2308.09301  [pdf, other

    cs.LG

    Learning Reward Machines through Preference Queries over Sequences

    Authors: Eric Hsiung, Joydeep Biswas, Swarat Chaudhuri

    Abstract: Reward machines have shown great promise at capturing non-Markovian reward functions for learning tasks that involve complex action sequencing. However, no algorithm currently exists for learning reward machines with realistic weak feedback in the form of preferences. We contribute REMAP, a novel algorithm for learning reward machines from preferences, with correctness and termination guarantees.… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

    Comments: 24 pages, 10 figures

  23. arXiv:2308.03908  [pdf, other

    cs.CV cs.AI cs.LG

    ViLP: Knowledge Exploration using Vision, Language, and Pose Embeddings for Video Action Recognition

    Authors: Soumyabrata Chaudhuri, Saumik Bhattacharya

    Abstract: Video Action Recognition (VAR) is a challenging task due to its inherent complexities. Though different approaches have been explored in the literature, designing a unified framework to recognize a large number of human actions is still a challenging problem. Recently, Multi-Modal Learning (MML) has demonstrated promising results in this domain. In literature, 2D skeleton or pose modality has ofte… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

    Comments: 7 pages, 3 figures, 2 Tables

  24. arXiv:2307.14570  [pdf, other

    cs.CV cs.RO

    Physically Plausible 3D Human-Scene Reconstruction from Monocular RGB Image using an Adversarial Learning Approach

    Authors: Sandika Biswas, Kejie Li, Biplab Banerjee, Subhasis Chaudhuri, Hamid Rezatofighi

    Abstract: Holistic 3D human-scene reconstruction is a crucial and emerging research area in robot perception. A key challenge in holistic 3D human-scene reconstruction is to generate a physically plausible 3D scene from a single monocular RGB image. The existing research mainly proposes optimization-based approaches for reconstructing the scene from a sequence of RGB frames with explicitly defined physical… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

    Comments: Accepted in RAL 2023

  25. arXiv:2307.14565  [pdf, other

    cs.DB cs.LG

    Auto-Tables: Synthesizing Multi-Step Transformations to Relationalize Tables without Using Examples

    Authors: Peng Li, Yeye He, Cong Yan, Yue Wang, Surajit Chaudhuri

    Abstract: Relational tables, where each row corresponds to an entity and each column corresponds to an attribute, have been the standard for tables in relational databases. However, such a standard cannot be taken for granted when dealing with tables "in the wild". Our survey of real spreadsheet-tables and web-tables shows that over 30% of such tables do not conform to the relational standard, for which com… ▽ More

    Submitted 9 August, 2023; v1 submitted 26 July, 2023; originally announced July 2023.

    Comments: full version of a paper accepted to VLDB 2023

  26. arXiv:2307.02472  [pdf, other

    cs.CL cs.AI

    Deductive Additivity for Planning of Natural Language Proofs

    Authors: Zayne Sprague, Kaj Bostrom, Swarat Chaudhuri, Greg Durrett

    Abstract: Current natural language systems designed for multi-step claim validation typically operate in two phases: retrieve a set of relevant premise statements using heuristics (planning), then generate novel conclusions from those statements using a large language model (deduction). The planning step often requires expensive Transformer operations and does not scale to arbitrary numbers of premise state… ▽ More

    Submitted 5 July, 2023; v1 submitted 5 July, 2023; originally announced July 2023.

  27. arXiv:2306.12515  [pdf, other

    cs.DB

    Auto-BI: Automatically Build BI-Models Leveraging Local Join Prediction and Global Schema Graph

    Authors: Yiming Lin, Yeye He, Surajit Chaudhuri

    Abstract: Business Intelligence (BI) is crucial in modern enterprises and billion-dollar business. Traditionally, technical experts like database administrators would manually prepare BI-models (e.g., in star or snowflake schemas) that join tables in data warehouses, before less-technical business users can run analytics using end-user dashboarding tools. However, the popularity of self-service BI (e.g., Ta… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

    Comments: full version of a paper accepted to VLDB 2023

  28. arXiv:2306.08695  [pdf, other

    cond-mat.mtrl-sci cs.AI

    A generative artificial intelligence framework based on a molecular diffusion model for the design of metal-organic frameworks for carbon capture

    Authors: Hyun Park, Xiaoli Yan, Ruijie Zhu, E. A. Huerta, Santanu Chaudhuri, Donny Cooper, Ian Foster, Emad Tajkhorshid

    Abstract: Metal-organic frameworks (MOFs) exhibit great promise for CO2 capture. However, finding the best performing materials poses computational and experimental grand challenges in view of the vast chemical space of potential building blocks. Here, we introduce GHP-MOFassemble, a generative artificial intelligence (AI), high performance framework for the rational and accelerated design of MOFs with high… ▽ More

    Submitted 12 March, 2024; v1 submitted 14 June, 2023; originally announced June 2023.

    Comments: 25 pages, 17 figures, 6 tables, accepted to Nature Communications Chemistry. This work was awarded the HPCwire 2023 Editors' Choice Awards for Best Use of High Performance Data Analytics \& Artificial Intelligence see https://www.hpcwire.com/2023-readers-editors-choice-data-analytics-ai/

    ACM Class: I.2

    Journal ref: Commun Chem 7, 21 (2024)

  29. arXiv:2306.06545  [pdf, other

    cs.LG stat.ML

    A Probabilistic Framework for Modular Continual Learning

    Authors: Lazar Valkov, Akash Srivastava, Swarat Chaudhuri, Charles Sutton

    Abstract: Modular approaches that use a different composition of modules for each problem are a promising direction in continual learning (CL). However, searching through the large, discrete space of module compositions is challenging, especially because evaluating a composition's performance requires a round of neural network training. We address this challenge through a modular CL framework, PICLE, that u… ▽ More

    Submitted 2 May, 2024; v1 submitted 10 June, 2023; originally announced June 2023.

  30. arXiv:2306.02421  [pdf, other

    cs.DB cs.LG

    Auto-Validate by-History: Auto-Program Data Quality Constraints to Validate Recurring Data Pipelines

    Authors: Dezhan Tu, Yeye He, Weiwei Cui, Song Ge, Haidong Zhang, Han Shi, Dongmei Zhang, Surajit Chaudhuri

    Abstract: Data pipelines are widely employed in modern enterprises to power a variety of Machine-Learning (ML) and Business-Intelligence (BI) applications. Crucially, these pipelines are \emph{recurring} (e.g., daily or hourly) in production settings to keep data updated so that ML models can be re-trained regularly, and BI dashboards refreshed frequently. However, data quality (DQ) issues can often creep i… ▽ More

    Submitted 4 June, 2023; originally announced June 2023.

    Comments: full version of a paper accepted to KDD 2023

  31. arXiv:2305.18341  [pdf, other

    cs.PL cs.AI cs.LG

    Coarse-Tuning Models of Code with Reinforcement Learning Feedback

    Authors: Abhinav Jain, Chima Adiole, Swarat Chaudhuri, Thomas Reps, Chris Jermaine

    Abstract: Large Language Models (LLMs) pre-trained on code have recently emerged as the dominant approach to program synthesis. However, these models are trained using next-token prediction, which ignores the syntax and semantics of code. We propose RLCF, that further trains a pre-trained LLM via reinforcement learning, using feedback from a grounding function that scores the quality of the code. The ground… ▽ More

    Submitted 23 December, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: 23 pages

  32. arXiv:2305.13610  [pdf, other

    cs.FL quant-ph

    Weighted Context-Free-Language Ordered Binary Decision Diagrams

    Authors: Meghana Sistla, Swarat Chaudhuri, Thomas Reps

    Abstract: Over the years, many variants of Binary Decision Diagrams (BDDs) have been developed to address the deficiencies of vanilla BDDs. A recent innovation is the Context-Free-Language Ordered BDD (CFLOBDD), a hierarchically structured decision diagram, akin to BDDs enhanced with a procedure-call mechanism, which allows substructures to be shared in ways not possible with BDDs. For some functions, CFLOB… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: 21 pages

  33. arXiv:2305.09668  [pdf, other

    cs.CR cs.DS cs.LG stat.ML

    Mean Estimation Under Heterogeneous Privacy: Some Privacy Can Be Free

    Authors: Syomantak Chaudhuri, Thomas A. Courtade

    Abstract: Differential Privacy (DP) is a well-established framework to quantify privacy loss incurred by any algorithm. Traditional DP formulations impose a uniform privacy requirement for all users, which is often inconsistent with real-world scenarios in which users dictate their privacy preferences individually. This work considers the problem of mean estimation under heterogeneous DP constraints, where… ▽ More

    Submitted 27 April, 2023; originally announced May 2023.

    Comments: To appear at ISIT 2023

  34. arXiv:2304.07445  [pdf, other

    cs.LG

    A framework for fully autonomous design of materials via multiobjective optimization and active learning: challenges and next steps

    Authors: Tyler H. Chang, Jakob R. Elias, Stefan M. Wild, Santanu Chaudhuri, Joseph A. Libera

    Abstract: In order to deploy machine learning in a real-world self-driving laboratory where data acquisition is costly and there are multiple competing design criteria, systems need to be able to intelligently sample while balancing performance trade-offs and constraints. For these reasons, we present an active learning process based on multiobjective black-box optimization with continuously updated machine… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

  35. arXiv:2303.11886  [pdf, other

    cs.GR

    Fast Complementary Dynamics via Skinning Eigenmodes

    Authors: Otman Benchekroun, Jiayi Eris Zhang, Siddhartha Chaudhuri, Eitan Grinspun, Yi Zhou, Alec Jacobson

    Abstract: We propose a reduced-space elasto-dynamic solver that is well suited for augmenting rigged character animations with secondary motion. At the core of our method is a novel deformation subspace based on Linear Blend Skinning that overcomes many of the shortcomings prior subspace methods face. Our skinning subspace is parameterized entirely by a set of scalar weights, which we can obtain through a s… ▽ More

    Submitted 19 June, 2023; v1 submitted 21 March, 2023; originally announced March 2023.

    Comments: 20 pages, 24 figures

    ACM Class: I.3.2; I.3.5; I.3.6; I.3.7; I.3.8

  36. arXiv:2303.01999  [pdf, other

    cs.CV cs.GR

    Unsupervised 3D Shape Reconstruction by Part Retrieval and Assembly

    Authors: Xianghao Xu, Paul Guerrero, Matthew Fisher, Siddhartha Chaudhuri, Daniel Ritchie

    Abstract: Representing a 3D shape with a set of primitives can aid perception of structure, improve robotic object manipulation, and enable editing, stylization, and compression of 3D shapes. Existing methods either use simple parametric primitives or learn a generative shape space of parts. Both have limitations: parametric primitives lead to coarse approximations, while learned parts offer too little cont… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

    Comments: CVPR 2023

  37. MultiScale Probability Map guided Index Pooling with Attention-based learning for Road and Building Segmentation

    Authors: Shirsha Bose, Ritesh Sur Chowdhury, Debabrata Pal, Shivashish Bose, Biplab Banerjee, Subhasis Chaudhuri

    Abstract: Efficient road and building footprint extraction from satellite images are predominant in many remote sensing applications. However, precise segmentation map extraction is quite challenging due to the diverse building structures camouflaged by trees, similar spectral responses between the roads and buildings, and occlusions by heterogeneous traffic over the roads. Existing convolutional neural net… ▽ More

    Submitted 18 February, 2023; originally announced February 2023.

    Journal ref: ISPRS Journal of Photogrammetry and Remote Sensing 2023

  38. arXiv:2302.04349  [pdf, other

    cs.FL cs.SC quant-ph

    Symbolic Quantum Simulation with Quasimodo

    Authors: Meghana Sistla, Swarat Chaudhuri, Thomas Reps

    Abstract: The simulation of quantum circuits on classical computers is an important problem in quantum computing. Such simulation requires representations of distributions over very large sets of basis vectors, and recent work has used symbolic data-structures such as Binary Decision Diagrams (BDDs) for this purpose. In this tool paper, we present Quasimodo, an extensible, open-source Python library for sym… ▽ More

    Submitted 29 May, 2023; v1 submitted 8 February, 2023; originally announced February 2023.

    Comments: 15 pages; 35th International Conference on Computer Aided Verification (CAV 2023)

  39. arXiv:2301.11374  [pdf, other

    cs.LG cs.AI

    Certifiably Robust Reinforcement Learning through Model-Based Abstract Interpretation

    Authors: Chenxi Yang, Greg Anderson, Swarat Chaudhuri

    Abstract: We present a reinforcement learning (RL) framework in which the learned policy comes with a machine-checkable certificate of provable adversarial robustness. Our approach, called CAROL, learns a model of the environment. In each learning iteration, it uses the current version of this model and an external abstract interpreter to construct a differentiable signal for provable robustness. This signa… ▽ More

    Submitted 26 May, 2023; v1 submitted 26 January, 2023; originally announced January 2023.

  40. arXiv:2212.11317  [pdf, other

    cond-mat.mtrl-sci cs.AI cs.LG

    End-to-end AI framework for interpretable prediction of molecular and crystal properties

    Authors: Hyun Park, Ruijie Zhu, E. A. Huerta, Santanu Chaudhuri, Emad Tajkhorshid, Donny Cooper

    Abstract: We introduce an end-to-end computational framework that allows for hyperparameter optimization using the DeepHyper library, accelerated model training, and interpretable AI inference. The framework is based on state-of-the-art AI models including CGCNN, PhysNet, SchNet, MPNN, MPNN-transformer, and TorchMD-NET. We employ these AI models along with the benchmark QM9, hMOF, and MD17 datasets to showc… ▽ More

    Submitted 14 August, 2023; v1 submitted 21 December, 2022; originally announced December 2022.

    Comments: 20 pages, 10 images, 6 tables; v2: accepted to Machine Learning: Science and Technology

    ACM Class: I.2

    Journal ref: Mach. Learn.: Sci. Technol. 4 (2023) 025036

  41. CFLOBDDs: Context-Free-Language Ordered Binary Decision Diagrams

    Authors: Meghana Sistla, Swarat Chaudhuri, Thomas Reps

    Abstract: This paper presents a new compressed representation of Boolean functions, called CFLOBDDs (for Context-Free-Language Ordered Binary Decision Diagrams). They are essentially a plug-compatible alternative to BDDs (Binary Decision Diagrams), and hence useful for representing certain classes of functions, matrices, graphs, relations, etc. in a highly compressed fashion. CFLOBDDs share many of the good… ▽ More

    Submitted 6 May, 2024; v1 submitted 12 November, 2022; originally announced November 2022.

    Comments: 144 pages

    Journal ref: TOPLAS 2024

  42. Prototypical quadruplet for few-shot class incremental learning

    Authors: Sanchar Palit, Biplab Banerjee, Subhasis Chaudhuri

    Abstract: Scarcity of data and incremental learning of new tasks pose two major bottlenecks for many modern computer vision algorithms. The phenomenon of catastrophic forgetting, i.e., the model's inability to classify previously learned data after training with new batches of data, is a major challenge. Conventional methods address catastrophic forgetting while compromising the current session's training.… ▽ More

    Submitted 8 April, 2023; v1 submitted 5 November, 2022; originally announced November 2022.

    Comments: Submitted to IJCNN 2023

    Journal ref: Proc.Comp.Sc. 222 (2023) 25-34

  43. arXiv:2211.00614  [pdf, other

    cs.CL

    Natural Language Deduction with Incomplete Information

    Authors: Zayne Sprague, Kaj Bostrom, Swarat Chaudhuri, Greg Durrett

    Abstract: A growing body of work studies how to answer a question or verify a claim by generating a natural language "proof": a chain of deductive inferences yielding the answer based on a set of premises. However, these methods can only make sound deductions when they follow from evidence that is given. We propose a new system that can handle the underspecified setting where not all premises are stated at… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

    Comments: Conference of EMNLP 2022

  44. arXiv:2210.05050  [pdf, other

    cs.AI

    Neurosymbolic Programming for Science

    Authors: Jennifer J. Sun, Megan Tjandrasuwita, Atharva Sehgal, Armando Solar-Lezama, Swarat Chaudhuri, Yisong Yue, Omar Costilla-Reyes

    Abstract: Neurosymbolic Programming (NP) techniques have the potential to accelerate scientific discovery. These models combine neural and symbolic components to learn complex patterns and representations from data, using high-level concepts or known constraints. NP techniques can interface with symbolic domain knowledge from scientists, such as prior knowledge and experimental context, to produce interpret… ▽ More

    Submitted 7 November, 2022; v1 submitted 10 October, 2022; originally announced October 2022.

    Comments: Neural Information Processing Systems 2022 - AI for science workshop

  45. arXiv:2209.14148  [pdf, other

    cs.LG

    Guiding Safe Exploration with Weakest Preconditions

    Authors: Greg Anderson, Swarat Chaudhuri, Isil Dillig

    Abstract: In reinforcement learning for safety-critical settings, it is often desirable for the agent to obey safety constraints at all points in time, including during training. We present a novel neurosymbolic approach called SPICE to solve this safe exploration problem. SPICE uses an online shielding layer based on symbolic weakest preconditions to achieve a more precise safety analysis than existing too… ▽ More

    Submitted 27 February, 2023; v1 submitted 28 September, 2022; originally announced September 2022.

  46. arXiv:2207.11790  [pdf, other

    cs.CV

    PatchRD: Detail-Preserving Shape Completion by Learning Patch Retrieval and Deformation

    Authors: Bo Sun, Vladimir G. Kim, Noam Aigerman, Qixing Huang, Siddhartha Chaudhuri

    Abstract: This paper introduces a data-driven shape completion approach that focuses on completing geometric details of missing regions of 3D shapes. We observe that existing generative methods lack the training data and representation capacity to synthesize plausible, fine-grained details with complex geometry and topology. Our key insight is to copy and deform patches from the partial input to complete mi… ▽ More

    Submitted 24 July, 2022; originally announced July 2022.

  47. arXiv:2206.09546  [pdf, other

    cs.LG cs.AI cs.LO

    Policy Optimization with Linear Temporal Logic Constraints

    Authors: Cameron Voloshin, Hoang M. Le, Swarat Chaudhuri, Yisong Yue

    Abstract: We study the problem of policy optimization (PO) with linear temporal logic (LTL) constraints. The language of LTL allows flexible description of tasks that may be unnatural to encode as a scalar cost function. We consider LTL-constrained PO as a systematic framework, decoupling task specification from policy selection, and as an alternative to the standard of cost shaping. With access to a genera… ▽ More

    Submitted 19 October, 2022; v1 submitted 19 June, 2022; originally announced June 2022.

  48. arXiv:2205.02904  [pdf, other

    cs.GR cs.CV cs.LG

    Neural Jacobian Fields: Learning Intrinsic Mappings of Arbitrary Meshes

    Authors: Noam Aigerman, Kunal Gupta, Vladimir G. Kim, Siddhartha Chaudhuri, Jun Saito, Thibault Groueix

    Abstract: This paper introduces a framework designed to accurately predict piecewise linear mappings of arbitrary meshes via a neural network, enabling training and evaluating over heterogeneous collections of meshes that do not share a triangulation, as well as producing highly detail-preserving maps whose accuracy exceeds current state of the art. The framework is based on reducing the neural aspect to a… ▽ More

    Submitted 5 May, 2022; originally announced May 2022.

  49. arXiv:2205.01476  [pdf, other

    cs.DC eess.SY

    Real-Time Streaming and Event-driven Control of Scientific Experiments

    Authors: Jakob R. Elias, Ryan Chard, Maksim Levental, Zhengchun Liu, Ian Foster, Santanu Chaudhuri

    Abstract: Advancements in scientific instrument sensors and connected devices provide unprecedented insight into ongoing experiments and present new opportunities for control, optimization, and steering. However, the diversity of sensors and heterogeneity of their data result in make it challenging to fully realize these new opportunities. Organizing and synthesizing diverse data streams in near-real-time r… ▽ More

    Submitted 3 May, 2022; originally announced May 2022.

  50. arXiv:2204.09513  [pdf

    cs.LG physics.flu-dyn

    Physics-Informed Bayesian Learning of Electrohydrodynamic Polymer Jet Printing Dynamics

    Authors: Athanasios Oikonomou, Theodoros Loutas, Dixia Fan, Alysia Garmulewicz, George Nounesis, Santanu Chaudhuri, Filippos Tourlomousis

    Abstract: Calibration of highly dynamic multi-physics manufacturing processes such as electro-hydrodynamics-based additive manufacturing (AM) technologies (E-jet printing) is still performed by labor-intensive trial-and-error practices. These practices have hindered the broad adoption of these technologies, demanding a new paradigm of self-calibrating E-jet printing machines. To address this need, we develo… ▽ More

    Submitted 15 April, 2022; originally announced April 2022.

    Comments: 36 pages, 9 figures, 4 supporting figures, research article