Skip to main content

Showing 1–21 of 21 results for author: Ghoshal, A

  1. arXiv:2402.08055  [pdf, other

    quant-ph cs.DC cs.ET

    A Quantum Algorithm Based Heuristic to Hide Sensitive Itemsets

    Authors: Abhijeet Ghoshal, Yan Li, Syam Menon, Sumit Sarkar

    Abstract: Quantum devices use qubits to represent information, which allows them to exploit important properties from quantum physics, specifically superposition and entanglement. As a result, quantum computers have the potential to outperform the most advanced classical computers. In recent years, quantum algorithms have shown hints of this promise, and many algorithms have been proposed for the quantum do… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Journal ref: Workshop on Information Technologies and Systems WITS 2023

  2. arXiv:2307.09142  [pdf, other

    physics.flu-dyn cs.LG

    Characterization of partial wetting by CMAS droplets using multiphase many-body dissipative particle dynamics and data-driven discovery based on PINNs

    Authors: Elham Kiyani, Mahdi Kooshkbaghi, Khemraj Shukla, Rahul Babu Koneru, Zhen Li, Luis Bravo, Anindya Ghoshal, George Em Karniadakis, Mikko Karttunen

    Abstract: The molten sand, a mixture of calcia, magnesia, alumina, and silicate, known as CMAS, is characterized by its high viscosity, density, and surface tension. The unique properties of CMAS make it a challenging material to deal with in high-temperature applications, requiring innovative solutions and materials to prevent its buildup and damage to critical equipment. Here, we use multiphase many-body… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

  3. arXiv:2302.00807  [pdf, other

    physics.flu-dyn cs.AI math.OC

    Deep neural operators can serve as accurate surrogates for shape optimization: A case study for airfoils

    Authors: Khemraj Shukla, Vivek Oommen, Ahmad Peyvan, Michael Penwarden, Luis Bravo, Anindya Ghoshal, Robert M. Kirby, George Em Karniadakis

    Abstract: Deep neural operators, such as DeepONets, have changed the paradigm in high-dimensional nonlinear regression from function regression to (differential) operator regression, paving the way for significant changes in computational engineering applications. Here, we investigate the use of DeepONets to infer flow fields around unseen airfoils with the aim of shape optimization, an important design pro… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

    Comments: 21 pages, 14 Figures

  4. arXiv:2212.09726  [pdf, other

    cs.CL cs.AI cs.LG

    Improving Faithfulness of Abstractive Summarization by Controlling Confounding Effect of Irrelevant Sentences

    Authors: Asish Ghoshal, Arash Einolghozati, Ankit Arun, Haoran Li, Lili Yu, Vera Gor, Yashar Mehdad, Scott Wen-tau Yih, Asli Celikyilmaz

    Abstract: Lack of factual correctness is an issue that still plagues state-of-the-art summarization systems despite their impressive progress on generating seemingly fluent summaries. In this paper, we show that factual inconsistency can be caused by irrelevant parts of the input text, which act as confounders. To that end, we leverage information-theoretic measures of causal effects to quantify the amount… ▽ More

    Submitted 18 January, 2024; v1 submitted 19 December, 2022; originally announced December 2022.

  5. arXiv:2211.10411  [pdf, other

    cs.IR cs.CL

    CITADEL: Conditional Token Interaction via Dynamic Lexical Routing for Efficient and Effective Multi-Vector Retrieval

    Authors: Minghan Li, Sheng-Chieh Lin, Barlas Oguz, Asish Ghoshal, Jimmy Lin, Yashar Mehdad, Wen-tau Yih, Xilun Chen

    Abstract: Multi-vector retrieval methods combine the merits of sparse (e.g. BM25) and dense (e.g. DPR) retrievers and have achieved state-of-the-art performance on various retrieval tasks. These methods, however, are orders of magnitude slower and need much more space to store their indices compared to their single-vector counterparts. In this paper, we unify different multi-vector retrieval models from a t… ▽ More

    Submitted 18 November, 2022; originally announced November 2022.

  6. arXiv:2211.01438  [pdf, other

    eess.AS cs.CL cs.SD

    Variable Attention Masking for Configurable Transformer Transducer Speech Recognition

    Authors: Pawel Swietojanski, Stefan Braun, Dogan Can, Thiago Fraga da Silva, Arnab Ghoshal, Takaaki Hori, Roger Hsiao, Henry Mason, Erik McDermott, Honza Silovsky, Ruchir Travadi, Xiaodan Zhuang

    Abstract: This work studies the use of attention masking in transformer transducer based speech recognition for building a single configurable model for different deployment scenarios. We present a comprehensive set of experiments comparing fixed masking, where the same attention mask is applied at every frame, with chunked masking, where the attention mask for each frame is determined by chunk boundaries,… ▽ More

    Submitted 18 April, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

    Comments: To appear in ICASSP 2023

    Journal ref: International Conference on Acoustics, Speech, and Signal Processing, 2023 International Conference on Acoustics, Speech, and Signal Processing International Conference on Acoustics, Speech, and Signal Processing

  7. arXiv:2210.12214  [pdf, ps, other

    cs.SD cs.CL eess.AS

    Optimizing Bilingual Neural Transducer with Synthetic Code-switching Text Generation

    Authors: Thien Nguyen, Nathalie Tran, Liuhui Deng, Thiago Fraga da Silva, Matthew Radzihovsky, Roger Hsiao, Henry Mason, Stefan Braun, Erik McDermott, Dogan Can, Pawel Swietojanski, Lyan Verwimp, Sibel Oyman, Tresi Arvizo, Honza Silovsky, Arnab Ghoshal, Mathieu Martel, Bharat Ram Ambati, Mohamed Ali

    Abstract: Code-switching describes the practice of using more than one language in the same sentence. In this study, we investigate how to optimize a neural transducer based bilingual automatic speech recognition (ASR) model for code-switching speech. Focusing on the scenario where the ASR model is trained without supervised code-switching data, we found that semi-supervised training and synthetic code-swit… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: 5 pages, 1 figure, submitted to ICASSP 2023, *: equal contributions

  8. arXiv:2205.00485  [pdf, ps, other

    cs.CL cs.SD eess.AS

    Bilingual End-to-End ASR with Byte-Level Subwords

    Authors: Liuhui Deng, Roger Hsiao, Arnab Ghoshal

    Abstract: In this paper, we investigate how the output representation of an end-to-end neural network affects multilingual automatic speech recognition (ASR). We study different representations including character-level, byte-level, byte pair encoding (BPE), and byte-level byte pair encoding (BBPE) representations, and analyze their strengths and weaknesses. We focus on developing a single end-to-end model… ▽ More

    Submitted 1 May, 2022; originally announced May 2022.

    Comments: 5 pages, to be published in IEEE ICASSP 2022

  9. arXiv:2101.00977  [pdf, other

    cs.LG

    Towards Understanding the Behaviors of Optimal Deep Active Learning Algorithms

    Authors: Yilun Zhou, Adithya Renduchintala, Xian Li, Sida Wang, Yashar Mehdad, Asish Ghoshal

    Abstract: Active learning (AL) algorithms may achieve better performance with fewer data because the model guides the data selection process. While many algorithms have been proposed, there is little study on what the optimal AL algorithm looks like, which would help researchers understand where their models fall short and iterate on the design. In this paper, we present a simulated annealing algorithm to s… ▽ More

    Submitted 20 February, 2021; v1 submitted 29 December, 2020; originally announced January 2021.

    Comments: AISTATS 2021

  10. arXiv:2012.15482  [pdf, other

    cs.CL

    FiD-Ex: Improving Sequence-to-Sequence Models for Extractive Rationale Generation

    Authors: Kushal Lakhotia, Bhargavi Paranjape, Asish Ghoshal, Wen-tau Yih, Yashar Mehdad, Srinivasan Iyer

    Abstract: Natural language (NL) explanations of model predictions are gaining popularity as a means to understand and verify decisions made by large black-box pre-trained models, for NLP tasks such as Question Answering (QA) and Fact Verification. Recently, pre-trained sequence to sequence (seq2seq) models have proven to be very effective in jointly making predictions, as well as generating NL explanations.… ▽ More

    Submitted 31 December, 2020; originally announced December 2020.

  11. arXiv:2010.03546  [pdf, other

    cs.CL

    Low-Resource Domain Adaptation for Compositional Task-Oriented Semantic Parsing

    Authors: Xilun Chen, Asish Ghoshal, Yashar Mehdad, Luke Zettlemoyer, Sonal Gupta

    Abstract: Task-oriented semantic parsing is a critical component of virtual assistants, which is responsible for understanding the user's intents (set reminder, play music, etc.). Recent advances in deep learning have enabled several approaches to successfully parse more complex queries (Gupta et al., 2018; Rongali et al.,2020), but these models require a large amount of annotated training data to parse que… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

    Comments: EMNLP 2020

  12. arXiv:2008.05514  [pdf, other

    eess.AS cs.CL cs.SD

    Online Automatic Speech Recognition with Listen, Attend and Spell Model

    Authors: Roger Hsiao, Dogan Can, Tim Ng, Ruchir Travadi, Arnab Ghoshal

    Abstract: The Listen, Attend and Spell (LAS) model and other attention-based automatic speech recognition (ASR) models have known limitations when operated in a fully online mode. In this paper, we analyze the online operation of LAS models to demonstrate that these limitations stem from the handling of silence regions and the reliability of online attention mechanism at the edge of input buffers. We propos… ▽ More

    Submitted 13 October, 2020; v1 submitted 12 August, 2020; originally announced August 2020.

    Comments: 5 pages, 4 figures, this version is submitted to IEEE Signal Processing Letters

  13. arXiv:2001.11019  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    Improving Language Identification for Multilingual Speakers

    Authors: Andrew Titus, Jan Silovsky, Nanxin Chen, Roger Hsiao, Mary Young, Arnab Ghoshal

    Abstract: Spoken language identification (LID) technologies have improved in recent years from discriminating largely distinct languages to discriminating highly similar languages or even dialects of the same language. One aspect that has been mostly neglected, however, is discrimination of languages for multilingual speakers, despite being a primary target audience of many systems that utilize LID technolo… ▽ More

    Submitted 29 January, 2020; originally announced January 2020.

    Comments: 5 pages, 2 figures. Submitted to ICASSP 2020

  14. arXiv:1906.00449  [pdf, ps, other

    cs.LG stat.ML

    Minimax bounds for structured prediction

    Authors: Kevin Bello, Asish Ghoshal, Jean Honorio

    Abstract: Structured prediction can be considered as a generalization of many standard supervised learning tasks, and is usually thought as a simultaneous prediction of multiple labels. One standard approach is to maximize a score function on the space of labels, which decomposes as a sum of unary and pairwise potentials, each depending on one or two specific labels, respectively. For this approach, several… ▽ More

    Submitted 2 June, 2019; originally announced June 2019.

    Journal ref: Artificial Intelligence and Statistics (AISTATS), 2020

  15. arXiv:1805.08196  [pdf, other

    cs.LG stat.ML

    Learning Maximum-A-Posteriori Perturbation Models for Structured Prediction in Polynomial Time

    Authors: Asish Ghoshal, Jean Honorio

    Abstract: MAP perturbation models have emerged as a powerful framework for inference in structured prediction. Such models provide a way to efficiently sample from the Gibbs distribution and facilitate predictions that are robust to random noise. In this paper, we propose a provably polynomial time randomized algorithm for learning the parameters of perturbed MAP predictors. Our approach is based on minimiz… ▽ More

    Submitted 21 May, 2018; originally announced May 2018.

    Comments: Accepted to ICML 2018

    Journal ref: International Conference on Machine Learning (ICML), 2018

  16. arXiv:1707.04673  [pdf, other

    cs.LG stat.ML

    Learning linear structural equation models in polynomial time and sample complexity

    Authors: Asish Ghoshal, Jean Honorio

    Abstract: The problem of learning structural equation models (SEMs) from data is a fundamental problem in causal inference. We develop a new algorithm --- which is computationally and statistically efficient and works in the high-dimensional regime --- for learning linear SEMs from purely observational data with arbitrary noise distribution. We consider three aspects of the problem: identifiability, computa… ▽ More

    Submitted 14 July, 2017; originally announced July 2017.

    Journal ref: International Conference on Artificial Intelligence and Statistics (AISTATS) 2018

  17. arXiv:1706.05648  [pdf, other

    cs.LG

    Learning Sparse Polymatrix Games in Polynomial Time and Sample Complexity

    Authors: Asish Ghoshal, Jean Honorio

    Abstract: We consider the problem of learning sparse polymatrix games from observations of strategic interactions. We show that a polynomial time method based on $\ell_{1,2}$-group regularized logistic regression recovers a game, whose Nash equilibria are the $ε$-Nash equilibria of the game from which the data was generated (true game), in $\mathcal{O}(m^4 d^4 \log (pd))$ samples of strategy profiles --- wh… ▽ More

    Submitted 20 November, 2017; v1 submitted 18 June, 2017; originally announced June 2017.

    Journal ref: International Conference on Artificial Intelligence and Statistics (AISTATS) 2018

  18. arXiv:1703.01218  [pdf, other

    cs.LG

    Learning Graphical Games from Behavioral Data: Sufficient and Necessary Conditions

    Authors: Asish Ghoshal, Jean Honorio

    Abstract: In this paper we obtain sufficient and necessary conditions on the number of samples required for exact recovery of the pure-strategy Nash equilibria (PSNE) set of a graphical game from noisy observations of joint actions. We consider sparse linear influence games --- a parametric class of graphical games with linear payoffs, and represented by directed graphs of n nodes (players) and in-degree of… ▽ More

    Submitted 3 March, 2017; originally announced March 2017.

    Comments: Accepted to AISTATS 2017, Florida. arXiv admin note: substantial text overlap with arXiv:1607.02959

  19. arXiv:1703.01196  [pdf, other

    cs.LG stat.ML

    Learning Identifiable Gaussian Bayesian Networks in Polynomial Time and Sample Complexity

    Authors: Asish Ghoshal, Jean Honorio

    Abstract: Learning the directed acyclic graph (DAG) structure of a Bayesian network from observational data is a notoriously difficult problem for which many hardness results are known. In this paper we propose a provably polynomial-time algorithm for learning sparse Gaussian Bayesian networks with equal noise variance --- a class of Bayesian networks for which the DAG structure can be uniquely identified f… ▽ More

    Submitted 3 March, 2017; originally announced March 2017.

    Journal ref: Neural Information Processing Systems (NIPS) 2017

  20. arXiv:1607.02959  [pdf, other

    cs.GT cs.LG stat.ML

    From Behavior to Sparse Graphical Games: Efficient Recovery of Equilibria

    Authors: Asish Ghoshal, Jean Honorio

    Abstract: In this paper we study the problem of exact recovery of the pure-strategy Nash equilibria (PSNE) set of a graphical game from noisy observations of joint actions of the players alone. We consider sparse linear influence games --- a parametric class of graphical games with linear payoffs, and represented by directed graphs of n nodes (players) and in-degree of at most k. We present an $\ell_1$-regu… ▽ More

    Submitted 19 October, 2016; v1 submitted 11 July, 2016; originally announced July 2016.

    Comments: Accepted at 54th Annual Allerton Conference on Communication, Control, and Computing (2016)

    Journal ref: Allerton Conference on Communication, Control, and Computing (2016)

  21. arXiv:1601.07460  [pdf, other

    cs.LG cs.IT stat.ML

    Information-theoretic limits of Bayesian network structure learning

    Authors: Asish Ghoshal, Jean Honorio

    Abstract: In this paper, we study the information-theoretic limits of learning the structure of Bayesian networks (BNs), on discrete as well as continuous random variables, from a finite number of samples. We show that the minimum number of samples required by any procedure to recover the correct structure grows as $Ω(m)$ and $Ω(k \log m + (k^2/m))$ for non-sparse and sparse BNs respectively, where $m$ is t… ▽ More

    Submitted 3 March, 2017; v1 submitted 27 January, 2016; originally announced January 2016.

    Comments: Accepted to AISTATS 2017, Florida

    Journal ref: International Conference on Artificial Intelligence and Statistics (AISTATS), 2017