Skip to main content

Showing 1–16 of 16 results for author: Navratil, J

  1. arXiv:2406.05882  [pdf, other

    cs.LG stat.ML

    Distributional Preference Alignment of LLMs via Optimal Transport

    Authors: Igor Melnyk, Youssef Mroueh, Brian Belgodere, Mattia Rigotti, Apoorva Nitsure, Mikhail Yurochkin, Kristjan Greenewald, Jiri Navratil, Jerret Ross

    Abstract: Current LLM alignment techniques use pairwise human preferences at a sample level, and as such, they do not imply an alignment on the distributional level. We propose in this paper Alignment via Optimal Transport (AOT), a novel method for distributional preference alignment of LLMs. AOT aligns LLMs on unpaired preference data by making the reward distribution of the positive samples stochastically… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  2. arXiv:2403.10638  [pdf, other

    cs.LG cs.CY stat.ML

    A resource-constrained stochastic scheduling algorithm for homeless street outreach and gleaning edible food

    Authors: Conor M. Artman, Aditya Mate, Ezinne Nwankwo, Aliza Heching, Tsuyoshi Idé, Jiří Navrátil, Karthikeyan Shanmugam, Wei Sun, Kush R. Varshney, Lauri Goldkind, Gidi Kroch, Jaclyn Sawyer, Ian Watson

    Abstract: We developed a common algorithmic solution addressing the problem of resource-constrained outreach encountered by social change organizations with different missions and operations: Breaking Ground -- an organization that helps individuals experiencing homelessness in New York transition to permanent housing and Leket -- the national food bank of Israel that rescues food from farms and elsewhere t… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  3. arXiv:2402.03726  [pdf, other

    cs.LG stat.ML

    Learning Granger Causality from Instance-wise Self-attentive Hawkes Processes

    Authors: Dongxia Wu, Tsuyoshi Idé, Aurélie Lozano, Georgios Kollias, Jiří Navrátil, Naoki Abe, Yi-An Ma, Rose Yu

    Abstract: We address the problem of learning Granger causality from asynchronous, interdependent, multi-type event sequences. In particular, we are interested in discovering instance-level causal structures in an unsupervised manner. Instance-level causality identifies causal relationships among individual events, providing more fine-grained information for decision-making. Existing work in the literature e… ▽ More

    Submitted 29 February, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

  4. arXiv:2310.07132  [pdf, other

    cs.LG math.ST q-fin.RM stat.ML

    Risk Aware Benchmarking of Large Language Models

    Authors: Apoorva Nitsure, Youssef Mroueh, Mattia Rigotti, Kristjan Greenewald, Brian Belgodere, Mikhail Yurochkin, Jiri Navratil, Igor Melnyk, Jerret Ross

    Abstract: We propose a distributional framework for benchmarking socio-technical risks of foundation models with quantified statistical significance. Our approach hinges on a new statistical relative testing based on first and second order stochastic dominance of real random variables. We show that the second order statistics in this test are linked to mean-risk models commonly used in econometrics and math… ▽ More

    Submitted 9 June, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: ICML 2024

  5. arXiv:2310.03158  [pdf, other

    cs.LG cs.AI

    Assessment of Prediction Intervals Using Uncertainty Characteristics Curves

    Authors: Jiri Navratil, Benjamin Elder, Matthew Arnold, Soumya Ghosh, Prasanna Sattigeri

    Abstract: Accurate quantification of model uncertainty has long been recognized as a fundamental requirement for trusted AI. In regression tasks, uncertainty is typically quantified using prediction intervals calibrated to an ad-hoc operating point, making evaluation and comparison across different studies relatively difficult. Our work leverages: (1) the concept of operating characteristics curves and (2)… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    Comments: Published at Workshop on Distribution-Free Uncertainty Quantification, International Conference on Machine Learning (ICML), July 2022. arXiv admin note: substantial text overlap with arXiv:2106.00858

  6. arXiv:2304.10819  [pdf, other

    cs.LG cs.AI stat.ML

    Auditing and Generating Synthetic Data with Controllable Trust Trade-offs

    Authors: Brian Belgodere, Pierre Dognin, Adam Ivankay, Igor Melnyk, Youssef Mroueh, Aleksandra Mojsilovic, Jiri Navratil, Apoorva Nitsure, Inkit Padhi, Mattia Rigotti, Jerret Ross, Yair Schiff, Radhika Vedpathak, Richard A. Young

    Abstract: Real-world data often exhibits bias, imbalance, and privacy risks. Synthetic datasets have emerged to address these issues. This paradigm relies on generative AI models to generate unbiased, privacy-preserving data while maintaining fidelity to the original data. However, assessing the trustworthiness of synthetic datasets and models is a critical challenge. We introduce a holistic auditing framew… ▽ More

    Submitted 9 June, 2024; v1 submitted 21 April, 2023; originally announced April 2023.

    Comments: submitted

  7. Anomaly Attribution with Likelihood Compensation

    Authors: Tsuyoshi Idé, Amit Dhurandhar, Jiří Navrátil, Moninder Singh, Naoki Abe

    Abstract: This paper addresses the task of explaining anomalous predictions of a black-box regression model. When using a black-box model, such as one to predict building energy consumption from many sensor measurements, we often have a situation where some observed samples may significantly deviate from their prediction. It may be due to a sub-optimal black-box model, or simply because those samples are ou… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

    Comments: 8 pages, 7 figures

    Journal ref: Proceedings of the AAAI Conference on Artificial Intelligence, 35(5), 4131-4138, 2021

  8. arXiv:2106.01410  [pdf, other

    cs.AI

    Uncertainty Quantification 360: A Holistic Toolkit for Quantifying and Communicating the Uncertainty of AI

    Authors: Soumya Ghosh, Q. Vera Liao, Karthikeyan Natesan Ramamurthy, Jiri Navratil, Prasanna Sattigeri, Kush R. Varshney, Yunfeng Zhang

    Abstract: In this paper, we describe an open source Python toolkit named Uncertainty Quantification 360 (UQ360) for the uncertainty quantification of AI models. The goal of this toolkit is twofold: first, to provide a broad range of capabilities to streamline as well as foster the common practices of quantifying, evaluating, improving, and communicating uncertainty in the AI application development lifecycl… ▽ More

    Submitted 3 June, 2021; v1 submitted 2 June, 2021; originally announced June 2021.

    Comments: Added references

  9. arXiv:2106.00858  [pdf, other

    cs.LG cs.AI stat.ML

    Uncertainty Characteristics Curves: A Systematic Assessment of Prediction Intervals

    Authors: Jiri Navratil, Benjamin Elder, Matthew Arnold, Soumya Ghosh, Prasanna Sattigeri

    Abstract: Accurate quantification of model uncertainty has long been recognized as a fundamental requirement for trusted AI. In regression tasks, uncertainty is typically quantified using prediction intervals calibrated to a specific operating point, making evaluation and comparison across different studies difficult. Our work leverages: (1) the concept of operating characteristics curves and (2) the notion… ▽ More

    Submitted 1 June, 2021; originally announced June 2021.

    Comments: 10 pages main paper, 9 pages appendix

  10. arXiv:2012.08625  [pdf, other

    cs.LG

    Learning Prediction Intervals for Model Performance

    Authors: Benjamin Elder, Matthew Arnold, Anupama Murthi, Jiri Navratil

    Abstract: Understanding model performance on unlabeled data is a fundamental challenge of developing, deploying, and maintaining AI systems. Model performance is typically evaluated using test sets or periodic manual quality assessments, both of which require laborious manual data labeling. Automated performance prediction techniques aim to mitigate this burden, but potential inaccuracy and a lack of trust… ▽ More

    Submitted 15 December, 2020; originally announced December 2020.

    Comments: 7+6 pages, 5 figures, AAAI 2021

  11. arXiv:2007.05499  [pdf, other

    cs.LG stat.ML

    Not Your Grandfathers Test Set: Reducing Labeling Effort for Testing

    Authors: Begum Taskazan, Jiri Navratil, Matthew Arnold, Anupama Murthi, Ganesh Venkataraman, Benjamin Elder

    Abstract: Building and maintaining high-quality test sets remains a laborious and expensive task. As a result, test sets in the real world are often not properly kept up to date and drift from the production traffic they are supposed to represent. The frequency and severity of this drift raises serious concerns over the value of manually labeled test sets in the QA process. This paper proposes a simple but… ▽ More

    Submitted 10 July, 2020; originally announced July 2020.

    Comments: International Workshop on Challenges in Deploying and Monitoring Machine Learning Systems in Conjunction with ICML 2020

  12. arXiv:2007.01350  [pdf, other

    cs.LG stat.ML

    Uncertainty Prediction for Deep Sequential Regression Using Meta Models

    Authors: Jiri Navratil, Matthew Arnold, Benjamin Elder

    Abstract: Generating high quality uncertainty estimates for sequential regression, particularly deep recurrent networks, remains a challenging and open problem. Existing approaches often make restrictive assumptions (such as stationarity) yet still perform poorly in practice, particularly in presence of real world non-stationary signals and drift. This paper describes a flexible method that can generate sym… ▽ More

    Submitted 22 July, 2021; v1 submitted 2 July, 2020; originally announced July 2020.

  13. arXiv:2003.12808  [pdf, other

    cs.LG cs.SE

    Towards Automating the AI Operations Lifecycle

    Authors: Matthew Arnold, Jeffrey Boston, Michael Desmond, Evelyn Duesterwald, Benjamin Elder, Anupama Murthi, Jiri Navratil, Darrell Reimer

    Abstract: Today's AI deployments often require significant human involvement and skill in the operational stages of the model lifecycle, including pre-release testing, monitoring, problem diagnosis and model improvements. We present a set of enabling technologies that can be used to increase the level of automation in AI operations, thus lowering the human effort required. Since a common source of human inv… ▽ More

    Submitted 28 March, 2020; originally announced March 2020.

    ACM Class: I.2

  14. Accelerating Physics-Based Simulations Using Neural Network Proxies: An Application in Oil Reservoir Modeling

    Authors: Jiri Navratil, Alan King, Jesus Rios, Georgios Kollias, Ruben Torrado, Andres Codas

    Abstract: We develop a proxy model based on deep learning methods to accelerate the simulations of oil reservoirs--by three orders of magnitude--compared to industry-strength physics-based PDE solvers. This paper describes a new architectural approach to this task, accompanied by a thorough experimental evaluation on a publicly available reservoir model. We demonstrate that in a practical setting a speedup… ▽ More

    Submitted 23 May, 2019; originally announced June 2019.

    Comments: 9 pages, submitted to FEED-2019 KDD Workshop & Frontiers in Big Data

    Journal ref: Front. Big Data, 20 September 2019

  15. arXiv:1805.05396  [pdf, other

    cs.LG stat.ML

    Confidence Scoring Using Whitebox Meta-models with Linear Classifier Probes

    Authors: Tongfei Chen, Jiří Navrátil, Vijay Iyengar, Karthikeyan Shanmugam

    Abstract: We propose a novel confidence scoring mechanism for deep neural networks based on a two-model paradigm involving a base model and a meta-model. The confidence score is learned by the meta-model observing the base model succeeding/failing at its task. As features to the meta-model, we investigate linear classifier probes inserted between the various layers of the base model. Our experiments demonst… ▽ More

    Submitted 13 March, 2019; v1 submitted 14 May, 2018; originally announced May 2018.

    Comments: Accepted at AISTATS 2019

    Journal ref: Proceedings of Machine Learning Research, PMLR 89:1467-1475, 2019

  16. arXiv:1708.04326  [pdf, ps, other

    cs.IR

    Improved Answer Selection with Pre-Trained Word Embeddings

    Authors: Rishav Chakravarti, Jiri Navratil, Cicero Nogueira dos Santos

    Abstract: This paper evaluates existing and newly proposed answer selection methods based on pre-trained word embeddings. Word embeddings are highly effective in various natural language processing tasks and their integration into traditional information retrieval (IR) systems allows for the capture of semantic relatedness between questions and answers. Empirical results on three publicly available data set… ▽ More

    Submitted 14 August, 2017; originally announced August 2017.