Skip to main content

Showing 1–31 of 31 results for author: Kilbertus, N

  1. arXiv:2406.03920  [pdf, other

    cs.LG physics.ao-ph

    Towards Physically Consistent Deep Learning For Climate Model Parameterizations

    Authors: Birgit Kühbacher, Fernando Iglesias-Suarez, Niki Kilbertus, Veronika Eyring

    Abstract: Climate models play a critical role in understanding and projecting climate change. Due to their complexity, their horizontal resolution of ~40-100 km remains too coarse to resolve processes such as clouds and convection, which need to be approximated via parameterizations. These parameterizations are a major source of systematic errors and large uncertainties in climate projections. Deep learning… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  2. arXiv:2405.19985  [pdf, other

    stat.ME cs.LG

    Targeted Sequential Indirect Experiment Design

    Authors: Elisabeth Ailer, Niclas Dern, Jason Hartford, Niki Kilbertus

    Abstract: Scientific hypotheses typically concern specific aspects of complex, imperfectly understood or entirely unknown mechanisms, such as the effect of gene expression levels on phenotypes or how microbial communities influence environmental health. Such queries are inherently causal (rather than purely associational), but in many settings, experiments can not be conducted directly on the target variabl… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  3. arXiv:2405.05998  [pdf, other

    q-bio.GN cs.LG

    Whole Genome Transformer for Gene Interaction Effects in Microbiome Habitat Specificity

    Authors: Zhufeng Li, Sandeep S Cranganore, Nicholas Youngblut, Niki Kilbertus

    Abstract: Leveraging the vast genetic diversity within microbiomes offers unparalleled insights into complex phenotypes, yet the task of accurately predicting and understanding such traits from genomic data remains challenging. We propose a framework taking advantage of existing large models for gene vectorization to predict habitat specificity from entire microbial genome sequences. Based on our model, we… ▽ More

    Submitted 28 May, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

  4. arXiv:2402.18477  [pdf, other

    cs.LG cs.AI stat.ML

    Signature Kernel Conditional Independence Tests in Causal Discovery for Stochastic Processes

    Authors: Georg Manten, Cecilia Casolo, Emilio Ferrucci, Søren Wengel Mogensen, Cristopher Salvi, Niki Kilbertus

    Abstract: Inferring the causal structure underlying stochastic dynamical systems from observational data holds great promise in domains ranging from science and health to finance. Such processes can often be accurately modeled via stochastic differential equations (SDEs), which naturally imply causal relationships via "which variables enter the differential of which other variables". In this paper, we devel… ▽ More

    Submitted 11 June, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

  5. arXiv:2311.15100  [pdf, other

    cs.CV cs.AI cs.LG

    Unbalancedness in Neural Monge Maps Improves Unpaired Domain Translation

    Authors: Luca Eyring, Dominik Klein, Théo Uscidda, Giovanni Palla, Niki Kilbertus, Zeynep Akata, Fabian Theis

    Abstract: In optimal transport (OT), a Monge map is known as a mapping that transports a source distribution to a target distribution in the most cost-efficient way. Recently, multiple neural estimators for Monge maps have been developed and applied in diverse unpaired domain translation tasks, e.g. in single-cell biology and computer vision. However, the classic OT framework enforces mass conservation, whi… ▽ More

    Submitted 11 March, 2024; v1 submitted 25 November, 2023; originally announced November 2023.

    Comments: ICLR 2024

  6. arXiv:2310.05573  [pdf, other

    cs.LG

    ODEFormer: Symbolic Regression of Dynamical Systems with Transformers

    Authors: Stéphane d'Ascoli, Sören Becker, Alexander Mathis, Philippe Schwaller, Niki Kilbertus

    Abstract: We introduce ODEFormer, the first transformer able to infer multidimensional ordinary differential equation (ODE) systems in symbolic form from the observation of a single solution trajectory. We perform extensive evaluations on two datasets: (i) the existing "Strogatz" dataset featuring two-dimensional systems; (ii) ODEBench, a collection of one- to four-dimensional systems that we carefully cura… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

  7. arXiv:2307.12617  [pdf, other

    cs.LG

    Predicting Ordinary Differential Equations with Transformers

    Authors: Sören Becker, Michal Klein, Alexander Neitz, Giambattista Parascandolo, Niki Kilbertus

    Abstract: We develop a transformer-based sequence-to-sequence model that recovers scalar ordinary differential equations (ODEs) in symbolic form from irregularly sampled and noisy observations of a single solution trajectory. We demonstrate in extensive empirical evaluations that our model performs better or on par with existing methods in terms of accurate recovery across various settings. Moreover, our me… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: Published at ICML 2023

  8. arXiv:2306.09739  [pdf, other

    cs.LG physics.comp-ph stat.ML

    Stabilized Neural Differential Equations for Learning Dynamics with Explicit Constraints

    Authors: Alistair White, Niki Kilbertus, Maximilian Gelbrecht, Niklas Boers

    Abstract: Many successful methods to learn dynamical systems from data have recently been introduced. However, ensuring that the inferred dynamics preserve known constraints, such as conservation laws or restrictions on the allowed system states, remains challenging. We propose stabilized neural differential equations (SNDEs), a method to enforce arbitrary manifold constraints for neural differential equati… ▽ More

    Submitted 15 February, 2024; v1 submitted 16 June, 2023; originally announced June 2023.

    Comments: 22 pages, 8 figures. Accepted at NeurIPS 2023

  9. arXiv:2302.05684  [pdf, other

    stat.ME cs.LG stat.ML

    Sequential Underspecified Instrument Selection for Cause-Effect Estimation

    Authors: Elisabeth Ailer, Jason Hartford, Niki Kilbertus

    Abstract: Instrumental variable (IV) methods are used to estimate causal effects in settings with unobserved confounding, where we cannot directly experiment on the treatment variable. Instruments are variables which only affect the outcome indirectly via the treatment variable(s). Most IV applications focus on low-dimensional treatments and crucially require at least as many instruments as treatments. This… ▽ More

    Submitted 25 May, 2023; v1 submitted 11 February, 2023; originally announced February 2023.

    Comments: Code for this paper is available at https://github.com/EAiler/underspecified-iv

  10. arXiv:2211.02830  [pdf, other

    cs.LG

    Discovering ordinary differential equations that govern time-series

    Authors: Sören Becker, Michal Klein, Alexander Neitz, Giambattista Parascandolo, Niki Kilbertus

    Abstract: Natural laws are often described through differential equations yet finding a differential equation that describes the governing law underlying observed data is a challenging and still mostly manual task. In this paper we make a step towards the automation of this process: we propose a transformer-based sequence-to-sequence model that recovers scalar autonomous ordinary differential equations (ODE… ▽ More

    Submitted 5 November, 2022; originally announced November 2022.

    Comments: Workshop paper at NeurIPS 2022 workshop "AI for Science"

  11. arXiv:2210.14672  [pdf, other

    cs.LG

    Sparsity in Continuous-Depth Neural Networks

    Authors: Hananeh Aliee, Till Richter, Mikhail Solonin, Ignacio Ibarra, Fabian Theis, Niki Kilbertus

    Abstract: Neural Ordinary Differential Equations (NODEs) have proven successful in learning dynamical systems in terms of accurately recovering the observed trajectories. While different types of sparsity have been proposed to improve robustness, the generalization properties of NODEs for dynamical systems beyond the observed data are underexplored. We systematically study the influence of weight and featur… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

    Comments: Neurips 2022

  12. arXiv:2207.09768  [pdf, other

    cs.LG stat.ML

    Learning Counterfactually Invariant Predictors

    Authors: Francesco Quinzan, Cecilia Casolo, Krikamol Muandet, Yucen Luo, Niki Kilbertus

    Abstract: Notions of counterfactual invariance (CI) have proven essential for predictors that are fair, robust, and generalizable in the real world. We propose graphical criteria that yield a sufficient condition for a predictor to be counterfactually invariant in terms of a conditional independence in the observational distribution. In order to learn such predictors, we propose a model-agnostic framework,… ▽ More

    Submitted 13 October, 2023; v1 submitted 20 July, 2022; originally announced July 2022.

  13. arXiv:2206.13102  [pdf, other

    cs.GT cs.CY cs.IR cs.LG stat.ML

    Modeling Content Creator Incentives on Algorithm-Curated Platforms

    Authors: Jiri Hron, Karl Krauth, Michael I. Jordan, Niki Kilbertus, Sarah Dean

    Abstract: Content creators compete for user attention. Their reach crucially depends on algorithmic choices made by developers on online platforms. To maximize exposure, many creators adapt strategically, as evidenced by examples like the sprawling search engine optimization industry. This begets competition for the finite user attention pool. We formalize these dynamics in what we call an exposure game, a… ▽ More

    Submitted 6 July, 2023; v1 submitted 27 June, 2022; originally announced June 2022.

    Comments: presented at ICLR 2023 (top 5%)

  14. arXiv:2205.08875  [pdf, other

    cs.LG cs.CY

    Multi-disciplinary fairness considerations in machine learning for clinical trials

    Authors: Isabel Chien, Nina Deliu, Richard E. Turner, Adrian Weller, Sofia S. Villar, Niki Kilbertus

    Abstract: While interest in the application of machine learning to improve healthcare has grown tremendously in recent years, a number of barriers prevent deployment in medical practice. A notable concern is the potential to exacerbate entrenched biases and existing health disparities in society. The area of fairness in machine learning seeks to address these issues of equity; however, appropriate approache… ▽ More

    Submitted 18 May, 2022; originally announced May 2022.

    Comments: Appeared at ACM FAccT 2022

  15. arXiv:2205.07271  [pdf, other

    stat.ML cs.LG stat.AP

    Supervised Learning and Model Analysis with Compositional Data

    Authors: Shimeng Huang, Elisabeth Ailer, Niki Kilbertus, Niklas Pfister

    Abstract: The compositionality and sparsity of high-throughput sequencing data poses a challenge for regression and classification. However, in microbiome research in particular, conditional modeling is an essential tool to investigate relationships between phenotypes and the microbiome. Existing techniques are often inadequate: they either rely on extensions of the linear log-contrast model (which adjusts… ▽ More

    Submitted 11 November, 2022; v1 submitted 15 May, 2022; originally announced May 2022.

  16. arXiv:2204.13545  [pdf, other

    cs.LG q-bio.GN stat.AP stat.ML

    Predicting Cellular Responses to Novel Drug Perturbations at a Single-Cell Resolution

    Authors: Leon Hetzel, Simon Böhm, Niki Kilbertus, Stephan Günnemann, Mohammad Lotfollahi, Fabian Theis

    Abstract: Single-cell transcriptomics enabled the study of cellular heterogeneity in response to perturbations at the resolution of individual cells. However, scaling high-throughput screens (HTSs) to measure cellular responses for many drugs remains a challenge due to technical limitations and, more importantly, the cost of such multiplexed experiments. Thus, transferring information from routinely perform… ▽ More

    Submitted 30 December, 2022; v1 submitted 28 April, 2022; originally announced April 2022.

    Comments: 10 pages. NeurIPS 2022 conference paper

  17. arXiv:2202.10806  [pdf, other

    stat.ML cs.LG

    Stochastic Causal Programming for Bounding Treatment Effects

    Authors: Kirtan Padh, Jakob Zeitler, David Watson, Matt Kusner, Ricardo Silva, Niki Kilbertus

    Abstract: Causal effect estimation is important for many tasks in the natural and social sciences. We design algorithms for the continuous partial identification problem: bounding the effects of multivariate, continuous treatments when unmeasured confounding makes identification impossible. Specifically, we cast causal effects as objective functions within a constrained optimization problem, and minimize/ma… ▽ More

    Submitted 17 May, 2023; v1 submitted 22 February, 2022; originally announced February 2022.

    Journal ref: Proceedings of Machine Learning Research vol 213:1-35, 2023

  18. arXiv:2106.14979  [pdf, other

    cs.IR cs.LG stat.ML

    On component interactions in two-stage recommender systems

    Authors: Jiri Hron, Karl Krauth, Michael I. Jordan, Niki Kilbertus

    Abstract: Thanks to their scalability, two-stage recommenders are used by many of today's largest online platforms, including YouTube, LinkedIn, and Pinterest. These systems produce recommendations in two steps: (i) multiple nominators, tuned for low prediction latency, preselect a small subset of candidates from the whole item pool; (ii) a slower but more accurate ranker further narrows down the nominated… ▽ More

    Submitted 12 January, 2022; v1 submitted 28 June, 2021; originally announced June 2021.

    Comments: Appears in the proceedings of the NeurIPS 2021 conference

  19. arXiv:2106.12430  [pdf, other

    cs.LG cs.AI

    Beyond Predictions in Neural ODEs: Identification and Interventions

    Authors: Hananeh Aliee, Fabian J. Theis, Niki Kilbertus

    Abstract: Spurred by tremendous success in pattern matching and prediction tasks, researchers increasingly resort to machine learning to aid original scientific discovery. Given large amounts of observational data about a system, can we uncover the rules that govern its evolution? Solving this task holds the great promise of fully understanding the causal interactions and being able to make reliable predict… ▽ More

    Submitted 23 June, 2021; originally announced June 2021.

  20. arXiv:2106.11234  [pdf, other

    cs.LG q-bio.QM stat.AP stat.ML

    Instrumental Variable Estimation for Compositional Treatments

    Authors: Elisabeth Ailer, Christian L. Müller, Niki Kilbertus

    Abstract: Many scientific datasets are compositional in nature. Important biological examples include species abundances in ecology, cell-type compositions derived from single-cell sequencing data, and amplicon abundance data in microbiome research. Here, we provide a causal view on compositional data in an instrumental variable setting where the composition acts as the cause. First, we crisply articulate p… ▽ More

    Submitted 28 May, 2024; v1 submitted 21 June, 2021; originally announced June 2021.

    Comments: Code available on https://github.com/EAiler/causal-compositions

  21. arXiv:2101.12476  [pdf, other

    cs.LG cs.AI cs.CY

    Beyond traditional assumptions in fair machine learning

    Authors: Niki Kilbertus

    Abstract: This thesis scrutinizes common assumptions underlying traditional machine learning approaches to fairness in consequential decision making. After challenging the validity of these assumptions in real-world applications, we propose ways to move forward when they are violated. First, we show that group fairness criteria purely based on statistical properties of observed data are fundamentally limite… ▽ More

    Submitted 29 January, 2021; originally announced January 2021.

    Comments: PhD Thesis submitted at the University of Cambridge, October 2020. The thesis is based on a number of previous works also available on arxiv (see Chapter 1)

  22. arXiv:2009.08956  [pdf, other

    cs.IR cs.LG stat.ML

    Exploration in two-stage recommender systems

    Authors: Jiri Hron, Karl Krauth, Michael I. Jordan, Niki Kilbertus

    Abstract: Two-stage recommender systems are widely adopted in industry due to their scalability and maintainability. These systems produce recommendations in two steps: (i) multiple nominators preselect a small number of items from a large pool using cheap-to-compute item embeddings; (ii) with a richer set of features, a ranker rearranges the nominated items and serves them to the user. A key challenge of t… ▽ More

    Submitted 1 September, 2020; originally announced September 2020.

    Comments: Published at the REVEAL 2020 workshop (RecSys 2020)

  23. arXiv:2006.07886  [pdf, other

    cs.LG stat.ML

    On Disentangled Representations Learned From Correlated Data

    Authors: Frederik Träuble, Elliot Creager, Niki Kilbertus, Francesco Locatello, Andrea Dittadi, Anirudh Goyal, Bernhard Schölkopf, Stefan Bauer

    Abstract: The focus of disentanglement approaches has been on identifying independent factors of variation in data. However, the causal variables underlying real-world observations are often not statistically independent. In this work, we bridge the gap to real-world scenarios by analyzing the behavior of the most prominent disentanglement approaches on correlated data in a large-scale empirical study (incl… ▽ More

    Submitted 16 July, 2021; v1 submitted 14 June, 2020; originally announced June 2020.

    Comments: Published at the 38th International Conference on Machine Learning (ICML 2021)

  24. arXiv:2006.06366  [pdf, other

    cs.LG stat.ML

    A Class of Algorithms for General Instrumental Variable Models

    Authors: Niki Kilbertus, Matt J. Kusner, Ricardo Silva

    Abstract: Causal treatment effect estimation is a key problem that arises in a variety of real-world settings, from personalized medicine to governmental policy making. There has been a flurry of recent work in machine learning on estimating causal effects when one has access to an instrument. However, to achieve identifiability, they in general require one-size-fits-all assumptions such as an additive erro… ▽ More

    Submitted 21 October, 2020; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: Appeared at Neural Information Processing Systems (NeurIPS) 2020; Code at https://github.com/nikikilbertus/general-iv-models

  25. arXiv:1907.01040  [pdf, other

    cs.LG cs.CY stat.ML

    The Sensitivity of Counterfactual Fairness to Unmeasured Confounding

    Authors: Niki Kilbertus, Philip J. Ball, Matt J. Kusner, Adrian Weller, Ricardo Silva

    Abstract: Causal approaches to fairness have seen substantial recent interest, both from the machine learning community and from wider parties interested in ethical prediction algorithms. In no small part, this has been due to the fact that causal models allow one to simultaneously leverage data and expert knowledge to remove discriminatory effects from predictions. However, one of the primary assumptions i… ▽ More

    Submitted 1 July, 2019; originally announced July 2019.

    Comments: published at UAI 2019

  26. arXiv:1904.08693  [pdf, other

    astro-ph.IM astro-ph.HE cs.LG stat.ML

    Convolutional neural networks: a magic bullet for gravitational-wave detection?

    Authors: Timothy D. Gebhard, Niki Kilbertus, Ian Harry, Bernhard Schölkopf

    Abstract: In the last few years, machine learning techniques, in particular convolutional neural networks, have been investigated as a method to replace or complement traditional matched filtering techniques that are used to detect the gravitational-wave signature of merging black holes. However, to date, these methods have not yet been successfully applied to the analysis of long stretches of data recorded… ▽ More

    Submitted 6 September, 2019; v1 submitted 18 April, 2019; originally announced April 2019.

    Comments: First two authors contributed equally; appeared at Phys. Rev. D

    Journal ref: Phys. Rev. D 100, 063015 (2019)

  27. arXiv:1902.02979  [pdf, other

    cs.LG cs.CY stat.ML

    Fair Decisions Despite Imperfect Predictions

    Authors: Niki Kilbertus, Manuel Gomez-Rodriguez, Bernhard Schölkopf, Krikamol Muandet, Isabel Valera

    Abstract: Consequential decisions are increasingly informed by sophisticated data-driven predictive models. However, to consistently learn accurate predictive models, one needs access to ground truth labels. Unfortunately, in practice, labels may only exist conditional on certain decisions---if a loan is denied, there is not even an option for the individual to pay back the loan. Hence, the observed data di… ▽ More

    Submitted 16 October, 2020; v1 submitted 8 February, 2019; originally announced February 2019.

    Comments: earlier version appeared at AISTATS 2020 http://proceedings.mlr.press/v108/

  28. arXiv:1812.00524  [pdf, other

    cs.LG stat.ML

    Generalization in anti-causal learning

    Authors: Niki Kilbertus, Giambattista Parascandolo, Bernhard Schölkopf

    Abstract: The ability to learn and act in novel situations is still a prerogative of animate intelligence, as current machine learning methods mostly fail when moving beyond the standard i.i.d. setting. What is the reason for this discrepancy? Most machine learning tasks are anti-causal, i.e., we infer causes (labels) from effects (observations). Typically, in supervised learning we build systems that try t… ▽ More

    Submitted 2 December, 2018; originally announced December 2018.

    Comments: A shorter version of this paper appeared at the workshop on `Critiquing and correcting trends in machine learning` at NeurIPS 2018

  29. arXiv:1806.03281  [pdf, other

    stat.ML cs.CR cs.CY cs.LG

    Blind Justice: Fairness with Encrypted Sensitive Attributes

    Authors: Niki Kilbertus, Adrià Gascón, Matt J. Kusner, Michael Veale, Krishna P. Gummadi, Adrian Weller

    Abstract: Recent work has explored how to train machine learning models which do not discriminate against any subgroup of the population as determined by sensitive attributes such as gender or race. To avoid disparate treatment, sensitive attributes should not be considered. On the other hand, in order to avoid disparate impact, sensitive attributes must be examined, e.g., in order to learn a fair model, or… ▽ More

    Submitted 8 June, 2018; originally announced June 2018.

    Comments: published at ICML 2018

    Journal ref: Proceedings of the 35th International Conference on Machine Learning, PMLR 80:2630-2639, 2018

  30. arXiv:1712.00961  [pdf, other

    cs.LG stat.ML

    Learning Independent Causal Mechanisms

    Authors: Giambattista Parascandolo, Niki Kilbertus, Mateo Rojas-Carulla, Bernhard Schölkopf

    Abstract: Statistical learning relies upon data sampled from a distribution, and we usually do not care what actually generated it in the first place. From the point of view of causal modeling, the structure of each distribution is induced by physical mechanisms that give rise to dependences between observables. Mechanisms, however, can be meaningful autonomous modules of generative models that make sense b… ▽ More

    Submitted 8 September, 2018; v1 submitted 4 December, 2017; originally announced December 2017.

    Comments: ICML 2018

    Journal ref: Proceedings of the 35th International Conference on Machine Learning, PMLR 80:4036-4044, 2018

  31. arXiv:1706.02744  [pdf, ps, other

    stat.ML cs.CY cs.LG

    Avoiding Discrimination through Causal Reasoning

    Authors: Niki Kilbertus, Mateo Rojas-Carulla, Giambattista Parascandolo, Moritz Hardt, Dominik Janzing, Bernhard Schölkopf

    Abstract: Recent work on fairness in machine learning has focused on various statistical discrimination criteria and how they trade off. Most of these criteria are observational: They depend only on the joint distribution of predictor, protected attribute, features, and outcome. While convenient to work with, observational criteria have severe inherent limitations that prevent them from resolving matters of… ▽ More

    Submitted 21 January, 2018; v1 submitted 8 June, 2017; originally announced June 2017.

    Comments: Advances in Neural Information Processing Systems 30, 2017 http://papers.nips.cc/paper/6668-avoiding-discrimination-through-causal-reasoning

    Journal ref: Advances in Neural Information Processing Systems 30, 2017, p. 656--666