Skip to main content

Showing 1–5 of 5 results for author: Sander, T

  1. arXiv:2403.02506  [pdf, other

    cs.CV cs.LG

    Differentially Private Representation Learning via Image Captioning

    Authors: Tom Sander, Yaodong Yu, Maziar Sanjabi, Alain Durmus, Yi Ma, Kamalika Chaudhuri, Chuan Guo

    Abstract: Differentially private (DP) machine learning is considered the gold-standard solution for training a model from sensitive data while still preserving privacy. However, a major barrier to achieving this ideal is its sub-optimal privacy-accuracy trade-off, which is particularly visible in DP representation learning. Specifically, it has been shown that under modest privacy budgets, most models learn… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  2. arXiv:2402.14904  [pdf, other

    cs.CR cs.AI cs.CL cs.LG

    Watermarking Makes Language Models Radioactive

    Authors: Tom Sander, Pierre Fernandez, Alain Durmus, Matthijs Douze, Teddy Furon

    Abstract: This paper investigates the radioactivity of LLM-generated texts, i.e. whether it is possible to detect that such input was used as training data. Conventional methods like membership inference can carry out this detection with some level of accuracy. We show that watermarked training data leaves traces easier to detect and much more reliable than membership inference. We link the contamination le… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  3. arXiv:2402.08344  [pdf, other

    stat.ML cs.LG

    Implicit Bias in Noisy-SGD: With Applications to Differentially Private Training

    Authors: Tom Sander, Maxime Sylvestre, Alain Durmus

    Abstract: Training Deep Neural Networks (DNNs) with small batches using Stochastic Gradient Descent (SGD) yields superior test performance compared to larger batches. The specific noise structure inherent to SGD is known to be responsible for this implicit bias. DP-SGD, used to ensure differential privacy (DP) in DNNs' training, adds Gaussian noise to the clipped gradients. Surprisingly, large-batch trainin… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  4. arXiv:2308.00420  [pdf, other

    cs.CC

    The complexity of the Timetable-Based Railway Network Design Problem

    Authors: Nadine Friesen, Tim Sander, Karl Nachtigall, Nils Nießen

    Abstract: Because of the long planning periods and their long life cycle, railway infrastructure has to be outlined long ahead. At the present, the infrastructure is designed while only little about the intended operation is known. Hence, the timetable and the operation are adjusted to the infrastructure. Since space, time and money for extension measures of railway infrastructure are limited, each modifica… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

  5. arXiv:2210.03403  [pdf, other

    cs.LG cs.CR stat.ML

    TAN Without a Burn: Scaling Laws of DP-SGD

    Authors: Tom Sander, Pierre Stock, Alexandre Sablayrolles

    Abstract: Differentially Private methods for training Deep Neural Networks (DNNs) have progressed recently, in particular with the use of massive batches and aggregated data augmentations for a large number of training steps. These techniques require much more computing resources than their non-private counterparts, shifting the traditional privacy-accuracy trade-off to a privacy-accuracy-compute trade-off… ▽ More

    Submitted 24 May, 2023; v1 submitted 7 October, 2022; originally announced October 2022.