Skip to main content

Showing 1–50 of 109 results for author: Feizi, S

  1. arXiv:2406.13683  [pdf, other

    cs.CV cs.AI

    IntCoOp: Interpretability-Aware Vision-Language Prompt Tuning

    Authors: Soumya Suvra Ghosal, Samyadeep Basu, Soheil Feizi, Dinesh Manocha

    Abstract: Image-text contrastive models such as CLIP learn transferable and robust representations for zero-shot transfer to a variety of downstream tasks. However, to obtain strong downstream performances, prompts need to be carefully curated, which can be a tedious engineering task. To address the issue of manual prompt engineering, prompt-tuning is used where a set of contextual vectors are learned by le… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  2. arXiv:2406.11674  [pdf, other

    cs.CL

    Endor: Hardware-Friendly Sparse Format for Offloaded LLM Inference

    Authors: Donghyeon Joo, Ramyad Hadidi, Soheil Feizi, Bahar Asgari

    Abstract: The increasing size of large language models (LLMs) challenges their usage on resource-constrained platforms. For example, memory on modern GPUs is insufficient to hold LLMs that are hundreds of Gigabytes in size. Offloading is a popular method to escape this constraint by storing weights of an LLM model to host CPU memory and SSD, then loading each weight to GPU before every use. In our case stud… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 14 pages, 16 figures

  3. arXiv:2406.07844  [pdf, other

    cs.CV

    Understanding and Mitigating Compositional Issues in Text-to-Image Generative Models

    Authors: Arman Zarei, Keivan Rezaei, Samyadeep Basu, Mehrdad Saberi, Mazda Moayeri, Priyatham Kattakinda, Soheil Feizi

    Abstract: Recent text-to-image diffusion-based generative models have the stunning ability to generate highly detailed and photo-realistic images and achieve state-of-the-art low FID scores on challenging image generation benchmarks. However, one of the primary failure modes of these text-to-image generative models is in composing attributes, objects, and their associated relationships accurately into an im… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  4. arXiv:2406.04236  [pdf, other

    cs.CV

    Understanding Information Storage and Transfer in Multi-modal Large Language Models

    Authors: Samyadeep Basu, Martin Grayson, Cecily Morrison, Besmira Nushi, Soheil Feizi, Daniela Massiceti

    Abstract: Understanding the mechanisms of information storage and transfer in Transformer-based models is important for driving model understanding progress. Recent work has studied these mechanisms for Large Language Models (LLMs), revealing insights on how information is stored in a model's parameters and how information flows to and from these parameters in response to specific prompts. However, these st… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 20 pages

  5. arXiv:2406.02836  [pdf, other

    cs.CR cs.CV

    DREW : Towards Robust Data Provenance by Leveraging Error-Controlled Watermarking

    Authors: Mehrdad Saberi, Vinu Sankar Sadasivan, Arman Zarei, Hessam Mahdavifar, Soheil Feizi

    Abstract: Identifying the origin of data is crucial for data provenance, with applications including data ownership protection, media forensics, and detecting AI-generated content. A standard approach involves embedding-based retrieval techniques that match query data with entries in a reference dataset. However, this method is not robust against benign and malicious edits. To address this, we propose Data… ▽ More

    Submitted 20 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

  6. arXiv:2406.02542  [pdf, other

    cs.LG

    Loki: Low-Rank Keys for Efficient Sparse Attention

    Authors: Prajwal Singhania, Siddharth Singh, Shwai He, Soheil Feizi, Abhinav Bhatele

    Abstract: Inference on large language models can be expensive in terms of the compute and memory costs involved, especially when long sequence lengths are used. In particular, the self-attention mechanism used in such models contributes significantly to these costs, which has resulted in several recent works that propose sparse attention approximations for inference. In this work, we propose to approximate… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  7. arXiv:2406.01583  [pdf, other

    cs.CV cs.LG

    Decomposing and Interpreting Image Representations via Text in ViTs Beyond CLIP

    Authors: Sriram Balasubramanian, Samyadeep Basu, Soheil Feizi

    Abstract: Recent works have explored how individual components of the CLIP-ViT model contribute to the final representation by leveraging the shared image-text representation space of CLIP. These components, such as attention heads and MLPs, have been shown to capture distinct image features like shape, color or texture. However, understanding the role of these components in arbitrary vision transformers (V… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 22 pages, 15 figures

    ACM Class: I.5.1

  8. arXiv:2405.16401  [pdf, other

    cs.CV cs.LG

    Understanding the Effect of using Semantically Meaningful Tokens for Visual Representation Learning

    Authors: Neha Kalibhat, Priyatham Kattakinda, Arman Zarei, Nikita Seleznev, Samuel Sharpe, Senthil Kumar, Soheil Feizi

    Abstract: Vision transformers have established a precedent of patchifying images into uniformly-sized chunks before processing. We hypothesize that this design choice may limit models in learning comprehensive and compositional representations from visual data. This paper explores the notion of providing semantically-meaningful visual tokens to transformer encoders within a vision-language pre-training fram… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  9. arXiv:2405.01008  [pdf, other

    cs.CV

    On Mechanistic Knowledge Localization in Text-to-Image Generative Models

    Authors: Samyadeep Basu, Keivan Rezaei, Priyatham Kattakinda, Ryan Rossi, Cherry Zhao, Vlad Morariu, Varun Manjunatha, Soheil Feizi

    Abstract: Identifying layers within text-to-image models which control visual attributes can facilitate efficient model editing through closed-form updates. Recent work, leveraging causal tracing show that early Stable-Diffusion variants confine knowledge primarily to the first layer of the CLIP text-encoder, while it diffuses throughout the UNet.Extending this framework, we observe that for recent models (… ▽ More

    Submitted 7 May, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

    Comments: Appearing in ICML 2024

  10. arXiv:2404.08030  [pdf, other

    cs.CV cs.AI

    Rethinking Artistic Copyright Infringements in the Era of Text-to-Image Generative Models

    Authors: Mazda Moayeri, Samyadeep Basu, Sriram Balasubramanian, Priyatham Kattakinda, Atoosa Chengini, Robert Brauneis, Soheil Feizi

    Abstract: Recent text-to-image generative models such as Stable Diffusion are extremely adept at mimicking and generating copyrighted content, raising concerns amongst artists that their unique styles may be improperly copied. Understanding how generative models copy "artistic style" is more complex than duplicating a single image, as style is comprised by a set of elements (or signature) that frequently co… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  11. arXiv:2403.02580  [pdf, other

    cs.CV cs.LG

    What do we learn from inverting CLIP models?

    Authors: Hamid Kazemi, Atoosa Chegini, Jonas Geiping, Soheil Feizi, Tom Goldstein

    Abstract: We employ an inversion-based approach to examine CLIP models. Our examination reveals that inverting CLIP models results in the generation of images that exhibit semantic alignment with the specified target prompts. We leverage these inverted images to gain insights into various aspects of CLIP models, such as their ability to blend concepts and inclusion of gender biases. We notably observe insta… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: Warning: This paper contains sexually explicit images and language, offensive visuals and terminology, discussions on pornography, gender bias, and other potentially unsettling, distressing, and/or offensive content for certain readers

  12. arXiv:2402.15570  [pdf, other

    cs.CR cs.AI cs.CL

    Fast Adversarial Attacks on Language Models In One GPU Minute

    Authors: Vinu Sankar Sadasivan, Shoumik Saha, Gaurang Sriramanan, Priyatham Kattakinda, Atoosa Chegini, Soheil Feizi

    Abstract: In this paper, we introduce a novel class of fast, beam search-based adversarial attack (BEAST) for Language Models (LMs). BEAST employs interpretable parameters, enabling attackers to balance between attack speed, success rate, and the readability of adversarial prompts. The computational efficiency of BEAST facilitates us to investigate its applications on LMs for jailbreaking, eliciting halluci… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  13. arXiv:2312.05464  [pdf, other

    cs.CV cs.LG

    Identifying and Mitigating Model Failures through Few-shot CLIP-aided Diffusion Generation

    Authors: Atoosa Chegini, Soheil Feizi

    Abstract: Deep learning models can encounter unexpected failures, especially when dealing with challenging sub-populations. One common reason for these failures is the occurrence of objects in backgrounds that are rarely seen during training. To gain a better understanding of these failure modes, human-interpretable descriptions are crucial for further analysis and improvement which is expensive. In this st… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  14. arXiv:2311.15551  [pdf, other

    cs.CV cs.AI cs.CR cs.LG eess.IV

    Instruct2Attack: Language-Guided Semantic Adversarial Attacks

    Authors: Jiang Liu, Chen Wei, Yuxiang Guo, Heng Yu, Alan Yuille, Soheil Feizi, Chun Pong Lau, Rama Chellappa

    Abstract: We propose Instruct2Attack (I2A), a language-guided semantic attack that generates semantically meaningful perturbations according to free-form language instructions. We make use of state-of-the-art latent diffusion models, where we adversarially guide the reverse diffusion process to search for an adversarial latent code conditioned on the input image and text instruction. Compared to existing no… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: under submission, code coming soon

  15. arXiv:2311.07601  [pdf, other

    cs.CY cs.AI

    Online Advertisements with LLMs: Opportunities and Challenges

    Authors: Soheil Feizi, MohammadTaghi Hajiaghayi, Keivan Rezaei, Suho Shin

    Abstract: This paper explores the potential for leveraging Large Language Models (LLM) in the realm of online advertising systems. We delve into essential requirements including privacy, latency, reliability as well as the satisfaction of users and advertisers that such a system must fulfill. We further introduce a general framework for LLM advertisement, consisting of modification, bidding, prediction, and… ▽ More

    Submitted 18 April, 2024; v1 submitted 10 November, 2023; originally announced November 2023.

  16. arXiv:2310.19889  [pdf, other

    cs.CV cs.AI cs.LG

    Exploring Geometry of Blind Spots in Vision Models

    Authors: Sriram Balasubramanian, Gaurang Sriramanan, Vinu Sankar Sadasivan, Soheil Feizi

    Abstract: Despite the remarkable success of deep neural networks in a myriad of settings, several works have demonstrated their overwhelming sensitivity to near-imperceptible perturbations, known as adversarial attacks. On the other hand, prior works have also observed that deep networks can be under-sensitive, wherein large-magnitude perturbations in input space do not induce appreciable changes to network… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: 25 pages, 20 figures, Accepted at NeurIPS 2023 (spotlight)

    ACM Class: I.2.6; I.2.10

  17. arXiv:2310.13730  [pdf, other

    cs.CV

    Localizing and Editing Knowledge in Text-to-Image Generative Models

    Authors: Samyadeep Basu, Nanxuan Zhao, Vlad Morariu, Soheil Feizi, Varun Manjunatha

    Abstract: Text-to-Image Diffusion Models such as Stable-Diffusion and Imagen have achieved unprecedented quality of photorealism with state-of-the-art FID scores on MS-COCO and other generation benchmarks. Given a caption, image generation requires fine-grained knowledge about attributes such as object structure, style, and viewpoint amongst others. Where does this information reside in text-to-image genera… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: 61 pages

  18. arXiv:2310.02426  [pdf, other

    cs.CV

    EditVal: Benchmarking Diffusion Based Text-Guided Image Editing Methods

    Authors: Samyadeep Basu, Mehrdad Saberi, Shweta Bhardwaj, Atoosa Malemir Chegini, Daniela Massiceti, Maziar Sanjabi, Shell Xu Hu, Soheil Feizi

    Abstract: A plethora of text-guided image editing methods have recently been developed by leveraging the impressive capabilities of large-scale diffusion-based generative models such as Imagen and Stable Diffusion. A standardized evaluation protocol, however, does not exist to compare methods across different types of fine-grained edits. To address this gap, we introduce EditVal, a standardized benchmark fo… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

  19. arXiv:2310.00164  [pdf, other

    cs.CV

    PRIME: Prioritizing Interpretability in Failure Mode Extraction

    Authors: Keivan Rezaei, Mehrdad Saberi, Mazda Moayeri, Soheil Feizi

    Abstract: In this work, we study the challenge of providing human-understandable descriptions for failure modes in trained image classification models. Existing works address this problem by first identifying clusters (or directions) of incorrectly classified samples in a latent space and then aiming to provide human-understandable text descriptions for them. We observe that in some cases, describing text d… ▽ More

    Submitted 14 March, 2024; v1 submitted 29 September, 2023; originally announced October 2023.

    Comments: Accepted to ICLR 2024

  20. arXiv:2310.00076  [pdf, other

    cs.CV

    Robustness of AI-Image Detectors: Fundamental Limits and Practical Attacks

    Authors: Mehrdad Saberi, Vinu Sankar Sadasivan, Keivan Rezaei, Aounon Kumar, Atoosa Chegini, Wenxiao Wang, Soheil Feizi

    Abstract: In light of recent advancements in generative AI models, it has become essential to distinguish genuine content from AI-generated one to prevent the malicious usage of fake materials as authentic ones and vice versa. Various techniques have been introduced for identifying AI-generated images, with watermarking emerging as a promising approach. In this paper, we analyze the robustness of various AI… ▽ More

    Submitted 14 February, 2024; v1 submitted 29 September, 2023; originally announced October 2023.

  21. arXiv:2309.03999  [pdf, other

    cs.CV cs.LG

    Adapting Self-Supervised Representations to Multi-Domain Setups

    Authors: Neha Kalibhat, Sam Sharpe, Jeremy Goodsitt, Bayan Bruss, Soheil Feizi

    Abstract: Current state-of-the-art self-supervised approaches, are effective when trained on individual domains but show limited generalization on unseen domains. We observe that these models poorly generalize even when trained on a mixture of domains, making them unsuitable to be deployed under diverse real-world setups. We therefore propose a general-purpose, lightweight Domain Disentanglement Module (DDM… ▽ More

    Submitted 12 December, 2023; v1 submitted 7 September, 2023; originally announced September 2023.

    Comments: Published at BMVC 2023

  22. arXiv:2309.02705  [pdf, other

    cs.CL cs.AI cs.CR cs.LG

    Certifying LLM Safety against Adversarial Prompting

    Authors: Aounon Kumar, Chirag Agarwal, Suraj Srinivas, Aaron Jiaxun Li, Soheil Feizi, Himabindu Lakkaraju

    Abstract: Large language models (LLMs) are vulnerable to adversarial attacks that add malicious tokens to an input prompt to bypass the safety guardrails of an LLM and cause it to produce harmful content. In this work, we introduce erase-and-check, the first framework for defending against adversarial prompts with certifiable safety guarantees. Given a prompt, our procedure erases tokens individually and in… ▽ More

    Submitted 12 February, 2024; v1 submitted 6 September, 2023; originally announced September 2023.

  23. Identifying and Mitigating the Security Risks of Generative AI

    Authors: Clark Barrett, Brad Boyd, Elie Burzstein, Nicholas Carlini, Brad Chen, Jihye Choi, Amrita Roy Chowdhury, Mihai Christodorescu, Anupam Datta, Soheil Feizi, Kathleen Fisher, Tatsunori Hashimoto, Dan Hendrycks, Somesh Jha, Daniel Kang, Florian Kerschbaum, Eric Mitchell, John Mitchell, Zulfikar Ramzan, Khawaja Shams, Dawn Song, Ankur Taly, Diyi Yang

    Abstract: Every major technical invention resurfaces the dual-use dilemma -- the new technology has the potential to be used for good as well as for harm. Generative AI (GenAI) techniques, such as large language models (LLMs) and diffusion models, have shown remarkable capabilities (e.g., in-context learning, code-completion, and text-to-image generation and editing). However, GenAI can be used just as well… ▽ More

    Submitted 28 December, 2023; v1 submitted 28 August, 2023; originally announced August 2023.

    Journal ref: Foundations and Trends in Privacy and Security 6 (2023) 1-52

  24. arXiv:2307.10504  [pdf, other

    cs.CV cs.LG

    Identifying Interpretable Subspaces in Image Representations

    Authors: Neha Kalibhat, Shweta Bhardwaj, Bayan Bruss, Hamed Firooz, Maziar Sanjabi, Soheil Feizi

    Abstract: We propose Automatic Feature Explanation using Contrasting Concepts (FALCON), an interpretability framework to explain features of image representations. For a target feature, FALCON captions its highly activating cropped images using a large captioning dataset (like LAION-400m) and a pre-trained vision-language model like CLIP. Each word among the captions is scored and ranked leading to a small… ▽ More

    Submitted 7 September, 2023; v1 submitted 19 July, 2023; originally announced July 2023.

    Comments: Published at ICML 2023 Code: https://github.com/NehaKalibhat/falcon-explain

  25. arXiv:2307.09233  [pdf, other

    cs.CV

    Distilling Knowledge from Text-to-Image Generative Models Improves Visio-Linguistic Reasoning in CLIP

    Authors: Samyadeep Basu, Shell Xu Hu, Maziar Sanjabi, Daniela Massiceti, Soheil Feizi

    Abstract: Image-text contrastive models like CLIP have wide applications in zero-shot classification, image-text retrieval, and transfer learning. However, they often struggle on compositional visio-linguistic tasks (e.g., attribute-binding or object-relationships) where their performance is no better than random chance. To address this, we introduce SDS-CLIP, a lightweight and sample-efficient distillation… ▽ More

    Submitted 1 July, 2024; v1 submitted 18 July, 2023; originally announced July 2023.

    Comments: Short paper

  26. arXiv:2306.16415  [pdf, other

    cs.LG cs.AI cs.CR cs.CV stat.ML

    On Practical Aspects of Aggregation Defenses against Data Poisoning Attacks

    Authors: Wenxiao Wang, Soheil Feizi

    Abstract: The increasing access to data poses both opportunities and risks in deep learning, as one can manipulate the behaviors of deep learning models with malicious training samples. Such attacks are known as data poisoning. Recent advances in defense strategies against data poisoning have highlighted the effectiveness of aggregation schemes in achieving state-of-the-art results in certified poisoning ro… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: 15 pages

  27. arXiv:2306.00183  [pdf, other

    cs.LG cs.AI

    Diffused Redundancy in Pre-trained Representations

    Authors: Vedant Nanda, Till Speicher, John P. Dickerson, Soheil Feizi, Krishna P. Gummadi, Adrian Weller

    Abstract: Representations learned by pre-training a neural network on a large dataset are increasingly used successfully to perform a variety of downstream tasks. In this work, we take a closer look at how features are encoded in such pre-trained representations. We find that learned representations in a given layer exhibit a degree of diffuse redundancy, ie, any randomly chosen subset of neurons in the lay… ▽ More

    Submitted 14 November, 2023; v1 submitted 31 May, 2023; originally announced June 2023.

    Comments: NeurIPS 2023

  28. arXiv:2305.06386  [pdf, other

    cs.CV cs.AI cs.HC cs.LG

    Text-To-Concept (and Back) via Cross-Model Alignment

    Authors: Mazda Moayeri, Keivan Rezaei, Maziar Sanjabi, Soheil Feizi

    Abstract: We observe that the mapping between an image's representation in one model to its representation in another can be learned surprisingly well with just a linear layer, even across diverse models. Building on this observation, we propose $\textit{text-to-concept}$, where features from a fixed pretrained model are aligned linearly to the CLIP space, so that text embeddings from CLIP's text encoder be… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

    Comments: Accepted to ICML 2023 and CVPR4XAI workshop 2023

  29. arXiv:2304.01917  [pdf, other

    cs.CV

    Strong Baselines for Parameter Efficient Few-Shot Fine-tuning

    Authors: Samyadeep Basu, Daniela Massiceti, Shell Xu Hu, Soheil Feizi

    Abstract: Few-shot classification (FSC) entails learning novel classes given only a few examples per class after a pre-training (or meta-training) phase on a set of base classes. Recent works have shown that simply fine-tuning a pre-trained Vision Transformer (ViT) on new test classes is a strong approach for FSC. Fine-tuning ViTs, however, is expensive in time, compute and storage. This has motivated the d… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

  30. arXiv:2303.16308  [pdf, other

    cs.LG stat.ML

    Provable Robustness for Streaming Models with a Sliding Window

    Authors: Aounon Kumar, Vinu Sankar Sadasivan, Soheil Feizi

    Abstract: The literature on provable robustness in machine learning has primarily focused on static prediction problems, such as image classification, in which input samples are assumed to be independent and model performance is measured as an expectation over the input distribution. Robustness certificates are derived for individual input instances with the assumption that the model is evaluated on each in… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

  31. arXiv:2303.13372  [pdf, other

    cs.CR cs.LG

    DRSM: De-Randomized Smoothing on Malware Classifier Providing Certified Robustness

    Authors: Shoumik Saha, Wenxiao Wang, Yigitcan Kaya, Soheil Feizi, Tudor Dumitras

    Abstract: Machine Learning (ML) models have been utilized for malware detection for over two decades. Consequently, this ignited an ongoing arms race between malware authors and antivirus systems, compelling researchers to propose defenses for malware-detection models against evasion attacks. However, most if not all existing defenses against evasion attacks suffer from sizable performance degradation and/o… ▽ More

    Submitted 2 October, 2023; v1 submitted 20 March, 2023; originally announced March 2023.

  32. arXiv:2303.11156  [pdf, other

    cs.CL cs.AI cs.LG

    Can AI-Generated Text be Reliably Detected?

    Authors: Vinu Sankar Sadasivan, Aounon Kumar, Sriram Balasubramanian, Wenxiao Wang, Soheil Feizi

    Abstract: The unregulated use of LLMs can potentially lead to malicious consequences such as plagiarism, generating fake news, spamming, etc. Therefore, reliable detection of AI-generated text can be critical to ensure the responsible use of LLMs. Recent works attempt to tackle this problem either using certain model signatures present in the generated text outputs or by applying watermarking techniques tha… ▽ More

    Submitted 19 February, 2024; v1 submitted 17 March, 2023; originally announced March 2023.

  33. arXiv:2303.04278  [pdf, other

    cs.LG cs.CR cs.CV

    CUDA: Convolution-based Unlearnable Datasets

    Authors: Vinu Sankar Sadasivan, Mahdi Soltanolkotabi, Soheil Feizi

    Abstract: Large-scale training of modern deep learning models heavily relies on publicly available data on the web. This potentially unauthorized usage of online data leads to concerns regarding data privacy. Recent works aim to make unlearnable data for deep learning models by adding small, specially designed noises to tackle this issue. However, these methods are vulnerable to adversarial training (AT) an… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

    Comments: CVPR 2023

  34. arXiv:2302.03684  [pdf, other

    cs.LG cs.AI cs.CR stat.ML

    Temporal Robustness against Data Poisoning

    Authors: Wenxiao Wang, Soheil Feizi

    Abstract: Data poisoning considers cases when an adversary manipulates the behavior of machine learning algorithms through malicious training data. Existing threat models of data poisoning center around a single metric, the number of poisoned samples. In consequence, if attackers can poison more samples than expected with affordable overhead, as in many practical scenarios, they may be able to render existi… ▽ More

    Submitted 6 December, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

    Comments: 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  35. arXiv:2302.02300  [pdf, other

    cs.LG

    Run-Off Election: Improved Provable Defense against Data Poisoning Attacks

    Authors: Keivan Rezaei, Kiarash Banihashem, Atoosa Chegini, Soheil Feizi

    Abstract: In data poisoning attacks, an adversary tries to change a model's prediction by adding, modifying, or removing samples in the training data. Recently, ensemble-based approaches for obtaining provable defenses against data poisoning have been proposed where predictions are done by taking a majority vote across multiple base models. In this work, we show that merely considering the majority vote in… ▽ More

    Submitted 16 May, 2023; v1 submitted 4 February, 2023; originally announced February 2023.

    Comments: Accepted to ICML 2023

  36. arXiv:2212.02648  [pdf, other

    cs.CV cs.AI cs.HC cs.LG

    Spuriosity Rankings: Sorting Data to Measure and Mitigate Biases

    Authors: Mazda Moayeri, Wenxiao Wang, Sahil Singla, Soheil Feizi

    Abstract: We present a simple but effective method to measure and mitigate model biases caused by reliance on spurious cues. Instead of requiring costly changes to one's data or model training, our method better utilizes the data one already has by sorting them. Specifically, we rank images within their classes based on spuriosity (the degree to which common spurious cues are present), proxied via deep neur… ▽ More

    Submitted 30 October, 2023; v1 submitted 5 December, 2022; originally announced December 2022.

    Comments: Accepted to NeurIPS '23 (Spotlight). Camera ready version

  37. arXiv:2211.14646  [pdf, other

    cs.CV cs.AI

    Towards Improved Input Masking for Convolutional Neural Networks

    Authors: Sriram Balasubramanian, Soheil Feizi

    Abstract: The ability to remove features from the input of machine learning models is very important to understand and interpret model predictions. However, this is non-trivial for vision models since masking out parts of the input image typically causes large distribution shifts. This is because the baseline color used for masking (typically grey or black) is out of distribution. Furthermore, the shape of… ▽ More

    Submitted 29 October, 2023; v1 submitted 26 November, 2022; originally announced November 2022.

    Comments: 29 pages, 19 figures. Accepted at ICCV 2023

    ACM Class: I.2.10; I.5.1

  38. arXiv:2211.10370  [pdf, other

    cs.CV cs.LG

    Invariant Learning via Diffusion Dreamed Distribution Shifts

    Authors: Priyatham Kattakinda, Alexander Levine, Soheil Feizi

    Abstract: Though the background is an important signal for image classification, over reliance on it can lead to incorrect predictions when spurious correlations between foreground and background are broken at test time. Training on a dataset where these correlations are unbiased would lead to more robust models. In this paper, we propose such a dataset called Diffusion Dreamed Distribution Shifts (D3S). D3… ▽ More

    Submitted 18 November, 2022; originally announced November 2022.

    Comments: 18 pages, 13 figures, 5 tables

  39. arXiv:2211.08453  [pdf, other

    cs.LG

    Improved techniques for deterministic l2 robustness

    Authors: Sahil Singla, Soheil Feizi

    Abstract: Training convolutional neural networks (CNNs) with a strict 1-Lipschitz constraint under the $l_{2}$ norm is useful for adversarial robustness, interpretable gradients and stable training. 1-Lipschitz CNNs are usually designed by enforcing each layer to have an orthogonal Jacobian matrix (for all inputs) to prevent the gradients from vanishing during backpropagation. However, their performance oft… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

    Comments: NeurIPS 2022. arXiv admin note: text overlap with arXiv:2108.04062

  40. arXiv:2209.07592  [pdf, other

    cs.LG cs.CV

    Explicit Tradeoffs between Adversarial and Natural Distributional Robustness

    Authors: Mazda Moayeri, Kiarash Banihashem, Soheil Feizi

    Abstract: Several existing works study either adversarial or natural distributional robustness of deep neural networks separately. In practice, however, models need to enjoy both types of robustness to ensure reliability. In this work, we bridge this gap and show that in fact, explicit tradeoffs exist between adversarial and natural distributional robustness. We first consider a simple linear regression set… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

    Comments: Accepted to NeurIPS 2022

  41. arXiv:2208.13298  [pdf, other

    cs.LG

    Goal-Conditioned Q-Learning as Knowledge Distillation

    Authors: Alexander Levine, Soheil Feizi

    Abstract: Many applications of reinforcement learning can be formalized as goal-conditioned environments, where, in each episode, there is a "goal" that affects the rewards obtained during that episode but does not affect the dynamics. Various techniques have been proposed to improve performance in goal-conditioned environments, such as automatic curriculum generation and goal relabeling. In this work, we e… ▽ More

    Submitted 8 March, 2023; v1 submitted 28 August, 2022; originally announced August 2022.

    Comments: AAAI 2023 Accepted paper

  42. arXiv:2208.03309  [pdf, other

    cs.LG cs.AI cs.CR cs.CV stat.ML

    Lethal Dose Conjecture on Data Poisoning

    Authors: Wenxiao Wang, Alexander Levine, Soheil Feizi

    Abstract: Data poisoning considers an adversary that distorts the training set of machine learning algorithms for malicious purposes. In this work, we bring to light one conjecture regarding the fundamentals of data poisoning, which we call the Lethal Dose Conjecture. The conjecture states: If $n$ clean training samples are needed for accurate predictions, then in a size-$N$ training set, only $Θ(N/n)$ pois… ▽ More

    Submitted 18 October, 2022; v1 submitted 5 August, 2022; originally announced August 2022.

    Comments: 36th Conference on Neural Information Processing Systems (NeurIPS 2022)

  43. arXiv:2206.10158  [pdf, other

    cs.LG cs.MA

    Certifiably Robust Policy Learning against Adversarial Communication in Multi-agent Systems

    Authors: Yanchao Sun, Ruijie Zheng, Parisa Hassanzadeh, Yongyuan Liang, Soheil Feizi, Sumitra Ganesh, Furong Huang

    Abstract: Communication is important in many multi-agent reinforcement learning (MARL) problems for agents to share information and make good decisions. However, when deploying trained communicative agents in a real-world application where noise and potential attackers exist, the safety of communication-based policies becomes a severe issue that is underexplored. Specifically, if communication messages are… ▽ More

    Submitted 2 July, 2022; v1 submitted 21 June, 2022; originally announced June 2022.

  44. arXiv:2206.02107  [pdf, other

    cs.LG

    Interpretable Mixture of Experts

    Authors: Aya Abdelsalam Ismail, Sercan Ö. Arik, Jinsung Yoon, Ankur Taly, Soheil Feizi, Tomas Pfister

    Abstract: The need for reliable model explanations is prominent for many machine learning applications, particularly for tabular and time-series data as their use cases often involve high-stakes decision making. Towards this goal, we introduce a novel interpretable modeling framework, Interpretable Mixture of Experts (IME), that yields high accuracy, comparable to `black-box' Deep Neural Networks (DNNs) in… ▽ More

    Submitted 25 May, 2023; v1 submitted 5 June, 2022; originally announced June 2022.

  45. arXiv:2203.15566  [pdf, other

    cs.CV cs.AI

    Core Risk Minimization using Salient ImageNet

    Authors: Sahil Singla, Mazda Moayeri, Soheil Feizi

    Abstract: Deep neural networks can be unreliable in the real world especially when they heavily use spurious features for their predictions. Recently, Singla & Feizi (2022) introduced the Salient Imagenet dataset by annotating and localizing core and spurious features of ~52k samples from 232 classes of Imagenet. While this dataset is useful for evaluating the reliance of pretrained models on spurious featu… ▽ More

    Submitted 27 March, 2022; originally announced March 2022.

  46. arXiv:2203.08945  [pdf, other

    cs.LG cs.CR

    Provable Adversarial Robustness for Fractional Lp Threat Models

    Authors: Alexander Levine, Soheil Feizi

    Abstract: In recent years, researchers have extensively studied adversarial robustness in a variety of threat models, including L_0, L_1, L_2, and L_infinity-norm bounded adversarial attacks. However, attacks bounded by fractional L_p "norms" (quasi-norms defined by the L_p distance with 0<p<1) have yet to be thoroughly considered. We proactively propose a defense with several desirable properties: it provi… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

    Comments: AISTATS 2022 accepted paper

  47. arXiv:2203.01881  [pdf, other

    cs.LG cs.AI cs.CV

    Measuring Self-Supervised Representation Quality for Downstream Classification using Discriminative Features

    Authors: Neha Kalibhat, Kanika Narang, Hamed Firooz, Maziar Sanjabi, Soheil Feizi

    Abstract: Self-supervised learning (SSL) has shown impressive results in downstream classification tasks. However, there is limited work in understanding their failure modes and interpreting their learned representations. In this paper, we study the representation space of state-of-the-art self-supervised models including SimCLR, SwaV, MoCo, BYOL, DINO, SimSiam, VICReg and Barlow Twins. Without the use of c… ▽ More

    Submitted 12 December, 2023; v1 submitted 3 March, 2022; originally announced March 2022.

    Comments: Published at AAAI 2024

  48. arXiv:2202.02628  [pdf, other

    cs.LG cs.CR stat.ML

    Improved Certified Defenses against Data Poisoning with (Deterministic) Finite Aggregation

    Authors: Wenxiao Wang, Alexander Levine, Soheil Feizi

    Abstract: Data poisoning attacks aim at manipulating model behaviors through distorting training data. Previously, an aggregation-based certified defense, Deep Partition Aggregation (DPA), was proposed to mitigate this threat. DPA predicts through an aggregation of base classifiers trained on disjoint subsets of data, thus restricting its sensitivity to dataset distortions. In this work, we propose an impro… ▽ More

    Submitted 14 July, 2022; v1 submitted 5 February, 2022; originally announced February 2022.

    Comments: International Conference on Machine Learning (ICML), 2022

    Journal ref: Proceedings of the 39th International Conference on Machine Learning, PMLR 162:22769-22783, 2022

  49. arXiv:2201.12440  [pdf, other

    cs.LG

    Certifying Model Accuracy under Distribution Shifts

    Authors: Aounon Kumar, Alexander Levine, Tom Goldstein, Soheil Feizi

    Abstract: Certified robustness in machine learning has primarily focused on adversarial perturbations of the input with a fixed attack budget for each point in the data distribution. In this work, we present provable robustness guarantees on the accuracy of a model under bounded Wasserstein shifts of the data distribution. We show that a simple procedure that randomizes the input of the model within a trans… ▽ More

    Submitted 16 July, 2023; v1 submitted 28 January, 2022; originally announced January 2022.

  50. arXiv:2201.10766  [pdf, other

    cs.CV

    A Comprehensive Study of Image Classification Model Sensitivity to Foregrounds, Backgrounds, and Visual Attributes

    Authors: Mazda Moayeri, Phillip Pope, Yogesh Balaji, Soheil Feizi

    Abstract: While datasets with single-label supervision have propelled rapid advances in image classification, additional annotations are necessary in order to quantitatively assess how models make predictions. To this end, for a subset of ImageNet samples, we collect segmentation masks for the entire object and $18$ informative attributes. We call this dataset RIVAL10 (RIch Visual Attributes with Localizati… ▽ More

    Submitted 26 January, 2022; originally announced January 2022.