Skip to main content

Showing 1–50 of 57 results for author: Bibi, A

  1. arXiv:2406.14563  [pdf, other

    cs.CL cs.AI cs.LG

    Model Merging and Safety Alignment: One Bad Model Spoils the Bunch

    Authors: Hasan Abed Al Kader Hammoud, Umberto Michieli, Fabio Pizzati, Philip Torr, Adel Bibi, Bernard Ghanem, Mete Ozay

    Abstract: Merging Large Language Models (LLMs) is a cost-effective technique for combining multiple expert LLMs into a single versatile model, retaining the expertise of the original ones. However, current approaches often overlook the importance of safety alignment during merging, leading to highly misaligned models. This work investigates the effects of model merging on alignment. We evaluate several popu… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Under review

  2. arXiv:2406.10288  [pdf, other

    cs.CL cs.LG

    Mimicking User Data: On Mitigating Fine-Tuning Risks in Closed Large Language Models

    Authors: Francisco Eiras, Aleksandar Petrov, Phillip H. S. Torr, M. Pawan Kumar, Adel Bibi

    Abstract: Fine-tuning large language models on small, high-quality datasets can enhance their performance on specific downstream tasks. Recent research shows that fine-tuning on benign, instruction-following data can inadvertently undo the safety alignment process and increase a model's propensity to comply with harmful queries. Although critical, understanding and mitigating safety risks in well-defined ta… ▽ More

    Submitted 1 July, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

  3. arXiv:2406.05222  [pdf, other

    cs.LG cs.NE

    Towards Interpretable Deep Local Learning with Successive Gradient Reconciliation

    Authors: Yibo Yang, Xiaojie Li, Motasem Alfarra, Hasan Hammoud, Adel Bibi, Philip Torr, Bernard Ghanem

    Abstract: Relieving the reliance of neural network training on a global back-propagation (BP) has emerged as a notable research topic due to the biological implausibility and huge memory consumption caused by BP. Among the existing solutions, local learning optimizes gradient-isolated modules of a neural network with local errors and has been proved to be effective even on large-scale datasets. However, the… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: ICML 2024

  4. arXiv:2406.01424  [pdf, other

    cs.LG cs.AI cs.CL

    Universal In-Context Approximation By Prompting Fully Recurrent Models

    Authors: Aleksandar Petrov, Tom A. Lamb, Alasdair Paren, Philip H. S. Torr, Adel Bibi

    Abstract: Zero-shot and in-context learning enable solving tasks without model fine-tuning, making them essential for developing generative model solutions. Therefore, it is crucial to understand whether a pretrained model can be prompted to approximate any function, i.e., whether it is a universal in-context approximator. While it was recently shown that transformer models do possess this property, these r… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  5. arXiv:2405.13922  [pdf, other

    cs.LG stat.ML

    Towards Certification of Uncertainty Calibration under Adversarial Attacks

    Authors: Cornelius Emde, Francesco Pinto, Thomas Lukasiewicz, Philip H. S. Torr, Adel Bibi

    Abstract: Since neural classifiers are known to be sensitive to adversarial perturbations that alter their accuracy, \textit{certification methods} have been developed to provide provable guarantees on the insensitivity of their predictions to such perturbations. Furthermore, in safety-critical applications, the frequentist interpretation of the confidence of a classifier (also known as model calibration) c… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 11 pages main paper, appendix included

  6. arXiv:2405.08597  [pdf, other

    cs.LG

    Risks and Opportunities of Open-Source Generative AI

    Authors: Francisco Eiras, Aleksandar Petrov, Bertie Vidgen, Christian Schroeder, Fabio Pizzati, Katherine Elkins, Supratik Mukhopadhyay, Adel Bibi, Aaron Purewal, Csaba Botos, Fabro Steibel, Fazel Keshtkar, Fazl Barez, Genevieve Smith, Gianluca Guadagni, Jon Chun, Jordi Cabot, Joseph Imperial, Juan Arturo Nolazco, Lori Landay, Matthew Jackson, Phillip H. S. Torr, Trevor Darrell, Yong Lee, Jakob Foerster

    Abstract: Applications of Generative AI (Gen AI) are expected to revolutionize a number of different areas, ranging from science & medicine to education. The potential for these seismic changes has triggered a lively debate about the potential risks of the technology, and resulted in calls for tighter regulation, in particular from some of the major tech companies who are leading in AI development. This reg… ▽ More

    Submitted 29 May, 2024; v1 submitted 14 May, 2024; originally announced May 2024.

    Comments: Extension of arXiv:2404.17047

  7. arXiv:2404.17047  [pdf, other

    cs.LG

    Near to Mid-term Risks and Opportunities of Open-Source Generative AI

    Authors: Francisco Eiras, Aleksandar Petrov, Bertie Vidgen, Christian Schroeder de Witt, Fabio Pizzati, Katherine Elkins, Supratik Mukhopadhyay, Adel Bibi, Botos Csaba, Fabro Steibel, Fazl Barez, Genevieve Smith, Gianluca Guadagni, Jon Chun, Jordi Cabot, Joseph Marvin Imperial, Juan A. Nolazco-Flores, Lori Landay, Matthew Jackson, Paul Röttger, Philip H. S. Torr, Trevor Darrell, Yong Suk Lee, Jakob Foerster

    Abstract: In the next few years, applications of Generative AI are expected to revolutionize a number of different areas, ranging from science & medicine to education. The potential for these seismic changes has triggered a lively debate about potential risks and resulted in calls for tighter regulation, in particular from some of the major tech companies who are leading in AI development. This regulation i… ▽ More

    Submitted 24 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: Accepted to ICML'24 as a position paper

  8. arXiv:2404.12766  [pdf, other

    cs.LG cs.CV

    Continual Learning on a Diet: Learning from Sparsely Labeled Streams Under Constrained Computation

    Authors: Wenxuan Zhang, Youssef Mohamed, Bernard Ghanem, Philip H. S. Torr, Adel Bibi, Mohamed Elhoseiny

    Abstract: We propose and study a realistic Continual Learning (CL) setting where learning algorithms are granted a restricted computational budget per time step while training. We apply this setting to large-scale semi-supervised Continual Learning scenarios with sparse label rates. Previous proficient CL methods perform very poorly in this challenging setting. Overfitting to the sparse labeled data and ins… ▽ More

    Submitted 8 June, 2024; v1 submitted 19 April, 2024; originally announced April 2024.

  9. arXiv:2404.04125  [pdf, other

    cs.CV cs.CL cs.LG

    No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance

    Authors: Vishaal Udandarao, Ameya Prabhu, Adhiraj Ghosh, Yash Sharma, Philip H. S. Torr, Adel Bibi, Samuel Albanie, Matthias Bethge

    Abstract: Web-crawled pretraining datasets underlie the impressive "zero-shot" evaluation performance of multimodal models, such as CLIP for classification/retrieval and Stable-Diffusion for image generation. However, it is unclear how meaningful the notion of "zero-shot" generalization is for such multimodal models, as it is not known to what extent their pretraining datasets encompass the downstream conce… ▽ More

    Submitted 8 April, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

    Comments: Extended version of the short paper accepted at DPFM, ICLR'24

  10. arXiv:2403.13808  [pdf, other

    cs.CV cs.AI cs.LG

    On Pretraining Data Diversity for Self-Supervised Learning

    Authors: Hasan Abed Al Kader Hammoud, Tuhin Das, Fabio Pizzati, Philip Torr, Adel Bibi, Bernard Ghanem

    Abstract: We explore the impact of training with more diverse datasets, characterized by the number of unique samples, on the performance of self-supervised learning (SSL) under a fixed computational budget. Our findings consistently demonstrate that increasing pretraining data diversity enhances SSL performance, albeit only when the distribution distance to the downstream data is minimal. Notably, even wit… ▽ More

    Submitted 5 April, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

    Comments: Under review

  11. arXiv:2402.19472  [pdf, other

    cs.LG cs.CV

    Lifelong Benchmarks: Efficient Model Evaluation in an Era of Rapid Progress

    Authors: Ameya Prabhu, Vishaal Udandarao, Philip Torr, Matthias Bethge, Adel Bibi, Samuel Albanie

    Abstract: Standardized benchmarks drive progress in machine learning. However, with repeated testing, the risk of overfitting grows as algorithms over-exploit benchmark idiosyncrasies. In our work, we seek to mitigate this challenge by compiling ever-expanding large-scale benchmarks called Lifelong Benchmarks. As exemplars of our approach, we create Lifelong-CIFAR10 and Lifelong-ImageNet, containing (for no… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  12. arXiv:2402.14753  [pdf, other

    cs.LG cs.AI math.FA

    Prompting a Pretrained Transformer Can Be a Universal Approximator

    Authors: Aleksandar Petrov, Philip H. S. Torr, Adel Bibi

    Abstract: Despite the widespread adoption of prompting, prompt tuning and prefix-tuning of transformer models, our theoretical understanding of these fine-tuning methods remains limited. A key question is whether one can arbitrarily modify the behavior of pretrained model by prompting or prefix-tuning it. Formally, whether prompting and prefix-tuning a pretrained model can universally approximate sequence-t… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  13. arXiv:2402.04559  [pdf, other

    cs.AI cs.CL cs.HC

    Can Large Language Model Agents Simulate Human Trust Behaviors?

    Authors: Chengxing Xie, Canyu Chen, Feiran Jia, Ziyu Ye, Kai Shu, Adel Bibi, Ziniu Hu, Philip Torr, Bernard Ghanem, Guohao Li

    Abstract: Large Language Model (LLM) agents have been increasingly adopted as simulation tools to model humans in applications such as social science. However, one fundamental question remains: can LLM agents really simulate human behaviors? In this paper, we focus on one of the most critical behaviors in human interactions, trust, and aim to investigate whether or not LLM agents can simulate human trust be… ▽ More

    Submitted 10 March, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: The first two authors contributed equally. Project website: https://www.camel-ai.org/research/agent-trust

  14. arXiv:2402.01832  [pdf, other

    cs.CV cs.AI cs.LG

    SynthCLIP: Are We Ready for a Fully Synthetic CLIP Training?

    Authors: Hasan Abed Al Kader Hammoud, Hani Itani, Fabio Pizzati, Philip Torr, Adel Bibi, Bernard Ghanem

    Abstract: We present SynthCLIP, a novel framework for training CLIP models with entirely synthetic text-image pairs, significantly departing from previous methods relying on real data. Leveraging recent text-to-image (TTI) generative networks and large language models (LLM), we are able to generate synthetic datasets of images and corresponding captions at any scale, with no human intervention. With trainin… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: Under review

  15. arXiv:2312.00923  [pdf, other

    cs.LG cs.CV

    Label Delay in Online Continual Learning

    Authors: Botos Csaba, Wenxuan Zhang, Matthias Müller, Ser-Nam Lim, Mohamed Elhoseiny, Philip Torr, Adel Bibi

    Abstract: Online continual learning, the process of training models on streaming data, has gained increasing attention in recent years. However, a critical aspect often overlooked is the label delay, where new data may not be labeled due to slow and costly annotation processes. We introduce a new continual learning framework with explicit modeling of the label delay between data and label streams over time… ▽ More

    Submitted 25 April, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

    Comments: 17 pages, 12 figures

    ACM Class: I.4.0; I.4.10

  16. arXiv:2311.11293  [pdf, other

    cs.LG

    From Categories to Classifier: Name-Only Continual Learning by Exploring the Web

    Authors: Ameya Prabhu, Hasan Abed Al Kader Hammoud, Ser-Nam Lim, Bernard Ghanem, Philip H. S. Torr, Adel Bibi

    Abstract: Continual Learning (CL) often relies on the availability of extensive annotated datasets, an assumption that is unrealistically time-consuming and costly in practice. We explore a novel paradigm termed name-only continual learning where time and cost constraints prohibit manual annotation. In this scenario, learners adapt to new category shifts using only category names without the luxury of annot… ▽ More

    Submitted 19 November, 2023; originally announced November 2023.

  17. arXiv:2310.19698  [pdf, other

    cs.LG cs.CL

    When Do Prompting and Prefix-Tuning Work? A Theory of Capabilities and Limitations

    Authors: Aleksandar Petrov, Philip H. S. Torr, Adel Bibi

    Abstract: Context-based fine-tuning methods, including prompting, in-context learning, soft prompting (also known as prompt tuning), and prefix-tuning, have gained popularity due to their ability to often match the performance of full fine-tuning with a fraction of the parameters. Despite their empirical successes, there is little theoretical understanding of how these techniques influence the internal comp… ▽ More

    Submitted 9 April, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: Accepted at ICLR 2024

  18. arXiv:2310.13479  [pdf, other

    cs.CV cs.LG

    Segment, Select, Correct: A Framework for Weakly-Supervised Referring Segmentation

    Authors: Francisco Eiras, Kemal Oksuz, Adel Bibi, Philip H. S. Torr, Puneet K. Dokania

    Abstract: Referring Image Segmentation (RIS) - the problem of identifying objects in images through natural language sentences - is a challenging task currently mostly solved through supervised learning. However, while collecting referred annotation masks is a time-consuming process, the few existing weakly-supervised and zero-shot approaches fall significantly short in performance compared to fully-supervi… ▽ More

    Submitted 23 October, 2023; v1 submitted 20 October, 2023; originally announced October 2023.

  19. arXiv:2305.15425  [pdf

    cs.CL cs.LG

    Language Model Tokenizers Introduce Unfairness Between Languages

    Authors: Aleksandar Petrov, Emanuele La Malfa, Philip H. S. Torr, Adel Bibi

    Abstract: Recent language models have shown impressive multilingual performance, even when not explicitly trained for it. Despite this, there are concerns about the quality of their outputs across different languages. In this paper, we show how disparity in the treatment of different languages arises at the tokenization stage, well before a model is even invoked. The same text translated into different lang… ▽ More

    Submitted 20 October, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: Published at NeurIPS 2023, Project webpage: https://aleksandarpetrov.github.io/tokenization-fairness, Code: https://github.com/AleksandarPetrov/tokenization-fairness

  20. arXiv:2305.10157  [pdf, other

    cs.LG math-ph

    Efficient Error Certification for Physics-Informed Neural Networks

    Authors: Francisco Eiras, Adel Bibi, Rudy Bunel, Krishnamurthy Dj Dvijotham, Philip Torr, M. Pawan Kumar

    Abstract: Recent work provides promising evidence that Physics-Informed Neural Networks (PINN) can efficiently solve partial differential equations (PDE). However, previous works have failed to provide guarantees on the worst-case residual error of a PINN across the spatio-temporal domain - a measure akin to the tolerance of numerical solvers - focusing instead on point-wise comparisons between their soluti… ▽ More

    Submitted 29 May, 2024; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: Accepted to ICML'24

  21. arXiv:2305.09275  [pdf, other

    cs.LG cs.AI cs.CV

    Rapid Adaptation in Online Continual Learning: Are We Evaluating It Right?

    Authors: Hasan Abed Al Kader Hammoud, Ameya Prabhu, Ser-Nam Lim, Philip H. S. Torr, Adel Bibi, Bernard Ghanem

    Abstract: We revisit the common practice of evaluating adaptation of Online Continual Learning (OCL) algorithms through the metric of online accuracy, which measures the accuracy of the model on the immediate next few samples. However, we show that this metric is unreliable, as even vacuous blind classifiers, which do not use input images for prediction, can achieve unrealistically high online accuracy by e… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

  22. arXiv:2304.13019  [pdf, other

    cs.LG

    Certifying Ensembles: A General Certification Theory with S-Lipschitzness

    Authors: Aleksandar Petrov, Francisco Eiras, Amartya Sanyal, Philip H. S. Torr, Adel Bibi

    Abstract: Improving and guaranteeing the robustness of deep learning models has been a topic of intense research. Ensembling, which combines several classifiers to provide a better model, has shown to be beneficial for generalisation, uncertainty estimation, calibration, and mitigating the effects of concept drift. However, the impact of ensembling on certified robustness is less well understood. In this wo… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

    Comments: Accepted to ICML 2023

  23. arXiv:2303.13211  [pdf, other

    cs.CR cs.CV cs.LG

    Don't FREAK Out: A Frequency-Inspired Approach to Detecting Backdoor Poisoned Samples in DNNs

    Authors: Hasan Abed Al Kader Hammoud, Adel Bibi, Philip H. S. Torr, Bernard Ghanem

    Abstract: In this paper we investigate the frequency sensitivity of Deep Neural Networks (DNNs) when presented with clean samples versus poisoned samples. Our analysis shows significant disparities in frequency sensitivity between these two types of samples. Building on these findings, we propose FREAK, a frequency-based poisoned sample detection algorithm that is simple yet effective. Our experimental resu… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

    Comments: Accepted at CVPRW (The Art of Robustness)

  24. arXiv:2303.11165  [pdf, other

    cs.LG cs.CV

    Computationally Budgeted Continual Learning: What Does Matter?

    Authors: Ameya Prabhu, Hasan Abed Al Kader Hammoud, Puneet Dokania, Philip H. S. Torr, Ser-Nam Lim, Bernard Ghanem, Adel Bibi

    Abstract: Continual Learning (CL) aims to sequentially train models on streams of incoming data that vary in distribution by preserving previous knowledge while adapting to new data. Current CL literature focuses on restricted access to previously seen data, while imposing no constraints on the computational budget for training. This is unreasonable for applications in-the-wild, where systems are primarily… ▽ More

    Submitted 14 July, 2023; v1 submitted 20 March, 2023; originally announced March 2023.

    Comments: CVPR 2023

  25. arXiv:2302.01047  [pdf, other

    cs.LG cs.AI cs.CV

    Real-Time Evaluation in Online Continual Learning: A New Hope

    Authors: Yasir Ghunaim, Adel Bibi, Kumail Alhamoud, Motasem Alfarra, Hasan Abed Al Kader Hammoud, Ameya Prabhu, Philip H. S. Torr, Bernard Ghanem

    Abstract: Current evaluations of Continual Learning (CL) methods typically assume that there is no constraint on training time and computation. This is an unrealistic assumption for any real-world setting, which motivates us to propose: a practical real-time evaluation of continual learning, in which the stream does not wait for the model to complete training before revealing the next data for predictions.… ▽ More

    Submitted 24 March, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

    Comments: Accepted at CVPR'23 as Highlight (Top 2.5%)

  26. arXiv:2211.16234  [pdf, other

    cs.CV cs.LG

    SimCS: Simulation for Domain Incremental Online Continual Segmentation

    Authors: Motasem Alfarra, Zhipeng Cai, Adel Bibi, Bernard Ghanem, Matthias Müller

    Abstract: Continual Learning is a step towards lifelong intelligence where models continuously learn from recently collected data without forgetting previous knowledge. Existing continual learning approaches mostly focus on image classification in the class-incremental setup with clear task boundaries and unlimited computational budget. This work explores the problem of Online Domain-Incremental Continual S… ▽ More

    Submitted 15 February, 2024; v1 submitted 29 November, 2022; originally announced November 2022.

    Comments: Accepted to AAAI Conference on Artificial Intelligence (AAAI'24)

  27. arXiv:2209.13071  [pdf, other

    cs.CV

    Diversified Dynamic Routing for Vision Tasks

    Authors: Botos Csaba, Adel Bibi, Yanwei Li, Philip Torr, Ser-Nam Lim

    Abstract: Deep learning models for vision tasks are trained on large datasets under the assumption that there exists a universal representation that can be used to make predictions for all samples. Whereas high complexity models are proven to be capable of learning such representations, a mixture of experts trained on specific subsets of the data can infer the labels more efficiently. However using mixture… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

    Comments: 18 pages, 9 figures, ECCV, VIPriors

  28. arXiv:2207.10170  [pdf, other

    cs.AI

    Illusory Attacks: Information-Theoretic Detectability Matters in Adversarial Attacks

    Authors: Tim Franzmeyer, Stephen McAleer, João F. Henriques, Jakob N. Foerster, Philip H. S. Torr, Adel Bibi, Christian Schroeder de Witt

    Abstract: Autonomous agents deployed in the real world need to be robust against adversarial attacks on sensory inputs. Robustifying agent policies requires anticipating the strongest attacks possible. We demonstrate that existing observation-space attacks on reinforcement learning agents have a common weakness: while effective, their lack of information-theoretic detectability constraints makes them detect… ▽ More

    Submitted 6 May, 2024; v1 submitted 20 July, 2022; originally announced July 2022.

    Comments: ICLR 2024 Spotlight (top 5%)

  29. arXiv:2206.08242  [pdf, other

    cs.LG cs.AI cs.CV

    Catastrophic overfitting can be induced with discriminative non-robust features

    Authors: Guillermo Ortiz-Jiménez, Pau de Jorge, Amartya Sanyal, Adel Bibi, Puneet K. Dokania, Pascal Frossard, Gregory Rogéz, Philip H. S. Torr

    Abstract: Adversarial training (AT) is the de facto method for building robust neural networks, but it can be computationally expensive. To mitigate this, fast single-step attacks can be used, but this may lead to catastrophic overfitting (CO). This phenomenon appears when networks gain non-trivial robustness during the first stages of AT, but then reach a breaking point where they become vulnerable in just… ▽ More

    Submitted 15 August, 2023; v1 submitted 16 June, 2022; originally announced June 2022.

    Comments: Published in Transactions on Machine Learning Research (TMLR)

  30. arXiv:2202.01181  [pdf, other

    cs.LG cs.CV

    Make Some Noise: Reliable and Efficient Single-Step Adversarial Training

    Authors: Pau de Jorge, Adel Bibi, Riccardo Volpi, Amartya Sanyal, Philip H. S. Torr, Grégory Rogez, Puneet K. Dokania

    Abstract: Recently, Wong et al. showed that adversarial training with single-step FGSM leads to a characteristic failure mode named Catastrophic Overfitting (CO), in which a model becomes suddenly vulnerable to multi-step attacks. Experimentally they showed that simply adding a random perturbation prior to FGSM (RS-FGSM) could prevent CO. However, Andriushchenko and Flammarion observed that RS-FGSM still le… ▽ More

    Submitted 17 October, 2022; v1 submitted 2 February, 2022; originally announced February 2022.

    Comments: Published in NeurIPS 2022

  31. arXiv:2107.04570  [pdf, other

    cs.LG cs.CV

    ANCER: Anisotropic Certification via Sample-wise Volume Maximization

    Authors: Francisco Eiras, Motasem Alfarra, M. Pawan Kumar, Philip H. S. Torr, Puneet K. Dokania, Bernard Ghanem, Adel Bibi

    Abstract: Randomized smoothing has recently emerged as an effective tool that enables certification of deep neural network classifiers at scale. All prior art on randomized smoothing has focused on isotropic $\ell_p$ certification, which has the advantage of yielding certificates that can be easily compared among isotropic methods via $\ell_p$-norm radius. However, isotropic certification limits the region… ▽ More

    Submitted 31 August, 2022; v1 submitted 9 July, 2021; originally announced July 2021.

    Comments: First two authors and the last one contributed equally to this work

  32. arXiv:2107.00996  [pdf, other

    cs.LG stat.ML

    DeformRS: Certifying Input Deformations with Randomized Smoothing

    Authors: Motasem Alfarra, Adel Bibi, Naeemullah Khan, Philip H. S. Torr, Bernard Ghanem

    Abstract: Deep neural networks are vulnerable to input deformations in the form of vector fields of pixel displacements and to other parameterized geometric deformations e.g. translations, rotations, etc. Current input deformation certification methods either 1. do not scale to deep networks on large input datasets, or 2. can only certify a specific class of deformations, e.g. only rotations. We reformulate… ▽ More

    Submitted 19 December, 2021; v1 submitted 2 July, 2021; originally announced July 2021.

    Comments: Accepted to AAAI Conference on Artificial Intelligence (AAAI'22)

  33. arXiv:2103.14347  [pdf, other

    cs.LG cs.CV

    Combating Adversaries with Anti-Adversaries

    Authors: Motasem Alfarra, Juan C. Pérez, Ali Thabet, Adel Bibi, Philip H. S. Torr, Bernard Ghanem

    Abstract: Deep neural networks are vulnerable to small input perturbations known as adversarial attacks. Inspired by the fact that these adversaries are constructed by iteratively minimizing the confidence of a network for the true class label, we propose the anti-adversary layer, aimed at countering this effect. In particular, our layer generates an input perturbation in the opposite direction of the adver… ▽ More

    Submitted 16 December, 2021; v1 submitted 26 March, 2021; originally announced March 2021.

    Comments: Accepted to AAAI Conference on Artificial Intelligence (AAAI'22)

  34. arXiv:2012.04351  [pdf, other

    cs.LG

    Data-Dependent Randomized Smoothing

    Authors: Motasem Alfarra, Adel Bibi, Philip H. S. Torr, Bernard Ghanem

    Abstract: Randomized smoothing is a recent technique that achieves state-of-art performance in training certifiably robust deep neural networks. While the smoothing family of distributions is often connected to the choice of the norm used for certification, the parameters of these distributions are always set as global hyper parameters independent from the input data on which a network is certified. In this… ▽ More

    Submitted 5 July, 2022; v1 submitted 8 December, 2020; originally announced December 2020.

    Comments: Accepted in Uncertainty in Artificial Intelligence Conference (UAI 2022). First two authors contributed equally to this work

  35. arXiv:2006.11776  [pdf, other

    cs.LG cs.CR stat.ML

    Network Moments: Extensions and Sparse-Smooth Attacks

    Authors: Modar Alfadly, Adel Bibi, Emilio Botero, Salman Alsubaihi, Bernard Ghanem

    Abstract: The impressive performance of deep neural networks (DNNs) has immensely strengthened the line of research that aims at theoretically analyzing their effectiveness. This has incited research on the reaction of DNNs to noisy input, namely developing adversarial input attacks and strategies that lead to robust DNNs to these attacks. To that end, in this paper, we derive exact analytic expressions for… ▽ More

    Submitted 21 June, 2020; originally announced June 2020.

  36. arXiv:2006.07682  [pdf, other

    cs.LG stat.ML

    Rethinking Clustering for Robustness

    Authors: Motasem Alfarra, Juan C. Pérez, Adel Bibi, Ali Thabet, Pablo Arbeláez, Bernard Ghanem

    Abstract: This paper studies how encouraging semantically-aligned features during deep neural network training can increase network robustness. Recent works observed that Adversarial Training leads to robust models, whose learnt features appear to correlate with human perception. Inspired by this connection from robustness to semantics, we study the complementary connection: from semantics to robustness. To… ▽ More

    Submitted 19 November, 2021; v1 submitted 13 June, 2020; originally announced June 2020.

    Comments: Accepted to the 32nd British Machine Vision Conference (BMVC'21)

  37. arXiv:2002.08838  [pdf, other

    cs.LG stat.ML

    On the Decision Boundaries of Neural Networks: A Tropical Geometry Perspective

    Authors: Motasem Alfarra, Adel Bibi, Hasan Hammoud, Mohamed Gaafar, Bernard Ghanem

    Abstract: This work tackles the problem of characterizing and understanding the decision boundaries of neural networks with piecewise linear non-linearity activations. We use tropical geometry, a new development in the area of algebraic geometry, to characterize the decision boundaries of a simple network of the form (Affine, ReLU, Affine). Our main finding is that the decision boundaries are a subset of a… ▽ More

    Submitted 22 August, 2022; v1 submitted 20 February, 2020; originally announced February 2020.

    Comments: First two authors contributed equally to this work

  38. arXiv:1912.05661  [pdf, other

    cs.CV

    Gabor Layers Enhance Network Robustness

    Authors: Juan C. Pérez, Motasem Alfarra, Guillaume Jeanneret, Adel Bibi, Ali Thabet, Bernard Ghanem, Pablo Arbeláez

    Abstract: We revisit the benefits of merging classical vision concepts with deep learning models. In particular, we explore the effect on robustness against adversarial attacks of replacing the first layers of various deep architectures with Gabor layers, i.e. convolutional layers with filters that are based on learnable Gabor parameters. We observe that architectures enhanced with Gabor layers gain a consi… ▽ More

    Submitted 27 March, 2020; v1 submitted 11 December, 2019; originally announced December 2019.

    Comments: 32 pages, 23 figures, 14 tables

  39. arXiv:1907.10410  [pdf, other

    cs.LG stat.ML

    Constrained Clustering: General Pairwise and Cardinality Constraints

    Authors: Adel Bibi, Ali Alqahtani, Bernard Ghanem

    Abstract: We study constrained clustering, where constraints guide the clustering process. In existing works, two categories of constraints have been widely explored, namely pairwise and cardinality constraints. Pairwise constraints enforce the cluster labels of two instances to be the same (must-link constraints) or different (cannot-link constraints). Cardinality constraints encourage cluster sizes to sat… ▽ More

    Submitted 27 January, 2023; v1 submitted 24 July, 2019; originally announced July 2019.

  40. arXiv:1905.12418  [pdf, other

    cs.LG cs.CR stat.ML

    Expected Tight Bounds for Robust Training

    Authors: Salman Alsubaihi, Adel Bibi, Modar Alfadly, Abdullah Hamdi, Bernard Ghanem

    Abstract: Training Deep Neural Networks that are robust to norm bounded adversarial attacks remains an elusive problem. While exact and inexact verification-based methods are generally too expensive to train large networks, it was demonstrated that bounded input intervals can be inexpensively propagated from a layer to another through deep networks. This interval bound propagation approach (IBP) not only ha… ▽ More

    Submitted 12 June, 2021; v1 submitted 28 May, 2019; originally announced May 2019.

    Comments: Presented as a RobustML workshop paper at ICLR 2021

  41. arXiv:1904.11005  [pdf, other

    cs.CV cs.LG

    Analytical Moment Regularizer for Gaussian Robust Networks

    Authors: Modar Alfadly, Adel Bibi, Bernard Ghanem

    Abstract: Despite the impressive performance of deep neural networks (DNNs) on numerous vision tasks, they still exhibit yet-to-understand uncouth behaviours. One puzzling behaviour is the subtle sensitive reaction of DNNs to various noise attacks. Such a nuisance has strengthened the line of research around developing and training noise-robust networks. In this work, we propose a new training regularizer t… ▽ More

    Submitted 24 April, 2019; originally announced April 2019.

  42. arXiv:1803.10794  [pdf, other

    cs.CV cs.RO

    TrackingNet: A Large-Scale Dataset and Benchmark for Object Tracking in the Wild

    Authors: Matthias Müller, Adel Bibi, Silvio Giancola, Salman Al-Subaihi, Bernard Ghanem

    Abstract: Despite the numerous developments in object tracking, further development of current tracking algorithms is limited by small and mostly saturated datasets. As a matter of fact, data-hungry trackers based on deep-learning currently rely on object detection datasets due to the scarcity of dedicated large-scale tracking datasets. In this work, we present TrackingNet, the first large-scale dataset and… ▽ More

    Submitted 28 March, 2018; originally announced March 2018.

    Comments: preprint

  43. arXiv:1311.3746  [pdf, ps, other

    cs.NI

    Investigating Quality Routing Link Metrics in Wireless Multi-hop Networks

    Authors: N. Javaid, A. BiBi, A. Javaid, Z. A. Khan, K. Latif, M. Ishfaq

    Abstract: In this paper, we propose a new Quality Link Metric (QLM), ``Inverse Expected Transmission Count (InvETX)'' in Optimized Link State Routing (OLSR) protocol. Then we compare performance of three existing QLMs which are based on loss probability measurements; Expected Transmission Count (ETX), Minimum Delay (MD), Minimum Loss (ML) in Static Wireless Multi-hop Networks (SWMhNs). A novel contribution… ▽ More

    Submitted 15 November, 2013; originally announced November 2013.

    Comments: Journal of Annales of Telecommunications, 2013. arXiv admin note: substantial text overlap with arXiv:1108.3706

  44. Modeling Enhancements in DSR, FSR, OLSR under Mobility and Scalability Constraints in VANETs

    Authors: N. Javaid, A. Bibi, S. H. Bouk, A. Javaid, I. Sasase

    Abstract: Frequent topological changes due to high mobility is one of the main issues in Vehicular Ad-hoc NETworks (VANETs). In this paper, we model transmission probabilities of 802.11p for VANETs and effect of these probabilities on average transmission time. To evaluate the effect of these probabilities of VANETs in routing protocols, we select Dynamic Source Routing (DSR), Fish-eye State Routing (FSR) a… ▽ More

    Submitted 29 July, 2012; originally announced July 2012.

    Journal ref: 3rd International Workshop on Towards Samart Communications and Networks Technologies (SaCoNet2012) in conjunction with 48th IEEE International Conference on Communications (ICC2012), Ottawa, Canada, 2012

  45. arXiv:1207.2609  [pdf, ps, other

    cs.NI

    Survey of Extended LEACH-Based Clustering Routing Protocols for Wireless Sensor Networks

    Authors: M. Aslam, N. Javaid, A. Rahim, U. Nazir, A. Bibi, Z. A. Khan

    Abstract: An energy efficient routing protocol is the major concern in Wireless Sensor Networks (WSNs). In this survey paper, we present energy efficient hierarchical routing protocols, developed from conventional LEACH routing protocol. Main focus of our study is how these extended protocols work in order to increase the life time and how quality routing protocol are improved for WSNs. Furthermore, this pa… ▽ More

    Submitted 11 July, 2012; originally announced July 2012.

    Journal ref: 5th AHPCN in conjunction with 14th HPCC-2012, Liverpool, UK

  46. arXiv:1207.2577  [pdf, ps, other

    cs.NI

    Noise Filtering, Channel Modeling and Energy Utilization in Wireless Body Area Networks

    Authors: B. Manzoor, N. Javaid, A. Bibi, Z. A. Khan, M. Tahir

    Abstract: Constant monitoring of patients without disturbing their daily activities can be achieved through mobile networks. Sensor nodes distributed in a home environment to provide home assistance gives concept of Wireless Wearable Body Area Networks. Gathering useful information and its transmission to the required destination may face several problems. In this paper we figure out different issues and di… ▽ More

    Submitted 11 July, 2012; originally announced July 2012.

    Journal ref: 3rd ESA in conjunction with 9th ICESS-2012, Liverpool, UK

  47. arXiv:1207.2240  [pdf, ps, other

    cs.NI

    Ubiquitous HealthCare in Wireless Body Area Networks

    Authors: N. A. Khan, N. Javaid, Z. A. Khan, M. Jaffar, U. Rafiq, A. Bibi

    Abstract: Recent advances in wireless communications, system on chip and low power sensor nodes allow realization of Wireless Body Area Networks (WBANs).WBANs comprise of tiny sensors, which collect information of a patient's vital signs and provide a real time feedback. In addition,WBANs also support many applications including ubiquitous healthcare, entertainment, gaming, military, etc. Ubiquitous healthc… ▽ More

    Submitted 10 July, 2012; originally announced July 2012.

  48. DSDV, DYMO, OLSR: Link Duration and Path Stability

    Authors: S. Kumar, N. Javaid, Z. Yousuf, H. Kumar, Z. A. Khan, A. Bibi

    Abstract: In this paper, we evaluate and compare the impact of link duration and path stability of routing protocols; Destination Sequence Distance vector (DSDV), Dynamic MANET On- Demand (DYMO) and Optimized Link State Routing (OLSR) at different number of connections and node density. In order to improve the efficiency of selected protocols; we enhance DYMO and OLSR. Simulation and comparison of both defa… ▽ More

    Submitted 7 July, 2012; originally announced July 2012.

    Journal ref: Multicom2012 held in conjunction with the 11th IEEE International Conference on Ubiquitous Computing and Communications (IUCC-2012) (25 - 27 June 2012, Liverpool, UK)

  49. Analysis and Modeling Experiment Performance Parameters of Routing Protocols in MANETs and VANETs

    Authors: S. Sagar, N. Javaid, Z. A. Khan, J. Saqib, A. Bibi, S. H. Bouk

    Abstract: In this paper, a framework for experimental parameters in which Packet Delivery Ratio (PDR), effect of link duration over End-to-End Delay (E2ED) and Normalized Routing Overhead (NRO) in terms of control packets is analyzed and modeled for Mobile Ad-Hoc NETworks (MANETs) and Vehicular Ad-Hoc NETworks (VANETs) with the assumption that nodes (vehicles) are sparsely moving in two different road. More… ▽ More

    Submitted 7 July, 2012; originally announced July 2012.

    Journal ref: Multicom2012 held in conjunction with 11th IEEE International Conference on Ubiquitous Computing and Communications (IUCC-2012) (25 - 27 June 2012, Liverpool, UK)

  50. arXiv:1207.1702  [pdf, ps, other

    cs.NI

    Performance Study of Localization Techniques in Wireless Body Area Sensor Networks

    Authors: Obaid ur Rehman, Nadeem Javaid, Ayesha Bibi, Zahoor Ali Khan

    Abstract: One of the major issues in Wireless Body Area Sensor Networks (WBASNs) is efficient localization. There are various techniques for indoor and outdoor environments to locate a person. This study evaluating and compares performance of optimization schemes in indoor environments for optimal placement of wireless sensors, where patients can perform their daily activities. In indoor environments, the p… ▽ More

    Submitted 6 July, 2012; originally announced July 2012.

    Comments: AUCN in conjunction with 11th IUCC-2012, Liverpool, UK