Skip to main content

Showing 1–46 of 46 results for author: Ebrahimi, S

  1. arXiv:2407.11774  [pdf, other

    cs.CL cs.AI

    Sharif-MGTD at SemEval-2024 Task 8: A Transformer-Based Approach to Detect Machine Generated Text

    Authors: Seyedeh Fatemeh Ebrahimi, Karim Akhavan Azari, Amirmasoud Iravani, Arian Qazvini, Pouya Sadeghi, Zeinab Sadat Taghavi, Hossein Sameti

    Abstract: Detecting Machine-Generated Text (MGT) has emerged as a significant area of study within Natural Language Processing. While language models generate text, they often leave discernible traces, which can be scrutinized using either traditional feature-based methods or more advanced neural language models. In this research, we explore the effectiveness of fine-tuning a RoBERTa-base transformer, a pow… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: 8 pages, 3 figures, 2 tables. Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024)

  2. arXiv:2405.18654  [pdf, other

    cs.CV

    Mitigating Object Hallucination via Data Augmented Contrastive Tuning

    Authors: Pritam Sarkar, Sayna Ebrahimi, Ali Etemad, Ahmad Beirami, Sercan Ö. Arık, Tomas Pfister

    Abstract: Despite their remarkable progress, Multimodal Large Language Models (MLLMs) tend to hallucinate factually inaccurate information. In this work, we address object hallucinations in MLLMs, where information is offered about an object that is not present in the model input. We introduce a contrastive tuning method that can be applied to a pretrained off-the-shelf MLLM for mitigating hallucinations wh… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  3. arXiv:2404.16789  [pdf, other

    cs.LG cs.AI cs.CL

    Continual Learning of Large Language Models: A Comprehensive Survey

    Authors: Haizhou Shi, Zihao Xu, Hengyi Wang, Weiyi Qin, Wenyuan Wang, Yibin Wang, Zifeng Wang, Sayna Ebrahimi, Hao Wang

    Abstract: The recent success of large language models (LLMs) trained on static, pre-collected, general datasets has sparked numerous research directions and applications. One such direction addresses the non-trivial challenge of integrating pre-trained LLMs into dynamic data distributions, task structures, and user preferences. Pre-trained LLMs, when tailored for specific needs, often experience significant… ▽ More

    Submitted 29 June, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: 47 pages, 2 figures, 4 tables. Work in progress

  4. arXiv:2404.11782  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    REQUAL-LM: Reliability and Equity through Aggregation in Large Language Models

    Authors: Sana Ebrahimi, Nima Shahbazi, Abolfazl Asudeh

    Abstract: The extensive scope of large language models (LLMs) across various domains underscores the critical importance of responsibility in their application, beyond natural language processing. In particular, the randomized nature of LLMs, coupled with inherent biases and historical stereotypes in data, raises critical concerns regarding reliability and equity. Addressing these challenges are necessary b… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  5. arXiv:2403.00198  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    AXOLOTL: Fairness through Assisted Self-Debiasing of Large Language Model Outputs

    Authors: Sana Ebrahimi, Kaiwen Chen, Abolfazl Asudeh, Gautam Das, Nick Koudas

    Abstract: Pre-trained Large Language Models (LLMs) have significantly advanced natural language processing capabilities but are susceptible to biases present in their training data, leading to unfair outcomes in various applications. While numerous strategies have been proposed to mitigate bias, they often require extensive computational resources and may compromise model performance. In this work, we intro… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

  6. arXiv:2402.11363  [pdf, other

    q-bio.QM cs.AI

    Transformer-based de novo peptide sequencing for data-independent acquisition mass spectrometry

    Authors: Shiva Ebrahimi, Xuan Guo

    Abstract: Tandem mass spectrometry (MS/MS) stands as the predominant high-throughput technique for comprehensively analyzing protein content within biological samples. This methodology is a cornerstone driving the advancement of proteomics. In recent years, substantial strides have been made in Data-Independent Acquisition (DIA) strategies, facilitating impartial and non-targeted fragmentation of precursor… ▽ More

    Submitted 26 June, 2024; v1 submitted 17 February, 2024; originally announced February 2024.

    Comments: Ebrahimi S., Guo X. Transformer-based de novo peptide sequencing for data-independent acquisition mass spectrometry. In 2023 IEEE 23rd International Conference on Bioinformatics and Bioengineering (BIBE) 2022 Dec 6 (pp. 17-22). IEEE

  7. arXiv:2402.04879  [pdf, other

    cs.SI

    Comparing Methods for Creating a National Random Sample of Twitter Users

    Authors: Meysam Alizadeh, Darya Zare, Zeynab Samei, Mohammadamin Alizadeh, Mael Kubli, Mohammadhadi Aliahmadi, Sarvenaz Ebrahimi, Fabrizio Gilardi

    Abstract: Twitter data has been widely used by researchers across various social and computer science disciplines. A common aim when working with Twitter data is the construction of a random sample of users from a given country. However, while several methods have been proposed in the literature, their comparative performance is mostly unexplored. In this paper, we implement four common methods to collect a… ▽ More

    Submitted 11 March, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

  8. arXiv:2312.01279  [pdf, other

    cs.CL cs.AI cs.LG

    TextGenSHAP: Scalable Post-hoc Explanations in Text Generation with Long Documents

    Authors: James Enouen, Hootan Nakhost, Sayna Ebrahimi, Sercan O Arik, Yan Liu, Tomas Pfister

    Abstract: Large language models (LLMs) have attracted huge interest in practical applications given their increasingly accurate responses and coherent reasoning abilities. Given their nature as black-boxes using complex reasoning processes on their inputs, it is inevitable that the demand for scalable and faithful explanations for LLMs' generated content will continue to grow. There have been major developm… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

  9. arXiv:2310.11689  [pdf, other

    cs.CL cs.LG

    Adaptation with Self-Evaluation to Improve Selective Prediction in LLMs

    Authors: Jiefeng Chen, Jinsung Yoon, Sayna Ebrahimi, Sercan O Arik, Tomas Pfister, Somesh Jha

    Abstract: Large language models (LLMs) have recently shown great advances in a variety of tasks, including natural language understanding and generation. However, their use in high-stakes decision-making scenarios is still limited due to the potential for errors. Selective prediction is a technique that can be used to improve the reliability of the LLMs by allowing them to abstain from making predictions wh… ▽ More

    Submitted 11 November, 2023; v1 submitted 17 October, 2023; originally announced October 2023.

    Comments: Paper published at Findings of the Association for Computational Linguistics: EMNLP, 2023

  10. arXiv:2310.05269  [pdf, other

    cs.LG cs.AI cs.CR cs.DC

    Federated Learning: A Cutting-Edge Survey of the Latest Advancements and Applications

    Authors: Azim Akhtarshenas, Mohammad Ali Vahedifar, Navid Ayoobi, Behrouz Maham, Tohid Alizadeh, Sina Ebrahimi, David López-Pérez

    Abstract: Robust machine learning (ML) models can be developed by leveraging large volumes of data and distributing the computational tasks across numerous devices or servers. Federated learning (FL) is a technique in the realm of ML that facilitates this goal by utilizing cloud infrastructure to enable collaborative model training among a network of decentralized devices. Beyond distributing the computatio… ▽ More

    Submitted 25 May, 2024; v1 submitted 8 October, 2023; originally announced October 2023.

  11. arXiv:2308.13703  [pdf, other

    cs.LG

    PAITS: Pretraining and Augmentation for Irregularly-Sampled Time Series

    Authors: Nicasia Beebe-Wang, Sayna Ebrahimi, Jinsung Yoon, Sercan O. Arik, Tomas Pfister

    Abstract: Real-world time series data that commonly reflect sequential human behavior are often uniquely irregularly sampled and sparse, with highly nonuniform sampling over time and entities. Yet, commonly-used pretraining and augmentation methods for time series are not specifically designed for such scenarios. In this paper, we present PAITS (Pretraining and Augmentation for Irregularly-sampled Time Seri… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

    Comments: Code: \url{https://github.com/google-research/google-research/tree/master/irregular_timeseries_pretraining}

  12. arXiv:2306.09293  [pdf, other

    cs.LG cs.AI

    [Experiments & Analysis] Evaluating the Feasibility of Sampling-Based Techniques for Training Multilayer Perceptrons

    Authors: Sana Ebrahimi, Rishi Advani, Abolfazl Asudeh

    Abstract: The training process of neural networks is known to be time-consuming, and having a deep architecture only aggravates the issue. This process consists mostly of matrix operations, among which matrix multiplication is the bottleneck. Several sampling-based techniques have been proposed for speeding up the training time of deep neural networks by approximating the matrix products. These techniques f… ▽ More

    Submitted 20 June, 2024; v1 submitted 15 June, 2023; originally announced June 2023.

  13. arXiv:2305.19157  [pdf, other

    cs.RO

    Sensor Fault Detection and Compensation with Performance Prescription for Robotic Manipulators

    Authors: S. Mohammadreza Ebrahimi, Farid Norouzi, Hossein Dastres, Reza Faieghi, Mehdi Naderi, Milad Malekzadeh

    Abstract: This paper focuses on sensor fault detection and compensation for robotic manipulators. The proposed method features a new adaptive observer and a new terminal sliding mode control law established on a second-order integral sliding surface. The method enables sensor fault detection without the need to know the bounds on fault value and/or its derivative. It also enables fast and fixed-time fault-t… ▽ More

    Submitted 18 March, 2024; v1 submitted 30 May, 2023; originally announced May 2023.

  14. arXiv:2305.16556  [pdf, other

    cs.LG cs.AI

    LANISTR: Multimodal Learning from Structured and Unstructured Data

    Authors: Sayna Ebrahimi, Sercan O. Arik, Yihe Dong, Tomas Pfister

    Abstract: Multimodal large-scale pretraining has shown impressive performance for unstructured data such as language and image. However, a prevalent real-world scenario involves structured data types, tabular and time-series, along with unstructured data. Such scenarios have been understudied. To bridge this gap, we propose LANISTR, an attention-based framework to learn from LANguage, Image, and STRuctured… ▽ More

    Submitted 24 April, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

  15. arXiv:2304.03870  [pdf, other

    cs.LG

    ASPEST: Bridging the Gap Between Active Learning and Selective Prediction

    Authors: Jiefeng Chen, Jinsung Yoon, Sayna Ebrahimi, Sercan Arik, Somesh Jha, Tomas Pfister

    Abstract: Selective prediction aims to learn a reliable model that abstains from making predictions when uncertain. These predictions can then be deferred to humans for further evaluation. As an everlasting challenge for machine learning, in many real-world scenarios, the distribution of test data is different from the training data. This results in more inaccurate predictions, and often increased dependenc… ▽ More

    Submitted 29 February, 2024; v1 submitted 7 April, 2023; originally announced April 2023.

  16. arXiv:2211.15646  [pdf, other

    stat.ML cs.CV cs.LG

    Beyond Invariance: Test-Time Label-Shift Adaptation for Distributions with "Spurious" Correlations

    Authors: Qingyao Sun, Kevin Murphy, Sayna Ebrahimi, Alexander D'Amour

    Abstract: Changes in the data distribution at test time can have deleterious effects on the performance of predictive models $p(y|x)$. We consider situations where there are additional meta-data labels (such as group labels), denoted by $z$, that can account for such changes in the distribution. In particular, we assume that the prior distribution $p(y, z)$, which models the dependence between the class lab… ▽ More

    Submitted 28 November, 2023; v1 submitted 28 November, 2022; originally announced November 2022.

    Comments: 24 pages, 7 figures

  17. arXiv:2207.07704  [pdf, other

    cs.SI

    Maximizing Fair Content Spread via Edge Suggestion in Social Networks

    Authors: Ian P. Swift, Sana Ebrahimi, Azade Nova, Abolfazl Asudeh

    Abstract: Content spread inequity is a potential unfairness issue in online social networks, disparately impacting minority groups. In this paper, we view friendship suggestion, a common feature in social network platforms, as an opportunity to achieve an equitable spread of content. In particular, we propose to suggest a subset of potential edges (currently not existing in the network but likely to be acce… ▽ More

    Submitted 20 December, 2022; v1 submitted 15 July, 2022; originally announced July 2022.

    Comments: 16 pages, 17 figures, 8 tables. VLDB '22. Technical Report

  18. arXiv:2206.07240  [pdf, other

    cs.CV cs.AI cs.LG

    Test-Time Adaptation for Visual Document Understanding

    Authors: Sayna Ebrahimi, Sercan O. Arik, Tomas Pfister

    Abstract: For visual document understanding (VDU), self-supervised pretraining has been shown to successfully generate transferable representations, yet, effective adaptation of such representations to distribution shifts at test-time remains to be an unexplored area. We propose DocTTA, a novel test-time adaptation method for documents, that does source-free domain adaptation using unlabeled target document… ▽ More

    Submitted 23 August, 2023; v1 submitted 14 June, 2022; originally announced June 2022.

    Comments: Accepted at TMLR 2023

  19. arXiv:2204.10377  [pdf, other

    cs.CV

    Contrastive Test-Time Adaptation

    Authors: Dian Chen, Dequan Wang, Trevor Darrell, Sayna Ebrahimi

    Abstract: Test-time adaptation is a special setting of unsupervised domain adaptation where a trained model on the source domain has to adapt to the target domain without accessing source data. We propose a novel way to leverage self-supervised contrastive learning to facilitate target feature learning, along with an online pseudo labeling scheme with refinement that significantly denoises pseudo labels. Th… ▽ More

    Submitted 21 April, 2022; originally announced April 2022.

    Comments: CVPR 2022 camera-ready version

  20. arXiv:2204.04799  [pdf, other

    cs.LG cs.CV

    DualPrompt: Complementary Prompting for Rehearsal-free Continual Learning

    Authors: Zifeng Wang, Zizhao Zhang, Sayna Ebrahimi, Ruoxi Sun, Han Zhang, Chen-Yu Lee, Xiaoqi Ren, Guolong Su, Vincent Perot, Jennifer Dy, Tomas Pfister

    Abstract: Continual learning aims to enable a single model to learn a sequence of tasks without catastrophic forgetting. Top-performing methods usually require a rehearsal buffer to store past pristine examples for experience replay, which, however, limits their practical value due to privacy and memory constraints. In this work, we present a simple yet effective framework, DualPrompt, which learns a tiny s… ▽ More

    Submitted 5 August, 2022; v1 submitted 10 April, 2022; originally announced April 2022.

    Comments: Published at ECCV 2022 as a conference paper

  21. RC-RNN: Reconfigurable Cache Architecture for Storage Systems Using Recurrent Neural Networks

    Authors: Shahriar Ebrahimi, Reza Salkhordeh, Seyed Ali Osia, Ali Taheri, Hamid Reza Rabiee, Hossein Asadi

    Abstract: Solid-State Drives (SSDs) have significant performance advantages over traditional Hard Disk Drives (HDDs) such as lower latency and higher throughput. Significantly higher price per capacity and limited lifetime, however, prevents designers to completely substitute HDDs by SSDs in enterprise storage systems. SSD-based caching has recently been suggested for storage systems to benefit from higher… ▽ More

    Submitted 5 November, 2021; originally announced November 2021.

    Comments: Date of Publication: 09 August 2021

    Journal ref: IEEE Transactions on Emerging Topics in Computing (2021)

  22. arXiv:2110.00274  [pdf, other

    cs.CR

    Enhancing Cold Wallet Security with Native Multi-Signature schemes in Centralized Exchanges

    Authors: Shahriar Ebrahimi, Parisa Hasanizadeh, Seyed Mohammad Aghamirmohammadali, Amirali Akbari

    Abstract: Currently, one of the most widely used protocols to secure cryptocurrency assets in centralized exchanges is categorizing wallets into cold and hot. While cold wallets hold user deposits, hot} wallets are responsible for addressing withdrawal requests. However, this method has some shortcomings such as: 1) availability of private keys in at least one cold device, and~2) exposure of all private key… ▽ More

    Submitted 1 October, 2021; originally announced October 2021.

    Comments: Nobitex Crypto-Exchange: https://www.nobitex.net, Available Online at: https://cdn.nobitex.net/security/nobitex-security-whitepaper.pdf

  23. arXiv:2109.01087  [pdf, other

    cs.CV cs.AI cs.LG

    On-target Adaptation

    Authors: Dequan Wang, Shaoteng Liu, Sayna Ebrahimi, Evan Shelhamer, Trevor Darrell

    Abstract: Domain adaptation seeks to mitigate the shift between training on the \emph{source} domain and testing on the \emph{target} domain. Most adaptation methods rely on the source data by joint optimization over source data and target data. Source-free methods replace the source data with a source model by fine-tuning it on target. Either way, the majority of the parameter updates for the model represe… ▽ More

    Submitted 2 September, 2021; originally announced September 2021.

  24. arXiv:2108.09186  [pdf, other

    cs.CV

    Region-level Active Detector Learning

    Authors: Michael Laielli, Giscard Biamby, Dian Chen, Ritwik Gupta, Adam Loeffler, Phat Dat Nguyen, Ross Luo, Trevor Darrell, Sayna Ebrahimi

    Abstract: Active learning for object detection is conventionally achieved by applying techniques developed for classification in a way that aggregates individual detections into image-level selection criteria. This is typically coupled with the costly assumption that every image selected for labelling must be exhaustively annotated. This yields incremental improvements on well-curated vision datasets and st… ▽ More

    Submitted 17 January, 2022; v1 submitted 20 August, 2021; originally announced August 2021.

  25. arXiv:2107.03315  [pdf, other

    cs.LG cs.CV stat.ML

    Predicting with Confidence on Unseen Distributions

    Authors: Devin Guillory, Vaishaal Shankar, Sayna Ebrahimi, Trevor Darrell, Ludwig Schmidt

    Abstract: Recent work has shown that the performance of machine learning models can vary substantially when models are evaluated on data drawn from a distribution that is close to but different from the training distribution. As a result, predicting model performance on unseen distributions is an important challenge. Our work connects techniques from domain adaptation and predictive uncertainty literature,… ▽ More

    Submitted 19 August, 2021; v1 submitted 7 July, 2021; originally announced July 2021.

    Comments: ICCV Camera ready; new scatter plots in supplementary material

    ACM Class: I.2.10

  26. arXiv:2103.12718  [pdf, other

    cs.CV

    Self-Supervised Pretraining Improves Self-Supervised Pretraining

    Authors: Colorado J. Reed, Xiangyu Yue, Ani Nrusimha, Sayna Ebrahimi, Vivek Vijaykumar, Richard Mao, Bo Li, Shanghang Zhang, Devin Guillory, Sean Metzger, Kurt Keutzer, Trevor Darrell

    Abstract: While self-supervised pretraining has proven beneficial for many computer vision tasks, it requires expensive and lengthy computation, large amounts of data, and is sensitive to data augmentation. Prior work demonstrates that models pretrained on datasets dissimilar to their target data, such as chest X-ray models trained on ImageNet, underperform models trained from scratch. Users that lack the r… ▽ More

    Submitted 24 March, 2021; v1 submitted 23 March, 2021; originally announced March 2021.

  27. arXiv:2012.10467  [pdf, other

    cs.CV cs.AI cs.LG

    Minimax Active Learning

    Authors: Sayna Ebrahimi, William Gan, Dian Chen, Giscard Biamby, Kamyar Salahi, Michael Laielli, Shizhan Zhu, Trevor Darrell

    Abstract: Active learning aims to develop label-efficient algorithms by querying the most representative samples to be labeled by a human annotator. Current active learning techniques either rely on model uncertainty to select the most uncertain samples or use clustering or reconstruction to choose the most diverse set of unlabeled examples. While uncertainty-based strategies are susceptible to outliers, so… ▽ More

    Submitted 30 March, 2021; v1 submitted 18 December, 2020; originally announced December 2020.

    Comments: Project page is available at https://people.eecs.berkeley.edu/~sayna/mal.html

  28. arXiv:2010.01528  [pdf, other

    cs.CV cs.AI cs.LG

    Remembering for the Right Reasons: Explanations Reduce Catastrophic Forgetting

    Authors: Sayna Ebrahimi, Suzanne Petryk, Akash Gokul, William Gan, Joseph E. Gonzalez, Marcus Rohrbach, Trevor Darrell

    Abstract: The goal of continual learning (CL) is to learn a sequence of tasks without suffering from the phenomenon of catastrophic forgetting. Previous work has shown that leveraging memory in the form of a replay buffer can reduce performance degradation on prior tasks. We hypothesize that forgetting can be further reduced when the model is encouraged to remember the \textit{evidence} for previously made… ▽ More

    Submitted 2 May, 2021; v1 submitted 4 October, 2020; originally announced October 2020.

    Comments: Accepted at ICLR 2021

  29. arXiv:2003.09553  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Adversarial Continual Learning

    Authors: Sayna Ebrahimi, Franziska Meier, Roberto Calandra, Trevor Darrell, Marcus Rohrbach

    Abstract: Continual learning aims to learn new tasks without forgetting previously learned ones. We hypothesize that representations learned to solve each task in a sequence have a shared structure while containing some task-specific properties. We show that shared features are significantly less prone to forgetting and propose a novel hybrid continual learning framework that learns a disjoint representatio… ▽ More

    Submitted 21 July, 2020; v1 submitted 20 March, 2020; originally announced March 2020.

    Comments: Accepted at ECCV 2020

  30. arXiv:2002.08984  [pdf, other

    eess.SP cs.NI

    E2E Migration Strategies Towards 5G: Long-term Migration Plan and Evolution Roadmap

    Authors: Abulfazl Zakeri, Narges Gholipoor, Mohsen Tajallifar, Sina Ebrahimi, Mohammad Reza Javan, Nader Mokari, Ahmad Reza Sharafat

    Abstract: After freezing the first phase of the fifth generation of wireless networks (5G) standardization, it finally goes live now and the rollout of the commercial launch (most in fixed 5G broadband services) and migration has been started. However, some challenges are arising in the deployment, integration of each technology, and the interoperability in the network of the communication service providers… ▽ More

    Submitted 20 February, 2020; originally announced February 2020.

    Comments: Migration, 5G, evolution, roadmap, option, path, 3 Figure, 4Table

  31. arXiv:1912.00192  [pdf, other

    cs.NI eess.SY math.OC

    Joint Resource and Admission Management for Slice-enabled Networks

    Authors: Sina Ebrahimi, Abulfazl Zakeri, Behzad Akbari, Nader Mokari

    Abstract: Network slicing is a crucial part of the 5G networks that communication service providers (CSPs) seek to deploy. By exploiting three main enabling technologies, namely, software-defined networking (SDN), network function virtualization (NFV), and network slicing, communication services can be served to the end-users in an efficient, scalable, and flexible manner. To adopt these technologies, what… ▽ More

    Submitted 7 December, 2019; v1 submitted 30 November, 2019; originally announced December 2019.

    Comments: 8 pages, double column, 8 figures, accepted to be presented in IEEE/IFIP Network Operations and Management Symposium (NOMS) 2020

  32. arXiv:1912.00187  [pdf, other

    cs.NI math.OC

    Energy-Efficient Task Offloading Under E2E Latency Constraints

    Authors: Mohsen Tajallifar, Sina Ebrahimi, Mohammad Reza Javan, Nader Mokari, Luca Chiaraviglio

    Abstract: In this paper, we propose a novel resource management scheme that jointly allocates the transmit power and computational resources in a centralized radio access network architecture. The network comprises a set of computing nodes to which the requested tasks of different users are offloaded. The optimization problem minimizes the energy consumption of task offloading while takes the end-to-end lat… ▽ More

    Submitted 23 June, 2021; v1 submitted 30 November, 2019; originally announced December 2019.

    Comments: 32 pages, 10 figures

  33. arXiv:1909.10225  [pdf, other

    cs.CV

    WiCV 2019: The Sixth Women In Computer Vision Workshop

    Authors: Irene Amerini, Elena Balashova, Sayna Ebrahimi, Kathryn Leonard, Arsha Nagrani, Amaia Salvador

    Abstract: In this paper we present the Women in Computer Vision Workshop - WiCV 2019, organized in conjunction with CVPR 2019. This event is meant for increasing the visibility and inclusion of women researchers in the computer vision field. Computer vision and machine learning have made incredible progress over the past years, but the number of female researchers is still low both in academia and in indust… ▽ More

    Submitted 23 September, 2019; originally announced September 2019.

    Comments: Report of the Sixth Women In Computer Vision Workshop

    Journal ref: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2019, pp. 0-0

  34. arXiv:1906.02425  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Uncertainty-guided Continual Learning with Bayesian Neural Networks

    Authors: Sayna Ebrahimi, Mohamed Elhoseiny, Trevor Darrell, Marcus Rohrbach

    Abstract: Continual learning aims to learn new tasks without forgetting previously learned ones. This is especially challenging when one cannot access data from previous tasks and when the model has a fixed capacity. Current regularization-based continual learning algorithms need an external representation and extra computation to measure the parameters' \textit{importance}. In contrast, we propose Uncertai… ▽ More

    Submitted 19 February, 2020; v1 submitted 6 June, 2019; originally announced June 2019.

    Comments: Accepted at ICLR 2020

  35. arXiv:1904.00370  [pdf, other

    cs.LG cs.CV stat.ML

    Variational Adversarial Active Learning

    Authors: Samarth Sinha, Sayna Ebrahimi, Trevor Darrell

    Abstract: Active learning aims to develop label-efficient algorithms by sampling the most representative queries to be labeled by an oracle. We describe a pool-based semi-supervised active learning algorithm that implicitly learns this sampling mechanism in an adversarial manner. Unlike conventional active learning algorithms, our approach is task agnostic, i.e., it does not depend on the performance of the… ▽ More

    Submitted 28 October, 2019; v1 submitted 31 March, 2019; originally announced April 2019.

    Comments: First two authors contributed equally, listed alphabetically. Accepted as Oral at ICCV 2019

  36. arXiv:1812.10430  [pdf, other

    stat.ML cs.LG

    Large Multistream Data Analytics for Monitoring and Diagnostics in Manufacturing Systems

    Authors: Samaneh Ebrahimi, Chitta Ranjan, Kamran Paynabar

    Abstract: The high-dimensionality and volume of large scale multistream data has inhibited significant research progress in developing an integrated monitoring and diagnostics (M&D) approach. This data, also categorized as big data, is becoming common in manufacturing plants. In this paper, we propose an integrated M\&D approach for large scale streaming data. We developed a novel monitoring method named Ad… ▽ More

    Submitted 26 December, 2018; originally announced December 2018.

  37. arXiv:1812.01784  [pdf, other

    cs.CV cs.AI cs.LG

    Generalized Zero- and Few-Shot Learning via Aligned Variational Autoencoders

    Authors: Edgar Schönfeld, Sayna Ebrahimi, Samarth Sinha, Trevor Darrell, Zeynep Akata

    Abstract: Many approaches in generalized zero-shot learning rely on cross-modal mapping between the image feature space and the class embedding space. As labeled images are expensive, one direction is to augment the dataset by generating either images or image features. However, the former misses fine-grained details and the latter requires learning a mapping associated with class embeddings. In this work,… ▽ More

    Submitted 5 April, 2019; v1 submitted 4 December, 2018; originally announced December 2018.

    Comments: Accepted at CVPR 2019

  38. arXiv:1807.07560  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Compositional GAN: Learning Image-Conditional Binary Composition

    Authors: Samaneh Azadi, Deepak Pathak, Sayna Ebrahimi, Trevor Darrell

    Abstract: Generative Adversarial Networks (GANs) can produce images of remarkable complexity and realism but are generally structured to sample from a single latent source ignoring the explicit spatial interaction between multiple entities that could be present in a scene. Capturing such complex interactions between different objects in the world, including their relative scaling, spatial layout, occlusion,… ▽ More

    Submitted 28 March, 2019; v1 submitted 19 July, 2018; originally announced July 2018.

  39. arXiv:1806.07912  [pdf, other

    cs.NE cs.AI

    Resource-Efficient Neural Architect

    Authors: Yanqi Zhou, Siavash Ebrahimi, Sercan Ö. Arık, Haonan Yu, Hairong Liu, Greg Diamos

    Abstract: Neural Architecture Search (NAS) is a laborious process. Prior work on automated NAS targets mainly on improving accuracy, but lacks consideration of computational resource use. We propose the Resource-Efficient Neural Architect (RENA), an efficient resource-constrained NAS using reinforcement learning with network embedding. RENA uses a policy network to process the network embeddings to generate… ▽ More

    Submitted 12 June, 2018; originally announced June 2018.

  40. ReCA: an Efficient Reconfigurable Cache Architecture for Storage Systems with Online Workload Characterization

    Authors: Reza Salkhordeh, Shahriar Ebrahimi, Hossein Asadi

    Abstract: In recent years, SSDs have gained tremendous attention in computing and storage systems due to significant performance improvement over HDDs. The cost per capacity of SSDs, however, prevents them from entirely replacing HDDs in such systems. One approach to effectively take advantage of SSDs is to use them as a caching layer to store performance critical data blocks to reduce the number of accesse… ▽ More

    Submitted 3 May, 2018; originally announced May 2018.

    Journal ref: IEEE TPDS 2018

  41. arXiv:1805.00325  [pdf, other

    cs.CV

    Study of Residual Networks for Image Recognition

    Authors: Mohammad Sadegh Ebrahimi, Hossein Karkeh Abadi

    Abstract: Deep neural networks demonstrate to have a high performance on image classification tasks while being more difficult to train. Due to the complexity and vanishing gradient problem, it normally takes a lot of time and more computational power to train deeper neural networks. Deep residual networks (ResNets) can make the training process faster and attain more accuracy compared to their equivalent n… ▽ More

    Submitted 21 April, 2018; originally announced May 2018.

    Comments: 6 pages, 9 figures

  42. arXiv:1804.08044  [pdf

    cs.SI

    Predicting User Performance and Bitcoin Price Using Block Chain Transaction Network

    Authors: Mohammad Sadegh Ebrahimi, Afshin Babveyh

    Abstract: This work is organized as follows. In the first section we review the prior work and we have obtained our data. Next, we will look at address reuse in the Bitcoin network. We show that a great portion of users reuse their addresses which could enable us to cluster the addresses and attribute them to single users. Next, we will categorize the nodes based on their role in the network as a customer o… ▽ More

    Submitted 21 April, 2018; originally announced April 2018.

    Comments: 8 pages, 7 figures

  43. arXiv:1802.03319  [pdf, other

    stat.ML cs.SD eess.AS

    Predicting Audio Advertisement Quality

    Authors: Samaneh Ebrahimi, Hossein Vahabi, Matthew Prockup, Oriol Nieto

    Abstract: Online audio advertising is a particular form of advertising used abundantly in online music streaming services. In these platforms, which tend to host tens of thousands of unique audio advertisements (ads), providing high quality ads ensures a better user experience and results in longer user engagement. Therefore, the automatic assessment of these ads is an important step toward audio ads rankin… ▽ More

    Submitted 9 February, 2018; originally announced February 2018.

    Comments: WSDM '18 Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining, 9 pages

    Journal ref: 2018. In Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining (WSDM '18)

  44. arXiv:1710.05958  [pdf, other

    cs.LG cs.AI cs.CV

    Gradient-free Policy Architecture Search and Adaptation

    Authors: Sayna Ebrahimi, Anna Rohrbach, Trevor Darrell

    Abstract: We develop a method for policy architecture search and adaptation via gradient-free optimization which can learn to perform autonomous driving tasks. By learning from both demonstration and environmental reward we develop a model that can learn with relatively few early catastrophic failures. We first learn an architecture of appropriate complexity to perceive aspects of world state relevant to th… ▽ More

    Submitted 16 October, 2017; originally announced October 2017.

    Comments: Accepted in Conference on Robot Learning, 2017

  45. arXiv:1704.03396  [pdf, ps, other

    cs.AI cs.LO

    Source-Sensitive Belief Change

    Authors: Shahab Ebrahimi

    Abstract: The AGM model is the most remarkable framework for modeling belief revision. However, it is not perfect in all aspects. Paraconsistent belief revision, multi-agent belief revision and non-prioritized belief revision are three different extensions to AGM to address three important criticisms applied to it. In this article, we propose a framework based on AGM that takes a position in each of these c… ▽ More

    Submitted 5 May, 2017; v1 submitted 11 April, 2017; originally announced April 2017.

    Comments: 13 pages

    Journal ref: International Journal of Artificial Intelligence and Applications (IJAIA), Vol.8, No.2, March 2017

  46. arXiv:1608.03533  [pdf, other

    stat.ML cs.LG

    Sequence Graph Transform (SGT): A Feature Embedding Function for Sequence Data Mining

    Authors: Chitta Ranjan, Samaneh Ebrahimi, Kamran Paynabar

    Abstract: Sequence feature embedding is a challenging task due to the unstructuredness of sequence, i.e., arbitrary strings of arbitrary length. Existing methods are efficient in extracting short-term dependencies but typically suffer from computation issues for the long-term. Sequence Graph Transform (SGT), a feature embedding function, that can extract a varying amount of short- to long-term dependencies… ▽ More

    Submitted 4 October, 2021; v1 submitted 11 August, 2016; originally announced August 2016.