Skip to main content

Showing 1–17 of 17 results for author: Bello, I

  1. arXiv:2407.07603  [pdf

    cs.CV

    iiANET: Inception Inspired Attention Hybrid Network for efficient Long-Range Dependency

    Authors: Haruna Yunusa, Qin Shiyin, Abdulrahman Hamman Adama Chukkol, Isah Bello, Adamu Lawan

    Abstract: The recent emergence of hybrid models has introduced another transformative approach to solving computer vision tasks, slowly shifting away from conventional CNN (Convolutional Neural Network) and ViT (Vision Transformer). However, not enough effort has been made to efficiently combine these two approaches to improve capturing long-range dependencies prevalent in complex images. In this paper, we… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  2. arXiv:2402.16291  [pdf

    cs.CV

    mAPm: multi-scale Attention Pyramid module for Enhanced scale-variation in RLD detection

    Authors: Yunusa Haruna, Shiyin Qin, Abdulrahman Hamman Adama Chukkol, Isah Bello, Adamu Lawan

    Abstract: Detecting objects across various scales remains a significant challenge in computer vision, particularly in tasks such as Rice Leaf Disease (RLD) detection, where objects exhibit considerable scale variations. Traditional object detection methods often struggle to address these variations, resulting in missed detections or reduced accuracy. In this study, we propose the multi-scale Attention Pyram… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

  3. arXiv:2402.09939  [pdf

    cs.AI cs.CL cs.HC cs.IR cs.LG

    Generative AI in the Construction Industry: A State-of-the-art Analysis

    Authors: Ridwan Taiwo, Idris Temitope Bello, Sulemana Fatoama Abdulai, Abdul-Mugis Yussif, Babatunde Abiodun Salami, Abdullahi Saka, Tarek Zayed

    Abstract: The construction industry is a vital sector of the global economy, but it faces many productivity challenges in various processes, such as design, planning, procurement, inspection, and maintenance. Generative artificial intelligence (AI), which can create novel and realistic data or content, such as text, image, video, or code, based on some input or prior knowledge, offers innovative and disrupt… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: 74 pages, 11 figures, 20 tables

  4. arXiv:2402.02941  [pdf

    cs.CV cs.LG

    Exploring the Synergies of Hybrid CNNs and ViTs Architectures for Computer Vision: A survey

    Authors: Haruna Yunusa, Shiyin Qin, Abdulrahman Hamman Adama Chukkol, Abdulganiyu Abdu Yusuf, Isah Bello, Adamu Lawan

    Abstract: The hybrid of Convolutional Neural Network (CNN) and Vision Transformers (ViT) architectures has emerged as a groundbreaking approach, pushing the boundaries of computer vision (CV). This comprehensive review provides a thorough examination of the literature on state-of-the-art hybrid CNN-ViT architectures, exploring the synergies between these two approaches. The main content of this survey inclu… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  5. arXiv:2303.08774  [pdf, other

    cs.CL cs.AI

    GPT-4 Technical Report

    Authors: OpenAI, Josh Achiam, Steven Adler, Sandhini Agarwal, Lama Ahmad, Ilge Akkaya, Florencia Leoni Aleman, Diogo Almeida, Janko Altenschmidt, Sam Altman, Shyamal Anadkat, Red Avila, Igor Babuschkin, Suchir Balaji, Valerie Balcom, Paul Baltescu, Haiming Bao, Mohammad Bavarian, Jeff Belgum, Irwan Bello, Jake Berdine, Gabriel Bernadett-Shapiro, Christopher Berner, Lenny Bogdonoff, Oleg Boiko , et al. (256 additional authors not shown)

    Abstract: We report the development of GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs. While less capable than humans in many real-world scenarios, GPT-4 exhibits human-level performance on various professional and academic benchmarks, including passing a simulated bar exam with a score around the top 10% of test takers. GPT-4 is a Transformer-based mo… ▽ More

    Submitted 4 March, 2024; v1 submitted 15 March, 2023; originally announced March 2023.

    Comments: 100 pages; updated authors list; fixed author names and added citation

  6. arXiv:2211.05100  [pdf, other

    cs.CL

    BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

    Authors: BigScience Workshop, :, Teven Le Scao, Angela Fan, Christopher Akiki, Ellie Pavlick, Suzana Ilić, Daniel Hesslow, Roman Castagné, Alexandra Sasha Luccioni, François Yvon, Matthias Gallé, Jonathan Tow, Alexander M. Rush, Stella Biderman, Albert Webson, Pawan Sasanka Ammanamanchi, Thomas Wang, Benoît Sagot, Niklas Muennighoff, Albert Villanova del Moral, Olatunji Ruwase, Rachel Bawden, Stas Bekman, Angelina McMillan-Major , et al. (369 additional authors not shown)

    Abstract: Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access… ▽ More

    Submitted 27 June, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

  7. arXiv:2202.08906  [pdf, other

    cs.CL cs.LG

    ST-MoE: Designing Stable and Transferable Sparse Expert Models

    Authors: Barret Zoph, Irwan Bello, Sameer Kumar, Nan Du, Yanping Huang, Jeff Dean, Noam Shazeer, William Fedus

    Abstract: Scale has opened new frontiers in natural language processing -- but at a high cost. In response, Mixture-of-Experts (MoE) and Switch Transformers have been proposed as an energy efficient path to even larger and more capable language models. But advancing the state-of-the-art across a broad set of natural language tasks has been hindered by training instabilities and uncertain quality during fine… ▽ More

    Submitted 29 April, 2022; v1 submitted 17 February, 2022; originally announced February 2022.

    Comments: 25 pages main text, 39 pages overall

  8. arXiv:2109.01696  [pdf, other

    cs.CV cs.LG eess.IV

    Revisiting 3D ResNets for Video Recognition

    Authors: Xianzhi Du, Yeqing Li, Yin Cui, Rui Qian, Jing Li, Irwan Bello

    Abstract: A recent work from Bello shows that training and scaling strategies may be more significant than model architectures for visual recognition. This short note studies effective training and scaling strategies for video recognition models. We propose a simple scaling strategy for 3D ResNets, in combination with improved training strategies and minor architectural changes. The resulting models, termed… ▽ More

    Submitted 3 September, 2021; originally announced September 2021.

    Comments: 6 pages

  9. arXiv:2103.07579  [pdf, other

    cs.CV

    Revisiting ResNets: Improved Training and Scaling Strategies

    Authors: Irwan Bello, William Fedus, Xianzhi Du, Ekin D. Cubuk, Aravind Srinivas, Tsung-Yi Lin, Jonathon Shlens, Barret Zoph

    Abstract: Novel computer vision architectures monopolize the spotlight, but the impact of the model architecture is often conflated with simultaneous changes to training methodology and scaling strategies. Our work revisits the canonical ResNet (He et al., 2015) and studies these three aspects in an effort to disentangle them. Perhaps surprisingly, we find that training and scaling strategies may matter mor… ▽ More

    Submitted 12 March, 2021; originally announced March 2021.

  10. arXiv:2102.08602  [pdf, other

    cs.CV cs.LG

    LambdaNetworks: Modeling Long-Range Interactions Without Attention

    Authors: Irwan Bello

    Abstract: We present lambda layers -- an alternative framework to self-attention -- for capturing long-range interactions between an input and structured contextual information (e.g. a pixel surrounded by other pixels). Lambda layers capture such interactions by transforming available contexts into linear functions, termed lambdas, and applying these linear functions to each input separately. Similar to lin… ▽ More

    Submitted 17 February, 2021; originally announced February 2021.

    Comments: Accepted for publication at the International Conference in Learning Representations 2021 (Spotlight)

  11. arXiv:2010.03019  [pdf, other

    cs.CV cs.LG

    Global Self-Attention Networks for Image Recognition

    Authors: Zhuoran Shen, Irwan Bello, Raviteja Vemulapalli, Xuhui Jia, Ching-Hui Chen

    Abstract: Recently, a series of works in computer vision have shown promising results on various image and video understanding tasks using self-attention. However, due to the quadratic computational and memory complexities of self-attention, these works either apply attention only to low-resolution feature maps in later stages of a deep network or restrict the receptive field of attention in each layer to a… ▽ More

    Submitted 14 October, 2020; v1 submitted 6 October, 2020; originally announced October 2020.

  12. arXiv:1906.05909  [pdf, other

    cs.CV

    Stand-Alone Self-Attention in Vision Models

    Authors: Prajit Ramachandran, Niki Parmar, Ashish Vaswani, Irwan Bello, Anselm Levskaya, Jonathon Shlens

    Abstract: Convolutions are a fundamental building block of modern computer vision systems. Recent approaches have argued for going beyond convolutions in order to capture long-range dependencies. These efforts focus on augmenting convolutional models with content-based interactions, such as self-attention and non-local means, to achieve gains on a number of vision tasks. The natural question that arises is… ▽ More

    Submitted 13 June, 2019; originally announced June 2019.

  13. arXiv:1904.09925  [pdf, other

    cs.CV

    Attention Augmented Convolutional Networks

    Authors: Irwan Bello, Barret Zoph, Ashish Vaswani, Jonathon Shlens, Quoc V. Le

    Abstract: Convolutional networks have been the paradigm of choice in many computer vision applications. The convolution operation however has a significant weakness in that it only operates on a local neighborhood, thus missing global information. Self-attention, on the other hand, has emerged as a recent advance to capture long range interactions, but has mostly been applied to sequence modeling and genera… ▽ More

    Submitted 9 September, 2020; v1 submitted 22 April, 2019; originally announced April 2019.

    Comments: ICCV 2019

  14. arXiv:1810.02019  [pdf, other

    cs.IR cs.LG stat.ML

    Seq2Slate: Re-ranking and Slate Optimization with RNNs

    Authors: Irwan Bello, Sayali Kulkarni, Sagar Jain, Craig Boutilier, Ed Chi, Elad Eban, Xiyang Luo, Alan Mackey, Ofer Meshi

    Abstract: Ranking is a central task in machine learning and information retrieval. In this task, it is especially important to present the user with a slate of items that is appealing as a whole. This in turn requires taking into account interactions between items, since intuitively, placing an item on the slate affects the decision of which other items should be placed alongside it. In this work, we propos… ▽ More

    Submitted 19 March, 2019; v1 submitted 3 October, 2018; originally announced October 2018.

  15. arXiv:1808.02822  [pdf, other

    cs.NE cs.LG stat.ML

    Backprop Evolution

    Authors: Maximilian Alber, Irwan Bello, Barret Zoph, Pieter-Jan Kindermans, Prajit Ramachandran, Quoc Le

    Abstract: The back-propagation algorithm is the cornerstone of deep learning. Despite its importance, few variations of the algorithm have been attempted. This work presents an approach to discover new variations of the back-propagation equation. We use a domain specific lan- guage to describe update equations as a list of primitive functions. An evolution-based method is used to discover new propagation ru… ▽ More

    Submitted 8 August, 2018; originally announced August 2018.

  16. arXiv:1709.07417  [pdf, other

    cs.AI cs.LG stat.ML

    Neural Optimizer Search with Reinforcement Learning

    Authors: Irwan Bello, Barret Zoph, Vijay Vasudevan, Quoc V. Le

    Abstract: We present an approach to automate the process of discovering optimization methods, with a focus on deep learning architectures. We train a Recurrent Neural Network controller to generate a string in a domain specific language that describes a mathematical update equation based on a list of primitive functions, such as the gradient, running average of the gradient, etc. The controller is trained w… ▽ More

    Submitted 22 September, 2017; v1 submitted 21 September, 2017; originally announced September 2017.

    Comments: ICML 2017 Conference paper

  17. arXiv:1611.09940  [pdf, ps, other

    cs.AI cs.LG stat.ML

    Neural Combinatorial Optimization with Reinforcement Learning

    Authors: Irwan Bello, Hieu Pham, Quoc V. Le, Mohammad Norouzi, Samy Bengio

    Abstract: This paper presents a framework to tackle combinatorial optimization problems using neural networks and reinforcement learning. We focus on the traveling salesman problem (TSP) and train a recurrent network that, given a set of city coordinates, predicts a distribution over different city permutations. Using negative tour length as the reward signal, we optimize the parameters of the recurrent net… ▽ More

    Submitted 12 January, 2017; v1 submitted 29 November, 2016; originally announced November 2016.

    Comments: Under review as a conference paper at ICLR 2017