Skip to main content

Showing 1–12 of 12 results for author: Hayes, M

  1. arXiv:2406.09203  [pdf, other

    cs.CV

    Optimizing Visual Question Answering Models for Driving: Bridging the Gap Between Human and Machine Attention Patterns

    Authors: Kaavya Rekanar, Martin Hayes, Ganesh Sistu, Ciaran Eising

    Abstract: Visual Question Answering (VQA) models play a critical role in enhancing the perception capabilities of autonomous driving systems by allowing vehicles to analyze visual inputs alongside textual queries, fostering natural interaction and trust between the vehicle and its occupants or other road users. This study investigates the attention patterns of humans compared to a VQA model when answering d… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  2. arXiv:2405.11422  [pdf, other

    cs.CL cs.AI cs.LG

    Large Language Models are Biased Reinforcement Learners

    Authors: William M. Hayes, Nicolas Yax, Stefano Palminteri

    Abstract: In-context learning enables large language models (LLMs) to perform a variety of tasks, including learning to make reward-maximizing choices in simple bandit tasks. Given their potential use as (autonomous) decision-making agents, it is important to understand how these models perform such reinforcement learning (RL) tasks and the extent to which they are susceptible to biases. Motivated by the fa… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

  3. arXiv:2401.14530  [pdf

    cs.CL cs.AI cs.LG

    Relative Value Biases in Large Language Models

    Authors: William M. Hayes, Nicolas Yax, Stefano Palminteri

    Abstract: Studies of reinforcement learning in humans and animals have demonstrated a preference for options that yielded relatively better outcomes in the past, even when those options are associated with lower absolute reward. The present study tested whether large language models would exhibit a similar bias. We had gpt-4-1106-preview (GPT-4 Turbo) and Llama-2-70B make repeated choices between pairs of o… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  4. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  5. arXiv:2307.09329  [pdf, other

    cs.CV

    Towards a performance analysis on pre-trained Visual Question Answering models for autonomous driving

    Authors: Kaavya Rekanar, Ciarán Eising, Ganesh Sistu, Martin Hayes

    Abstract: This short paper presents a preliminary analysis of three popular Visual Question Answering (VQA) models, namely ViLBERT, ViLT, and LXMERT, in the context of answering questions relating to driving scenarios. The performance of these models is evaluated by comparing the similarity of responses to reference answers provided by computer vision experts. Model selection is predicated on the analysis o… ▽ More

    Submitted 28 July, 2023; v1 submitted 18 July, 2023; originally announced July 2023.

    Journal ref: Proceedings of the Irish Machine Vision and Image Processing Conference 2023

  6. arXiv:2303.16867  [pdf, other

    cs.CV

    A Video-based End-to-end Pipeline for Non-nutritive Sucking Action Recognition and Segmentation in Young Infants

    Authors: Shaotong Zhu, Michael Wan, Elaheh Hatamimajoumerd, Kashish Jain, Samuel Zlota, Cholpady Vikram Kamath, Cassandra B. Rowan, Emma C. Grace, Matthew S. Goodwin, Marie J. Hayes, Rebecca A. Schwartz-Mette, Emily Zimmerman, Sarah Ostadabbas

    Abstract: We present an end-to-end computer vision pipeline to detect non-nutritive sucking (NNS) -- an infant sucking pattern with no nutrition delivered -- as a potential biomarker for developmental delays, using off-the-shelf baby monitor video footage. One barrier to clinical (or algorithmic) assessment of NNS stems from its sparsity, requiring experts to wade through hours of footage to find minutes of… ▽ More

    Submitted 29 March, 2023; originally announced March 2023.

  7. arXiv:2212.04001  [pdf, other

    cs.CL cs.LG

    TweetDrought: A Deep-Learning Drought Impacts Recognizer based on Twitter Data

    Authors: Beichen Zhang, Frank Schilder, Kelly Helm Smith, Michael J. Hayes, Sherri Harms, Tsegaye Tadesse

    Abstract: Acquiring a better understanding of drought impacts becomes increasingly vital under a warming climate. Traditional drought indices describe mainly biophysical variables and not impacts on social, economic, and environmental systems. We utilized natural language processing and bidirectional encoder representation from Transformers (BERT) based transfer learning to fine-tune the model on the data f… ▽ More

    Submitted 7 December, 2022; originally announced December 2022.

    Comments: 5 pages (+3 in appendix), 5 figures in appendix, 2 tables (+1 in appendix), ICML Workshop on Tackling Climate Change with Machine Learning Workshop, 2021

  8. arXiv:2211.02768  [pdf, other

    cs.LG stat.AP

    Quantitative Assessment of Drought Impacts Using XGBoost based on the Drought Impact Reporter

    Authors: Beichen Zhang, Fatima K. Abu Salem, Michael J. Hayes, Tsegaye Tadesse

    Abstract: Under climate change, the increasing frequency, intensity, and spatial extent of drought events lead to higher socio-economic costs. However, the relationships between the hydro-meteorological indicators and drought impacts are not identified well yet because of the complexity and data scarcity. In this paper, we proposed a framework based on the extreme gradient model (XGBoost) for Texas to predi… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

    Comments: 4 pages with 2 figures and 1 table. NeurIPS workshop on Tackling Climate Change with Machine Learning, 2020

  9. Uncovering Visually Impaired Gamers' Preferences for Spatial Awareness Tools Within Video Games

    Authors: Vishnu Nair, Shao-en Ma, Ricardo E. Gonzalez Penuela, Yicheng He, Karen Lin, Mason Hayes, Hannah Huddleston, Matthew Donnelly, Brian A. Smith

    Abstract: Sighted players gain spatial awareness within video games through sight and spatial awareness tools (SATs) such as minimaps. Visually impaired players (VIPs), however, must often rely heavily on SATs to gain spatial awareness, especially in complex environments where using rich ambient sound design alone may be insufficient. Researchers have developed many SATs for facilitating spatial awareness w… ▽ More

    Submitted 30 August, 2022; originally announced August 2022.

    Journal ref: Proceedings of the 24th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS '22), October 2022

  10. arXiv:2208.06697  [pdf, other

    cs.NI

    Wireless Communications for Smart Manufacturing and Industrial IoT: Existing Technologies, 5G, and Beyond

    Authors: Md. Noor-A-Rahim, Jobish John, Fadhil Firyaguna, Dimitrios Zorbas, Hafiz Husnain Raza Sherazi, Sergii Kushch, Eoin O Connell, Dirk Pesch, Brendan O Flynn, Martin Hayes, Eddie Armstrong

    Abstract: Smart manufacturing is a vision and major driver for change in industrial environments. The goal of smart manufacturing is to optimize manufacturing processes through constantly monitoring and adapting processes towards more efficient and personalised manufacturing. This requires and relies on technologies for connected machines incorporating a variety of computation, sensing, actuation, and machi… ▽ More

    Submitted 13 August, 2022; originally announced August 2022.

    Comments: The manuscript has been submitted to IEEE for possible publication

  11. arXiv:2110.08935  [pdf, other

    cs.CV

    InfAnFace: Bridging the infant-adult domain gap in facial landmark estimation in the wild

    Authors: Michael Wan, Shaotong Zhu, Lingfei Luan, Gulati Prateek, Xiaofei Huang, Rebecca Schwartz-Mette, Marie Hayes, Emily Zimmerman, Sarah Ostadabbas

    Abstract: We lay the groundwork for research in the algorithmic comprehension of infant faces, in anticipation of applications from healthcare to psychology, especially in the early prediction of developmental disorders. Specifically, we introduce the first-ever dataset of infant faces annotated with facial landmark coordinates and pose attributes, demonstrate the inadequacies of existing facial landmark es… ▽ More

    Submitted 26 May, 2022; v1 submitted 17 October, 2021; originally announced October 2021.

  12. arXiv:1807.06036  [pdf, other

    cs.IR cs.LG stat.ML

    Pangloss: Fast Entity Linking in Noisy Text Environments

    Authors: Michael Conover, Matthew Hayes, Scott Blackburn, Pete Skomoroch, Sam Shah

    Abstract: Entity linking is the task of mapping potentially ambiguous terms in text to their constituent entities in a knowledge base like Wikipedia. This is useful for organizing content, extracting structured data from textual documents, and in machine learning relevance applications like semantic search, knowledge graph construction, and question answering. Traditionally, this work has focused on text th… ▽ More

    Submitted 16 July, 2018; originally announced July 2018.

    Comments: KDD 2018