Skip to main content

Showing 1–4 of 4 results for author: Ozyildirim, M

  1. arXiv:2403.20329  [pdf, other

    cs.CL cs.AI cs.LG

    ReALM: Reference Resolution As Language Modeling

    Authors: Joel Ruben Antony Moniz, Soundarya Krishnan, Melis Ozyildirim, Prathamesh Saraf, Halim Cagri Ates, Yuan Zhang, Hong Yu, Nidhi Rajshree

    Abstract: Reference resolution is an important problem, one that is essential to understand and successfully handle context of different kinds. This context includes both previous turns and context that pertains to non-conversational entities, such as entities on the user's screen or those running in the background. While LLMs have been shown to be extremely powerful for a variety of tasks, their use in ref… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

  2. MARRS: Multimodal Reference Resolution System

    Authors: Halim Cagri Ates, Shruti Bhargava, Site Li, Jiarui Lu, Siddhardha Maddula, Joel Ruben Antony Moniz, Anil Kumar Nalamalapu, Roman Hoang Nguyen, Melis Ozyildirim, Alkesh Patel, Dhivya Piraviperumal, Vincent Renkens, Ankit Samal, Thy Tran, Bo-Hsiang Tseng, Hong Yu, Yuan Zhang, Rong Zou

    Abstract: Successfully handling context is essential for any dialog understanding task. This context maybe be conversational (relying on previous user queries or system responses), visual (relying on what the user sees, for example, on their screen), or background (based on signals such as a ringing alarm or playing music). In this work, we present an overview of MARRS, or Multimodal Reference Resolution Sy… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: Sixth Workshop on Computational Models of Reference, Anaphora and Coreference (CRAC 2023)

  3. arXiv:2201.11182  [pdf, other

    cs.LG cs.NE

    Hyperparameter Tuning for Deep Reinforcement Learning Applications

    Authors: Mariam Kiran, Melis Ozyildirim

    Abstract: Reinforcement learning (RL) applications, where an agent can simply learn optimal behaviors by interacting with the environment, are quickly gaining tremendous success in a wide variety of applications from controlling simple pendulums to complex data centers. However, setting the right hyperparameters can have a huge impact on the deployed solution performance and reliability in the inference mod… ▽ More

    Submitted 26 January, 2022; originally announced January 2022.

    Comments: 11 pages, 6 figures

    ACM Class: I.2.6; I.2.11

  4. arXiv:2002.12642  [pdf, other

    cs.LG stat.ML

    Do optimization methods in deep learning applications matter?

    Authors: Buse Melis Ozyildirim, Mariam Kiran

    Abstract: With advances in deep learning, exponential data growth and increasing model complexity, developing efficient optimization methods are attracting much research attention. Several implementations favor the use of Conjugate Gradient (CG) and Stochastic Gradient Descent (SGD) as being practical and elegant solutions to achieve quick convergence, however, these optimization processes also present many… ▽ More

    Submitted 28 February, 2020; originally announced February 2020.