Skip to main content

Showing 1–6 of 6 results for author: Batra, A

  1. Reviewing FID and SID Metrics on Generative Adversarial Networks

    Authors: Ricardo de Deijn, Aishwarya Batra, Brandon Koch, Naseef Mansoor, Hema Makkena

    Abstract: The growth of generative adversarial network (GAN) models has increased the ability of image processing and provides numerous industries with the technology to produce realistic image transformations. However, with the field being recently established there are new evaluation metrics that can further this research. Previous research has shown the Fréchet Inception Distance (FID) to be an effective… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 14 pages 9 figures 1 table Included in IOTBS, NLTM, AIMLA, DBDM - 2024 Conference Proceedings Editor: David C. Wyld et al

    Journal ref: CS & IT - CSCP (2024) 111-124

  2. arXiv:2312.12620  [pdf, ps, other

    cs.CY

    "It Can Relate to Real Lives": Attitudes and Expectations in Justice-Centered Data Structures & Algorithms for Non-Majors

    Authors: Anna Batra, Iris Zhou, Suh Young Choi, Chongjiu Gao, Yanbing Xiao, Sonia Fereidooni, Kevin Lin

    Abstract: Prior work has argued for a more justice-centered approach to postsecondary computing education by emphasizing ethics, identity, and political vision. In this experience report, we examine how postsecondary students of diverse gender and racial identities experience a justice-centered Data Structures and Algorithms designed for undergraduate non-computer science majors. Through a quantitative and… ▽ More

    Submitted 15 March, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

    Comments: Experience Reports and Tools paper in the Proceedings of the 55th ACM Technical Symposium on Computer Science Education V. 1 (SIGCSE 2024); 7 pages

    ACM Class: K.3.2

  3. arXiv:2311.15964  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Efficient Pre-training for Localized Instruction Generation of Videos

    Authors: Anil Batra, Davide Moltisanti, Laura Sevilla-Lara, Marcus Rohrbach, Frank Keller

    Abstract: Procedural videos, exemplified by recipe demonstrations, are instrumental in conveying step-by-step instructions. However, understanding such videos is challenging as it involves the precise localization of steps and the generation of textual instructions. Manually annotating steps and writing instructions is costly, which limits the size of current datasets and hinders effective learning. Leverag… ▽ More

    Submitted 23 May, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: updated version

  4. arXiv:2306.00501  [pdf, other

    cs.CV cs.AI cs.LG

    Image generation with shortest path diffusion

    Authors: Ayan Das, Stathi Fotiadis, Anil Batra, Farhang Nabiei, FengTing Liao, Sattar Vakili, Da-shan Shiu, Alberto Bernacchia

    Abstract: The field of image generation has made significant progress thanks to the introduction of Diffusion Models, which learn to progressively reverse a given image corruption. Recently, a few studies introduced alternative ways of corrupting images in Diffusion Models, with an emphasis on blurring. However, these studies are purely empirical and it remains unclear what is the optimal procedure for corr… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: AD and SF contributed equally

  5. arXiv:2209.15501  [pdf, other

    cs.CV

    A Closer Look at Temporal Ordering in the Segmentation of Instructional Videos

    Authors: Anil Batra, Shreyank N Gowda, Frank Keller, Laura Sevilla-Lara

    Abstract: Understanding the steps required to perform a task is an important skill for AI systems. Learning these steps from instructional videos involves two subproblems: (i) identifying the temporal boundary of sequentially occurring segments and (ii) summarizing these steps in natural language. We refer to this task as Procedure Segmentation and Summarization (PSS). In this paper, we take a closer look a… ▽ More

    Submitted 7 October, 2022; v1 submitted 30 September, 2022; originally announced September 2022.

    Comments: Accepted at BMVC 2022

  6. arXiv:2207.11033  [pdf

    cs.HC cs.CV

    GesSure- A Robust Face-Authentication enabled Dynamic Gesture Recognition GUI Application

    Authors: Ankit Jha, Ishita, Pratham G. Shenwai, Ayush Batra, Siddharth Kotian, Piyush Modi

    Abstract: Using physical interactive devices like mouse and keyboards hinders naturalistic human-machine interaction and increases the probability of surface contact during a pandemic. Existing gesture-recognition systems do not possess user authentication, making them unreliable. Static gestures in current gesture-recognition technology introduce long adaptation periods and reduce user compatibility. Our t… ▽ More

    Submitted 7 September, 2022; v1 submitted 22 July, 2022; originally announced July 2022.

    Comments: Accepted at International Conference on Artificial Intelligence Advances (AIAD 2022)

    Journal ref: IJCI Conference Proceedings, International Conference on Artificial Intelligence Advances (AIAD 2022)