Skip to main content

Showing 1–10 of 10 results for author: Jangra, A

  1. arXiv:2404.04728  [pdf, other

    cs.CL cs.HC

    Navigating the Landscape of Hint Generation Research: From the Past to the Future

    Authors: Anubhav Jangra, Jamshid Mozafari, Adam Jatowt, Smaranda Muresan

    Abstract: Digital education has gained popularity in the last decade, especially after the COVID-19 pandemic. With the improving capabilities of large language models to reason and communicate with users, envisioning intelligent tutoring systems (ITSs) that can facilitate self-learning is not very far-fetched. One integral component to fulfill this vision is the ability to give accurate and effective feedba… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

    Comments: Submitted to TACL'24

  2. TriviaHG: A Dataset for Automatic Hint Generation from Factoid Questions

    Authors: Jamshid Mozafari, Anubhav Jangra, Adam Jatowt

    Abstract: Nowadays, individuals tend to engage in dialogues with Large Language Models, seeking answers to their questions. In times when such answers are readily accessible to anyone, the stimulation and preservation of human's cognitive abilities, as well as the assurance of maintaining good reasoning skills by humans becomes crucial. This study addresses such needs by proposing hints (instead of final an… ▽ More

    Submitted 10 May, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: Accepted at SIGIR 2024

    Journal ref: Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2024)

  3. arXiv:2302.06560  [pdf, other

    cs.CL cs.MM

    Large Scale Multi-Lingual Multi-Modal Summarization Dataset

    Authors: Yash Verma, Anubhav Jangra, Raghvendra Kumar, Sriparna Saha

    Abstract: Significant developments in techniques such as encoder-decoder models have enabled us to represent information comprising multiple modalities. This information can further enhance many downstream tasks in the field of information retrieval and natural language processing; however, improvements in multi-modal techniques and their performance evaluation require large-scale multi-modal data which off… ▽ More

    Submitted 13 February, 2023; originally announced February 2023.

  4. arXiv:2212.01669  [pdf, other

    cs.CL

    A Survey on Medical Document Summarization

    Authors: Raghav Jain, Anubhav Jangra, Sriparna Saha, Adam Jatowt

    Abstract: The internet has had a dramatic effect on the healthcare industry, allowing documents to be saved, shared, and managed digitally. This has made it easier to locate and share important data, improving patient care and providing more opportunities for medical studies. As there is so much data accessible to doctors and patients alike, summarizing it has become increasingly necessary - this has been s… ▽ More

    Submitted 3 December, 2022; originally announced December 2022.

  5. arXiv:2212.01667  [pdf, other

    cs.CL

    T-STAR: Truthful Style Transfer using AMR Graph as Intermediate Representation

    Authors: Anubhav Jangra, Preksha Nema, Aravindan Raghuveer

    Abstract: Unavailability of parallel corpora for training text style transfer (TST) models is a very challenging yet common scenario. Also, TST models implicitly need to preserve the content while transforming a source sentence into the target style. To tackle these problems, an intermediate representation is often constructed that is devoid of style while still preserving the meaning of the source sentence… ▽ More

    Submitted 3 December, 2022; originally announced December 2022.

    Comments: Accepted in EMNLP 2022

  6. arXiv:2204.09140  [pdf, other

    cs.CL cs.AI cs.IR

    Multi-hop Question Answering

    Authors: Vaibhav Mavi, Anubhav Jangra, Adam Jatowt

    Abstract: The task of Question Answering (QA) has attracted significant research interest for long. Its relevance to language understanding and knowledge retrieval tasks, along with the simple setting makes the task of QA crucial for strong AI systems. Recent success on simple QA tasks has shifted the focus to more complex settings. Among these, Multi-Hop QA (MHQA) is one of the most researched tasks over t… ▽ More

    Submitted 31 May, 2024; v1 submitted 19 April, 2022; originally announced April 2022.

    Comments: Published at Foundations and Trends in Information Retrieval

  7. arXiv:2201.09282  [pdf, other

    cs.CL

    WIDAR -- Weighted Input Document Augmented ROUGE

    Authors: Raghav Jain, Vaibhav Mavi, Anubhav Jangra, Sriparna Saha

    Abstract: The task of automatic text summarization has gained a lot of traction due to the recent advancements in machine learning techniques. However, evaluating the quality of a generated summary remains to be an open problem. The literature has widely adopted Recall-Oriented Understudy for Gisting Evaluation (ROUGE) as the standard evaluation metric for summarization. However, ROUGE has some long-establi… ▽ More

    Submitted 23 January, 2022; originally announced January 2022.

    Comments: Manuscript Accepted as full paper in ECIR 2022

  8. arXiv:2109.05199  [pdf, other

    cs.CL cs.MM cs.NE

    A Survey on Multi-modal Summarization

    Authors: Anubhav Jangra, Sourajit Mukherjee, Adam Jatowt, Sriparna Saha, Mohammad Hasanuzzaman

    Abstract: The new era of technology has brought us to the point where it is convenient for people to share their opinions over an abundance of platforms. These platforms have a provision for the users to express themselves in multiple forms of representations, including text, images, videos, and audio. This, however, makes it difficult for users to obtain all the key information about a topic, making the ta… ▽ More

    Submitted 13 February, 2023; v1 submitted 11 September, 2021; originally announced September 2021.

    Comments: Accepted in ACM CSUR 2023

  9. arXiv:2105.01296  [pdf, other

    cs.CL cs.LG

    Semantic Extractor-Paraphraser based Abstractive Summarization

    Authors: Anubhav Jangra, Raghav Jain, Vaibhav Mavi, Sriparna Saha, Pushpak Bhattacharyya

    Abstract: The anthology of spoken languages today is inundated with textual information, necessitating the development of automatic summarization models. In this manuscript, we propose an extractor-paraphraser based abstractive summarization system that exploits semantic overlap as opposed to its predecessors that focus more on syntactic information overlap. Our model outperforms the state-of-the-art baseli… ▽ More

    Submitted 4 May, 2021; originally announced May 2021.

  10. arXiv:2005.09252  [pdf, other

    cs.IR

    Multi-Modal Summary Generation using Multi-Objective Optimization

    Authors: Anubhav Jangra, Sriparna Saha, Adam Jatowt, Mohammad Hasanuzzaman

    Abstract: Significant development of communication technology over the past few years has motivated research in multi-modal summarization techniques. A majority of the previous works on multi-modal summarization focus on text and images. In this paper, we propose a novel extractive multi-objective optimization based model to produce a multi-modal summary containing text, images, and videos. Important object… ▽ More

    Submitted 19 May, 2020; originally announced May 2020.

    Comments: 5 pages, 2 figures