Skip to main content

Showing 1–4 of 4 results for author: Eremeev, M

  1. arXiv:2406.17987  [pdf, other

    cs.CL cs.AI

    Multi-step Inference over Unstructured Data

    Authors: Aditya Kalyanpur, Kailash Saravanakumar, Victor Barres, CJ McFate, Lori Moon, Nati Seifu, Maksim Eremeev, Jose Barrera, Eric Brown, David Ferrucci

    Abstract: The advent of Large Language Models (LLMs) and Generative AI has revolutionized natural language applications across various domains. However, high-stakes decision-making tasks in fields such as medical, legal and finance require a level of precision, comprehensiveness, and logical consistency that pure LLM or Retrieval-Augmented-Generation (RAG) approaches often fail to deliver. At Elemental Cogn… ▽ More

    Submitted 11 July, 2024; v1 submitted 25 June, 2024; originally announced June 2024.

  2. arXiv:2306.03652  [pdf, other

    cs.CL

    Injecting knowledge into language generation: a case study in auto-charting after-visit care instructions from medical dialogue

    Authors: Maksim Eremeev, Ilya Valmianski, Xavier Amatriain, Anitha Kannan

    Abstract: Factual correctness is often the limiting factor in practical applications of natural language generation in high-stakes domains such as healthcare. An essential requirement for maintaining factuality is the ability to deal with rare tokens. This paper focuses on rare tokens that appear in both the source and the reference sequences, and which, when missed during generation, decrease the factual c… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Comments: ACL 2023 (main conference)

  3. arXiv:2112.08914  [pdf, other

    cs.LG cs.CL

    Characterizing and addressing the issue of oversmoothing in neural autoregressive sequence modeling

    Authors: Ilia Kulikov, Maksim Eremeev, Kyunghyun Cho

    Abstract: Neural autoregressive sequence models smear the probability among many possible sequences including degenerate ones, such as empty or repetitive sequences. In this work, we tackle one specific case where the model assigns a high probability to unreasonably short sequences. We define the oversmoothing rate to quantify this issue. After confirming the high degree of oversmoothing in neural machine t… ▽ More

    Submitted 22 December, 2021; v1 submitted 16 December, 2021; originally announced December 2021.

    Comments: Ilia Kulikov and Maksim Eremeev contributed equally

  4. arXiv:1809.02471  [pdf, ps, other

    cs.CR

    Protection of Information from Imitation on the Basis of Crypt-Code Structures

    Authors: Dmitry Samoylenko, Mikhail Eremeev, Oleg Finko, Sergey Dichenko

    Abstract: A system is offered for imitation resistant transmitting of encrypted information in wireless communication networks on the basis of redundant residue polynomial codes. The particular feature of this solution is complexing of methods for cryptographic protection of information and multi-character codes that correct errors, and the resulting structures (crypt-code structures) ensure stable function… ▽ More

    Submitted 7 September, 2018; originally announced September 2018.

    Comments: 21st International Multi-conference On Advanced Computer Systems Acs 2018 (Mi\K{E}Dzyzdroje, Poland, September 24-26, 2018)

    MSC Class: 11A07; 94A60; 94A62; 94B40; 94B15