Skip to main content

Showing 1–5 of 5 results for author: Nikolich, A

  1. arXiv:2405.13929  [pdf, other

    cs.CL cs.AI

    Vikhr: The Family of Open-Source Instruction-Tuned Large Language Models for Russian

    Authors: Aleksandr Nikolich, Konstantin Korolev, Artem Shelmanov, Igor Kiselev

    Abstract: There has been a surge in the development of various Large Language Models (LLMs). However, text generation for languages other than English often faces significant challenges, including poor generation quality and the reduced computational performance due to the disproportionate representation of tokens in model's vocabulary. In this work, we address these issues and introduce Vikhr, a new state-… ▽ More

    Submitted 19 June, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

  2. arXiv:2404.13146  [pdf, other

    cs.CR cs.CV

    DeepFake-O-Meter v2.0: An Open Platform for DeepFake Detection

    Authors: Yan Ju, Chengzhe Sun, Shan Jia, Shuwei Hou, Zhaofeng Si, Soumyya Kanti Datta, Lipeng Ke, Riky Zhou, Anita Nikolich, Siwei Lyu

    Abstract: Deepfakes, as AI-generated media, have increasingly threatened media integrity and personal privacy with realistic yet fake digital content. In this work, we introduce an open-source and user-friendly online platform, DeepFake-O-Meter v2.0, that integrates state-of-the-art methods for detecting Deepfake images, videos, and audio. Built upon DeepFake-O-Meter v1.0, we have made significant upgrades… ▽ More

    Submitted 27 June, 2024; v1 submitted 19 April, 2024; originally announced April 2024.

  3. arXiv:2204.01790  [pdf, other

    cs.SI cs.IR

    Leaders or Followers? A Temporal Analysis of Tweets from IRA Trolls

    Authors: Siva K. Balasubramanian, Mustafa Bilgic, Aron Culotta, Libby Hemphill, Anita Nikolich, Matthew A. Shapiro

    Abstract: The Internet Research Agency (IRA) influences online political conversations in the United States, exacerbating existing partisan divides and sowing discord. In this paper we investigate the IRA's communication strategies by analyzing trending terms on Twitter to identify cases in which the IRA leads or follows other users. Our analysis focuses on over 38M tweets posted between 2016 and 2017 from… ▽ More

    Submitted 4 April, 2022; originally announced April 2022.

    Comments: ICWSM 2022

  4. arXiv:2112.02448  [pdf, other

    cs.CL cs.AI cs.LG

    Emojich -- zero-shot emoji generation using Russian language: a technical report

    Authors: Alex Shonenkov, Daria Bakshandaeva, Denis Dimitrov, Aleksandr Nikolich

    Abstract: This technical report presents a text-to-image neural network "Emojich" that generates emojis using captions in Russian language as a condition. We aim to keep the generalization ability of a pretrained big model ruDALL-E Malevich (XL) 1.3B parameters at the fine-tuning stage, while giving special style to the images generated. Here are presented some engineering methods, code realization, all hyp… ▽ More

    Submitted 12 January, 2022; v1 submitted 4 December, 2021; originally announced December 2021.

    Comments: 5 pages, 4 figures and big figure at appendix, technical report

  5. arXiv:2108.03502  [pdf, other

    cs.CL

    Fine-tuning GPT-3 for Russian Text Summarization

    Authors: Alexandr Nikolich, Arina Puchkova

    Abstract: Automatic summarization techniques aim to shorten and generalize information given in the text while preserving its core message and the most relevant ideas. This task can be approached and treated with a variety of methods, however, not many attempts have been made to produce solutions specifically for the Russian language despite existing localizations of the state-of-the-art models. In this pap… ▽ More

    Submitted 7 August, 2021; originally announced August 2021.