Skip to main content

Showing 1–3 of 3 results for author: Bespalov, D

  1. arXiv:2404.08690  [pdf, other

    cs.CL cs.AI cs.CR cs.LG

    Towards Building a Robust Toxicity Predictor

    Authors: Dmitriy Bespalov, Sourav Bhabesh, Yi Xiang, Liutong Zhou, Yanjun Qi

    Abstract: Recent NLP literature pays little attention to the robustness of toxicity language predictors, while these systems are most likely to be used in adversarial contexts. This paper presents a novel adversarial attack, \texttt{ToxicTrap}, introducing small word-level perturbations to fool SOTA text classifiers to predict toxic text samples as benign. ToxicTrap exploits greedy based search strategies t… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: ACL 2023 /

  2. arXiv:2312.04684  [pdf, other

    cs.CL cs.AI

    Latent Skill Discovery for Chain-of-Thought Reasoning

    Authors: Zifan Xu, Haozhu Wang, Dmitriy Bespalov, Peter Stone, Yanjun Qi

    Abstract: Recent advances in Large Language Models (LLMs) have led to an emergent ability of chain-of-thought (CoT) prompting, a prompt reasoning strategy that adds intermediate rationale steps between questions and answers to construct prompts. Conditioned on these prompts, LLMs can effectively learn in context to generate rationales that lead to more accurate answers than when answering the same question… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  3. arXiv:1102.3563  [pdf, ps, other

    cs.DC

    Parallel algorithms for SAT in application to inversion problems of some discrete functions

    Authors: Alexander Semenov, Oleg Zaikin, Dmitry Bespalov, Mikhail Posypkin

    Abstract: In this article we consider the inversion problem for polynomially computable discrete functions. These functions describe behavior of many discrete systems and are used in model checking, hardware verification, cryptanalysis, computer biology and other domains. Quite often it is necessary to invert these functions, i.e. to find an unknown preimage if an image and algorithm of function computation… ▽ More

    Submitted 17 February, 2011; originally announced February 2011.

    Comments: 16 pages, 8 figures