Skip to main content

Showing 1–38 of 38 results for author: Panov, A

  1. arXiv:2407.09287  [pdf, other

    cs.AI

    Instruction Following with Goal-Conditioned Reinforcement Learning in Virtual Environments

    Authors: Zoya Volovikova, Alexey Skrynnik, Petr Kuderov, Aleksandr I. Panov

    Abstract: In this study, we address the issue of enabling an artificial intelligence agent to execute complex language instructions within virtual environments. In our framework, we assume that these instructions involve intricate linguistic structures and multiple interdependent tasks that must be navigated successfully to achieve the desired outcomes. To effectively manage these complexities, we propose a… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  2. arXiv:2312.15908  [pdf, other

    cs.AI cs.LG cs.MA

    Decentralized Monte Carlo Tree Search for Partially Observable Multi-agent Pathfinding

    Authors: Alexey Skrynnik, Anton Andreychuk, Konstantin Yakovlev, Aleksandr Panov

    Abstract: The Multi-Agent Pathfinding (MAPF) problem involves finding a set of conflict-free paths for a group of agents confined to a graph. In typical MAPF scenarios, the graph and the agents' starting and ending vertices are known beforehand, allowing the use of centralized planning algorithms. However, in this study, we focus on the decentralized MAPF setting, where the agents may observe the other agen… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

    Comments: The paper is accepted to AAAI-2024 conference

  3. arXiv:2311.06295  [pdf, other

    physics.chem-ph cs.LG

    Gradual Optimization Learning for Conformational Energy Minimization

    Authors: Artem Tsypin, Leonid Ugadiarov, Kuzma Khrabrov, Alexander Telepov, Egor Rumiantsev, Alexey Skrynnik, Aleksandr I. Panov, Dmitry Vetrov, Elena Tutubalina, Artur Kadurin

    Abstract: Molecular conformation optimization is crucial to computer-aided drug discovery and materials design. Traditional energy minimization techniques rely on iterative optimization methods that use molecular forces calculated by a physical simulator (oracle) as anti-gradients. However, this is a computationally expensive approach that requires many interactions with a physical simulator. One way to acc… ▽ More

    Submitted 12 March, 2024; v1 submitted 5 November, 2023; originally announced November 2023.

    Comments: Published as a conference paper at ICLR2024 (Poster)

  4. arXiv:2311.04640  [pdf, other

    cs.LG cs.AI cs.CV

    Object-Centric Learning with Slot Mixture Module

    Authors: Daniil Kirilenko, Vitaliy Vorobyov, Alexey K. Kovalev, Aleksandr I. Panov

    Abstract: Object-centric architectures usually apply a differentiable module to the entire feature map to decompose it into sets of entity representations called slots. Some of these methods structurally resemble clustering algorithms, where the cluster's center in latent space serves as a slot representation. Slot Attention is an example of such a method, acting as a learnable analog of the soft k-means al… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 17 pages, 6 figures

  5. arXiv:2311.04107  [pdf, other

    cs.RO cs.CV

    Interactive Semantic Map Representation for Skill-based Visual Object Navigation

    Authors: Tatiana Zemskova, Aleksei Staroverov, Kirill Muravyev, Dmitry Yudin, Aleksandr Panov

    Abstract: Visual object navigation using learning methods is one of the key tasks in mobile robotics. This paper introduces a new representation of a scene semantic map formed during the embodied agent interaction with the indoor environment. It is based on a neural network method that adjusts the weights of the segmentation model with backpropagation of the predicted fusion loss values during inference on… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  6. arXiv:2310.17178  [pdf, other

    cs.AI cs.LG cs.RO

    Graphical Object-Centric Actor-Critic

    Authors: Leonid Ugadiarov, Aleksandr I. Panov

    Abstract: There have recently been significant advances in the problem of unsupervised object-centric representation learning and its application to downstream tasks. The latest works support the argument that employing disentangled object representations in image-based object-centric reinforcement learning tasks facilitates policy learning. We propose a novel object-centric reinforcement learning algorithm… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

  7. arXiv:2310.16362  [pdf, other

    cs.RO cs.LG

    Neural Potential Field for Obstacle-Aware Local Motion Planning

    Authors: Muhammad Alhaddad, Konstantin Mironov, Aleksey Staroverov, Aleksandr Panov

    Abstract: Model predictive control (MPC) may provide local motion planning for mobile robotic platforms. The challenging aspect is the analytic representation of collision cost for the case when both the obstacle map and robot footprint are arbitrary. We propose a Neural Potential Field: a neural network model that returns a differentiable collision cost based on robot pose, obstacle map, and robot footprin… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

  8. arXiv:2310.13391  [pdf, other

    cs.LG cs.AI cs.NE

    Learning Successor Features with Distributed Hebbian Temporal Memory

    Authors: Evgenii Dzhivelikian, Petr Kuderov, Aleksandr I. Panov

    Abstract: This paper presents a novel approach to address the challenge of online temporal memory learning for decision-making under uncertainty in non-stationary, partially observable environments. The proposed algorithm, Distributed Hebbian Temporal Memory (DHTM), is based on factor graph formalism and a multicomponent neuron model. DHTM aims to capture sequential data relationships and make cumulative pr… ▽ More

    Submitted 19 March, 2024; v1 submitted 20 October, 2023; originally announced October 2023.

    Comments: 20 pages, 9 figures

  9. arXiv:2310.12031  [pdf, other

    cs.CV cs.AI cs.LG

    SegmATRon: Embodied Adaptive Semantic Segmentation for Indoor Environment

    Authors: Tatiana Zemskova, Margarita Kichik, Dmitry Yudin, Aleksei Staroverov, Aleksandr Panov

    Abstract: This paper presents an adaptive transformer model named SegmATRon for embodied image semantic segmentation. Its distinctive feature is the adaptation of model weights during inference on several images using a hybrid multicomponent loss function. We studied this model on datasets collected in the photorealistic Habitat and the synthetic AI2-THOR Simulators. We showed that obtaining additional imag… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: 14 pages, 6 figures

  10. arXiv:2310.01207  [pdf, other

    cs.AI cs.MA

    Learn to Follow: Decentralized Lifelong Multi-agent Pathfinding via Planning and Learning

    Authors: Alexey Skrynnik, Anton Andreychuk, Maria Nesterova, Konstantin Yakovlev, Aleksandr Panov

    Abstract: Multi-agent Pathfinding (MAPF) problem generally asks to find a set of conflict-free paths for a set of agents confined to a graph and is typically solved in a centralized fashion. Conversely, in this work, we investigate the decentralized MAPF setting, when the central controller that posses all the information on the agents' locations and goals is absent and the agents have to sequientially de… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Comments: 12 pages, 11 figures

  11. arXiv:2307.14568  [pdf, other

    cs.RO cs.AI cs.LG

    Evaluation of Safety Constraints in Autonomous Navigation with Deep Reinforcement Learning

    Authors: Brian Angulo, Gregory Gorbov, Aleksandr Panov, Konstantin Yakovlev

    Abstract: While reinforcement learning algorithms have had great success in the field of autonomous navigation, they cannot be straightforwardly applied to the real autonomous systems without considering the safety constraints. The later are crucial to avoid unsafe behaviors of the autonomous vehicle on the road. To highlight the importance of these constraints, in this study, we compare two learnable navig… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

    Comments: 4 pages, 5 figures

  12. arXiv:2307.13453  [pdf, other

    cs.AI

    Monte-Carlo Tree Search for Multi-Agent Pathfinding: Preliminary Results

    Authors: Yelisey Pitanov, Alexey Skrynnik, Anton Andreychuk, Konstantin Yakovlev, Aleksandr Panov

    Abstract: In this work we study a well-known and challenging problem of Multi-agent Pathfinding, when a set of agents is confined to a graph, each agent is assigned a unique start and goal vertices and the task is to find a set of collision-free paths (one for each agent) such that each agent reaches its respective goal. We investigate how to utilize Monte-Carlo Tree Search (MCTS) to solve the problem. Alth… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

    Comments: The paper is accepted to HAIS 2023

  13. arXiv:2306.09459  [pdf, other

    cs.LG cs.AI

    Recurrent Action Transformer with Memory

    Authors: Alexey Staroverov, Egor Cherepanov, Dmitry Yudin, Alexey K. Kovalev, Aleksandr I. Panov

    Abstract: Recently, the use of transformers in offline reinforcement learning has become a rapidly developing area. This is due to their ability to treat the agent's trajectory in the environment as a sequence, thereby reducing the policy learning problem to sequence modeling. In environments where the agent's decisions depend on past events, it is essential to capture both the event itself and the decision… ▽ More

    Submitted 27 March, 2024; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: 15 pages, 11 figures

  14. arXiv:2301.10067  [pdf, other

    cs.LG cs.AI

    Intrinsic Motivation in Model-based Reinforcement Learning: A Brief Review

    Authors: Artem Latyshev, Aleksandr I. Panov

    Abstract: The reinforcement learning research area contains a wide range of methods for solving the problems of intelligent agent control. Despite the progress that has been made, the task of creating a highly autonomous agent is still a significant challenge. One potential solution to this problem is intrinsic motivation, a concept derived from developmental psychology. This review considers the existing m… ▽ More

    Submitted 24 January, 2023; originally announced January 2023.

    Comments: 13 pages, 7 figures

  15. Reinforcement Learning with Success Induced Task Prioritization

    Authors: Maria Nesterova, Alexey Skrynnik, Aleksandr Panov

    Abstract: Many challenging reinforcement learning (RL) problems require designing a distribution of tasks that can be applied to train effective policies. This distribution of tasks can be specified by the curriculum. A curriculum is meant to improve the results of learning and accelerate it. We introduce Success Induced Task Prioritization (SITP), a framework for automatic curriculum learning, where a task… ▽ More

    Submitted 30 December, 2022; originally announced January 2023.

    Journal ref: MICAI 2022. Lecture Notes in Computer Science, vol 13612

  16. arXiv:2212.14649  [pdf, other

    cs.CV cs.AI

    HPointLoc: Point-based Indoor Place Recognition using Synthetic RGB-D Images

    Authors: Dmitry Yudin, Yaroslav Solomentsev, Ruslan Musaev, Aleksei Staroverov, Aleksandr I. Panov

    Abstract: We present a novel dataset named as HPointLoc, specially designed for exploring capabilities of visual place recognition in indoor environment and loop detection in simultaneous localization and mapping. The loop detection sub-task is especially relevant when a robot with an on-board RGB-D camera can drive past the same place (``Point") at different angles. The dataset is based on the popular Habi… ▽ More

    Submitted 30 December, 2022; originally announced December 2022.

    Comments: Accepted for publishing in proceedings of the 29th International Conference on Neural Information Processing (ICONIP 2022)

  17. arXiv:2212.14307  [pdf, other

    cs.RO

    Policy Optimization to Learn Adaptive Motion Primitives in Path Planning with Dynamic Obstacles

    Authors: Brian Angulo, Aleksandr Panov, Konstantin Yakovlev

    Abstract: This paper addresses the kinodynamic motion planning for non-holonomic robots in dynamic environments with both static and dynamic obstacles -- a challenging problem that lacks a universal solution yet. One of the promising approaches to solve it is decomposing the problem into the smaller sub problems and combining the local solutions into the global one. The crux of any planning method for non-h… ▽ More

    Submitted 29 December, 2022; originally announced December 2022.

    Comments: 8 pages, 10 figures

  18. arXiv:2212.11730  [pdf, other

    cs.AI cs.LG

    TransPath: Learning Heuristics For Grid-Based Pathfinding via Transformers

    Authors: Daniil Kirilenko, Anton Andreychuk, Aleksandr Panov, Konstantin Yakovlev

    Abstract: Heuristic search algorithms, e.g. A*, are the commonly used tools for pathfinding on grids, i.e. graphs of regular structure that are widely employed to represent environments in robotics, video games etc. Instance-independent heuristics for grid graphs, e.g. Manhattan distance, do not take the obstacles into account and, thus, the search led by such heuristics performs poorly in the obstacle-rich… ▽ More

    Submitted 22 December, 2022; originally announced December 2022.

    Comments: Pre-print of the paper accepted to AAAI'23

  19. arXiv:2211.06552  [pdf, other

    cs.CL cs.AI

    Collecting Interactive Multi-modal Datasets for Grounded Language Understanding

    Authors: Shrestha Mohanty, Negar Arabzadeh, Milagro Teruel, Yuxuan Sun, Artem Zholus, Alexey Skrynnik, Mikhail Burtsev, Kavya Srinet, Aleksandr Panov, Arthur Szlam, Marc-Alexandre Côté, Julia Kiseleva

    Abstract: Human intelligence can remarkably adapt quickly to new tasks and environments. Starting from a very young age, humans acquire new skills and learn how to solve new tasks either by imitating the behavior of others or by following provided natural language instructions. To facilitate research which can enable similar capabilities in machines, we made the following contributions (1) formalized the co… ▽ More

    Submitted 21 March, 2023; v1 submitted 11 November, 2022; originally announced November 2022.

    Journal ref: Interactive Learning for Natural Language Processing NeurIPS 2022 Workshop

  20. arXiv:2211.00688  [pdf, other

    cs.AI cs.CL

    Learning to Solve Voxel Building Embodied Tasks from Pixels and Natural Language Instructions

    Authors: Alexey Skrynnik, Zoya Volovikova, Marc-Alexandre Côté, Anton Voronov, Artem Zholus, Negar Arabzadeh, Shrestha Mohanty, Milagro Teruel, Ahmed Awadallah, Aleksandr Panov, Mikhail Burtsev, Julia Kiseleva

    Abstract: The adoption of pre-trained language models to generate action plans for embodied agents is a promising research strategy. However, execution of instructions in real or simulated environments requires verification of the feasibility of actions as well as their relevance to the completion of a goal. We propose a new method that combines a language model and reinforcement learning for the task of bu… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

    Comments: 6 pages, 3 figures

  21. arXiv:2206.10944  [pdf, other

    cs.LG cs.AI cs.MA

    POGEMA: Partially Observable Grid Environment for Multiple Agents

    Authors: Alexey Skrynnik, Anton Andreychuk, Konstantin Yakovlev, Aleksandr I. Panov

    Abstract: We introduce POGEMA (https://github.com/AIRI-Institute/pogema) a sandbox for challenging partially observable multi-agent pathfinding (PO-MAPF) problems . This is a grid-based environment that was specifically designed to be a flexible, tunable and scalable benchmark. It can be tailored to a variety of PO-MAPF, which can serve as an excellent testing ground for planning and learning methods, and t… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

    Comments: 7 pages, 7 figures

  22. arXiv:2206.00142  [pdf, other

    cs.LG cs.AI cs.CL

    IGLU Gridworld: Simple and Fast Environment for Embodied Dialog Agents

    Authors: Artem Zholus, Alexey Skrynnik, Shrestha Mohanty, Zoya Volovikova, Julia Kiseleva, Artur Szlam, Marc-Alexandre Coté, Aleksandr I. Panov

    Abstract: We present the IGLU Gridworld: a reinforcement learning environment for building and evaluating language conditioned embodied agents in a scalable way. The environment features visual agent embodiment, interactive learning through collaboration, language conditioned RL, and combinatorically hard task (3d blocks building) space.

    Submitted 31 May, 2022; originally announced June 2022.

  23. arXiv:2205.13771  [pdf, other

    cs.CL

    IGLU 2022: Interactive Grounded Language Understanding in a Collaborative Environment at NeurIPS 2022

    Authors: Julia Kiseleva, Alexey Skrynnik, Artem Zholus, Shrestha Mohanty, Negar Arabzadeh, Marc-Alexandre Côté, Mohammad Aliannejadi, Milagro Teruel, Ziming Li, Mikhail Burtsev, Maartje ter Hoeve, Zoya Volovikova, Aleksandr Panov, Yuxuan Sun, Kavya Srinet, Arthur Szlam, Ahmed Awadallah

    Abstract: Human intelligence has the remarkable ability to adapt to new tasks and environments quickly. Starting from a very young age, humans acquire new skills and learn how to solve new tasks either by imitating the behavior of others or by following provided natural language instructions. To facilitate research in this direction, we propose IGLU: Interactive Grounded Language Understanding in a Collabor… ▽ More

    Submitted 27 May, 2022; originally announced May 2022.

    Comments: arXiv admin note: text overlap with arXiv:2110.06536

  24. arXiv:2205.02388  [pdf, other

    cs.CL cs.AI

    Interactive Grounded Language Understanding in a Collaborative Environment: IGLU 2021

    Authors: Julia Kiseleva, Ziming Li, Mohammad Aliannejadi, Shrestha Mohanty, Maartje ter Hoeve, Mikhail Burtsev, Alexey Skrynnik, Artem Zholus, Aleksandr Panov, Kavya Srinet, Arthur Szlam, Yuxuan Sun, Marc-Alexandre Côté, Katja Hofmann, Ahmed Awadallah, Linar Abdrazakov, Igor Churin, Putra Manggala, Kata Naszadi, Michiel van der Meer, Taewoon Kim

    Abstract: Human intelligence has the remarkable ability to quickly adapt to new tasks and environments. Starting from a very young age, humans acquire new skills and learn how to solve new tasks either by imitating the behavior of others or by following provided natural language instructions. To facilitate research in this direction, we propose \emph{IGLU: Interactive Grounded Language Understanding in a Co… ▽ More

    Submitted 27 May, 2022; v1 submitted 4 May, 2022; originally announced May 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2110.06536

    Journal ref: Proceedings of Machine Learning Research NeurIPS 2021 Competition and Demonstration Track

  25. arXiv:2110.13241  [pdf, other

    cs.LG

    Multitask Adaptation by Retrospective Exploration with Learned World Models

    Authors: Artem Zholus, Aleksandr I. Panov

    Abstract: Model-based reinforcement learning (MBRL) allows solving complex tasks in a sample-efficient manner. However, no information is reused between the tasks. In this work, we propose a meta-learned addressing model called RAMa that provides training samples for the MBRL agent taken from continuously growing task-agnostic storage. The model is trained to maximize the expected agent's performance by sel… ▽ More

    Submitted 25 October, 2021; originally announced October 2021.

  26. arXiv:2110.06536  [pdf, other

    cs.AI

    NeurIPS 2021 Competition IGLU: Interactive Grounded Language Understanding in a Collaborative Environment

    Authors: Julia Kiseleva, Ziming Li, Mohammad Aliannejadi, Shrestha Mohanty, Maartje ter Hoeve, Mikhail Burtsev, Alexey Skrynnik, Artem Zholus, Aleksandr Panov, Kavya Srinet, Arthur Szlam, Yuxuan Sun, Katja Hofmann, Michel Galley, Ahmed Awadallah

    Abstract: Human intelligence has the remarkable ability to adapt to new tasks and environments quickly. Starting from a very young age, humans acquire new skills and learn how to solve new tasks either by imitating the behavior of others or by following provided natural language instructions. To facilitate research in this direction, we propose IGLU: Interactive Grounded Language Understanding in a Collabor… ▽ More

    Submitted 14 October, 2021; v1 submitted 13 October, 2021; originally announced October 2021.

  27. arXiv:2109.10173  [pdf, other

    cs.LG cs.AI

    Long-Term Exploration in Persistent MDPs

    Authors: Leonid Ugadiarov, Alexey Skrynnik, Aleksandr I. Panov

    Abstract: Exploration is an essential part of reinforcement learning, which restricts the quality of learned policy. Hard-exploration environments are defined by huge state space and sparse rewards. In such conditions, an exhaustive exploration of the environment is often impossible, and the successful training of an agent requires a lot of interaction steps. In this paper, we propose an exploration method… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

    Comments: This is a preprint of the paper accepted to MICAI 2021. It contains 13 pages and 6 figures

  28. arXiv:2109.09512  [pdf, other

    cs.AI cs.RO

    Landmark Policy Optimization for Object Navigation Task

    Authors: Aleksey Staroverov, Aleksandr I. Panov

    Abstract: This work studies object goal navigation task, which involves navigating to the closest object related to the given semantic category in unseen environments. Recent works have shown significant achievements both in the end-to-end Reinforcement Learning approach and modular systems, but need a big step forward to be robust and optimal. We propose a hierarchical method that incorporates standard tas… ▽ More

    Submitted 17 September, 2021; originally announced September 2021.

  29. arXiv:2108.06148  [pdf, other

    cs.LG cs.AI

    Q-Mixing Network for Multi-Agent Pathfinding in Partially Observable Grid Environments

    Authors: Vasilii Davydov, Alexey Skrynnik, Konstantin Yakovlev, Aleksandr I. Panov

    Abstract: In this paper, we consider the problem of multi-agent navigation in partially observable grid environments. This problem is challenging for centralized planning approaches as they, typically, rely on the full knowledge of the environment. We suggest utilizing the reinforcement learning approach when the agents, first, learn the policies that map observations to actions and then follow these polici… ▽ More

    Submitted 13 August, 2021; originally announced August 2021.

    Comments: This is a preprint of the paper accepted to RCAI 2021. It contains 11 pages and 5 figures

  30. arXiv:2006.09950  [pdf, other

    cs.LG cs.AI

    Delta Schema Network in Model-based Reinforcement Learning

    Authors: Andrey Gorodetskiy, Alexandra Shlychkova, Aleksandr I. Panov

    Abstract: This work is devoted to unresolved problems of Artificial General Intelligence - the inefficiency of transfer learning. One of the mechanisms that are used to solve this problem in the area of reinforcement learning is a model-based approach. In the paper we are expanding the schema networks method which allows to extract the logical relationships between objects and actions from the environment d… ▽ More

    Submitted 8 July, 2020; v1 submitted 17 June, 2020; originally announced June 2020.

    Comments: Published at the AGI 2020 conference

  31. arXiv:2006.09939  [pdf, other

    cs.LG cs.AI

    Forgetful Experience Replay in Hierarchical Reinforcement Learning from Demonstrations

    Authors: Alexey Skrynnik, Aleksey Staroverov, Ermek Aitygulov, Kirill Aksenov, Vasilii Davydov, Aleksandr I. Panov

    Abstract: Currently, deep reinforcement learning (RL) shows impressive results in complex gaming and robotic environments. Often these results are achieved at the expense of huge computational costs and require an incredible number of episodes of interaction between the agent and the environment. There are two main approaches to improving the sample efficiency of reinforcement learning methods - using hiera… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

  32. arXiv:1912.08664  [pdf, other

    cs.AI

    Hierarchical Deep Q-Network from Imperfect Demonstrations in Minecraft

    Authors: Alexey Skrynnik, Aleksey Staroverov, Ermek Aitygulov, Kirill Aksenov, Vasilii Davydov, Aleksandr I. Panov

    Abstract: We present Hierarchical Deep Q-Network (HDQfD) that took first place in the MineRL competition. HDQfD works on imperfect demonstrations and utilizes the hierarchical structure of expert trajectories. We introduce the procedure of extracting an effective sequence of meta-actions and subgoals from demonstration data. We present a structured task-dependent replay buffer and adaptive prioritizing tech… ▽ More

    Submitted 13 July, 2020; v1 submitted 18 December, 2019; originally announced December 2019.

  33. arXiv:1905.08217  [pdf

    physics.comp-ph cs.DC cs.PF

    Exploiting Parallelism on Shared Memory in the QED Particle-in-Cell Code PICADOR with Greedy Load Balancing

    Authors: Iosif Meyerov, Sergei Bastrakov, Aleksei Bashinov, Evgeny Efimenko, Alexander Panov, Elena Panova, Igor Surmin, Valentin Volokitin, Arkady Gonoskov

    Abstract: State-of-the-art numerical simulations of laser plasma by means of the Particle-in-Cell method are often extremely computationally intensive. Therefore there is a growing need for development of approaches for efficient utilization of resources of modern supercomputers. In this paper, we address the problem of a substantially non-uniform and dynamically varying distribution of macroparticles in a… ▽ More

    Submitted 20 May, 2019; originally announced May 2019.

    Comments: 11 pages, 5 figures. Submitted to PPAM-2019

  34. arXiv:1806.05292  [pdf, other

    cs.AI

    Automatic formation of the structure of abstract machines in hierarchical reinforcement learning with state clustering

    Authors: Aleksandr I. Panov, Aleksey Skrynnik

    Abstract: We introduce a new approach to hierarchy formation and task decomposition in hierarchical reinforcement learning. Our method is based on the Hierarchy Of Abstract Machines (HAM) framework because HAM approach is able to design efficient controllers that will realize specific behaviors in real robots. The key to our algorithm is the introduction of the internal or "mental" environment in which the… ▽ More

    Submitted 13 June, 2018; originally announced June 2018.

  35. arXiv:1607.08181  [pdf, other

    cs.AI

    Psychologically inspired planning method for smart relocation task

    Authors: Aleksandr I. Panov, Konstantin Yakovlev

    Abstract: Behavior planning is known to be one of the basic cognitive functions, which is essential for any cognitive architecture of any control system used in robotics. At the same time most of the widespread planning algorithms employed in those systems are developed using only approaches and models of Artificial Intelligence and don't take into account numerous results of cognitive experiments. As a res… ▽ More

    Submitted 27 July, 2016; originally announced July 2016.

    Comments: As submitted to the 7th International Conference on Biologically Inspired Cognitive Architectures (BICA 2016), New-York, USA, July 16-19 2016

  36. Behavior and path planning for the coalition of cognitive robots in smart relocation tasks

    Authors: Aleksandr I. Panov, Konstantin Yakovlev

    Abstract: In this paper we outline the approach of solving special type of navigation tasks for robotic systems, when a coalition of robots (agents) acts in the 2D environment, which can be modified by the actions, and share the same goal location. The latter is originally unreachable for some members of the coalition, but the common task still can be accomplished as the agents can assist each other (e.g. b… ▽ More

    Submitted 27 July, 2016; originally announced July 2016.

    Comments: As submitted to the 4th International Conference on Robot Intelligence Technology and Applications (RiTA-2015), Bucheon, Korea, December 14-16, 2015

  37. arXiv:1311.2056  [pdf

    cs.OH

    The structure and functions of an automated project management system for the centers of scientific and technical creativity of students

    Authors: V. M. Dmitriev, T. V. Gandzha, V. V. Gandzha, S. A. Panov

    Abstract: This article discusses the possibility of automating of the student's projecting through the use of automated project management system. There are described the purpose, structure and formalism of automated workplace of student-designer (AWSD), and shown its structural-functional diagram.

    Submitted 7 November, 2013; originally announced November 2013.

    Comments: in Russian

  38. arXiv:1311.1587  [pdf

    cs.OH

    Tasks and architecture of documentation subsystem in multi-level modeling environment MARS

    Authors: T. V. Gandzha, S. A Panov

    Abstract: The article describes the automated documentation system designed to generate reports on research conducted by computer complex technical objects and systems in multi-level modeling environment «MARS». We defined the purposes, tasks and abilities of documentation system and examined the types and structure of documents, and gave an example of its practical use

    Submitted 7 November, 2013; originally announced November 2013.

    Comments: in Russian