Skip to main content

Showing 1–30 of 30 results for author: Di Castro, D

  1. arXiv:2407.01302  [pdf, other

    cs.CV cs.AI cs.RO

    Robot Instance Segmentation with Few Annotations for Grasping

    Authors: Moshe Kimhi, David Vainshtein, Chaim Baskin, Dotan Di Castro

    Abstract: The ability of robots to manipulate objects relies heavily on their aptitude for visual perception. In domains characterized by cluttered scenes and high object variability, most methods call for vast labeled datasets, laboriously hand-annotated, with the aim of training capable models. Once deployed, the challenge of generalizing to unfamiliar objects implies that the model must evolve alongside… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2406.16093  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Towards Natural Language-Driven Assembly Using Foundation Models

    Authors: Omkar Joglekar, Tal Lancewicki, Shir Kozlovsky, Vladimir Tchuiev, Zohar Feldman, Dotan Di Castro

    Abstract: Large Language Models (LLMs) and strong vision models have enabled rapid research and development in the field of Vision-Language-Action models that enable robotic control. The main objective of these methods is to develop a generalist policy that can control robots with various embodiments. However, in industrial robotic applications such as automated assembly and disassembly, some tasks, such as… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

  3. arXiv:2406.02158  [pdf, other

    cs.CV cs.LG

    Radar Spectra-Language Model for Automotive Scene Parsing

    Authors: Mariia Pushkareva, Yuri Feldman, Csaba Domokos, Kilian Rambach, Dotan Di Castro

    Abstract: Radar sensors are low cost, long-range, and weather-resilient. Therefore, they are widely used for driver assistance functions, and are expected to be crucial for the success of autonomous driving in the future. In many perception tasks only pre-processed radar point clouds are considered. In contrast, radar spectra are a raw form of radar measurements and contain more information than radar point… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  4. arXiv:2402.11996  [pdf, other

    cs.CV cs.LG

    ISCUTE: Instance Segmentation of Cables Using Text Embedding

    Authors: Shir Kozlovsky, Omkar Joglekar, Dotan Di Castro

    Abstract: In the field of robotics and automation, conventional object recognition and instance segmentation methods face a formidable challenge when it comes to perceiving Deformable Linear Objects (DLOs) like wires, cables, and flexible tubes. This challenge arises primarily from the lack of distinct attributes such as shape, color, and texture, which calls for tailored solutions to achieve precise identi… ▽ More

    Submitted 27 February, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

  5. arXiv:2402.04046  [pdf, other

    cs.SI cs.AI cs.LG

    Generative Modeling of Graphs via Joint Diffusion of Node and Edge Attributes

    Authors: Nimrod Berman, Eitan Kosman, Dotan Di Castro, Omri Azencot

    Abstract: Graph generation is integral to various engineering and scientific disciplines. Nevertheless, existing methodologies tend to overlook the generation of edge attributes. However, we identify critical applications where edge attributes are essential, making prior methods potentially unsuitable in such contexts. Moreover, while trivial adaptations are available, empirical investigations reveal their… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  6. arXiv:2401.06890  [pdf, other

    cs.LG

    An Axiomatic Approach to Model-Agnostic Concept Explanations

    Authors: Zhili Feng, Michal Moshkovitz, Dotan Di Castro, J. Zico Kolter

    Abstract: Concept explanation is a popular approach for examining how human-interpretable concepts impact the predictions of a model. However, most existing methods for concept explanations are tailored to specific models. To address this issue, this paper focuses on model-agnostic measures. Specifically, we propose an approach to concept explanations that satisfy three natural axioms: linearity, recursivit… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

  7. arXiv:2306.13630  [pdf, other

    cs.RO cs.AI cs.LG

    Offline Skill Graph (OSG): A Framework for Learning and Planning using Offline Reinforcement Learning Skills

    Authors: Ben-ya Halevy, Yehudit Aperstein, Dotan Di Castro

    Abstract: Reinforcement Learning has received wide interest due to its success in competitive games. Yet, its adoption in everyday applications is limited (e.g. industrial, home, healthcare, etc.). In this paper, we address this limitation by presenting a framework for planning over offline skills and solving complex tasks in real-world environments. Our framework is comprised of three modules that together… ▽ More

    Submitted 23 June, 2023; originally announced June 2023.

  8. arXiv:2303.15827  [pdf, other

    cs.LG math.NA stat.ML

    CONFIDE: Contextual Finite Differences Modelling of PDEs

    Authors: Ori Linial, Orly Avner, Dotan Di Castro

    Abstract: We introduce a method for inferring an explicit PDE from a data sample generated by previously unseen dynamics, based on a learned context. The training phase integrates knowledge of the form of the equation with a differential scheme, while the inference phase yields a PDE that fits the data sample and enables both signal prediction and data explanation. We include results of extensive experiment… ▽ More

    Submitted 7 June, 2024; v1 submitted 28 March, 2023; originally announced March 2023.

  9. AG2U -- Autonomous Grading Under Uncertainties

    Authors: Yakov Miron, Yuval Goldfracht, Chana Ross, Dotan Di Castro, Itzik Klein

    Abstract: Surface grading, the process of leveling an uneven area containing pre-dumped sand piles, is an important task in the construction site pipeline. This labour-intensive process is often carried out by a dozer, a key machinery tool at any construction site. Current attempts to automate surface grading assume perfect localization. However, in real-world scenarios, this assumption fails, as agents are… ▽ More

    Submitted 4 August, 2022; originally announced August 2022.

    Comments: 8 Pages

    Report number: ras.ral.22-2218.3966ab9e

    Journal ref: in IEEE Robotics and Automation Letters, vol. 8, no. 1, pp. 65-72, Jan. 2023

  10. arXiv:2207.01375  [pdf, other

    cs.CV cs.AI

    GraphVid: It Only Takes a Few Nodes to Understand a Video

    Authors: Eitan Kosman, Dotan Di Castro

    Abstract: We propose a concise representation of videos that encode perceptually meaningful features into graphs. With this representation, we aim to leverage the large amount of redundancies in videos and save computations. First, we construct superpixel-based graph representations of videos by considering superpixels as graph nodes and create spatial and temporal connections between adjacent superpixels.… ▽ More

    Submitted 20 July, 2022; v1 submitted 4 July, 2022; originally announced July 2022.

    Comments: Accepted to ECCV2022 (Oral)

  11. arXiv:2206.06091  [pdf, other

    cs.RO cs.AI cs.LG

    Towards Autonomous Grading In The Real World

    Authors: Yakov Miron, Chana Ross, Yuval Goldfracht, Chen Tessler, Dotan Di Castro

    Abstract: In this work, we aim to tackle the problem of autonomous grading, where a dozer is required to flatten an uneven area. In addition, we explore methods for bridging the gap between a simulated environment and real scenarios. We design both a realistic physical simulation and a scaled real prototype environment mimicking the real dozer dynamics and sensory information. We establish heuristics and le… ▽ More

    Submitted 25 July, 2022; v1 submitted 13 June, 2022; originally announced June 2022.

    Comments: 7 pages, Accepted to IEEE-IROS2022

  12. arXiv:2203.01153  [pdf, other

    cs.RO cs.AI

    InsertionNet 2.0: Minimal Contact Multi-Step Insertion Using Multimodal Multiview Sensory Input

    Authors: Oren Spector, Vladimir Tchuiev, Dotan Di Castro

    Abstract: We address the problem of devising the means for a robot to rapidly and safely learn insertion skills with just a few human interventions and without hand-crafted rewards or demonstrations. Our InsertionNet version 2.0 provides an improved technique to robustly cope with a wide range of use-cases featuring different shapes, colors, initial poses, etc. In particular, we present a regression-based m… ▽ More

    Submitted 2 March, 2022; originally announced March 2022.

    Comments: Accepted to ICRA 2022, InsertionNet 1.0 : https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=9420246

  13. arXiv:2112.10877  [pdf, other

    cs.RO cs.AI cs.LG

    AGPNet -- Autonomous Grading Policy Network

    Authors: Chana Ross, Yakov Miron, Yuval Goldfracht, Dotan Di Castro

    Abstract: In this work, we establish heuristics and learning strategies for the autonomous control of a dozer grading an uneven area studded with sand piles. We formalize the problem as a Markov Decision Process, design a simulation which demonstrates agent-environment interactions and finally compare our simulator to a real dozer prototype. We use methods from reinforcement learning, behavior cloning and c… ▽ More

    Submitted 20 December, 2021; originally announced December 2021.

    Comments: 7 pages, paper submitted to IEEE International Conference on Robotics and Automation

  14. arXiv:2111.05694  [pdf, other

    cs.LG cs.AI

    LSP : Acceleration and Regularization of Graph Neural Networks via Locality Sensitive Pruning of Graphs

    Authors: Eitan Kosman, Joel Oren, Dotan Di Castro

    Abstract: Graph Neural Networks (GNNs) have emerged as highly successful tools for graph-related tasks. However, real-world problems involve very large graphs, and the compute resources needed to fit GNNs to those problems grow rapidly. Moreover, the noisy nature and size of real-world graphs cause GNNs to over-fit if not regularized properly. Surprisingly, recent works show that large graphs often involve… ▽ More

    Submitted 10 November, 2021; originally announced November 2021.

  15. arXiv:2111.01510  [pdf, other

    cs.RO cs.AI

    A Hybrid Approach for Learning to Shift and Grasp with Elaborate Motion Primitives

    Authors: Zohar Feldman, Hanna Ziesche, Ngo Anh Vien, Dotan Di Castro

    Abstract: Many possible fields of application of robots in real world settings hinge on the ability of robots to grasp objects. As a result, robot grasping has been an active field of research for many years. With our publication we contribute to the endeavor of enabling robots to grasp, with a particular focus on bin picking applications. Bin picking is especially challenging due to the often cluttered and… ▽ More

    Submitted 2 November, 2021; originally announced November 2021.

  16. arXiv:2110.00445  [pdf, ps, other

    stat.ML cs.LG

    Sim and Real: Better Together

    Authors: Shirli Di Castro Shashua, Dotan Di Castro, Shie Mannor

    Abstract: Simulation is used extensively in autonomous systems, particularly in robotic manipulation. By far, the most common approach is to train a controller in simulation, and then use it as an initial starting point for the real system. We demonstrate how to learn simultaneously from both simulation and interaction with the real environment. We propose an algorithm for balancing the large number of samp… ▽ More

    Submitted 5 October, 2021; v1 submitted 1 October, 2021; originally announced October 2021.

  17. arXiv:2108.04706  [pdf, other

    cs.CV cs.RO

    BIDCD -- Bosch Industrial Depth Completion Dataset

    Authors: Adam Botach, Yuri Feldman, Yakov Miron, Yoel Shapiro, Dotan Di Castro

    Abstract: We introduce BIDCD -- the Bosch Industrial Depth Completion Dataset. BIDCD is a new RGBD dataset of metallic industrial objects, collected with a depth camera mounted on a robotic manipulator. The main purpose of this dataset is to facilitate the training of domain-specific depth completion models, to be used in logistics and manufacturing tasks. We trained a State-of-the-Art depth completion mode… ▽ More

    Submitted 4 October, 2021; v1 submitted 10 August, 2021; originally announced August 2021.

  18. arXiv:2107.12674  [pdf, other

    cs.CV cs.LG

    Vision-Guided Forecasting -- Visual Context for Multi-Horizon Time Series Forecasting

    Authors: Eitan Kosman, Dotan Di Castro

    Abstract: Autonomous driving gained huge traction in recent years, due to its potential to change the way we commute. Much effort has been put into trying to estimate the state of a vehicle. Meanwhile, learning to forecast the state of a vehicle ahead introduces new capabilities, such as predicting dangerous situations. Moreover, forecasting brings new supervision opportunities by learning to predict richer… ▽ More

    Submitted 26 September, 2021; v1 submitted 27 July, 2021; originally announced July 2021.

    Comments: To be presented in the ROAD challenge & SRVU workshop (ICCV2021)

  19. arXiv:2104.14223  [pdf, other

    cs.RO cs.AI cs.LG

    InsertionNet -- A Scalable Solution for Insertion

    Authors: Oren Spector, Dotan Di Castro

    Abstract: Complicated assembly processes can be described as a sequence of two main activities: grasping and insertion. While general grasping solutions are common in industry, insertion is still only applicable to small subsets of problems, mainly ones involving simple shapes in fixed locations and in which the variations are not taken into consideration. Recently, RL approaches with prior knowledge (e.g.,… ▽ More

    Submitted 29 April, 2021; originally announced April 2021.

    Comments: Qualitative results can be found in our supplementary video on our website: https://sites.google.com/view/insertionnet/

  20. arXiv:2104.01646  [pdf, other

    cs.LG math.OC

    SOLO: Search Online, Learn Offline for Combinatorial Optimization Problems

    Authors: Joel Oren, Chana Ross, Maksym Lefarov, Felix Richter, Ayal Taitler, Zohar Feldman, Christian Daniel, Dotan Di Castro

    Abstract: We study combinatorial problems with real world applications such as machine scheduling, routing, and assignment. We propose a method that combines Reinforcement Learning (RL) and planning. This method can equally be applied to both the offline, as well as online, variants of the combinatorial problem, in which the problem components (e.g., jobs in scheduling problems) are not known in advance, bu… ▽ More

    Submitted 18 May, 2021; v1 submitted 4 April, 2021; originally announced April 2021.

  21. arXiv:2008.07861  [pdf, other

    cs.CV

    Depth Completion with RGB Prior

    Authors: Yuri Feldman, Yoel Shapiro, Dotan Di Castro

    Abstract: Depth cameras are a prominent perception system for robotics, especially when operating in natural unstructured environments. Industrial applications, however, typically involve reflective objects under harsh lighting conditions, a challenging scenario for depth cameras, as it induces numerous reflections and deflections, leading to loss of robustness and deteriorated accuracy. Here, we developed… ▽ More

    Submitted 18 August, 2020; originally announced August 2020.

    Comments: 17 pages, 4 figures

  22. arXiv:1908.08379  [pdf, other

    cs.LG stat.ML

    Practical Risk Measures in Reinforcement Learning

    Authors: Dotan Di Castro, Joel Oren, Shie Mannor

    Abstract: Practical application of Reinforcement Learning (RL) often involves risk considerations. We study a generalized approximation scheme for risk measures, based on Monte-Carlo simulations, where the risk measures need not necessarily be \emph{coherent}. We demonstrate that, even in simple problems, measures such as the variance of the reward-to-go do not capture the risk in a satisfactory manner. In… ▽ More

    Submitted 22 August, 2019; originally announced August 2019.

  23. arXiv:1607.01381  [pdf, other

    stat.ML cs.AI cs.IR

    One-Shot Session Recommendation Systems with Combinatorial Items

    Authors: Yahel David, Dotan Di Castro, Zohar Karnin

    Abstract: In recent years, content recommendation systems in large websites (or \emph{content providers}) capture an increased focus. While the type of content varies, e.g.\ movies, articles, music, advertisements, etc., the high level problem remains the same. Based on knowledge obtained so far on the user, recommend the most desired content. In this paper we present a method to handle the well known user-… ▽ More

    Submitted 5 July, 2016; originally announced July 2016.

  24. arXiv:1502.02259  [pdf, other

    stat.ML cs.LG

    Contextual Markov Decision Processes

    Authors: Assaf Hallak, Dotan Di Castro, Shie Mannor

    Abstract: We consider a planning problem where the dynamics and rewards of the environment depend on a hidden static parameter referred to as the context. The objective is to learn a strategy that maximizes the accumulated reward across all contexts. The new model, called Contextual Markov Decision Process (CMDP), can model a customer's behavior when interacting with a website (the learner). The customer's… ▽ More

    Submitted 8 February, 2015; originally announced February 2015.

  25. arXiv:1301.0104  [pdf, other

    cs.LG stat.ML

    Policy Evaluation with Variance Related Risk Criteria in Markov Decision Processes

    Authors: Aviv Tamar, Dotan Di Castro, Shie Mannor

    Abstract: In this paper we extend temporal difference policy evaluation algorithms to performance criteria that include the variance of the cumulative reward. Such criteria are useful for risk management, and are important in domains such as finance and process control. We propose both TD(0) and LSTD(lambda) variants with linear function approximation, prove their convergence, and demonstrate their utility… ▽ More

    Submitted 1 January, 2013; originally announced January 2013.

    Journal ref: JMLR Workshop and Conference Proceedings 28 (3): 495-503, 2013

  26. arXiv:1206.6404  [pdf

    cs.LG cs.CY math.OC stat.ML

    Policy Gradients with Variance Related Risk Criteria

    Authors: Dotan Di Castro, Aviv Tamar, Shie Mannor

    Abstract: Managing risk in dynamic decision problems is of cardinal importance in many fields such as finance and process control. The most common approach to defining risk is through various variance related criteria such as the Sharpe Ratio or the standard deviation adjusted reward. It is known that optimizing many of the variance related risk criteria is NP-hard. In this paper we devise a framework for l… ▽ More

    Submitted 27 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the 29th International Conference on Machine Learning (ICML 2012)

  27. arXiv:1109.2296  [pdf, other

    cs.LG

    Bandits with an Edge

    Authors: Dotan Di Castro, Claudio Gentile, Shie Mannor

    Abstract: We consider a bandit problem over a graph where the rewards are not directly observed. Instead, the decision maker can compare two nodes and receive (stochastic) information pertaining to the difference in their value. The graph structure describes the set of possible comparisons. Consequently, comparing between two nodes that are relatively far requires estimating the difference between every pai… ▽ More

    Submitted 11 September, 2011; originally announced September 2011.

  28. arXiv:1105.2550   

    cs.LG

    A Maximal Large Deviation Inequality for Sub-Gaussian Variables

    Authors: Dotan Di Castro, Claudio Gentile, Shie Mannor

    Abstract: In this short note we prove a maximal concentration lemma for sub-Gaussian random variables stating that for independent sub-Gaussian random variables we have \[P<(\max_{1\le i\le N}S_{i}>ε>) \le\exp<(-\frac{1}{N^2}\sum_{i=1}^{N}\frac{ε^{2}}{2σ_{i}^{2}}>), \] where $S_i$ is the sum of $i$ zero mean independent sub-Gaussian random variables and $σ_i$ is the variance of the $i$th random variable.

    Submitted 25 July, 2011; v1 submitted 12 May, 2011; originally announced May 2011.

    Comments: This paper has been withdrawn by the authors due to a crucial error in the last sentence of the proof of Theorem 1: "we can take the infimum of the r.h.s. over s, which yields (1)." This statement is only true if a single value of s yields the supremum of (ε_i s - ρ_i(s)) simultaneously for every i

  29. arXiv:1005.0125  [pdf, ps, other

    cs.LG cs.AI

    Adaptive Bases for Reinforcement Learning

    Authors: Dotan Di Castro, Shie Mannor

    Abstract: We consider the problem of reinforcement learning using function approximation, where the approximating basis can change dynamically while interacting with the environment. A motivation for such an approach is maximizing the value function fitness to the problem faced. Three errors are considered: approximation square error, Bellman residual, and projected Bellman residual. Algorithms under the ac… ▽ More

    Submitted 2 May, 2010; originally announced May 2010.

  30. arXiv:0909.2934  [pdf, ps, other

    cs.LG cs.AI

    A Convergent Online Single Time Scale Actor Critic Algorithm

    Authors: D. Di Castro, R. Meir

    Abstract: Actor-Critic based approaches were among the first to address reinforcement learning in a general setting. Recently, these algorithms have gained renewed interest due to their generality, good convergence properties, and possible biological relevance. In this paper, we introduce an online temporal difference based actor-critic algorithm which is proved to converge to a neighborhood of a local ma… ▽ More

    Submitted 16 September, 2009; originally announced September 2009.