-
Where to go next: Learning a Subgoal Recommendation Policy for Navigation Among Pedestrians arXiv.cs.RO Pub Date : 2021-02-25 Bruno Brito; Michael Everett; Jonathan P. How; Javier Alonso-Mora
Robotic navigation in environments shared with other robots or humans remains challenging because the intentions of the surrounding agents are not directly observable and the environment conditions are continuously changing. Local trajectory optimization methods, such as model predictive control (MPC), can deal with those changes but require global guidance, which is not trivial to obtain in crowded
-
LES: Locally Exploitative Sampling for Robot Path Planning arXiv.cs.RO Pub Date : 2021-02-25 Sagar Suhas Joshi; Seth Hutchinson; Panagiotis Tsiotras
Sampling-based algorithms solve the path planning problem by generating random samples in the search-space and incrementally growing a connectivity graph or a tree. Conventionally, the sampling strategy used in these algorithms is biased towards exploration to acquire information about the search-space. In contrast, this work proposes an optimization-based procedure that generates new samples to improve
-
Non-invasive Cognitive-level Human Interfacing for the Robotic Restoration of Reaching & Grasping arXiv.cs.RO Pub Date : 2021-02-25 Ali Shafti; A. Aldo Faisal
Assistive and Wearable Robotics have the potential to support humans with different types of motor impairments to become independent and fulfil their activities of daily living successfully. The success of these robot systems, however, relies on the ability to meaningfully decode human action intentions and carry them out appropriately. Neural interfaces have been explored for use in such system with
-
Structured Prediction for CRiSP Inverse Kinematics Learning with Misspecified Robot Models arXiv.cs.RO Pub Date : 2021-02-25 Gian Maria Marconi; Rafaello Camoriano; Lorenzo Rosasco; Carlo Ciliberto
With the recent advances in machine learning, problems that traditionally would require accurate modeling to be solved analytically can now be successfully approached with data-driven strategies. Among these, computing the inverse kinematics of a redundant robot arm poses a significant challenge due to the non-linear structure of the robot, the hard joint constraints and the non-invertible kinematics
-
Docking and Undocking a Modular Underactuated Oscillating Swimming Robot arXiv.cs.RO Pub Date : 2021-02-25 Gedaliah Knizhnik; Mark Yim
We describe a docking mechanism and strategy to allow modular self-assembly for the Modboat: an inexpensive underactuated oscillating swimming robot powered by a single motor. Because propulsion is achieved through oscillation, orientation can be controlled only in the average; this complicates docking, which requires precise position and orientation control. Given these challenges, we present a docking
-
$SE_2(3)$ based Extended Kalman Filter for Inertial-Integrated Navigation arXiv.cs.RO Pub Date : 2021-02-25 Yarong Luo; Chi Guo; Shenyong You; Jianlang Hu; Jingnan Liu
The error representation using the straight difference of two vectors in the inertial navigation system may not be reasonable as it does not take the direction difference into consideration. Therefore, we proposed to use the $SE_2(3)$ matrix Lie group to represent the state of the inertial-integrated navigation system which consequently leads to the common frame error representation. With the new velocity
-
CPG-ACTOR: Reinforcement Learning for Central Pattern Generators arXiv.cs.RO Pub Date : 2021-02-25 Luigi Campanaro; Siddhant Gangapurwala; Daniele De Martini; Wolfgang Merkt; Ioannis Havoutis
Central Pattern Generators (CPGs) have several properties desirable for locomotion: they generate smooth trajectories, are robust to perturbations and are simple to implement. Although conceptually promising, we argue that the full potential of CPGs has so far been limited by insufficient sensory-feedback information. This paper proposes a new methodology that allows tuning CPG controllers through
-
FAITH: Fast iterative half-plane focus of expansion estimation using event-based optic flow arXiv.cs.RO Pub Date : 2021-02-25 Raoul Dinaux; Nikhil Wessendorp; Julien Dupeyroux; Guido de Croon
Course estimation is a key component for the development of autonomous navigation systems for robots. While state-of-the-art methods widely use visual-based algorithms, it is worth noting that they all fail to deal with the complexity of the real world by being computationally greedy and sometimes too slow. They often require obstacles to be highly textured to improve the overall performance, particularly
-
Active Modular Environment for Robot Navigation arXiv.cs.RO Pub Date : 2021-02-25 Shota Kameyama; Keisuke Okumura; Yasumasa Tamura; Xavier Défago
This paper presents a novel robot-environment interaction in navigation tasks such that robots have neither a representation of their working space nor planning function, instead, an active environment takes charge of these aspects. This is realized by spatially deploying computing units, called cells, and making cells manage traffic in their respective physical region. Different from stigmegic approaches
-
Design and Control of a Highly Redundant Rigid-Flexible Coupling Robot to Assist the COVID-19 Oropharyngeal-Swab Sampling arXiv.cs.RO Pub Date : 2021-02-25 Yingbai HuChair of Robotics, Artificial Intelligence and Real-time Systems, Technische Universit München, München, GermanyShenzhen Institute of Artificial Intelligence and Robotics for Society, China; Jian LiRobotics and Intelligent Manufacturing & School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen, ChinaShenzhen Institute of Artificial Intelligence and Robotics for Society
The outbreak of novel coronavirus pneumonia (COVID-19) has caused mortality and morbidity worldwide. Oropharyngeal-swab (OP-swab) sampling is widely used for the diagnosis of COVID-19 in the world. To avoid the clinical staff from being affected by the virus, we developed a 9-degree-of-freedom (DOF) rigid-flexible coupling (RFC) robot to assist the COVID-19 OP-swab sampling. This robot is composed
-
Defining Preferred and Natural Robot Motions in Immersive Telepresence from a First-Person Perspective arXiv.cs.RO Pub Date : 2021-02-25 Katherine J. Mimnaugh; Markku Suomalainen; Israel Becerra; Eliezer Lozano; Rafael Murrieta-Cid; Steven M. LaValle
This paper presents some early work and future plans regarding how the autonomous motions of a telepresence robot affect a person embodied in the robot through a head-mounted display. We consider the preferences, comfort, and the perceived naturalness of aspects of piecewise linear paths compared to the same aspects on a smooth path. In a user study, thirty-six subjects (eighteen females) watched panoramic
-
A Simulation-based End-to-End Learning Framework for Evidential Occupancy Grid Mapping arXiv.cs.RO Pub Date : 2021-02-25 Raphael van Kempen; Bastian Lampe; Timo Woopen; Lutz Eckstein
Evidential occupancy grid maps (OGMs) are a popular representation of the environment of automated vehicles. Inverse sensor models (ISMs) are used to compute OGMs from sensor data such as lidar point clouds. Geometric ISMs show a limited performance when estimating states in unobserved but inferable areas and have difficulties dealing with ambiguous input. Deep learning-based ISMs face the challenge
-
Strapdown Inertial Navigation System Initial Alignment based on Group of Double Direct Spatial Isometries arXiv.cs.RO Pub Date : 2021-02-25 Lubin Chang; Fangjun Qin; Jiangning Xu
The task of strapdown inertial navigation system (SINS) initial alignment is to calculate the attitude transformation matrix from body frame to navigation frame. In this paper, such attitude transformation matrix is divided into two parts through introducing the initial inertially fixed navigation frame as inertial frame. The attitude changes of the navigation frame corresponding to the defined inertial
-
Ensuring Progress for Multiple Mobile Robots via Space Partitioning, Motion Rules, and Adaptively Centralized Conflict Resolution arXiv.cs.RO Pub Date : 2021-02-25 Claire LiangCornell University Department of Computer Science; Wil ThomasonCornell University Department of Computer Science; Elizabeth RicciCornell University Department of Computer Science; Soham SankaranCornell University Department of Computer SciencePashi Corp
In environments where multiple robots must coordinate in a shared space, decentralized approaches allow for decoupled planning at the cost of global guarantees, while centralized approaches make the opposite trade-off. These solutions make a range of assumptions - commonly, that all the robots share the same planning strategies. In this work, we present a framework that ensures progress for all robots
-
Real-Time Ellipse Detection for Robotics Applications arXiv.cs.RO Pub Date : 2021-02-25 Azarakhsh Keipour; Guilherme A. S. Pereira; Sebastian Scherer
We propose a new algorithm for real-time detection and tracking of elliptic patterns suitable for real-world robotics applications. The method fits ellipses to each contour in the image frame and rejects ellipses that do not yield a good fit. It can detect complete, partial, and imperfect ellipses in extreme weather and lighting conditions and is lightweight enough to be used on robots' resource-limited
-
Imitation Learning for Robust and Safe Real-time Motion Planning: A Contraction Theory Approach arXiv.cs.RO Pub Date : 2021-02-25 Hiroyasu Tsukamoto; Soon-Jo Chung
This paper presents Learning-based Autonomous Guidance with Robustness, Optimality, and Safety guarantees (LAG-ROS), a real-time robust motion planning algorithm for safety-critical nonlinear systems perturbed by bounded disturbances. The LAG-ROS method consists of three phases: 1) Control Lyapunov Function (CLF) construction via contraction theory; 2) imitation learning of the CLF-based robust feedback
-
Learning Inverse Kinodynamics for Accurate High-Speed Off-Road Navigation on Unstructured Terrain arXiv.cs.RO Pub Date : 2021-02-25 Xuesu Xiao; Joydeep Biswas; Peter Stone
This paper presents a learning-based approach to consider the effect of unobservable world states in kinodynamic motion planning in order to enable accurate high-speed off-road navigation on unstructured terrain. Existing kinodynamic motion planners either operate in structured and homogeneous environments and thus do not need to explicitly account for terrain-vehicle interaction, or assume a set of
-
Adapting legacy robotic machinery to industry 4: a ciot experiment version 1 arXiv.cs.RO Pub Date : 2021-02-25 Hadi Alasti
This paper presents an experimental adaptation of a non-collaborative robot arm to collaborate with the environment, as one step towards adapting legacy robotic machinery to fit in industry 4.0 requirements. A cloud-based internet of things (CIoT) service is employed to connect, supervise and control a robotic arm's motion using the added wireless sensing devices to the environment. A programmable
-
Theory and Analysis of Optimal Planning over Long and Infinite Horizons for Achieving Independent Partially-Observable Tasks that Evolve over Time arXiv.cs.RO Pub Date : 2021-02-25 Anahita Mohseni-Kabir; Manuela Veloso; Maxim Likhachev
We present the theoretical analysis and proofs of a recently developed algorithm that allows for optimal planning over long and infinite horizons for achieving multiple independent tasks that are partially observable and evolve over time.
-
Software Engineering for Robotic Systems:a systematic mapping study arXiv.cs.RO Pub Date : 2021-02-24 Marcela G. dos Santos; Fabio Petrillo
Robots are being applied in a vast range of fields, leading researchers and practitioners to write tasks more complex than in the past. The robot software complexity increases the difficulty of engineering the robot's software components with quality requirements. Researchers and practitioners have applied software engineering (SE) approaches and robotic domains to address this issue in the last two
-
Task-Agnostic Morphology Evolution arXiv.cs.RO Pub Date : 2021-02-25 Donald J. Hejna III; Pieter Abbeel; Lerrel Pinto
Deep reinforcement learning primarily focuses on learning behavior, usually overlooking the fact that an agent's function is largely determined by form. So, how should one go about finding a morphology fit for solving tasks in a given environment? Current approaches that co-adapt morphology and behavior use a specific task's reward as a signal for morphology optimization. However, this often requires
-
The Catenary Robot: Design and Control of a Cable Propelled by Two Quadrotors arXiv.cs.RO Pub Date : 2021-02-24 Diego S. D'antonio; Gustavo A. Cardona; David Saldaña
Transporting objects using aerial robots has been widely studied in the literature. Still, those approaches always assume that the connection between the quadrotor and the load is made in a previous stage. However, that previous stage usually requires human intervention, and autonomous procedures to locate and attach the object are not considered. Additionally, most of the approaches assume cables
-
AutoPreview: A Framework for Autopilot Behavior Understanding arXiv.cs.RO Pub Date : 2021-02-25 Yuan Shen; Niviru Wijayaratne; Peter Du; Shanduojiao Jiang; Katherine Driggs Campbell
The behavior of self driving cars may differ from people expectations, (e.g. an autopilot may unexpectedly relinquish control). This expectation mismatch can cause potential and existing users to distrust self driving technology and can increase the likelihood of accidents. We propose a simple but effective framework, AutoPreview, to enable consumers to preview a target autopilot potential actions
-
Gaze-Informed Multi-Objective Imitation Learning from Human Demonstrations arXiv.cs.RO Pub Date : 2021-02-25 Ritwik Bera; Vinicius G. Goecks; Gregory M. Gremillion; Vernon J. Lawhern; John Valasek; Nicholas R. Waytowich
In the field of human-robot interaction, teaching learning agents from human demonstrations via supervised learning has been widely studied and successfully applied to multiple domains such as self-driving cars and robot manipulation. However, the majority of the work on learning from human demonstrations utilizes only behavioral information from the demonstrator, i.e. what actions were taken, and
-
Bias-reduced multi-step hindsight experience replay arXiv.cs.RO Pub Date : 2021-02-25 Rui Yang; Jiafei Lyu; Yu Yang; Jiangpeng Ya; Feng Luo; Dijun Luo; Lanqing Li; Xiu Li
Multi-goal reinforcement learning is widely used in planning and robot manipulation. Two main challenges in multi-goal reinforcement learning are sparse rewards and sample inefficiency. Hindsight Experience Replay (HER) aims to tackle the two challenges with hindsight knowledge. However, HER and its previous variants still need millions of samples and a huge computation. In this paper, we propose \emph{Multi-step
-
A Comprehensive Survey on the Multiple Travelling Salesman Problem: Applications, Approaches and Taxonomy arXiv.cs.RO Pub Date : 2021-02-25 Omar Cheikhrouhou; Ines Khoufi
The Multiple Travelling Salesman Problem (MTSP) is among the most interesting combinatorial optimization problems because it is widely adopted in real-life applications, including robotics, transportation, networking, etc. Although the importance of this optimization problem, there is no survey dedicated to reviewing recent MTSP contributions. In this paper, we aim to fill this gap by providing a comprehensive
-
Scene Retrieval for Contextual Visual Mapping arXiv.cs.RO Pub Date : 2021-02-25 William H. B. Smith; Michael Milford; Klaus D. McDonald-Maier; Shoaib Ehsan
Visual navigation localizes a query place image against a reference database of place images, also known as a `visual map'. Localization accuracy requirements for specific areas of the visual map, `scene classes', vary according to the context of the environment and task. State-of-the-art visual mapping is unable to reflect these requirements by explicitly targetting scene classes for inclusion in
-
Neuroevolution of a Recurrent Neural Network for Spatial and Working Memory in a Simulated Robotic Environment arXiv.cs.RO Pub Date : 2021-02-25 Xinyun Zou; Eric O. Scott; Alexander B. Johnson; Kexin Chen; Douglas A. Nitz; Kenneth A. De Jong; Jeffrey L. Krichmar
Animals ranging from rats to humans can demonstrate cognitive map capabilities. We evolved weights in a biologically plausible recurrent neural network (RNN) using an evolutionary algorithm to replicate the behavior and neural activity observed in rats during a spatial and working memory task in a triple T-maze. The rat was simulated in the Webots robot simulator and used vision, distance and accelerometer
-
The Logical Options Framework arXiv.cs.RO Pub Date : 2021-02-24 Brandon Araki; Xiao Li; Kiran Vodrahalli; Jonathan DeCastro; Micah J. Fry; Daniela Rus
Learning composable policies for environments with complex rules and tasks is a challenging problem. We introduce a hierarchical reinforcement learning framework called the Logical Options Framework (LOF) that learns policies that are satisfying, optimal, and composable. LOF efficiently learns policies that satisfy tasks by representing the task as an automaton and integrating it into learning and
-
Deep Reinforcement Learning for Safe Landing Site Selection with Concurrent Consideration of Divert Maneuvers arXiv.cs.RO Pub Date : 2021-02-24 Keidai Iiyama; Kento Tomita; Bhavi A. Jagatia; Tatsuwaki Nakagawa; Koki Ho
This research proposes a new integrated framework for identifying safe landing locations and planning in-flight divert maneuvers. The state-of-the-art algorithms for landing zone selection utilize local terrain features such as slopes and roughness to judge the safety and priority of the landing point. However, when there are additional chances of observation and diverting in the future, these algorithms
-
R2LIVE: A Robust, Real-time, LiDAR-Inertial-Visual tightly-coupled state Estimator and mapping arXiv.cs.RO Pub Date : 2021-02-24 Jiarong Lin; Chunran Zheng; Wei Xu; Fu Zhang
In this letter, we propose a robust, real-time tightly-coupled multi-sensor fusion framework, which fuses measurement from LiDAR, inertial sensor, and visual camera to achieve robust and accurate state estimation. Our proposed framework is composed of two parts: the filter-based odometry and factor graph optimization. To guarantee real-time performance, we estimate the state within the framework of
-
Iterative Refinement for Real-Time Multi-Robot Path Planning arXiv.cs.RO Pub Date : 2021-02-24 Keisuke Okumura; Yasumasa Tamura; Xavier Defago
We study the iterative refinement of path planning for multiple robots, known as multi-agent pathfinding (MAPF). Given a graph, agents, their initial locations, and destinations, a solution of MAPF is a set of paths without collisions. Iterative refinement for MAPF is desirable for three reasons: 1) optimization is intractable, 2) sub-optimal solutions can be obtained instantly, and 3) it is anytime
-
Mobile Recharger Path Planning and Recharge Scheduling in a Multi-Robot Environment arXiv.cs.RO Pub Date : 2021-02-24 Tanmoy Kundu; Indranil Saha
In many multi-robot applications, mobile worker robots are often engaged in performing some tasks repetitively by following pre-computed trajectories. As these robots are battery-powered, they need to get recharged at regular intervals. We envision that in the future, a few mobile recharger robots will be employed to supply charge to the energy-deficient worker robots recurrently, to keep the overall
-
A Trident Quaternion Framework for Inertial-based Navigation Part II: Error Models and Application to Initial Alignment arXiv.cs.RO Pub Date : 2021-02-24 Wei Ouyang; Yuanxin Wu
This work deals with error models for trident quaternion framework proposed in the companion paper "A Trident Quaternion Framework for Inertial-based Navigation Part I: Motion Representation and Computation" and further uses them to investigate the static and in-motion alignment for land vehicles. Specifically, the zero-velocity and odometer velocity measurements are applied in the static and in-motion
-
A Trident Quaternion Framework for Inertial-based Navigation Part I: Rigid Motion Representation and Computation arXiv.cs.RO Pub Date : 2021-02-24 Wei Ouyang; Yuanxin Wu
Strapdown inertial navigation research involves the parameterization and computation of the attitude, velocity and position of a rigid body in a chosen reference frame. The community has long devoted to finding the most concise and efficient representation for the strapdown inertial navigation system (INS). The current work is motivated by simplifying the existing dual quaternion representation of
-
Learning to Shift Attention for Motion Generation arXiv.cs.RO Pub Date : 2021-02-24 You Zhou; Jianfeng Gao; Tamim Asfour
One challenge of motion generation using robot learning from demonstration techniques is that human demonstrations follow a distribution with multiple modes for one task query. Previous approaches fail to capture all modes or tend to average modes of the demonstrations and thus generate invalid trajectories. The other difficulty is the small number of demonstrations that cannot cover the entire working
-
Safe Learning-based Gradient-free Model Predictive Control Based on Cross-entropy Method arXiv.cs.RO Pub Date : 2021-02-24 Lei Zheng; Rui Yang; Zhixuan Wu; Jiesen Panb; Hui Cheng
In this paper, a safe and learning-based control framework for model predictive control (MPC) is proposed to optimize nonlinear systems with a gradient-free objective function under uncertain environmental disturbances. The control framework integrates a learning-based MPC with an auxiliary controller in a way of minimal intervention. The learning-based MPC augments the prior nominal model with incremental
-
Spatio-Temporal Look-Ahead Trajectory Prediction using Memory Neural Network arXiv.cs.RO Pub Date : 2021-02-24 Nishanth Rao; Suresh Sundaram
Prognostication of vehicle trajectories in unknown environments is intrinsically a challenging and difficult problem to solve. The behavior of such vehicles is highly influenced by surrounding traffic, road conditions, and rogue participants present in the environment. Moreover, the presence of pedestrians, traffic lights, stop signs, etc., makes it much harder to infer the behavior of various traffic
-
Towards Optimized Distributed Multi-Robot Printing: An Algorithmic Approach arXiv.cs.RO Pub Date : 2021-02-24 Kedar Karpe; Avinash Sinha; Shreyas Raorane; Ayon Chatterjee; Pranav Srinivas; Lorenzo Sabattini
This paper presents a distributed multi-robot printing method which utilizes an optimization approach to decompose and allocate a printing task to a group of mobile robots. The motivation for this problem is to minimize the printing time of the robots by using an appropriate task decomposition algorithm. We present one such algorithm which decomposes an image into rasterized geodesic cells before allocating
-
PixSet : An Opportunity for 3D Computer Vision to Go Beyond Point Clouds With a Full-Waveform LiDAR Dataset arXiv.cs.RO Pub Date : 2021-02-24 Jean-Luc Déziel; Pierre Merriaux; Francis Tremblay; Dave Lessard; Dominique Plourde; Julien Stanguennec; Pierre Goulet; Pierre Olivier
Leddar PixSet is a new publicly available dataset (dataset.leddartech.com) for autonomous driving research and development. One key novelty of this dataset is the presence of full-waveform data from the Leddar Pixell sensor, a solid-state flash LiDAR. Full-waveform data has been shown to improve the performance of perception algorithms in airborne applications but is yet to be demonstrated for terrestrial
-
4D Panoptic LiDAR Segmentation arXiv.cs.RO Pub Date : 2021-02-24 Mehmet Aygün; Aljoša Ošep; Mark Weber; Maxim Maximov; Cyrill Stachniss; Jens Behley; Laura Leal-Taixé
Temporal semantic scene understanding is critical for self-driving cars or robots operating in dynamic environments. In this paper, we propose 4D panoptic LiDAR segmentation to assign a semantic class and a temporally-consistent instance ID to a sequence of 3D points. To this end, we present an approach and a point-centric evaluation metric. Our approach determines a semantic class for every point
-
Memory-based Deep Reinforcement Learning for POMDP arXiv.cs.RO Pub Date : 2021-02-24 Lingheng Meng; Rob Gorbet; Dana Kulić
A promising characteristic of Deep Reinforcement Learning (DRL) is its capability to learn optimal policy in an end-to-end manner without relying on feature engineering. However, most approaches assume a fully observable state space, i.e. fully observable Markov Decision Process (MDP). In real-world robotics, this assumption is unpractical, because of the sensor issues such as sensors' capacity limitation
-
GEM: Glare or Gloom, I Can Still See You -- End-to-End Multimodal Object Detector arXiv.cs.RO Pub Date : 2021-02-24 Osama Mazhar; Jens Kober; Robert Babuska
Deep neural networks designed for vision tasks are often prone to failure when they encounter environmental conditions not covered by the training data. Efficient fusion strategies for multi-sensor configurations can enhance the robustness of the detection algorithms by exploiting redundancy from different sensor streams. In this paper, we propose sensor-aware multi-modal fusion strategies for 2D object
-
GDR-Net: Geometry-Guided Direct Regression Network for Monocular 6D Object Pose Estimation arXiv.cs.RO Pub Date : 2021-02-24 Gu Wang; Fabian Manhardt; Federico Tombari; Xiangyang Ji
6D pose estimation from a single RGB image is a fundamental task in computer vision. The current top-performing deep learning-based methods rely on an indirect strategy, i.e., first establishing 2D-3D correspondences between the coordinates in the image plane and object coordinate system, and then applying a variant of the P$n$P/RANSAC algorithm. However, this two-stage pipeline is not end-to-end trainable
-
Contingency Model Predictive Control for Linear Time-Varying Systems arXiv.cs.RO Pub Date : 2021-02-24 John P. Alsterda; J. Christian Gerdes
We present Contingency Model Predictive Control (CMPC), a motion planning and control framework that optimizes performance objectives while simultaneously maintaining a contingency plan -- an alternate trajectory that avoids a potential hazard. By preserving the existence of a feasible avoidance trajectory, CMPC anticipates emergency and keeps the controlled system in a safe state that is selectively
-
Annotating Motion Primitives for Simplifying Action Search in Reinforcement Learning arXiv.cs.RO Pub Date : 2021-02-24 Isaac J. Sledge; Darshan W. Bryner; Jose C. Principe
Reinforcement learning in large-scale environments is challenging due to the many possible actions that can be taken in specific situations. We have previously developed a means of constraining, and hence speeding up, the search process through the use of motion primitives; motion primitives are sequences of pre-specified actions taken across a state series. As a byproduct of this work, we have found
-
Learning to Drop Points for LiDAR Scan Synthesis arXiv.cs.RO Pub Date : 2021-02-23 Kazuto Nakashima; Ryo Kurazume
Generative modeling of 3D scenes is a crucial topic for aiding mobile robots to improve unreliable observations. However, despite the rapid progress in the natural image domain, building generative models is still challenging for 3D data, such as point clouds. Most existing studies on point clouds have focused on small and uniform-density data. In contrast, 3D LiDAR point clouds widely used in mobile
-
State Augmented Constrained Reinforcement Learning: Overcoming the Limitations of Learning with Rewards arXiv.cs.RO Pub Date : 2021-02-23 Miguel Calvo-Fullana; Santiago Paternain; Luiz F. O. Chamon; Alejandro Ribeiro
Constrained reinforcement learning involves multiple rewards that must individually accumulate to given thresholds. In this class of problems, we show a simple example in which the desired optimal policy cannot be induced by any linear combination of rewards. Hence, there exist constrained reinforcement learning problems for which neither regularized nor classical primal-dual methods yield optimal
-
A Robotic Model of Hippocampal Reverse Replay for Reinforcement Learning arXiv.cs.RO Pub Date : 2021-02-23 Matthew T. Whelan; Tony J. Prescott; Eleni Vasilaki
Hippocampal reverse replay is thought to contribute to learning, and particularly reinforcement learning, in animals. We present a computational model of learning in the hippocampus that builds on a previous model of the hippocampal-striatal network viewed as implementing a three-factor reinforcement learning rule. To augment this model with hippocampal reverse replay, a novel policy gradient learning
-
Grounded Relational Inference: Domain Knowledge Driven Explainable Autonomous Driving arXiv.cs.RO Pub Date : 2021-02-23 Chen Tang; Nishan Srishankar; Sujitha Martin; Masayoshi Tomizuka
Explainability is essential for autonomous vehicles and other robotics systems interacting with humans and other objects during operation. Humans need to understand and anticipate the actions taken by the machines for trustful and safe cooperation. In this work, we aim to enable the explainability of an autonomous driving system at the design stage by incorporating expert domain knowledge into the
-
Good Actors can come in Smaller Sizes: A Case Study on the Value of Actor-Critic Asymmetry arXiv.cs.RO Pub Date : 2021-02-23 Siddharth Mysore; Bassel Mabsout; Renato Mancuso; Kate Saenko
Actors and critics in actor-critic reinforcement learning algorithms are functionally separate, yet they often use the same network architectures. This case study explores the performance impact of network sizes when considering actor and critic architectures independently. By relaxing the assumption of architectural symmetry, it is often possible for smaller actors to achieve comparable policy performance
-
Interpretability in Contact-Rich Manipulation via Kinodynamic Images arXiv.cs.RO Pub Date : 2021-02-23 Ioanna Mitsioni; Joonatan Mänttäri; Yiannis Karayiannidis; John Folkesson; Danica Kragic
Deep Neural Networks (NNs) have been widely utilized in contact-rich manipulation tasks to model the complicated contact dynamics. However, NN-based models are often difficult to decipher which can lead to seemingly inexplicable behaviors and unidentifiable failure cases. In this work, we address the interpretability of NN-based models by introducing the kinodynamic images. We propose a methodology
-
Resilient Path Planning of UAVs against Covert Attacks on UWB Sensors arXiv.cs.RO Pub Date : 2021-02-23 Jiayi He; Xin Gong; Yukang Cui; Tingwen Huang
In this letter, a resilient path planning scheme is proposed to navigate a UAV to the planned (nominal) destination with minimum energy-consumption in the presence of a smart attacker. The UAV is equipped with two sensors, a GPS sensor, which is vulnerable to the spoofing attacker, and a well-functioning Ultra-Wideband (UWB) sensor, which is possible to be fooled. We show that a covert attacker can
-
Design and Integration of a Drone based Passive Manipulator for Capturing Flying Targets arXiv.cs.RO Pub Date : 2021-02-23 B. V. Vidyadhara; Lima Agnel Tony; Mohitvishnu S. Gadde; Shuvrangshu Jana; V. P. Varun; Aashay Anil Bhise; Suresh Sundaram; Debasish Ghose
In this paper, we present a novel passive single Degree-of-Freedom (DoF) manipulator design and its integration on an autonomous drone to capture a moving target. The end-effector is designed to be passive, to disengage the moving target from a flying UAV and capture it efficiently in the presence of disturbances, with minimal energy usage. It is also designed to handle target sway and the effect of
-
Mathematical Properties of Generalized Shape Expansion-Based Motion Planning Algorithms arXiv.cs.RO Pub Date : 2021-02-23 Adhvaith Ramkumar; Vrushabh Zinage; Satadal Ghosh
Motion planning is an essential aspect of autonomous systems and robotics and is an active area of research. A recently-proposed sampling-based motion planning algorithm, termed 'Generalized Shape Expansion' (GSE), has been shown to possess significant improvement in computational time over several existing well-established algorithms. The GSE has also been shown to be probabilistically complete. However
-
An Interaction-aware Evaluation Method for Highly Automated Vehicles arXiv.cs.RO Pub Date : 2021-02-23 Xinpeng Wang; Songan Zhang; Kuan-Hui Lee; Huei Peng
It is important to build a rigorous verification and validation (V&V) process to evaluate the safety of highly automated vehicles (HAVs) before their wide deployment on public roads. In this paper, we propose an interaction-aware framework for HAV safety evaluation which is suitable for some highly-interactive driving scenarios including highway merging, roundabout entering, etc. Contrary to existing
-
Smart Navigation for an In-pipe Robot Through Multi-phase Motion Control and Particle Filtering Method arXiv.cs.RO Pub Date : 2021-02-23 Saber Kazeminasab; Vahid Janfaza; Moein Razavi; M. Katherine Banks
In-pipe robots are promising solutions for condition assessment, leak detection, water quality monitoring in a variety of other tasks in pipeline networks. Smart navigation is an extremely challenging task for these robots as a result of highly uncertain and disturbing environment for operation. Wireless communication to control these robots during operation is not feasible if the pipe material is
-
SeqNet: Learning Descriptors for Sequence-based Hierarchical Place Recognition arXiv.cs.RO Pub Date : 2021-02-23 Sourav Garg; Michael Milford
Visual Place Recognition (VPR) is the task of matching current visual imagery from a camera to images stored in a reference map of the environment. While initial VPR systems used simple direct image methods or hand-crafted visual features, recent work has focused on learning more powerful visual features and further improving performance through either some form of sequential matcher / filter or a
-
ROAD: The ROad event Awareness Dataset for Autonomous Driving arXiv.cs.RO Pub Date : 2021-02-23 Gurkirt Singh; Stephen Akrigg; Manuele Di Maio; Valentina Fontana; Reza Javanmard Alitappeh; Suman Saha; Kossar Jeddisaravi; Farzad Yousefi; Jacob Culley; Tom Nicholson; Jordan Omokeowa; Salman Khan; Stanislao Grazioso; Andrew Bradley; Giuseppe Di Gironimo; Fabio Cuzzolin
Humans approach driving in a holistic fashion which entails, in particular, understanding road events and their evolution. Injecting these capabilities in an autonomous vehicle has thus the potential to take situational awareness and decision making closer to human-level performance. To this purpose, we introduce the ROad event Awareness Dataset (ROAD) for Autonomous Driving, to our knowledge the first
-
Explore the Context: Optimal Data Collection for Context-Conditional Dynamics Models arXiv.cs.RO Pub Date : 2021-02-22 Jan Achterhold; Joerg Stueckler
In this paper, we learn dynamics models for parametrized families of dynamical systems with varying properties. The dynamics models are formulated as stochastic processes conditioned on a latent context variable which is inferred from observed transitions of the respective system. The probabilistic formulation allows us to compute an action sequence which, for a limited number of environment interactions
Contents have been reproduced by permission of the publishers.