Crafted vs. Learned Representations in Predictive Models - A Case Study on Cyclist Path Prediction

Journal article (2021)

Authors

E.A.I. Pool Intelligent Vehicles - Mechanical, Maritime and Materials Engineering

J.F.P. Kooij Intelligent Vehicles - Mechanical, Maritime and Materials Engineering

D. Gavrila Intelligent Vehicles - Mechanical, Maritime and Materials Engineering

Research Group

Intelligent Vehicles (Mechanical, Maritime and Materials Engineering) (TU Delft)

DOI: https://doi.org/10.1109/TIV.2021.3064253

Intelligent vehicles Dynamics Vulnerable road users (VRUs) Vehicle dynamics Context modeling Predictive models Roads Active safety Motion prediction Recurrent neural networks

To reference this document use:

http://resolver.tudelft.nl/uuid:b634c2db-227b-4b29-81ba-84c075a03bf8

More Info

expand_more

Published Date

2021

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Mechanical, Maritime and Materials Engineering

Department

Cognitive Robotics

Research Group

Intelligent Vehicles

Abstract

This paper compares two models for context-based path prediction of objects with switching dynamics: a Dynamic Bayesian Network (DBN) and a Recurrent Neural Network (RNN). These models are instances of two larger model categories, distinguished by whether expert knowledge is explicitly crafted into the state representation (and thus is interpretable) or whether the representation is learned from data, respectively. Both have shown state-of-the-art performance in previous work. In order to provide a fair comparison, we ensure that both models are treated similarly with respect to the use of context cues and parameter estimation. Specifically, we describe (1) how to integrate the context cues (used previously by the DBN) into the RNN, and (2) how to optimize the DBN with back-propagation similar to the RNN, while keeping an interpretable state representation. Experiments are performed on a scenario where a cyclist might turn left at an intersection in front of the ego-vehicle. Results show that the RNN successfully leverages the context cues, and that optimizing the DBN improves its performance with respect to existing work. While the RNN outperforms the optimized DBN in predictive log-likelihood by a significant margin, both models attain similar average Euclidean distance errors (23-39 cm for DBN and 31-34 cm for RNN, predicting 1 s ahead).

Files

09372805.pdf

(pdf | 0.912 Mb)

Unknown license

Download not available

Crafted_vs_Learned_Representat... (pdf)

(pdf | 1.95 Mb)

- Embargo expired in 08-09-2021

Unknown license