Fast discovery of influential outcomes for risk-aware MPDM

Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), 2017

PDF thumbnail
(PDF, 1.5 MB )

Abstract

In the Multi-Policy Decision Making (MPDM) framework, a robot’s policy is elected by sampling from the distribution of current states, predicting future outcomes through forward simulation, and selecting the policy with the best expected performance. Electing the best plan depends on sampling initial conditions with influential (very high costs) outcomes. Discovering these configurations through random sampling may require drawing many samples, which becomes a performance bottleneck.

In this paper, we describe a risk-aware approach which augments this sampling with an optimization process that helps discover those influential outcomes. We describe how we overcome several practical difficulties with this approach, and demonstrate significant performance improvements on a real robot platform navigating a semi-crowded, highly dynamic environment.

Media

Fast discovery of influential outcomes for risk-aware MPDM
Movie 8.6 MB


bibtex

@inproceedings{mehta2017icra,
    TITLE      = {Fast discovery of influential outcomes for risk-aware {MPDM}},
    AUTHOR     = {Dhanvin Mehta and Gonzalo Ferrer and Edwin Olson},
    BOOKTITLE  = {Proceedings of the {IEEE} International Conference on Robotics and
                 Automation ({ICRA})},
    MONTH      = {May},
    YEAR       = {2017},
}