In this TDP we describe our framework for learning new behaviors from scratch. It has been the core feature to advance from 5th to 2nd place in RoboCup 2017. The content is based on a paper presented at the RoboCup Symposium 2017, but adds some new results achieved after publishing the paper.