planning and acting in partially observable stochastic domains