I’m trying to start and finish a project in the next month. The project involves a lot of thinking and programming. I’m trying to solve a large action space Factored MDP. It’s challenging because the actions can’t be factored. It’s almost a flat MDP.
I’m hoping that the results will be easily interpretable. There is no point finding an optimal solution if I can’t explain it :-)
I’ll try to report the progress I make as we go.