Directory Structure ------ . └── Search-based Planning ├── bfs.py # breadth-first searching ├── dfs.py # depth-first searching ├── dijkstra.py # dijkstra's algorithm └── a_star.py # a* algorithm └── Stochastic Shortest Path ├── value_iteration.py # value iteration ├── policy_iteration.py # policy iteration ├── Q-value_iteration.py # Q-value iteration └── Q-policy_iteration.py # Q-policy iteration └── Sampling-based Planning ├── Sarsa.py # SARSA : on-policy TD control └── Q-learning.py # Q-learning : off-policy TD control └── Model-free Control