Sen descrición

Huiming Zhou a335e148bf Update README.md %!s(int64=5) %!d(string=hai) anos
Model-free Control e477cb2e4c update %!s(int64=5) %!d(string=hai) anos
Search-based Planning eae82b5c46 update %!s(int64=5) %!d(string=hai) anos
Stochastic Shortest Path f8430b8273 update %!s(int64=5) %!d(string=hai) anos
README.md a335e148bf Update README.md %!s(int64=5) %!d(string=hai) anos

README.md

Directory Structure

.
└── Search-based Planning
    ├── bfs.py                          # breadth-first searching
    ├── dfs.py                          # depth-first searching
    ├── dijkstra.py                     # dijkstra's algorithm
    └── a_star.py                       # a* algorithm
└── Stochastic Shortest Path
    ├── value_iteration.py              # value iteration
    ├── policy_iteration.py             # policy iteration
    ├── Q-value_iteration.py            # Q-value iteration
    └── Q-policy_iteration.py           # Q-policy iteration
└── Sampling-based Planning
    ├── Sarsa.py                        # SARSA : on-policy TD control
    └── Q-learning.py                   # Q-learning : off-policy TD control
└── Model-free Control