Keine Beschreibung

yue qi 3c37d35dec A*3d vor 5 Jahren
.idea 022d9f4727 update vor 5 Jahren
Model-free Control 5f8ba1601f regulate format vor 5 Jahren
Sampling-based Planning b25bfec8d3 A*changed vor 5 Jahren
Search-based Planning 3c37d35dec A*3d vor 5 Jahren
Stochastic Shortest Path 5f8ba1601f regulate format vor 5 Jahren
LICENSE ae02c14de5 Create LICENSE vor 5 Jahren
README.md 2e53436228 Update README.md vor 5 Jahren
nano.save bc92b39c90 update path vor 5 Jahren
nano.save.1 bc92b39c90 update path vor 5 Jahren

README.md

Directory Structure

.
└── Search-based Planning
    ├── bfs.py                                  # breadth-first
    ├── dfs.py                                  # depth-first
    ├── dijkstra.py                             # dijkstra
    ├── a_star.py                               # a*
    ├── ara_star.py                             # ara*
    ├── queue.py                                # FIFO, FILO, Priority queues
    ├── env.py                                  # environment: grid world, motions
    └── plotting.py                             # animation
└── Stochastic Shortest Path
    ├── value_iteration.py                      # value iteration
    ├── policy_iteration.py                     # policy iteration
    ├── Q-value_iteration.py                    # Q-value iteration
    └── Q-policy_iteration.py                   # Q-policy iteration
└── Model-free Control
    ├── Sarsa.py                                # SARSA : on-policy TD control
    └── Q-learning.py                           # Q-learning : off-policy TD control
└── Sampling-based Planning
    └── rrt_2D
        ├── rrt.py                              # rrt : goal-biased rrt
        └── rrt_star.py
    └── rrt_3D
        ├── rrt3D.py                            # rrt3D : goal-biased rrt3D
        └── rrtstar3D.py

Animations

DFS & BFS (Dijkstra)

  • Blue: starting state
  • Green: goal state
dfs
bfs

A* and A* Variants

astar
biastar

Value/Policy/Q-value/Q-policy Iteration

  • Brown: losing states
    value iteration
    value iteration

SARSA(on-policy) & Q-learning(off-policy)

  • Brown: losing states
    value iteration
    value iteration

License

MIT License