Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

update #1

Merged
merged 52 commits into from
Nov 8, 2023
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
52 commits
Select commit Hold shift + click to select a range
0a63915
New exploration seminar
Qwasser Apr 7, 2022
d665d37
fix q-learning file
Qwasser Apr 7, 2022
7f58abb
Add videos
Qwasser Apr 7, 2022
40c5f65
Clarify meaning of percentile in crossentropy_method
dniku Apr 10, 2022
ddcb71f
Added HW on continuous control (TD3 or SAC) (#478)
FortsAndMills Apr 17, 2022
cbc44b1
Fix formatting in README.md for Week 04 (#500)
AI-Ahmed Jul 3, 2022
1c938df
[week 06] Install dependencies for CartPole now that Colab no longer …
AI-Ahmed Sep 12, 2022
65977e8
[week 06] Install dependencies for CartPole now that Colab no longer …
AI-Ahmed Sep 12, 2022
fc71832
[week 06] download atari_wrappers automatically & install missing dep…
AI-Ahmed Sep 12, 2022
67e297a
[Week 06] Fix 'EntryPoints' object has no attribute 'get' (#509)
AI-Ahmed Oct 16, 2022
dfe8283
Add the video lecture of Pavel Shvechikov (#510)
AI-Ahmed Nov 12, 2022
f1d7764
Adding new blog (#504)
AI-Ahmed Nov 13, 2022
9d71e24
Downloading & Install `xvfb` for video game recording (#511)
AI-Ahmed Dec 4, 2022
df9c3cc
update week03 notebook - merge sem and hw notebooks, simplify, renew …
kharitonov-ivan Jan 31, 2023
f77ce53
update deps: gym -> gymnasium
kharitonov-ivan Jan 31, 2023
7a0f6c7
migrate from gym to gymnasium
kharitonov-ivan Feb 1, 2023
ee32d9f
update link
kharitonov-ivan Feb 1, 2023
011e75c
[Week 03] Improve grammar & rename binarizer->discretizer
dniku Feb 12, 2023
dcd0304
Merge pull request #517 from neer201/update-week03
kharitonov-ivan Feb 13, 2023
3e4b041
Fix typo (#518)
sashkent3 Feb 18, 2023
3f56b7b
fix regret calculation in week5 (#515)
Feb 19, 2023
9e0017f
Support MacOS in A2C homework multiprocessing (#520)
q0o0p Feb 22, 2023
9137d39
Support gym 0.26 in week6 seminar (#519)
q0o0p Feb 22, 2023
ec6576e
[Week 06] Simplify Colab initialization and do some cleanup in the no…
dniku Apr 10, 2023
7614100
fix: fix np float for current numpy version (#523)
dremovd Apr 26, 2023
c934d06
Update Roms.rar GDrive URL (#524)
dniku May 25, 2023
1fff643
move seminar interface to gymnasium, rename
laktionov Jun 25, 2023
818878b
move crossentropy_method to gymnasium
laktionov Jun 25, 2023
4a1c333
move crossentropy_method to gymnasium
laktionov Jun 25, 2023
5e8ee16
move deep_crossentropy_method to gymnasium
laktionov Jun 25, 2023
d80921a
update readme
laktionov Jun 25, 2023
f871ed2
done = terminated or truncated, apply formatter
laktionov Jun 29, 2023
70124e3
migrate to gymnasium
laktionov Jun 29, 2023
794fc4b
Merge pull request #526 from laktionov/week01_move_to_gymnasium
dniku Jul 1, 2023
59c7fa6
add context manager for recording, return s', change gym to gymnaasiu…
laktionov Jul 2, 2023
cebc0aa
apply formatter to notebooks
laktionov Jul 2, 2023
fe79654
add comment to play_and_record fun
laktionov Jul 2, 2023
262f540
Revert "add context manager for recording, return s', change gym to g…
laktionov Jul 2, 2023
788de21
Revert "add comment to play_and_record fun"
laktionov Jul 2, 2023
19bba0d
fix bug in seminar, return s', add context manager in video recording
laktionov Jul 2, 2023
e20a0cb
Merge pull request #527 from laktionov/week04_migrate_to_gymnasium
dniku Jul 2, 2023
55c5234
Fix: Update broken link in README.md and Notebook (#530)
umernaeem217 Aug 6, 2023
06be1bc
migrate to gymnasium with shimmy and fix bug in InverseDynamics (#532)
laktionov Aug 10, 2023
f5cc7d1
[Week08] Migrate to gymnasium (#528)
laktionov Aug 13, 2023
c598c63
[Week06] migrate to gymnasium (#531)
laktionov Aug 13, 2023
6caa4ff
[Week 10] Migrate to gymnasium (#534)
laktionov Sep 21, 2023
56df1f7
[Week09] Migrate to gymnasium (#533)
laktionov Sep 25, 2023
fcd3655
[Week06] fix truncation processing (#536)
laktionov Oct 4, 2023
ff7d7cd
fix termination and truncation processing (#537)
laktionov Oct 10, 2023
c0cf1c7
remove graph_size attr in plot_graph()
recycletechno Nov 2, 2023
a04133e
Merge pull request #538 from recycletechno/mdp-graph-size-fix
kharitonov-ivan Nov 5, 2023
240c989
Change deprecated github repo (#539)
Khaymon Nov 5, 2023
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion docker/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -7,7 +7,7 @@ _tl;dr [dockerhub url](https://hub.docker.com/r/justheuristic/practical_rl/)_
We recommend you to use either native docker (recommended for linux) or kitematic(recommended for windows).
* Installing [kitematic](https://kitematic.com/), a simple interface to docker (all platforms)
* Pure docker: Guide for [windows](https://docs.docker.com/docker-for-windows/), [linux](https://docs.docker.com/engine/installation/), or [macOS](https://docs.docker.com/docker-for-mac/).
* If you want to use your GPU make sure you have [nvidia-docker](https://github.com/NVIDIA/nvidia-docker) and [NVidia driver](https://www.nvidia.com/en-us/drivers/unix/) + [CUDA 10.2](https://developer.nvidia.com/cuda-downloads) installed
* If you want to use your GPU make sure you have [nvidia-container-toolkit](https://github.com/NVIDIA/nvidia-container-toolkit) and [NVidia driver](https://www.nvidia.com/en-us/drivers/unix/) + [CUDA 10.2](https://developer.nvidia.com/cuda-downloads) installed

Below are the instructions for both approaches.

Expand Down
2 changes: 1 addition & 1 deletion setup_colab.sh
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,7 @@ wget -q https://raw.githubusercontent.com/yandexdataschool/Practical_RL/master/x

# Download & import Atari ROMs (Colab stopped bundling them around the beginning of June 2021)

gdown -q https://drive.google.com/uc?id=1dCLEJcJGDDV4l5ssoexP2TEOVuBfyh7D
gdown -q https://drive.google.com/uc?id=1c6_W2Fig92hm5FRIc2Mpc_ZZyr6o52lF

# Alternative download:
# wget -q http://www.atarimania.com/roms/Roms.rar
Expand Down
8 changes: 4 additions & 4 deletions week01_intro/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -11,7 +11,7 @@


## More materials:
* __[recommended]__ - awesome openai post about evolution strategies - [blog post](https://blog.openai.com/evolution-strategies/), [article](https://arxiv.org/abs/1703.03864)
* __[recommended]__ - awesome openai post about evolution strategies - [blog post](https://openai.com/research/evolution-strategies), [article](https://arxiv.org/abs/1703.03864)
* __[recommended]__ - formal explanation of crossentropy method in [general](https://people.smp.uq.edu.au/DirkKroese/ps/CEEncycl.pdf) and for [optimization](https://people.smp.uq.edu.au/DirkKroese/ps/CEopt.pdf)
* Deep learning course (if you want to learn in parallel) - https://github.com/yandexdataschool/HSE_deeplearning
* Video on genetic algorithms (english) - [video](https://www.youtube.com/watch?v=ejxfTy4lI6I)
Expand All @@ -23,10 +23,10 @@


## Practice assignment
Instant dive in: [__seminar_gym_interface__](https://colab.research.google.com/github/yandexdataschool/Practical_RL/blob/master/week01_intro/seminar_gym_interface.ipynb), [__crossentropy_method__](https://colab.research.google.com/github/yandexdataschool/Practical_RL/blob/master/week01_intro/crossentropy_method.ipynb),
Instant dive in: [__seminar_gymnasium_interface__](https://colab.research.google.com/github/yandexdataschool/Practical_RL/blob/master/week01_intro/seminar_gymnasium_interface.ipynb), [__crossentropy_method__](https://colab.research.google.com/github/yandexdataschool/Practical_RL/blob/master/week01_intro/crossentropy_method.ipynb),
[__deep_crossentropy_method__](https://colab.research.google.com/github/yandexdataschool/Practical_RL/blob/master/week01_intro/deep_crossentropy_method.ipynb)

* Open `gym_interface.ipynb` and follow instructions from there
* Open `seminar_gymnasium_interface.ipynb` and follow instructions from there
* After you're done there, proceed to `crossentropy_method.ipynb`
* You can find homework and bonus assignment descriptions at the end of that notebook.
* Note: so far it's enough to say `pip install gym` on top of any data-science-stuffed python, but we'd appreciate if you gradually switch to [full installation](https://github.com/openai/gym#installing-everything).
* Note: so far it's enough to say `pip install gymnasium` on top of any data-science-stuffed python, but we'd appreciate if you gradually switch to [full installation](https://github.com/Farama-Foundation/Gymnasium).
Loading