inconsistency fix for stochastic deep sea #33

cgao3 · 2020-10-22T19:08:59Z

The description of stochastic sea environment says "adds N(0,1) noise to the end of states of the chain", but in line 125, noisy reward were only added when "column" is either 0 or "_size -1".
The description of stochastic sea environment says "act right with 1 - 1/N moves agent to right", but in line 121, i.e., when agent is at cell "(_size-1, _size-1)" and act right, there is no such stochasticity.

This pull request fixes these two inconsistency issues.
Without this pull request fix, expected value under optimal policy is more complicated; with these fixes,
expected value for optimal policy is simply given by (1-1/N)^N0.99 + (-0.01 + E[norm(0,1)])(1-(1-1/N)^N)

google-cla · 2020-10-22T19:09:03Z

Thanks for your pull request. It looks like this may be your first contribution to a Google open source project (if not, look below for help). Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

📝 Please visit https://cla.developers.google.com/ to sign.

Once you've signed (or fixed any issues), please reply here with @googlebot I signed it! and we'll verify it.

What to do if you already signed the CLA

Individual signers

It's possible we don't have your GitHub username or you're using a different email address on your commit. Check your existing CLA data and verify that your email is set on your git commits.

Corporate signers

Your company has a Point of Contact who decides which employees are authorized to participate. Ask your POC to be added to the group of authorized contributors. If you don't know who your Point of Contact is, direct the Google project maintainer to go/cla#troubleshoot (Public version).
The email used to register you as an authorized contributor must be the email used for the Git commit. Check your existing CLA data and verify that your email is set on your git commits.
The email used to register you as an authorized contributor must also be attached to your GitHub account.

ℹ️ Googlers: Go here for more info.

cgao3 · 2020-10-22T19:12:20Z

@googlebot I signed it!

@googlebot I signed it!

inconsistency fix for stochastic deep sea

4076cd5

google-cla bot added the cla: no label Oct 22, 2020

google-cla bot added cla: yes and removed cla: no labels Oct 22, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

inconsistency fix for stochastic deep sea #33

inconsistency fix for stochastic deep sea #33

Uh oh!

cgao3 commented Oct 22, 2020

Uh oh!

google-cla bot commented Oct 22, 2020

Uh oh!

cgao3 commented Oct 22, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

inconsistency fix for stochastic deep sea #33

Are you sure you want to change the base?

inconsistency fix for stochastic deep sea #33

Uh oh!

Conversation

cgao3 commented Oct 22, 2020

Uh oh!

google-cla bot commented Oct 22, 2020

What to do if you already signed the CLA

Individual signers

Corporate signers

Uh oh!

cgao3 commented Oct 22, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant