Update templates
This commit is contained in:
parent
55b9e2523c
commit
b0ca171973
34 changed files with 536 additions and 707 deletions
|
@ -21,31 +21,25 @@
|
|||
</head>
|
||||
<body>
|
||||
<article>
|
||||
|
||||
<header>
|
||||
<nav>
|
||||
<nav>
|
||||
<a href="../">Home</a>
|
||||
<a href="../projects.html">Projects</a>
|
||||
<a href="../archive.html">Archive</a>
|
||||
<a href="../contact.html">Contact</a>
|
||||
</nav>
|
||||
<a href="../archive.html">Posts</a>
|
||||
<a href="../projects.html">Projects</a>
|
||||
<a href="../contact.html">Contact</a>
|
||||
</nav>
|
||||
|
||||
<h1 class="title">Quick Notes on Reinforcement Learning</h1>
|
||||
|
||||
|
||||
<p class="byline">November 21, 2018</p>
|
||||
|
||||
</header>
|
||||
|
||||
|
||||
</article>
|
||||
|
||||
<article>
|
||||
<section class="header">
|
||||
|
||||
</section>
|
||||
<section>
|
||||
<h2 id="introduction">Introduction</h2>
|
||||
<h1 class="title">Quick Notes on Reinforcement Learning</h1>
|
||||
|
||||
|
||||
<p class="byline">November 21, 2018</p>
|
||||
|
||||
|
||||
</header>
|
||||
|
||||
<section>
|
||||
<h2 id="introduction">Introduction</h2>
|
||||
<p>In this series of blog posts, I intend to write my notes as I go through Richard S. Sutton’s excellent <em>Reinforcement Learning: An Introduction</em> <a href="#ref-1">(1)</a>.</p>
|
||||
<p>I will try to formalise the maths behind it a little bit, mainly because I would like to use it as a useful personal reference to the main concepts in RL. I will probably add a few remarks about a possible implementation as I go on.</p>
|
||||
<h2 id="relationship-between-agent-and-environment">Relationship between agent and environment</h2>
|
||||
|
@ -133,10 +127,11 @@ q_{\pi}(s,a) &= \mathbb{E}_{\pi}\left[ \sum_{k=0}^{\infty} \gamma^k R_{t+k+1
|
|||
<ol>
|
||||
<li><span id="ref-1"></span>R. S. Sutton and A. G. Barto, Reinforcement learning: an introduction, Second edition. Cambridge, MA: The MIT Press, 2018.</li>
|
||||
</ol>
|
||||
</section>
|
||||
</article>
|
||||
</section>
|
||||
|
||||
|
||||
</article>
|
||||
|
||||
<footer>
|
||||
Site proudly generated by
|
||||
<a href="http://jaspervdj.be/hakyll">Hakyll</a>
|
||||
|
|
Loading…
Add table
Add a link
Reference in a new issue