Update templates

This commit is contained in:
Dimitri Lozeve 2020-08-30 23:41:53 +02:00
parent 55b9e2523c
commit b0ca171973
34 changed files with 536 additions and 707 deletions

View file

@ -21,31 +21,25 @@
</head>
<body>
<article>
<header>
<nav>
<nav>
<a href="../">Home</a>
<a href="../projects.html">Projects</a>
<a href="../archive.html">Archive</a>
<a href="../contact.html">Contact</a>
</nav>
<a href="../archive.html">Posts</a>
<a href="../projects.html">Projects</a>
<a href="../contact.html">Contact</a>
</nav>
<h1 class="title">Quick Notes on Reinforcement Learning</h1>
<p class="byline">November 21, 2018</p>
</header>
</article>
<article>
<section class="header">
</section>
<section>
<h2 id="introduction">Introduction</h2>
<h1 class="title">Quick Notes on Reinforcement Learning</h1>
<p class="byline">November 21, 2018</p>
</header>
<section>
<h2 id="introduction">Introduction</h2>
<p>In this series of blog posts, I intend to write my notes as I go through Richard S. Suttons excellent <em>Reinforcement Learning: An Introduction</em> <a href="#ref-1">(1)</a>.</p>
<p>I will try to formalise the maths behind it a little bit, mainly because I would like to use it as a useful personal reference to the main concepts in RL. I will probably add a few remarks about a possible implementation as I go on.</p>
<h2 id="relationship-between-agent-and-environment">Relationship between agent and environment</h2>
@ -133,10 +127,11 @@ q_{\pi}(s,a) &amp;= \mathbb{E}_{\pi}\left[ \sum_{k=0}^{\infty} \gamma^k R_{t+k+1
<ol>
<li><span id="ref-1"></span>R. S. Sutton and A. G. Barto, Reinforcement learning: an introduction, Second edition. Cambridge, MA: The MIT Press, 2018.</li>
</ol>
</section>
</article>
</section>
</article>
<footer>
Site proudly generated by
<a href="http://jaspervdj.be/hakyll">Hakyll</a>