<div dir="ltr"><div dir="ltr"><br></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Thu, Jul 18, 2019 at 11:58 AM John Clark <<a href="mailto:johnkclark@gmail.com">johnkclark@gmail.com</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr"><div dir="ltr"><div style="font-family:arial,helvetica,sans-serif"><span style="font-family:Arial,Helvetica,sans-serif">On Thu, Jul 18, 2019 at 10:43 AM Dave Sill <<a href="mailto:sparge@gmail.com" target="_blank">sparge@gmail.com</a>> wrote:</span><br></div></div><div class="gmail_quote"><div dir="ltr" class="gmail_attr"><br></div><br><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr"><div class="gmail_quote"><div><i></i></div><div><i><span class="gmail_default" style="font-family:arial,helvetica,sans-serif">> </span>Pluribus isn't modifying it's own code. When I said it'd say "I just pick the statistically best play", that was overly simplified. It more like "I pick the statistically best play and continually look at my previous play and try different things and adjust the probabilities so I can do better next time".</i></div></div></div></blockquote><div><br></div><div><font size="4">Dave<span class="gmail_default" style="font-family:arial,helvetica,sans-serif">, a program is just code. If a program has changed its behavior then the code must have changed. If a human didn't change the code and the program received no new input from the outside world then from the process of elimination it must have been the program itself that changed the code. And if that change resulted in it making more money playing Poker then the program has become more intelagent.</span></font></div><div><br></div></div></div></blockquote><div><br></div><div>John, Deep and reinforcement learning algos don't work in the same way as classical programming code.  As Dave mentioned, the code for the algos absolutely does not change between iterations.  The statistical model does.  Weights within the model change based on feedback from each learning iteration, but the code remains untouched. </div></div></div>