<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Trung Huynh&#039;s tech blog</title>
	<atom:link href="http://www.trungh.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.trungh.com</link>
	<description>Even god made mistakes, please let me know what mistakes I have made</description>
	<lastBuildDate>Tue, 11 Oct 2011 14:20:51 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.3</generator>
		<item>
		<title>The “Michael Dell” Meeting</title>
		<link>http://www.trungh.com/2011/10/the-%e2%80%9cmichael-dell%e2%80%9d-meeting/</link>
		<comments>http://www.trungh.com/2011/10/the-%e2%80%9cmichael-dell%e2%80%9d-meeting/#comments</comments>
		<pubDate>Tue, 11 Oct 2011 11:55:20 +0000</pubDate>
		<dc:creator>trung</dc:creator>
				<category><![CDATA[post]]></category>
		<category><![CDATA[Business]]></category>
		<category><![CDATA[Work]]></category>

		<guid isPermaLink="false">http://www.trungh.com/2011/10/the-%e2%80%9cmichael-dell%e2%80%9d-meeting/</guid>
		<description><![CDATA[The below talk was given shortly after Steve Jobs returned in 1997 as Interim CEO, in response to Michael Dell’s suggestion in the press a few days previous that Apple should just shut down and return the cash to shareholders: And you know what? He’s right. The world doesn’t need another Dell or HP. It [...]]]></description>
			<content:encoded><![CDATA[<p>The below talk was given shortly after Steve Jobs returned in 1997 as Interim CEO, in response to  Michael Dell’s suggestion in the press a few days previous that Apple should just shut down and return the cash to shareholders:</p>
<blockquote><p>
And you know what? He’s right.</p>
<p>The world doesn’t need another Dell or HP.  It doesn’t need another manufacturer of plain, beige, boring PCs.  If that’s all we’re going to do, then we should really pack up now.
<p/>
<p>But we’re lucky, because Apple has a purpose.  Unlike anyone in the industry, people want us to make products that they love.  In fact, more than love.  Our job is to make products that people lust for.  That’s what Apple is meant to be.</p>
<p>What’s BMW’s market share of the auto market?  Does anyone know?  Well, it’s less than 2%, but no one cares.  Why?  Because either you drive a BMW or you stare at the new one driving by.  If we do our job, we’ll make products that people lust after, and no one will care about our market share.</p>
<p>Apple is a start-up.  Granted, it’s a startup with $6B in revenue, but that can and will go in an instant.  If you are here for a cushy 9-to-5 job, then that’s OK, but you should go.  We’re going to make sure everyone has stock options, and that they are oriented towards the long term.  If you need a big salary and bonus, then that’s OK, but you should go.  This isn’t going to be that place.  There are plenty of companies like that in the Valley.  This is going to be hard work, possibly the hardest you’ve ever done.  But if we do it right, it’s going to be worth it.
</p></blockquote>
<p> &#8211; Steve Jobs</p>
<p>The story is reblogged from <a href="http://blog.adamnash.com/2011/10/10/steve-jobs-bmw-ebay/">Adam Nash&#8217;s blog</a></p>]]></content:encoded>
			<wfw:commentRss>http://www.trungh.com/2011/10/the-%e2%80%9cmichael-dell%e2%80%9d-meeting/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>People think focus means saying yes to t&#8230;</title>
		<link>http://www.trungh.com/2011/10/focus-ability-to-say-no/</link>
		<comments>http://www.trungh.com/2011/10/focus-ability-to-say-no/#comments</comments>
		<pubDate>Wed, 05 Oct 2011 10:11:33 +0000</pubDate>
		<dc:creator>trung</dc:creator>
				<category><![CDATA[quote]]></category>
		<category><![CDATA[Work]]></category>

		<guid isPermaLink="false">http://www.trungh.com/2011/10/focus-ability-to-say-no/</guid>
		<description><![CDATA[People think focus means saying yes to the thing you’ve got to focus on. But that’s not what it means at all. It means saying no to the hundred other good ideas that there are. You have to pick carefully. Steve Jobs]]></description>
			<content:encoded><![CDATA[<p>People think focus means saying yes to the thing you’ve got to focus on. But that’s not what it means at all. It means saying no to the hundred other good ideas that there are. You have to pick carefully.</p>
<p><cite>Steve Jobs</cite></p>]]></content:encoded>
			<wfw:commentRss>http://www.trungh.com/2011/10/focus-ability-to-say-no/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Remembering that you are going to die is&#8230;</title>
		<link>http://www.trungh.com/2011/09/remembering-that-you-are-going-to-die-is-2/</link>
		<comments>http://www.trungh.com/2011/09/remembering-that-you-are-going-to-die-is-2/#comments</comments>
		<pubDate>Wed, 07 Sep 2011 20:25:23 +0000</pubDate>
		<dc:creator>trung</dc:creator>
				<category><![CDATA[quote]]></category>

		<guid isPermaLink="false">http://www.trungh.com/2011/09/remembering-that-you-are-going-to-die-is-2/</guid>
		<description><![CDATA[Remembering that you are going to die is the best way I know to avoid the trap of thinking you have something to lose. You are already naked. There is no reason not to follow your heart. Steve Jobs]]></description>
			<content:encoded><![CDATA[<p>Remembering that you are going to die is the best way I know to avoid the trap of thinking you have something to lose. You are already naked. There is no reason not to follow your heart.</p>
<p><cite>Steve Jobs</cite></p>]]></content:encoded>
			<wfw:commentRss>http://www.trungh.com/2011/09/remembering-that-you-are-going-to-die-is-2/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Success is the ability to go from one fa&#8230;</title>
		<link>http://www.trungh.com/2011/08/success-is-the-ability-to-go-from-one-fa/</link>
		<comments>http://www.trungh.com/2011/08/success-is-the-ability-to-go-from-one-fa/#comments</comments>
		<pubDate>Thu, 18 Aug 2011 10:03:56 +0000</pubDate>
		<dc:creator>trung</dc:creator>
				<category><![CDATA[quote]]></category>

		<guid isPermaLink="false">http://www.trungh.com/2011/08/success-is-the-ability-to-go-from-one-fa/</guid>
		<description><![CDATA[Success is the ability to go from one failure to another with no loss of enthusiasm. Winston Churchill]]></description>
			<content:encoded><![CDATA[<p>Success is the ability to go from one failure to another with no loss of enthusiasm.</p>
<p><cite>Winston Churchill</cite></p>]]></content:encoded>
			<wfw:commentRss>http://www.trungh.com/2011/08/success-is-the-ability-to-go-from-one-fa/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>THE FUTURE IS ALREADY HERE, IT’S JUST UN&#8230;</title>
		<link>http://www.trungh.com/2011/07/the-future-is-already-here-it%e2%80%99s-just-un/</link>
		<comments>http://www.trungh.com/2011/07/the-future-is-already-here-it%e2%80%99s-just-un/#comments</comments>
		<pubDate>Tue, 19 Jul 2011 10:46:39 +0000</pubDate>
		<dc:creator>trung</dc:creator>
				<category><![CDATA[quote]]></category>
		<category><![CDATA[Work]]></category>

		<guid isPermaLink="false">http://www.trungh.com/2011/07/the-future-is-already-here-it%e2%80%99s-just-un/</guid>
		<description><![CDATA[THE FUTURE IS ALREADY HERE, IT’S JUST UNEVENLY DISTRIBUTED WILLIAM GIBSON, 1994]]></description>
			<content:encoded><![CDATA[<p>THE FUTURE IS ALREADY HERE, IT’S JUST UNEVENLY DISTRIBUTED</p>
<p><cite>WILLIAM GIBSON, 1994</cite></p>]]></content:encoded>
			<wfw:commentRss>http://www.trungh.com/2011/07/the-future-is-already-here-it%e2%80%99s-just-un/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Google vs Microsoft (search, google docs&#8230;</title>
		<link>http://www.trungh.com/2011/07/google-vs-microsoft-search-google-docs/</link>
		<comments>http://www.trungh.com/2011/07/google-vs-microsoft-search-google-docs/#comments</comments>
		<pubDate>Mon, 18 Jul 2011 10:00:08 +0000</pubDate>
		<dc:creator>trung</dc:creator>
				<category><![CDATA[status]]></category>
		<category><![CDATA[Business]]></category>
		<category><![CDATA[Start-up]]></category>
		<category><![CDATA[Work]]></category>

		<guid isPermaLink="false">http://www.trungh.com/2011/07/google-vs-microsoft-search-google-docs/</guid>
		<description><![CDATA[Google vs Microsoft (search, google docs, android, gmail) + Facebook (G+) + Twitter (Buzz) + Apple (Android) + Groupon (Google Deals) + Foursquare (Google Latitude) + Yahoo (Search, Google News, Google Talk) + BBC (Google News) + Vimeo (YouTube) +&#8230;, c&#8217;mon greedy Google, this is not your own world!]]></description>
			<content:encoded><![CDATA[<p>Google vs Microsoft (search, google docs, android, gmail) + Facebook (G+) + Twitter (Buzz) + Apple (Android) + Groupon (Google Deals) + Foursquare (Google Latitude) + Yahoo (Search, Google News, Google Talk) + BBC (Google News) + Vimeo (YouTube) +&#8230;, c&#8217;mon greedy Google, this is not your own world!</p>
<p><a href="http://www.trungh.com/wp-content/uploads/2011/07/tumblr_ljr6g0WWX21qfrwkj.jpg"><img src="http://www.trungh.com/wp-content/uploads/2011/07/tumblr_ljr6g0WWX21qfrwkj.jpg" alt="" title="tumblr_ljr6g0WWX21qfrwkj" width="286" height="176" class="alignnone size-full wp-image-315" /></a></p>]]></content:encoded>
			<wfw:commentRss>http://www.trungh.com/2011/07/google-vs-microsoft-search-google-docs/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>An Introduction to Sentiment Analysis</title>
		<link>http://www.trungh.com/2011/06/an-introduction-to-sentiment-analysis/</link>
		<comments>http://www.trungh.com/2011/06/an-introduction-to-sentiment-analysis/#comments</comments>
		<pubDate>Thu, 30 Jun 2011 09:42:24 +0000</pubDate>
		<dc:creator>trung</dc:creator>
				<category><![CDATA[post]]></category>
		<category><![CDATA[Opinion Mining]]></category>
		<category><![CDATA[Work]]></category>

		<guid isPermaLink="false">http://www.trungh.com/2011/06/an-introduction-to-sentiment-analysis/</guid>
		<description><![CDATA[]]></description>
			<content:encoded><![CDATA[<p><iframe src="http://portal.sliderocket.com:80/app/fullplayer.aspx?id=A6FC9082-0C6D-A294-AAEF-C695272A6D92" width="600" height="400" scrolling=no frameBorder="1" style="border:1px solid #333333;border-bottom-style:none"></iframe></p>]]></content:encoded>
			<wfw:commentRss>http://www.trungh.com/2011/06/an-introduction-to-sentiment-analysis/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>reassign user/group for sshfs mounted device</title>
		<link>http://www.trungh.com/2011/04/reassign-usergroup-for-sshfs-mounted-device/</link>
		<comments>http://www.trungh.com/2011/04/reassign-usergroup-for-sshfs-mounted-device/#comments</comments>
		<pubDate>Wed, 13 Apr 2011 10:16:45 +0000</pubDate>
		<dc:creator>trung</dc:creator>
				<category><![CDATA[post]]></category>
		<category><![CDATA[Linux]]></category>

		<guid isPermaLink="false">http://www.trungh.com/2011/04/reassign-usergroup-for-sshfs-mounted-device/</guid>
		<description><![CDATA[1sshfs -o allow_other,uid=&#123;new uid&#125;,gid=&#123;new gid&#125; &#123;remote address&#125; &#123;mounted address&#125;]]></description>
			<content:encoded><![CDATA[<div class="codecolorer-container bash default" style="overflow:auto;white-space:nowrap;border:1px solid #9F9F9F;width:435px;"><table cellspacing="0" cellpadding="0"><tbody><tr><td style="padding:5px;text-align:center;color:#888888;background-color:#EEEEEE;border-right: 1px solid #9F9F9F;font: normal 12px/1.4em Monaco, Lucida Console, monospace;"><div>1<br /></div></td><td><div class="bash codecolorer" style="padding:5px;font:normal 12px/1.4em Monaco, Lucida Console, monospace;white-space:nowrap">sshfs <span style="color: #660033;">-o</span> allow_other,<span style="color: #007800;">uid</span>=<span style="color: #7a0874; font-weight: bold;">&#123;</span>new uid<span style="color: #7a0874; font-weight: bold;">&#125;</span>,<span style="color: #007800;">gid</span>=<span style="color: #7a0874; font-weight: bold;">&#123;</span>new gid<span style="color: #7a0874; font-weight: bold;">&#125;</span> <span style="color: #7a0874; font-weight: bold;">&#123;</span>remote address<span style="color: #7a0874; font-weight: bold;">&#125;</span> <span style="color: #7a0874; font-weight: bold;">&#123;</span>mounted address<span style="color: #7a0874; font-weight: bold;">&#125;</span></div></td></tr></tbody></table></div>]]></content:encoded>
			<wfw:commentRss>http://www.trungh.com/2011/04/reassign-usergroup-for-sshfs-mounted-device/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>git/git-svn notes</title>
		<link>http://www.trungh.com/2011/03/gitgit-svn-notes-2/</link>
		<comments>http://www.trungh.com/2011/03/gitgit-svn-notes-2/#comments</comments>
		<pubDate>Wed, 23 Mar 2011 16:19:04 +0000</pubDate>
		<dc:creator>trung</dc:creator>
				<category><![CDATA[post]]></category>
		<category><![CDATA[Git]]></category>
		<category><![CDATA[Svn]]></category>

		<guid isPermaLink="false">http://www.trungh.com/2011/03/gitgit-svn-notes-2/</guid>
		<description><![CDATA[Untrack a file (works nice with git-svn): 1git update-index --assume-unchanged file_to_untrack]]></description>
			<content:encoded><![CDATA[<p>Untrack a file (works nice with git-svn):</p>
<div class="codecolorer-container bash default" style="overflow:auto;white-space:nowrap;border:1px solid #9F9F9F;width:435px;"><table cellspacing="0" cellpadding="0"><tbody><tr><td style="padding:5px;text-align:center;color:#888888;background-color:#EEEEEE;border-right: 1px solid #9F9F9F;font: normal 12px/1.4em Monaco, Lucida Console, monospace;"><div>1<br /></div></td><td><div class="bash codecolorer" style="padding:5px;font:normal 12px/1.4em Monaco, Lucida Console, monospace;white-space:nowrap"><span style="color: #c20cb9; font-weight: bold;">git</span> update-index <span style="color: #660033;">--assume-unchanged</span> file_to_untrack</div></td></tr></tbody></table></div>]]></content:encoded>
			<wfw:commentRss>http://www.trungh.com/2011/03/gitgit-svn-notes-2/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>An introduction to Q-learning</title>
		<link>http://www.trungh.com/2011/03/an-introduction-to-q-learing/</link>
		<comments>http://www.trungh.com/2011/03/an-introduction-to-q-learing/#comments</comments>
		<pubDate>Tue, 22 Mar 2011 18:05:35 +0000</pubDate>
		<dc:creator>trung</dc:creator>
				<category><![CDATA[post]]></category>
		<category><![CDATA[AI]]></category>

		<guid isPermaLink="false">http://www.trungh.com/2011/03/an-introduction-to-q-learing/</guid>
		<description><![CDATA[Q-learning [Watkins, 1989] is one of the most popular reinforcement learning methods. One of the advantages of Q-learning is its ability to compare the expected utility of the available actions without requiring a model of the environment. The basic content of Q-learning is inside the below equation: Where: is the Q-value at time , state [...]]]></description>
			<content:encoded><![CDATA[<p>Q-learning [Watkins, 1989] is one of the most popular reinforcement learning methods. One of the advantages of Q-learning is its ability to compare the expected utility of the available actions without requiring a model of the environment.</p>
<p>The basic content of Q-learning is inside the below equation:</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=Q_%7Bt%2B1%7D%28a%2C%20s%29%3D%281-%5Calpha_%7Bt%7D%29Q_%7Bt%7D%28a%2Cs%29%2B%5Calpha_%7Bt%7D%5Br_%7Bt%7D%28s%29%2B%5Cgamma%5Cmax_%7Ba%5E%7B%27%7D%7D%7BQ_%7Bt%7D%28a%27%2Cs%27%29%7D%5D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Q_{t+1}(a, s)=(1-\alpha_{t})Q_{t}(a,s)+\alpha_{t}[r_{t}(s)+\gamma\max_{a^{&#039;}}{Q_{t}(a&#039;,s&#039;)}]' title='Q_{t+1}(a, s)=(1-\alpha_{t})Q_{t}(a,s)+\alpha_{t}[r_{t}(s)+\gamma\max_{a^{&#039;}}{Q_{t}(a&#039;,s&#039;)}]' class='latex' /></p>
<p>Where:</p>
<ul>
<li><img src='http://s.wordpress.com/latex.php?latex=Q_%7Bt%7D%28a%2Cs%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Q_{t}(a,s)' title='Q_{t}(a,s)' class='latex' /> is the Q-value at time <img src='http://s.wordpress.com/latex.php?latex=t&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='t' title='t' class='latex' />, state <img src='http://s.wordpress.com/latex.php?latex=s&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='s' title='s' class='latex' /> with action <img src='http://s.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a' title='a' class='latex' />.</li>
<li><img src='http://s.wordpress.com/latex.php?latex=r_%7Bt%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='r_{t}' title='r_{t}' class='latex' /> is the reward.</li>
<li><img src='http://s.wordpress.com/latex.php?latex=%5Calpha&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\alpha' title='\alpha' class='latex' /> is the learning rate. The learning rate determines how fast and how important the new information is to be learned. If <img src='http://s.wordpress.com/latex.php?latex=%5Calpha&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\alpha' title='\alpha' class='latex' /> is 0, the agent does not learn anything. If <img src='http://s.wordpress.com/latex.php?latex=%5Calpha&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\alpha' title='\alpha' class='latex' /> is 1, only the new information is considered and all old information is discarded.</li>
<li><img src='http://s.wordpress.com/latex.php?latex=%5Cgamma&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\gamma' title='\gamma' class='latex' /> is the discount factor. The discount factor is in range [0..1] and is used to weight new term reinforcement more heavily than distant future reinforcement. The closer <img src='http://s.wordpress.com/latex.php?latex=%5Cgamma&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\gamma' title='\gamma' class='latex' /> is to 1, the greater the weight of future reinforcement.</li>
</ul>
<p>So what does the equation mean ? We now assume <img src='http://s.wordpress.com/latex.php?latex=%5Calpha%3D1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\alpha=1' title='\alpha=1' class='latex' /> and <img src='http://s.wordpress.com/latex.php?latex=%5Cgamma%3D1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\gamma=1' title='\gamma=1' class='latex' />, then the equation becomes:</p>
<p style="text-align: center; "><img src='http://s.wordpress.com/latex.php?latex=Q_%7Bt%2B1%7D%28a%2C%20s%29%3Dr_%7Bt%7D%28s%29%2Bmax_%7Ba%27%7D%7BQ_%7Bt%7D%28a%27%2Cs%27%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Q_{t+1}(a, s)=r_{t}(s)+max_{a&#039;}{Q_{t}(a&#039;,s&#039;)}' title='Q_{t+1}(a, s)=r_{t}(s)+max_{a&#039;}{Q_{t}(a&#039;,s&#039;)}' class='latex' /></p>
<p style="text-align: left;">It is now easy to see that the Q-value of state-action pair (<img src='http://s.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a' title='a' class='latex' />,<img src='http://s.wordpress.com/latex.php?latex=s&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='s' title='s' class='latex' />) is equal to the maximum Q-value of next state (for all next actions) adding the reward of action <img src='http://s.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a' title='a' class='latex' />. The learning method is obviously a dynamic algorithm that gives the optimal Q-value for state-action pairs.</p>
<p style="text-align: left;">When the discount factor is enabled (&lt;1),  it makes the reward reduced by time and hence the total reward at time <img src='http://s.wordpress.com/latex.php?latex=t&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='t' title='t' class='latex' /> is given by:</p>
<p style="text-align: center;"><img src='http://s.wordpress.com/latex.php?latex=R_%7Bt%7D%3Dr_%7Bt%7D%2B%5Cgamma%20r_%7Bt%2B1%7D%20%2B%20%5Cgamma%5E2%20r_%7Bt%2B2%7D%20%2B%20%5Cdots%20%2B%20%5Cgamma%5En%20r_%7Bt%2Bn%7D%20%2B%20%5Cdots&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='R_{t}=r_{t}+\gamma r_{t+1} + \gamma^2 r_{t+2} + \dots + \gamma^n r_{t+n} + \dots' title='R_{t}=r_{t}+\gamma r_{t+1} + \gamma^2 r_{t+2} + \dots + \gamma^n r_{t+n} + \dots' class='latex' /></p>
<p>The bellow java applet is a very good illustration of Q-learning (thank to Vander B. Frank):</p>
<p><applet code=BotQLearning.class width=600 height=300 archive="http://www.applied-mathematics.net/qlearning/BotQLearning.jar"> </applet> </p>
<p>For the detail of how the applet works, please reach the document of Vander B. Frank through this <a href="http://www.applied-mathematics.net/qlearning/qlearning.pdf">PDF</a>.</p>
<p><br/><br />
<strong>Bibliography</strong></p>
<p>1. Wikipedia: <em>Q-learning</em> [<a href="http://en.wikipedia.org/wiki/Q-learning">http://en.wikipedia.org/wiki/Q-learning</a>].<br />
2. Vander B. Frank: <em>Q-learning. <span style="font-style: normal;">IRIDIA, Universit Libre de Bruxelles. 7, 2003. [<a href="http://www.applied-mathematics.net/qlearning/qlearning.pdf">PDF</a>]</span></em><br />
<em><span style="font-style: normal;">3. Watkins, C.J.C.H. (1989). Learning from Delayed Rewards. PhD thesis, Cambridge University, Cambridge, England</span></em></p>]]></content:encoded>
			<wfw:commentRss>http://www.trungh.com/2011/03/an-introduction-to-q-learing/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

