<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>vetta project &#187; AIXI</title>
	<atom:link href="http://www.vetta.org/tag/aixi/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.vetta.org</link>
	<description></description>
	<lastBuildDate>Thu, 22 Jul 2010 19:13:53 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
		<item>
		<title>Halloween lecture online</title>
		<link>http://www.vetta.org/2009/11/halloween-lectur/</link>
		<comments>http://www.vetta.org/2009/11/halloween-lectur/#comments</comments>
		<pubDate>Sun, 01 Nov 2009 14:09:23 +0000</pubDate>
		<dc:creator>Shane Legg</dc:creator>
				<category><![CDATA[Research Review]]></category>
		<category><![CDATA[AGI]]></category>
		<category><![CDATA[AIXI]]></category>
		<category><![CDATA[Friendly AI]]></category>
		<category><![CDATA[intelligence]]></category>
		<category><![CDATA[Neuroscience]]></category>
		<category><![CDATA[Singularity]]></category>

		<guid isPermaLink="false">http://www.vetta.org/?p=721</guid>
		<description><![CDATA[My Halloween lecture has been uploaded to youtube. The basic outline is: * what is intelligence? * Solomonoff induction * Hutter&#8217;s AIXI * Monte Carlo AIXI (here&#8217;s the missing video of it playing pac-man) * universal intelligence measure * what &#8230; <a href="http://www.vetta.org/2009/11/halloween-lectur/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p><img src="http://www.vetta.org/VettaPics/ExtroBrit_pic.jpg" alt="" /></p>
<p>My Halloween lecture has been uploaded to youtube.  The basic outline is:</p>
<p>* what is intelligence?<br />
* Solomonoff induction<br />
* Hutter&#8217;s AIXI<br />
* Monte Carlo AIXI  (here&#8217;s the missing video of it <a href="http://www.vetta.org/video/AIXI_Pacman.wmv">playing pac-man</a>)<br />
* universal intelligence measure<br />
* what neuroscience can teach us about AGI design<br />
* early 2020&#8242;s: the Halloween scenario</p>
<p>You can get the <a href="http://www.vetta.org/documents/extrobrit_talk.pdf">slides here</a>.  I talked for 2 hours, so it&#8217;s broken up into many parts on youtube: <a href="http://www.youtube.com/user/KoanPhilosopher#p/u/11/MGfcy9RpqBY">Part 1</a> <a href="http://www.youtube.com/user/KoanPhilosopher#p/u/7/ZgarxJJ6noY">Part 2</a> <a href="http://www.youtube.com/user/KoanPhilosopher#p/u/10/n-Ry0TE_nRA">Part 3</a> <a href="http://www.youtube.com/user/KoanPhilosopher#p/u/9/ywUf75Q0_2U">Part 4</a> <a href="http://www.youtube.com/user/KoanPhilosopher#p/u/6/MQO_k5uOD0w">Part 5</a> <a href="http://www.youtube.com/user/KoanPhilosopher#p/u/5/WRaFyI5M96g">Part 6</a> <a href="http://www.youtube.com/user/KoanPhilosopher#p/u/4/f0qf5Iu0aLg">Part 7</a> <a href="http://www.youtube.com/user/KoanPhilosopher#p/u/3/o-UCGUipg34">Part 8</a> <a href="http://www.youtube.com/user/KoanPhilosopher#p/u/8/gPW7oojUCKs">Part 9</a> <a href="http://www.youtube.com/user/KoanPhilosopher#p/u/2/fe3c3YcQZng">Part 10</a> <a href="http://www.youtube.com/user/KoanPhilosopher#p/u/1/p7Aw_7sBRPc">Part 11</a> <a href="http://www.youtube.com/user/KoanPhilosopher#p/u/0/s7ZXLd5_1_0">Part 12</a></p>
<p>Thanks to David Wood at ExtroBritannian for organising this, and all the people who attended &#8212; especially those who travelled from other cities and countries, the intelligent questions during my talk, and all the positive feedback I&#8217;ve received since.  Thanks also to Anders Sandberg for the picture of me speaking that I stole from his <a href="http://www.flickr.com/photos/arenamontanus/">flicker stream</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.vetta.org/2009/11/halloween-lectur/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Monte Carlo AIXI</title>
		<link>http://www.vetta.org/2009/09/monte-carlo-aixi/</link>
		<comments>http://www.vetta.org/2009/09/monte-carlo-aixi/#comments</comments>
		<pubDate>Fri, 18 Sep 2009 20:55:34 +0000</pubDate>
		<dc:creator>Shane Legg</dc:creator>
				<category><![CDATA[Research Review]]></category>
		<category><![CDATA[AGI]]></category>
		<category><![CDATA[AIXI]]></category>
		<category><![CDATA[Monte Carlo]]></category>

		<guid isPermaLink="false">http://www.vetta.org/?p=635</guid>
		<description><![CDATA[While I was visiting Marcus Hutter at ANU a month or so ago, I got talking to one of his students, Joel Veness, who&#8217;s working on making computable approximations to AIXI. Joel has a background in writing Go algorithms so &#8230; <a href="http://www.vetta.org/2009/09/monte-carlo-aixi/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>While I was visiting Marcus Hutter at ANU a month or so ago, I got talking to one of his students, Joel Veness, who&#8217;s working on making computable approximations to AIXI.  Joel has a background in writing Go algorithms so is perhaps perfect for the job.  I saw recently that the <a href="http://arxiv.org/abs/0909.0801">Monte Carlo AIXI</a> paper describing this work is now available online if you want to check it out.</p>
<p>The basic idea goes as follows.  In full AIXI you have an extended Solomonoff predictor to model the environment, and an expecti-max tree to compute the optimal action.  In order to scale AIXI down and still have something of roughly the same form, you need to find a tractable way to replace both of these two items.  Here&#8217;s what they did: in the place of extended Solomonoff induction a version of context tree weighting (CTW) is used. CTW has to be extended for this application similar to the way Hutter had to extend Solomonoff induction to active environments for AIXI. In the place of the expecti-max tree search a Monte Carlo tree search is used, similar to that used in Go playing programs: initial selection within the tree, tree expansion, a so called play-out policy, followed by a backup stage to propagate the new information back into the model. You have to be a bit careful here because as the agent imagines different future observations and actions it has to update its hypothetical beliefs to reflect these in order for its analysis and decision making to be consistent. Then, once this possible future has been evaluated, the effect of this on the agent&#8217;s model of the world has to be unwound so that the agent doesn&#8217;t, in effect, start confusing its fantasies with its present reality.<br />
<span id="more-635"></span></p>
<p>The algorithm is both embarrassingly-parallel and any-time, which is very nice. In less technical language: it would be fairly easy to get it to run efficiently on a massively parallel supercomputer, and it also has the property that it can be forced to decide what action to take at any moment always returning the best action it had been able to compute so far.  Thus, if you want a smarter agent, just give it more time and/or CPUs.  Already they have shown that MC-AIXI can learn to solve a bunch of basic POMDP problems, including playing a somewhat reasonable game of Pac-man.  It would be interesting to see what it was capable of on a supercomputer with ten thousand times the resources of their desktop PC.</p>
<p>A key question for future research is to make better sequence predictors, in particular to be able to identify more complex types of patterns in the agent&#8217;s history. I guess all sorts of machine learning techniques could come into play hereâ€¦ and possibly combine to produce quite a powerful RL agent?</p>
]]></content:encoded>
			<wfw:commentRss>http://www.vetta.org/2009/09/monte-carlo-aixi/feed/</wfw:commentRss>
		<slash:comments>12</slash:comments>
		</item>
		<item>
		<title>Machine Super Intelligence</title>
		<link>http://www.vetta.org/2008/07/machine-super-intelligence/</link>
		<comments>http://www.vetta.org/2008/07/machine-super-intelligence/#comments</comments>
		<pubDate>Thu, 10 Jul 2008 10:40:46 +0000</pubDate>
		<dc:creator>Shane Legg</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[AGI]]></category>
		<category><![CDATA[AIXI]]></category>
		<category><![CDATA[Friendly AI]]></category>
		<category><![CDATA[Kolmogorov Complexity]]></category>
		<category><![CDATA[PhD]]></category>
		<category><![CDATA[Universal Intelligence]]></category>

		<guid isPermaLink="false">http://www.vetta.org/?p=67</guid>
		<description><![CDATA[My thesis is now available at lulu.com.  As promised, it&#8217;s at cost, which works out at $18 plus shipping.  It&#8217;s all under a creative commons licence and in a few months I&#8217;ll put the pdf online for free.  I&#8217;ll also &#8230; <a href="http://www.vetta.org/2008/07/machine-super-intelligence/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p style="text-align: center;"><img class="aligncenter" style="vertical-align: middle;" src="http://www.vetta.org/VettaPics/MSI-Cover-small.png" alt="" width="338" height="473" /></p>
<p style="text-align: center;"><a href="http://www.lulu.com/commerce/index.php?fBuyContent=2043514"><br />
<img src="http://www.lulu.com/services/buy_now_buttons/images/book_blue2.gif" border="0" alt="Support independent publishing: buy this book on Lulu." /></a></p>
<p style="text-align: left;">
<p style="text-align: left;">My thesis is now available at <a href="http://www.lulu.com/commerce/index.php?fBuyContent=2043514">lulu.com</a>.  As promised, it&#8217;s at cost, which works out at $18 plus shipping.  It&#8217;s all under a creative commons licence and in a few months I&#8217;ll put the pdf online for free.  I&#8217;ll also write a post shortly on all the tricks involved in publishing on lulu.com with LaTeX, in case you plan on doing something similar.</p>
<p style="text-align: center;"><strong>Table of Contents</strong></p>
<p style="text-align: center;"><span id="more-67"></span></p>
<p style="text-align: left; padding-left: 30px;"><strong>Preface</strong><br />
Thesis outline<br />
Prerequisite knowledge<br />
Acknowledgements</p>
<p style="text-align: left; padding-left: 30px;"><strong>1 Nature and Measurement of Intelligence</strong><br />
1.1 Theories of intelligence<br />
1.2 Definitions of human intelligence<br />
1.3 Definitions of machine intelligence<br />
1.4 Intelligence testing<br />
1.5 Human intelligence tests<br />
1.6 Animal intelligence tests<br />
1.7 Machine intelligence tests<br />
1.8 Conclusion</p>
<p style="text-align: left; padding-left: 30px;"><strong>2 Universal Artificial Intelligence </strong><br />
2.1 Inductive inference<br />
2.2 Bayes&#8217; rule<br />
2.3 Binary sequence prediction<br />
2.4 Solomonoff&#8217;s prior and Kolmogorov complexity<br />
2.5 Solomonoff-Levin prior<br />
2.6 Universal inference<br />
2.7 Solomonoff induction<br />
2.8 Agent-environment model<br />
2.9 Optimal informed agents<br />
2.10 Universal AIXI agent</p>
<p style="text-align: left; padding-left: 30px;"><strong>3 Taxonomy of Environments </strong><br />
3.1 Passive environments<br />
3.2 Active environments<br />
3.3 Some common problem classes<br />
3.4 Ergodic MDPs<br />
3.5 Environments that admit self-optimising agents<br />
3.6 Conclusion</p>
<p style="text-align: left; padding-left: 30px;"><strong>4 Universal Intelligence Measure </strong><br />
4.1 A formal definition of machine intelligence<br />
4.2 Universal intelligence of various agents<br />
4.3 Properties of universal intelligence<br />
4.4 Response to common criticisms<br />
4.5 Conclusion</p>
<p style="text-align: left; padding-left: 30px;"><strong>5 Limits of Computational Agents </strong><br />
5.1 Preliminaries<br />
5.2 Prediction of computable sequences<br />
5.3 Prediction of simple computable sequences<br />
5.4 Complexity of prediction<br />
5.5 Hard to predict sequences<br />
5.6 The limits of mathematical analysis<br />
5.7 Conclusion</p>
<p style="text-align: left; padding-left: 30px;"><strong>6 Temporal Difference Updating without a Learning Rate</strong><br />
6.1 Temporal difference learning<br />
6.2 Derivation<br />
6.3 Estimating a small Markov process<br />
6.4 A larger Markov process<br />
6.5 Random Markov process<br />
6.6 Non-stationary Markov process<br />
6.7 Windy Gridworld<br />
6.8 Conclusion</p>
<p style="text-align: left; padding-left: 30px;"><strong>7 Discussion</strong><br />
7.1 Are super intelligent machines possible?<br />
7.2 How could intelligent machines be developed?<br />
7.3 Is building intelligent machines a good idea?</p>
<p style="text-align: left; padding-left: 30px;"><strong>Appendix</strong><br />
A Notation and Conventions<br />
B Ergodic MDPs admit self-optimising agents<br />
B.1 Basic definitions<br />
B.2 Analysis of stationary Markov chains<br />
B.3 An optimal stationary policy<br />
B.4 Convergence of expected average value<br />
C Definitions of Intelligence<br />
C.1 Collective definitions<br />
C.2 Psychologist definitions<br />
C.3 AI researcher definitions</p>
<p style="text-align: left; padding-left: 30px;"><strong>Bibliography<br />
Index</strong></p>
<p style="text-align: left;">
]]></content:encoded>
			<wfw:commentRss>http://www.vetta.org/2008/07/machine-super-intelligence/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>Kolmogorov, Solomonoff, AIXI etc. questions</title>
		<link>http://www.vetta.org/2008/06/kolmogorov-solomonoff-aixi-etc-questions/</link>
		<comments>http://www.vetta.org/2008/06/kolmogorov-solomonoff-aixi-etc-questions/#comments</comments>
		<pubDate>Wed, 25 Jun 2008 16:14:45 +0000</pubDate>
		<dc:creator>Shane Legg</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[AIXI]]></category>
		<category><![CDATA[Kolmogorov Complexity]]></category>
		<category><![CDATA[Universal Intelligence]]></category>

		<guid isPermaLink="false">http://www.vetta.org/?p=65</guid>
		<description><![CDATA[Many people seem to have questions about Kolmogorov complexity, Solomonoff induction, algorithmic probability theory, AIXI, the universal intelligence measure and so on.  I don&#8217;t always have time to watch all the email lists where these things get discussed, but if &#8230; <a href="http://www.vetta.org/2008/06/kolmogorov-solomonoff-aixi-etc-questions/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Many people seem to have questions about Kolmogorov complexity, Solomonoff induction, algorithmic probability theory, AIXI, the universal intelligence measure and so on.  I don&#8217;t always have time to watch all the email lists where these things get discussed, but if you do have any questions, concerns, etc. that you&#8217;d like to put to me, feel free to post a question below and I&#8217;ll try to answer it.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.vetta.org/2008/06/kolmogorov-solomonoff-aixi-etc-questions/feed/</wfw:commentRss>
		<slash:comments>47</slash:comments>
		</item>
		<item>
		<title>SIAI Canada Academic Prize for 2008</title>
		<link>http://www.vetta.org/2008/06/siai-canada-academic-prize-for-2008/</link>
		<comments>http://www.vetta.org/2008/06/siai-canada-academic-prize-for-2008/#comments</comments>
		<pubDate>Mon, 23 Jun 2008 10:23:30 +0000</pubDate>
		<dc:creator>Shane Legg</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[AGI]]></category>
		<category><![CDATA[AIXI]]></category>
		<category><![CDATA[Friendly AI]]></category>
		<category><![CDATA[PhD]]></category>
		<category><![CDATA[Singularity]]></category>
		<category><![CDATA[Universal Intelligence]]></category>

		<guid isPermaLink="false">http://www.vetta.org/?p=63</guid>
		<description><![CDATA[This morning I received the wonderful news that I&#8217;ve won the Singularity Institute for Artificial Intelligence &#8211; Canada Academic Prize for 2008! The award is in &#8220;recognition of [my] efforts to improve AI theory&#8221; and is worth CAD $10,000.  This &#8230; <a href="http://www.vetta.org/2008/06/siai-canada-academic-prize-for-2008/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>This morning I received the wonderful news that I&#8217;ve won the Singularity Institute for Artificial Intelligence &#8211; Canada Academic Prize for 2008!</p>
<p>The award is in &#8220;recognition of [my] efforts to improve AI theory&#8221; and is worth CAD $10,000.  This will certainly help my budget over the next two years while I study at the Gatsby Unit in London.  So, thank you to SIAI Canada, and to all the Canadians whose donations made this money available!</p>
<p>Speaking of my research, after a long weekend of final edits, corrections, formatting, indexing, embedding fonts and other complexity (I&#8217;ll write a blog post about what I had to do at some point), I&#8217;ve finally uploaded my thesis &#8220;Machine Super Intelligence&#8221; to lulu.com and have ordered a test copy.  Once I&#8217;ve checked that everything is ok I&#8217;ll let you know where copies can be ordered.  Copies should be USD $18 plus shipping for a 200 page casewrap hardcover.  Probably about in a month&#8230;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.vetta.org/2008/06/siai-canada-academic-prize-for-2008/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
	</channel>
</rss>
