<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>vetta project &#187; Friendly AI</title>
	<atom:link href="http://www.vetta.org/tag/friendly-ai/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.vetta.org</link>
	<description></description>
	<lastBuildDate>Thu, 22 Jul 2010 19:13:53 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
		<item>
		<title>Halloween lecture online</title>
		<link>http://www.vetta.org/2009/11/halloween-lectur/</link>
		<comments>http://www.vetta.org/2009/11/halloween-lectur/#comments</comments>
		<pubDate>Sun, 01 Nov 2009 14:09:23 +0000</pubDate>
		<dc:creator>Shane Legg</dc:creator>
				<category><![CDATA[Research Review]]></category>
		<category><![CDATA[AGI]]></category>
		<category><![CDATA[AIXI]]></category>
		<category><![CDATA[Friendly AI]]></category>
		<category><![CDATA[intelligence]]></category>
		<category><![CDATA[Neuroscience]]></category>
		<category><![CDATA[Singularity]]></category>

		<guid isPermaLink="false">http://www.vetta.org/?p=721</guid>
		<description><![CDATA[My Halloween lecture has been uploaded to youtube. The basic outline is: * what is intelligence? * Solomonoff induction * Hutter&#8217;s AIXI * Monte Carlo AIXI (here&#8217;s the missing video of it playing pac-man) * universal intelligence measure * what &#8230; <a href="http://www.vetta.org/2009/11/halloween-lectur/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p><img src="http://www.vetta.org/VettaPics/ExtroBrit_pic.jpg" alt="" /></p>
<p>My Halloween lecture has been uploaded to youtube.  The basic outline is:</p>
<p>* what is intelligence?<br />
* Solomonoff induction<br />
* Hutter&#8217;s AIXI<br />
* Monte Carlo AIXI  (here&#8217;s the missing video of it <a href="http://www.vetta.org/video/AIXI_Pacman.wmv">playing pac-man</a>)<br />
* universal intelligence measure<br />
* what neuroscience can teach us about AGI design<br />
* early 2020&#8242;s: the Halloween scenario</p>
<p>You can get the <a href="http://www.vetta.org/documents/extrobrit_talk.pdf">slides here</a>.  I talked for 2 hours, so it&#8217;s broken up into many parts on youtube: <a href="http://www.youtube.com/user/KoanPhilosopher#p/u/11/MGfcy9RpqBY">Part 1</a> <a href="http://www.youtube.com/user/KoanPhilosopher#p/u/7/ZgarxJJ6noY">Part 2</a> <a href="http://www.youtube.com/user/KoanPhilosopher#p/u/10/n-Ry0TE_nRA">Part 3</a> <a href="http://www.youtube.com/user/KoanPhilosopher#p/u/9/ywUf75Q0_2U">Part 4</a> <a href="http://www.youtube.com/user/KoanPhilosopher#p/u/6/MQO_k5uOD0w">Part 5</a> <a href="http://www.youtube.com/user/KoanPhilosopher#p/u/5/WRaFyI5M96g">Part 6</a> <a href="http://www.youtube.com/user/KoanPhilosopher#p/u/4/f0qf5Iu0aLg">Part 7</a> <a href="http://www.youtube.com/user/KoanPhilosopher#p/u/3/o-UCGUipg34">Part 8</a> <a href="http://www.youtube.com/user/KoanPhilosopher#p/u/8/gPW7oojUCKs">Part 9</a> <a href="http://www.youtube.com/user/KoanPhilosopher#p/u/2/fe3c3YcQZng">Part 10</a> <a href="http://www.youtube.com/user/KoanPhilosopher#p/u/1/p7Aw_7sBRPc">Part 11</a> <a href="http://www.youtube.com/user/KoanPhilosopher#p/u/0/s7ZXLd5_1_0">Part 12</a></p>
<p>Thanks to David Wood at ExtroBritannian for organising this, and all the people who attended &#8212; especially those who travelled from other cities and countries, the intelligent questions during my talk, and all the positive feedback I&#8217;ve received since.  Thanks also to Anders Sandberg for the picture of me speaking that I stole from his <a href="http://www.flickr.com/photos/arenamontanus/">flicker stream</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.vetta.org/2009/11/halloween-lectur/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>AGI: To create, or not to create?</title>
		<link>http://www.vetta.org/2008/09/agi-to-create-or-not-to-create/</link>
		<comments>http://www.vetta.org/2008/09/agi-to-create-or-not-to-create/#comments</comments>
		<pubDate>Sun, 07 Sep 2008 22:06:02 +0000</pubDate>
		<dc:creator>Shane Legg</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[AGI]]></category>
		<category><![CDATA[Ethics]]></category>
		<category><![CDATA[Friendly AI]]></category>

		<guid isPermaLink="false">http://www.vetta.org/?p=92</guid>
		<description><![CDATA[People interested in the technological singularity often have strangely contradictory attitudes regarding AGI development.  On one hand, progress towards AGI in terms of hardware, software, design and theory is all very exciting and generally super cool.  Yay, all hail AGI &#8230; <a href="http://www.vetta.org/2008/09/agi-to-create-or-not-to-create/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>People interested in the technological singularity often have strangely contradictory attitudes regarding AGI development.  On one hand, progress towards AGI in terms of hardware, software, design and theory is all very exciting and generally super cool.  Yay, all hail AGI progress!  On the other hand, many of these people, often the very same people, believe that the development of a powerful AGI might well spell the end of humanity.  Hssss, booo!  I&#8217;ll admit to being one of these somewhat contradicted people myself.</p>
<p>Now, I understand that a really wonderfully nice AGI is probably a very good thing, and a flawed one is probably bad news.  We can all support efforts to push AGI towards the more desirable types of outcomes.  But what about AGI research in general?  That is, the work that goes into trying to figure out how to make artificial systems more powerful and general, in other words, more intelligent.  Is this a good thing?  Is it a bad thing?</p>
<p>More pointedly: Imagine that you seriously thought that you might be able to build the first AGI.  Other people might think you&#8217;re deluded, and maybe they are right.  Nevertheless, from where you stand it looks like you have a real chance of making it happen.  Would you go ahead and actually try to do it?</p>
]]></content:encoded>
			<wfw:commentRss>http://www.vetta.org/2008/09/agi-to-create-or-not-to-create/feed/</wfw:commentRss>
		<slash:comments>14</slash:comments>
		</item>
		<item>
		<title>An imitation test for moral capacity</title>
		<link>http://www.vetta.org/2008/08/an-imitation-test-for-moral-capacity/</link>
		<comments>http://www.vetta.org/2008/08/an-imitation-test-for-moral-capacity/#comments</comments>
		<pubDate>Sat, 09 Aug 2008 12:58:54 +0000</pubDate>
		<dc:creator>Shane Legg</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[AGI]]></category>
		<category><![CDATA[Ethics]]></category>
		<category><![CDATA[Friendly AI]]></category>
		<category><![CDATA[Singularity]]></category>

		<guid isPermaLink="false">http://www.vetta.org/?p=80</guid>
		<description><![CDATA[Yudkowsky has been posting a lot on Overcoming Bias recently about his theory of metaethics.  Today he posted a summary of sorts.  Essentially he seems to be saying that morality is a big complex function computed by our brain that &#8230; <a href="http://www.vetta.org/2008/08/an-imitation-test-for-moral-capacity/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Yudkowsky has been posting a lot on <a href="http://www.overcomingbias.com/">Overcoming Bias</a> recently about his theory of metaethics.  Today he posted a <a href="http://www.overcomingbias.com/2008/08/rightness-redux.html">summary</a> of sorts.  Essentially he seems to be saying that morality is a big complex function computed by our brain that doesn&#8217;t derive from any single unifying principle.  Rather, this function is a mishmash of things and even we don&#8217;t really know what our own function is, in the sense that we are unable to write down an exact and complete formulation.  It&#8217;s just something that we intuitively use.</p>
<p>I&#8217;m not convinced that ethics can&#8217;t be derived from some deeper unifying principle.  I&#8217;m also not convinced that it can, lest you misunderstand me.  What I do accept is that if this is possible then finding such a principle and convincingly arguing for it is likely to be difficult in the extreme, and probably not something that is likely to happen before the singularity.  Nevertheless, I haven&#8217;t yet seen any argument so devastating to this possibility that I&#8217;m willing to move it from being extremely difficult to certainly impossible.  Any system of ethics that does derive from some unifying metaethical principle is almost certainly going to be different to our present (western?) ethical notions.  I think some degree of this is acceptable, given that our ethical ideas do change a bit over time.  Furthermore, no matter how human we try to make the ethical system of a powerful AGI, post-singularity we are still going to be faced with ethical challenges that our pre-singularity ethics were never set up to deal with.  Thus, our ethics are going to have to be modified and updated in order to remain somewhat consistent and viable, otherwise we&#8217;ll end up with <a href="http://www.vetta.org/2008/05/aliens-may-be-free-from-original-sin/">this kind of nonsense</a>.</p>
<p><span id="more-80"></span>Anyway, let&#8217;s assume that this unifying principle either does not exist, or at least can&#8217;t be found.  How can we tell if an AGI is ethical given that we can&#8217;t explicitly and completely specify what this means?  This seems like the problem Turing faced when trying to determine whether a machine is intelligent or not.  He figured that he couldn&#8217;t explicitly and completely say what intelligence is, unlike the <a href="http://www.vetta.org/documents/UniversalIntelligence.pdf">research</a> by Hutter and myself, and thus he tried to dodge the issue in the obvious way by setting up an imitation game that doesn&#8217;t require an explicit description of intelligence.</p>
<p>Here we can do something similar: set up a group of people and the AGI and ask them ethical questions from a panel of expert judges.  If the judges cannot tell which the machine is, then it passes.  Given that the morality function varies between people, and that we can&#8217;t say explicitly and completely what our own function is, this seems to be about the best we could hope for.  Naturally, this doesn&#8217;t prove that the AGI, or indeed any of the humans participating, are &#8220;good&#8221;.  An evil genius could probably pass such a test.  Rather, it is simply designed to test whether the AGI is at least able to compute a version of the human morality function which is sufficiently similar to ours that it is able to pass as being human.  Whether the AGI (or human) actually takes its human-passable morality function and reliably and consistently seeks to follow it into the future is a whole other set of problems.  Thus, passing such a test is perhaps a necessary, but certainly not a sufficient condition for having an ethical AGI.</p>
<p>I&#8217;m sure somebody must have proposed this idea before, but at least my half hearted attempt to find the idea on Google didn&#8217;t turn up anything.  I should also point out that in order for this test to work you&#8217;d probably want the AGI to pass a more general Turing test first so that it doesn&#8217;t get singled out by the judges for various other reasons.  Only then should you bring in a group of expert ethicists to try to judge which of the test subjects was ethically inhuman.  We would also want to include in the test subjects a few very nice people and a couple of professional ethicists as we wouldn&#8217;t want the AGI to be able to &#8220;fail&#8221; for being too nice or consistently ethical.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.vetta.org/2008/08/an-imitation-test-for-moral-capacity/feed/</wfw:commentRss>
		<slash:comments>9</slash:comments>
		</item>
		<item>
		<title>Machine Super Intelligence</title>
		<link>http://www.vetta.org/2008/07/machine-super-intelligence/</link>
		<comments>http://www.vetta.org/2008/07/machine-super-intelligence/#comments</comments>
		<pubDate>Thu, 10 Jul 2008 10:40:46 +0000</pubDate>
		<dc:creator>Shane Legg</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[AGI]]></category>
		<category><![CDATA[AIXI]]></category>
		<category><![CDATA[Friendly AI]]></category>
		<category><![CDATA[Kolmogorov Complexity]]></category>
		<category><![CDATA[PhD]]></category>
		<category><![CDATA[Universal Intelligence]]></category>

		<guid isPermaLink="false">http://www.vetta.org/?p=67</guid>
		<description><![CDATA[My thesis is now available at lulu.com.  As promised, it&#8217;s at cost, which works out at $18 plus shipping.  It&#8217;s all under a creative commons licence and in a few months I&#8217;ll put the pdf online for free.  I&#8217;ll also &#8230; <a href="http://www.vetta.org/2008/07/machine-super-intelligence/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p style="text-align: center;"><img class="aligncenter" style="vertical-align: middle;" src="http://www.vetta.org/VettaPics/MSI-Cover-small.png" alt="" width="338" height="473" /></p>
<p style="text-align: center;"><a href="http://www.lulu.com/commerce/index.php?fBuyContent=2043514"><br />
<img src="http://www.lulu.com/services/buy_now_buttons/images/book_blue2.gif" border="0" alt="Support independent publishing: buy this book on Lulu." /></a></p>
<p style="text-align: left;">
<p style="text-align: left;">My thesis is now available at <a href="http://www.lulu.com/commerce/index.php?fBuyContent=2043514">lulu.com</a>.  As promised, it&#8217;s at cost, which works out at $18 plus shipping.  It&#8217;s all under a creative commons licence and in a few months I&#8217;ll put the pdf online for free.  I&#8217;ll also write a post shortly on all the tricks involved in publishing on lulu.com with LaTeX, in case you plan on doing something similar.</p>
<p style="text-align: center;"><strong>Table of Contents</strong></p>
<p style="text-align: center;"><span id="more-67"></span></p>
<p style="text-align: left; padding-left: 30px;"><strong>Preface</strong><br />
Thesis outline<br />
Prerequisite knowledge<br />
Acknowledgements</p>
<p style="text-align: left; padding-left: 30px;"><strong>1 Nature and Measurement of Intelligence</strong><br />
1.1 Theories of intelligence<br />
1.2 Definitions of human intelligence<br />
1.3 Definitions of machine intelligence<br />
1.4 Intelligence testing<br />
1.5 Human intelligence tests<br />
1.6 Animal intelligence tests<br />
1.7 Machine intelligence tests<br />
1.8 Conclusion</p>
<p style="text-align: left; padding-left: 30px;"><strong>2 Universal Artificial Intelligence </strong><br />
2.1 Inductive inference<br />
2.2 Bayes&#8217; rule<br />
2.3 Binary sequence prediction<br />
2.4 Solomonoff&#8217;s prior and Kolmogorov complexity<br />
2.5 Solomonoff-Levin prior<br />
2.6 Universal inference<br />
2.7 Solomonoff induction<br />
2.8 Agent-environment model<br />
2.9 Optimal informed agents<br />
2.10 Universal AIXI agent</p>
<p style="text-align: left; padding-left: 30px;"><strong>3 Taxonomy of Environments </strong><br />
3.1 Passive environments<br />
3.2 Active environments<br />
3.3 Some common problem classes<br />
3.4 Ergodic MDPs<br />
3.5 Environments that admit self-optimising agents<br />
3.6 Conclusion</p>
<p style="text-align: left; padding-left: 30px;"><strong>4 Universal Intelligence Measure </strong><br />
4.1 A formal definition of machine intelligence<br />
4.2 Universal intelligence of various agents<br />
4.3 Properties of universal intelligence<br />
4.4 Response to common criticisms<br />
4.5 Conclusion</p>
<p style="text-align: left; padding-left: 30px;"><strong>5 Limits of Computational Agents </strong><br />
5.1 Preliminaries<br />
5.2 Prediction of computable sequences<br />
5.3 Prediction of simple computable sequences<br />
5.4 Complexity of prediction<br />
5.5 Hard to predict sequences<br />
5.6 The limits of mathematical analysis<br />
5.7 Conclusion</p>
<p style="text-align: left; padding-left: 30px;"><strong>6 Temporal Difference Updating without a Learning Rate</strong><br />
6.1 Temporal difference learning<br />
6.2 Derivation<br />
6.3 Estimating a small Markov process<br />
6.4 A larger Markov process<br />
6.5 Random Markov process<br />
6.6 Non-stationary Markov process<br />
6.7 Windy Gridworld<br />
6.8 Conclusion</p>
<p style="text-align: left; padding-left: 30px;"><strong>7 Discussion</strong><br />
7.1 Are super intelligent machines possible?<br />
7.2 How could intelligent machines be developed?<br />
7.3 Is building intelligent machines a good idea?</p>
<p style="text-align: left; padding-left: 30px;"><strong>Appendix</strong><br />
A Notation and Conventions<br />
B Ergodic MDPs admit self-optimising agents<br />
B.1 Basic definitions<br />
B.2 Analysis of stationary Markov chains<br />
B.3 An optimal stationary policy<br />
B.4 Convergence of expected average value<br />
C Definitions of Intelligence<br />
C.1 Collective definitions<br />
C.2 Psychologist definitions<br />
C.3 AI researcher definitions</p>
<p style="text-align: left; padding-left: 30px;"><strong>Bibliography<br />
Index</strong></p>
<p style="text-align: left;">
]]></content:encoded>
			<wfw:commentRss>http://www.vetta.org/2008/07/machine-super-intelligence/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>SIAI Canada Academic Prize for 2008</title>
		<link>http://www.vetta.org/2008/06/siai-canada-academic-prize-for-2008/</link>
		<comments>http://www.vetta.org/2008/06/siai-canada-academic-prize-for-2008/#comments</comments>
		<pubDate>Mon, 23 Jun 2008 10:23:30 +0000</pubDate>
		<dc:creator>Shane Legg</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[AGI]]></category>
		<category><![CDATA[AIXI]]></category>
		<category><![CDATA[Friendly AI]]></category>
		<category><![CDATA[PhD]]></category>
		<category><![CDATA[Singularity]]></category>
		<category><![CDATA[Universal Intelligence]]></category>

		<guid isPermaLink="false">http://www.vetta.org/?p=63</guid>
		<description><![CDATA[This morning I received the wonderful news that I&#8217;ve won the Singularity Institute for Artificial Intelligence &#8211; Canada Academic Prize for 2008! The award is in &#8220;recognition of [my] efforts to improve AI theory&#8221; and is worth CAD $10,000.  This &#8230; <a href="http://www.vetta.org/2008/06/siai-canada-academic-prize-for-2008/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>This morning I received the wonderful news that I&#8217;ve won the Singularity Institute for Artificial Intelligence &#8211; Canada Academic Prize for 2008!</p>
<p>The award is in &#8220;recognition of [my] efforts to improve AI theory&#8221; and is worth CAD $10,000.  This will certainly help my budget over the next two years while I study at the Gatsby Unit in London.  So, thank you to SIAI Canada, and to all the Canadians whose donations made this money available!</p>
<p>Speaking of my research, after a long weekend of final edits, corrections, formatting, indexing, embedding fonts and other complexity (I&#8217;ll write a blog post about what I had to do at some point), I&#8217;ve finally uploaded my thesis &#8220;Machine Super Intelligence&#8221; to lulu.com and have ordered a test copy.  Once I&#8217;ve checked that everything is ok I&#8217;ll let you know where copies can be ordered.  Copies should be USD $18 plus shipping for a 200 page casewrap hardcover.  Probably about in a month&#8230;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.vetta.org/2008/06/siai-canada-academic-prize-for-2008/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Thinking about ethical AGI, part 2</title>
		<link>http://www.vetta.org/2008/06/thinking-about-ethical-agi-part-2/</link>
		<comments>http://www.vetta.org/2008/06/thinking-about-ethical-agi-part-2/#comments</comments>
		<pubDate>Sun, 01 Jun 2008 16:04:49 +0000</pubDate>
		<dc:creator>Shane Legg</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[AGI]]></category>
		<category><![CDATA[Ethics]]></category>
		<category><![CDATA[Friendly AI]]></category>
		<category><![CDATA[future]]></category>
		<category><![CDATA[Singularity]]></category>

		<guid isPermaLink="false">http://www.vetta.org/?p=57</guid>
		<description><![CDATA[Currently the foremost thinker on the ethics and safety of artificial general intelligence is Eliezer Yudkowsky of the Singularity Institute for Artificial Intelligence.  On a few occasions I have tried to read some of his writings on this topic.  Every &#8230; <a href="http://www.vetta.org/2008/06/thinking-about-ethical-agi-part-2/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Currently the foremost thinker on the ethics and safety of artificial general intelligence is <span class="post-footers"> Eliezer Yudkowsky of the Singularity Institute for Artificial Intelligence.  On a few occasions I have tried to read some of his writings on this topic.  Every time, however, I would give up after about ten pages.  I found the ideas to be very jumbled up: a kind of patch work that didn&#8217;t flow together to produce any kind of a whole.  I would read something that made good sense to me, followed by what I considered to be an unjustified leap in reasoning.  I also didn&#8217;t like his style of writing.  Rather than a dry academic style his writing was more folksy.   Maybe this style appeals to Hofstadter fans, though I&#8217;ve never been a fan of the latter myself.  Moreover, I felt there was an unpleasant underlying tone: an attitude that seemed to say &#8220;if you object to this statement it is because you are either not intelligent enough or have not studied enough to understand why I am right&#8221;. </span></p>
<p>A few months ago I started reading the <a href="http://www.overcomingbias.com">Overcoming Bias</a> blog, on which Yudkowsky was discussing physics (I&#8217;m still not sure why he&#8217;s so involved with physics now, but he&#8217;s slowly getting around to explaining this).  Anyway, I have found his writings here to be much more to my liking.  His ideas seem clearer, more focused and organised and I find the style and tone to be much improved.  If you like some interesting philosophical discussions and you haven&#8217;t seen the blog already, you might want to check it out.</p>
<p>Encouraged by this I decided to have another look at Yudkowsky&#8217;s writings on the ethics and safety of artificial general intelligence.  This time I went for one of his most recent pieces: <a href="http://singinst.org/AIRisk.pdf">Artificial Intelligence as a Positive and Negative Factor in Global Risk</a>, a book chapter he did for Global Catastrophic Risks.  If, like me, you were put off by some of his earlier writings, you might want to have a look at this newer document.  While I naturally don&#8217;t agree with everything in the chapter, in my opinion the points are clearly argued and fit together well.  Indeed, I found myself agreeing with most of his points.  In short, if you are new to the safety of powerful AI technologies, I&#8217;d suggest that you put this document at the top of your reading list.</p>
<p><em>In part 3 I&#8217;ll get back to my own thoughts on the matter&#8230;</em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.vetta.org/2008/06/thinking-about-ethical-agi-part-2/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
	</channel>
</rss>
