<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>vetta project &#187; Ethics</title>
	<atom:link href="http://www.vetta.org/tag/ethics/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.vetta.org</link>
	<description></description>
	<lastBuildDate>Thu, 22 Jul 2010 19:13:53 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
		<item>
		<title>AGI: To create, or not to create?</title>
		<link>http://www.vetta.org/2008/09/agi-to-create-or-not-to-create/</link>
		<comments>http://www.vetta.org/2008/09/agi-to-create-or-not-to-create/#comments</comments>
		<pubDate>Sun, 07 Sep 2008 22:06:02 +0000</pubDate>
		<dc:creator>Shane Legg</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[AGI]]></category>
		<category><![CDATA[Ethics]]></category>
		<category><![CDATA[Friendly AI]]></category>

		<guid isPermaLink="false">http://www.vetta.org/?p=92</guid>
		<description><![CDATA[People interested in the technological singularity often have strangely contradictory attitudes regarding AGI development.  On one hand, progress towards AGI in terms of hardware, software, design and theory is all very exciting and generally super cool.  Yay, all hail AGI &#8230; <a href="http://www.vetta.org/2008/09/agi-to-create-or-not-to-create/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>People interested in the technological singularity often have strangely contradictory attitudes regarding AGI development.  On one hand, progress towards AGI in terms of hardware, software, design and theory is all very exciting and generally super cool.  Yay, all hail AGI progress!  On the other hand, many of these people, often the very same people, believe that the development of a powerful AGI might well spell the end of humanity.  Hssss, booo!  I&#8217;ll admit to being one of these somewhat contradicted people myself.</p>
<p>Now, I understand that a really wonderfully nice AGI is probably a very good thing, and a flawed one is probably bad news.  We can all support efforts to push AGI towards the more desirable types of outcomes.  But what about AGI research in general?  That is, the work that goes into trying to figure out how to make artificial systems more powerful and general, in other words, more intelligent.  Is this a good thing?  Is it a bad thing?</p>
<p>More pointedly: Imagine that you seriously thought that you might be able to build the first AGI.  Other people might think you&#8217;re deluded, and maybe they are right.  Nevertheless, from where you stand it looks like you have a real chance of making it happen.  Would you go ahead and actually try to do it?</p>
]]></content:encoded>
			<wfw:commentRss>http://www.vetta.org/2008/09/agi-to-create-or-not-to-create/feed/</wfw:commentRss>
		<slash:comments>14</slash:comments>
		</item>
		<item>
		<title>An imitation test for moral capacity</title>
		<link>http://www.vetta.org/2008/08/an-imitation-test-for-moral-capacity/</link>
		<comments>http://www.vetta.org/2008/08/an-imitation-test-for-moral-capacity/#comments</comments>
		<pubDate>Sat, 09 Aug 2008 12:58:54 +0000</pubDate>
		<dc:creator>Shane Legg</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[AGI]]></category>
		<category><![CDATA[Ethics]]></category>
		<category><![CDATA[Friendly AI]]></category>
		<category><![CDATA[Singularity]]></category>

		<guid isPermaLink="false">http://www.vetta.org/?p=80</guid>
		<description><![CDATA[Yudkowsky has been posting a lot on Overcoming Bias recently about his theory of metaethics.  Today he posted a summary of sorts.  Essentially he seems to be saying that morality is a big complex function computed by our brain that &#8230; <a href="http://www.vetta.org/2008/08/an-imitation-test-for-moral-capacity/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Yudkowsky has been posting a lot on <a href="http://www.overcomingbias.com/">Overcoming Bias</a> recently about his theory of metaethics.  Today he posted a <a href="http://www.overcomingbias.com/2008/08/rightness-redux.html">summary</a> of sorts.  Essentially he seems to be saying that morality is a big complex function computed by our brain that doesn&#8217;t derive from any single unifying principle.  Rather, this function is a mishmash of things and even we don&#8217;t really know what our own function is, in the sense that we are unable to write down an exact and complete formulation.  It&#8217;s just something that we intuitively use.</p>
<p>I&#8217;m not convinced that ethics can&#8217;t be derived from some deeper unifying principle.  I&#8217;m also not convinced that it can, lest you misunderstand me.  What I do accept is that if this is possible then finding such a principle and convincingly arguing for it is likely to be difficult in the extreme, and probably not something that is likely to happen before the singularity.  Nevertheless, I haven&#8217;t yet seen any argument so devastating to this possibility that I&#8217;m willing to move it from being extremely difficult to certainly impossible.  Any system of ethics that does derive from some unifying metaethical principle is almost certainly going to be different to our present (western?) ethical notions.  I think some degree of this is acceptable, given that our ethical ideas do change a bit over time.  Furthermore, no matter how human we try to make the ethical system of a powerful AGI, post-singularity we are still going to be faced with ethical challenges that our pre-singularity ethics were never set up to deal with.  Thus, our ethics are going to have to be modified and updated in order to remain somewhat consistent and viable, otherwise we&#8217;ll end up with <a href="http://www.vetta.org/2008/05/aliens-may-be-free-from-original-sin/">this kind of nonsense</a>.</p>
<p><span id="more-80"></span>Anyway, let&#8217;s assume that this unifying principle either does not exist, or at least can&#8217;t be found.  How can we tell if an AGI is ethical given that we can&#8217;t explicitly and completely specify what this means?  This seems like the problem Turing faced when trying to determine whether a machine is intelligent or not.  He figured that he couldn&#8217;t explicitly and completely say what intelligence is, unlike the <a href="http://www.vetta.org/documents/UniversalIntelligence.pdf">research</a> by Hutter and myself, and thus he tried to dodge the issue in the obvious way by setting up an imitation game that doesn&#8217;t require an explicit description of intelligence.</p>
<p>Here we can do something similar: set up a group of people and the AGI and ask them ethical questions from a panel of expert judges.  If the judges cannot tell which the machine is, then it passes.  Given that the morality function varies between people, and that we can&#8217;t say explicitly and completely what our own function is, this seems to be about the best we could hope for.  Naturally, this doesn&#8217;t prove that the AGI, or indeed any of the humans participating, are &#8220;good&#8221;.  An evil genius could probably pass such a test.  Rather, it is simply designed to test whether the AGI is at least able to compute a version of the human morality function which is sufficiently similar to ours that it is able to pass as being human.  Whether the AGI (or human) actually takes its human-passable morality function and reliably and consistently seeks to follow it into the future is a whole other set of problems.  Thus, passing such a test is perhaps a necessary, but certainly not a sufficient condition for having an ethical AGI.</p>
<p>I&#8217;m sure somebody must have proposed this idea before, but at least my half hearted attempt to find the idea on Google didn&#8217;t turn up anything.  I should also point out that in order for this test to work you&#8217;d probably want the AGI to pass a more general Turing test first so that it doesn&#8217;t get singled out by the judges for various other reasons.  Only then should you bring in a group of expert ethicists to try to judge which of the test subjects was ethically inhuman.  We would also want to include in the test subjects a few very nice people and a couple of professional ethicists as we wouldn&#8217;t want the AGI to be able to &#8220;fail&#8221; for being too nice or consistently ethical.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.vetta.org/2008/08/an-imitation-test-for-moral-capacity/feed/</wfw:commentRss>
		<slash:comments>9</slash:comments>
		</item>
		<item>
		<title>Thinking about ethical AGI, part 2</title>
		<link>http://www.vetta.org/2008/06/thinking-about-ethical-agi-part-2/</link>
		<comments>http://www.vetta.org/2008/06/thinking-about-ethical-agi-part-2/#comments</comments>
		<pubDate>Sun, 01 Jun 2008 16:04:49 +0000</pubDate>
		<dc:creator>Shane Legg</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[AGI]]></category>
		<category><![CDATA[Ethics]]></category>
		<category><![CDATA[Friendly AI]]></category>
		<category><![CDATA[future]]></category>
		<category><![CDATA[Singularity]]></category>

		<guid isPermaLink="false">http://www.vetta.org/?p=57</guid>
		<description><![CDATA[Currently the foremost thinker on the ethics and safety of artificial general intelligence is Eliezer Yudkowsky of the Singularity Institute for Artificial Intelligence.  On a few occasions I have tried to read some of his writings on this topic.  Every &#8230; <a href="http://www.vetta.org/2008/06/thinking-about-ethical-agi-part-2/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Currently the foremost thinker on the ethics and safety of artificial general intelligence is <span class="post-footers"> Eliezer Yudkowsky of the Singularity Institute for Artificial Intelligence.  On a few occasions I have tried to read some of his writings on this topic.  Every time, however, I would give up after about ten pages.  I found the ideas to be very jumbled up: a kind of patch work that didn&#8217;t flow together to produce any kind of a whole.  I would read something that made good sense to me, followed by what I considered to be an unjustified leap in reasoning.  I also didn&#8217;t like his style of writing.  Rather than a dry academic style his writing was more folksy.   Maybe this style appeals to Hofstadter fans, though I&#8217;ve never been a fan of the latter myself.  Moreover, I felt there was an unpleasant underlying tone: an attitude that seemed to say &#8220;if you object to this statement it is because you are either not intelligent enough or have not studied enough to understand why I am right&#8221;. </span></p>
<p>A few months ago I started reading the <a href="http://www.overcomingbias.com">Overcoming Bias</a> blog, on which Yudkowsky was discussing physics (I&#8217;m still not sure why he&#8217;s so involved with physics now, but he&#8217;s slowly getting around to explaining this).  Anyway, I have found his writings here to be much more to my liking.  His ideas seem clearer, more focused and organised and I find the style and tone to be much improved.  If you like some interesting philosophical discussions and you haven&#8217;t seen the blog already, you might want to check it out.</p>
<p>Encouraged by this I decided to have another look at Yudkowsky&#8217;s writings on the ethics and safety of artificial general intelligence.  This time I went for one of his most recent pieces: <a href="http://singinst.org/AIRisk.pdf">Artificial Intelligence as a Positive and Negative Factor in Global Risk</a>, a book chapter he did for Global Catastrophic Risks.  If, like me, you were put off by some of his earlier writings, you might want to have a look at this newer document.  While I naturally don&#8217;t agree with everything in the chapter, in my opinion the points are clearly argued and fit together well.  Indeed, I found myself agreeing with most of his points.  In short, if you are new to the safety of powerful AI technologies, I&#8217;d suggest that you put this document at the top of your reading list.</p>
<p><em>In part 3 I&#8217;ll get back to my own thoughts on the matter&#8230;</em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.vetta.org/2008/06/thinking-about-ethical-agi-part-2/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
	</channel>
</rss>
