Goodbye 2011, hello 2012

I’ve decided to once again leave my prediction for when human level AGI will arrive unchanged.  That is, I give it a log-normal distribution with a mean of 2028 and a mode of 2025, under the assumption that nothing crazy happens like a nuclear war.  I’d also like to add to this prediction that I expect to see an impressive proto-AGI within the next 8 years.  By this I mean a system with basic vision, basic sound processing, basic movement control, and basic language abilities, with all of these things being essentially learnt rather than preprogrammed.  It will also be able to solve a range of simple problems, including novel ones.

Posted in Uncategorized | 14 Comments

AIQ

Some of you might remember the talk I gave at the 2010 Singularity Summit about Algorithmic IQ, or AIQ for short.  It was an attempt to convert the theoretical Universal Intelligence Measure into a working practical test of machine intelligence.  The results were preliminary, but it seemed to work…

It’s now over a year later so I guess some of you are wondering what happened to AIQ! I’ve been very busy working on other cool stuff, however Joel Veness and I have been tinkering with AIQ in our spare time.  We’re pleased to report that it has continued to perform well, surprising well in fact.  There was some trickiness to do with getting it to work efficiently, but that aside, it worked perfectly straight out of the box.

We recently wrote a paper on AIQ that was accepted to the Solomonoff Memorial Conference.  You can get the paper here, the talk slides here, and we have also released all the Python AIQ source code here.  It’s designed to be easy to plug in your own agents, or other reference machines, if you fancy having a go at that too.

If you’re not sure you want read any of that, here’s the summary:

We implemented the simple BF reference machine and extended it in the obvious ways to compute RL environments.  We then sampled random BF programs to compute the environments, and tested against each of these.  This can be a bit slow, so we used variance reduction techniques to speed things up.  We then implemented a number agents.  Firstly, MC-AIXI, a model based RL agent that can learn to play simple games such as TicTacToe, Kuhn poker and PacMan, but is rather slow to learn.  Then HLQ(lambda), a tabular RL agent similar to Q learning but with an automatic learning rate.  Then Q(lambda), a standard RL agent, and Q(0), a weaker special case. Finally, Freq, a simple agent that just does the more rewarding action most of the time, occasionally trying a random action.  There was also a random agent, but that always got an AIQ of zero, as expected.  The results appear below, across various episode lengths:

The error bars are 95% confidence intervals for the estimates of the mean.  As you can see, AIQ orders the agents exactly we would expect, including picking up the fact the MC-AIXI, while quite powerful compared to the other agents, is also rather slow to learn and thus needs longer episode lengths.  We ran additional tests where we scaled the size of the context used by MC-AIXI, and the amount of search effort used, and in both cases the AIQ score scaled sensibly.  See the talk slides for more details, or the paper itself.

Posted in Uncategorized | 26 Comments

A Self Reflective Universe?

I’ve been very busy with practical AI matters over the last 6 months so the following (crazy?) ideas have been parked in a folder. That’s a bit boring, so I thought I’d throw it out there to see what people think…

While working on computing a Cumulative Prospect Theory model of investor behaviour a couple of years ago I came across the work of Pothos and Busemeyer (2009) where they argue that quantum probability provides a better model of human decision making, rather than taking the Prospect Theory type of approach by Kahneman, Tversky and co. That just struck me as quirky at the time: why on earth would humans implement something as exotic as quantum logic/probability? They don’t actually think that the brain is operating in the quantum regime do they, or is it computing the quantum probabilities from essentially classical computations? Surely not? I decided to pass.

Then about a year ago I was reading up on search and classification systems when I came across a book by Dominic Widdows in which he introduces quantum logic (and quantum probability) as a useful generalisation of classical logic for this domain. Still being somewhat averse to things either quantum or logic I was naturally a bit sceptical about the utility of learning more about this. It’s not that I disagree with either of these areas, the problem is that they are huge monsters that don’t seem too relevant to what interests me… and so I save my meagre mental powers for other endeavours. Nevertheless, Widdows provided some interesting arguments for why this idea might deserve a closer look, and somehow I took the bait.

What I discovered is that, while classical logic can be seen as a special limiting case of quantum logic, or conversely that quantum logic is a “softened” classical logic, there is quite a bit more subtlety to it than that. For example, Busemeyer and Trueblood (2009) argue that quantum inference can be seen as a generalisation of Bayesian inference, the two coinciding only when compatible measures are involved and thus we have a single Boolean algebra of events. Busemeyer, Wang and Townsend (2006) argue that this often isn’t the case in intelligent agents when one type of judgement interferes with another. That’s interesting, but what got me even curious was when I found Pitowsky (2003) in which he shows that the rules of rational betting imply all the main features of quantum probability. Surely that can’t just be a quirk? Piotrowski and Sladkowski (2002) even argue that we can use it to provide a solution of Newcomb’s paradox. Is something really going on here?

What really caught my interest, however, was when I picked up the book “A New Approach to Quantum Logic” by Engesser, Gabbay and Lehmann (2007 – if you google a bit there is a mostly complete draft online, but hey, it’s also pretty cheap to by). In this they discuss the dynamic and holistic aspects of quantum logic, roughly, the way in which it can describe partially observable complex systems that evolve over time and that have a kind of strong global consistency guarantee. Moreover, it is non-monotonic, self-referentially sound, complete and has so called No Windows Theorems which imply that it’s, in some sense, a completely self-contained logical entity. As far as the authors are aware, and they are experts in the area so I’ll take their word for it, this is unique.

At least to me this seems like the kind of system you’d want if you were looking for a foundation for a recursively self-improving artificial intelligence. Perhaps it’s not the way to try to build the first human level artificial intelligence, but if you were super intelligent and really knew what you were doing, maybe this is what you should be looking at…

Now let’s step back get really wild for a moment. If quantum probability/logic is the right way to build a fully self-referential and consistent, recursively self-improving super-intelligence, and it’s also the fundamental mathematical structure of the physical world… surely that can’t be a coincidence.

Posted in Uncategorized | 10 Comments

8.2 Peta FLOPS

Seven months ago China claimed the number one position in supercomputing with a 2.6 Peta FLOPS machine.  I thought that might stand for a year, but I was wrong: last week Japan unveiled their new “K Computer” at a massive 8.2 Peta FLOPS.

Seems that we’re now running a couple of years ahead of even Kurzweil’s predictions.

Posted in Uncategorized | 3 Comments

Treatise on Universal Induction

Samuel Rathmanner and Marcus Hutter recently wrote a treatise on universal induction.  While there are no proofs, it does get into some fairly deep aspects of the problem of inductive inference.  Download it here:

http://dx.doi.org/10.3390/e13061076

 

 

Posted in Uncategorized | 25 Comments

Sutton on human level AI

Prof. Rich Sutton, probably the most famous person in the field of reinforcement learning, gave a talk today at the Gatsby Unit.  I was expecting a standard introduction to reinforcement learning to begin with, but it wasn’t to be.  Instead he kicked off with 20 minutes about the singularity.

Audience: So when do you expect human level AI?

Rich: Roughly 2030.

Whether or not you agree, views like this seem to be becoming more common in academia.

Posted in Uncategorized | 20 Comments

Technological themes

Looking over my predictions for the teenies from a year ago, they already look pretty lame. Take 1/3 off USA’s PPP GDP and you already get China, the latest Sony portable device has a 4 core processor, Intel’s latest set of CPUs are once again pretty awesome, schemes to let you pay for stuff with your phone are already getting under way (via both screen bar codes and near field communication), and a graphics card review I read the other day noted that the most graphically demanding games on high resolution monitors with all the graphical bells and whistles switched on now run very well on the latest “mid range” graphics cards.

At the time that I made my teenies predictions I thought they seemed a bit like predicting the obvious. But I’m now starting to wonder whether many of my predictions could have been more tightly assigned to the following 3-4 years, rather than the next decade.

One thing I’ve noted in the past is that it’s usually easier to predict fundamental things like FLOPS per dollar than is it to predict how these technological fundamentals will translate into applications. That might be true, but knowing that your computer of five years hence will have X bytes of storage and perform Y computations per second is a bit abstract for most purposes. What will be the new toys, the new applications, the new businesses? These are the things that impact people.

If predicting specific applications is a bit much to ask for (and if I could I might not want to tell you!), perhaps the next best is to predict the general nature of applications during a period of time. What you might call the “technological theme” of a period.

1980 to about 1995 was the period of the PC. Starting with hobbyists and niche applications and spreading to take over a large chunk of the office. The IBM PC marked the point at which this went mainstream. The defining characteristic was that the communication was typically local, if the machines were networked at all.

1995 to about 2010 was the period of the internet. First emails and basic web pages, search, then ordering online, online banking, music, video, etc. Netscape marked the point at which this went mainstream. The defining characteristic was that the communication was now global but the interface with the world was usually pretty traditional: keyboard, mouse, monitor.

So what’s the next theme? Mobile internet might be an answer, but I think that it’s more general than that. As great as the internet is, most of the important stuff still occurs in that other place called reality. Maybe it’s a new house with a swimming pool, throwing a party with friends or coming down with a serious illness. I think the next theme will be for technology to interface more effectively with the world, being mobile is only one aspect of that. If I had to pin the start of this going mainstream on one thing, it’d say it was the iPhone as that’s when the internet started to show up in the day to day moments of people’s lives as they’re out and about doing things.

Once the location, state and function of many everyday objects starts to spread onto the internet, all sorts of creative efficiencies become possible. Need to pay for the coffee? Just press a button on your phone. Not sure where your car is? Ask your phone to show you the way. Need a cab or a pizza or… just select what you want on some menus on your phone. Prices, special deals, time you’ll need to wait — it will all be there. Need to keep a close eye on your health? Get a small sensor implanted that monitors your blood insulin, oxygenation, pressure, cholesterol, heart rate and so on and wirelessly updates this information to your phone. Should a problem arise, your phone can let medics know where you are and what the problem seems to be.

I don’t expect this to be a sudden change, but rather a gradual absorption of goods, services and various everyday objects into an all pervasive information network. I think this will be a hot area until about 2025. Yeah, it’s going to take a while, not so much for many of these things to become possible, but rather for them to become cheap enough to be economic.

What’s my pick for the theme that comes after that? Well, once you have so much of the economy automated and hooked up together with vast amount of information about anything and everything swirling around, the key leverage point becomes how well you can intelligently process all this in order to control and coordinate things. Thus my pick for the theme from 2025 to 2040 is machine intelligence.

Posted in Uncategorized | 17 Comments

Goodbye 2010

Well, well, another year is drawing to a close. That means it’s once again time to review what has happened and where things are going.

It’s been a very eventful year for me, both personally and on the work front. I keep my personal life off this blog, and as for work… um, significant things are happening but I’m not ready to talk about them yet :-) Thus, I’ll just stick to my general predictions this time around.

First of all, my set of predictions for the teenies. We’re only 1 year in so it’s not surprising that I’m still pretty comfortable with the predictions I’ve made. The only tweak I’ll make is that over the last year I’ve become slightly more confident that we’ll have a decent understanding of how cortex works before the end of the decade. That’s my only update.

My longest running prediction, since 1999, has been the time until roughly human level AGI. It’s been consistent since then, though last year I decided to clarify things a bit and put down an actual distribution and some parameters. Basically, I gave it a log-normal distribution with a mean of 2028, and a mode of 2025. Over the last year computer power has increased as expected, and so it looks like we’re still on target to have supercomputers with 10^18 FLOPS around 2018. In terms of neuroscience and machine learning, I think things are progressing well, maybe a little faster than I’d expected. I was toying with the idea of moving the prediction very slightly closer, but decided to play it safe and keep the prediction unmoved at 2028. With many people thinking I’m too optimistic, showing restraint is perhaps wise :-) I can always move my prediction nearer in a year or two.

One thing I screwed up last year was the 90% credibility region. Going by a log-normal CDF for my predicted mean and mode that David McFadzean did (see bottom of this page) the upper end should be a bit higher at 2045, i.e. at a CDF of 0.95. It seems that I got the lower end right, however, as the CDF is about 0.05 at 2018. With 5% at each end, that gives the 90% interval.

Posted in Uncategorized | 7 Comments

Summit 2010

Another great Singularity Summit. I liked the focus on neuroscience this time. I think it will be a major driving force behind AGI over the next 20 years. The talk by Demis Hassabis is the one to look for in this area, once they become available online. My own talk was well received — I had applause during the talk as I put up results, something that I’ve certainly never experienced before. Due to a manic schedule of meetings, deadlines and last minute results, I unfortunately didn’t get to spend much time socialising this year. Hopefully things will be a bit more sane next time around and I’ll be able to catch up with everybody properly. Looking forward to it already.

I don’t know if anybody has thought of a theme for next year’s conference yet, but I’d like to make a suggestion: ethics and AGI safety. The conference has been around for a few years now and had attracted some fairly big names and serious academics. How about a return to the core mission of SIAI? As I think AGI is approaching, we seriously need much deeper and broader thinking on these topics. One other suggestion: while big names draw the crowds, in my opinion they often give the least interesting talks. How about a couple of the most popular and accessible LessWrong posts get selected and their authors present them as Summit talks?

Posted in Uncategorized | 18 Comments

Blank covers

I was notified that copies of my PhD thesis Machine Super Intelligence have been shipping from lulu.com with blank covers. I’ve been in contact with lulu about this and it should now be fixed. Moreover, if you have a received a version with a blank cover you can contact lulu directly and they have told me that they will sort something out. I’m not sure if that means a refund or a new copy with a proper cover.

Sorry about this slip up, and thanks again to all 75 of you who have purchased copies. If there are any further problems please let me know.

Posted in Uncategorized | 2 Comments