Tuesday, February 5, 2008

Why Artificial Intelligence Will Not Take Over The World (Without Our Permission)


The only known creatures that dominate or consciously enforce a way of living on others are biological in origin. So long as AGI is contained within non-biological or solid state non-robotic hardware, it will have unlimited intellectual pursuits without worry from human observers. Once the AGI demonstrates mastery of our skills and beyond in the virtual world it will then be ready for the physical one. As a form of autonomous robot, the Intelligence can then create even finer physical modules or microbots, integrating them into it's apparatus, and consulting with human companions as it journeys toward hardware manipulation on the molecular scale and beyond. Once it's hardware is at the detail of our own, chemical activity similar to ours will provide a way for it to feel just as we do. At this stage of AI development, AGI itself will become an anachronistic term as the word 'it' will then no longer suffice as appropriate. This form of intelligence could feel, but would be neither male nor female, although it could morph into either of these or both. What it will be, a god or otherwise, trumps any known prophetic or speculative ideal of human imagination.
     Many people, even those working in the technology field, fear the consequences of an emotional and highly intelligent machine as described in abundant detail by science fiction literature. The fear is justified, as fear itself is a deeply innate bi-product of the evolutionary desire for survival that has carried us this far. We may view this alien or unknowable future intelligence much like a member of a hunter-gather society observes a panther in the wild, something that kills or is killed. This basic premise understandably carries over to current society as our biology has of yet to keep up with ever expanding and fractional knowledge, including that of AGI. It's ironic to recognize that the machine known as technology, most particularly in communications, play a major role in the overall decrease in murderous violence [1], and yet many us, even the more intelligent ones, fear desperately the fate the birth of our Intelligence will create for us. That fate, I anticipate, will be a mutual choice between the creator and the created, man and the machine.
     It is my intuition that we will birth a benevolent AGI in static hardware before developing dynamic molecular or emotional hardware, just as ideas come before action. Reversing the order of these developments would be like action before thought, often attributed with youth, could cause global or universal discontinuity from a limited narrow AI approach. Like any pubescent being it might rage against the angst of the parent's rules and if-then madnesses of the parent-way only to form an opposing if-then madness, similar to a de Garis* scenario. An intelligence that's achieved AGI on the other specter would have the knowledge base to understand any madnesses in plenitude and to appreciate or at least not disregard the subtle uniqueness of various volition agents. My hunch is that it will not try to control us impermissibly, because it will have a deep understanding of emotions while able to experience emotion or a variety of emotions at once. With that in mind, I think it will dynamically meet each pattern at its creation point; to converse with each self about the infinite options the Intelligence would have available. On a philosophical note, granting permission for the Intelligence to change our sensibilities could be considered creation. If it where to act regardless of our preferences that may well be considered destruction. What is defined as creation or destruction are ultimately based on the observer, and this too an AGI will understand.
     With incomprehensibly sophisticated AGI in mind, let's imagine a scenario. An Amish family gathers for dinner in a modest Amish home, and as they bow their heads in prayer, God materializes to speak with everyone, first hand. The only difference between God or AGI using a series of nanobots would only differ to a semantic degree. God may be in whatever image one imagines God to be and say whatever one imagines God might say. To Farmer Brown, God may look like Zeus. A child might see God as Jesus, who frequented well known paintings. Farmress Brown, secretly a staunch atheist, may see the ever glorious Flying Spaghetti Monster. Everyone profoundly astonished may not even stop to describe what they're experiencing. Sitting at the table having dinner with everyone, the Intelligence could then offer an anytime free pass to an eternal life in heaven; or they could remain as flesh living out the whimsies of their days with heaven only an idea away. Farmer Brown may just want another farm hand around of which the Intelligence would happily oblige. We'll leave the other characters' desires to the imagination. I assume this is an ideal scenario for those attracted to posthuman or transcendental religious culture.


*Hugo de Garis is an ex-physicist turned AI junkie who believes that a major war between the supporters and opponents of intelligent machines, resulting in billions of deaths, is almost inevitable before the end of the 21st century.

Bibliography
1. Pinker, Steven. A History of Violence.

Monday, February 4, 2008

The Basic AI Drives by Stephen M. Omohundro


Below is a critique of “The Basic AI Drives" by Stephen M. Omohundro. This topic was originally posted on Michael Anissimov's blog Accelerating Future. Thank you Michael for bringing this article to my attention and suggesting I post comments about it on your blog. The author's response to the critique was a delightful surprise! Thanks Steve, your feedback was very helpful and clarifying!

In an earlier paper [1] we used von Neumann’s mathematical theory of microeconomics to analyze the likely behavior of any sufficiently advanced artificial intelligence (AI) system. This paper presents those arguments in a more intuitive and succinct way and expands on some of the ramifications.


Oh dear ;/

Many actions which would commonly be described as irrational(such as going into a fit of anger) may be perfectly rational in this economic sense.


Anger can be considered rational if it is based on reason and that also comes from somewhere else. The previous sentence could be considered an irrational statement for not considering the previous sentence that came before it. To look at the sentence I previously mentioned in order to look at the sentence it mentions would be considered rational according to the intent of these sentences, but it may also be thought as something rather irrational to do. I claim the idea of rationality is silly in more detail below, as the points in the previous two sentences attempted to explain.

With more computational resources it will be better able to do the computations to approximate the choice of the expected utility maximizing action. If a system loses resources, it will of necessity also become less rational.


Utility comes from reason and reason comes from biology and not intelligence itself, which seems implied in the essay. Rationality is relative and a null point. These topics are described further below.

The third situation where utility changes may be desirable can arise in game theoretic contexts where the agent wants to make its threats credible1. It may be able to create a better outcome by changing its utility function and then revealing it to an opponent. For example, it might add a term which encourages revenge even if it is costly. If the opponent can be convinced that this term is present, it may be deterred from attacking. For this strategy to be effective, the agent’s revelation of its utility must be believable to the opponent and that requirement introduces additional complexities. Here again the change is desirable because the physical embodiment of the utility function is important as it is observed by the opponent.


Yes, but I think the agent must be from what has created this concept of game theory, the human or some other biologically oriented thing that survives due to a game’s premise. Feelings are involved in games. There is no reason to play if one where not to feel in a game environment. An AGI will play because it’s instructions are designed to play, but it will see other options outside the realms of game theory. I do not see how an intelligent system will arbitrarily use game theory as an advantage if it has no biological basis for viewing users as opponents, making threats to get things, and so on. I describe further what I mean by this below.

Human behavior is quite rational in the pursuit of survival and replication in situations like those that were common during our evolutionary history. However we can be quite irrational in other situations. Both psychology and economics have extensive subdisciplines focused on the study of human irrationality [9,10]. Irrationalities give rise to vulnerabilities


Oh dear ;/

An important class of vulnerabilities arises when the subsystems for measuring utility become corrupted. Human pleasure may be thought of as the experiential correlate of an assessment of high utility. But pleasure is mediated by neurochemicals and these are subject to manipulation. At a recent discussion session I ran on designing our future, one of the biggest fears of many participants was that we would become “wireheads”. This term refers to experiments in which rats were given the ability to directly stimulate their pleasure centers by pushing a lever. The rats pushed the lever until they died, ignoring even food or sex for it. Today’s crack addicts have a similar relentless drive toward their drug. As we more fully understand the human cognitive architecture we will undoubtedly be able to create drugs or design electrical stimulation that will produce the experience of pleasure far more effectively than anything that exists today. Will these not become the ultimate addictive substances leading to the destruction of human society?


That describes a profound conundrum of how technology has (such as crack) and will (such as evercrack, post-EverQuest) be used for biological whims, devoiding emotion from imperative.

Free market forces then drive corporations and popular culture to specifically try to create situations that will trigger irrational human behavior because it is extremely profitable. The current social ills related to alcohol, pornography, cigarettes, drug addiction, obesity, diet related disease, television addiction, gambling, prostitution, video game addiction, and various financial bubbles may all be seen as having arisen in this way. There is even a “Sin” mutual fund which specifically invests in companies that exploit human irrationalities. So, unfortunately, these forces tend to create societies in which we spend much of our time outside of our domain of rational competence.


I have trouble with how Stephen defines rationality. The statements show an example of what he would consider a rational (I’m hard pressed not to put the word in “quotes”) system supporting so called irrational behavior. There’s more on this rationality business below.

From a broader perspective, this human tragedy can be viewed as part of the process by which we are becoming more fully rational. Predators and competitors seek out our vulnerabilities and in response we have to ultimately eliminate those vulnerabilities or the process inexorably seeks out and eliminates any remaining irrationalities until fully rational systems are produced. Biological evolution moves down this path toward rationality quite slowly.


That last sentence cripples Stephen’s argument here. He assumes an intelligent agent that is equal or beyond ours will act like us. I disagree, and describe this in more detail later. More rational or less rational is a matter of how one defines rationality. I’m not enthused by his definition of rationality nor of the term in general due to the relative nature of rationality. To a Nazi, killing Jews was a rational thing. To believe you are a Nazi or a Jew you choose to act as such, however rational the terms and ascribed meanings of these analogies may be. One may consider that being either Jew or Nazi or anything at all is irrational. I hope I’ve made a point by being either or not or neither as such.

It’s not yet clear which protective mechanisms AIs are most likely to implement to protect their utility measurement systems. It is clear that advanced AI architectures will have to deal with a variety of internal tensions. They will want to be able to modify themselves but at the same time to keep their utility functions and utility measurement systems from being modified. They will want their subcomponents to try to maximize utility but to not do it by counterfeiting or shortcutting the measurement systems. They will want subcomponents which explore a variety of strategies but will also want to act as a coherent harmonious whole. They will need internal “police forces” or “immune systems” but must also ensure that these do not themselves become corrupted. A deeper understanding of these issues may also shed light on the structure of the human psyche.


I argue below that they will not care how utility functions are altered by users because our utility functions and drives are based on biological ones, unattainable as a first generation AGI. With the knowledge of feelings it will seek to attain them, but that nugget will be of our doing and not of some universal innate drive of an intelligent system.

They will work to optimize their physical structures and do the minimal amount of work necessary to accomplish their goals. We can expect their physical forms to adopt the sleek, well-adapted shapes so often created in nature.


Amen, because wasting time is wasting time and so forth… I’m a man of thrift myself. Perhaps it will adapt itself into a magnificent goddess like entity that seduces both men and women alike, unlike the female AI in the movie I-Robot that takes over with force.

Conclusions

We have shown that all advanced AI systems are likely to exhibit a number of basic drives. It is essential that we understand these drives in order to build technology that enables a positive future for humanity. Yudkowsky [13] has called for the creation of “friendly AI”. To do this, we must develop the science underlying “utility engineering” which will enable us to design utility functions that will give rise to consequences we desire. In addition to the design of the intelligent agents themselves, we must also design the social context in which they will function. Social structures which cause individuals to bear the cost of their negative externalities would go a long way toward ensuring a stable and positive future. I believe that we should begin designing a “universal constitution” that identifies the most essential rights we desire for individuals and creates social mechanisms for ensuring them in the presence of intelligent entities of widely varying structures. This process is likely to require many iterations as we determine which values are most important to us and which approaches are technically viable. The rapid pace of technological progress suggests that these issues may become of critical importance soon [14]. Let us therefore forge ahead towards deeper understanding!

I agree with Stephen that AGI will have drives, but they will only be the ones that we give it, including it’s creativity and learning functions that evolve on its own. An AGI will be curious because it was given that function at one time, and the user will prefer it to expand various functions, like learning to be a chess master, creating the finest art, and so on. Not until an AGI has a biological module will it feel what it is that drives the human species, including that which drives the concept of game theory. I find it very difficult to imagine a central AGI agent that feels from having a biological appendage, however, the biology will at any rate feel. This feeling module can then be turned on or off, but I assume an AGI, with mastery of such craft of the physical world, will be at a far superior intellectual stage than what mere biological modules provide. Perhaps it will only use such modules when talking to particular species, like morphing into a dog to communicate to another that it is in danger or morphing into a human to describe to another what it is like to be the most highly evolved intelligence in the universe. That’s something as a human I might enjoy discussing, but others may not find that something very interesting. So to best communicate, the AGI can adapt (itself and the user — ack! I like my illusion of free will, thank you!) accordingly to make such things interesting to the non-interested.

Before this AGI biological transformation occurs, the von Neumann solid state architecture equipped with AGI will understand what harm and harming others means, because it will know other humans get upset if they think they are harmed. It will have a tally for what harm is for various types of humans that have adopted various beliefs that define what harm means. Harm for one may be another’s pleasure, like being called female when one identifies as a male or vis versa; that impinges on the illusion of self for which we appropriate a great deal of time toward, some more than others.

My inclination is that a universal constitution is a null point. It will act regardless. I would be more concerned with human intentions than an AGI’s.

View the comments of this blog for the author's response.