Current Posts

 

INDEX.

Welcome to Super-rational, a group discussion in the spirit of Enlightenment values and in celebration of the faculty of Reason. This blog is structured as an extended conversation, where each post is formulated as a response to the previous ones. This post, the INDEX, lists each post in chronological order so that the integrity of this conversation is maintained for a new reader.


  1. The Platonia Dilemma by Aziz

  2. Enlightened self-interest by araven

  3. I choose to defect by Aziz

  4. I choose to cooperate by Keith

  5. Right and wrong by Aziz

  6. Payoff and right by Keith

  7. Mapping to reality by Phlegm

  8. I choose to cooperate - because I can by Matoko

  9. Super? Super-rational by sourat



In addition, the right sidebar features a del.icio.us-powered list of links that pertain to Reason and the Enlightenment. Anyone may submit a link to this feed by registering a del.icio.us account and then tagging the link with "enlightenment" and "for:azizhp".

permalink | posted by Aziz P. | 1.03.2007 | 0 comments

 

Super? Super Rational.

The question of morality came up, but I think it is irrelevant here. Game theory and the PD are about strategy or set of moves, maximization in terms of the constraints of the problem...and in this problem it is to maximize your benefit (money).

We'll get back to this problem in a moment. Game theory and strategy can take several forms given the assumptions that are made. If communication is allowed, the skill level of the other 'gamers', if the game is co-operative or not, if there is perfect information, if the game is simultaneous or sequential, etc. Fundamentally, it is making decisions about what you think the other gamer(s) will do.

IMHO, this single-shot problem has a nash-esque equilibrium. In other words, your choice constitutes the best response to the actions of other players or you do what is best for yourself and the group. This is the rational equilibrium. I ran 10,000 simulations using simple binomial (p) distribution to model the anticipated choice of the other gamers and here are the results I got using Hofstader's 20 player PD and the payoff matrix below.





paverage C payoutaverage D payoutaverage payout
0.1051.374.352.1
0.2542.775.648.3
0.5028.557.040.4
0.7514.238.130.3
0.905.825.723.2


p is the anticipated probability that the other gamers choose Defect. As Hofstader expected, everyone would choose to Cooperate in order to maximize the group payout. But this is if you assume that every player is rational and informed.

A better assumption, is that you can't predict what other gamers will do. In other words, you predict that other gamers will be equally likely to Cooperate or Defect, in which case you Defect. This may explain why Hofstader experiment resulted in so many Defectors in his sample of "super" rational thinkers.

Maybe, they would all cooperate if they were super super rational. (This seems like the beginning of some bad time travel movie).


permalink | posted by sourat | 11.28.2006 | 0 comments

 

I choose to cooperate-- because i can.

Hofstadter was disappointed in his results--he says, "Cooperate was the answer I was hoping to recieve from everyone.". In the problem definition for the single shot Platonia Dilemma, he deliberately selected individuals who he believed would be superrational. Not a random sample.

The answer closest to what i would have done was Scott Buresh's. Hofstadter says,
...he treated his own brain as a simulation of other people's brains and ran the simulation enough to get a sense of what a typical person would do.
The results of Buresh's simulation (or, einsteinian thought experiment) parallel Hofstadter's sampling results. In Buresh's simulation, he chose to cooperate roughly one third of the time. Two thirds of the chosen players chose to defect, including Buresh. This aligns with my own hypothesis, that reason is non-deterministic.

Phlegm is right, it not possible to think of the single shot PD as a black box, bereft of I/O. Aziz says, he can choose to cooperate based on the mores and social values he has learned (input). Brenner (from Hofstadter's original matrix) cooperated and says he feared being labelled a defector in a published journal (output).

I do not, however, believe that this is a "moral" choice. Morals can contribute to the iterated PD. (Phlegm, you should read Maynard-Smith for this.) But for the single shot, it is more about survival versus altruism.

Now, at this point i could launch into a long-winded discussion of the evolution of cooperation, but Aziz has strictly restricted me to the single shot.
So in closing, i say, i choose to cooperate--because I can. ;)

permalink | posted by jinnderella | 8.15.2006 | 0 comments

 

Mapping to Reality.

As my contribution to this is over two years in waiting, I'm going to step back before zoning in on particulars.

For starters, both the Prisoner's Dilemma and Platonia Dilemma are but two idealized, zero-sum games in all of game theory. They are idealized in that they assume the players are all completely rational (or Super-Rational) with substantiated beliefs about what the other players might do, and they are zero-sum in that the players' interests are diametrically opposed. Their one-shot versions favor defection as the Nash Equilibrium or winning strategy, while their iterated versions favor Tit-for-Tat. Okay, moving on...

Keep it in the Game
Mapping these games to morality is, to put it bluntly, just plain incorrect. As games, the decisions of each player should be confined to the rules of the game; you don't give up the ball in football or give up your queen in chess because it's the "right" thing to do. No, the goal of each game is simply to win, and you bound your decisions by the rules of the game. Morality has nothing to do with it; it's a game! The only moral defection is in breaking the game's rules to achieve victory, or in making decisions based upon external influences (such as throwing the fight because you've been bribed to lose). If a player wants to win the game for reasons beyond the scope of the game's rules, he is not in fact playing the same game. In other words, if you don't want to win, you aren't even playing.

If you want the PD to represent morality, I'd suggest you substitute "Good" and "Bad" with areas of grey for the dollar amounts in the matrix. There is then no moral high-ground to justify a player's less-than-optimal play; the implementation details of what is Good and Bad are safely compartmentalized.

Real Life is not Zero-Sum
In most real-life interactions, each player benefits to some degree; it's just a matter of how much. You might haggle for a better price for your new bazooka, but in the end both you and the arms dealer get what you want. It is easier to see how morality is mapped onto non-zero-sum games with less confusion than zero-sum games, because it merely acts as a scalar modifier to the relative payoffs of each player. Since everyone still wins, it's easier to justify not winning as much--or in this case, paying just a bit more than you'd like for that bazooka. It is mostly in sports and intellectual quibbles where we see the zero-sum situation. I'd even go so far as to say it doesn't even exist in war, as war often has no clear winner or loser.

Real Life is not Super-Rational
The Nash equilibrium for the one-shot PD is for every player to defect. For players to reach this equilibrium, they need play based on the assumption that everyone else will make the best play. However, each player knows at a gut-level that each player's best play may not be rational (in other words, based on some moral high-ground as mentioned above), and thus adjusts his play accordingly. This has lead to the development of behavioral game theory, which concentrates on interactions between real people, rather than Super-Rational people. The value of the Nash Equilibrium as an idealized outcome is simply to bound our expectations.

There is no One-Shot Game
I truly don't believe there is much to contest in what I've said thus far; I think it is in line with what both Aziz and Keith have stated. In fact, I apologize if it merely sounds like I'm re-hashing what's already been covered. Nonetheless, the main point I wanted to make is the following: I don't think the one-shot game exists in real life.

"Oh, but it does. I interact with strangers all the time." Yes, we all interact with stangers, but do you approach each interaction fresh and without predispositions? No, and in fact this is the premise of my NPC Theory (which could probably use some revisions). Aziz says that Stranger's conglomerate strategy is Random (personal correspondence), but I disagree. I believe that we all have pretty good ideas how interactions with Stranger will go. Think of it as a class heirarchy, where, say, Stranger, Friend, Family, and so on all inherit from the common Person class (or as I put it in my NPC theory, each occupies a different role space).

John is a new Person we encounter in real life. He could be considered an instance of Stranger, and thus has some default properties based not only upon our entire lifetime of experience with Stranger, but also our observation of Stranger's interactions with other Persons. We might know nothing about John in particular, but we nonetheless have a framework within which to make decisions. Thus, while my interaction with John might be a one-shot case, my interaction with Stranger is not; my play is influenced by Stranger's past behavior, and John's play is influenced by his past experiences with other Strangers.

Granted, the properties of Stranger will vary depending on time and place. A Stranger in feudal Japan has different properties than a Stranger in modern New York. However, we know this, and unless we have been suddenly whisked away to an alien place or have utterly un-plastic minds, we have a good idea of how to interact with Stranger. When we don't, well, that's what we call culture shock.

In summary, I don't see one-shot, zero-sum, payoff matrices as relevant to morality, but I'd definitely be interested to see how game theory as a whole could be.

permalink | posted by Phlegm | 8.12.2006 | 0 comments

 

Payoff and Right.

Aziz writes:
I may have read Keith wrongly, but it seems that he explains Hoftstadter's colleagues' defection as a simple function of humans being flawed. Namely, the "right" thing to do was to cooperate, and they were "wrong" to defect. This view is of course founded on the a-priori assumption that cooperation still is the "right" course of action, and any deviation from that course is the result of a flawed analysis."

I did couch a potential explanation their defection in those terms, but not because I was a priori associating cooperation with ‘right’ and defection with ‘wrong’. Hofstadter says the point of the Platonia Dilemma game is to maximize one’s individual return. According to the Platonia pay-off matrix, if everyone cooperates, each will receive the most money ($57). That maximal monetary return is the point of favoring cooperation in this analysis, regardless of whether we consider cooperation the ‘right thing to do.’ (I cooperated because I thought it was the ‘right thing to do’ but Aziz appears to conflate my motives with my analysis of Hoftadter’s colleagues.)

As an individual entering into the Platonia Dilemma, I must seek to maximize my return. I recognize that if I cooperate AND everyone else cooperates, I’ll get the maximal return possible. However, I have to judge the likelihood that everyone else will cooperate, because my pay-off is dependent on the choices of others. Hofstadter thought that each actor would recognize that all the other actors are rational decision-makers and cooperate based on trust that others would as well. This meta-analysis constitutes ‘superrationality’

What I was pointing out in my post is that we must consider our own limited rationality as an assumption in our reasoning. When we realize that it’s likely that not everyone will see things the same way, that someone will distrust others and therefore defect in their own self-interest, we end up with the results that Hofstadter got’lots of defections. These individuals were acting in their own self-interest based on the assumption that not everyone would cooperate. That is, they were rejecting the conjunctive assumption ‘AND everyone else cooperates.’ This is not a flawed analysis, it's a perfectly valid one. I’ll grant that my post may have made it sound flawed because I was talking about the "mental defect" of humans as limited creatures. However, that’s just one ground for believing the assumption that ‘someone’s going to defect’ and therefore rationally choosing to defect oneself in order to maximize return. There could be many other grounds for rejecting the assumption that others will cooperate. Since these grounds exist and will be exercised by humans in real-life situations, we do best to avoid singe-shot Prisoner’s Dilemma AND Platonia Dilemma situations.

In any event, I don’t think Aziz and I are in fundamental conflict on any of that. However, I still am confused about Aziz’s railing against identifying the Prisoner’s Dilemma and morality:

This is why I stated that morality and the PD are incompatible concepts. Morality is a set of values, the PD is a game based on a payoff matrix. The payoff matrix is not a moral matrix, it is usually rigidly defined according to some quantitative metric: a set number of dollars, for example (in the Platonia Dilemma). Of course the qualitative moral implications are related to the quantitative payoff, but that relationship is often quite complex.

Who’s trying to say that the Prisoner’s Dilemma == morality’ All I’ve heard in the literature is the very plausible argument from Axelrod et. al. that situations arising in natural selection that can be modeled by the Iterated Prisoner’s Dilemma give rise to seemingly altruistic behavior that, in humans, could explain in naturalistic terms the origin and evolutionary success of morality.

I like Keith's argument that we need to find a way to rephrase social problems so as to avoid the Prisoner's Dilemma entirely, but I think this is largely an impossible task, aside from ruinously forceful intrusion of the government into personal liberties. And even in that scenario, what the government decides is "right" is not neccessarily so. You can never escape the Henkins paradox when dealing with issues sch as Right and Wrong in dealing with social policy.

While I agree with the last sentence at least to the extent that we’ll rarely have unanimity on the right course of action, I disagree with Aziz’s pessimism that any government interventions must be ruinous to personal liberty and with the implication that we should do nothing even if the preponderance of evidence indicates we should.

In fact, the Prisoner’s Dilemma shows us how to take appropriate action in a minimally interventionalist way. The EPA’s SO2 emissions trading scheme is a great example. Here’s a pollutant that causes acid rain’a social negative only partially felt by the actor (energy company) causing the problem by burning coal to generate electricity for sale (the pay-off that’s all theirs, creating an incentive to pump more SO2 into the atmosphere). The EPA could’ve banned SO2, but that would’ve (in the short term, for certain companies and localities) been economically disastrous. So they instituted caps and started a emissions credit trading market that changed the pay-off matrix such that emissions would be reduced without micromanaging regulation for each utility.

Of course, we’re all used to and appreciative of much more intrusive regulation on a personal level already. Enforcing property rights is one way the government right now avoids situations where the pay-off matrix in a single-shot interaction favors ‘defection’ as in stealing and plundering. We’ve accepted the restriction on personal liberty to not take another’s property and it’s served us well to have government enforce such regulations strictly.

The bottom line is that the intervention must be proportional to the harm. The real problem with government intervention is in the disagreements in assessing harm and therefore the level of regulation required to avoid it.

permalink | posted by Keith Gillette | 4.17.2004 | 0 comments

 

Right and wrong.

As Keith noted, the single-shot PD's self-interest matrix is biased towards defection. Note that tit-for-tat is only an optimal strategy for iterated PD, and so it is not really applicable to the single-shot case, since the interaction may never progress to "tat."

I may have read Keith wrongly, but it seems that he explains Hoftstadter's colleagues' defection as a simple function of humans being flawed. Namely, the "right" thing to do was to cooperate, and they were "wrong" to defect. This view is of course founded on the a-priori assumption that cooperation still is the "right" course of action, and any deviation from that course is the result of a flawed analysis. However, as Hofstadter himself points out, there is an analogy to the Henkins statement "this sentence is true." It is equally defensible on the grounds of logic alone to believe the sentence to be "right" or "wrong."

This is why I stated that morality and the PD are incompatible concepts. Morality is a set of values, the PD is a game based on a payoff matrix. The payoff matrix is not a moral matrix, it is usually rigidly defined according to some quantitative metric: a set number of dollars, for example (in the Platonia Dilemma). Of course the qualitative moral implications are related to the quantitative payoff, but that relationship is often quite complex.

For example, to use Kim's Wal-Mart example. You could make a moral argument that supporting that store helps fund social causes that negatively affect society. You could make an equally compelling moral argument that the money you save, partly used to fund social causes that benefit society, have more effect. My yearly donation of $15 to the Sierra Club has a real impact in terms of lobbyist salaries and influencing policy, for example, whereas since there are always people willing to defect and shop at Wal-Mart, the net effect of not shopping there is almost negligible (and Kim ends up having less money in her pocket anyway).

Does that mean that you should shop at Wal-Mart even if you believe in the social causes at risk by that store's methods of doing business? Not at all - see graphic at left (from article in Business Week, free registration required).

What is misleading in the Costco example however is the selective metrics used to make the argument. Yes, profits per employee and per square foot are higher than Wal-Mart, but what if you compare stock prices? COST | WMT

By which metrics do we determine the winner of the payoff matrix, then? Clearly, the "winner" is subjective. Wal-Mart "defects" by paying employees less, Costco cooperates by paying them more, and yet both claim victory. One consumer shops at Wal-Mart and uses a portion of the savings to donate to liberal organizations, another shops at Costco and pays more. Both will claim moral high ground. You could take the argument to an extreme and ask, why do we spend even one dollar on the space program or the NEA when we could be ending world hunger with the same monies?

Overall, these gray areas are inescapable. This is the real reason that Hofstadter's colleagues - including Axelrod himself - were perfectly justified in choosing to defect. Not because they were wrong, but because they were also right. Right and Wrong simply don't map cleanly onto the payoff matrix space.

I like Keith's argument that we need to find a way to rephrase social problems so as to avoid the Prisoner's Dilemma entirely, but I think this is largely an impossible task, aside from ruinously forceful intrusion of the government into personal liberties. And even in that scenario, what the government decides is "right" is not neccessarily so. You can never escape the Henkins paradox when dealing with issues sch as Right and Wrong in dealing with social policy.

It is tempting to try and relate these kinds of issues to simple one-dimensional morality issues like "do not steal" and "share" but in real-life, it is precisely the more complex, gray area situations that we most need to solve which are not so easily characterized. In that sense, Keith's wish is granted, since the PD is simply inapplicable.

permalink | posted by Aziz P. | 4.12.2004 | 0 comments

 

I choose to cooperate.

I do so not because I think it'll necessarily bring me the greatest benefit in any given instance but because it's the right thing to do and I was taught to do the right thing.

I must grant the possibility (to Kim & Nigel) that if one were to psychoanalyze me, I may only be "doing the right thing" even if it doesn't appear to be in my self-interest by subconsciously performing the hedonistic calculus that puts "feeling superior" for having done the right thing on the positive balance side to compensate for whatever exterior loss I may be facing, meaning that I'm still acting purely in my own self-interest, just with my own, personal, internal, subjective pay-off matrix. It's quite hard to say.

It's also possible that I'm choosing to cooperate not based on any extensively reasoned basis at all. As Daniel Dennett argues compellingly in Darwin's Dangerous Idea, Herbert Simon's method of "satisficing" is the only practical way for a limited being such as a human to come to a decision in the first place. Reasoning through all the possibilities is, for most purposes, impossible. There's simply not enough time to calculate all the effects of a particular line of conduct. Humans have therefore developed a lot of shortcuts and "conversation stoppers" to end discussion and choose a path. As Aziz says, (human) reason is not logic. Suboptimal as it may be, our shortcut, satisficed decisions are "close enough" and better than the paralysis by analysis that gives us no decision at all. (Sabertooth tiger approaching. Hmmm. If I were to climb a tree I could escape being eaten which would be good? Which tree? Well, that one's too small. That one's too slippery. If only I had a large club with me. Perhaps I could fashion one out of some nearby debris. What if we invented a sort of poking device with a long stick and a sharpened rock? Eaten! Should’ve just RUN!)

Morality, a set of prescriptive rules handed down from generation to generation, is one of those satisficing intellectual shortcuts, a conversation stopper that keeps us making fairly successful decisions without laboring over them each time. It's what gives me that sense of "right" that impels me to cooperate. However, if we accept the Darwinian framework, that I was taught that cooperating was "the right thing" is in need of some explanation. For survival purposes, one would expect individuals to always act only in their own self-interest, which would on the surface imply non-cooperation. However, as Robert Axelrod has so elegantly shown using the iterated prisoner's dilemma (IPD), cooperation is an evolutionarily stable strategy that explains the existence of "altruism" on the very basis of self-interest. It is, as Kim says, the long view of "enlightened" self interest.

In the single-shot PD, however, the opposite is true and self-interest favors defection. In the everyday course of life, however, we more often encounter variations of the IPD, so we should be more inclined to act in accord with the "tit-for-tat" strategy of cooperate but punish defections than with the purely selfish calculus of the one-shot pay-off matrix. One of the rules of that tit-for-tat strategy is to cooperate first, only defecting to punish past defections. Perhaps that's another reason why I choose to cooperate. While I may consciously know that I would be better off financially defecting in a one-shot PD situation, do I really know it's a one-shot PD? Even if I'm guaranteed never to meet the same actors in the same situation again, who's to say that I won't face-off against someone watching this match in the future? People have long memories and don't often forget cheating. In an IPD world, grudges have useful survival benefit.

The reason Hofstadter's colleagues disappointed him in his Platonia Dilemma experiment is that, unfortunately, we don't live in Plato's perfect world of forms. People are not perfect. We are not (all) superrational. We make decisions by satisficing. As Aziz says, it's entirely possible for even rational thinkers to be wrong. This mental defect (side effect of being limited beings, really) is a basic assumption that must be included in any discussion regarding human action. When we do include this assumption and recognize not everyone will reason the same way we do, we end up with the "reverberant doubt" that causes defection in the Platonia Dilemma: Someone's going to defect. Someone's going to cheat! Fully understanding the implications of that assumption reveals the foolishness of both the trickle-down economics that Kim rails against and the expectation that any other group will consistently "do the right thing" when it's not in their self-interest (economic or otherwise) to do so. (Kim & I being the superrational exceptions to the rule. Or more likely, just the exceptions to the rule based on our personalities or the good job our parents did teaching us right from wrong or some other factor nothing to do with Hofstadter's notion of superrationality.) But regardless of how many "good apples" cooperate, someone's going to cheat.

Given that, the answer to the conflict between morality and self-interest is not to fight human nature but to change the rules of the game. Aziz is correct when he says the demands of the PD and morality are often in conflict.* In particular, they are in conflict in single-shot PD scenario. So, we need to avoid single-shot PDs whenever possible. We must avoid "tragedy of the commons" situations (whose pay-off matrix is identical to the PD and results in the same kind of defection) by imposing external regulations or costs that change the matrix to favor cooperation. This is how we protect the "do gooders" from the less scrupulous "bad apples" who act only in their short-term self-interest. We invent strategies that maximize cooperation for non-zero-sum benefit. We carefully constrain zero-sum interactions. We find the defectors and punish them.

Of course, that leaves open the question of what we should do when we find ourselves in a single-shot PD situation. How do we act when they are in direct conflict and we don't have recourse to external structural changes to change the pay-off? How do we choose between "doing the right thing" and acting in our own self-interest? Between morality and the PD pay-off matrix?

What do we do? We satisfice. Aziz defects, knowing the cruel logic of the PD. Kim & I cooperate, assuming either the hedonic benefit of the high moral ground or slavishly following our internalized morality or allowing for the more pragmatic possibility there will be another match to play and what now appears a single-round PD is in fact an IPD in a larger context.

Given the particular situation and the particular pay-off, I expect that any of us may flip-flop our decision for one (satisficed and limited) reason or another.



*Postscript: While I agree that in the single-shot PD, the PD pay-off and demands of morality are in conflict, that's not the case in the IPD. I'm not sure what Aziz is referring to when he finds "attempts to unify morality and PD-strategy to be deeply flawed." The analyses I've read from Axelrod (cited by Aziz), reiterated and expanded on by Robert Wright (The Moral Animal, Nonzero) are explanations of the appearance of cooperation and morality based on the survival advantage granted by such win-win behavior in the commonly occurring IPD scenario. I've not heard of anything that equates defection in the single-shot PD with moral choice of "doing the right thing", which is what Aziz seems to imply when he calls them orthogonal decision-making strategies

I also would quibble with Aziz's assertion that the demands of morality are subjective. While I can certainly agree that the body of rules handed down from generation to generation are subjective in their purported moral content, there is a sense in which, to the extent they confer survival benefit, support the growth of life, and open up potential, they have "objective" value. I believe that the naturalization of ethics through the IPD points a way to an non-subjective morality, but that's a matter for a different blog.

permalink | posted by Keith Gillette | 4.05.2004 | 0 comments

 

I choose to defect.

Kim writes below that "Unquestionably in these one-shot Prisoner's Dilemma situations, cooperation is better." - but I can't bring myself to agree. The case of the chocolate donut is a misleading analogy, because ultimately the value of the donut is purely subjective. We aren't talking about competing for an astronaut slot on Space Station Alpha or the last ticket out of Ho Chi Minh city circa 1973. Its just a donut. The vast majority of Prisoner's Dilemma situations that we come across are of similar less-than-weighty import, and I think that there is a clear reason to defect rather than cooperate.

I don't think you can unify the demands of morality (which are also subjective) with the simple strategy of playing a PD game. You have to choose. The PD almost always is in conflict with morality, and which choice you make is highly dependent on the desired outcome.

I find most of the attempts to unify morality and PD-strategy to be deeply flawed. These are orthogonal decision-making strategies.

permalink | posted by Aziz P. | 11.01.2003 | 0 comments

 

Enlightened self-interest.

It seems to me that Hofstadter was missing a piece, or else that decisionmaking of this kind is based upon iterative layers of rationality. I used to debate with my good friend Nigel about why people choose to do "good." His theory was that doing good resulted in either material or mental benefits to the do-gooder, and so the behavior is inherently selfish. His take was that all behavior is inherently selfish at some level. My take then was that people choose to do good because of an adherence to an ethical or moral structure. (let me pause and define, for the sake of argument, by "good" here I mean "selfless" and that the acts in question may or may not actually result in good to others, but that the do-gooder must believe that they would and that must be the motivation for the act)

What I now believe is that adherence to an ethical or moral structure, and choosing to act selflessly in conformance therewith, is just a long-view "enlightened self-interest." On the other hand, I believe as well that some people adhere to their ethics and morals, benefiting others far more than themselves, even without the slightest hope that others will do the same. Some people are willing to "prime the pump" of social behavior for their whole lives, and get shafted as a result. So perhaps those people are engaging in species/genetic enlightened self-interest by defining "self" as broader than the individual human in question.

It's difficult to know if I'm conveying what I'm thinking, so I'll use an illustration. I walk into a donut shop and choose between a glazed and a chocolate glazed donut. If there are ethical implications to that choice, they're difficult to discern. If, instead, I'm neutral as to the choice, but the person behind me is telling her friend that she's been waiting all day to get a chocolate donut, and I see that there is only one left, I have a choice to either benefit her and choose the other donut, or harm her without benefiting myself. That one is easy. If, instead, I'm partial to the chocolate myself, I have a choice. Do I choose to satisfy my desire at the expense of the person who got there after me, or do I choose to leave her the donut anyway? No OBVIOUS benefit can accrue, since she probably won't know I've done her a favor. By Nigel's way of thinking, someone with a particular social bent that informs their ethics would leave the chocolate donut, and do so for the following reasons: 1. very little harm to self, moderate benefit to other; 2. get pleasant feeling and boost to self-worth for doing something "good." I see a third reason. 3. Do unto others as you would have them do unto you. A very old formulation, even an "old saw" that we teach to children as a rule to live by...but maybe there's more to it than that.

Unquestionably in these one-shot Prisoner's Dilemma situations, cooperation is better. In real life, cooperation is better too. The truth lies in a bunch of tired old sayings. The ones about one person making a difference, or every good movement starting with one person...those aren't just optimistic patter, those are true. In every case of a good idea that caught on, someone had to start. Each individual has hundreds or more choices a day to make, and many of those choices will be ones with impact on other people. In most of those cases, the choice will be between doing good for others, and not doing good for others. Many times doing good for others means an apparent direct harm to self, and a difficult-to-discern long-term benefit. Often those long-term benefits are bad bets.

Tradeoffs are present in every area of life. Currently as a nation we're engaged in discussing security and freedom, because those things are only marginally compatible. Some of us see no point in security if our freedom is sacrificed. Others see compromises to be made for those less-used freedoms. This works the same way. It's easy to make choices to eschew direct benefit when the return from indirect benefit is likely to be high. In other words, if I belong to a small group of friends, and we like to go out to lunch a lot, and we each pick up the tab once in a while, it's nice, feels good and friendly, lets us have a free lunch those times when we're feeling poor, lets us feel generous those times we're feeling well-off, and saves us all the hassle of splitting the bills. In a group like that, defection is unlikely, and everyone feels that they're probably coming out just fine in the end. In larger groups, defection becomes more likely. In vast groups like cities, you get a bunch of people watering their yards in a drought because A. "other people are, why should my lawn suffer?" or B. the net effect is large, but the individual effect is miniscule, or "what can it hurt, if we're all going to go thirsty, my ten gallons of lawn-water won't make a difference."

So when defection is common, and the group is large and somewhat faceless, the decision becomes one that seems to come down to arbitrarily adhering to an ethic of no apparent self-benefit or choosing the direct self-benefit. If I shop at Wal-mart, I support several conservative causes and several conservatives, that/who I believe are abhorrent. I can choose to go to a local store, benefit a local retailer, support my local economy, and not support school vouchers. I pay a little more to do that, and I may experience a little inconvenience (maybe several stops instead of one). The ethic is obvious here, but it's still very tempting to "defect" and join the masses at Wal-mart. My handful of money doesn't make much difference, but a LOT of people's handfuls of money make a big difference.

So the decision is between two "layers" of rationality, and/or the belief that ethical behavior is deserving of adherence because once you know the "right" thing to do, the "do unto others as you would have them do unto you" is ultimately a "winning" strategy, at least as good as the one that tries to gain every direct benefit. Does it result in the greatest return in the "real world?" That's hard to say. Each decision is unique, and each person must make each decision on an individual basis. Those decisions that leave me shaking my head (eschewing a very large direct benefit, or gambling on a losing long-term benefit) are difficult to make, but then when I manage to "do the right thing" anyway there are some comforting things to consider. I'm a little supersitious. When I flip through the channels on TV, I will sometimes turn off something that attracted my attention simply because it's SO bad (thank you Fox) that I'm appalled that it would be on. I might be interested in watching it, but I superstitiously believe that if I turn it off, then it's more likely that OTHER people will also be turning it off at the same time. As if I can shift the statistics all myself. If I turn something off, then there are probably a bunch of people who are similar to me, who might then also be turning it off, not because I influenced them, but simply because there's some likelihood that if one person does something and a lot of people are in a position to do the same thing, that more than one person will probably be doing it. Along the lines of the way politicians tally letters and emails. If one person writes a letter, the politician assumes that a certain arbitrary number of others probably felt the same, but just didn't write.

Choosing not to participate in something "everyone does" because it appears to have negative social value is a reasonable decision, even if it has direct harm and little likely indirect benefit. White plantation owners who freed their slaves before the civil war suffered financial and social harm to a huge extent, but they did the right thing, with little likelihood of indirect benefit. That was a large decision. Our lives are mostly composed of small decisions. How much easier, then, to laugh, give up the direct benefit (the "rational choice" to defect) and just do the thing we believe to have the most benefit to society and self if everyone else were to make the same decision? Why play chicken? Why live our lives in an arms race trying to acquire the most, or to prevent anyone else from acquiring more? How does that benefit us? In the final analysis, Nigel may have hit the nail on the head, why not do good for the mental benefit it brings just KNOWING you've done good? Maybe that's enough to compensate for the loss of some other benefit in most cases.

permalink | posted by araven | 7.07.2003 | 0 comments

 

The Platonia Dilemma.

I consider myself a reasonably intelligent guy, with rather strong opinions, and a sufficient amount of self-criticism. Note the presence of moderating adjectives in every sub-phrase - reasonably, rather, sufficient. Like a moth to flame, therefore, I am always drawn to writing by those I admire for the same qualities, only more so (without the adjectives). One of these is Kim, a dear ultra-liberal friend from college, still one of my closest (and always will be). Kim is formidably intelligent, with blazingly strong opinions, and fanatical self-criticism. I admire her greatly, because she tends to write things like this:

I've gotten used to hearing Republican claims and assuming that the claimers are just lying scum out for their own advantage, or people who haven't considered the issues thoroughly. I think most are, but some might just be superrational people who see an economic model that could be workable in a platonic world, just as my very liberal view of people acting decently and working hard just because it's the right thing to do may not play so well in the real world...but it requires my adherence anyway. Might make for more interesting discussions with my right-leaning friends, and non-friends. It's interesting to see that I've been long arguing against voodoo economics based on its inapplicability in the real world (actual rich people are the ones best at never letting a dime escape their clutches, so what exactly would "trickle?") and getting very angry when people argue that in the real world, lots of people will refuse to do the right thing environmentally or otherwise, so good people get the short end of the stick. I argue, "so what" be ethical anyway, and deny that in the real world MOST people are evil. Well, I guess the superrational Reaganomics fans think that surely MOST rich people aren't evil, and enough of them would take their ill-gotten capital gains and do good with them. Interesting.


The reference to "super-rational" is from Douglas Hofstadter's book, Metamagical Themas, in relation to the Prisoner's Dilemma[1]. A super-rational (SR) player is defined recursively as one who assumes the other players are also superrational, and chooses to cooperate in order to maximize gain. Unlike the "iterated" PD, where the best strategy is "TIT FOR TAT"[1], Hoftstatder explored the "one-shot" PD, and tried to rationalize a strategy that favors cooperation rather than defection (p.730):

I found that I could not accept the seemingly flawless logical conclusion that says a rational player in a noniterated situation will always defect. In turning this over in my mind and trying to articulate my objections clearly, I found myself inventing variation after variation after variation on the basic situation.


This led Hofstadter to come up with what he called the "Platonia Dilemma", based on the following payoff matrix[2]:

(3,3)(0,5)
(5,0)(1,1)


and he then sent out a letter to 20 of his friends and acquaintances, chosen for their rationality and familiarity with the PD concept, including notables such as Martin Gardner and Bob Axelrod, whose own research (summarized in his amazing book, The Evolution of Cooperation) proved the superiority of the TIT FOR TAT strategy in the iterated PD case. The letter read (in part):

Each of you is to give me a single letter: "C" or "D", standing for "cooperate" or "defect". This will be used as your move in a Prisoner's Dilemma with each of the nineteen other players. The payoff matrix I am using is [see above].

Thus if everyone sends in "C", everyone will get $57, while if everyone sends in "D", everyone will get $19. You can't lose! And of course, anyone who sends in "D" will get at least as much as everyone else will. If, for example, 11 people send in "C" and 9 send in "D", then the 11 C-ers will get $3 apiece from each of the other C-ers (making $30) and zero from the D-ers. The D-ers, by contrast, will pick up $5 apiece from each of the C-ers, making $55, and $1 from each of the other D-ers, making $8, for a grand total of $63. No matter what the distribution is, D-ers always do better than C-ers. Of course, the more C-ers there are, the better everyone will do!

By the way, I should make it clear that in making your choice, you should not aim to be the winner, but simply to get as much money for yourself as possible. Thus you should be happier to get $30 (say, as a result of saying "C" along with 10 others, even though the D-ers get more than you) than to get $19 (by saying "D" along with everybody else, so nobody "beats" you).


Hofstadter set this PD up with a clear subtext that cooperation is preferred. Note that the payoff matrix does reward defectors, but only if there are very few. A SR thinker would presumably choose to cooperate to maximize the probability of a large payoff. His expectation was that most would choose to cooperate (p.746):

Any number of ideal rational thinkers faced with the same situation and undergoing similar throes of reasoning agony will neccessarily come up with the ientical answer eventually, so long as reasoning alone is the ultimate justification for their conclusion. Otherwise reasoning would be subjective, not objective as arithmetic is. A conclusion reached by reasoning would [then] be a matter of preference, not necessity. Now some people may believe this of reasoning, but rational thinkers understand that a valid argument must be universally compelling, otherwise it is simply not a valid argument.

If you'll grant this, then you are 90 percent of the way. All you need ask now is, "since we are going to submit the same letter, which one would be more logical? That is, which world is better for the individual rational thinker: one with all C's or one with all D's?" The answer is immediate: "I get $57 if we all cooperate, $19 if we all defect. Clearly I prefer $57, hence cooperating is preferred by this rational thinker. Since I am typical, cooperating must be preferred by all rational thinkers. So, I'll cooperate."


Italics are his emphasis, underlines mine. A clear flaw is the assumption that all players are SR. I have underlined the parts of his argument where this assumption is explicit. Another clear flaw is the assumption that the "throes of reasoning agony" will be correct - it is entirely possible for a rational thinker to simply be wrong. This can be to flaws in logic, omission/ignorance of key facts, or flawed assumptions.[3]

When Hofstadter tallied up the responses, he found (much to his chagrin) that there were 14 defections (each earning $43) and only 6 cooperators (each earning $15). This, despite the fact that he had unconsciously (?) "biased" the sample of participants by selecting his own friends and acquaintances based on his evaluation of their "rationality" - even people like Bob Axelrod and Martin Gardner who are intimately familiar with the PD and game theory (both of whom chose to defect, BTW). Hofstadter writes:

It has disturbed me how vehemently and staunchly my clear-headed friends have been able to defend their decisions to defect. They seem to be able to digest my argument about superrationality, to mull it over, to begrudge some curious kind of validity to it, but ultimately to feel on a gut level that it is wrong, and to reject it. This has led me to consider the notion that my faith in the superrational argument might be similar to a self-fulfilling prophecy or self-supporting claim, something like being absolutely convinced beyond a shadow of a doubt that the Henkin sentence "This sentence is true" actually must be true - when, of course, it is equally defensible to believe it to be false. The sentence is undecidable; its truth value is stable, whichever way you wish it to go (in this way, it is the diametric opposite of the Epimenides sentence "This sentence is false", whose truth value flips faster than the tip of a happy pup's tail). One difference, though, between the Prisoner's Dilemma and oddball self-referential sentence is that whereas your beliefs about such sentences' truth values usually have inconsequential consequences, with the Prisoner's Dilemma, it's quite another matter.


Again, italics are his emphasis, underlines mine. The key to undertanding why seemingly "rational" thinkers could take the same facts and arrive at a different result, is because reason is not logic. The mechanics and machinery of our intellect is ultimately Godelian - no system of analytical statements and rules and formal descriptors can ever fully model it. In fact Hofstadter himself in the same book devotes much time to emphasizing this, that ultimately intellect must be an emergent and statistical property - even creating a wonderful symbolic model called "The Careenium" (a great pun which I won't spoil. Read the book :) to illustrate it.

This is why self-examination is essential. Thought and analysis are ultimately founded upon "gut instinct" as much as pure facts and figures and logic - we cannot escape it. This also is an argument for diversity - because in combining the analyses of two people, who arrive at different conclusions from different facts, we are able to better triangulate the reality which underlies all of existence, towards which we all must grope towards half-blinded when alone.

Self-examination of the kind that Kim engages in so directly and willingly is essential to improving ourselves and the world. And the lessons of the PD are one such route to that goal. Ultimately, though, we do have to apply reason as we understand it, not as we think others do.


[1] Steven Den Beste had a good article on TIT FOR TAT in the iterated PD (which does NOT apply to the one-shot PD, of course).
[2] I assume the reader is familiar with the basic concept of the PD, the "payoff matrix" representation, as well as the terms "cooperate" and "defect" in that context. If not, I highly recommend Hoftstatder's book (Metamagical Themas) or good ol' Google.
[3] this paragraph is self-referential in classic Hoftstatder tradition :) .

permalink | posted by Aziz P. | 2.17.2003 | 1 comments