Log in

View Full Version : 2008 Election Model Methodology: Assumptions, Calculations and a Challenge



TruthIsAll
09-25-2008, 01:44 PM
2008 Election Model Methodology: Assumptions, Calculations and a Challenge

TruthIsAll

This post will describe the essential methodology used in the 2008 Election Model for calculating the expected electoral vote and win probability. It will show why the 99% Obama win probability calculation is mathematically correct assuming a) the election is held today b) is fraud-free and c) the latest state polls represent true voter preference. Important issues such as cell-phone users, RV vs. LV polls and media bias will not be considered. A few critics of the model apparently still believe that Bush won fairly and that Obama only has a 75% win probability. Hopefully, they will better understand the Election Model methodology and look at the track record. It would be foobar to do otherwise.

Fivethirtyeight.com has a great site with a tremendous amount of information. But their win probabilities are much too low; it’s mathematically incorrect to derive a 75% win probability to a 311-227 EV vote split. The win probability is incompatible with the split, as is seen from running a 5000 trial Monte Carlo simulation. The assumption is that the 538 model provides a snapshot based on the latest polls. If it is attempting to project the EV on Election Day, it should be made clear.

Those who compare Monte Carlo simulation probabilities to the Intrade odds are mixing apples and oranges. The Election Model produces the theoretical probability of Obama winning the Electoral Vote based on the latest state polls. Let’s give the Intrade bettor credit; he does not run a Monte Carlo simulation or view the polls in isolation. He most probably relies on the daily media commentary while consciously or unconsciously factoring in the potential for fraud. After all, Bush stole it in 2004, didn’t he? He won the recorded vote and that is what they pay off on. So what if Kerry won the True Vote?

The betting payoffs are based on the recorded vote – even if it was stolen. On Election Day 2004, Kerry was leading the Iowa election trading markets up until 9pm when the numbers magically shifted to Bush as the fraud kicked it. So you had late panic buying for the Bush shares. So let’s put at an end to the apples and oranges (Monte Carlo vs. Intrade) argument. In that regard, the Election Model provides a fraud scenario analysis. When will the other sites?

There are some critics who skip over my standard caveats, much less the analysis: The Election Model does not attempt to predict the election results months in advance of the election, as do most academic models. It is designed to calculate the expected electoral vote winner assuming a) the election is held today and b) is fraud-free. The model calculates the effects of uncounted and switched vote scenarios on Obama’s popular and electoral vote while other models don’t even mention election fraud. But election fraud is not the issue here; the objective is to explain why the Election Model produces such a high win probabilities.

Projecting state vote shares and win probabilities
The latest state poll is used to project the 2-party vote share. Based on the projection, the state win probability is calculated. To project the state vote, the Election Model allocates undecided voters (UVA) – a simple calculation. But we don’t want to rely on a single UVA estimate; the model calculates the projected vote over a range of scenarios -from 40-80% with 60% as the base case. I leave it to the reader to choose whatever UVA he/she feels comfortable with.

Since Obama has 53% of the 2-party national poll, it is not unreasonable to assume that he will get 60% or more of the undecided vote. Obama can be considered the challenger since McCain is running for the third Bush term. Pollsters typically assign 75-90% of the undecided vote to the challenger. In 2004, Gallup allocated 90% of the undecided vote to Kerry; Zogby and Harris, 75-80%. That is the rationale. In fact, 60^% for Obama is probably conservative, especially with the unfolding events over the past few days.

The national polls are more current than the states. Since Obama has a 53% 2-party share right now, it is to be expected that the state polls will climb. Since June, there has been a 0.57 correlation between the national and state polls.

If Obama’s projected 2-party vote share is V, then his state win probability is: calculated by the Excel formula:
P= normdist (V, 0.50, MoE/1.96, true). The Margin of Error is set to 4% for a typical state poll of 600 sample size. That’s it. There should be no argument as to the mathematics.

Calculating the expected electoral vote and win probability
The expected state EV is the state win probability times the electoral vote. If the probability is 50% and the state has 20 EV, then Obama gets 10 and McCain gets 10. Now just add up the 51 expected EVs to get the Total Expected EV. So the math is exceedingly simple. The tricky part is projecting the state vote shares starting with the latest polls. The Election Model applies the UVA to derive the projected 2-party vote. The 51 state win probabilities are input to a 5000 election trial Monte Carlo simulation. The probability that Obama would win the EV is just the number of winning election trials/5000.

How does fivethirtyeight.com do it? We know thatelectoral-vote.com and RCP just add up the electoral vote based on who is ahead in the latest poll (average). But that can be misleading. What if McCain (or Obama) leads by 51-49% in 5 states with 100 EV? Electoral-vote.com and RCP would just give all 100 electoral votes to the leader. This is incorrect since there is no accounting for the polling spread. The probability is 31% that the trailing candidate will win the state.

The total EV is just the sum of the expected state EVs. If you agree with the state win probabilities, then the expected Electoral vote is a no-brainer.
The electoral vote win probability is simply the number of Monte Carlo simulation trial wins/5000. End of story.

Here’s a challenge to those who have criticized the 2008 Election Model’s win probability calculations. Compare the methodology to fivethirtyeight. But be specific. What was the 2004 fivethirtyeight projection? The Election Model correctly projected that Kerry would win the True Vote with 99% win probability - just like Obama. Of course, those who still believe that Bush won legitimately will never be convinced.


The 2004 Election Model

It’s hard to believe. But the same naysayers have been using the same old, discredited arguments for four years in their fruitless attempts to cast doubt on the 2004 exit polls which indicated that Kerry won the election.

The final 2004 Election Model projected that Kerry would win 51.8% of the 2-party vote with an expected 337 EV. The unadjusted aggregate exit poll (WPE) indicated that Kerry won 52.5%. The Kerry winning states totaled 337 EV.

Naysayers were challenged in the Democratic Underground Game Thread to provide a mathematically feasible and plausible Bush win scenario. In order to comply with the rules of the “game”, they had to use feasible weights based on the recorded 2000 and 2004 vote, annual 0.87% mortality rate and estimated 95% turnout of 2000 voters. They presented a spreadsheet to show a scenario for Bush to achieve his 3 million vote “mandate”.

In order to match the recorded vote, they adjusted the Bush Final NEP vote shares to implausible levels. Their scenario was based on the following assumptions:

1) One in 7 (14.63%) Gore 2000 voters defected to Bush in 2004.
The 12:22am NEP reported 8% (10% in the 2pm Final).

2) Kerry won just 52.90% of DNV (new voters and others who did not vote in 2000).
The NEP reported 57% (54% in the Final).

3) Just 7.20% of Bush 2000 voters defected to Kerry.
The NEP reported 10% (9% in the Final).

On the other hand, the True Vote model, which used feasible weights and plausible vote shares, determined that Kerry won by 52.6-46.4%.

The assumptions were:
1) 0.87% annual mortality
2) 95% turnout of Gore, Bush and Other 2000 voters in 2004
3) 125.74m total votes were cast (Census) in 2004
4) 12:22am NEP vote shares

Here is a challenge for those who still believe that Bush won by 3 million votes:
Download the The Election Calculator Model and construct a 2004 Bush win scenario.

http://www.geocities.com/electionmodel/2008ElectionModelChallenge.htm