The Texas Hold'em Strategy Guide and Online Poker Forum Community
Poker
TOOLS

Poker Forum

HOTFTR11 at Carbon Poker: $50 Money-Added Tourney - Dec 11th Subscribe to FTR web feed
Already Registered?      Username:    Password:   Remember Me         Forgot Password

  >    > 

Double or Nothings (DON) and Statistical Power

  
Page 1 of 1  ||  Post new topic  |  Post reply

Author Message
Nakamura
Post Posted: Sun, 08 Nov 2009, 11:11am    Post subject: Double or Nothings (DON) and Statistical Power Reply with quote
Full House
Full House

Joined: 24 Mar 2008
Posts: 925
WPP: 139
Location: Cape Town
With the rain beating down and feeling a bit sick, I was a bit bored today when I found my stats book. I've had something on my mind for a while now; I was wondering how many double or nothing tournaments do you need to play to be certain you are running at a particular level? If your mate is running at 7.5% over 2000 tournaments and you are running at 0% there is, statistically speaking, a good chance there is no difference in your ability. To understand why I have to give you a course in STATS101.

In statistics we create something called a null hypothesis (Ho), which assumes there is no difference between the data sets. In this case the null hypothesis would be that there is no difference between the mean percentage of tournaments your mate wins compared to you. The alternative hypothesis supposes that there are differences in the statistical behaviour of these two datasets.

The job of the statistician is to attempt to reject the null hypothesis, thereby accepting the alternative hypothesis. How do we do this? We use statistical tests to achieve this, but if our sample size is too small compared to the magnitude of the difference, we may not be able to detect the true difference.

Coming back to the hypotheses, since data is sampled at random, there is always a risk of reaching an incorrect conclusion. We can do this in two ways,

Type I error - The hypothesis is correct, but the test rejects it.
Type II error - The hypothesis is incorrect, but the test accepts it.

The probability of rejecting a false hypothesis is called the 'power' of a test and, you guessed it, there is a method for calculating this imaginatively called the power test.

Now, we are interested in the differences in ROI but we should look at the ITM %, since this the driving factor behind ROI. Conveniently, for DONs the result of any particular game is binary; either you lost the game or you finished ITM. So we can consider the ITM % as a proportion.

I think it suffices to say that there is a particular kind of power test for comparing proportions, which is based on the approximations with the normal distribution (the standard bell-shaped distribution that most random data conforms to). I have run this particular code in R, a free stats program that anyone can learn to use!

Here is the code:

Code:
> power.prop.test(power=0.85,p1=0.52,p2=0.53)

     Two-sample comparison of proportions power calculation

              n = 44778.2
             p1 = 0.52
             p2 = 0.53
      sig.level = 0.05
          power = 0.85
    alternative = two.sided

 NOTE: n is number in *each* group


Eezy, peezy, Lionel squeezy, huh?

I have set the desired power to 85% (although it's up to the designer of the experiment). We are using a 95% confidence interval, which is where the significance level of 0.05 comes in (don't worry too much about this - it's pretty industry standard). I have set the first ITM percentage at 52%, which is exactly 0% ROI and the second ITM percentage at 53% or 1.9% ROI.

The power test indicates that you would need have 44778 under your belt to be sure that an ROI of 0% is statistically different to an ROI of 1.9% (at the 95% confidence interval). Now we can run with those ITM percentage and ask how many tournaments would you need to have under your belt to be sure that an ROI of X is different to an ROI of 0%. Here is the result.

Code:

ITM %      ROI %         Tournaments
52           0.0              NA
53           1.9           44778
54           3.8           11181
55           5.8            4962
56           7.7            2786
57           9.6            1780
58          11.5            1233
59          13.5            904
60          15.4            690


Looking at it graphically, it looks like this.


Figure 1: The number of tournaments a player needs to play to be 95% certain that his ROI of 0% is statistically different to X ROI.

Obviously, the further apart the ROI figures are the fewer tournament you need to be certain that the ROI figures are in fact different. Interestingly this translates into a near perfect negative exponential line.

These results can perhaps be interpreted in a more practical manner with a player with a 5.8% ROI asking, "Can I be certain that my true ROI isn't 9.6%?" Since the difference of ITM finishes is 2% between those two ROI figures, we can simply read off the chart the numbers of tournaments you would need to play between an ITM% of 52 and and 54. In this case, you would need to play 11181 to be 95% certain that an ROI of 5.8% is statistically different to an ROI of 9.6%.

Well I hope this isn't too dense for folks to understand. I just did it as an exercise for myself and I hope some people find it interesting.

Edits: Just some housework with spelling.
View user's profile Send private message
DanAronG
Post Posted: Sun, 08 Nov 2009, 6:54pm    Post subject: Reply with quote
Flush
Flush

Joined: 17 Nov 2008
Posts: 436
WPP: 112
Location: Beachside
Intereseting stuff Nak. I often have ideas like this while at work and plan to do some math research for a post when I get home, but I never get round to doing it for one reason or another. A+ for effort.
View user's profile Send private message
Nakamura
Post Posted: Mon, 09 Nov 2009, 5:16am    Post subject: Reply with quote
Full House
Full House

Joined: 24 Mar 2008
Posts: 925
WPP: 139
Location: Cape Town
Thanks Dan.

Another thought came to mind, which is this stuff obviously also applies to HUSNGS, since the outcome of a game is also binary.
View user's profile Send private message
taipan168
Post Posted: Mon, 09 Nov 2009, 7:46am    Post subject: Reply with quote
Straight Flush
Straight Flush

Joined: 07 Jul 2005
Posts: 9467
WPP: 127
Location: Sydney
Thanks Rob, interesting analysis. When I have a little more time and headspace I'll run through your analysis in detail, I'm sure there are insights to be gained there!
View user's profile Send private message MSN Messenger
DanAronG
Post Posted: Mon, 09 Nov 2009, 8:14am    Post subject: Reply with quote
Flush
Flush

Joined: 17 Nov 2008
Posts: 436
WPP: 112
Location: Beachside
Nakamura wrote:
Thanks Dan.

Another thought came to mind, which is this stuff obviously also applies to HUSNGS, since the outcome of a game is also binary.


And 3-way (tripple-ups), which I think I mentioned earlier are seriously soft at low levels.
View user's profile Send private message
Arjonius
Post Posted: Mon, 09 Nov 2009, 2:46pm    Post subject: Reply with quote
Two Pair
Two Pair

Joined: 02 Jul 2005
Posts: 40
WPP: 68

What's a decent ROI in DoNs anyway?
View user's profile Send private message
DanAronG
Post Posted: Tue, 10 Nov 2009, 4:01am    Post subject: Reply with quote
Flush
Flush

Joined: 17 Nov 2008
Posts: 436
WPP: 112
Location: Beachside
One good thing that can be taken from this, is that if you are only just losing money or mayber breaking even, or even if you have lost considerably but don't have a huge data set, please continue playing.

It's probably not that you are crap at poker, it is likely you have just hit some variance and you are in fact a winning player. So continue playing for another, say 5000 SnG's before you either change your play or quit poker for good.


(Is that considered doing my bit for the poker community? I need donks in SnGs to make up for my small yet potentially large losses in ring games)
View user's profile Send private message
kevster
Post Posted: Tue, 10 Nov 2009, 9:09am    Post subject: Reply with quote
Full House
Full House

Joined: 04 May 2006
Posts: 607
WPP: 126
Location: Fold City
wow. just wow.

This post pleases me greatly. Very useful.
View user's profile Send private message
Nakamura
Post Posted: Tue, 10 Nov 2009, 12:11pm    Post subject: Reply with quote
Full House
Full House

Joined: 24 Mar 2008
Posts: 925
WPP: 139
Location: Cape Town
Thanks Kev.

I'm been bored while watching us clobber the hell out the Zimbabwe in the cricket, so I wondered what if we change the significance level. In other words, you lower the statistical significance level, which increases the chance of a Type II error (accepting the alternative hypothesis when it is actually incorrect). In our case, that would mean you accept that there is difference between two ROI's when, in fact, they are not different.

I played with significance levels of 80% and 70%, which, needless to say, are unusual in the stats realm. Here are the results displayed in a graphical layout (since I can't be bothered to align everything)

.
Figure 2. ROI versus number of tournaments at three different sigificance levels, 95%, 80%, 75.

It just so happens that if you want to be 80% certain that two ROIs are in fact different, you now only need a sample size 59.8% compared to that at the 95% sig. level. For example, when answering whether an ROI of 3.8% is different to 0%, you now only 6691 tournaments compared to 11181. However, the flip-side is that there is a greater chance you are concluding that two ROI's are different, when in fact they are not.

If you lower your standards even further, you can get away with less then half the number of tournaments you required at the 95% sig level, but now you are accepting a 30% risk of error.

I doubt this is the end of my thoughts, as I was thinking about the evolution of certainty between samples. For example, when am I am 10%, 20% certain etc., etc. But this is it for now.
View user's profile Send private message
Display posts from previous:   
All times are GMT - 4 Hours

  >    > 

Double or Nothings (DON) and Statistical Power

  

Post new topic   Reply to topic


 
Jump to:  

You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot rate topics in this forum
You cannot vote in polls in this forum


.  Forum style based on NoseBleed by mikelothar.com.   

All content
© FlopTurnRiver.com
Advertising  |   Partners  |   Testimonials  |   T&C  |   Contact Us  |   FTR News & Press  |   Site Map  |   Search FTR

Full Tilt  |   Titan Poker  |   UltimateBet  |   Poker Stars  |   Ladbrokes Bonus  |   Sportsbook  |   Cake Poker  

Play Texas Holdem Online, Online Texas Holdem Strategy, & Poker Forum
This is not a gambling website.