1. Introductory Notes To Probability Questions, Problems, Paradoxes
"The most important questions of life are, for the most part, really only problems of probability."
Pierre Simon de Laplace, “Théorie Analytique des Probabilités”
Free software accompanying this article:
• RandomDigits.EXE version 1.0/October 2004.
• • OccupancySaliuParadox.EXE version 1.0/November 2004.
It no longer comes as a surprise to me getting emails on various probability questions, and problems, and paradoxes. Ever since I published the best introduction to theory of probability, I made myself available as the Doctor in Occult Science of Randomness. In addition to emails, people try also to get me to participate in debates in newsgroups and other public forums. I know it because I see questions repeated in the newsgroups or forums I visit; I also receive direct questions on those repeated problems.
I believe that my site offers answers to every possible probability question or problem. The probability page offers the best theoretical template in solving just about every kind of probability question. Furthermore, my web site also offers the best software to deal with the problem. My software — which can only be freeware, by the decision of my Fear_Survival System — will do probability, statistics, and combinatorics calculations. Calculate and also generate all possible cases of a problem — and then count the favorable cases. That's what probability is all about: Rapport the favorable cases to total cases.
This article is about a famous paradox, which is constantly risen in public forums as 'probability question'. I am referring to the Monty Paradox, or the Monty Hall problem. I also offer a few more details on Ion saliu paradox (problem) of N cases (drawings); it responds to several questions in newsgroups.
2. The Monty Paradox And The TV Game Shows: How To Win?
Apparently, this paradox or probability problem was inspired by the Monty Hall's TV game show Let's Make A Deal. I have never, ever seen that TV show. I collected the necessary information over the Internet.
The host, Monty Hall (a.k.a. Monte Hall), offers the player the opportunity to win what is behind one of three doors. Typically there was a really nice prize (i.e. a car; I really need one!) behind one of the doors and a not-so-nice prize (i.e. a goat; don't like goat or mutton) behind the other two. After selecting a door, Monty would then proceed to open one of the doors you didn't select. It is important to note here that Monty would NOT open the door that concealed the car. Thus, the host always ELIMINATES one of the losing cases. At this point, he would then ask you if you wanted to switch to the other door before revealing what you had won.
In September of 1991 a reader of Marilyn Vos Savant's “Sunday Parade” column wrote in and asked the following question:
"Suppose you're on a game show, and you're given the choice of three doors: Behind one door is a car; behind the others, goats. You pick a door, say No. 1, and the host, who knows what's behind the other doors, opens another door, say No. 3, which has a goat. He then says to you, 'Do you want to pick door No. 2?' Is it to your advantage to take the switch?"
This problem was given the name The Monty Hall Paradox in honor of the long time host of the television game show "Let's Make a Deal." Articles about the controversy appeared in the New York Times and other papers around the country. Marilyn's answer was that the contestant should switch doors and she received nearly 10,000 responses from readers, most of them disagreeing with her. Several were from mathematicians and scientists whose responses ranged from hostility to disappointment at the nation's lack of mathematical skills.
Why were so many people convinced that Marilyn Vos Savant was wrong? They had all decided that it did not matter if the contestant switched or did not switch. There may be a reason so many disagreed with her. Omitting one phrase in the statement of this problem changes the answer completely and this might explain why many people have the wrong intuition about the solution. If the host (Monty Hall) does not know where the car is behind the other two doors, then the answer to the question is "IT DOESN'T MATTER IF THE CONTESTANT SWITCHES." The change in the statement of the problem is so slight that this might be the reason this problem is such a "paradox."
3. Theory Of Probability Applied To Monty Paradox
There is no doubt that the total number of possibilities is 6, or 3! (3 factorial or factorial of 3). No matter what, the Player has always 2 favorable situations, those starting with digit '1'. It doesn't matter what door number that is. Digit '1' in position #1 means that the Player guessed correctly the door number where the prize (car) hides. This situation occurs 2 out of 6 times, or 2/6 = 1/3. Of course, if the Player stays (sticks) with her choice, the Player wins. In other words, the Player loses if she changes (switches) her first selection.
In 4 out of 6 cases, the digit in the first position is NOT 1; therefore, the player makes a losing selection at the beginning of the game. The complementary situation is CHANGE the selection. In other words, the Player WINS if she changes (switches) her first selection. The winning situation occurs 4 out of 6 times, or 4/6 = 2/3.
This is a no-brainer, really. The Player is better served if changing his first choice. This was a TV game, not a casino game. No casino in the world would offer such a favorable-to-the-player game. If allowed to switch, I would bet on Dealer's hand on most occasions…
We can generalize this game. The probability to win in STAY situations is (3-1)! / 3! For N choices, the probability is:
P = (N – 1)! / N! = 1 / N
The probability to win in CHANGE situations is the complementary probability:
PC = {1 – [(N – 1)! / N!] = { 1 – (1 / N)}
If N tends to infinity, the probability to win in STAY cases tends to zero; the probability to win in CHANGE cases tends to 1 (certainty).
Let's analyze the N = 4 case in our quest to find the general solution. In this case, there are four doors. The prize is behind one of the doors. The other three doors represent a losing choice. Again, '1' represents the prize; the digit in the first position represents the Player's first selection. Here is what my program generated.
If the host eliminates only one losing door, the player has more nested choices to make. That's why the program analyzes also 'Stay Again' and 'Change Again' situations. Still — the winning probability for STAY is ALWAYS 6/18 = 1/3. It is the same probability as in the original problem with 3 doors (choices). No-brainer: The player should always change. The winning probability for the CHANGE situation is always 1 – 1/3 = 2/3.
If N > 4, we record the same situation for change, and change, and change, etc. The probability to win — IF STAY — is always 3! / (4! – 3!) = 6/18 = 1/3.
4. Addenda To The Ion Saliu Paradox Of N Trials
I understand this is a question for me now. Not because the question was repeated in newsgroups, but because I was asked directly to give an answer to it.
The answer to your question can be found in a recently discovered paradox. I named it 'Ion Saliu's problem (paradox) of N trials (draws)'.
“The limit of the degree of certainty DC is {1 — (1/e)} when N tends to infinity,
for an event of probability p = 1/N and a number of trials equal to N.
e represents the base of the natural logarithm and equals approximately 2.718281828...
The limit {1 — (1/e)} is approximately 0.63212055...”
Read: “Theory of Probability: Best introduction, formulae, algorithms, software” —
Read also: “The Fundamental Formula of Gambling: Theory of Probability, Mathematics, Chance” —
4.1. First, “a random sequence of digits (0-9)” can be in the 'Birthday Paradox' format; i.e. we draw 10 digits IN ONE DRAW. Simultaneous is the key word here. It does make a difference if the numbers are drawn simultaneously, or one at a time. There are some shrewd (or screwed?) gamblers who apply the 'Birthday Paradox' to roulette. But their calculations are wrong. Roulette draws one number at a time. The 'Birthday Paradox' requires at least TWO elements simultaneously (in the same sequence, or drawing). Read:
“The Birthday Paradox: Combinatorics, Probability, Software, Pick 3 Lottery, Roulette”.
www.saliu.com/birthday.html.
If you draw 10 digits in the same draw, the 'Birthday Paradox' shows that the probability of the 10 digits being unique is 0.04%. That means that in 99.96% of the situations, the 'draw' consists of at least two duplicate digits.
4.2. You probably mean drawing ONE DIGIT AT A TIME. The probability of each digit is 1/10. We always need a 3rd element: the degree of certainty. It is widely known as 'confidence level'. I don't like that term. Truth is an independent relation from any human. Truth has nothing to do with human confidence. Confident or not be you, o incredulous one, the truth has nothing to do with that. What matters is the degree of certainty that a relation exists. Our spite or joy seeing a relation is irrelevant to the relation.
The tool of choice to solve your problem is the Fundamental Formula of Gambling (FFG). One software package that incorporates this extraordinary formula is my freeware SuperFormula.EXE. It calculates easily many probability and statistics parameters, even for gigantic values.
4.3. What is the degree of certainty that all 10 digits will appear in 10 draws? Select option 'C' in SuperFormula, then '2' (the program calculates the probability p). Entries: p = 1/10, number of trials N = 10. The result: 65.13%. You draw a digit from 0 to 9. In most cases, you'll see that 6 or 7 DISTINCT (unique) digits come up. You can use another piece of free software I wrote and offer to the entire world, regardless of nationality, sex, religion, age, or preference for math, maths, or matematica: PermuteCombine.EXE. You can generate RANDOMLY 1000 pick-3 combinations (arrangements of 3 elements from 1 to 10 each). You'll notice most of the time the 1 – 1/e limit. There will be around 632 unique pick-3 combinations among the 1000 random combinations. I remember in the 1990's a group of Australian investors bought tickets in Virginia, USA lottery. A lotto game had a huge jackpot. The investors bought a number of tickets equal to total possible combinations in that lotto game (I forget now what game it was, how many numbers it had, etc.). The Aussies bought quick-picks: random combinations generated by the computer of the lottery commission. They thought they had covered every possible combination. They hadn't! Luckily, they won the jackpot. They were lucky, indeed. They hadn't covered around 37% of the combinations. So, they faced a scary probability to lose: 37%! I think they had bought over US$100 million worth of tickets!
4.4. You can rephrase your question. How many draws will it take to get ALL 10 digits with a degree of certainty of 90%? Use option 'F' in SuperFormula.EXE. Answer: 22 draws. How many draws will it take to get ALL 10 digits with a degree of certainty of 99%? Answer: 44 draws.
5. Another Paradox: "The Classical Occupancy Problem" Or "The Coupon Collector's Problem"
I wrote software to simulate "The Classical Occupancy Problem":
The program simulates a wide range of games, from coin tossing, dice throwing, 10 digits, 3-digit combinations of the pick-3 lottery, etc. The user can select how many times to run the program. For example, the 10-digit case, drawing one element at a time, from 0 to 9. The user can run the program 10,000 times. I did it many times, with just seconds per run. If the output is displayed on screen, the user can see also the individual elements generated randomly. When all 10 digits have been generated, the program ends the current run, and starts the next. If the output is directed to disk, the execution is much faster and more comprehensive. The program saves to file total trials for each run. The runs are sorted in ascending order in order to calculate the median. Two sample runs.
The 'median number of trials' is a useful parameter. It represents the number of trials for a degree of certainty equal to 50%. • SuperFormula.EXE has a special function: 'C = Calculate probability from median'. The number of trials in the output files generated by RandomDigits.EXEare, in fact, skips. The digits skip various numbers of draws before ALL of them appear. In the die throwing case, such probability for a median equal to 15 is: p = 1/23. In the 10-digit case, such probability for a median equal to 30 is: p = 1/44. These values get closer and closer to the reverse of N calculated by the Fundamental Formula of Gambling. It's option F in SuperFormula.EXE. FFG calculates N as:
poc = log(1 — p) / log(1 — DC),
The fast computers of the today and the near future make possible simulations for longer and longer runs of RandomDigits.EXE. Therefore much more precise relations will be found.
I wrote also software to simulate "Ion Saliu's Paradox of N Trials":
The player selects, for example, the 10 numbers case; the number of trials N = 10 in this case. The program generates randomly 10 numbers from 1 to 10. Each run of 10 trials will show also how many numbers are missing (numbers NOT drawn in a run of 10 trials). At the end, the program calculates the average of unique numbers per run. That value always tends to {1 — 1/e}.
6. The most precise method of calculating probabilities: Software to generate ALL the elements in lexicographical order
• PermuteCombine.EXE can generate any possible set of numbers, including the almighty exponents (Saliusian sets). Another program can take the output of PermuteCombine.EXE and count the favorable cases. For example, rolling the die 7 times. Count all occurrences of triple digits or two repeat-digits. That means that exactly 5 of the 6 point-faces were generated. Such case will calculate the probability that exactly one of the six digits is missing (equivalent to the probability of exactly 5 of 6 in 7 trials).
This method of calculating the probability is absolutely precise. The random simulation is approximate, given a degree of certainty.
• OccupancySaliuParadox.EXE version 1.0/November 2004 - Freeware.
The player selects, for example, the 10 numbers case; the number of trials N = 10 in this case. The program generates randomly 10 numbers from 1 to 10. Each run of 10 trials will show also how many numbers are missing (numbers NOT drawn in a run of 10 trials). At the end, the program calculates the average of unique numbers per run. That value always tends to {1 — 1/e}. For double-zero roulette, the average of unique numbers in 38 spins is between 24 and 25 for the majority of cases.
There is NO real paradox here. Just generate all possible cases and the problem becomes crystal-clear. There are three elements in the problem. Total number of permutations of 3 is 3! = 1 * 2 * 3 = 6. The easiest way is to nominate the three cases as 1, 2, 3. 1 represents the prize (i.e. the car); 2 and 3 represent the losing situations (i.e. the goats). Run my freeware PermuteCombine.EXE, option 'Permutations', then 'Numbers Sequentially'. The program generates all 6 permutations of 3 numbers. I wrote another program that generates the permutations and also evaluates the Win (W) / Loss (L) situations. Then the favorable cases are counted. Allow me, o wise reader, to present you with fragments of the output files. For every output is the child process of an input and they both show clearly the picture of an entity.
The original game: N = 3 (three doors or choices)
Stay Change
---- ------
1 2 3 W L
1 3 2 W L
2 1 3 L W
2 3 1 L W
3 1 2 L W
3 2 1 L W
The digit in position #1 represents the Player's choice. The Player always selects first. Then, the Host eliminates a losing possibility. The Player is greatly assisted by having to make one of two choices. That's what makes many people believe that the Player only faces a p = ½ choice. But they ignore what the total number of possibilities is, and the number of favorable cases is.
N = 4; four doors (choices); one prize (favorable case)
Stay Change
---- ------
1 2 3 4 W LL
1 2 4 3 W LL
1 3 2 4 W LL
1 3 4 2 W LL
1 4 2 3 W LL
1 4 3 2 W LL
Stay Again Change Again
========== ============
2 1 3 4 L WL W L
2 1 4 3 L WL W L
2 3 1 4 L WL L W
2 3 4 1 L WL L W
2 4 1 3 L WL L W
2 4 3 1 L WL L W
3 1 2 4 L LW W L
3 1 4 2 L LW W L
3 2 1 4 L WL L W
3 2 4 1 L WL L W
3 4 1 2 L WL L W
3 4 2 1 L WL L W
4 1 2 3 L LW W L
4 1 3 2 L LW W L
4 2 1 3 L WL L W
4 2 3 1 L WL L W
4 3 1 2 L WL L W
4 3 2 1 L WL L W
The favorable cases are W; a total of 6 wins; that is, 3! = 6.
Number of cases that qualifies as CHANGE AGAIN is 18; that is, 4! – 3! = 18.
Of course, this situation is more complex. The host can choose to eliminate all the losing situations but one — after the Player makes his first selection. In such a case, we are exactly as in N = 3. The probability to win IF STAY is 1/N; the probability to win IF CHANGE is {1 – (1/N)}.
“In a random sequence of digits (0-9), what is the expected length of the sequence until all 10 digits have appeared? I have no idea how to deal with this problem~~ ha”
www.saliu.com/theory-of-probability.html.
www.saliu.com/Saliu2.htm.
The probability of all 10 digits to appear is a variation of "The Classical Occupancy Problem". This is an old and difficult problem. I searched on the Internet and I also found references in old books. The honor of presenting the problem for the first time goes to the French mathematician Pierre Simon de Laplace, 1812. There have been several attempts to offer the most precise method of calculating the probability of N elements to appear with a degree of certainty. I believe every method is an approximation. The answer I have just given above — 44 draws for 10 digits — is also an approximation. I found some interesting new facts. It is undoubtedly accurate to say that a digit will appear within 44 trials with a degree of certainty of 99%. But that calculation refers to just one of the digits. There will be situations when more than one digit will skip more than 44 trials (drawings).
• RandomDigits.EXE version 1.0/October 2004 - Freeware.
~ 10 digits; lower bound = 0; upper bound = 9; total runs: 10,000
Minimum number of trials: 10
Maximum number of trials: 164
Median number of trials: 30
~ one die; lower bound = 1; upper bound = 6; total runs: 10,000
Minimum number of trials: 6
Maximum number of trials: 91
Median number of trials: 15
You can run for the pick 3 game: lower bound = 0; upper bound = 999. It takes much longer to generate all 1000 pick-3 straight combinations. I saw runs of over 9,000 draws (trials) before all combinations came out!
N = log(1 — DC) / log(1 — p).
It appears that the probability of the 'occupancy problem' is approximated by the formula 1/N:
for a DC = 0.99 (99%).
• • OccupancySaliuParadox.EXE version 1.0/November 2004 - Freeware.
This program supersedes RandomDigits.EXE — it does everything RandomDigits.EXE does, plus simulates 'Ion Saliu's Paradox'.
Software simulation can be the most accurate method of calculating the probability of just about any situation, or problem, or paradox. I wrote about it on the “Theory of Probability: Best introduction, formulae, algorithms, software” page. I even offer a mighty tool, free software to generate all elements in a set: PermuteCombine.EXE. After every element in the set has been generated, specialized software will count the number of favorable cases. One example is • BirthdayParadox.EXE. Let's take the dice-throwing situation. Let's simulate rolling 6 dice. The individual probability is p=1/6. What is the probability that all point faces appear in 6 throws? The programs counts the favorable cases (720) and reports them to total possibilities (46656). The probability is p=1.54%. The program can also generate all the elements in the set.
This program supersedes RandomDigits.EXE — it does everything RandomDigits.EXE does, plus simulates 'Ion Saliu's Paradox'.

Doctor in Paradoxical Science of Probability




Copyright ©1997-2007, Ion Saliu. All rights reserved worldwide. Reproduction, in any form, of the contents of this site is strictly prohibited. Read important copyright information regarding web site www.saliu.com