1 of 36

Powerball lottery

2 of 36

How to play

6 picks out of 69 numbers

One pick out of 26 numbers

3 of 36

4 of 36

5 of 36

6 of 36

Plan which data you will organize

  • Counting frequency
  • Counting frequency of powerball number
  • By No1~No5 ?
  • By draw?
  • Any game rule change?

7 of 36

Extract counting from the data

  • Import data by choosing csv file -> select powerball data (please download from the website)

8 of 36

  • Go “pivot” and choose Number_1 and create pivot table (select show table and show plot)
  • Store in “powerball_n1”

9 of 36

  • You can create separate five data of power_n1 … powerball_n5 which shows frequency distribution by Number_1~Number_5.

10 of 36

  • Collect all data(distribution) into one
  • Combine -> select two data one for ‘datasets’ and the other for ‘combine with’ -> combine type ‘bind columns’
  • Comined dataset as “powerball_n1_n2”
  • You should do the same thing with n3, n4, n5.

11 of 36

  • Transform->create-> total=n1+n2+n3+n4+n5

12 of 36

  • Now you have n1~n5 counting data and total counting data!
  • Let’s play with data

13 of 36

  • Top 10 frequent numbers
  • Top 10 least-frequent numbers

Click here to arrange the data

14 of 36

  • Is this frequency the normal that we can get from the data?
  • How we can evaluate this result?

Draw: 587

Probability: 5/69

15 of 36

Draw: 587

Probability: 5/69

You will pick Number 28?? Based on the data?

16 of 36

17 of 36

Standard for frequency?

18 of 36

Higher than the normal probablity

Lower than the normal probablity

19 of 36

Must-have numbers?

Must-not-have numbers?

20 of 36

Distribution of frequency

Normal distribution zone

21 of 36

  • Rule changed

261 draw

22 of 36

Data process – group by

23 of 36

-First, to figure out your data and its structure.

X1, X2, X3, X4, … Xn

24 of 36

-To understand the data by X1, use the function of ‘group by’

X1, X3, X4, … Xn

group by X1 for X2

X2

25 of 36

No

26 of 36

group by X1 and X2, data size=max 69*69 (in case of powerball)

27 of 36

28 of 36

X1, X2, X3, X4, … Xn

X1

X2

X3

X4

29 of 36

30 of 36

X1, X2, X3, X4, … Xn

X1

X2

X3

X4

Filter -> we should find each variable

Need to look at one variable

31 of 36

How can we find the frequency of the number set?

32 of 36

How many of (12, 3) in the data?

33 of 36

Filter 12 in gathered data (tidy data) -> save draw -> ‘innerjoin’ with original data

Innerjoin saves only shared data between two data (<->antijoin)

34 of 36

Filter 12 in gathered data and save only data with ‘12’

35 of 36

Innerjoin data with ‘12’ and original gathered data with ‘draw’

36 of 36

6 times showing the data has (12,3)