1 of 181

Probabaility, Statistics and Linear Programming

4th SEMESTER

BS-202

Department of Applied Science, BVCOE New Delhi Subject: PSLP

Lecture Notes

1

2 of 181

 

Discrete Uniform Distribution:

If the discrete random variable X assumes the values x1, x2, …,xk with equal probabilities, then X has the discrete uniform distribution given by:

Note:

·      f(x)=f(x;k)=P(X=x)

k is called the parameter of the distribution.

Department of Applied Science, BVCOE New Delhi Subject: PSLP

2

3 of 181

Example 1 :

Experiment: Tossing a balanced die.

  • Sample space: S={1,2,3,4,5,6}
  • Each sample point of S occurs with the same probability 1/6.
  • Let X= the number observed when tossing a balanced die.
  • The probability distribution of X is,

Department of Applied Science, BVCOE New Delhi Subject: PSLP

3

4 of 181

Result:

If the discrete random variable X has a discrete uniform distribution with parameter k, then the mean and the variance of X are;

Var(X) = σ2 =

Department of Applied Science, BVCOE New Delhi Subject: PSLP

4

5 of 181

Uniform Distribution

Also E(X)= (k+1)/2 (why?)

Hint: (1/k).k(k+1)/2 = (k+1)/2

Similarly, Var (X)=(k+1)(k-1)/12

Department of Applied Science, BVCOE New Delhi Subject: PSLP

5

6 of 181

Proof:

Department of Applied Science, BVCOE New Delhi Subject: PSLP

6

7 of 181

Solution:

E(X) = μ =

Var(X) = σ2 =

Example 5.3:

Find E(X) and Var(X) in Example 1.

Department of Applied Science, BVCOE New Delhi Subject: PSLP

7

8 of 181

Binomial Distribution:

Bernoulli Trial:

·      Bernoulli trial is an experiment with only two possible outcomes.

·      The two possible outcomes are labeled:

success (s) and failure (f)

·      The probability of success is P(s)=p and the probability of failure is P(f)= q = 1−p.

·      Examples:

1.    Tossing a coin (success=H, failure=T, and p=P(H))

2.    Inspecting an item (success=defective, failure=non- defective, and p=P(defective))

Department of Applied Science, BVCOE New Delhi Subject: PSLP

8

9 of 181

Bernoulli Process:

Bernoulli process is an experiment that must satisfy the following properties:

1.    The experiment consists of n repeated Bernoulli trials.

2.    The probability of success, P(s)=p, remains constant from trial to trial.

3.    The repeated trials are independent; that is the outcome of one trial has no effect on the outcome of any other trial.

Binomial Random Variable:

Consider the random variable :

X = The number of successes in n trials in a Bernoulli

process.

The random variable X has a binomial distribution with parameters n (number of trials) and p (probability of success), and we write:

X ~ Binomial(n,p) or X~b(x;n,p)

Department of Applied Science, BVCOE New Delhi Subject: PSLP

9

10 of 181

The probability distribution of X is given by:

Department of Applied Science, BVCOE New Delhi Subject: PSLP

10

11 of 181

We can write the probability distribution of X in table as follows.

x

f(x)=P(X=x)=b(x;n,p)

0

1

2

n − 1

n

Total

1.00

Department of Applied Science, BVCOE New Delhi Subject: PSLP

11

12 of 181

Example 2:

Suppose that 25% of the products of a manufacturing process are defective. Three items are selected at random, inspected, and classified as defective (D) or non-defective (N). Find the probability distribution of the number of defective items.

Solution:

·      Experiment: selecting 3 items at random, inspected, and classified as (D) or (N).

·      The sample space is

S={DDD,DDN,DND,DNN,NDD,NDN,NND,NNN}

·      Let X = the number of defective items in the sample

·      We need to find the probability distribution of X.

Department of Applied Science, BVCOE New Delhi Subject: PSLP

12

13 of 181

(1) First Solution:

Outcome

Probability

X

NNN

0

NND

1

NDN

1

NDD

2

DNN

1

DND

2

DDN

2

DDD

3

Department of Applied Science, BVCOE New Delhi Subject: PSLP

13

14 of 181

The probability distribution

of X is,

x

f(x)=P(X=x)

0

1

2

3

(2) Second Solution:

Bernoulli trial is the process of inspecting the item. The results are success=D or failure=N, with probability of success P(s)=25/100=1/4=0.25.

The experiments is a Bernoulli process with:

Department of Applied Science, BVCOE New Delhi Subject: PSLP

14

15 of 181

 ·      number of trials: n=3

·      Probability of success: p=1/4=0.25

·      X ~ Binomial(n,p)=Binomial(3,1/4)

·      The probability distribution of X is given by:

The probability distribution of X is,

x

f(x)=P(X=x)

=b(x;3,1/4)

0

27/64

1

27/64

2

9/64

3

1/64

Department of Applied Science, BVCOE New Delhi Subject: PSLP

15

16 of 181

Result:

The mean and the variance of the binomial distribution b(x;n,p) are:

μ = n p

σ2 = n p (1 p)

Department of Applied Science, BVCOE New Delhi Subject: PSLP

16

17 of 181

Example:

In the previous example, find the expected value (mean) and the variance of the number of defective items.

Solution:

·      X = number of defective items

·      We need to find E(X)=μ and Var(X)=σ2

·      We found that X ~ Binomial(n,p)=Binomial(3,1/4)

·      .n=3 and p=1/4

The expected number of defective items is

E(X)=μ = n p = (3) (1/4) = 3/4 = 0.75

The variance of the number of defective items is

Var(X)=σ2 = n p (1 −p) = (3) (1/4) (3/4) = 9/16 = 0.5625

Department of Applied Science, BVCOE New Delhi Subject: PSLP

17

18 of 181

Example:

In the previous example, find the following probabilities:

(1) The probability of getting at least two defective items.

(2) The probability of getting at most two defective items.

Solution:

X ~ Binomial(3,1/4)

x

.f(x)=P(X=x)=b(x;3,1/4)

0

27/64

1

27/64

2

9/64

3

1/64

Department of Applied Science, BVCOE New Delhi Subject: PSLP

18

19 of 181

  1. The probability of getting at least two defective items:

P(X≥2)=P(X=2)+P(X=3)= f(2)+f(3)=

(2) The probability of getting at most two defective item:

P(X≤2) = P(X=0)+P(X=1)+P(X=2)

= f(0)+f(1)+f(2) =

or

P(X≤2)= 1−P(X>2) = 1−P(X=3) = 1− f(3) =

Department of Applied Science, BVCOE New Delhi Subject: PSLP

19

20 of 181

�� Problems of Binomial Distribution.

  • Ten coins are rolled simultaneously. Find the probability of getting at least 7 heads? Ans. 11/64
  • In a lot of 200 articles, 10 are defective, find prob. of, no defective, one defective, atleast one defective in a random sample of 20 articles.
  • The sum of mean and variance of binomial distribution of 5 trials is 4.8, find the distribution. Ans. p=4/5, q=1/5.

Department of Applied Science, BVCOE New Delhi Subject: PSLP

20

21 of 181

solutions

Department of Applied Science, BVCOE New Delhi Subject: PSLP

21

22 of 181

Department of Applied Science, BVCOE New Delhi Subject: PSLP

22

23 of 181

Department of Applied Science, BVCOE New Delhi Subject: PSLP

23

24 of 181

Hypergeometric Distribution :

·      Suppose there is a population with 2 types of elements:

1-st Type = success

2-nd Type = failure

·      N= population size

·      K= number of elements of the 1-st type

·      NK = number of elements of the 2-nd type

Department of Applied Science, BVCOE New Delhi Subject: PSLP

24

25 of 181

· We select a sample of n elements at random from the population.

· Let X = number of elements of 1-st type (number of successes) in the sample.

·      We need to find the probability distribution of X.

There are to two methods of selection:

1. selection with replacement

2. selection without replacement

(1) If we select the elements of the sample at random and with replacement, then

X ~ Binomial(n,p); where

(2) Now, suppose we select the elements of the sample at random and without replacement. When the selection is made without replacement, the random variable X has a hyper geometric distribution with parameters N, n, and K. and we write X~h(x;N,n,K).

Department of Applied Science, BVCOE New Delhi Subject: PSLP

25

26 of 181

Note that the values of X must satisfy:

0≤x≤K and 0≤n−x≤ NK

0≤x≤K and nN+K≤ x≤ n

Department of Applied Science, BVCOE New Delhi Subject: PSLP

26

27 of 181

Hypergeometric Distribution

Also E(X)= (nK)/N

Hint: (1/k).k(k+1)/2 = (k+1)/2

Similarly, Var (X)=NK(N-K)(N-n)/N^2(N-1)

Department of Applied Science, BVCOE New Delhi Subject: PSLP

27

28 of 181

Example:

Lots of 40 components each are called acceptable if they contain no more than 3 defectives. The procedure for sampling the lot is to select 5 components at random (without replacement) and to reject the lot if a defective is found. What is the probability that exactly one defective is found in the sample if there are 3 defectives in the entire lot.

Solution:

Department of Applied Science, BVCOE New Delhi Subject: PSLP

28

29 of 181

·      Let X= number of defectives in the sample

·      N=40, K=3, and n=5

·      X has a hypergeometric distribution with parameters N=40, n=5, and K=3.

·      X~h(x;N,n,K)=h(x;40,5,3).

·      The probability distribution of X is given by:

But the values of X must satisfy:

0≤x≤K and nN+K≤ x≤ n ⇔ 0≤x≤3 and −42≤ x≤ 5

Therefore, the probability distribution of X is given by:

Department of Applied Science, BVCOE New Delhi Subject: PSLP

29

30 of 181

Now, the probability that exactly one defective is found in the sample is

.f(1)=P(X=1)=h(1;40,5,3)=

Department of Applied Science, BVCOE New Delhi Subject: PSLP

30

31 of 181

Question?

  • A bag contains 50 light bulbs of which 5 are defective and 45 are not. A Quality Control Inspector randomly samples 4 bulbs without replacement.  Let X = the number of defective bulbs selected. Find the probability mass function, f(x), of the discrete random variable X.

Department of Applied Science, BVCOE New Delhi Subject: PSLP

31

32 of 181

Result.

  • Note: One of the key features of the hypergeometric distribution is that it is associated with sampling without replacement. When the samples are drawn with replacement, the discrete random variable follows what is called the binomial distribution.
  • Note: In cases where the sample size is relatively large compared to the population, a discrete distribution called hypergeometric may be useful.

Department of Applied Science, BVCOE New Delhi Subject: PSLP

32

33 of 181

Question 1. A lake contains 600 fish, eighty (80) of which have been tagged by scientists. A researcher randomly catches 15 fish from the lake. Find a formula for the probability mass function of the number of fish in the researcher's sample which are tagged.

Question 2. Let the random variable X denote the number of aces in a five-card hand dealt from a standard 52-card deck. Find a formula for the probability mass function of X.

Department of Applied Science, BVCOE New Delhi Subject: PSLP

33

34 of 181

�Question 3: Determine whether the given scenario describes a binomial setting. Justify your answer.�(a) Genetics says that the genes children receive from their parents are independent from one child to another. Each child of a particular set of parents has probability 0.25 of having type O blood. Suppose these parents have 5 children. Count the number of children with type O blood.�

(b). Shuffle a standard deck of 52 playing cards. Turn over the first 10 cards, one at a time. Record the number of aces you observe.

Department of Applied Science, BVCOE New Delhi Subject: PSLP

34

35 of 181

�Question 4: Each child of a particular set of parents has probability 0.25 of having type O blood. Suppose these parents have 5 children. What’s the probability that exactly one of the five children has type O blood?��

Department of Applied Science, BVCOE New Delhi Subject: PSLP

35

36 of 181

Solution 3 (a)

  • Binary? “Success” 5 has type O blood. “Failure” 5 doesn’t have type O blood.
  • Independent? Knowing one child’s blood type tells you nothing about another child’s because they inherit genes independently from their parents.
  • Number? n = 5
  • Same probability? p = 0.25

Department of Applied Science, BVCOE New Delhi Subject: PSLP

36

37 of 181

Solution 3 (b)

  • Binary? “Success” get an ace. “Failure” don’t get an ace.
  • Independent? No. If the first card you turn over is an ace, then the next card is less likely to be an ace because you’re not replacing the top card in the deck. If the first card isn’t an ace, the second card is more likely to be an ace.

  • This is not a binomial setting because the independent condition is not met.

Department of Applied Science, BVCOE New Delhi Subject: PSLP

37

38 of 181

Poisson Distribution:

  • It is discrete distribution.
  • The discrete r. v. X is said to have a Poisson distribution with parameter (average) λ if the probability distribution of X is given by

where e = 2.71828.

We write:

X ~ Poisson ( λ )

  • The mean (average) of Poisson (λ) is μ= λ
  • The variance is:

Department of Applied Science, BVCOE New Delhi Subject: PSLP

38

39 of 181

  • The Poisson distribution is used to model a discrete r. v. which is a count of how many times a specified random event occurred in an interval of time or space.

Example:

  • No. of patients in a waiting room in an hour.
  • No. of serious injuries in a particular factory in a month.
  • No. of calls received by a telephone operator in a day.
  • No. of rates in each house in a particular city.

Note:

  • is the average (mean) of the distribution.

If X = The number of calls received in a month and

X ~ Poisson (λ)

Department of Applied Science, BVCOE New Delhi Subject: PSLP

39

40 of 181

then:

(i) Y = The no. calls received in a year.

Y ~ Poisson (λ*), where λ*=12λ

Y ~ Poisson (12λ)

  1. W = The no. calls received in a day.

W ~ Poisson (λ*), where λ*=λ/30

W ~ Poisson (λ/30)

Department of Applied Science, BVCOE New Delhi Subject: PSLP

40

41 of 181

Example

Suppose that the number of snake bites cases seen in a year has a Poisson distribution with average 6 bite cases.

1. What is the probability that in a year:

(i) The no. of snake bite cases will be 7?

(ii) The no. of snake bite cases will be less than 2?

Department of Applied Science, BVCOE New Delhi Subject: PSLP

41

42 of 181

2- What is the probability that in 2 years there will be 10 bite cases?

3- What is the probability that in a month there will be no snake bite cases?

Solution:

(1) X = no. of snake bite cases in a year.

X ~ Poisson (6) (λ=6)

(i)

(ii)

Department of Applied Science, BVCOE New Delhi Subject: PSLP

42

43 of 181

Y = no of snake bite cases in 2 years

Y ~ Poisson(12)

3- W = no. of snake bite cases in a month.

W ~ Poisson (0.5)

Department of Applied Science, BVCOE New Delhi Subject: PSLP

43

44 of 181

Note: Poisson distribution is for counts—if events happen at a constant rate over time, the Poisson distribution gives the probability of X number of events occurring in time T.

Department of Applied Science, BVCOE New Delhi Subject: PSLP

44

45 of 181

 

Department of Applied Science, BVCOE New Delhi Subject: PSLP

45

46 of 181

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Department of Applied Science, BVCOE New Delhi Subject: PSLP

46

47 of 181

�Continuous Random Variable

A continuous random variable can take any value in an interval, open or closed, so it has innumerable values

Examples: the height or weight of a chair

For such a variable X, the probability assigned to an exact value P(X = a) is always 0, though the probability for it to fall into interval [a, b], that is,

P(a ≤ X ≤ b), can be a positive number

47

48 of 181

Probability Distributions and Probability Density Functions�

Figure 4-1 Density function of a loading on a long, thin beam.

49 of 181

4.1 Probability Distributions Probability Density Functions�

Definition

50 of 181

Probability Distributions and Probability Density Functions�

Probability determined from the area under f(x).

51 of 181

Probability Distributions and Probability Density Functions�

52 of 181

Probability Distributions and Probability Density Functions�

Example:

53 of 181

Cumulative Distribution Functions�

Definition

54 of 181

Mean and Variance of a Continuous Random Variable�

Definition

55 of 181

Mean and Variance of a Continuous Random Variable�

Expected Value of a Function of a Continuous Random Variable

56 of 181

Question?

  •  

57 of 181

Solution.

58 of 181

59 of 181

4.3 Continuous Uniform Random Variable�

Definition

60 of 181

Continuous Uniform Random Variable�

Continuous uniform probability density function.

61 of 181

Continuous Uniform Random Variable�

Mean and Variance

62 of 181

Continuous Uniform Random Variable�

Example

63 of 181

Continuous Uniform Random Variable�

Figure 4-9 Probability for Example 4-9.

64 of 181

4.4Continuous Uniform Random Variable�

65 of 181

4.5 Normal Distribution�

Definition

66 of 181

Normal Distribution�

Figure : Normal probability density functions for selected values of the parameters μ and σ2.

67 of 181

Normal Distribution�

Some useful results concerning the normal distribution

68 of 181

4.6 Normal Distribution�

Definition : Standard Normal

69 of 181

Normal Distribution�

Example:

Figure 4-13 Standard normal probability density function.

70 of 181

Normal Distribution�

Standardizing

71 of 181

Normal Distribution�

Example:

72 of 181

Normal Distribution�

Figure 4-15 Standardizing a normal random variable.

73 of 181

Normal Distribution�

To Calculate Probability

74 of 181

Normal Distribution�

Example:

75 of 181

Normal Distribution�

Example (When prob. is given)

76 of 181

4-7 Normal Approximation to the Binomial and Poisson Distributions�

  • Under certain conditions, the normal distribution can be used to approximate the binomial distribution and the Poisson distribution.

77 of 181

4-7 Normal Approximation to the Binomial and Poisson Distributions�

Figure 4-19 Normal approximation to the binomial.

78 of 181

Normal Approximation to the Binomial and Poisson Distributions�

Example (Normal Distri. can be used to compute the longer calculations)

79 of 181

Normal Approximation to the Binomial and Poisson Distributions�

Normal Approximation to the Binomial Distribution

80 of 181

4-7 Normal Approximation to the Binomial and Poisson Distributions�

Example 4-18

81 of 181

4-7 Normal Approximation to the Binomial and Poisson Distributions�

Figure 4-21 Conditions for approximating hypergeometric and binomial probabilities.

82 of 181

4-7 Normal Approximation to the Binomial and Poisson Distributions�

Normal Approximation to the Poisson Distribution

83 of 181

4-7 Normal Approximation to the Binomial and Poisson Distributions�

Example 4-20

84 of 181

4-8 Exponential Distribution�

Definition

85 of 181

Poisson or not?��Which of the following is most likely to be well modelled by a Poisson distribution?��

  1. Number of trains arriving at Falmer every hour
  2. Number of lottery winners each year that live in Brighton
  3. Number of days between solar eclipses
  4. Number of days until a component fails

86 of 181

Are they Poisson? Answers:

  1. Number of trains arriving at Falmer every hour��NO, (supposed to) arrive regularly on a timetable not at random

  • Number of lottery winners each year that live in Brighton��Yes, is number of random events in fixed interval�
  • Number of days between solar eclipses��NO, solar eclipses are not random events and this is a time between random events, not the number in some fixed interval�
  • Number of days until a component fails�NO, random events, but this is time until a random event, not the number of random events

87 of 181

 

What is the probability distribution for the time to the first event?

 

Poisson - Discrete distribution: P(number of events)

Exponential - Continuous distribution: P(time till first event)

Time between random events / time till first random event ?

88 of 181

 

Occurrence

 

1) Time until the failure of a part.

 

2) Separation between randomly happening events

 

 

 

 

 

89 of 181

Relation to Poisson distribution�

 

 

 

 

 

 

 

 

 

 

 

90 of 181

Example

 

 

 

 

 

E.g. Probability the time till the next death is less than one year?

 

 

 

91 of 181

Exponential distribution��A certain type of component can be purchased new or used. 50% of all new components last more than five years, but only 30% of used components last more than five years. Is it possible that the lifetimes of new components are exponentially distributed?

  1. YES
  2. NO

Question from Derek Bruff

92 of 181

 

 

 

 

 

 

93 of 181

4-8 Exponential Distribution�

Mean and Variance

94 of 181

4-8 Exponential Distribution�

Example 4-21

95 of 181

4-8 Exponential Distribution�

Figure 4-23 Probability for the exponential distribution in Example 4-21.

96 of 181

4-8 Exponential Distribution�

Example 4-21 (continued)

97 of 181

4-8 Exponential Distribution�

Example 4-21 (continued)

98 of 181

4-8 Exponential Distribution�

Example 4-21 (continued)

99 of 181

4-8 Exponential Distribution�

Our starting point for observing the system does not matter.

  • An even more interesting property of an exponential random variable is the lack of memory property.

In Example 4-21, suppose that there are no log-ons from 12:00 to 12:15; the probability that there are no log-ons from 12:15 to 12:21 is still 0.082. Because we have already been waiting for 15 minutes, we feel that we are “due.” That is, the probability of a log-on in the next 6 minutes should be greater than 0.082. However, for an exponential distribution this is not true.

100 of 181

4-8 Exponential Distribution�

Example 4-22

101 of 181

4-8 Exponential Distribution�

Example 4-22 (continued)

102 of 181

4-8 Exponential Distribution�

Example 4-22 (continued)

103 of 181

Example: Reliability

The time till failure of an electronic component has an Exponential distribution and it is known that 10% of components have failed by 1000 hours.

(a) What is the probability that a component is still working after 5000 hours?

(b) Find the mean and standard deviation of the time till failure.

 

 

 

 

 

 

 

 

 

 

104 of 181

 

 

 

 

(b) Find the mean and standard deviation of the time till failure.

 

Answer:

 

 

105 of 181

4-8 Exponential Distribution�

Lack of Memory Property

106 of 181

4-8 Exponential Distribution�

Figure 4-24 Lack of memory property of an Exponential distribution.

107 of 181

4-9 Erlang and Gamma Distributions�

Erlang Distribution

The random variable X that equals the interval length until r counts occur in a Poisson process with mean λ > 0 has and Erlang random variable with parameters λ and r. The probability density function of X is

for x > 0 and r =1, 2, 3, ….

108 of 181

4-9 Erlang and Gamma Distributions�

Gamma Distribution

109 of 181

4-9 Erlang and Gamma Distributions�

Gamma Distribution

110 of 181

4-9 Erlang and Gamma Distributions�

Gamma Distribution

Figure 4-25 Gamma probability density functions for selected values of r and λ.

111 of 181

4-9 Erlang and Gamma Distributions�

Gamma Distribution

112 of 181

4-10 Weibull Distribution�

Definition

113 of 181

4-10 Weibull Distribution�

Figure 4-26 Weibull probability density functions for selected values of α and β.

114 of 181

4-10 Weibull Distribution�

115 of 181

4-10 Weibull Distribution�

Example 4-25

116 of 181

4-11 Lognormal Distribution�

117 of 181

4-11 Lognormal Distribution�

Figure 4-27 Lognormal probability density functions with θ = 0 for selected values of ω2.

118 of 181

4-11 Lognormal Distribution�

Example 4-26

119 of 181

4-11 Lognormal Distribution�

Example 4-26 (continued)

120 of 181

4-11 Lognormal Distribution�

Example 4-26 (continued)

121 of 181

122 of 181

���������� Normal Probability � Distribution s

123 of 181

Properties of Normal Distributions

A continuous random variable has an infinite number of possible values that can be represented by an interval on the number line.

Hours spent studying in a day

0

6

3

9

15

12

18

24

21

The time spent studying can be any number between 0 and 24.

The probability distribution of a continuous random variable is called a continuous probability distribution.

124 of 181

Properties of Normal Distributions

The most important probability distribution in statistics is the normal distribution.

A normal distribution is a continuous probability distribution for a random variable, x. The graph of a normal distribution is called the normal curve.

Normal curve

x

125 of 181

Properties of Normal Distributions

Properties of a Normal Distribution

  1. The mean, median, and mode are equal.
  2. The normal curve is bell-shaped and symmetric about the mean.
  3. The total area under the curve is equal to one.
  4. The normal curve approaches, but never touches the x-axis as it extends farther and farther away from the mean.
  5. Between μ − σ and μ + σ (in the center of the curve), the graph curves downward. The graph curves upward to the left of μ − σ and to the right of μ + σ. The points at which the curve changes from curving upward to curving downward are called the inflection points.

126 of 181

Properties of Normal Distributions

μ − 3σ

μ + σ

μ − 2σ

μσ

μ

μ + 2σ

μ + 3σ

Inflection points

Total area = 1

If x is a continuous random variable having a normal distribution with mean μ and standard deviation σ, you can graph a normal curve with the equation

x

127 of 181

Means and Standard Deviations

A normal distribution can have any mean and any positive standard deviation.

Mean: μ = 3.5

Standard deviation: σ ≈ 1.3

Mean: μ = 6,

μ -σ =7.9

Standard deviation: σ ≈ 1.9

The mean gives the location of the line of symmetry.

The standard deviation describes the spread of the data.

Inflection points

Inflection points

3

6

1

5

4

2

x

3

6

1

5

4

2

9

7

11

10

8

x

128 of 181

Means and Standard Deviations

Example:

  1. Which curve has the greater mean?
  2. Which curve has the greater standard deviation?

The line of symmetry of curve A occurs at x = 5. The line of symmetry of curve B occurs at x = 9. Curve B has the greater mean.

Curve B is more spread out than curve A, so curve B has the greater standard deviation.

3

1

5

9

7

11

13

A

B

x

129 of 181

Interpreting Graphs

Example:

The heights of fully grown magnolia bushes are normally distributed. The curve represents the distribution. What is the mean height of a fully grown magnolia bush? Estimate the standard deviation.

The heights of the magnolia bushes are normally distributed with a mean height of about 8 feet and a standard deviation of about 0.7 feet.

μ = 8

The inflection points are one standard deviation away from the mean.

σ ≈ 0.7

6

8

7

9

10

Height (in feet)

x

130 of 181

The Standard Normal Distribution

−3

1

−2

−1

0

2

3

z

The standard normal distribution is a normal distribution with a mean of 0 and a standard deviation of 1.

Any value can be transformed into a z-score by using the formula

The horizontal scale corresponds to z-scores.

131 of 181

The Standard Normal Distribution

If each data value of a normally distributed random variable x is transformed into a z-score, the result will be the standard normal distribution.

After the formula is used to transform an x-value into a z-score, the Standard Normal Table in Appendix B is used to find the cumulative area under the curve.

The area that falls in the interval under the nonstandard normal curve (the x-values) is the same as the area under the standard normal curve (within the corresponding z-boundaries).

−3

1

−2

−1

0

2

3

z

132 of 181

The Standard Normal Table

Properties of the Standard Normal Distribution

  1. The cumulative area is close to 0 for z-scores close to z = −3.49.
  2. The cumulative area increases as the z-scores increase.
  3. The cumulative area for z = 0 is 0.5000.
  4. The cumulative area is close to 1 for z-scores close to z = 3.49

z = −3.49

Area is close to 0.

z = 0

Area is 0.5000.

z = 3.49

Area is close to 1.

z

−3

1

−2

−1

0

2

3

133 of 181

The Standard Normal Table

Example:

Find the cumulative area that corresponds to a z-score of 2.71.

z

.00

.01

.02

.03

.04

.05

.06

.07

.08

.09

0.0

.5000

.5040

.5080

.5120

.5160

.5199

.5239

.5279

.5319

.5359

0.1

.5398

.5438

.5478

.5517

.5557

.5596

.5636

.5675

.5714

.5753

0.2

.5793

.5832

.5871

.5910

.5948

.5987

.6026

.6064

.6103

.6141

2.6

.9953

.9955

.9956

.9957

.9959

.9960

.9961

.9962

.9963

.9964

2.7

.9965

.9966

.9967

.9968

.9969

.9970

.9971

.9972

.9973

.9974

2.8

.9974

.9975

.9976

.9977

.9977

.9978

.9979

.9979

.9980

.9981

Find the area by finding 2.7 in the left hand column, and then moving across the row to the column under 0.01.

The area to the left of z = 2.71 is 0.9966.

Appendix B: Standard Normal Table

134 of 181

The Standard Normal Table

Example:

Find the cumulative area that corresponds to a z-score of −0.25.

z

.09

.08

.07

.06

.05

.04

.03

.02

.01

.00

−3.4

.0002

.0003

.0003

.0003

.0003

.0003

.0003

.0003

.0003

.0003

−3.3

.0003

.0004

.0004

.0004

.0004

.0004

.0004

.0005

.0005

.0005

Find the area by finding −0.2 in the left hand column, and then moving across the row to the column under 0.05.

The area to the left of z = −0.25 is 0.4013

−0.3

.3483

.3520

.3557

.3594

.3632

.3669

.3707

.3745

.3783

.3821

−0.2

.3859

.3897

.3936

.3974

.4013

.4052

.4090

.4129

.4168

.4207

−0.1

.4247

.4286

.4325

.4364

.4404

.4443

.4483

.4522

.4562

.4602

−0.0

.4641

.4681

.4724

.4761

.4801

.4840

.4880

.4920

.4960

.5000

Appendix B: Standard Normal Table

135 of 181

Guidelines for Finding Areas

Finding Areas Under the Standard Normal Curve

  1. Sketch the standard normal curve and shade the appropriate area under the curve.
  2. Find the area by following the directions for each case shown.
    1. To find the area to the left of z, find the area that corresponds to z in the Standard Normal Table.

1. Use the table to find the area for the z-score.

The area to the left of z = 1.23 is 0.8907. In table, 1.2 at 0.03=0.3907 +0.5

1.23

0

z

136 of 181

Guidelines for Finding Areas

Finding Areas Under the Standard Normal Curve

    • To find the area to the right of z, use the Standard Normal Table to find the area that corresponds to z. Then subtract the area from 1.

3. Subtract to find the area to the right of z = 1.23: 1 − 0.8907 = 0.1093.

1. Use the table to find the area for the z-score.

2. The area to the left of z = 1.23 is 0.8907.

1.23

0

z

137 of 181

Guidelines for Finding Areas

Finding Areas Under the Standard Normal Curve

    • To find the area between two z-scores, find the area corresponding to each z-score in the Standard Normal Table. Then subtract the smaller area from the larger area.

4. Subtract to find the area of the region between the two z-scores: 0.8907 − 0.2266 = 0.6641.

1. Use the table to find the area for the z-score.

3. The area to the left of z = −0.75 is 0.2266

(=0.5-0to 0.75 area=0.5-0.2734=0.2266)

2. The area to the left of z = 1.23 is 0.8907.

1.23

0

z

−0.75

138 of 181

Guidelines for Finding Areas

Example:

Find the area under the standard normal curve to the left of z = −2.33.

From the Standard Normal Table, the area is equal to 0.0099. (0.5-0.4901)

Always draw the curve!

−2.33

0

z

139 of 181

Guidelines for Finding Areas

Example:

Find the area under the standard normal curve to the right of z = 0.94.

From the Standard Normal Table, the area is equal to 0.1736.

Always draw the curve!

0.8264

1 − 0.8264 = 0.1736

0.94

0

z

140 of 181

Guidelines for Finding Areas

Example:

Find the area under the standard normal curve between z = −1.98 and z = 1.07.

From the Standard Normal Table, the area is equal to 0.8338.

Always draw the curve!

0.8577 − 0.0239 = 0.8338

0.8577

0.0239

1.07

0

z

−1.98

141 of 181

§ 5.2

Normal Distributions: Finding Probabilities

142 of 181

Probability and Normal Distributions

If a random variable, x, is normally distributed, you can find the probability that x will fall in a given interval by calculating the area under the normal curve for that interval.

P(x < 15)

μ = 10

σ = 5

15

μ =10

x

143 of 181

Probability and Normal Distributions

Same area

P(x < 15) = P(z < 1) = Shaded area under the curve

= 0.8413

15

μ =10

P(x < 15)

μ = 10

σ = 5

Normal Distribution

x

1

μ =0

μ = 0

σ = 1

Standard Normal Distribution

z

P(z < 1)

144 of 181

Probability and Normal Distributions

Example:

The average on a statistics test was 78 with a standard deviation of 8. If the test scores are normally distributed, find the probability that a student receives a test score less than 90.

P(x < 90) = P(z < 1.5) = 0.9332

The probability that a student receives a test score less than 90 is 0.9332.

μ =0

z

?

1.5

90

μ =78

P(x < 90)

μ = 78

σ = 8

x

145 of 181

Probability and Normal Distributions

Example:

The average on a statistics test was 78 with a standard deviation of 8. If the test scores are normally distributed, find the probability that a student receives a test score greater than than 85.

P(x > 85) = P(z > 0.88) = 1 − P(z < 0.88) = 1 − 0.8106 = 0.1894

The probability that a student receives a test score greater than 85 is 0.1894.

μ =0

z

?

0.88

85

μ =78

P(x > 85)

μ = 78

σ = 8

x

146 of 181

Probability and Normal Distributions

Example:

The average on a statistics test was 78 with a standard deviation of 8. If the test scores are normally distributed, find the probability that a student receives a test score between 60 and 80.

P(60 < x < 80) = P(−2.25 < z < 0.25) = P(z < 0.25) − P(z < −2.25)

The probability that a student receives a test score between 60 and 80 is 0.5865.

μ =0

z

?

?

0.25

−2.25

= 0.5987 − 0.0122 = 0.5865

60

80

μ =78

P(60 < x < 80)

μ = 78

σ = 8

x

147 of 181

§ 5.3

Normal Distributions: Finding Values

148 of 181

Finding z-Scores

Example:

Find the z-score that corresponds to a cumulative area of 0.9973.

z

.00

.01

.02

.03

.04

.05

.06

.07

.08

.09

0.0

.5000

.5040

.5080

.5120

.5160

.5199

.5239

.5279

.5319

.5359

0.1

.5398

.5438

.5478

.5517

.5557

.5596

.5636

.5675

.5714

.5753

0.2

.5793

.5832

.5871

.5910

.5948

.5987

.6026

.6064

.6103

.6141

2.6

.9953

.9955

.9956

.9957

.9959

.9960

.9961

.9962

.9963

.9964

2.7

.9965

.9966

.9967

.9968

.9969

.9970

.9971

.9972

.9973

.9974

2.8

.9974

.9975

.9976

.9977

.9977

.9978

.9979

.9979

.9980

.9981

Find the z-score by locating 0.9973 in the body of the Standard Normal Table. The values at the beginning of the corresponding row and at the top of the column give the z-score.

The z-score is 2.78.

Appendix B: Standard Normal Table

2.7

.08

149 of 181

Finding z-Scores

Example:

Find the z-score that corresponds to a cumulative area of 0.4170.

z

.09

.08

.07

.06

.05

.04

.03

.02

.01

.00

−3.4

.0002

.0003

.0003

.0003

.0003

.0003

.0003

.0003

.0003

.0003

−0.2

.0003

.0004

.0004

.0004

.0004

.0004

.0004

.0005

.0005

.0005

Find the z-score by locating 0.4170 in the body of the Standard Normal Table. Use the value closest to 0.4170.

−0.3

.3483

.3520

.3557

.3594

.3632

.3669

.3707

.3745

.3783

.3821

−0.2

.3859

.3897

.3936

.3974

.4013

.4052

.4090

.4129

.4168

.4207

−0.1

.4247

.4286

.4325

.4364

.4404

.4443

.4483

.4522

.4562

.4602

−0.0

.4641

.4681

.4724

.4761

.4801

.4840

.4880

.4920

.4960

.5000

Appendix B: Standard Normal Table

Use the closest area.

The z-score is −0.21.

−0.2

.01

150 of 181

Finding a z-Score Given a Percentile

Example:

Find the z-score that corresponds to P75.

The z-score that corresponds to P75 is the same z-score that corresponds to an area of 0.75.

The z-score is 0.67.

?

μ =0

z

0.67

Area = 0.75

151 of 181

Transforming a z-Score to an x-Score

To transform a standard z-score to a data value, x, in a given population, use the formula

Example:

The monthly electric bills in a city are normally distributed with a mean of $120 and a standard deviation of $16. Find the x-value corresponding to a z-score of 1.60.

We can conclude that an electric bill of $145.60 is 1.6 standard deviations above the mean.

152 of 181

Finding a Specific Data Value

Example:

The weights of bags of chips for a vending machine are normally distributed with a mean of 1.25 ounces and a standard deviation of 0.1 ounce. Bags that have weights in the lower 8% are too light and will not work in the machine. What is the least a bag of chips can weigh and still work in the machine?

The least a bag can weigh and still work in the machine is 1.11 ounces.

?

0

z

8%

P(z < ?) = 0.08

P(z < −1.41) = 0.08

−1.41

1.25

x

?

1.11

153 of 181

§ 5.4

Sampling Distributions and the Central Limit Theorem

154 of 181

Sampling Distributions

Population

Sample

A sampling distribution is the probability distribution of a sample statistic that is formed when samples of size n are repeatedly taken from a population.

Sample

Sample

Sample

Sample

Sample

Sample

Sample

Sample

Sample

155 of 181

Sampling Distributions

If the sample statistic is the sample mean, then the distribution is the sampling distribution of sample means.

Sample 1

Sample 4

Sample 3

Sample 6

The sampling distribution consists of the values of the sample means,

Sample 2

Sample 5

156 of 181

Properties of Sampling Distributions

Properties of Sampling Distributions of Sample Means

  1. The mean of the sample means, is equal to the population mean.

  • The standard deviation of the sample means, is equal to the population standard deviation, divided by the square root of n.

The standard deviation of the sampling distribution of the sample means is called the standard error of the mean.

157 of 181

Sampling Distribution of Sample Means

Example:

The population values {5, 10, 15, 20} are written on slips of paper and put in a hat. Two slips are randomly selected, with replacement.

    • Find the mean, standard deviation, and variance of the population.

Continued.

Population

5

10

15

20

158 of 181

Sampling Distribution of Sample Means

Example continued:

The population values {5, 10, 15, 20} are written on slips of paper and put in a hat. Two slips are randomly selected, with replacement.

    • Graph the probability histogram for the population values.

Continued.

This uniform distribution shows that all values have the same probability of being selected.

Population values

Probability

0.25

5

10

15

20

x

P(x)

Probability Histogram of Population of x

159 of 181

Sampling Distribution of Sample Means

Example continued:

The population values {5, 10, 15, 20} are written on slips of paper and put in a hat. Two slips are randomly selected, with replacement.

    • List all the possible samples of size n = 2 and calculate the mean of each.

15

10, 20

12.5

10, 15

10

10, 10

7.5

10, 5

12.5

5, 20

10

5, 15

7.5

5, 10

5

5, 5

Sample mean,

Sample

20

20, 20

17.5

20, 15

15

20, 10

12.5

20, 5

17.5

15, 20

15

15, 15

12.5

15, 10

10

15, 5

Sample mean,

Sample

Continued.

These means form the sampling distribution of the sample means.

160 of 181

Sampling Distribution of Sample Means

Example continued:

The population values {5, 10, 15, 20} are written on slips of paper and put in a hat. Two slips are randomly selected, with replacement.

    • Create the probability distribution of the sample means.

Probability Distribution of Sample Means

0.0625

1

20

0.1250

2

17.5

0.1875

3

15

0.2500

4

12.5

0.1875

3

10

0.1250

2

7.5

0.0625

1

5

161 of 181

Sampling Distribution of Sample Means

Example continued:

The population values {5, 10, 15, 20} are written on slips of paper and put in a hat. Two slips are randomly selected, with replacement.

    • Graph the probability histogram for the sampling distribution.

The shape of the graph is symmetric and bell shaped. It approximates a normal distribution.

Sample mean

Probability

0.25

P(x)

Probability Histogram of Sampling Distribution

0.20

0.15

0.10

0.05

17.5

20

15

12.5

10

7.5

5

162 of 181

The Central Limit Theorem

the sample means will have a normal distribution.

If a sample of size n ≥ 30 is taken from a population with any type of distribution that has a mean = μ and standard deviation = σ,

x

x

163 of 181

The Central Limit Theorem

If the population itself is normally distributed, with mean = μ and standard deviation = σ,

the sample means will have a normal distribution for any sample size n.

x

164 of 181

The Central Limit Theorem

In either case, the sampling distribution of sample means has a mean equal to the population mean.

Mean of the sample means

Standard deviation of the sample means

The sampling distribution of sample means has a standard deviation equal to the population standard deviation divided by the square root of n.

This is also called the standard error of the mean.

165 of 181

The Mean and Standard Error

Example:

The heights of fully grown magnolia bushes have a mean height of 8 feet and a standard deviation of 0.7 feet. 38 bushes are randomly selected from the population, and the mean of each sample is determined. Find the mean and standard error of the mean of the sampling distribution.

Mean

Standard deviation (standard error)

Continued.

166 of 181

Interpreting the Central Limit Theorem

Example continued:

The heights of fully grown magnolia bushes have a mean height of 8 feet and a standard deviation of 0.7 feet. 38 bushes are randomly selected from the population, and the mean of each sample is determined.

From the Central Limit Theorem, because the sample size is greater than 30, the sampling distribution can be approximated by the normal distribution.

The mean of the sampling distribution is 8 feet ,and the standard error of the sampling distribution is 0.11 feet.

167 of 181

Finding Probabilities

Example:

The heights of fully grown magnolia bushes have a mean height of 8 feet and a standard deviation of 0.7 feet. 38 bushes are randomly selected from the population, and the mean of each sample is determined.

Find the probability that the mean height of the 38 bushes is less than 7.8 feet.

The mean of the sampling distribution is 8 feet, and the standard error of the sampling distribution is 0.11 feet.

7.8

Continued.

168 of 181

Finding Probabilities

P ( < 7.8) = P (z < ____ )

?

−1.82

Example continued:

Find the probability that the mean height of the 38 bushes is less than 7.8 feet.

7.8

The probability that the mean height of the 38 bushes is less than 7.8 feet is 0.0344.

= 0.0344

P ( < 7.8)

169 of 181

Probability and Normal Distributions

Example:

The average on a statistics test was 78 with a standard deviation of 8. If the test scores are normally distributed, find the probability that the mean score of 25 randomly selected students is between 75 and 79.

0

z

?

?

0.63

−1.88

Continued.

P (75 < < 79)

75

79

78

170 of 181

Probability and Normal Distributions

Example continued:

Approximately 70.56% of the 25 students will have a mean score between 75 and 79.

= 0.7357 − 0.0301 = 0.7056

0

z

?

?

0.63

−1.88

P (75 < < 79)

P(75 < < 79) = P(−1.88 < z < 0.63) = P(z < 0.63) − P(z < −1.88)

75

79

78

171 of 181

Probabilities of x and x

Example:

The population mean salary for auto mechanics is μ = $34,000 with a standard deviation of σ = $2,500. Find the probability that the mean salary for a randomly selected sample of 50 mechanics is greater than $35,000.

0

z

?

2.83

= P (z > 2.83)

= 1 − P (z < 2.83)

= 1 − 0.9977

= 0.0023

The probability that the mean salary for a randomly selected sample of 50 mechanics is greater than $35,000 is 0.0023.

35000

34000

P ( > 35000)

172 of 181

Probabilities of x and x

Example:

The population mean salary for auto mechanics is μ = $34,000 with a standard deviation of σ = $2,500. Find the probability that the salary for one randomly selected mechanic is greater than $35,000.

0

z

?

0.4

= P (z > 0.4)

= 1 − P (z < 0.4)

= 1 − 0.6554

= 0.3446

The probability that the salary for one mechanic is greater than $35,000 is 0.3446.

(Notice that the Central Limit Theorem does not apply.)

35000

34000

P (x > 35000)

173 of 181

Probabilities of x and x

Example:

The probability that the salary for one randomly selected mechanic is greater than $35,000 is 0.3446. In a group of 50 mechanics, approximately how many would have a salary greater than $35,000?

P(x > 35000) = 0.3446

This also means that 34.46% of mechanics have a salary greater than $35,000.

You would expect about 17 mechanics out of the group of 50 to have a salary greater than $35,000.

34.46% of 50 = 0.3446 × 50 = 17.23

174 of 181

§ 5.5

Normal Approximations to Binomial Distributions

175 of 181

Normal Approximation

The normal distribution is used to approximate the binomial distribution when it would be impractical to use the binomial distribution to find a probability.

Normal Approximation to a Binomial Distribution

If np ≥ 5 and nq ≥ 5, then the binomial random variable x is approximately normally distributed with mean

and standard deviation

176 of 181

Normal Approximation

Example:

Decided whether the normal distribution to approximate x may be used in the following examples.

    • Thirty-six percent of people in the United States own a dog. You randomly select 25 people in the United States and ask them if they own a dog.

    • Fourteen percent of people in the United States own a cat. You randomly select 20 people in the United States and ask them if they own a cat.

Because np and nq are greater than 5, the normal distribution may be used.

Because np is not greater than 5, the normal distribution may NOT be used.

177 of 181

Correction for Continuity

The binomial distribution is discrete and can be represented by a probability histogram.

This is called the correction for continuity.

When using the continuous normal distribution to approximate a binomial distribution, move 0.5 unit to the left and right of the midpoint to include all possible x-values in the interval.

To calculate exact binomial probabilities, the binomial formula is used for each value of x and the results are added.

Exact binomial probability

c

P(x = c)

P(c0.5 < x < c + 0.5)

Normal approximation

c

c + 0.5

c − 0.5

178 of 181

Correction for Continuity

Example:

Use a correction for continuity to convert the binomial intervals to a normal distribution interval.

    • The probability of getting between 125 and 145 successes, inclusive.

The discrete midpoint values are 125, 126, …, 145.

The continuous interval is 124.5 < x < 145.5.

    • The probability of getting exactly 100 successes.

The discrete midpoint value is 100.

The continuous interval is 99.5 < x < 100.5.

    • The probability of getting at least 67 successes.

The discrete midpoint values are 67, 68, ….

The continuous interval is x > 66.5.

179 of 181

Guidelines

Using the Normal Distribution to Approximate Binomial Probabilities

In Words In Symbols

  1. Verify that the binomial distribution applies.
  2. Determine if you can use the normal distribution to approximate x, the binomial variable.
  3. Find the mean μ and standard deviationσ for the distribution.
  4. Apply the appropriate continuity correction. Shade the corresponding area under the normal curve.
  5. Find the corresponding z-value(s).
  6. Find the probability.

Specify n, p, and q.

Is np ≥ 5?

Is nq ≥ 5?

Add or subtract 0.5 from endpoints.

Use the Standard Normal Table.

180 of 181

Approximating a Binomial Probability

Example:

Thirty-one percent of the seniors in a certain high school plan to attend college. If 50 students are randomly selected, find the probability that less than 14 students plan to attend college.

np = (50)(0.31) = 15.5

nq = (50)(0.69) = 34.5

The variable x is approximately normally distributed with μ = np = 15.5 and

P(x < 13.5)

Correction for continuity

= P(z < −0.61)

10

15

x

20

μ= 15.5

13.5

= 0.2709

The probability that less than 14 plan to attend college is 0.2079.

181 of 181

Approximating a Binomial Probability

Example:

A survey reports that forty-eight percent of US citizens own computers. 45 citizens are randomly selected and asked whether he or she owns a computer. What is the probability that exactly 10 say yes?

np = (45)(0.48) = 12

nq = (45)(0.52) = 23.4

P(9.5 < x < 10.5)

Correction for continuity

5

10

x

15

μ = 12

9.5

= 0.0997

The probability that exactly 10 US citizens own a computer is 0.0997.

10.5

= P(−0.75 < z − 0.45)