1 of 58

Hypothesis Testing:�One-Sample Inference

2 of 58

Hypothesis Testing

Statistical Hypotheses

Definition

Statistical hypothesis testing and confidence interval estimation of parameters are the fundamental methods used at the data analysis stage of a comparative experiment, in which the engineer is interested, for example, in comparing the mean of a population to a specified value.

3 of 58

Hypothesis Testing

  • Null hypothesis (H0)
  • Alternative hypothesis (H1 or Ha)

4 of 58

Null hypothesis (H0)�

  • For superiority studies we think for example
  • Average systolic blood pressure (SBP) on Drug A is different than average SBP on Drug B
  • Null of that? Usually that there is no effect
  • Mean = 0
  • Sometimes compare to a fixed value so Null
  • Mean = 120

5 of 58

How to make conclusions based on data?

  • The purpose of most experiments is to prove or disprove a hypothesis.
  • This is done by collecting data, analyzing it and drawing a conclusion.
  • The original hypothesis is tested against the data to find out whether or not it is right.

6 of 58

Example of a hypothesis

  • 636 children from Peru had their lung capacity examined. Response: FEV (Forced Expiratory Volume).
  • Scientific question:
  • Do boys and girls have different lung capacity?
  • Hypothesis:
  • H0 : There is no difference in lung capacity for boys and girls.
  • We observe:
  • Girls : mean(FEV) = 1.54 Boys : mean(FEV) = 1.66.
  • Difference = 0.12. How to deal with statistical uncertainty?

7 of 58

Never “Accept” Anything

  • Reject the null hypothesis
  • Fail to reject the null hypothesis
  • Failing to reject the null hypothesis does
  • NOT mean the null (H0) is true
  • Failing to reject the null means
  • Not enough evidence in your sample to reject the null hypothesis

8 of 58

Hypothesis Testing

Statistical Hypotheses

null hypothesis

alternative hypothesis

One-sided Alternative Hypotheses

Two-sided Alternative Hypothesis

9 of 58

Hypothesis Testing

Statistical Hypotheses

Test of a Hypothesis

  • A procedure leading to a decision about a particular hypothesis

  • Hypothesis-testing procedures rely on using the information in a random sample from the population of interest.

  • If this information is consistent with the hypothesis, then we will conclude that the hypothesis is true; if this information is inconsistent with the hypothesis, we will conclude that the hypothesis is false.

10 of 58

Hypothesis Testing

Tests of Statistical Hypotheses

Decision criteria for testing H0:μ = 50 centimeters per second versus H1:μ ≠ 50 centimeters per second.

11 of 58

Hypothesis Testing

Tests of Statistical Hypotheses

Definitions

12 of 58

Hypothesis Testing

Tests of Statistical Hypotheses

Sometimes the type I error probability is called the significance level, or the α-error, or the size of the test.

13 of 58

Hypothesis Testing

Tests of Statistical Hypotheses

14 of 58

Hypothesis Testing

15 of 58

Hypothesis Testing

16 of 58

Hypothesis Testing

17 of 58

Hypothesis Testing

Definition

  • The power is computed as 1 - β, and power can be interpreted as the probability of correctly rejecting a false null hypothesis. We often compare statistical tests by comparing their power properties.

  • For example, consider the propellant burning rate problem when

we are testing H 0 : μ = 50 centimeters per second against H 1 : μ not equal 50 centimeters per second . Suppose that the true value of the mean is μ = 52. When n = 10, we found that β = 0.2643, so the power of this test is 1 - β = 1 - 0.2643 = 0.7357 when μ = 52.

18 of 58

Hypothesis Testing

One-Sided and Two-Sided Hypotheses

Two-Sided Test:

One-Sided Tests:

19 of 58

Hypothesis Testing

P-Values in Hypothesis Tests

Definition

20 of 58

Hypothesis Testing

P-Values in Hypothesis Tests

21 of 58

Tests on the Mean of a Normal Distribution, Variance Known

Hypothesis Tests on the Mean

We wish to test:

The test statistic is:

22 of 58

Tests on the Mean of a Normal Distribution, Variance Known

Hypothesis Tests on the Mean

Reject H0 if the observed value of the test statistic z0 is either:

z0 > zα/2 or z0 < -zα/2

Fail to reject H0 if

-zα/2 < z0 < zα/2

23 of 58

Tests on the Mean of a Normal Distribution, Variance Known

24 of 58

Tests on the Mean of a Normal Distribution, Variance Known

Hypothesis Tests on the Mean

25 of 58

Tests on the Mean of a Normal Distribution, Variance Known

Hypothesis Tests on the Mean (Continued)

26 of 58

Tests on the Mean of a Normal Distribution, Variance Known

Hypothesis Tests on the Mean (Continued)

27 of 58

Tests on the Mean of a Normal Distribution, Variance Known

P-Values in Hypothesis Tests

28 of 58

Tests on the Mean of a Normal Distribution, Variance Known

Type II Error and Choice of Sample Size

Finding the Probability of Type II Error β (Eq. 9-19)

29 of 58

Tests on the Mean of a Normal Distribution, Variance Known

Type II Error and Choice of Sample Size

Finding the Probability of Type II Error β

30 of 58

Tests on the Mean of a Normal Distribution, Variance Unknown

Hypothesis Tests on the Mean

One-Sample t-Test

31 of 58

Tests on the Mean of a Normal Distribution, Variance Unknown

Hypothesis Tests on the Mean

The reference distribution for H0: μ = μ0 with critical region for (a) H1: μ ≠ μ0 , (b) H1: μ > μ0, and (c) H1: μ < μ0.

32 of 58

Tests on the Mean of a Normal Distribution, Variance Unknown

P-value for a t-Test

The P-value for a t-test is just the smallest level of significance at which the null hypothesis would be rejected.

Notice that t0 = 2.72 in Example 9-6, and that this is between two tabulated values, 2.624 and 2.977. Therefore, the P-value must be between 0.01 and 0.005. These are effectively the upper and lower bounds on the P-value.

33 of 58

Tests on the Mean of a Normal Distribution, Variance Unknown

Type II Error and Choice of Sample Size

The type II error of the two-sided alternative (for example) would be

34 of 58

Hypothesis Tests on the Variance and Standard Deviation of a Normal Distribution

Hypothesis Test on the Variance

35 of 58

Hypothesis Tests on the Variance and Standard Deviation of a Normal Distribution

Hypothesis Test on the Variance

36 of 58

Hypothesis Tests on the Variance and Standard Deviation of a Normal Distribution

Hypothesis Test on the Variance

37 of 58

Hypothesis Tests on the Variance and Standard Deviation of a Normal Distribution

Hypothesis Test on the Variance

38 of 58

Hypothesis Tests on the Variance and Standard Deviation of a Normal Distribution

39 of 58

Hypothesis Tests on the Variance and Standard Deviation of a Normal Distribution

40 of 58

Hypothesis Tests on the Variance and Standard Deviation of a Normal Distribution

Type II Error and Choice of Sample Size

For the two-sided alternative hypothesis:

Operating characteristic curves are provided in Charts VIi and VIj:

41 of 58

Hypothesis Tests on the Variance and Standard Deviation of a Normal Distribution

42 of 58

Tests on a Population Proportion

Large-Sample Tests on a Proportion

Many engineering decision problems include hypothesis testing about p.

An appropriate test statistic is

43 of 58

Tests on a Population Proportion

44 of 58

Tests on a Population Proportion

45 of 58

Tests on a Population Proportion

Another form of the test statistic Z0 is

or

46 of 58

Tests on a Population Proportion

Type II Error and Choice of Sample Size

For a two-sided alternative (Eq. 9-42)

If the alternative is p < p0 (Eq. 9-43)

If the alternative is p > p0 (Eq. 9-44)

47 of 58

Tests on a Population Proportion

Type II Error and Choice of Sample Size

For a two-sided alternative (Eq. 9-45)

For a one-sided alternative (Eq. 9-46)

48 of 58

Tests on a Population Proportion

49 of 58

Tests on a Population Proportion

50 of 58

  • An engineer who is studying the tensile strength of a steel alloy intended for use in hip replacement knows that tensile strength is approximately normally distributed with σ = 60 psi. A random sample of 12 specimens has a mean tensile strength of x = 3450 psi.

(a) Test the hypothesis that mean strength is 3500 psi. Use α = 0.01.

(b) What is the smallest level of significance at which you would be willing to reject the null hypothesis?

(c) What is the β-error for the test in part (a) if the true mean is 3470?

(d) Suppose that you wanted to reject the null hypothesis with probability at least 0.8 if mean strength μ = 3470. What sample size should be used?

(e) Explain how you could answer the question in part (a) with a two-sided confidence interval on mean tensile strength.

51 of 58

  • Medical researchers have developed a new artificial heart constructed primarily of titanium and plastic. The heart will last and operate almost indefinitely once it is implanted in the patient’s body, but the battery pack needs to be recharged about every four hours. A random sample of 50 battery packs is selected and subjected to a life test. The average life of these batteries is 4.05 hours. Assume that battery life is normally distributed with standard deviation σ = 0.2 hour.

52 of 58

  • (a) Is there evidence to support the claim that mean battery life exceeds 4 hours? Use α = 0.05.
  • (b) What is the P-value for the test in part (a)?
  • (c) Compute the power of the test if the true mean battery life is 4.5 hours.
  • (d) What sample size would be required to detect a true mean battery life of 4.5 hours if you wanted the power of the test to be at least 0.9?
  • (e) Explain how the question in part (a) could be answered by constructing a one-sided confidence bound on the mean life.

53 of 58

54 of 58

55 of 58

56 of 58

57 of 58

58 of 58