Chapter 15��Statistical Analysis of Quantitative Data���Dr Sanaa Abujilban, PhD
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Statistical Analysis
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Descriptive Indexes
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Phases of Data Analysis
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Phases of Data Analysis:
5. Representation of data through graphs and tables
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Statistics
1. Univaraite “frequency distribution”.
2. Measures of central tendency (mode, median, mean), and
measures of variability (SD, variance (the degree that observations are dispersed around the central value), range).
3. Bivaraite analysis (contingency table, correlation- such as
Pearson or Product Moment Correlation Coefficient, and
Spearman’s correlation coefficient).
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Descriptive Statistics: Frequency Distributions
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Frequency Polygon
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Shapes of Distributions
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
All the distributions in Figure are
symmetric.
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Shapes of Distributions (cont.)�aspect of a distribution’s shape
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Normal Distribution; (sometimes�called a bell-shaped curve)
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Question
Is the following statement True or False?
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Answer
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Central Tendency
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Comparison of Measures of Central Tendency
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Distributions (cont.)
Category Percent
Under 35 9%
36-45 21
46-55 45
56-65 19
66+ 6
A Frequency Distribution Table
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Relationships of central tendency indexes in skewed distributions.
Relationships of central tendency indexes in skewed distributions.
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Variability
The degree to which scores in a distribution are spread out or dispersed
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Two distributions of different variability
School A has a wide range of scores
there are few students at either extreme
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Indexes of Variability
That express the extent to which scores deviate from one another
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Standard deviations in a normal distribution
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Standard deviations in a normal distribution.
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Bivariate Descriptive Statistics
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Contingency Table
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Correlation Coefficients
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Question
The researcher subtracts the lowest value of data from the highest value of data to obtain which of the following?
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Answer
d. Range
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Correlation Coefficients (cont.)
-1.00 to +1.00
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Correlation Coefficients (cont.)
Ex: r = -.45 is stronger than r = +.40
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Bivaraite Analysis : Correlation
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Bivaraite Analysis: Correlation
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Bivaraite Analysis �2. Correlation
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Index of Correlation
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
scatter plot
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Figure 1 - top left - is an example of �a positive correlation.�
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Figure 2 is an example of a zero correlation.
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Figure 3 is an example of �a negative correlation.
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Describing Risk
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Describing Risk (cont.)
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
The Odds Ratio (OR)
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Inferential Statistics
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Sampling Distribution of the Mean
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Statistical Inference—Two Forms
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Question
Is the following statement True or False?
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Answer
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Estimation of Parameters
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Confidence Intervals
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Inferential Statistics �Hypothesis Testing
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Hypothesis Testing
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Hypothesis Testing
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
critical region
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Hypothesis Testing (cont.)
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
If the value of the test statistic indicates that the null hypothesis is improbable, then the result is statistically significant
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
critical region and statistical tests
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Errors in Statistical Decisions
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Parametric and Nonparametric Tests
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Overview of Hypothesis-Testing Procedures
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Inferential Statistics
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Inferential Statistics
The most common used inferential statistics are:
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Commonly Used Bivariate Statistical Tests
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Question
Is the following statement True or False?
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Answer
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
t -Test
e.g., means for men vs. women
e.g., means for patients before and after surgery
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Analysis of Variance (ANOVA)
Tests the difference between more than 2 means (3 or more groups)
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Chi-Squared Test
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Correlation
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Effect Size
d ≤ .20, small effect
d = .50, moderate effect
d ≥ .80, large effect
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Multivariate Statistics
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Multiple Linear Regression
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Multiple Correlation Coefficient (R )
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Analysis of Covariance (ANCOVA)
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Visual respresentation of analysis of covariance
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Question
Which test would be used to compare the observed frequencies with expected frequencies within a contingency table?
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Answer
b. Chi-squared test
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Logistic Regression
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Factor Analysis
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Multivariate Analysis of Variance�
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Causal Modeling
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Univariate Tests
Level of Measurement | Test |
Categorical | Chi square test |
Non-categorical | One-sample t test |
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Bivariate Tests
VARIABLE 1 (IDV) | VARIABLE 2 (DV) | STATISTICAL TEST |
Categorical | Categorical | Chi square test for crosstables |
Categorical (2 groups or values) | Non-categorical | Independent samples t-test |
Categorical (> 2 groups or values) | Non-categorical | ANOVA |
Non-Categorical | Non-Categorical | Correlation/ Regression |
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
PREDICTOR VARIABLE (S) | OUTCOME VARIABLE | |
| Categorical | Continuous |
Categorical | Chi-square, Log linear, Logistic | T-test, ANOVA (Analysis of Varirance), Linear regression |
Continuous | Logistic regression | Linear regression, Pearson correlation |
Mixture of Categorical and Continuous | Logistic regression | Linear regression, Analysis of Covariance |
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Summary of Tests
- One -way ANOVA- is used for a quantitative dependent variable by a single factor (independent) variable.
- Two-way ANOVA- is used for one dependent variable by one or more factors and/or variables.
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Summary of Tests
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Guidelines for Critiquing Quantitative Analyses
1. Does the report include any descriptive statistics? Do these statistics sufficiently describe the major characteristics of the researcher’s data set?
2. Were indices of both central tendency and variability provided in the report? If not, how does the absence of this information affect the reader’s understanding of the research variables?
3. Were the correct descriptive statistics used (e.g., was a median used when a mean would have been more appropriate)?
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Guidelines for Critiquing Quantitative Analyses
4. Does the report include any inferential statistics? Was a statistical test performed for each of the hypotheses or research questions? If inferential statistics were not used, should they have been?
5. Was the selected statistical test appropriate, given the level of measurement of the variables?
6. Was a parametric test used? Does it appear that the assumptions for the use of parametric tests were met? If a nonparametric test was used, should a more powerful parametric procedure have been used instead?
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Guidelines for Critiquing Quantitative Analyses
7. Were any multivariate procedures used? If so, does it appear that the researcher chose the appropriate test? If multivariate procedures were not used, should they have been? Would the use of a multivariate procedure have improved the researcher’s ability to draw conclusions about the relationship between the dependent and independent variables?
8. In general, does the report provide a rationale for the use of the selected statistical tests? Does the report contain sufficient information for you to judge whether appropriate statistics were used?
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
Guidelines for Critiquing Quantitative Analyses
9. Was there an appropriate amount of statistical information reported? Are the findings clearly and logically organized?
10. Were the results of any statistical tests significant? What do the tests tell you about the plausibility of the research hypotheses?
11. Were tables used judiciously to summarize large masses of statistical information? Are the tables clearly presented, with good titles and carefully labeled column headings? Is the information presented in the text consistent with the information presented in the tables? Is the information totally redundant?
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins
End of Presentation
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins