The Effect of Beech Bark Disease on Radial Growth: A Statistical Investigation
By Julia Worland
Part 1:
Mathematical Concepts
1
The Normal Distribution
In a frequency distribution, data can be distributed in many different ways.
Properties of the Normal Distribution
The Sampling Distribution of the Mean of Quantitative Data
When a population has a mean of μ and a true standard deviation σ, we predict that sample means taken from the population has a sampling distribution with the same mean μ but whose standard deviation is σ / √n.
inversely proportional
mean = μ
SD = σ
mean = μ
SD = σ / √5
mean = μ
SD = σ / √30
Example:
μ(x̅) = μ
σ(x̅) = σ / √n
Null & Alternative Hypothesis
The Null Hypothesis:
The Alternative Hypothesis:
Ho:
parameter = hypothesized value
Ha:
parameter ≠ hypothesized value
hypothesis test for the mean of a single population:
Ho: µo = µ
Ha: µo ≠ µ
hypothesis test for the difference in mean of two populations
Ho: μ1=μ2 , μ1-μ2=0
Ha: μ1≠μ2 , μ1-μ2≠0
Student’s T-Model
Student’s T-model is used when estimating the mean of a normally-distributed population in situations where the sample size is small and the population's standard deviation is unknown
Characteristics:
the smaller a sample size, the taller the tails.
= degrees of freedom = n-1
= normal distribution
Assumptions for T-Models
Random Sampling
Population
Sample
T-Scores
1-sample t-score:
ȳ - μ
t = --------
SE
2-sample t-score
(ȳ1 - ȳ2) - 0 ← Ho
t = --------------------
SE (ȳ1 - ȳ2)
ȳ = average from a sample population
μ = average under null hypothesis
SE(ȳ) = s/√n
= an estimation of the SD(μ)
(ȳ1 - ȳ2) = difference of averages from 2 sampling populations
0 = difference of averages under null hypothesis
s1^2 s2^2
SE(y1-y2) = √ ------- + -------
n1 n2
= an estimation of the SD(μ1 - μ1)
T-Intervals
n-1 degrees of freedom
confidence level % of distribution area
{
t critical value
t-interval for a mean:
ȳ ± ME
t-interval for a difference in means:
(ȳ1-ȳ2) ± ME
s1^2 s2^2
n1 n2
Interpreting a confidence interval:
If we created an interval based around your sample statistic:
“We are confidence-level% confident that the true population parameter is within your interval.
If we created an interval based around your population:
“We are confidence-level% confident that a given sample statistic is within your interval.
P-Value
degrees of freedom = n-1 OR (n1 + n2) -2
= t-score of sample statistic
= p-value (2-tailed) = 0.0073462754
In null hypothesis significance testing, the p-value is the probability of obtaining test results at least as extreme as the results actually observed, under the assumption that the null hypothesis is correct.
the null hypothesis is rejected if the p-value is less than the probability of a type 1 error = 𝜶 = 0.05, by convention
How to calculate p-value:
Part 2: Application of Mathematical Concepts
2
Background
Neonectria
Cryptococcus fagisuga
Xylococcus betulae
Beech bark disease (BBD):
(Nbrazee, 2019) (Crandall Park Trees)
effects on tree health
(Koch, 2010)
uninfected beech tree
infected beech tree
Origin of the Idea
Beech trees in my backyard affected with BBD
The Lewis Deane Nature Preserve, Poultney, Vermont
Research Question, Variables, and Hypothesis
1. First, I chose to investigate the following question:
Is the mean growth rate of trees infected with beech bark disease different from and those without?
explanatory variable = infection by beech bark disease as my explanatory variable (No Infection or Infected)
dependent variable = average radial growth
Null Hypothesis: The mean growth rate of healthy trees is equal to the mean growth rate of trees infected with beech bark disease.
Ho:
x̅(growth rate healthy trees) = x̅(growth rate infected trees)
Ha:
x̅(growth rate healthy trees) = x̅(growth rate infected trees)
Alternative Hypothesis: The mean growth rate of healthy trees differs from the mean growth rate of trees infected with beech bark disease.
Data Collection
Trees were measured using a diameter tape.
Trees were ranked using Jacob M. Griffin 1-5 beech bark severity scale (Griffin, 2003):
Visible Signs of Infection:
white waxy covering
red/brown spots
cankering
foliage reduction
and color change
cracking
Calculating Growth Rates
Tree ID | healthy range | healthy growth | infected range | infected range | infected growth |
1 | | | | | |
2 | | | | | |
... | | | | | |
Tree ID | healthy growth rate | infected growth rate |
1 | | |
2 | | |
... | | |
growth
growth rate = -------------------
range of years
(final year - initial year)
Distributions
Assumptions / Conditions
T-Test and P-Value
degrees of freedom = 149.0085797
= t-score of 2.71802195653
= p-value = 0.0073462754 = 0.73462754%
(ȳ1 - ȳ2) - 0 ← Ho
t = ----------------
SE (ȳ1 - y2)
(0.2395 - 0.1654) - 0 ← Ho
t = -----------------------
0.2021^2 0.1614^2
√------- + ---------
81 109
= 2.71802195653
T-Interval
formula:
confidence interval = (ȳ1-ȳ2) ± ME
ME = t(df) * SE(ȳ1-ȳ2)
s1^2 s2^2
SE(ȳ1-ȳ2) = √ ------ + ------
n1 n2
my calculation:
ȳ1-ȳ2 = 0.2395 - 0.1654 = 0.0781
ME = t(df) * SE(ȳ1-ȳ2)
t(149.0085797) = 1.976012213.
0.2021^2 0.1614^2
SE(ȳ1-ȳ2) = √ -------- + --------
81 109
= 0.02726247274
ME = 1.976012213 * 0.02726247274
= 0.0538709791
Confidence interval
= 0.0781 ± 0.053870979 = ( 0.024229021 , 0.131970979 )
conclusion:
The Future of Null Hypothesis Significance Testing:
My Ideas
�Application to our beech bark disease study:
Perhaps, determining the true alpha level of our experiment and then re-evaluating our data could be the next step in strengthening our statistical analysis.
(On the Past and Future of Null Hypothesis Significance Testing 2001)
Common System
Proposed System
Question
↓
Hypothesis
↓
Design Experiment
↓
Collect Data
↓
Analyze Data
↓
Conclusion
Question
↓
Hypothesis
↓
Design Experiment
↓
Determine alpha
↓
Collect Data
↓
Analyze Data
↓
Conclusion
Sources:
Crandall Park Trees. (n.d.). Retrieved April 03, 2021, from http://mdocs.skidmore.edu/crandallparktrees/invasives/beech-bark-disease/
Griffin, J. M., Lovett, G. M., Arthur, M. A., & Weathers, K. C. (2003). The distribution and severity of beech bark disease in the Catskill Mountains, n.y. Canadian Journal of Forest Research, 33(9), 1754-1760. doi:10.1139/x03-093
Koch, J. L. (2010). Beech bark disease: The OLDEST "NEW" threat to American beech in the United States. Outlooks on Pest Management, 21(2), 64-68. doi:10.1564/21apr03
Nbrazee. (2019, November 26). Beech bark disease. Retrieved April 03, 2021, from https://ag.umass.edu/landscape/fact-sheets/beech-bark-disease
On the Past and Future of Null Hypothesis Significance Testing (Rep.). (2001, December). Retrieved https://www.ets.org/Media/Research/pdf/RR-01-24-Wainer.pdf
Images Used:
https://en.wikipedia.org/wiki/Histogram
https://www.sciencedirect.com/topics/mathematics/frequency-polygon
https://faculty.elgin.edu/dkernler/statistics/ch02/2-1.html
https://help.ezbiocloud.net/box-plot/
https://america.bioweb.co/products/forestry-suppliers-fabric-diameter-tape
https://geocuse.syr.edu/sag/physical/students-tree-dbh-measurement-1/
https://ag.umass.edu/landscape/fact-sheets/beech-bark-disease
http://magazine.nurserymag.com/article/february-2018/beech-bark-disease.aspx
https://www.dec.ny.gov/lands/120589.html
https://vtinvasives.org/invasive/beech-bark-disease
https://en.wikipedia.org/wiki/Beech_bark_disease
https://en.wikipedia.org/wiki/Cryptococcus_fagisuga
http://www.idtools.org/id/scales/factsheet.php?name=6846
https://vtinvasives.org/invasive/beech-bark-disease
https://www.mdpi.com/1999-4907/8/5/155/htm
https://america.bioweb.co/products/forestry-suppliers-fabric-diameter-tape
https://geocuse.syr.edu/sag/physical/students-tree-dbh-measurement-1/
https://ag.umass.edu/landscape/fact-sheets/beech-bark-disease
http://magazine.nurserymag.com/article/february-2018/beech-bark-disease.aspx
https://www.dec.ny.gov/lands/120589.html
https://vtinvasives.org/invasive/beech-bark-disease