1 of 23

The Effect of Beech Bark Disease on Radial Growth: A Statistical Investigation

By Julia Worland

2 of 23

Part 1:

Mathematical Concepts

1

3 of 23

The Normal Distribution

In a frequency distribution, data can be distributed in many different ways.

Properties of the Normal Distribution

  • symmetry about the center
  • 68% of values are within 1 SD of the mean
  • 95% of values are within 2 SD of the mean
  • 99.7% of values are within 3 SD of the mean
  • mean = median = mode

4 of 23

The Sampling Distribution of the Mean of Quantitative Data

When a population has a mean of μ and a true standard deviation σ, we predict that sample means taken from the population has a sampling distribution with the same mean μ but whose standard deviation is σ / √n.

inversely proportional

mean = μ

SD = σ

mean = μ

SD = σ / √5

mean = μ

SD = σ / √30

Example:

μ(x̅) = μ

σ(x̅) = σ / √n

5 of 23

Null & Alternative Hypothesis

The Null Hypothesis:

  • states that a population parameter is equal to a hypothesized value
  • based on previous analyses or specialized knowledge

The Alternative Hypothesis:

  • states that a population parameter is smaller, greater, or different than the hypothesized value in the null hypothesis

Ho:

parameter = hypothesized value

Ha:

parameter ≠ hypothesized value

hypothesis test for the mean of a single population:

Ho: µo = µ

Ha: µo ≠ µ

hypothesis test for the difference in mean of two populations

Ho: μ1=μ2 , μ1-μ2=0

Ha: μ1≠μ2 , μ1-μ2≠0

6 of 23

Student’s T-Model

Student’s T-model is used when estimating the mean of a normally-distributed population in situations where the sample size is small and the population's standard deviation is unknown

Characteristics:

  • unimodal
  • symmetric
  • bell-shaped models
  • change in shape depending on n.

the smaller a sample size, the taller the tails.

= degrees of freedom = n-1

= normal distribution

7 of 23

Assumptions for T-Models

  • Independence assumption: The sampled values must be independent of and not influence each other.

  • Randomization condition: The sample population used should be a simple random sample representative of the whole target population

  • 10% condition: the sample size, n, must be no larger than 10% of the entire target population

  • Nearly normal condition: The distribution of the variable measured should be generally normal in shape

Random Sampling

Population

Sample

8 of 23

T-Scores

1-sample t-score:

  • a measure of how unusual a mean is
  • how many standard errors a given average is above or below what we assume to be the true average of a population

ȳ - μ

t = --------

SE

2-sample t-score

  • a measure of how unusual a difference between two means is
  • how many standard errors a given difference in averages is above what we assume to be the true difference between averages

(ȳ1 - ȳ2) - 0 ← Ho

t = --------------------

SE (ȳ1 - ȳ2)

ȳ = average from a sample population

μ = average under null hypothesis

SE(ȳ) = s/√n

= an estimation of the SD(μ)

(ȳ1 - ȳ2) = difference of averages from 2 sampling populations

0 = difference of averages under null hypothesis

s1^2 s2^2

SE(y1-y2) = √ ------- + -------

n1 n2

= an estimation of the SD(μ1 - μ1)

9 of 23

T-Intervals

n-1 degrees of freedom

confidence level % of distribution area

{

t critical value

t-interval for a mean:

ȳ ± ME

  • ME = t(n-1) * SE(ȳ)
  • SE(ȳ) = s / √n

t-interval for a difference in means:

(ȳ1-ȳ2) ± ME

  • ME = t(n-1) * SE(ȳ1-ȳ2)

s1^2 s2^2

  • SE(ȳ1-ȳ2) = √ ------ + ------

n1 n2

Interpreting a confidence interval:

If we created an interval based around your sample statistic:

“We are confidence-level% confident that the true population parameter is within your interval.

If we created an interval based around your population:

“We are confidence-level% confident that a given sample statistic is within your interval.

10 of 23

P-Value

degrees of freedom = n-1 OR (n1 + n2) -2

= t-score of sample statistic

= p-value (2-tailed) = 0.0073462754

In null hypothesis significance testing, the p-value is the probability of obtaining test results at least as extreme as the results actually observed, under the assumption that the null hypothesis is correct.

the null hypothesis is rejected if the p-value is less than the probability of a type 1 error = 𝜶 = 0.05, by convention

How to calculate p-value:

  1. calculating a t-score
  2. create a t-distribution with the appropriate degrees of freedom
  3. find the area of the distribution that is at least as extreme as that z score

11 of 23

Part 2: Application of Mathematical Concepts

2

12 of 23

Background

Neonectria

Cryptococcus fagisuga

Xylococcus betulae

Beech bark disease (BBD):

  • a disease that causes mortality and defects in beech trees in the eastern United States, Canada and Europe.
  • involves scale insects (Cryptococcus fagisuga and Xylococcus betulae) and a fungal pathogen (Neonectria ditissima and Neonectria faginata).
  • occurs when the bark of a tree is attacked by the beech scale insect and is then ultimately destroyed by the fungus
  • occurs in 3 stages: the advancing front, the killing front, the aftermath zone

(Nbrazee, 2019) (Crandall Park Trees)

effects on tree health

  • greater levels of damage
  • poorer tree crown conditions
  • splitting during windy conditions
  • drought and cold sensitivity

(Koch, 2010)

uninfected beech tree

infected beech tree

13 of 23

Origin of the Idea

Beech trees in my backyard affected with BBD

The Lewis Deane Nature Preserve, Poultney, Vermont

14 of 23

Research Question, Variables, and Hypothesis

1. First, I chose to investigate the following question:

Is the mean growth rate of trees infected with beech bark disease different from and those without?

explanatory variable = infection by beech bark disease as my explanatory variable (No Infection or Infected)

dependent variable = average radial growth

Null Hypothesis: The mean growth rate of healthy trees is equal to the mean growth rate of trees infected with beech bark disease.

Ho:

x̅(growth rate healthy trees) = x̅(growth rate infected trees)

Ha:

x̅(growth rate healthy trees) = x̅(growth rate infected trees)

Alternative Hypothesis: The mean growth rate of healthy trees differs from the mean growth rate of trees infected with beech bark disease.

15 of 23

Data Collection

Trees were measured using a diameter tape.

Trees were ranked using Jacob M. Griffin 1-5 beech bark severity scale (Griffin, 2003):

  • 1 = no visible sign of infection
  • 2-4 = varying severities of beech bark disease
  • 5 = tree in question is dead due to BBD

Visible Signs of Infection:

white waxy covering

red/brown spots

cankering

foliage reduction

and color change

cracking

16 of 23

Calculating Growth Rates

Tree ID

healthy range

healthy growth

infected range

infected range

infected growth

1

2

...

Tree ID

healthy growth rate

infected growth rate

1

2

...

growth

growth rate = -------------------

range of years

(final year - initial year)

17 of 23

Distributions

18 of 23

Assumptions / Conditions

  • Independence assumption: While clumping of similar growth rates and years of infection can occur, we can assume that, in general, the growth rates of beech trees are independent of one another.

  • Randomization condition: We assume that random techniques were employed and a representative sample was selected.

  • 10% condition: The 207 trees used in the study are less than 10% of all American beech trees.

  • Nearly normal condition: The distributions of both healthy and infected growth rates are unimodal, but both are positively skewed. However, past studies suggested radial tree growth to be a normal variable).

19 of 23

T-Test and P-Value

degrees of freedom = 149.0085797

= t-score of 2.71802195653

= p-value = 0.0073462754 = 0.73462754%

(ȳ1 - ȳ2) - 0 ← Ho

t = ----------------

SE (ȳ1 - y2)

(0.2395 - 0.1654) - 0 ← Ho

t = -----------------------

0.2021^2 0.1614^2

√------- + ---------

81 109

= 2.71802195653

20 of 23

T-Interval

formula:

confidence interval = (ȳ1-ȳ2) ± ME

ME = t(df) * SE(ȳ1-ȳ2)

s1^2 s2^2

SE(ȳ1-ȳ2) = √ ------ + ------

n1 n2

my calculation:

ȳ1-ȳ2 = 0.2395 - 0.1654 = 0.0781

ME = t(df) * SE(ȳ1-ȳ2)

t(149.0085797) = 1.976012213.

0.2021^2 0.1614^2

SE(ȳ1-ȳ2) = √ -------- + --------

81 109

= 0.02726247274

ME = 1.976012213 * 0.02726247274

= 0.0538709791

Confidence interval

= 0.0781 ± 0.053870979 = ( 0.024229021 , 0.131970979 )

conclusion:

  • We are 95% confident that the true difference in mean growth rates between healthy beech trees and those infected by beech bark disease is between 0.0242 and 0.132 cm in diameter per year.
  • If repeated infinitely, 95% of intervals would capture the true difference in means.
  • This interval does not include 0, so we conclude that it is unlikely that the true mean growth rates are equal.

21 of 23

The Future of Null Hypothesis Significance Testing:

My Ideas

  • redefine α
  • make it commonplace to provide justification for α-levels (own experiment or site past experiments)
  • Depending on the alpha level these prior experiments produce and depending on how exact experimenters need their analysis to be, the experiment may need to be redesigned

�Application to our beech bark disease study:

Perhaps, determining the true alpha level of our experiment and then re-evaluating our data could be the next step in strengthening our statistical analysis.

(On the Past and Future of Null Hypothesis Significance Testing 2001)

Common System

Proposed System

Question

Hypothesis

Design Experiment

Collect Data

Analyze Data

Conclusion

Question

Hypothesis

Design Experiment

Determine alpha

Collect Data

Analyze Data

Conclusion

22 of 23

Sources:

Crandall Park Trees. (n.d.). Retrieved April 03, 2021, from http://mdocs.skidmore.edu/crandallparktrees/invasives/beech-bark-disease/

Griffin, J. M., Lovett, G. M., Arthur, M. A., & Weathers, K. C. (2003). The distribution and severity of beech bark disease in the Catskill Mountains, n.y. Canadian Journal of Forest Research, 33(9), 1754-1760. doi:10.1139/x03-093

Koch, J. L. (2010). Beech bark disease: The OLDEST "NEW" threat to American beech in the United States. Outlooks on Pest Management, 21(2), 64-68. doi:10.1564/21apr03

Nbrazee. (2019, November 26). Beech bark disease. Retrieved April 03, 2021, from https://ag.umass.edu/landscape/fact-sheets/beech-bark-disease

On the Past and Future of Null Hypothesis Significance Testing (Rep.). (2001, December). Retrieved https://www.ets.org/Media/Research/pdf/RR-01-24-Wainer.pdf

23 of 23

Images Used:

https://en.wikipedia.org/wiki/Histogram

https://www.sciencedirect.com/topics/mathematics/frequency-polygon

https://faculty.elgin.edu/dkernler/statistics/ch02/2-1.html

http://sites.utexas.edu/sos/guided/descriptive/descriptivec/frequency/#:~:text=Relative%20frequencies%20are%20more%20commonly,to%20the%20overall%20sample%20size.&text=Pie%20charts%20represent%20relative%20frequencies,whole%20pie%20each%20category%20represents.

https://help.ezbiocloud.net/box-plot/

https://america.bioweb.co/products/forestry-suppliers-fabric-diameter-tape

https://geocuse.syr.edu/sag/physical/students-tree-dbh-measurement-1/

https://www.hortweek.com/beech-bark-disease-measures-strategies-best-protect-woodland-forests-against-beech-bark-disease/ornamentals/article/1590240

https://ag.umass.edu/landscape/fact-sheets/beech-bark-disease

http://magazine.nurserymag.com/article/february-2018/beech-bark-disease.aspx

https://www.dec.ny.gov/lands/120589.html

https://vtinvasives.org/invasive/beech-bark-disease

https://en.wikipedia.org/wiki/Beech_bark_disease

https://en.wikipedia.org/wiki/Cryptococcus_fagisuga

http://www.idtools.org/id/scales/factsheet.php?name=6846

https://vtinvasives.org/invasive/beech-bark-disease

https://www.mdpi.com/1999-4907/8/5/155/htm

https://america.bioweb.co/products/forestry-suppliers-fabric-diameter-tape

https://geocuse.syr.edu/sag/physical/students-tree-dbh-measurement-1/

https://www.hortweek.com/beech-bark-disease-measures-strategies-best-protect-woodland-forests-against-beech-bark-disease/ornamentals/article/1590240

https://ag.umass.edu/landscape/fact-sheets/beech-bark-disease

http://magazine.nurserymag.com/article/february-2018/beech-bark-disease.aspx

https://www.dec.ny.gov/lands/120589.html

https://vtinvasives.org/invasive/beech-bark-disease