�Experimental Design, Describing Data and Statistical Analysis
AP Biology
Part 1: Experimental Design
The goal of scientific investigation is to obtain and interpret reliable scientific data.
In biology, we engage in the scientific method in order to answer questions about the natural world.
Scientific Method
Make observations – Identify an event you’d like to understand more about.
Formulate a question – What testable, data-driven question can you ask about the event?
Form a hypothesis – Write a statement that could answer the question. The statement should be both testable and falsifiable.
Test the hypothesis – Collect data and observations that helps you determine whether your hypothesis is accurate.
Data Analysis – Graphs, computations, and/or statistical tests that allow the scientist to determine patterns.
Draw conclusions – Interpret your data and observations, attempted to support or refute your hypothesis.
In science, nothing can be proven, only supported!
Variables
Variables – Things that change in an experiment.
Changes can be a part of the experimental design OR something you’re measuring in order to answer your question.
Independent variables are controlled or chosen by the scientist. On a graph, placed on the x-axis.
Dependent variables are measured or observed by the scientist. On a graph, placed on the y-axis.
Use the sentence: “The _______ depends on the ______.” to help you figure out which is which.
Practice identifying variables
Let’s start by considering some biological questions (pay attention to the IV and DV here):
How do solutes dissolved in water affect water’s properties of adhesion and cohesion?
What temperature is ideal for an enzyme-mediated rate of reaction?
How do toxins impact the rate of mitosis in onions?
Are mealworms more attracted to grains or meats?
How do environmental conditions impact the rate of transpiration?
Constants
A constant is something that should be kept the same for the entire experiment, especially between different trials.
E.g., temperature, humidity, light levels, etc.
Might also be called a “controlled variable”
Something that, if not maintained at a consistent level throughout the experiment could reasonably impact your data.
Controls
A control is something used as a standard of comparison in an experiment.
It is often the same conditions as your experimental conditions, just without the independent variable.
A basis for comparison that allows you to claim that changes in your dependent variable are likely due to your independent variable.
Any time you are interpreting data, you should intentionally and explicitly look at the control and state, “… as compared to the control” or use numbers to do this explicitly. Ex: “The number of drops of acetone that could be held on a penny was 6 less than the control.”
Positive & Negative Controls
There are two types of controls: positive and negative. Some labs have one or the other, some have both; this just depends on your experimental design.
A positive control is designed where a “known” response is expected. For example, we expect bacteria to grow on nutrient agar.
A negative control is designed where no response is expected. This shows that your experimental setup is working properly. For example, we expect that antibiotics to kill bacteria grown on nutrient agar that has been supplemented with antibiotics.
Positive and Negative Controls
Negative Control: sterile disc in distilled water
Positive Control: sterile disc in known antibiotic that works
Test Sample: (experimental) sterile disc in new antibiotic to test
Types of Data & Observations
Scientific observations can take two forms: Qualitative or Quantitative.
Qualitative data refers to the qualities of something (e.g., color, shape, texture, odor).
Ex: The liquid turned orange after we added the iron.
Quantitative data refers to the quantity of something (e.g., amount or value)
Ex: The mass increased 4 grams.
Collecting Data
We need to have a sufficiently large sample size. 40 is ideal, but it’s pretty unrealistic for our class. The data need to be normally distributed.
In some applications, repetition (multiple trials) is more realistic than collecting many individuals.
Trials occur when you do the EXACT same thing repeatedly. No variations at all.
Why are large, unbiased samples so important?
The goal is that your sample is indicative of the entire population. So the 10 plants you find the transpiration of are indicative of every plant of that species, in those conditions, on the planet. Crazy.
The goal: the sample mean and spread (called Standard Deviation) is the same as the population mean and spread.
This is where a bell curve comes from:
A large, unbiased sample allows us to make inferences of the population.
The individuals in the sample are a reasonable approximation of the population. Of course there will be variation, but as long as we have a large, unbiased sample, results should be legitimate and variation should produce a bell curve around the mean.
Part 2: Describing Data
Visual Descriptions of Data are Graphs
Use whatever graph best allows for the visualization of relationships
Bar
Line
And all of their many variations
Descriptive Statistics –
Complete calculations to understand the intricacies of the data. Your calculator will help!
Mean
Median
Standard Error of the Mean
Interquartile Range (IQR)
We will often graph a dataset’s descriptive statistics rather than the raw data in order to show relationships more clearly.
Graphing Requirements
A good title (Y vs. X of _____)
Label your axes with units, and use evenly spaced and scaled numbers to spread out the data.
Clearly mark data points.
When making a line graph, draw a line of best fit
Any extrapolation beyond the last data point should be shown with a dashed line.
What kind of graph do I make?�Bar Graphs…
Behold: the return of the BAR GRAPH! The only rule to graphing is that you make the one that represents your data the best.
Bar graph: Used for categorical independent variables.
Since we always do multiple trials, the top of the bar represents the MEAN of the trials of that data.
Might have error bars, which show the variation in the data.
Box & Whisker �Plot
Box & Whisker plots are modified bar graphs that show how the data is spread out using quartiles.
Your calculator will find your quartiles for you. ☺
Line Graphs
Line graph: Used for continuous independent variables.
Might have two y-axes for multiple scales
Might plot the means, and have error bars to show variation in data.
Might have a log scale
Constructing a Data Table
The rules are much the same as graphing: have a good title, have labels and units clearly indicated, and of course, USE A RULER.
Depending on your data, your independent variable may go on the left or across the top. Either is appropriate.
I have the data. Now what? Descriptive statistics describe your data.
Using your TI-84 to lighten the load:
97.8 | 98.1 | 98.7 | 98.4 | 98.3 | 97.9 | 97.4 |
97.7 | 98.2 | 98.3 | 97.6 | 98.9 | 98.6 | 98.5 |
Using your TI-84 to lighten the load:
To have your calculator compute descriptive statistics for this list of data:
Press “Stat”
Press over to highlight “Calc” along the top.
The top option is “1-var Stats” – select that.
Your screen will read like this:
Ensure the correct List is specified; leave FreqList blank
Press down twice to highlight “Calculate” and hit enter.
Using your TI-84 to lighten the load:
Showing descriptive statistics graphically: box and whisker plot
Showing descriptive statistics graphically: mean±2SEM
Part 3: Statistical Analysis
How do we go about answering biological questions?
Now, to statistical tests…
Taking your data to court:
Taking your data to court:
It’s your goal to attack the null hypothesis, showing you have significant evidence against it. In doing so, you lend support to (BUT DO NOT PROVE!) the alternate hypothesis.
Ultimately, you obtain a probability, or p-value: the probability of getting this data by chance alone; that your two datasets are from the same population, or your data are within reason for an expected outcome.
Biologists are typically willing to allow 5% error. So, we look for a p-value of 0.05 or less in order to reject the null hypothesis and accept the alternate hypothesis.
Two non-biological questions:
Do vehicles get better highway MPG when using cruise control compared with when not using cruise control?
Null: There is not a significant difference in the gas mileage using cruise control vs. not.
Alternate: There is a significant different in gas mileage using cruise control vs. not.
Is this weird looking coin I just found in the hallway a fair coin? Or is it “loaded”?
Null: There is not a significant difference in the number of heads vs. tails flipped on this coin compared to a fair coin.
Alternate: There is a significant…..
So, now we test the hypotheses we wrote:
There are loads of different statistical tests. You need to calculate two this year:
One when you are comparing two datasets to one another (like the gas mileage question) 🡪 two sample t-test.
One when you are comparing a single dataset to expected values (like the fair coin question: Heads vs. Tails) 🡪 Chi-Square test.
You will need to be able to interpret p-values of all tests, but these are the only two I’ll ask you to calculate.
Practice! From earlier:
Do vehicles get better highway MPG when using cruise control compared with when not using cruise control?
Null: There is not a significant difference in the gas mileage using cruise control vs. not.
Alternate: There is a significant different in gas mileage using cruise control vs. not.
Enter your data into two lists (Stat; 1:Edit) in your calculator – one list for each treatment.
MPG with cruise control | MPG without cruise control | ||
26 | 27 | 22 | 19 |
25 | 26 | 23 | 24 |
28 | 24 | 22 | 21 |
24 | 26 | 24 | 22 |
28 | 27 | 21 | 23 |
Two Sample T-test:
Now, perform the t-test:
Stat, over to TESTS
4:2-SampTTest…
Data
List 1: L1
List 2: L2
Freq1: 1
Freq2: 1
µ1:≠ µ2
Pooled: No
Calculate, hit enter.
T-test:
More on a t-test
Shift gears to the other test: Chi-square Goodness of Fit (GOF)
Use this test for data when you are comparing one (observed) data set to predicted (expected) values.
The coin flip example.
1:2:1 ratios of progeny – think Punnett squares!
Evaluating whether organisms are attracted or repelled by something in their environment.
Does this population meet the predictions of the Hardy-Weinberg equilibrium?
Statistical hypotheses are generally the same!
Null: There is no difference between the expected and observed data.
Alternate: There is a difference between the expected and observed data.
Calculating Chi-square
Is this weird looking coin I just found in the hallway a fair coin? Or is it “loaded”?
Null: There is not a significant difference in the number of heads vs. tails flipped on this coin compared to a fair coin.
Alternate: There is a significant…..
Enter the data and do the test:
Flip | Observed | Expected |
Heads | 12 | 15 |
Tails | 18 | 15 |
How to enter data for X2GOF…
Enter your data into two lists:
Observed data in List 1
Expected data in List 2
Then:
STAT, over to TESTS
Down to D:X2GOF-Test
Observed: L1
Expected: L2
df:? (enter the number of options -1)
Calculate and hit enter.
More on Chi-square
So, what is our statistical conclusion?
What is our scientific conclusion?
Consider sample size: add a zero to everything and rerun the test:
The proportions have NOT changed.
What happened to the results?
What does this indicate about the importance sample size? When could patterns in data be overlooked?
Flip | Observed | Expected |
Heads | 120 | 150 |
Tails | 180 | 150 |
Doing Chi-square by hand:
Looks scary, just algebra. And you don’t have to be very precise.
Use the table if you want!
So, now what?
How many degrees of freedom?
What’s the critical value? Assume p=0.05
Categories (outcome) | Expected (e) | Observed (o) | o-e | (o-e)2 | |
Heads | 15 | 12 |
|
|
|
Tails | 15 | 18 |
|
|
|
| | | | Sum: | |
But what does it all mean?!?!
If the Chi-square value is greater than the critical value, p is less than 0.05 and the H0 may be rejected. The expected and observed values are sufficiently different to reject the null hypothesis.
If the Chi-square value is less than the critical value, p is greater than 0.05 and the H0 may not be rejected: the expected and observed values are too similar to reject the null hypothesis.
Why ? Such torture…
Remember, the goal is to answer questions about the natural world.
Quantitative evidence helps us come to justifiable answers to our questions.
Just claiming two experimental conditions yield “different” results is insufficient. How different? Different enough? Statistics allows us to defensibly answer questions.