What experiment could be designed to test whether the two populations are one species or two

Butlin, R. K., Galindo, J. & Grahame, J. W. Sympatric, parapatric or allopatric: The most important way to classify speciation? Philosophical Transactions of the Royal Society B: Biological Sciences 363, 2997–3007 (2008).

Coyne, J. A. & Orr, H. A. Speciation. Sunderland, MA: Sinauer Associates, 2004.

Table of Contents Show

When to use a t-test
What type of t-test should I use?
One-sample, two-sample, or paired t-test?
One-tailed or two-tailed t-test?
Performing a t-test
T-test formula
T-test function in statistical software
Interpreting test results
Presenting the results of a t-test

Darwin, C. On The Origin of Species by Means of Natural Selection, or The Preservation of Favoured Races in the Struggle for Life. London, UK: John Murray, 1859. (link)

Feder, J. L., Chilcote, C. A. & Bush, G. L. Genetic differentiation between sympatric host races of Rhagoletis pomonella. Nature 336, 61–64 (1988).

Funk, D. J. Isolating a role for natural selection in speciation: Host adaptation and sexual isolation in Neochlamisus bebbianae leaf beetles. Evolution 52, 1744–1759 (1998).

Maan, M. E. & Seehausen, O. Mechanisms of species divergence through visual adaptation and sexual selection: Perspectives from a cichlid model system. Current Zoology 56, 285–299 (2010).

Mallet, J. et al. Space, sympatry and speciation. Journal of Evolutionary Biology 22, 2332–2341 (2009).

Mani, G. S. & Clarke, B. C. Mutation order — A major stochastic process in evolution. Proceedings of the Royal Society B: Biological Sciences 240, 29–37 (1990).

Mayr, E. Systematics and the Origin of Species. New York, NY: Columbia University Press, 1942.

Mayr, E. Animal Species and Evolution. Harvard, MA: Harvard University Press, 1963.

Mayr, E. & Provine, W. B. The Evolutionary Synthesis. Harvard, MA: Harvard University Press, 1998.

Niemiller, M. R., Fitzpatrick, B. M. & Miller, B. T. Recent divergence with gene flow in Tennessee cave salamanders (Plethodontidae: Gyrinophilus) inferred from gene genealogies. Molecular Ecology 17, 2258–2275 (2008).

Nosil, P., Crespi, B. J. & Sandoval, C. P. Host-plant adaptation drives the parallel evolution of reproductive isolation. Nature 417, 440–443 (2002).

Nosil, P., Harmon, L. J. & Seehausen, O. Ecological explanations for (incomplete) speciation. Trends in Ecology & Evolution 24, 145–156 (2009).

Nosil, P., Funk, D. J. & Ortíz-Barrientos, D. Divergent selection and heterogeneous genomic divergence. Molecular Ecology 18, 375–402 (2009).

Panhuis, T. M. et al. Sexual selection and speciation. Trends in Ecology & Evolution 16, 364–371 (2001).

Price, T. D. Speciation in Birds. Woodbury, NY: Roberts and Company, 2007.

Ramsey, J. & Schemske, D. W. Pathways, mechanisms and rates of polyploid formation in flowering plants. Annual Review of Ecology, Evolution, and Systematics 29, 467–501 (1998).

Ritchie, M. G. Sexual selection and speciation. Annual Review of Ecology, Evolution, and Systematics 38, 79–102 (2007).

Rundle, H. D. & Nosil, P. Ecological speciation. Ecology Letters 8, 336–352 (2005).

Schluter, D. Ecology and the origin of species. Trends in Ecology & Evolution 16, 372–380 (2001).

Schluter, D. Evidence for ecological speciation and its alternative. Science 323, 737–741 (2009).

Seehausen, O. et al. Speciation through sensory drive in cichlid fish. Nature 455, 620–626 (2008).

Turner, T. L., Hahn, M. W. & Nuzhdin, S. V. Genomic islands of speciation in Anopheles gambiae. PLoS Biology 3, e285 (2005). doi:10.1371/journal.pbio.0030285

van Doorn, S., Edelaar, P. & Weissing, F. J. On the origin of species by natural and sexual selection. Science 326, 1704–1707 (2009).

Via, S. Sympatric speciation in animals: The ugly duckling grows up. Trends in Ecology & Evolution 16, 381–390 (2001).

A t-test is a statistical test that is used to compare the means of two groups. It is often used in hypothesis testing to determine whether a process or treatment actually has an effect on the population of interest, or whether two groups are different from one another.

You want to know whether the mean petal length of iris flowers differs according to their species. You find two different species of irises growing in a garden and measure 25 petals of each species. You can test the difference between these two groups using a t-test and null and alterative hypotheses.

The null hypothesis (H0) is that the true difference between these group means is zero.
The alternate hypothesis (Ha) is that the true difference is different from zero.

When to use a t-test

A t-test can only be used when comparing the means of two groups (a.k.a. pairwise comparison). If you want to compare more than two groups, or if you want to do multiple pairwise comparisons, use an ANOVA test or a post-hoc test.

The t-test is a parametric test of difference, meaning that it makes the same assumptions about your data as other parametric tests. The t-test assumes your data:

are independent
are (approximately) normally distributed.
have a similar amount of variance within each group being compared (a.k.a. homogeneity of variance)

If your data do not fit these assumptions, you can try a nonparametric alternative to the t-test, such as the Wilcoxon Signed-Rank test for data with unequal variances.

What type of t-test should I use?

When choosing a t-test, you will need to consider two things: whether the groups being compared come from a single population or two different populations, and whether you want to test the difference in a specific direction.

One-sample, two-sample, or paired t-test?

If the groups come from a single population (e.g. measuring before and after an experimental treatment), perform a paired t-test.
If the groups come from two different populations (e.g. two different species, or people from two separate cities), perform a two-sample t-test (a.k.a. independent t-test).
If there is one group being compared against a standard value (e.g. comparing the acidity of a liquid to a neutral pH of 7), perform a one-sample t-test.

One-tailed or two-tailed t-test?

If you only care whether the two populations are different from one another, perform a two-tailed t-test.
If you want to know whether one population mean is greater than or less than the other, perform a one-tailed t-test.

In your test of whether petal length differs by species:

Your observations come from two separate populations (separate species), so you perform a two-sample t-test.
You don’t care about the direction of the difference, only whether there is a difference, so you choose to use a two-tailed t-test.

Performing a t-test

The t-test estimates the true difference between two group means using the ratio of the difference in group means over the pooled standard error of both groups. You can calculate it manually using a formula, or use statistical analysis software.

T-test formula

The formula for the two-sample t-test (a.k.a. the Student’s t-test) is shown below.

In this formula, t is the t-value, x1 and x2 are the means of the two groups being compared, s2 is the pooled standard error of the two groups, and n1 and n2 are the number of observations in each of the groups.

A larger t-value shows that the difference between group means is greater than the pooled standard error, indicating a more significant difference between the groups.

You can compare your calculated t-value against the values in a critical value chart to determine whether your t-value is greater than what would be expected by chance. If so, you can reject the null hypothesis and conclude that the two groups are in fact different.

T-test function in statistical software

Most statistical software (R, SPSS, etc.) includes a t-test function. This built-in function will take your raw data and calculate the t-value. It will then compare it to the critical value, and calculate a p-value. This way you can quickly see whether your groups are statistically different.

In your comparison of flower petal lengths, you decide to perform your t-test using R. The code looks like this:

t.test(Petal.Length ~ Species, data = flower.data)

Download the data set to practice by yourself.

Sample data set

Interpreting test results

If you perform the t-test for your flower hypothesis in R, you will receive the following output:

The output provides:

An explanation of what is being compared, called data in the output table.
The t-value: -33.719. Note that it’s negative; this is fine! In most cases, we only care about the absolute value of the difference, or the distance from 0. It doesn’t matter which direction.
The degrees of freedom: 30.196. Degrees of freedom is related to your sample size, and shows how many ‘free’ data points are available in your test for making comparisons. The greater the degrees of freedom, the better your statistical test will work.
The p-value: 2.2e-16 (i.e. 2.2 with 15 zeros in front). This describes the probability that you would see a t-value as large as this one by chance.
A statement of the alternate hypothesis (Ha). In this test, the Ha is that the difference is not 0.
The 95% confidence interval. This is the range of numbers within which the true difference in means will be 95% of the time. This can be changed from 95% if you want a larger or smaller interval, but 95% is very commonly used.
The mean petal length for each group.

From the output table, we can see that the difference in means for our sample data is −4.084 (1.456 − 5.540), and the confidence interval shows that the true difference in means is between −3.836 and −4.331. So, 95% of the time, the true difference in means will be different from 0. Our p-value of 2.2e–16 is much smaller than 0.05, so we can reject the null hypothesis of no difference and say with a high degree of confidence that the true difference in means is not equal to zero.

Presenting the results of a t-test

When reporting your t-test results, the most important values to include are the t-value, the p-value, and the degrees of freedom for the test. These will communicate to your audience whether the difference between the two groups is statistically significant (a.k.a. that it is unlikely to have happened by chance).

You can also include the summary statistics for the groups being compared, namely the mean and standard deviation. In R, the code for calculating the mean and the standard deviation from the data looks like this:

flower.data %>% group_by(Species) %>% summarize(mean_length = mean(Petal.Length),

sd_length = sd(Petal.Length))