Skip to content
Calcipedia
ANOVA Calculator instructional illustration

ANOVA Calculator

Compare three or more independent groups with a one-way ANOVA calculator that returns the F statistic, p-value, ANOVA table, group means, effect sizes.

Last updated

Hypothesis testing

ANOVA calculator

Compare three or more independent groups with a one-way analysis of variance. Enter raw values for each group and the calculator returns the F statistic, p-value, ANOVA table, group summaries, and effect-size context so you can see whether the group means differ beyond random variation.

Quick examples

Groups

Each group should contain at least two numeric values. Balanced groups are ideal, but unequal sample sizes are allowed.

Significance level

How to read the result

  • Use raw values from independent groups, not percentages or matched pairs.
  • If the p-value is below α, at least one group mean is different.
  • After a significant result, follow up with a post-hoc test such as Tukey HSD to locate the difference.

Result

F = 141.56

The one-way ANOVA suggests that at least one group mean differs from the others (F(2, 12) = 141.56, p < .001). The largest mean is Treatment B and the smallest mean is Control; use the pairwise guide below as a screen, then confirm with a planned post-hoc test such as Tukey HSD or Games-Howell when reporting formal results.

141.56

F statistic

< .001

p-value

0.96

η² effect size

0.95

ω² effect size

Reject H₀

Decision at α = 0.05

1.54

Largest mean gap

1.92

Largest / smallest variance

Balanced

Sample balance

Key ANOVA totals

Between groups SS5.95
Within groups SS0.25
Total SS6.2
df between / within / total2 / 12 / 14
MS between2.97
MS within0.02

ANOVA source table

SourceSSdfMSFp
Between groups5.9522.97141.56< .001
Within groups0.25120.02
Total6.214

Group summary

GroupnMeanSDVariance
Control54.060.110.01
Treatment A54.90.160.02
Treatment B55.60.160.02

Pairwise mean-gap guide

Exploratory pairwise checks using the ANOVA pooled error term and a Bonferroni-adjusted p-value. Use this as a screen before reporting a planned post-hoc test.

PairMean gapSEt guideBonferroni pScreen at 0.05
Control vs Treatment B-1.540.09-16.8< .001Likely gap
Control vs Treatment A-0.840.09-9.17< .001Likely gap
Treatment A vs Treatment B-0.70.09-7.64< .001Likely gap
Reject H₀ at α = 0.05

Assumptions and follow-up

  • Use raw numeric observations from independent groups rather than percentages or repeated measures.
  • Each group should be approximately normal, especially when sample sizes are small.
  • Group variances should be broadly similar; if they are not, Welch's ANOVA is usually the safer alternative.
  • A significant result tells you that at least one group mean differs, not which groups differ.
← All Statistics calculators

Hypothesis Testing

ANOVA calculator: compare three or more group means with an F test

A one-way ANOVA calculator compares the means of three or more independent groups and tells you whether the differences are large enough to matter statistically. It returns the F statistic, p-value, ANOVA table, group summaries, and effect-size context from raw data, which makes it a practical companion when you want a single test instead of several pairwise t-tests.

What one-way ANOVA tests

ANOVA stands for analysis of variance. In a one-way ANOVA, the question is whether the mean of one group is different from the others when you have one independent variable and three or more groups. The null hypothesis says all population means are equal; the alternative says at least one group mean is different.

This is the right tool when you have independent samples and want to compare more than two means at the same time. If you only have two groups, a t-test is the simpler comparison. If your measurements are paired or repeated on the same subjects, a repeated-measures approach is the better fit.

How the F ratio is built

One-way ANOVA separates the total spread in the data into between-group variation and within-group variation. The between-group sum of squares measures how far each group mean sits from the grand mean, while the within-group sum of squares measures how spread out the values are inside each group.

The test statistic is the F ratio: F = MS_between / MS_within. If the group means are very similar compared with the spread inside the groups, the F value stays close to 1. If the group means are much farther apart than the within-group variation, the F value grows and the right-tailed p-value shrinks.

The same decomposition also appears in the ANOVA table, which lists the source of variation, sums of squares, degrees of freedom, and mean squares. That table is what lets you check the arithmetic instead of treating the output as a black box.

SS_between = Σ n_i( x̄_i − x̄ )²

Between-group sum of squares, using each group mean and the grand mean.

SS_within = ΣΣ (x_ij − x̄_i)²

Within-group sum of squares, using each observation around its own group mean.

F = MS_between / MS_within

The ANOVA F ratio compares explained variation to residual variation.

η² = SS_between / SS_total

Eta-squared effect size, which shows how much of the total variation is associated with group membership.

Worked example

Take the default example on the page: three groups labelled Control, Treatment A, and Treatment B. The Control values cluster around 4, Treatment A values cluster around 5, and Treatment B values cluster around 5.5. That separation produces a very large F statistic and a very small p-value, which is exactly what you would expect when the group means are clearly different relative to the variation inside each group.

The helpful part of the result sheet is not just the headline F test. The group-summary table shows the mean and standard deviation for each group, while the ANOVA table shows how much variation sits between groups and how much stays within groups. That makes it easier to explain why the result is significant instead of simply reporting that it is.

Reading the result

If the p-value is below your chosen alpha level, usually 0.05, you reject the null hypothesis and conclude that at least one group mean differs. If the p-value is above alpha, the data do not provide strong evidence against equal means.

A significant ANOVA does not tell you which groups differ. It only tells you that the means are not all the same. If you need pairwise detail, a post-hoc test such as Tukey HSD is the usual follow-up because it controls the family-wise error rate across multiple comparisons.

Effect size adds useful context. Eta-squared tells you the proportion of total variation associated with group membership, while omega-squared gives a slightly more conservative estimate of practical impact. Those values do not replace the p-value, but they help you decide whether the difference looks meaningful as well as statistically detectable.

Using the pairwise mean-gap guide

Many ANOVA calculator searches do not stop at the omnibus F test. People also want to know which groups might be driving the result. The calculator now adds an exploratory pairwise mean-gap guide that sorts every group pair by the size of the mean difference, shows the standard error from the ANOVA pooled error term, and applies a simple Bonferroni p-value screen.

That guide is not a replacement for a planned post-hoc test in a formal report. Treat it as a fast interpretation layer: it points you toward the largest candidate gaps, while the assumptions panel tells you whether Tukey HSD, Games-Howell, Welch's ANOVA, or a non-parametric alternative may be more appropriate before you publish or submit the analysis.

Assumptions and alternatives

A standard one-way ANOVA assumes independent observations, approximately normal group distributions, and broadly similar variances. The test is fairly robust to mild violations, especially with balanced and moderately sized samples, but large variance differences can distort the result.

If the groups have noticeably unequal variances, Welch's ANOVA is often the safer alternative. If the data are strongly non-normal or ordinal, a non-parametric test such as Kruskal-Wallis may be more appropriate. Those alternatives answer a similar comparison question, but they relax different assumptions.

Unequal sample sizes are allowed, but balanced groups are easier to interpret and usually give a more stable estimate of the F ratio. That is why many ANOVA examples keep the groups even when the maths itself does not require it.

Sources and scope

This page is a calculation aid for introductory statistics, teaching, and research planning. It does not run post-hoc tests, and it does not replace software for more advanced factorial or repeated-measures designs.

For a deeper reference on one-way ANOVA calculations and the ANOVA table, see the NIST/SEMATECH handbook and the OpenStax one-way ANOVA chapter linked below.

Further reading

Frequently asked questions

How many groups do I need for ANOVA?

You need at least three groups for a one-way ANOVA, because the point of the test is to compare three or more means at the same time. If you only have two groups, a t-test is simpler and gives the same basic comparison more directly.

What does a significant ANOVA mean?

A significant ANOVA means the F test found enough evidence to say the group means are not all the same. It does not tell you which groups differ from one another; it only tells you that at least one mean is different. A post-hoc test such as Tukey HSD is the usual next step when you need that detail.

Can ANOVA handle unequal sample sizes?

Yes. One-way ANOVA does not require identical sample sizes in each group. Balanced samples are usually easier to interpret and can give a more stable estimate, but unequal group sizes are common in real data and are still valid as long as the other assumptions are reasonable.

What assumptions does one-way ANOVA make?

One-way ANOVA assumes independent observations, approximately normal group distributions, and broadly similar variances across groups. The test is fairly robust to mild assumption violations, but large departures from normality or very uneven variances can make the p-value less trustworthy.

When should I use Welch's ANOVA instead?

Use Welch's ANOVA when the groups have clearly different variances or the sample sizes are unbalanced enough that the equal-variance assumption looks weak. Welch's version is designed to be more robust in that situation and is often a better default when the groups are not nicely matched.

Do I need a post-hoc test after ANOVA?

You need a post-hoc test when the overall ANOVA is significant and you want to know which specific group pairs differ. Common choices are Tukey HSD and Games-Howell, depending on whether the variances are similar. If the overall ANOVA is not significant, post-hoc testing usually does not add much.

Is the pairwise table the same as Tukey HSD?

No. The pairwise table is an exploratory screening guide based on the ANOVA pooled error term and a Bonferroni-adjusted p-value. Tukey HSD and Games-Howell are formal post-hoc procedures with their own assumptions. Use the table to see which gaps deserve attention, then use the appropriate planned post-hoc method when you need report-ready pairwise conclusions.

Can I use ANOVA for repeated measures or paired data?

Not with this calculator. One-way ANOVA assumes independent groups, so repeated measurements on the same subjects or matched pairs need a repeated-measures or paired analysis instead. Treating linked observations as independent would understate the true error and can distort the p-value.

Also in Statistics

Related

More from nearby categories

These related calculators come from the same leaf category, nearby sibling categories, or the same top-level topic.