What are the assumptions of a chi-square test?
The core assumptions are independence of observations, mutually exclusive categories, and expected counts that are not too small. In practice, the usual rule of thumb is that expected counts should generally be at least 5 in most cells, with cells below 1 being a stronger warning that the chi-square approximation may be unreliable. The test also assumes you are working with raw counts rather than percentages or rates.
What is the difference between a goodness-of-fit test and a test of independence?
A goodness-of-fit test compares one observed categorical distribution with an expected distribution you provide. A test of independence starts from a contingency table and asks whether two categorical variables are related. The formula for the chi-square statistic is the same, but the null hypothesis and the way expected counts are created are different.
Can I use percentages instead of counts in a chi-square calculator?
Not directly. Chi-square tests are based on observed and expected frequencies, so the safest input is raw counts. If you only have percentages, convert them back to counts using the sample size first. Feeding percentages straight into the formula can distort both the chi-square statistic and the effect-size interpretation.
What if some expected counts are below 5?
Small expected counts make the usual chi-square p-value less reliable. If only a few cells dip below 5, the result can still be a reasonable approximation, but you should treat it with caution. For sparse 2×2 tables, Fisher's exact test is often the preferred follow-up. In larger tables, you may need to combine sparse categories or collect more data before making a firm conclusion.
What do the degrees of freedom mean in a chi-square test?
Degrees of freedom determine which chi-square distribution should be used to convert the test statistic into a p-value. For goodness-of-fit, df usually equals the number of categories minus one. For independence tables, df equals (rows - 1) × (columns - 1). More degrees of freedom generally mean a broader chi-square reference distribution.
What does a non-significant chi-square result mean?
A non-significant result means the observed differences are not large enough, relative to the sample size and expected variation, to reject the null hypothesis at your chosen alpha level. It does not prove that the categories are identical or that the variables are independent. The sample may simply be too small, too noisy, or too sparse to detect a real effect.
What is Cramér's V or phi in the result sheet?
Those are effect-size measures that describe how strong the departure from independence appears to be. For 2×2 tables the common effect-size measure is phi; for larger contingency tables it is Cramér's V. They help answer a different question from the p-value: not just whether the association is statistically detectable, but whether it looks negligible, small, medium, or large in magnitude.
Is the chi-square test always right-tailed?
Yes. The chi-square statistic is built from squared deviations, so it cannot be negative. Larger chi-square values indicate greater departure from the null hypothesis, which means evidence always accumulates in the right tail of the chi-square distribution.
Should I use Yates correction for a 2×2 table?
Yates correction is a continuity adjustment sometimes used for 2×2 chi-square tests, especially with small samples. It makes the test more conservative by reducing the chi-square statistic. Some analysts prefer it, while others move directly to Fisher's exact test for sparse 2×2 tables. This calculator reports the standard uncorrected chi-square result, so use judgment when counts are small.
Which significance level should I choose for a chi-square test?
Use the alpha level that was chosen before the analysis. Many introductory examples use α = 0.05, but α = 0.01 is more conservative and α = 0.10 is sometimes used for exploratory screening. The calculator lets you compare common thresholds, but the interpretation should come from your study plan, course instructions, or reporting standard rather than from trying several cutoffs until one looks significant.
How do I report a chi-square result in words?
Report the test type, degrees of freedom, sample size, chi-square statistic, p-value, and effect size. A compact sentence might say, χ²(1, N = 70) = 23.33, p < .001, phi = 0.58, followed by a short explanation of which expected-versus-observed cells contributed most. If expected counts are sparse, mention that limitation and consider Fisher's exact test for a 2×2 table.
When is Fisher's exact test better than chi-square?
Fisher's exact test is usually preferred for 2×2 tables when expected counts are small, because it does not rely on the same large-sample approximation. Chi-square is fast and useful for moderate-to-large samples, but Fisher's exact test is the safer option when one or more expected cells fall below 5 and the table is small enough for an exact calculation.
Can a chi-square test prove causation?
No. A chi-square test can show that observed counts differ from expectation or that two categorical variables are associated more than chance alone would suggest. It does not prove a causal relationship. Study design, bias control, confounding, measurement quality, and subject-matter knowledge still matter.