Mixed-Design ANOVA Calculator – Free Online Split-Plot ANOVA Tool | Effect Size & P-Value

Mixed-Design ANOVA Calculator

Free online split-plot (mixed) ANOVA for between-subjects and within-subjects factors. Computes F-statistics for both main effects and the interaction, with Greenhouse–Geisser and Huynh–Feldt sphericity corrections, partial eta squared, and APA-ready output.

Mixed-Design ANOVA Split-Plot Between × Within Sphericity Correction Free Online

📊 1. Enter Your Data

Each row below represents one between-subjects group (e.g., Treatment vs Control). Inside each group, enter the within-subjects measurements (e.g., Time 1, Time 2, Time 3) — one row of comma-separated values per subject, one subject per line. Group names are editable. Click + Add Group to add more between-subjects groups.

Within-Subjects Level Names (comma-separated)

e.g., Pre, Mid, Post • Baseline, Week 4, Week 8 • T1, T2, T3, T4

Within-Subjects Factor Name

e.g., Time, Trial, Condition

Upload CSV or Excel file:

Supports .csv, .txt, .xlsx, .xls — headers detected automatically.

How upload works for Mixed ANOVA: Click columns that should each become a between-subjects cluster (one column per group). Each selected column becomes one group. Within each group, the column's values are split evenly across the within-subjects levels you specify. For correct results, make sure each column has n_subjects × k_within_levels values stacked subject-by-subject (Subject 1 T1, Subject 1 T2, Subject 1 T3, Subject 2 T1, ...).

Choose a sample dataset:

Selecting a dataset pre-fills the Type tab. Click Run Analysis below to compute.

Significance Level (α)

Between-Subjects Factor Name

e.g., Group, Treatment, Condition

📐 5. Formulas & Technical Notes

A. Formulas (full)

Sums of Squares — Three-Way Partition

SS_total = Σ(Yijk − Ȳ...)²

Between-Subjects partition
  SS_A      = b·n · Σ_i (Ȳi.. − Ȳ...)²
  SS_S(A)   = b   · Σ_i Σ_k (Ȳi.k − Ȳi..)²        ← subjects-within-groups (error 1)

Within-Subjects partition
  SS_B      = a·n · Σ_j (Ȳ.j. − Ȳ...)²
  SS_AB     = n   · Σ_i Σ_j (Ȳij. − Ȳi.. − Ȳ.j. + Ȳ...)²
  SS_B×S(A) = SS_total − SS_A − SS_S(A) − SS_B − SS_AB   ← within-subjects error (error 2)

Degrees of Freedom

df_A       = a − 1                       (between groups)
df_S(A)    = a(n − 1)                    (subjects within groups)
df_B       = b − 1                       (within-subjects levels)
df_AB      = (a − 1)(b − 1)              (interaction)
df_B×S(A)  = a(n − 1)(b − 1)             (within-subjects error)

F-Statistics

F_A   = MS_A   / MS_S(A)        →  tests Group main effect
F_B   = MS_B   / MS_B×S(A)      →  tests within-subjects (Time) main effect
F_AB  = MS_AB  / MS_B×S(A)      →  tests Group × Time interaction

Effect Size — Partial Eta Squared

η²_p(A)  = SS_A  / (SS_A  + SS_S(A))
η²_p(B)  = SS_B  / (SS_B  + SS_B×S(A))
η²_p(AB) = SS_AB / (SS_AB + SS_B×S(A))

Benchmarks (Cohen, 1988): 0.01 small · 0.06 medium · 0.14 large

Sphericity Corrections

Greenhouse–Geisser ε̂ = (Σλi)² / [(b−1) · Σλi²]
Huynh–Feldt ε̃        = min(1, n·(b−1)·ε̂ − 2) / [(b−1)·(n−1 − (b−1)·ε̂)]

Corrected df:  df1' = ε·df1   df2' = ε·df2
Where λi are the eigenvalues of the within-subjects covariance matrix.

Where:

a         = number of between-subjects groups
b         = number of within-subjects levels
n         = subjects per group  (assumed balanced)
Yijk      = score for subject k in group i at level j
Ȳ...      = grand mean
Ȳi..      = mean of group i
Ȳ.j.      = mean of within-subjects level j
Ȳij.      = mean of cell (i, j)
Ȳi.k      = mean for subject k in group i across within levels
SS        = sum of squares
MS        = mean square = SS / df
ε         = sphericity epsilon (1 = perfect sphericity)

B. Assumptions & Diagnostic Notes

Independence of observations between groups. Subjects in different between-subjects groups must be independent.
Normality of residuals within each cell. Each combination of group × time should have approximately normally distributed scores. Robust to mild violations when group sizes are equal and n > 15.
Homogeneity of between-subjects variances. Check with Levene's test or compare cell SDs. Largest/smallest SD ratio < 2 is acceptable.
Sphericity (within-subjects). Tested by Mauchly's W when b ≥ 3. When violated, apply Greenhouse–Geisser or Huynh–Feldt corrections (both reported above).
Balanced design preferred. Equal n per group simplifies the SS partition. This tool assumes balanced n.
If assumptions fail: Consider linear mixed-effects models (R lme4) or robust ANOVA (Wilcox WRS2 package).

🎯 6. When to Use a Mixed-Design ANOVA

Decision Checklist & Real-World Examples

This free mixed-design ANOVA tool is designed for studies that combine a between-subjects factor (independent groups) with a within-subjects factor (repeated measures on the same subjects). It is one of the most commonly used designs in clinical trials, education research, sports science, and experimental psychology.

Use this test when ALL of these are true:

You have at least 2 independent groups (between-subjects factor)
Each subject is measured 2 or more times on the same outcome (within-subjects factor)
Your dependent variable is continuous (interval or ratio)
You want to test both main effects and their interaction
Sample is approximately balanced and residuals are roughly normal
Don't use if all measurements are on the same single group → use Repeated-Measures ANOVA
Don't use if groups are measured only once → use One-Way ANOVA
Don't use if data are heavily skewed and n is small → use a nonparametric mixed model or linear mixed-effects model

Real-world examples

Medical research. Three drug groups (Placebo, Drug A, Drug B) measured at Pre, Mid, and Post — does blood pressure change differently across drugs over time?
Education. Traditional vs Flipped classroom measured Q1 → Q4 — do learning trajectories diverge?
Sports science. Control vs HIIT training measured at Week 0, 4, and 8 — does HIIT improve sprint time faster than control?
Psychology. CBT vs Mindfulness vs Waitlist measured Pre, Post, and Follow-up — which therapy produces the largest reduction in anxiety, and does that benefit persist?

Sample size guidance

For 80% power to detect a medium interaction effect (Cohen's f = 0.25) at α = 0.05, you typically need n ≈ 24 per group with 2 groups and 3 time points (≈ 48 total). For small interaction effects (f = 0.10), expect n ≥ 80 per group.

Decision tree

Two or more groups → measured once → One-Way / Two-Way ANOVA
One group → measured repeatedly → Repeated-Measures ANOVA
Multiple groups → measured repeatedly → Mixed-Design ANOVA (this tool)
Non-normal data, complex random effects → Linear Mixed-Effects Model (LMM)

📖 7. How to Use This Mixed-Design ANOVA Calculator

Step-by-Step Walkthrough

Step 1 — Enter your data. Use the Type / Paste tab (default). Each group is one row; within each group, paste comma-separated values per subject, one subject per line. For three time points, each subject row should be exactly 3 numbers (e.g., 52, 48, 55).
Step 2 — Edit group names. Click any group name field to rename it (e.g., from "Group 1" to "Placebo").
Step 3 — Set within-subjects level names. Enter comma-separated labels matching the number of values per row (e.g., Pre, Mid, Post).
Step 4 — Pick a sample dataset (optional). Five built-in datasets pre-fill the form so you can see the tool in action.
Step 5 — Configure alpha. 0.05 is standard. Use 0.01 for more conservative inference.
Step 6 — Run analysis. Click the green button. Nothing runs automatically on page load.
Step 7 — Read summary cards. Three F-tests are shown: Group, Time, and Group × Time. Color = significance.
Step 8 — Inspect the full ANOVA table. Each row gives df, SS, MS, F, p, and partial η².
Step 9 — Check four visualizations. Interaction plot, error-bar chart, individual trajectories, and box plots together give a complete picture.
Step 10 — Export. Click Download Doc for a plain-text report or Download PDF for a print-ready PDF.

❓ 8. Frequently Asked Questions

Q1. What is a mixed-design ANOVA and when should I use it?

A mixed-design ANOVA (also called split-plot ANOVA) is used when your study has at least one between-subjects factor (different groups of people) and at least one within-subjects factor (repeated measures on each person). Use it when you want to test whether a treatment effect changes over time across different groups — e.g., does blood pressure change differently across three drug conditions when measured at pre, mid, and post-treatment.

Q2. What is the difference between mixed ANOVA and repeated-measures ANOVA?

A repeated-measures ANOVA has only within-subjects factors (one group measured multiple times). A mixed-design ANOVA adds a between-subjects factor (separate groups), so you can test main effects of group, time, and their interaction.

Q3. What is the interaction effect in a mixed ANOVA?

The interaction effect (Group × Time) tests whether the within-subjects effect differs across groups. A significant interaction means the pattern of change over time is not the same across groups, which is often the most important finding — it tells you that the treatment is doing something different in different groups.

Q4. What is sphericity and why does it matter?

Sphericity is the assumption that variances of the differences between all pairs of within-subjects conditions are equal. When sphericity is violated, F-tests become liberal (too many false positives). This tool reports Greenhouse–Geisser and Huynh–Feldt corrections automatically when the within-subjects factor has 3 or more levels.

Q5. How do I interpret partial eta squared in a mixed ANOVA?

Partial eta squared (η²_p) is the proportion of variance in the outcome explained by an effect, after partialling out other effects. Cohen's (1988) benchmarks: 0.01 = small, 0.06 = medium, 0.14 = large. A partial η² of 0.25 for an interaction is a large effect.

Q6. What are the assumptions of a mixed-design ANOVA?

Independence of observations between groups, normality of residuals within each cell, homogeneity of variance across between-subjects groups (Levene's test), and sphericity for the within-subjects factor. The test is robust to mild violations when sample sizes are equal across groups.

Q7. How large a sample do I need for a mixed-design ANOVA?

As a rule of thumb, aim for at least 15–20 participants per between-subjects group. For 80% power to detect a medium interaction effect (f = 0.25, α = 0.05) with 2 groups and 3 time points, you typically need n ≈ 24 per group (≈ 48 total).

Q8. What should I do if sphericity is violated?

Apply the Greenhouse–Geisser epsilon (more conservative) or Huynh–Feldt epsilon (less conservative when ε > 0.75) to adjust the degrees of freedom. Both corrections are shown automatically in this tool. As a rule of thumb, use Greenhouse–Geisser when ε ≤ 0.75 and Huynh–Feldt when ε > 0.75.

Q9. How do I report a mixed-design ANOVA in APA 7th edition format?

Report each effect with F(df1, df2) = value, p = value, partial η² = value. Example: "There was a significant Group × Time interaction, F(2, 36) = 6.42, p = .004, η²_p = .26." See Section 3 for five full APA, thesis, plain-language, abstract, and pre-registration templates.

Q10. Can I use this calculator for my published research or thesis?

This tool is designed for educational use, class assignments, and exploratory analysis. For published research, verify results with peer-reviewed software (R afex, SPSS GLM Repeated Measures, JASP). Cite as: STATS UNLOCK. (2025). Mixed-design ANOVA calculator. Retrieved from https://statsunlock.com.

📚 9. References

The following references support the statistical methods used in this mixed-design ANOVA calculator, covering partial eta squared effect sizes, sphericity correction, and best practices in hypothesis testing and APA-format reporting.

Fisher, R. A. (1925). Statistical methods for research workers. Oliver and Boyd. https://psychclassics.yorku.ca/Fisher/Methods/
Greenhouse, S. W., & Geisser, S. (1959). On methods in the analysis of profile data. Psychometrika, 24(2), 95–112. https://doi.org/10.1007/BF02289823
Huynh, H., & Feldt, L. S. (1976). Estimation of the Box correction for degrees of freedom from sample data in randomized block and split-plot designs. Journal of Educational Statistics, 1(1), 69–82. https://doi.org/10.3102/10769986001001069
Mauchly, J. W. (1940). Significance test for sphericity of a normal n-variate distribution. The Annals of Mathematical Statistics, 11(2), 204–209. https://doi.org/10.1214/aoms/1177731915
Cohen, J. (1988). Statistical power analysis for the behavioral sciences (2nd ed.). Lawrence Erlbaum Associates. https://doi.org/10.4324/9780203771587
Field, A. (2018). Discovering statistics using IBM SPSS statistics (5th ed.). SAGE Publications. SAGE link
Maxwell, S. E., & Delaney, H. D. (2004). Designing experiments and analyzing data: A model comparison perspective (2nd ed.). Lawrence Erlbaum Associates. https://doi.org/10.4324/9781410609243
Howell, D. C. (2013). Statistical methods for psychology (8th ed.). Cengage Learning.
Keselman, H. J., Algina, J., & Kowalchuk, R. K. (2001). The analysis of repeated measures designs: A review. British Journal of Mathematical and Statistical Psychology, 54(1), 1–20. https://doi.org/10.1348/000711001159357
Lakens, D. (2013). Calculating and reporting effect sizes to facilitate cumulative science: A practical primer for t-tests and ANOVAs. Frontiers in Psychology, 4, 863. https://doi.org/10.3389/fpsyg.2013.00863
Singmann, H., Bolker, B., Westfall, J., & Aust, F. (2024). afex: Analysis of factorial experiments [R package]. https://cran.r-project.org/package=afex
American Psychological Association. (2020). Publication manual of the American Psychological Association (7th ed.). APA. https://doi.org/10.1037/0000165-000
Bakeman, R. (2005). Recommended effect size statistics for repeated measures designs. Behavior Research Methods, 37(3), 379–384. https://doi.org/10.3758/BF03192707
R Core Team. (2024). R: A language and environment for statistical computing. R Foundation for Statistical Computing. https://www.R-project.org/
NIST/SEMATECH. (2013). e-Handbook of statistical methods. National Institute of Standards and Technology. https://www.itl.nist.gov/div898/handbook/