Exploratory Factor Analysis (EFA) Calculator
Identify latent factors behind your variables. Computes the correlation matrix, KMO, Bartlett's test of sphericity, eigenvalues, factor loadings, varimax rotation, and communalities — with publication-ready output.
📥 Input Your Data
Enter at least 3 variables (clusters) measured on the same set of cases. Each variable should be a continuous score from N respondents/observations.
Sample 1 (Big-5 Personality) is pre-loaded.
Format — comma-separated numbers (default), e.g. 52, 48, 55, 61, 47, …. The variable name on the left is editable.
Enter values in the cells below. Each column = one variable.
👀 Data Preview
Live summary of the variables you've entered.
Exploratory Factor Analysis discovers a small number of latent factors that explain the correlations among many observed variables. Use it to develop scales, validate questionnaires, or reduce many measurements into a handful of interpretable constructs.
📊 Sample & Factorability
Before extracting factors, EFA requires that the data are factorable: enough common variance among the variables, an adequate sample size, and a non-identity correlation matrix.
📐 Eigenvalues & Variance Explained
Each eigenvalue represents how much variance one factor accounts for. The cumulative percentage tells you how much of the total information is captured by the retained factors.
🎯 Rotated Factor Loadings
Each value is the correlation between a variable and a factor after rotation. Loadings ≥ |0.40| are considered salient (highlighted). Each variable usually loads strongly on a single factor when rotation succeeds.
🔗 Communalities (h²)
Communality is the proportion of a variable's variance accounted for by all extracted factors. Values above 0.50 are good; below 0.30 suggest the variable does not fit the factor solution well.
📈 Visualizations (4 Plots)
Four colourful plots help you read the EFA solution at a glance.
1. Scree Plot
2. Variance Explained
3. Factor Loadings Heatmap
4. Communalities (h²)
🧠 Interpretation Results — Detailed
A long-form, plain-language interpretation with everything a researcher needs to draw conclusions and report findings responsibly.
✍️ How to Write Your Results in Research
Five ready-to-use reporting templates. Click 📋 Copy to paste straight into your paper or thesis.
🏁 Conclusion
A complete, paper-ready conclusion that wraps the analysis, the practical meaning, and the next steps.
✅ Assumption Checks
EFA assumes specific data conditions. Each is verified below with a Pass / Warning / Fail badge.
📖 How to Use This Tool — Step-by-Step Guide
Step 1 — Enter Your Data. Use the Type tab to paste comma-separated values (e.g. 52, 48, 55, 61, …) for each variable, or use the Upload tab to load a CSV/Excel file and tick the columns that should become clusters, or use the Manual Entry grid for small datasets.
Step 2 — Choose a Sample Dataset. Five samples are available — Big-5 Personality, Job Satisfaction, Camera Trap Habitat Variables, Customer Experience, and Plant Diversity Indices. Sample 1 is pre-loaded so you can run the tool immediately.
Step 3 — Configure Settings. Choose the number of factors (Auto = Kaiser's rule), the rotation method (Varimax recommended for orthogonal factors), the extraction method (Principal Axis Factoring is the default for true EFA), and the loading cutoff used to mark "salient" loadings.
Step 4 — Run the Analysis. Click the green "Run Exploratory Factor Analysis" button. The KMO, Bartlett's test, eigenvalues, rotated loadings, communalities, and four plots appear instantly.
Step 5 — Read the Summary Cards. KMO, Bartlett's χ², number of factors retained, and total variance explained are shown at a glance with colour cues.
Step 6 — Read the Tables. Sample & Factorability → Eigenvalues → Rotated Loadings → Communalities. Use the loading cutoff to identify which variables define each factor.
Step 7 — Examine the Four Plots. Scree plot for retention, Variance bar chart for cumulative %, Heatmap to visually scan the loading pattern, and Communalities bar to check fit per variable.
Step 8 — Check Assumptions. Sample size adequacy, factorability, and absence of pathological multicollinearity are reported with badges.
Step 9 — Read the Detailed Interpretation. Eight in-depth paragraphs cover what was found, how to label factors, what the loadings mean, practical vs statistical importance, and limitations.
Step 10 — Export. Use Download Doc for a plain-text report, Download PDF for a print-ready PDF, or Copy Summary for a one-line takeaway you can paste into chat or email.
❓ Frequently Asked Questions
Q1. What is exploratory factor analysis (EFA) and when should I use it?
EFA is a multivariate technique that uncovers the latent factors generating the correlations among observed variables. Use it when you do not yet have a confirmed structure for a measurement instrument — for example, when developing a personality scale, a customer-satisfaction questionnaire, or grouping ecological habitat variables into broader constructs. EFA tells you how many underlying factors exist and which variables belong to each.
Q2. How is EFA different from PCA?
Principal Component Analysis (PCA) explains total variance using uncorrelated linear combinations and is purely a data-reduction tool. EFA explains shared (common) variance and assumes latent factors generate the correlations. PCA is appropriate when the goal is dimensionality reduction; EFA is appropriate when you want to identify or interpret underlying constructs.
Q3. What is the KMO measure and how do I interpret it?
The Kaiser–Meyer–Olkin (KMO) statistic compares squared partial correlations to squared correlations. It ranges from 0 to 1. Kaiser's interpretive guide: ≥ .90 marvelous, .80–.89 meritorious, .70–.79 middling, .60–.69 mediocre, .50–.59 miserable, < .50 unacceptable. A KMO below .60 suggests the data are not factorable and EFA should not proceed.
Q4. What is Bartlett's test of sphericity?
Bartlett's test checks whether the observed correlation matrix is significantly different from an identity matrix (a matrix in which all variables are uncorrelated). A significant result (p < .05) means there are meaningful correlations among variables, and factor analysis is justified. A non-significant result is a red flag — there is nothing to factor.
Q5. What does the scree plot show and how do I read it?
The scree plot displays eigenvalues in descending order. The "elbow" — the point where the curve flattens — is often used to choose the number of factors. Combined with Kaiser's rule (retain factors with eigenvalue > 1), it provides a practical guide. Modern best practice also uses parallel analysis as a more accurate criterion.
Q6. What is varimax rotation and why is it used?
Varimax is an orthogonal rotation that maximizes the variance of squared loadings within each factor, producing a "simple structure" where each variable loads strongly on one factor and weakly on the others. This dramatically improves interpretability while keeping factors uncorrelated. If you expect factors to be correlated (e.g., aspects of psychological wellbeing), use an oblique rotation such as Promax instead.
Q7. How do I interpret a factor loading?
A factor loading is the correlation between a variable and a factor. Conventional benchmarks: |.32| poor, |.45| fair, |.55| good, |.63| very good, |.71| excellent (Comrey & Lee, 1992). In practice, |λ| ≥ .40 is the most common threshold for declaring a loading "salient". Negative loadings indicate the variable is inversely related to the factor.
Q8. What is communality and why does it matter?
Communality (h²) is the proportion of a variable's variance explained by the retained factors together. It is the sum of squared loadings across factors. Values above .50 indicate the variable is well represented by the factor solution; values below .30 suggest the variable is largely unique and may need to be dropped.
Q9. How many factors should I retain?
Use multiple criteria together: Kaiser's rule (eigenvalue > 1), the scree plot elbow, parallel analysis, theoretical interpretability, and total variance explained (typically ≥ 60% for social-science data). When criteria disagree, prefer the most interpretable solution. Always report the rationale for the retained number of factors.
Q10. What sample size do I need for EFA?
Common rules: at least 5–10 cases per variable, with a minimum of 100–200 cases. Higher communalities (≥ .60) and over-determined factors (≥ 4 strong loadings per factor) allow smaller samples (n ≈ 100). Low communalities or weakly defined factors require n ≥ 300. Cases-to-variables ratios below 5:1 should be avoided.
📚 References
All references for this exploratory factor analysis calculator, in APA 7th-edition format with DOIs/URLs.
- Bartlett, M. S. (1950). Tests of significance in factor analysis. British Journal of Statistical Psychology, 3(2), 77–85. https://doi.org/10.1111/j.2044-8317.1950.tb00285.x
- Cattell, R. B. (1966). The scree test for the number of factors. Multivariate Behavioral Research, 1(2), 245–276. https://doi.org/10.1207/s15327906mbr0102_10
- Comrey, A. L., & Lee, H. B. (1992). A first course in factor analysis (2nd ed.). Psychology Press. https://doi.org/10.4324/9781315827506
- Costello, A. B., & Osborne, J. W. (2005). Best practices in exploratory factor analysis: Four recommendations for getting the most from your analysis. Practical Assessment, Research, and Evaluation, 10(7), 1–9. https://doi.org/10.7275/jyj1-4868
- Fabrigar, L. R., Wegener, D. T., MacCallum, R. C., & Strahan, E. J. (1999). Evaluating the use of exploratory factor analysis in psychological research. Psychological Methods, 4(3), 272–299. https://doi.org/10.1037/1082-989X.4.3.272
- Field, A. (2018). Discovering statistics using IBM SPSS statistics (5th ed.). SAGE. https://us.sagepub.com/en-us/nam/discovering-statistics-using-ibm-spss-statistics
- Hair, J. F., Black, W. C., Babin, B. J., & Anderson, R. E. (2019). Multivariate data analysis (8th ed.). Cengage. https://www.cengage.com/c/multivariate-data-analysis-8e-hair
- Horn, J. L. (1965). A rationale and test for the number of factors in factor analysis. Psychometrika, 30(2), 179–185. https://doi.org/10.1007/BF02289447
- Kaiser, H. F. (1958). The varimax criterion for analytic rotation in factor analysis. Psychometrika, 23(3), 187–200. https://doi.org/10.1007/BF02289233
- Kaiser, H. F. (1974). An index of factorial simplicity. Psychometrika, 39(1), 31–36. https://doi.org/10.1007/BF02291575
- MacCallum, R. C., Widaman, K. F., Zhang, S., & Hong, S. (1999). Sample size in factor analysis. Psychological Methods, 4(1), 84–99. https://doi.org/10.1037/1082-989X.4.1.84
- Revelle, W. (2024). psych: Procedures for psychological, psychometric, and personality research [R package]. Northwestern University. https://CRAN.R-project.org/package=psych
- Tabachnick, B. G., & Fidell, L. S. (2019). Using multivariate statistics (7th ed.). Pearson. https://www.pearson.com/en-us/subject-catalog/p/using-multivariate-statistics
- Thompson, B. (2004). Exploratory and confirmatory factor analysis: Understanding concepts and applications. American Psychological Association. https://doi.org/10.1037/10694-000
- Watkins, M. W. (2018). Exploratory factor analysis: A guide to best practice. Journal of Black Psychology, 44(3), 219–246. https://doi.org/10.1177/0095798418771807









