What if Levene’s Test is “Significant”? (2024)

An assumption required for ANOVA is hom*ogeneity of variances. We often run Levene’s test to check if this holds. But what if it doesn't? This tutorial walks you through.

  • SPSS ANOVA Dialogs I
  • Results I - Levene’s Test “Significant“
  • SPSS ANOVA Dialogs II
  • Results II - Welch and Games-Howell Tests
  • Plan B - Kruskal-Wallis Test

Example Data

All analyses in this tutorial use staff.sav, part of which is shown below. We encourage you to download these data and replicate our analyses.

What if Levene’s Test is “Significant”? (1)

Our data contain some details on a sample of N = 179 employees. The research question for today is:is salary associated with region?We'll try to support this claim by rejecting the null hypothesis that all regions have equal mean population salaries. A likely analysis for this is an ANOVA but this requires a couple of assumptions.

ANOVA Assumptions

An ANOVA requires 3 assumptions:

  1. independent observations;
  2. normality: the dependent variable must follow a normal distribution within each subpopulation.
  3. hom*ogeneity: the variance of the dependent variable must be equal over all subpopulations.

With regard to our data, independent observations seem plausible: each record represents a distinct person and people didn't interact in any way that's likely to affect their answers.

Second, normality is only needed for small sample sizes of, say, N < 25 per subgroup. We'll inspect if our data meet this requirement in a minute.

Last, hom*ogeneity is only needed if sample sizes are sharply unequal. If so, we usually run Levene's test. This procedure tests if 2+ population variances are all likely to be equal.

Quick Data Check

Before running our ANOVA, let's first see if the reported salaries are even plausible. The best way to do so is inspecting a histogram which we'll create by running the syntax below.

*Run basic histogram on salary.

frequencies salary
/format notable
/histogram.

Result

What if Levene’s Test is “Significant”? (2)
  • Note that our histogram reports N = 175 rather than our N = 179 respondents. This implies that salary contains 4 missing values.
  • The frequency distribution, however, looks plausible: there's no clear outliers or other abnormalities that should ring any alarm bells.
  • The distribution shows some positive skewness. However, this makes perfect sense and is no cause for concern.

Let's now proceed to the actual ANOVA.

SPSS ANOVA Dialogs I

After opening our data in SPSS, let's first navigate toAnalyze What if Levene’s Test is “Significant”? (3) General Linear Model What if Levene’s Test is “Significant”? (4) Univariate as shown below.

What if Levene’s Test is “Significant”? (5)

Let's now fill in the dialog that opens as shown below.

What if Levene’s Test is “Significant”? (6)

Completing these steps results in the syntax below. Let's run it.

*ANOVA with descriptive statistics, Levene's test and effect size: (partial) eta squared.

UNIANOVA salary BY region
/METHOD=SSTYPE(3)
/INTERCEPT=INCLUDE
/PRINT ETASQ DESCRIPTIVE hom*oGENEITY
/CRITERIA=ALPHA(.05)
/DESIGN=region.

Results I - Levene’s Test “Significant”

The very first thing we inspect are the sample sizes used for our ANOVA and Levene’s test as shown below.

What if Levene’s Test is “Significant”? (7)
  • First off, note that our Descriptive Statistics table is based on N = 171 respondents (bottom row). This is due to some missing values in both region and salary.
  • Second, sample sizes for “North” and “East” are rather small. We may therefore need the normality assumption. For now, let's just assume it's met.
  • Next, our sample sizes are sharply unequal so we really need to meet the hom*ogeneity of variances assumption.
  • However, Levene’s test is statistically significant because its p < 0.05: we reject its null hypothesis of equal population variances.

The combination of these last 2 points implies thatwe can not interpret or report the F-testshown in the table below.

What if Levene’s Test is “Significant”? (8)

What if Levene’s Test is “Significant”? (9) As discussed, we can't rely on this p-value for the usual F-test.

What if Levene’s Test is “Significant”? (10) However, we can still interpret eta squared (often written as η2). This is a descriptive statistic that neither requires normality nor hom*ogeneity. η2 = 0.046 implies a small to medium effect size for our ANOVA.

Now, if we can't interpret our F-test, then how can we know if our mean salaries differ? Two good alternatives are:

  • running an ANOVA with the Welch statistic or
  • a Kruskal-Wallis test.

Let's start off with the Welch statistic.

SPSS ANOVA Dialogs II

For inspecting the Welch statistic, first navigate toAnalyze What if Levene’s Test is “Significant”? (11) Compare Means What if Levene’s Test is “Significant”? (12) One-Way ANOVA as shown below.

What if Levene’s Test is “Significant”? (13)

Next, we'll fill out the dialogs that open as shown below.

What if Levene’s Test is “Significant”? (14)

This results in the syntax below. Again, let's run it.

*ANOVA with Welch statistic and Games-Howell post hoc tests.

ONEWAY salary BY region
/STATISTICS hom*oGENEITY WELCH
/MISSING ANALYSIS
/POSTHOC=GH ALPHA(0.05).

Results II - Welch and Games-Howell Tests

As shown below, the Welch test rejects the null hypothesis of equal population means.

What if Levene’s Test is “Significant”? (15)

This table is labelled “Robust Tests...” because it's robust to a violation of the hom*ogeneity assumption as indicated by Levene’s test. So we now conclude that mean salaries are not equal over all regions.

But precisely which regions differ with regard to mean salaries? This is answered by inspecting post hoc tests. And if the hom*ogeneity assumption is violated, we usually prefer Games-Howell as shown below.

What if Levene’s Test is “Significant”? (16)

Note that each comparison is shown twice in this table. The only regions whose mean salaries differ “significantly” are North and Top 4 City.

Plan B - Kruskal-Wallis Test

So far, we overlooked one issue: some regions have sample sizes of n = 15 or n = 16. This implies that the normality assumption should be met as well. A terrible idea here is to run

  • a Kolmogorov-Smirnov test or
  • a Shapiro-Wilk test

for each region separately. Neither test rejects the null hypothesis of a normally distributed dependent variable but this is merely due to insufficient sample sizes.

A much better idea is running a Kruskal-Wallis test. You could do so with the syntax below.

*Kruskal-Wallis test from Analyze - Nonparametric Tests - Legacy Dialogs - K Independent Samples.

NPAR TESTS
/K-W=salary BY region(1 5)
/STATISTICS DESCRIPTIVES
/MISSING ANALYSIS.

Result

What if Levene’s Test is “Significant”? (17)

Sadly, our Kruskal-Wallis test doesn't detect any difference between mean salary ranks over regions, H(4) = 6.58, p = 0.16.

In short, our analyses come up with inconclusive outcomes and it's unclear precisely why. If you've any suggestions, please throw us a comment below. Other than that,

Thanks for reading!

What if Levene’s Test is “Significant”? (2024)

FAQs

What if Levene’s Test is “Significant”? ›

Levene test is only a kind of signal whether to go for a parametric or non-parametric test of association. In case it shows a significant P-value it means we need to run a non-parametric test such as Chi-square.

What should I do if Levene's test is significant? ›

When Levene's test is significant, modified procedures are used that do not assume equality of variance. Levene's test may also test a meaningful question in its own right if a researcher is interested in knowing whether population group variances are different.

What happens if Levene's test is not met? ›

The Levene's test uses an F-test to test the null hypothesis that the variance is equal across groups. A p value less than . 05 indicates a violation of the assumption. If a violation occurs, it is likely that conducting the non-parametric equivalent of the analysis is more appropriate.

What does it mean if Levene's test is insignificant? ›

If this test is nonsignificant, that means you have hom*ogeneity of variance between the two groups on the dependent or outcome variable. If Levene's test is significant, this means that the two groups did not show hom*ogeneity of variance on the dependent or outcome variable.

What if Levene's test is significant in Ancova? ›

If the Levene test is positive (P<0.05) then the variances in the groups are different (the groups are not hom*ogeneous), and therefore the assumptions for ANCOVA are not met.

When the Levene test yields AP value of less than 0.05 we report? ›

For this test, a p-value of less than 0.05 indicates that there is, in fact, enough variance in the sample to account for possible mean differences. The p-value reported for Levene's Test for Equality of Variance in the table above is p = 0.000, which is well below the 0.05 threshold.

Is Levene's test sensitive to sample size? ›

Yes, the original Levene test may be influenced by small sample sizes, as large differences in sample sizes can cause the test to reject the null hypothesis even when it is true (as mentioned in the paper).

How to interpret Levene test results? ›

If the p-value for the Levene test is greater than . 05, then the variances are not significantly different from each other (i.e., the hom*ogeneity assumption of the variance is met). If the p-value for the Levene's test is less than . 05, then there is a significant difference between the variances.

Do I need to report Levene's test? ›

In order to provide enough information for readers to fully understand the results when you have run an independent t-test, you should include the result of normality tests, Levene's Equality of Variances test, the two group means and standard deviations, the actual t-test result and the direction of the difference (if ...

What to do if hom*ogeneity of variance is not met? ›

There are two tests that you can run that are applicable when the assumption of hom*ogeneity of variances has been violated: (1) Welch or (2) Brown and Forsythe test. Alternatively, you could run a Kruskal-Wallis H Test. For most situations it has been shown that the Welch test is best.

What is the difference between F test and Levene's test? ›

2 tests are commonly used to check for hom*ogeneity of variance: Fisher's F test and Levene's test. Fisher's F test, which is introduced here, is restricted to comparison of two variances/groups while Levene's test can assess more than two variances/groups.

What is the alternative to the Levene's test? ›

Welch's t-test gives you less work and saves time.

Instead of doing the two steps process of checking the equal variance with Levene's test and then checking the equal mean with t-test, you can directly check if the two data sets have equal means by using the Welch's t-test.

What do you do if Levene's test is significant? ›

If Levene's Test was significant (variances are different), then read the bottom line. If Levene's Test is not significant (variances are equal), then read the top line. In this example, we read the bottom line. Our “t” value is 5.842, with 2124 degrees of freedom.

Why do we need Levene's test? ›

Levene's test ( Levene 1960) is used to test if k samples have equal variances. Equal variances across samples is called hom*ogeneity of variance. Some statistical tests, for example the analysis of variance, assume that variances are equal across groups or samples. The Levene test can be used to verify that assumption.

Does Levene's test require normal distribution? ›

In conclusion, the Levene test for variance is an important tool for experimental design and data analysis. It allows us to determine whether or not the variances of two or more groups are equal and can be used with data that is not normally distributed.

What does a non significant Levene's test tell you? ›

The levene's test is for checking the equality of variances. A non-significant p value of levene's test show that the variences are indeed equal and there is no difference in variances of both groups. However, this has nothing to do with the p-value obtained from applying independent samples test.

What is a violation of the Levene's test? ›

> As the Levene's test for equality of variance is significant, it indicates that the group variances are unequal in the population. > Due to the significance of the Levene's test, the hom*ogeneity of variance is violated, meaning there is a greater probability of rejecting the null hypothesis.

What if the Manova Levene's test is significant? ›

Levene's test should be non-‐significant for all dependent variables if the assumption of hom*ogeneity of variance has been met. We can see here that the assumption has been met (p > . 05 in all cases), which strengthens the case for assuming that the multivariate test statistics are robust.

What would you conclude if Levene's test had a tiny p-value? ›

If the resulting p-value of Levene's test is less than some significance level (typically 0.05), the obtained differences in sample variances are unlikely to have occurred based on random sampling from a population with equal variances.

How to interpret results of Levene's test in R? ›

Interpretation. If the results of the Levene's test are significant, it means that the variances are signicantly different. If the test produces non-significant results, the variances are similar or not significantly different.

What is the null hypothesis of the Levene's test? ›

The null hypothesis for Levene's is that the variances are equal across all samples.

What are the disadvantages of Levene's test? ›

It relies too much on p-values, and therefore, sample sizes. If the sample size is large, Levene's will have a smaller p-value than if the sample size is small, given the same variances.So it's very likely that you're overstating a problem with the assumption in large samples and understating it in small samples.

What does f mean in Levene's test? ›

A Levene's Test for Equality of of Variances: This section has the test results for Levene's Test. From left to right: F is the test statistic of Levene's test. Sig. is the p-value corresponding to this test statistic.

How do I report Levene's test? ›

If Levene's test for equality of variances is significant, report the statistics for the row equal variances not assumed with the altered degrees of freedom rounded to the nearest whole number.

What to do if hom*ogeneity of variance is violated? ›

There are two tests that you can run that are applicable when the assumption of hom*ogeneity of variances has been violated: (1) Welch or (2) Brown and Forsythe test. Alternatively, you could run a Kruskal-Wallis H Test. For most situations it has been shown that the Welch test is best.

What is the purpose of the Levene's test and why is it important for analysis of variance? ›

Levene's test ( Levene 1960) is used to test if k samples have equal variances. Equal variances across samples is called hom*ogeneity of variance. Some statistical tests, for example the analysis of variance, assume that variances are equal across groups or samples. The Levene test can be used to verify that assumption.

When to use equal variances not assumed? ›

Check hom*ogeneity of variances
  1. If your Sig. value is larger than . 05, you should use the first line in the table: Equal Variances assumed.
  2. If your Sig. value is p=. 05 or less, you should use the information in the second line of the t-test table: equal variances not assumed.

Top Articles
Latest Posts
Article information

Author: Nicola Considine CPA

Last Updated:

Views: 6177

Rating: 4.9 / 5 (69 voted)

Reviews: 92% of readers found this page helpful

Author information

Name: Nicola Considine CPA

Birthday: 1993-02-26

Address: 3809 Clinton Inlet, East Aleisha, UT 46318-2392

Phone: +2681424145499

Job: Government Technician

Hobby: Calligraphy, Lego building, Worldbuilding, Shooting, Bird watching, Shopping, Cooking

Introduction: My name is Nicola Considine CPA, I am a determined, witty, powerful, brainy, open, smiling, proud person who loves writing and wants to share my knowledge and understanding with you.