Which normality test should I use: Shapiro-Wilk or Kolmogorov-Smirnov?

Use Shapiro-Wilk for n up to about 5000, it has the highest power for detecting non-normality across most alternatives. KS is very weak unless you specify the mean and SD a priori; the Lilliefors variant fixes this but is still less powerful than Shapiro-Wilk. Anderson-Darling is competitive and works well in the tails.

Why does my data fail a normality test even though it looks normal?

With large n (above ~5000), normality tests become so sensitive they reject for trivial deviations that don't affect inference. The reverse problem applies for small n: tests can't distinguish normal from clearly non-normal. Always pair the test with a Q-Q plot and consider whether the deviation matters for your analysis.

Do I need to test for normality before running a t-test?

For n above 30 per group, the central limit theorem makes t-test robust to mild non-normality of the underlying data. Test the residuals, not the raw data. If you see heavy skew, outliers, or a clear non-normal shape, switch to Mann-Whitney or use a transformation; otherwise t-test is usually fine.

Normality Test Picker

Many statistical tests assume your data come from a normal (bell-curve) distribution. Shapiro-Wilk, Anderson-Darling, Lilliefors, and Jarque-Bera each check this differently. Paste your data to pick the right test for your sample size, see the verdict and Q-Q plot, and get a recommendation on parametric vs non-parametric.

5 tests · one tool · Shapiro-Wilk · Anderson-Darling · Lilliefors · Jarque-Bera · Q-Q plot · Runs in your browser

Try a real-world example to load.

📊 Small sample

Twenty values from a standard normal. Tests should not reject; the Q-Q plot should look like a clean diagonal.

Verdict

R code RUNNABLE

R Reproduce in R

Diagnostic INTERACTIVE

Inference

We ran multiple normality tests (Shapiro-Wilk, Anderson-Darling, Kolmogorov-Smirnov) and combined them with visual checks to diagnose your distribution.

Normality Test Picker

How we got there