P-Value Calculator

Z & T Tests Chi-Square & F One & Two-Tailed Free ยท No Signup

Free online p-value calculator for Z-test, T-test, Chi-square, and F-test. One-tailed and two-tailed tests with significance classification, interactive distribution curve, and Python scipy export.

P-Value Calculator
Standard normal test statistic

Result

๐Ÿ“Š

Enter parameters and click Calculate

Compute p-values for Z, T, Chi-square, or F test statistics.

Distribution Visualization

Python Compiler

What Is a P-Value?

A p-value is the probability of observing a test statistic at least as extreme as the one computed from sample data, assuming the null hypothesis (Hโ‚€) is true. It is a fundamental concept in hypothesis testing and helps researchers decide whether to reject Hโ‚€.

Fail to reject Hโ‚€ p/2 p/2 โˆ’z* +z* 0
๐Ÿ“

Probability Measure

The p-value quantifies how likely observed data (or more extreme) would occur if Hโ‚€ were true. Ranges from 0 to 1.

โš–๏ธ

Evidence Against Hโ‚€

A smaller p-value provides stronger evidence against the null hypothesis. It does NOT measure the probability that Hโ‚€ is true.

๐ŸŽฏ

Decision Tool

Compare p-value to significance level ฮฑ. If p โ‰ค ฮฑ, reject Hโ‚€. Common thresholds: 0.05, 0.01, 0.10.

P-Value Interpretation Guide

The significance of a p-value depends on the chosen ฮฑ level. Below are commonly used thresholds and their interpretations.

p < 0.001

Highly Significant โ€” Very strong evidence against the null hypothesis. Often denoted with *** in research papers.

p < 0.05

Significant โ€” Sufficient evidence to reject Hโ‚€ at the standard 5% level. The most common threshold in science.

p < 0.10

Marginal โ€” Weak evidence against Hโ‚€. Sometimes called โ€œtrending toward significance.โ€ Use with caution.

p โ‰ฅ 0.10

Not Significant โ€” Insufficient evidence to reject Hโ‚€. Does not prove Hโ‚€ is true, only that data is consistent with it.

Test Distributions

Z-Test:  p = P(Z โ‰ฅ |z|) for standard normal. Used when ฯƒ is known or n is large (โ‰ฅ30).
T-Test:  p = P(T โ‰ฅ |t|) with df = n โˆ’ 1. Used when ฯƒ is unknown and sample is small.
Chi-Square:  p = P(ฯ‡ยฒ โ‰ฅ xยฒ) with df = k โˆ’ 1. Used for goodness-of-fit and independence tests.
F-Test:  p = P(F โ‰ฅ f) with dfโ‚ and dfโ‚‚. Used for ANOVA and comparing two variances.
DistributionParametersCommon Use
Z (Normal)None (standard)Large samples, known ฯƒ, proportions
t (Studentโ€™s)df = n โˆ’ 1Small samples, unknown ฯƒ
ฯ‡ยฒ (Chi-square)df = k โˆ’ 1Categorical data, goodness-of-fit
F (Fisher)dfโ‚, dfโ‚‚ANOVA, variance comparison

One-Tailed vs Two-Tailed Tests

One-Tailed (right) ฮฑ
Two-Tailed ฮฑ/2 ฮฑ/2

One-Tailed (Directional)

Tests for an effect in a specific direction (e.g., mean is greater than or less than a value). All ฮฑ is concentrated in one tail, giving more power for that direction.

Two-Tailed (Non-directional)

Tests for any difference in either direction. The ฮฑ is split between both tails (ฮฑ/2 each). More conservative but appropriate when the direction is not pre-specified.

Important: The choice between one-tailed and two-tailed tests must be made before looking at the data. Choosing after seeing results inflates the false positive rate.

Common Misconceptions About P-Values

โŒ โ€œP is the probability Hโ‚€ is trueโ€

Incorrect. The p-value is the probability of the data (or more extreme) given Hโ‚€ is true. It is P(data | Hโ‚€), not P(Hโ‚€ | data). Bayesian methods are needed for the latter.

โŒ โ€œ1 โˆ’ p = probability Hโ‚ is trueโ€

Incorrect. A p-value of 0.03 does not mean there is a 97% chance the alternative is true. The p-value tells you about the data under Hโ‚€, not the probability of any hypothesis.

โŒ โ€œNot significant = no effectโ€

Incorrect. Failure to reject Hโ‚€ does not prove Hโ‚€. It may simply mean the sample size was too small to detect the effect. โ€œAbsence of evidence is not evidence of absence.โ€

โœ… Correct Interpretation

Correct: โ€œAssuming Hโ‚€ is true, the probability of observing a test statistic as extreme as or more extreme than the one observed is p.โ€ Always interpret in context.

Frequently Asked Questions

A p-value is the probability of observing results at least as extreme as the data, assuming the null hypothesis is true. A small p-value (e.g., less than 0.05) provides evidence against the null hypothesis. It is not the probability that the null hypothesis is true.
Use a one-tailed test when you have a directional hypothesis (e.g., the mean is greater than a value). Use two-tailed when you test for any difference in either direction. The choice must be made before looking at the data.
Use Z for large samples with known ฯƒ. Use t for small samples with unknown ฯƒ. Use chi-square for goodness-of-fit and independence tests. Use F for comparing variances or ANOVA.
The most common is ฮฑ = 0.05 (95% confidence). For high-stakes decisions, use 0.01. For exploratory research, 0.10 is sometimes used. The threshold should be chosen before conducting the test.
No. Statistical significance does not equal practical significance. A large sample can produce a tiny p-value for a trivially small effect. Always report effect sizes alongside p-values to assess real-world importance.
Common mistakes include thinking p is the probability Hโ‚€ is true, or that 1 โˆ’ p is the probability Hโ‚ is true, or that a non-significant result proves no effect exists. The p-value only measures evidence against Hโ‚€ under the assumption Hโ‚€ is true.

Support This Free Tool

Every coffee helps keep the servers running. Every book sale funds the next tool I'm dreaming up. You're not just supporting a site โ€” you're helping me build what developers actually need.

500K+ users
200+ tools
100% private
Privacy Guarantee: Private keys you enter or generate are never stored on our servers. All tools are served over HTTPS.