Statistics Problem Calculator
What are the mean, median, mode, and standard deviation of your data?
Paste or type your dataset and get every core descriptive statistic in one pass: mean, median, mode, standard deviation, variance, and range. Results update live as you edit. Useful for homework, lab reports, quality checks, and quick data sanity checks.
—
Send feedback
💡 Share your idea or report a problem
✓ Thanks! We'll take a look.
Learn more
How It Works
The formula, explained simply
Think of descriptive statistics as a short biography of a dataset. The mean tells you where the center of gravity is — if you had to balance the entire number line on a single point using your data as weights, the mean is where you place your finger. The median tells you a different story: it finds the physical middle of the sorted list, immune to the weight of extreme values. When those two numbers diverge significantly, it signals that outliers exist and are pulling the average away from the typical case.
Standard deviation measures how spread out the values are around the mean. A low standard deviation means the values cluster tightly — you can predict new observations with confidence. A high standard deviation means the data scatters widely and the mean alone gives an incomplete picture. Variance is simply the standard deviation squared; it is the raw unit of spread before taking the square root makes it human-readable in the original units.
Range gives you the bluntest possible summary: the distance from the smallest to the largest value. It is sensitive to a single outlier at either extreme and tells you nothing about what happens in between. Mode counts which value shows up most often — useful in categorical or discrete data but often meaningless in continuous measurements where every value is unique. Together, these six statistics give you a fast, complete orientation to any unfamiliar dataset.
When To Use This
Right tool, right situation
Use this tool when you need a fast, reliable summary of a dataset before analysis, before presenting results, or before checking your manual work. It is appropriate for homework problems, quick lab data checks, comparing two small datasets side by side, and sanity-checking output from larger software. The results are mathematically exact given the inputs — there is no estimation or modeling involved.
This tool is not appropriate when you need inferential statistics — hypothesis tests, confidence intervals, p-values, or regression coefficients. Those require assumptions about distribution shape and involve probability calculations beyond descriptive summaries. If you are trying to answer whether two groups differ in a statistically meaningful way, you need a t-test or ANOVA, not just the mean and standard deviation of each group.
Also stop trusting these results if your dataset has a structure the tool cannot see: time-series ordering, grouped categories, or hierarchical nesting. A mean calculated across all rows of a panel dataset ignores that some rows belong to the same subject measured repeatedly — that requires a different approach entirely.
Common Mistakes
Why results sometimes look wrong
The most common mistake is using population standard deviation on a sample — and it happens because many calculators label the population formula as simply SD without flagging the distinction. The result looks plausible but is systematically too small, which makes your data appear less variable than it actually is. In a sample of 10 values, the error is about 5%; in a sample of 5, it is closer to 10%.
A second mistake is treating the mean as the right central-tendency statistic for skewed data. If your dataset has a few very large or very small outliers, the mean shifts toward them while the median stays put. Reporting mean salary at a company where one executive earns ten times everyone else misleads the reader about what a typical employee earns. Always compare mean and median — a gap of more than 10-15% of the mean is a signal to investigate the distribution further.
A third mistake specific to this tool: pasting data that includes a header row or row labels. If your spreadsheet column starts with the word Score or Date, that token is silently dropped during parsing, which is correct — but you should verify the reported count matches how many values you expected to include. A mismatch means something was filtered out.
The Math
Worked examples and deeper derivation
Mean: sum all values, divide by count. If your values are x1 through xn, mean equals (x1 + x2 + ... + xn) divided by n. Median: sort the values ascending. If count is odd, take the middle value. If count is even, average the two middle values. Mode: tally the frequency of each unique value and report the value with the highest count.
Sample standard deviation uses the formula: SD equals the square root of the sum of squared differences from the mean, divided by n-1. The n-1 denominator (Bessel's correction) compensates for the tendency of a sample to underestimate how spread out the full population actually is. Population standard deviation uses n in the denominator instead, giving a smaller result. Variance is whichever of those squared results before taking the square root.
Range is simply max minus min — no weighting, no averaging. It is the fastest spread measure to compute mentally and the fastest to mislead you if your dataset has even one rogue outlier. For datasets with extreme values, interquartile range (IQR) — the span from the 25th to the 75th percentile — is more informative, though that requires sorting the dataset into quartiles beyond what this tool reports.
Expert Unlock
The thing most explanations skip
Sample standard deviation with Bessel's correction (n-1) is an unbiased estimator of population variance, but it is not an unbiased estimator of population standard deviation — because the square root is a concave function, Jensen's inequality guarantees that the expected value of the square root is less than the square root of the expected value. In practice this bias is negligible for n above 30, but for small samples (n under 10) the corrected SD still slightly underestimates the true population SD. Some fields use correction factors to compensate, but the tool follows the standard academic and industry convention of n-1 as-is.
Why does my standard deviation not match what my textbook says?
Need something this doesn't cover?
Suggest a tool — we'll build it →