Statistics Problem Calculator

What are the mean, median, mode, and standard deviation of your data?

Paste or type your dataset and get every core descriptive statistic in one pass: mean, median, mode, standard deviation, variance, and range. Results update live as you edit. Useful for homework, lab reports, quality checks, and quick data sanity checks.

Updated July 2026 · How this works

Your Data Values

Treat as Population (not Sample)

—

See a way to make this better?

Worth knowing

Learn more

How It Works

The formula, explained simply

Think of descriptive statistics as a short biography of a dataset. The mean tells you where the center of gravity is — if you had to balance the entire number line on a single point using your data as weights, the mean is where you place your finger. The median tells you a different story: it finds the physical middle of the sorted list, immune to the weight of extreme values. When those two numbers diverge significantly, it signals that outliers exist and are pulling the average away from the typical case.

Standard deviation measures how spread out the values are around the mean. A low standard deviation means the values cluster tightly — you can predict new observations with confidence. A high standard deviation means the data scatters widely and the mean alone gives an incomplete picture. Variance is simply the standard deviation squared; it is the raw unit of spread before taking the square root makes it human-readable in the original units.

Range gives you the bluntest possible summary: the distance from the smallest to the largest value. It is sensitive to a single outlier at either extreme and tells you nothing about what happens in between. Mode counts which value shows up most often — useful in categorical or discrete data but often meaningless in continuous measurements where every value is unique. Together, these six statistics give you a fast, complete orientation to any unfamiliar dataset.

When To Use This

Right tool, right situation

Use this tool when you need a fast, reliable summary of a dataset before analysis, before presenting results, or before checking your manual work. It is appropriate for homework problems, quick lab data checks, comparing two small datasets side by side, and sanity-checking output from larger software. The results are mathematically exact given the inputs — there is no estimation or modeling involved.

This tool is not appropriate when you need inferential statistics — hypothesis tests, confidence intervals, p-values, or regression coefficients. Those require assumptions about distribution shape and involve probability calculations beyond descriptive summaries. If you are trying to answer whether two groups differ in a statistically meaningful way, you need a t-test or ANOVA, not just the mean and standard deviation of each group.

Also stop trusting these results if your dataset has a structure the tool cannot see: time-series ordering, grouped categories, or hierarchical nesting. A mean calculated across all rows of a panel dataset ignores that some rows belong to the same subject measured repeatedly — that requires a different approach entirely.

Common Mistakes

Why results sometimes look wrong

The most common mistake is using population standard deviation on a sample — and it happens because many calculators label the population formula as simply SD without flagging the distinction. The result looks plausible but is systematically too small, which makes your data appear less variable than it actually is. In a sample of 10 values, the error is about 5%; in a sample of 5, it is closer to 10%.

A second mistake is treating the mean as the right central-tendency statistic for skewed data. If your dataset has a few very large or very small outliers, the mean shifts toward them while the median stays put. Reporting mean salary at a company where one executive earns ten times everyone else misleads the reader about what a typical employee earns. Always compare mean and median — a gap of more than 10-15% of the mean is a signal to investigate the distribution further.

A third mistake specific to this tool: pasting data that includes a header row or row labels. If your spreadsheet column starts with the word Score or Date, that token is silently dropped during parsing, which is correct — but you should verify the reported count matches how many values you expected to include. A mismatch means something was filtered out.

∑

The Math

Worked examples and deeper derivation

Mean: sum all values, divide by count. If your values are x1 through xn, mean equals (x1 + x2 + ... + xn) divided by n. Median: sort the values ascending. If count is odd, take the middle value. If count is even, average the two middle values. Mode: tally the frequency of each unique value and report the value with the highest count.

Sample standard deviation uses the formula: SD equals the square root of the sum of squared differences from the mean, divided by n-1. The n-1 denominator (Bessel's correction) compensates for the tendency of a sample to underestimate how spread out the full population actually is. Population standard deviation uses n in the denominator instead, giving a smaller result. Variance is whichever of those squared results before taking the square root.

Range is simply max minus min — no weighting, no averaging. It is the fastest spread measure to compute mentally and the fastest to mislead you if your dataset has even one rogue outlier. For datasets with extreme values, interquartile range (IQR) — the span from the 25th to the 75th percentile — is more informative, though that requires sorting the dataset into quartiles beyond what this tool reports.

Student checking homework before submission

Dataset: 78, 85, 90, 72, 88, 91, 76, 84, 69, 95 — sample mode, 10 test scores

Mean comes out to 82.8, median is 84.5, mode is null (all unique), SD is approximately 8.35. The gap between mean and median tells the student the distribution skews slightly left — a couple of low scores pull the mean down. Submitting these numbers with confidence takes under 20 seconds.

Quality control check on a production line measurement

Dataset: 10.01, 9.98, 10.02, 9.99, 10.03, 9.97, 10.01 — population mode, 7 measurements

Mean is 10.0014, SD in population mode is approximately 0.0021. A standard deviation under 0.005 on a 10mm part tells the QC engineer the line is running tight. Switching to population mode matters here because these 7 readings represent the entire current production batch, not a sample drawn from a larger run.

Real estate agent comparing days-on-market across listings

Dataset: 14, 22, 8, 45, 19, 11, 33, 27, 6, 52 — sample mode, 10 listings

Mean is 23.7 days, median is 20.5 days, SD is approximately 15.7 days. The large standard deviation relative to the mean signals high spread — a few slow-moving properties are inflating the average. The agent learns to quote the median (20.5 days) rather than the mean when marketing their turnaround speed, because the median is more representative of a typical listing.

Expert Unlock

The thing most explanations skip

Sample standard deviation with Bessel's correction (n-1) is an unbiased estimator of population variance, but it is not an unbiased estimator of population standard deviation — because the square root is a concave function, Jensen's inequality guarantees that the expected value of the square root is less than the square root of the expected value. In practice this bias is negligible for n above 30, but for small samples (n under 10) the corrected SD still slightly underestimates the true population SD. Some fields use correction factors to compensate, but the tool follows the standard academic and industry convention of n-1 as-is.

Why does my standard deviation not match what my textbook says?

What is the difference between sample standard deviation and population standard deviation?

Sample standard deviation divides by n-1 (Bessel's correction) to account for the fact that a sample tends to underestimate the true spread of a population. Population standard deviation divides by n and is only appropriate when your data includes every possible value in the group you care about — for example, all employees at a specific company, not a random selection of them. Most homework problems and real-world analyses use the sample version unless the problem explicitly states otherwise.

Why is mode showing null when I have a clear dataset?

Mode reports null when every value in the dataset appears exactly once — there is no single most-frequent value, so no mode exists by the standard definition. This is mathematically correct and common in small datasets with continuous measurements. If you want the most central value in that situation, median is the better statistic to report.

Can I paste data copied from Excel or Google Sheets?

Yes — values separated by tabs, commas, spaces, semicolons, or line breaks all parse correctly. A column of numbers copied directly from a spreadsheet will work without any reformatting. The calculator ignores blank lines and non-numeric text automatically, so column headers are stripped out before calculation.

Need something this doesn't cover?

Suggest a tool — we'll build it →