Help With Statistics
What are the mean, median, mode, and spread of your data?
Paste or type a list of numbers and get every core descriptive statistic in one shot. Mean, median, mode, range, standard deviation, variance, and count — calculated instantly so you can move on to interpreting your data instead of grinding through arithmetic.
—
Send feedback
💡 Share your idea or report a problem
✓ Thanks! We'll take a look.
Learn more
How It Works
The formula, explained simply
Imagine pouring a bag of marbles onto a table and asking two questions: where is the center of the pile, and how spread out is it? Mean, median, and mode each answer the first question differently. Standard deviation and variance answer the second. Together they give you a complete picture of shape, center, and spread — the three pillars of descriptive statistics.
The mean is the balance point. If you placed each data value on a number line as a weight, the mean is the point where the line tips neither left nor right. It is sensitive to every value, which makes it accurate for symmetric data but misleading when one extreme number pulls it away from the bulk of values. The median ignores magnitude and focuses on position — the value at the exact middle after sorting. When the mean and median are far apart, that gap itself is a signal: your data is skewed.
Standard deviation translates variance — a squared unit that is hard to interpret — back into the original unit of measurement. A standard deviation of 5 kilograms means values typically sit about 5 kg away from the mean. Variance is standard deviation squared, useful in further statistical calculations but less intuitive as a standalone number. The choice between population and sample formulas is not about data size — it is about whether your data is the whole group or a representation of a larger one.
When To Use This
Right tool, right situation
Use this tool whenever you need a fast, reliable summary of a numeric data set: a homework problem, a quick report, a sanity check before running a more complex analysis, or a first look at survey results. It handles the arithmetic correctly every time, which removes transcription errors from manual calculation — the most common source of wrong answers in introductory statistics work.
This tool is appropriate for data sets where all values are numeric, the list is finite, and you want descriptive statistics rather than inferential ones. It does not compute confidence intervals, p-values, t-tests, or regression. If you are trying to determine whether two groups differ significantly, or whether a relationship between variables is real, you need inferential statistics tools beyond what descriptive statistics alone can tell you.
Do not rely on this tool when your data has ordinal or categorical values that happen to be coded as numbers — for example, survey questions where 1 means Strongly Disagree and 5 means Strongly Agree. Treating those as numbers and computing a mean is technically possible but statistically contested. Median and mode are safer summaries for ordinal data. The tool will calculate a mean regardless, so that judgment call rests with you.
Common Mistakes
Why results sometimes look wrong
The most common mistake is using population standard deviation when sample standard deviation is required. If your data is a survey sample, a batch test, or any subset drawn from a larger population, always use the sample formula (divide by n-1). Using population formula on sample data systematically underestimates variability, which matters when you use the standard deviation to draw conclusions about the larger group.
A second mistake is treating mode as always meaningful. When data values are continuous measurements — lengths, times, temperatures — mode is usually uninformative because exact repetition is rare by chance. Mode is most useful when data is discrete and repetition is expected, such as ratings on a 1-10 scale or survey response categories. Reporting mode for continuous measurement data without rounding first often produces a misleading result.
A third mistake is confusing a large standard deviation with bad data. High standard deviation means high variability — which is a property of the data, not a problem with the measurement. A wide spread might be exactly what you expect from a diverse population. The mistake is assuming the goal is always a small standard deviation. In quality control, a small std dev is good. In ecological diversity measurement, a large one might be the intended outcome.
The Math
Worked examples and deeper derivation
Mean is calculated as the sum of all values divided by the count: mean = (x1 + x2 + ... + xn) / n. Median is determined by sorting the values in ascending order and taking the middle value. For an even count, median is the average of the two middle values. Mode is identified by counting how often each value appears and selecting the most frequent.
Variance measures the average squared distance from the mean. For a population: variance = sum of (xi - mean) squared, divided by n. For a sample: the denominator becomes n-1. Squaring the differences removes negatives and amplifies larger deviations. Standard deviation is simply the square root of variance, returning the result to the original unit scale. Range is the maximum value minus the minimum value — a single-number summary of total spread.
The relationship between mean and median encodes skew. When mean exceeds median, the distribution likely has a long right tail — a few high values pulling the average up. When median exceeds mean, a long left tail of low values is pulling the average down. When they are nearly equal, the data is roughly symmetric. This mean-median gap is one of the fastest diagnostics in descriptive statistics and requires no additional calculation once you have both numbers.
Expert Unlock
The thing most explanations skip
The population versus sample distinction is not just a formula detail — it reflects a deeper question about what your data represents. The n-1 denominator in Bessel's correction is not a rounding convenience. It corrects for the fact that when you estimate the mean from a sample, you introduce a bias: sample values tend to be closer to their own sample mean than to the true population mean. Dividing by n-1 inflates variance slightly to compensate for that systematic underestimate. For large n, the correction is negligible. For n below 30, it can matter meaningfully. If your data set has fewer than 10 values and you are generalizing to a population, the correction deserves explicit acknowledgment in any report.
Mean came out different from what I expected — what am I missing?
Need something this doesn't cover?
Suggest a tool — we'll build it →