Help With Statistics

What are the mean, median, mode, and spread of your data?

Paste or type a list of numbers and get every core descriptive statistic in one shot. Mean, median, mode, range, standard deviation, variance, and count — calculated instantly so you can move on to interpreting your data instead of grinding through arithmetic.

Updated July 2026 · How this works

Your Data Set

Calculation Type

—

See a way to make this better?

Worth knowing

Learn more

How It Works

The formula, explained simply

Imagine pouring a bag of marbles onto a table and asking two questions: where is the center of the pile, and how spread out is it? Mean, median, and mode each answer the first question differently. Standard deviation and variance answer the second. Together they give you a complete picture of shape, center, and spread — the three pillars of descriptive statistics.

The mean is the balance point. If you placed each data value on a number line as a weight, the mean is the point where the line tips neither left nor right. It is sensitive to every value, which makes it accurate for symmetric data but misleading when one extreme number pulls it away from the bulk of values. The median ignores magnitude and focuses on position — the value at the exact middle after sorting. When the mean and median are far apart, that gap itself is a signal: your data is skewed.

Standard deviation translates variance — a squared unit that is hard to interpret — back into the original unit of measurement. A standard deviation of 5 kilograms means values typically sit about 5 kg away from the mean. Variance is standard deviation squared, useful in further statistical calculations but less intuitive as a standalone number. The choice between population and sample formulas is not about data size — it is about whether your data is the whole group or a representation of a larger one.

When To Use This

Right tool, right situation

Use this tool whenever you need a fast, reliable summary of a numeric data set: a homework problem, a quick report, a sanity check before running a more complex analysis, or a first look at survey results. It handles the arithmetic correctly every time, which removes transcription errors from manual calculation — the most common source of wrong answers in introductory statistics work.

This tool is appropriate for data sets where all values are numeric, the list is finite, and you want descriptive statistics rather than inferential ones. It does not compute confidence intervals, p-values, t-tests, or regression. If you are trying to determine whether two groups differ significantly, or whether a relationship between variables is real, you need inferential statistics tools beyond what descriptive statistics alone can tell you.

Do not rely on this tool when your data has ordinal or categorical values that happen to be coded as numbers — for example, survey questions where 1 means Strongly Disagree and 5 means Strongly Agree. Treating those as numbers and computing a mean is technically possible but statistically contested. Median and mode are safer summaries for ordinal data. The tool will calculate a mean regardless, so that judgment call rests with you.

Common Mistakes

Why results sometimes look wrong

The most common mistake is using population standard deviation when sample standard deviation is required. If your data is a survey sample, a batch test, or any subset drawn from a larger population, always use the sample formula (divide by n-1). Using population formula on sample data systematically underestimates variability, which matters when you use the standard deviation to draw conclusions about the larger group.

A second mistake is treating mode as always meaningful. When data values are continuous measurements — lengths, times, temperatures — mode is usually uninformative because exact repetition is rare by chance. Mode is most useful when data is discrete and repetition is expected, such as ratings on a 1-10 scale or survey response categories. Reporting mode for continuous measurement data without rounding first often produces a misleading result.

A third mistake is confusing a large standard deviation with bad data. High standard deviation means high variability — which is a property of the data, not a problem with the measurement. A wide spread might be exactly what you expect from a diverse population. The mistake is assuming the goal is always a small standard deviation. In quality control, a small std dev is good. In ecological diversity measurement, a large one might be the intended outcome.

∑

The Math

Worked examples and deeper derivation

Mean is calculated as the sum of all values divided by the count: mean = (x1 + x2 + ... + xn) / n. Median is determined by sorting the values in ascending order and taking the middle value. For an even count, median is the average of the two middle values. Mode is identified by counting how often each value appears and selecting the most frequent.

Variance measures the average squared distance from the mean. For a population: variance = sum of (xi - mean) squared, divided by n. For a sample: the denominator becomes n-1. Squaring the differences removes negatives and amplifies larger deviations. Standard deviation is simply the square root of variance, returning the result to the original unit scale. Range is the maximum value minus the minimum value — a single-number summary of total spread.

The relationship between mean and median encodes skew. When mean exceeds median, the distribution likely has a long right tail — a few high values pulling the average up. When median exceeds mean, a long left tail of low values is pulling the average down. When they are nearly equal, the data is roughly symmetric. This mean-median gap is one of the fastest diagnostics in descriptive statistics and requires no additional calculation once you have both numbers.

Grading a quiz — class scores analysis

Scores: 72, 85, 90, 68, 77, 85, 91, 64, 80, 85 — Population

Mean is 79.7, median is 81, mode is 85 (three students). The mean pulled below the median by the two low scores at 64 and 68 — a classic left-skew signal. A teacher using only the mean might underestimate how well most of the class performed.

Survey sample — customer satisfaction ratings

Ratings 1-10: 7, 8, 6, 9, 8, 10, 5, 8, 7, 9, 6, 8 — Sample

With Sample selected, the standard deviation uses n-1 (Bessel's correction), giving a slightly larger spread estimate than population formula would. This is appropriate because the 12 respondents represent a larger customer base. The sample std dev of about 1.38 tells you most customers rate within 1.4 points of the mean — tight agreement.

Quality control in manufacturing — part measurements

Measured diameters in mm: 10.01, 9.99, 10.02, 10.00, 9.98, 10.01, 10.03, 9.97 — Population

Mean is 10.0013 mm, very close to the 10 mm target. Standard deviation of about 0.019 mm tells the engineer whether the process is within tolerance. If the spec allows plus or minus 0.05 mm, a std dev of 0.019 means nearly all parts fall inside the window — a healthy process. Range of 0.06 mm is the first signal that one part may be drifting near the edge.

Expert Unlock

The thing most explanations skip

The population versus sample distinction is not just a formula detail — it reflects a deeper question about what your data represents. The n-1 denominator in Bessel's correction is not a rounding convenience. It corrects for the fact that when you estimate the mean from a sample, you introduce a bias: sample values tend to be closer to their own sample mean than to the true population mean. Dividing by n-1 inflates variance slightly to compensate for that systematic underestimate. For large n, the correction is negligible. For n below 30, it can matter meaningfully. If your data set has fewer than 10 values and you are generalizing to a population, the correction deserves explicit acknowledgment in any report.

Mean came out different from what I expected — what am I missing?

What is the difference between population and sample standard deviation?

Population standard deviation divides the sum of squared differences by n, treating your data as the complete group. Sample standard deviation divides by n-1 (Bessel's correction), which produces a slightly larger number and is used when your data is a subset drawn from a bigger population. If you measured every unit in a batch, use population. If your 50 survey responses represent millions of customers, use sample.

How is median different from mean and when does it matter more?

The mean adds all values and divides by count, so a single extreme outlier can drag it far from the center. The median is the literal middle value when data is sorted, and outliers cannot move it. Median matters more when data is skewed — income distributions, house prices, and response times are classic cases where median gives a more honest picture of what a typical value looks like.

What does it mean if my data has no mode?

No mode means every value in the data set appears exactly once — there is no single most frequent number. This is common in small, continuous, or highly varied data sets. Mode is most useful for categorical or discrete data like survey ratings, test scores, or counts, where repetition is expected. For continuous measurements like heights or weights, mode is often uninformative.

Need something this doesn't cover?

Suggest a tool — we'll build it →