Calibration Curve Calculator
How reliable is your calibration curve, and what is your unknown concentration?
Paste in your calibration standards and instrument readings, and get a linear regression equation, R-squared fit quality, and calculated concentration for any unknown sample — in seconds.
—
Send feedback
💡 Share your idea or report a problem
✓ Thanks! We'll take a look.
Learn more
How It Works
The formula, explained simply
A calibration curve works like a translator between two languages your instrument does not speak natively. The instrument produces a signal — absorbance, peak area, voltage — but what you care about is concentration. The curve builds a Rosetta Stone: measure a set of known concentrations, record the signals, and fit a line through the pairs. Once the line exists, you can run any unknown sample, read its signal, and reverse-translate back into concentration.
The math behind the translation is ordinary least squares linear regression. For each pair of concentration x and signal y values, the algorithm finds the slope and intercept that minimize the sum of squared vertical distances from the data points to the fitted line. The result is the equation y = slope * x + intercept. To find an unknown concentration, you rearrange it: x = (y - intercept) / slope. Every calculation in this tool uses that exact inversion.
R-squared measures how well the line describes your data. A value of 1.000 means every data point sits exactly on the line — perfect linearity. A value of 0.990 means the line explains 99% of the variance in your responses, with 1% unexplained by linear behavior. That remaining 1% sounds trivial, but in analytical chemistry it compounds into meaningful concentration errors, especially near the ends of your working range. This is why professional methods set minimum R-squared thresholds rather than eyeballing the fit.
When To Use This
Right tool, right situation
Use this calculator any time you need to back-calculate a concentration from an instrument signal using a linear standard curve — UV-Vis absorbance, HPLC peak area, fluorescence intensity, ion-selective electrode voltage, or any other response that follows Beer-Lambert or a linear detector law across your working range. It is appropriate for protein assays, nutrient analyses, heavy metal quantitation, and pharmaceutical potency checks where linearity has been established.
Do not use this tool when your response-concentration relationship is non-linear. Enzyme-linked immunosorbent assays (ELISA) and many immunoassays follow a sigmoidal 4-parameter logistic curve — fitting a line to this data and reading concentrations from the midpoint of the curve will give systematically wrong results. Similarly, if your signal detector is known to saturate at high concentrations, a linear fit is inappropriate and a weighted or polynomial regression is needed.
This tool is also not appropriate for method validation work — it calculates the regression but does not compute prediction intervals, limit of detection, limit of quantitation, or matrix-matched correction factors. For regulatory submissions or ISO-accredited reports, use validated software with full uncertainty propagation and audit trail functionality.
Common Mistakes
Why results sometimes look wrong
The most common mistake is treating R-squared as a pass/fail stamp rather than a diagnostic. A high R-squared does not mean your concentrations are accurate — it means your data is internally consistent. If all your standards were prepared from a degraded stock solution, your R-squared could be 0.9999 and every result would be systematically wrong. R-squared catches random scatter; it cannot catch systematic errors in your standards.
A second mistake is using too few standards placed too close together. Four standards across a narrow range — say 1, 2, 3, and 4 mg/L — will almost always produce a high R-squared simply because there is little room for curvature to appear. When samples then fall at 8 or 12 mg/L, the extrapolated concentrations can be substantially wrong even though the curve looked excellent within range. Always bracket your expected sample concentrations with your standards, and include a point near the upper end of your working range.
A third mistake specific to this tool is entering standards in the wrong column order — response first, concentration second — which inverts the slope and produces nonsense concentration results. The input expects concentration on the left of the comma and instrument response on the right. If your calculated unknown concentration comes out as a tiny fraction of what you expect, check the order of your columns before assuming the chemistry is wrong.
The Math
Worked examples and deeper derivation
The linear regression equations used here are the closed-form ordinary least squares solutions. For n data points with concentrations x and responses y:
Slope (m) = (n * sum(xy) - sum(x) * sum(y)) / (n * sum(x^2) - sum(x)^2)
Intercept (b) = (sum(y) - m * sum(x)) / n
R-squared is calculated as 1 - (SS_residuals / SS_total), where SS_residuals is the sum of squared differences between each actual y and the predicted y at that x, and SS_total is the sum of squared differences between each actual y and the mean y.
The unknown concentration is recovered by inverting the line equation: if the signal is y_unknown, then x_unknown = (y_unknown - b) / m.
One practical implication of this math is that R-squared is not symmetric — it measures how well x predicts y, not how well y predicts x. Calibration curves use x (concentration) to predict y (response) during construction, but then invert the relationship to predict x from y for unknowns. The propagation of uncertainty from regression errors into the final concentration estimate requires additional calculations — prediction intervals — that go beyond what this tool computes. For regulated methods, consult your method documentation for the correct confidence interval approach.
Expert Unlock
The thing most explanations skip
Ordinary least squares regression assumes that all error lies in the y-direction (the response) and that x values (concentrations) are known exactly. In practice, gravimetric or volumetric preparation of standards always carries some error, which means both axes have uncertainty. At low concentrations, the signal-to-noise ratio is worse, producing larger relative errors in y. This violates the homoscedasticity assumption of OLS and means that high-concentration standards exert disproportionate leverage on the slope estimate. For methods spanning more than two orders of magnitude, weighted least squares regression — where each point is weighted by 1/y or 1/y^2 — gives a more accurate slope at low concentrations where regulatory limits often sit. The R-squared from an OLS fit on wide-range data will often look excellent while the low-end accuracy is meaningfully worse than the headline number suggests.
What does R-squared tell me about trusting my unknown concentration result?
Need something this doesn't cover?
Suggest a tool — we'll build it →