A sample problem should make this all clear: in the lab, the boiling point of a liquid, which has a theoretical value of 54.0° C, was measured by a student four times. Determine, for each measurement, the error, percent error, deviation, and percent deviation.
Observed value | Error | Percent error | Deviation | Percent deviation |
54.9 | 0.9 | 2.0% | 0.5 | 0.9% |
54.4 | 0.4 | 0.7% | 0.0 | 0.0% |
54.1 | 0.1 | 0.2% | -0.3 | -0.6% |
54.2 | 0.2 | 0.4% | -0.2 | -0.4% |
We show the calculations for the first data point as an example:
One standard deviation (sometimes expressed as "one sigma") away from the mean in either direction on the horizontal axis (the red area on the above graph) accounts for somewhere around 68 percent of the data points. Two standard deviations, or two sigmas, away from the mean (the red and green areas) account for roughly 95 percent of the data points And three standard deviations (the red, green and blue areas) account for about 99 percent of the data points.
If this curve were flatter and more spread out, the standard deviation would have to be larger in order to account for those 68 percent or so of the points. That's why the standard deviation can tell you how spread out the examples in a set are from the mean.
How do you calculate the standard deviation? It's not too difficult, but it IS tedious, unless you have a calculator that handles statistics. The formula for the standard deviation is as follows:
Basically, what this says is as follows:
If you have a statistics-capable calculator, this is really easy to do, since there is a button (usually labeled "SD") that allows you to do this. We, however, don't have a stats calculator (well, we do, but we're pretending!), so we have to do it the hard way.
In this example, the student has measured the percentage of chlorine (Cl) in an experiment a total of five times. The arithmetic mean is calculated to be 19.71. The student wishes to find out the standard deviation for the data set, with particular interest in the range of values from one sigma below the mean to one sigma above the mean:
Trial | 1 | 2 | 3 | 4 | 5 |
Percentage of Cl | 19.82 | 19.57 | 19.68 | 19.71 | 19.75 |
d | 0.11 | -0.14 | -0.03 | 0.00 | 0.04 |
d2 | 0.0121 | 0.0196 | 0.0009 | 0.0000 | 0.0016 |
Adding up all of the d2= 0.0342
Dividing 0.0342 by 4 (or 5-1) = 0.0086
Taking the square root of 0.0086= 0.09
This means that the standard deviation for this problem is 0.09, and that if we keep doing the experiment, most (68% or so) of the data points should be between 19.62 (19.71 - 0.09) and 19.80 (19.71+0.09). The lower the standard deviation, the better (in this case) the measurements are.