Histograms and Cumulative Frequency
Histograms are similar to bar charts apart from the consideration of areas. In a bar chart, all of the bars are the same width and the only thing that matters is the height of the bar. In a histogram, the area is the important thing.
Example
Draw a histogram for the following information.
Height (feet) | Frequency | Relative Frequency |
0 - 2 | 0 | 0 |
2 - 4 | 1 | 1 |
4 - 5 | 4 | 8 |
5 - 6 | 8 | 16 |
6 - 8 | 2 | 2 |
(Ignore relative frequency for now). It is difficult to draw a bar chart for this information, because the class divisions for the height are not the same.
When drawing a histogram, the y-axis is labelled ‘frequency density’ or "relative frequency". You must work out the relative frequency before you can draw a histogram. To do this, first decide upon a standard width for the groups. Some of the heights are grouped into 2s (0-2, 2-4, 6-8) and some into 1s (4-5, 5-6). Most are 2s, so we shall call the standard width 2. To make the areas match, we must double the values for frequency which have a class division of 1 (since 1 is half of 2). Therefore the figures in the 4-5 and the 5-6 columns must be doubled. If any of the class divisions were 4 (for example if there was a 8-12 group), these figures would be halved. This is because the area of this "bar" will be twice the standard width of 2 unless we half the frequency.
Cumulative Frequency
The cumulative frequency is the running total of the frequencies. On a graph, it can be represented by a cumulative frequency polygon, where straight lines join up the points, or a cumulative frequency curve.
Example
Height (cm) | Frequency | Cumulative Frequency |
0 - 100 | 4 | 4 |
100 - 120 | 6 | 10 (= 4 + 6) |
120 - 140 | 3 | 13 (= 4 + 6 + 3) |
140 - 160 | 2 | 15 (= 4 + 6 + 3 + 2) |
160 - 180 | 6 | 21 |
180 - 220 | 4 | 25 |
These data are used to draw a cumulative frequency polygon by plotting the cumulative frequencies against the upper class boundaries.