Calculating Spread for Grouped Data
How do we measure the spread of data presented in a grouped frequency table? For example, data on phone battery duration grouped into hour intervals (6-10 hours, 11-15 hours, etc.).
Since we don't know the exact value of each data point within a class interval (e.g., in the 11-15 hours class, we don't know if the duration was exactly 11 hours, 12 hours, or something else), we need to make an assumption.
The most common assumption is that all data within a class interval are evenly distributed. Therefore, we can represent all data in that class using the midpoint () of that class.
Formulas for Variance and Standard Deviation of Grouped Data
Using the midpoint () and frequency () of each class, the formulas are slightly different:
-
Variance () The commonly used (and easier to compute) formula is the computational formula adapted for grouped data:
This formula essentially calculates the average of the squared midpoints weighted by frequency, minus the square of the average midpoint weighted by frequency (the mean of the grouped data).
-
Standard Deviation () Just like with ungrouped data, the standard deviation is the square root of the variance:
Calculating Variance and Standard Deviation of Phone Battery Duration
Suppose a study on phone battery duration yielded the following data:
Battery duration (hours) | Frequency () |
---|---|
6-10 | 2 |
11-15 | 10 |
16-20 | 18 |
21-25 | 45 |
26-30 | 5 |
Let's determine the variance and standard deviation for this battery duration data.
Create a Helper Table
We need to calculate the midpoint () for each class, then compute and .
Battery duration (hours) | Midpoint, | Frequency, | ||
---|---|---|---|---|
6-10 | 2 | |||
11-15 | 10 | |||
16-20 | 18 | |||
21-25 | 45 | |||
26-30 | 5 | |||
Total |
Calculate Variance
Plug the total values from the table into the variance formula:
So, the variance of the battery duration data is approximately 19.37 (in units of hours squared).
Calculate Standard Deviation
Take the square root of the variance:
The standard deviation of the phone battery duration is approximately 4.4 hours. This gives us an idea that the average deviation of battery duration from the mean (which can be calculated as hours) is about 4.4 hours.
The smaller the standard deviation, the more uniform the phone battery durations were in the study.