What Are Percentiles?
You're already familiar with quartiles, which divide data into 4 equal parts, right? Well, percentiles are like quartiles' sibling, but they're even more detailed!
If quartiles divide data into 4 chunks, percentiles divide ordered data into 100 equal chunks. That's a lot, huh? Like dividing a chocolate bar into 100 tiny squares.
Each chunk is separated by a percentile value. There are 99 percentile values, starting from , , , ..., up to .
- (10th Percentile) means this value separates the smallest 10% of the data from the remaining 90%.
- (50th Percentile) is exactly the same as the Median or the Second Quartile ( ), because it divides the data right in the middle (50% below, 50% above).
- (85th Percentile) means this value separates the smallest 85% of the data from the largest 15%.
Percentiles are very useful for seeing the position of a specific value relative to the entire dataset, like class rankings for test scores or a child's growth compared to peers of the same age.
How to Find Percentile Values for Grouped Data
Just like finding quartiles for grouped data, we also use interpolation to find the value of a percentile () when the data is grouped.
The steps are very similar:
Find the Percentile Class Position
First, we determine which data point corresponds to the i-th percentile. The formula is:
- = Which percentile are we looking for? (e.g., 10, 50, 85)
- = Total number of data points
Once we have the position, we look at the cumulative frequency table () to find out which class interval this percentile falls into.
Calculate the Percentile Value using the Interpolation Formula
Once we know the class, we use this magic interpolation formula:
Where:
- = Value of the i-th Percentile (what we're looking for)
- = Lower boundary of the i-th percentile class
- = Which percentile (e.g., 10, 85)
- = Total frequency
- = Cumulative frequency BEFORE the i-th percentile class
- = Frequency of the i-th percentile class
- = Class width
Notice, the formula is very similar to the quartile formula, the only difference is the part (quartiles use ).
Finding Math Test Scores
For example, let's say we have the math test scores of 40 students:
Test Score | Frequency () | Cumulative Frequency () | Lower Boundary () | Class Width () |
---|---|---|---|---|
61-70 | 4 | 4 | 10 | |
71-80 | 10 | 14 | 10 | |
81-90 | 16 | 30 | 10 | |
91-100 | 10 | 40 | 10 | |
Total | 40 |
We want to find the value of the 85th Percentile ().
-
Find the Position of :
Position of = the -th data point.
-
Determine the Class of :
Look at the column. Where is the 34th data point? The 81-90 class has (not enough). The 91-100 class has (data points 31 through 40 are here). So, the class is 91-100.
-
Gather Ingredients for the Formula:
- (Lower boundary of class 91-100) = 90.5
- = 85
- = 40
- (Cumulative frequency before class 91-100) = 30
- (Frequency of class 91-100) = 10
- (Class width) = 10
-
Calculate :
So, the 85th Percentile value is 94.5. This means 85% of the students scored 94.5 or less, and 15% scored above 94.5.
Exercise
Try calculating the value of the 20th Percentile () from the math test score data above!
Answer Key
-
Position of :
Position of = the -th data point.
-
Class of :
Look at . The 8th data point is in the 71-80 class (because the previous class's is 4, and this class's is 14).
-
Formula Ingredients:
- = 70.5
- = 20
- = 40
- = 4
- = 10
- = 10
-
Calculate :
The 20th Percentile value is 74.5.