Grouped-data questions about medians and percentiles almost always come down to one column you can build yourself: the cumulative frequency, or running total of how many observations fall at or below each class boundary. Add that column and an ogive, the graph of that running total, and finding the median, quartiles, and percentiles becomes a procedure you can repeat.
When To Use Cumulative Frequency
Reach for cumulative frequency whenever a question asks for ordered position in a data set rather than a class-by-class count: a median, a quartile, or a percentile. It is the right tool when raw data are large and a grouped table is easier to read than a long list, and it is what an ogive is built from. If you only need to know how many observations land in one class, plain frequency is enough; the moment "at or below" enters the question, switch to cumulative frequency.
The Procedure, Step By Step
If the class frequencies are , then the cumulative frequency up to class is
Each row adds one more class to the total. If the cumulative frequency is at the end of a class, then observations are in that class or below it. For ungrouped data, cumulative frequency is just a running count; for grouped data, it is a running count by class interval.
To locate a value, work from the total frequency :
- the median is around the th value
- the first quartile is around the th value
- the third quartile is around the th value
- the th percentile is around the th value
An ogive turns those positions into a reading. Plot the upper class boundary on the horizontal axis and the cumulative frequency on the vertical axis, then join the points; the curve only rises, because cumulative frequency never decreases. Start from a vertical position, move across to the curve, and drop down to the horizontal axis to estimate the value.
A Complete Pass Through One Table
Suppose test scores for students are grouped like this:
| Score | Frequency | Cumulative frequency |
|---|---|---|
| 0-10 | ||
| 10-20 | ||
| 20-30 | ||
| 30-40 | ||
| 40-50 | ||
| 50-60 |
The total frequency is .
The median
The median is the th value. Reading the cumulative frequencies, the total is up to 20-30 and up to 30-40, so the th value lies in the - class. If you want a grouped-data estimate, use interpolation only when it is reasonable to treat the values as spread fairly evenly through that class:
Here is the lower boundary, is the cumulative frequency before the class, is the class frequency, and is the class width. So
That estimate is not exact. It depends on the assumption that the values inside the - class are spread fairly smoothly.
The 75th percentile
The th percentile is the th value. The total is up to 30-40 and up to 40-50, so the th value lies in the - class. Using the same interpolation idea,
On an ogive, you would mark on the cumulative-frequency axis, move across to the curve, and read down to about on the score axis.
Where Each Step Tends To Break, And How To Check It
Step: building the column. The frequent slip is confusing frequency with cumulative frequency. Frequency counts one class; cumulative frequency counts that class plus all earlier ones. Self-check: the last cumulative entry must equal .
Step: locating the position. The position comes from the total . Use the wrong total and every later step is off. Self-check: re-derive the position from before reading the table.
Step: reading off the value. A grouped estimate lands inside a class, not on an exact original value, and it depends on how the data are distributed in the interval. Self-check: confirm your answer falls between the class boundaries you used.
Step: drawing the ogive. For grouped data, plot against class boundaries, especially upper class boundaries; plotting against midpoints changes the meaning. Self-check: the leftmost point should sit at the first cumulative frequency, the rightmost at .
Where Cumulative Frequency Shows Up
Cumulative frequency appears in exam-score summaries, income distributions, quality-control data, and any setting where percentiles or medians matter more than individual bin counts. It is especially handy when raw data are large and a grouped table reads more clearly than the full list of observations.
Run The Procedure Once More
Build a cumulative frequency column for a small grouped table with , then locate where the th, th, and th values fall. Draw the ogive, read off the median and one percentile, and compare those readings with the table-based estimates. Matching the two is the cleanest sign the procedure has clicked.
Frequently Asked Questions
- What is cumulative frequency?
- Cumulative frequency is the running total of frequencies, so it shows how many observations are at or below a given value or class boundary.
- What is an ogive?
- An ogive is a graph of cumulative frequency against value or class boundary. It is commonly used to read medians, quartiles, and percentiles.
- Can you find exact percentiles from grouped data?
- Usually only approximately. An estimate inside a class depends on an interpolation assumption, which treats values as spread fairly smoothly through that class.
Need help with a problem?
Upload your question and get a verified, step-by-step solution in seconds.
Open GPAI Solver →