Cumulative frequency is the running total in a frequency table. It tells you how many observations are at or below a value or class boundary, which is why it is useful for finding the median, quartiles, and percentiles.
An ogive is the graph of that running total. Once you can read the table and the graph together, grouped-data questions become much easier.
Cumulative Frequency Definition
If the class frequencies are , then the cumulative frequency up to class is
Each row adds one more class to the total. If the cumulative frequency is at the end of a class, then observations are in that class or below it.
For ungrouped data, cumulative frequency is just a running count. For grouped data, it is a running count by class interval.
How an Ogive Helps You Read Percentiles
An ogive plots cumulative frequency against class boundaries. For grouped continuous data, you usually plot:
- the upper class boundary on the horizontal axis
- the cumulative frequency on the vertical axis
Then you join the points with a smooth or piecewise line. The curve rises because cumulative frequency never decreases.
The main use of an ogive is reading positions in the ordered data set. If the total frequency is , then:
- the median is around the th value
- the first quartile is around the th value
- the third quartile is around the th value
- the th percentile is around the th value
On the graph, you start from that vertical position, move across to the ogive, then drop down to the horizontal axis to estimate the value.
Worked Example: Median and 75th Percentile
Suppose test scores for students are grouped like this:
| Score | Frequency | Cumulative frequency |
|---|---|---|
| 0-10 | ||
| 10-20 | ||
| 20-30 | ||
| 30-40 | ||
| 40-50 | ||
| 50-60 |
The total frequency is .
Find the median from the table
The median is the th value.
Look at the cumulative frequencies:
- up to 20-30, the total is
- up to 30-40, the total is
So the th value lies in the - class.
If you want a grouped-data estimate, use interpolation only if it is reasonable to treat the values as spread fairly evenly through that class. Then
Here:
- is the lower boundary of the class
- is the cumulative frequency before the class
- is the class frequency
- is the class width
So
That estimate is not exact. It depends on the assumption that the values inside the - class are spread fairly smoothly.
Estimate the 75th percentile
The th percentile is the th value.
From the cumulative frequencies:
- up to 30-40, the total is
- up to 40-50, the total is
So the th value lies in the - class.
Using the same interpolation idea,
On an ogive, you would mark on the cumulative-frequency axis, move across to the curve, and then read down to about on the score axis.
Common Mistakes with Cumulative Frequency
Mixing up frequency and cumulative frequency
Frequency tells you how many observations are in one class. Cumulative frequency tells you how many observations are in that class and all earlier classes together.
Using the wrong position
For the median or a percentile, the position comes from the total frequency . If you use the wrong total, every later step is off.
Treating grouped estimates as exact
An ogive or interpolation gives an estimate inside a class, not an exact original data value. That estimate depends on how the data are distributed inside the interval.
Plotting the wrong horizontal values
For grouped data, ogives are usually plotted against class boundaries, especially upper class boundaries. Plotting against class midpoints changes the meaning.
When Cumulative Frequency Is Used
Cumulative frequency is used whenever you need ordered position in a data set rather than just class-by-class counts. That includes exam-score summaries, income distributions, quality-control data, and any situation where percentiles or medians matter more than individual bin counts.
It is especially useful when raw data are large and a grouped table is easier to read than a long list of observations.
Try a Similar Cumulative Frequency Problem
Take any small grouped table and add a cumulative frequency column before drawing an ogive. Then read the median and one percentile from the graph and compare them with the table-based estimate.
If you want one more check, try your own version with and ask where the th, th, and th values would fall. That is a simple way to make the idea stick.
Need help with a problem?
Upload your question and get a verified, step-by-step solution in seconds.
Open GPAI Solver →