Cumulative frequency is the running total in a frequency table. It tells you how many observations are at or below a value or class boundary, which is why it is useful for finding the median, quartiles, and percentiles.

An ogive is the graph of that running total. Once you can read the table and the graph together, grouped-data questions become much easier.

Cumulative Frequency Definition

If the class frequencies are f1,f2,,fkf_1, f_2, \dots, f_k, then the cumulative frequency up to class kk is

Fk=f1+f2++fkF_k = f_1 + f_2 + \cdots + f_k

Each row adds one more class to the total. If the cumulative frequency is 2828 at the end of a class, then 2828 observations are in that class or below it.

For ungrouped data, cumulative frequency is just a running count. For grouped data, it is a running count by class interval.

How an Ogive Helps You Read Percentiles

An ogive plots cumulative frequency against class boundaries. For grouped continuous data, you usually plot:

  • the upper class boundary on the horizontal axis
  • the cumulative frequency on the vertical axis

Then you join the points with a smooth or piecewise line. The curve rises because cumulative frequency never decreases.

The main use of an ogive is reading positions in the ordered data set. If the total frequency is NN, then:

  • the median is around the N/2N/2th value
  • the first quartile is around the N/4N/4th value
  • the third quartile is around the 3N/43N/4th value
  • the ppth percentile is around the (p/100)N(p/100)Nth value

On the graph, you start from that vertical position, move across to the ogive, then drop down to the horizontal axis to estimate the value.

Worked Example: Median and 75th Percentile

Suppose test scores for 4040 students are grouped like this:

Score Frequency Cumulative frequency
0-10 22 22
10-20 55 77
20-30 99 1616
30-40 1212 2828
40-50 88 3636
50-60 44 4040

The total frequency is N=40N = 40.

Find the median from the table

The median is the N/2=20N/2 = 20th value.

Look at the cumulative frequencies:

  • up to 20-30, the total is 1616
  • up to 30-40, the total is 2828

So the 2020th value lies in the 3030-4040 class.

If you want a grouped-data estimate, use interpolation only if it is reasonable to treat the values as spread fairly evenly through that class. Then

medianL+N/2Fbeforefw\text{median} \approx L + \frac{N/2 - F_{\text{before}}}{f} \cdot w

Here:

  • L=30L = 30 is the lower boundary of the class
  • Fbefore=16F_{\text{before}} = 16 is the cumulative frequency before the class
  • f=12f = 12 is the class frequency
  • w=10w = 10 is the class width

So

median30+20161210=30+401233.3\text{median} \approx 30 + \frac{20 - 16}{12} \cdot 10 = 30 + \frac{40}{12} \approx 33.3

That estimate is not exact. It depends on the assumption that the values inside the 3030-4040 class are spread fairly smoothly.

Estimate the 75th percentile

The 7575th percentile is the (75/100)40=30(75/100) \cdot 40 = 30th value.

From the cumulative frequencies:

  • up to 30-40, the total is 2828
  • up to 40-50, the total is 3636

So the 3030th value lies in the 4040-5050 class.

Using the same interpolation idea,

P7540+3028810=42.5P_{75} \approx 40 + \frac{30 - 28}{8} \cdot 10 = 42.5

On an ogive, you would mark 3030 on the cumulative-frequency axis, move across to the curve, and then read down to about 42.542.5 on the score axis.

Common Mistakes with Cumulative Frequency

Mixing up frequency and cumulative frequency

Frequency tells you how many observations are in one class. Cumulative frequency tells you how many observations are in that class and all earlier classes together.

Using the wrong position

For the median or a percentile, the position comes from the total frequency NN. If you use the wrong total, every later step is off.

Treating grouped estimates as exact

An ogive or interpolation gives an estimate inside a class, not an exact original data value. That estimate depends on how the data are distributed inside the interval.

Plotting the wrong horizontal values

For grouped data, ogives are usually plotted against class boundaries, especially upper class boundaries. Plotting against class midpoints changes the meaning.

When Cumulative Frequency Is Used

Cumulative frequency is used whenever you need ordered position in a data set rather than just class-by-class counts. That includes exam-score summaries, income distributions, quality-control data, and any situation where percentiles or medians matter more than individual bin counts.

It is especially useful when raw data are large and a grouped table is easier to read than a long list of observations.

Try a Similar Cumulative Frequency Problem

Take any small grouped table and add a cumulative frequency column before drawing an ogive. Then read the median and one percentile from the graph and compare them with the table-based estimate.

If you want one more check, try your own version with N=50N = 50 and ask where the 2020th, 2525th, and 4545th values would fall. That is a simple way to make the idea stick.

Need help with a problem?

Upload your question and get a verified, step-by-step solution in seconds.

Open GPAI Solver →