A box and whisker plot compresses a whole data set into five landmarks. Built from the five-number summary, it lets you see center, spread, and skew in a single glance without scanning a long list of values.
The Five-Number Summary And IQR
A box and whisker plot is built from five values: the minimum, the first quartile , the median, the third quartile , and the maximum. The box runs from to , so it contains the middle of the data, and the line inside the box is the median. The whiskers show how far the data extends beyond that middle half.
The width of the box is the interquartile range:
A larger means the middle half of the data is more spread out. Many box plots also use the rule to mark possible outliers; in that version, the whiskers stop at the most extreme non-outlier values instead of the absolute minimum and maximum.
Why The Box Sits Where It Does
The quartiles are not arbitrary. They are the cut points that split ordered data into four equal parts, so the box from to literally brackets the central half of the values. The median splits that central half again, which is why its position inside the box hints at skew: if it sits off-center, the data is not balanced around the middle.
This is also why two correct box plots of the same data can differ. The whisker rule, not the data, decides how far the tails are drawn.
Worked Example: From Data To Box Plot
Use the ordered data set
There are values, so the median is the average of the two middle values:
Because there is an even number of data points, split the list into two equal halves. The lower half is , so
The upper half is , so
That gives the five-number summary:
Now compute the interquartile range:
If you use the common outlier rule, the fences are
and
All the data values fall between and , so there are no possible outliers under that rule. For this data set, the box would run from to , the median line would be at , and the whiskers would reach and .
Practice: Read And Build One Yourself
Take the sorted set and write its five-number summary before drawing anything. With seven values the median is the fourth value, ; the lower half gives , and the upper half gives , so . Check your work: the box should run to with the median line at .
To read any finished plot, start with the median line to locate the center, compare the box width and whisker lengths for spread, then look for asymmetry that signals skew.
Calculation Pitfalls To Watch
The most common slip is reading the edges of the box as the minimum and maximum. They usually represent and , not the endpoints of the full data set.
A second pitfall is assuming every box plot uses the same whisker rule. Some whiskers extend to the true minimum and maximum; others stop at the most extreme non-outlier values under the rule.
A third is forgetting that quartiles depend on ordered data. If the values are not sorted first, the quartiles and median will be wrong. Box and whisker plots are most useful exactly when outliers or skew matter, because the median and quartiles stay more stable than the mean alone.
Frequently Asked Questions
- What does a box and whisker plot show?
- It shows the center, spread, and possible skew of a data set in one glance. The plot is built from the five-number summary: the minimum, first quartile, median, third quartile, and maximum. The box covers the middle 50 percent of the data and the whiskers show how far the rest extends.
- What is the five-number summary in a box plot?
- It is the minimum, the first quartile Q1, the median, the third quartile Q3, and the maximum. The box runs from Q1 to Q3 with the median line inside it, and the width of the box is the interquartile range, which measures the spread of the middle half of the data.
- How do you find outliers using the 1.5 IQR rule?
- Compute the interquartile range, then set fences at Q1 minus 1.5 times IQR and Q3 plus 1.5 times IQR. Values outside those fences are flagged as possible outliers, and in that version of the plot the whiskers stop at the most extreme values that are not outliers.
- Why can two box plots of the same data look different?
- Because they may use different whisker rules. Some box plots extend the whiskers to the absolute minimum and maximum, while others use the 1.5 times IQR rule and stop the whiskers at the most extreme non-outlier values. Both can be correct, so check which rule was used.
- What does the box in a box plot represent?
- The box runs from the first quartile to the third quartile, so it contains the middle 50 percent of the data, with the median drawn as a line inside it. A wider box means a larger interquartile range, meaning the middle half of the values is more spread out.
Need help with a problem?
Upload your question and get a verified, step-by-step solution in seconds.
Open GPAI Solver →