The five-number summary of a data set (that is, of a set of values) is the minimum value, the maximum value, and, between these two extremes, the quartile values Q1, Q2, and Q3.
Content Continues Below
The terminology for box-and-whisker plots can vary somewhat, and you may encounter some different words, depending on your textbook and instructor.
The top end of your box, at Q3, may be called the "upper hinge"; the lower end, at Q1, may be called the "lower hinge".
The lower hinge is also called "the 25th percentile"; the median is "the 50th percentile"; the upper hinge is "the 75th percentile". This means that 25%, 50% and 75% of the data, respectively, is at or below that point.
The distance between the hinges may be referred to as the "H-spread" or, as you will see on the following page, the "Interquartile Range", abbreviated "IQR". ("Hinge" actually has a different technical definition, but the term is sometimes used informally. You should use whichever terminology your book or instructor uses.)
Also, some books and software will include the overall median (that is, Q2) when computing Q1 and Q3 for data sets with an odd number of elements. The Texas Instruments calculators do not include Q2 in this case, so you may encounter a book answer that doesn't match the calculator answer. And different software packages use all different sorts of formulas. Be careful to use the formula from your book when doing your homework!
Additionally, the box-and-whisker plot may include a cross or an "X" marking the mean value of the data, in addition to the line inside the box that marks the median. The difference between the "X" and the median line can then be used as a measure of "skew".
Please don't ask me to explain "skew".
77, 79, 80, 86, 87, 87, 94, 99
These data are already listed in numerical order, so my first step is to find the median. Since there are eight data points, the median will be the average of the two middle values:
(86 + 87) ÷ 2 = 86.5 = Q2
This splits the list into two halves:
77, 79, 80, 86
...and:
87, 87, 94, 99
Since the halves of the data set each contain an even number of values, the sub-medians (that is, the first and third quartiles) will be the averages of the middle two values of each half.
Q1 = (79 + 80) ÷ 2 = 79.5
Q3 = (87 + 94) ÷ 2 = 90.5
The minimum value is 77 and the maximum value is 99, so I have:
min: 77
Q1: 79.5
Q2: 86.5
Q3: 90.5
max: 99
Then my plot looks like this:
(Yes, my solution above started with me referring to the listed values as "these data are", rather than (as has become common) "this data is". Technically, "data" is plural; a single value would be a "datum". No, I don't usually use the terms correctly, either.)
Content Continues Below
As you can see, you only need the five values listed above (min, Q1, Q2, Q3, and max) in order to draw your box-and-whisker plot. This set of five values has been given the name "the five-number summary".
If you are asked to produce the five-number summary, they're wanting the same values as I found in the example above, but without doing the associated box-and-whisker plot.
79, 53, 82, 91, 87, 98, 80, 93
The five-number summary consists of the numbers I need for the box-and-whisker plot: the minimum value, Q1 (being the bottom of the box), Q2 (being the median of the entire set), Q3 (being the top of the box), and the maximum value (which is also Q4).
So I need to order the set, find the median and the sub-medians (or hinges), note the largest and smallest values, and then list the required values.
I'll start with ordering the list:
53, 79, 80, 82, 87, 91, 93, 98
Now I can see that the minimum is 53 and the maximum is 98.
To find the median, I'll take the average of the middle two values of the ordered list:
(82 + 87) ÷ 2 = 84.5 = Q2
The median splits the data into two halves. The lower half of the list is:
53, 79, 80, 82
With an even number of values, the first quartile's value will be the average of the middle two values of this half of the list:
Q1 = (79 + 80) ÷ 2 = 79.5
I'll do the same thing with the upper half of the list, which is:
87, 91, 93, 98
Q3 = (91 + 93) ÷ 2 = 92
This gives me all the values for the five-number summary. Then my answer (with labelling, just to be clear) is:
five-number summary:
min: 53
Q1: 79.5
;Q2: 84.5
Q3: 92
max: 98
Part of the point of a box-and-whisker plot is to show how spread out your values are. But what if one or another of your values is way out of line? For this, we need to consider "outliers".
You can use the Mathway widget below to practice finding the five-number summary (or skip the widget and continue with the lesson). Try the entered exercise, or type in your own exercise. Then click the button and scroll down to "Find the Five Number Summary" to compare your answer to Mathway's.
Please accept "preferences" cookies in order to enable this widget.
(Click "Tap to view steps" to be taken directly to the Mathway site for a paid upgrade.)
URL: https://www.purplemath.com/modules/boxwhisk2.htm
© 2024 Purplemath, Inc. All right reserved. Web Design by