Percentile Rank

Content code

m1371

Slug (identifier)

percentile-rank

Parent content

Measures of Position

Grades

Secondary IV

Topic

Mathematics

Tags

percentile rank

quartile

percentile

median

Content

Contenu

Corps

In an ordered distribution, a percentile rank is a measure of position used to compare one data value to the others.

Content

Corps

The percentile rank of a data value |x| is the percentage of data with a value less than or equal to |x.| The percentile rank of |x| is denoted by |R_{100}(x).|
Percentiles correspond to the |99| values that divide an ordered distribution into |100| groups, each containing approximately |1\ \%| of the data. Percentiles are numbered from |C_1| to |C_{99}.|

Content

Corps

Data that have the same value are assigned the same percentile rank.

Links

Type

Anchor

Title

Finding the Percentile Rank of a Data Value

Anchor

finding-percentile-rank

Type

Anchor

Title

Finding a Data Value From its Percentile Rank

Anchor

finding-value-percentile-rank

Title (level 2)

Finding the Percentile Rank of a Data Value

Title slug (identifier)

finding-percentile-rank

Contenu

Corps

When looking for the percentile rank of a data value |x,| we use the following formula.

Content

Corps

||R_{100}(x)=\dfrac{\begin{gather}\text{Number of data values}\\\text{that are less than }x\end{gather}+\dfrac{1}{2}\left(\begin{gathered}\text{Number of data values}\\\text{that are equal to }x\end{gathered}\right)}{\begin{gather}\text{Total number of data values}\end{gather}}\times100||

Content

Corps

If the answer is not a whole number, then we must round up to the nearest one.

Content

Corps

In a class of |30| students, Martine scored |83\ \%| on her last math exam, which was the 4th best grade in the group. In addition, she was the only student to get this grade.

What is Martine's percentile rank?

According to the information provided, |26| students |(30-4)| scored less than |83\ \%.| Also, only one student (Martine) scored exactly |83\ \%.|

We can calculate Martine's percentile rank using the formula.||\begin{align}R_{100}(\text{Martine})&=\dfrac{\begin{gather}\text{Number of grades}\\\text{less than }83\ \%\end{gather}+\dfrac{1}{2}\left(\begin{gathered}\text{Number of grades}\\\text{equal to }83\ \%\end{gathered}\right)}{\begin{gather}\text{Total number of grades}\end{gather}}\times100\\&=\dfrac{26+\dfrac{1}{2}(1)}{30}\times100\\&=88.\overline{3}\end{align}||Rounding up to the nearest whole number gives |89.|

Answer: Martine occupies the 89th percentile rank, which we denote |R_{100}(\text{Martine})=89.|

Content

Corps

The context of a situation must always be taken into account when studying the percentile rank of a data value. The better or higher performing the data value being studied is, when compared with the other data, the higher its percentile rank will be.

In the previous example, students were classed according to their score out of |100.| So the higher a student's grade, the better they performed, and therefore the higher their percentile rank.

On the other hand, we could be interested in race participants, where competitors are classed according to the time they take to cover the distance. In this type of situation, the shorter a participant's time, the better they perform, and therefore the higher their percentile rank.

Content

Corps

There were |312| participants in the last great cycling tour of Lac Mégantic. Danny crossed the finish line at the same time as |5| other people, meaning they shared the 89th position in the standings.

What is Danny's percentile rank?

According to the information provided, |218| cyclists |(312-88-5-1)| crossed the finish line after Danny. From the 312 cyclists, we have to subtract the 88 who arrived before Danny, the |5| who arrived at the same time as him and Danny himself.

Danny's percentile rank is calculated using this formula:||\begin{align}R_{100}(\text{Danny})&=\dfrac{\begin{gather}\text{Number of cyclists}\\\text{that finished after the }89^\text{th}\ \text{position}\end{gather}+\dfrac{1}{2}\left(\begin{gathered}\text{Number of cyclists}\\\text{in }89^\text{th}\ \text{position}\end{gathered}\right)}{\begin{gather}\text{Total number of cyclists}\end{gather}}\times100\\&=\dfrac{218+\dfrac{1}{2}(6)}{312}\times100\\&=70.8\overline{3}\end{align}||Rounding up to the nearest whole number gives |71.|

Image

A representation of the percentile ranks.

Corps

Answer: Danny occupies the 71st percentile, which is denoted by |R_{100}(\text{Danny})=71.|

Contenu

Title

Why Take Half of the Data Equal to x When Calculating the Percentile Rank?

Content

Corps

The percentile rank of a data value |x| has been defined as the percentage of data values that are less than or equal to |x.| So why is only half of the data that is equal to |x| used in the formula?||R_{100}(x)=\dfrac{\begin{gather}\text{Number of data }\\\text{values less than }x\end{gather}+\color{#ec0000}{\boldsymbol{\dfrac{1}{2}}\left(\color{black}{\begin{gathered}\text{Number of data}\\\text{values equal to }x\end{gathered}}\right)}}{\begin{gather}\text{Total number of data values}\end{gather}}\times100||It is because the percentile rank, as a measure of position, is an estimate that applies especially to distributions that contain a large number of data values.

If we include all the data values that are equal to |x| in the calculation, we might get a different percentile rank. In the previous example, Danny would have been in the 72nd percentile.||\begin{align}R_{100}(\text{Danny})&=\dfrac{\begin{gather}\text{Number of cyclists}\\\text{that finished after the }89^\text{th}\ \text{position}\end{gather}+\begin{gathered}\text{Number of cyclists}\\\text{at the }89^\text{th}\ \text{position}\end{gathered}}{\begin{gather}\text{Total number of cyclists}\end{gather}}\times100\\&=\dfrac{218+6}{312}\times100\\&\approx71.79\ \rightarrow\ 72\end{align}||

Image

Corps

We might also exclude all data with a value equal to |x| from the calculation. In this case, in the previous example, Danny would have ended up in the 70th percentile.||\begin{align}R_{100}(\text{Danny})&=\dfrac{\begin{gather}\text{Number of cyclists that}\\\text{finished after the }89^\text{th}\ \text{position}\end{gather}}{\begin{gather}\text{Total number of cyclists}\end{gather}}\times100\\ &=\dfrac{218}{312}\times100\\&\approx69.87\ \rightarrow\ 70\end{align}||

Image

Corps

That being said, the formula used most often in Quebec secondary schools is the one that contains the fraction |\boldsymbol{\color{#ec0000}{\dfrac{1}{2}}}.|

Content

Corps

It is possible to convert percentile ranks into quarters and the percentiles into quartiles.

Columns number

2 columns

Format

50% / 50%

First column

Corps

The percentile ranks from 1 to 25 correspond to the 1st quarter.
The percentile ranks from 26 to 50 correspond to the 2nd quarter.
The percentile ranks from 51 to 75 correspond to the 3rd quarter.
The percentile ranks from 76 to 25 correspond to the 4th quarter.

Second column

Corps

The 25th percentile |(C_{25})| corresponds to the 1st quartile |(Q_1).|
The 50th percentile |(C_{50})| corresponds to the median |(Q_2).|
The 75th percentile |(C_{75})| corresponds to the 3rd quartile |(Q_3).|

Corps

These 2 concepts can be compared using a box and whisker plot.

Image

A box and whisker plot showing the percentile ranks.

Title (level 2)

Finding a Data Value From its Percentile Rank

Title slug (identifier)

finding-value-percentile-rank

Contenu

Corps

When we are looking for the data value |x| that occupies a certain percentile rank, we must first calculate its position in the distribution using the following formula:

Content

Corps

||\begin{gathered}\text{Position of }x\\\text{in the distribution}\end{gathered}=\dfrac{R_{100}(x)}{100}\times\begin{gather}\text{Total number}\\\text{of data values}\end{gather}||

Content

Corps

If the answer is not a whole number, round down to the nearest whole number.

Content

Corps

A survey measured the height, in centimetres, of the |124| 15-year-olds at Juliette's high school. Here is the resulting sample.||150,155,156,\!\!\!\!\!\!\underbrace{\ldots}_{\large{68\ \text{students}}}\!\!\!\!\!,167,167,168,169,170,170,170,171,\!\!\!\!\!\!\underbrace{\ldots}_{\large{42\ \text{students}}}\!\!\!\!\!,182,185,188||What is Juliette's height if she occupies the 61st percentile?

Calculate Juliette’s position in the list.

||\begin{aligned}\begin{gathered}\text{Juliette's position}\\\text{in the school}\end{gathered}&=\dfrac{R_{100}(\text{Juliette})}{100}\times\begin{gather}\text{Total number}\\\text{of data}\end{gather}\\&=\dfrac{61}{100}\times124\\&=75.64\end{aligned}||Rounding down to the nearest whole number, we find that Juliette would be the 75th data value. The position of the data value we're looking for is always found by counting from the worst-performing data value, which in this case is |150.| Therefore, the 75th data value is |169.|

Check the answer.

Check to make sure that |169| indeed occupies the 61st percentile.||\begin{align}R_{100}(\text{Juliette})&=\dfrac{\begin{gather}\text{Number of data values}\\\text{less than }169\end{gather}+\dfrac{1}{2}\left(\begin{gathered}\text{Number of data values}\\\text{equal to }169\end{gathered}\right)}{\begin{gather}\text{Total number of data values}\end{gather}}\times100\\&=\dfrac{74+\dfrac{1}{2}(1)}{124}\times100\\&\approx60.08\end{align}||Rounding up to the nearest whole number gives |61.| This is indeed the data value we're looking for: |169.|

Answer: Juliette’s height is |169\ \text{cm}.|

Content

Corps

It is always best to check that the position obtained using the formula actually indicates the data value that occupies the percentile rank provided. It is possible that the data value you are looking for occupies the next position in the distribution.

Contenu

Title

Example where the calculated position does not correspond to the percentile rank provided

Content

Corps

In a class of |30| students, Martine was ranked at the 89th percentile for her last math exam. What's more, she is the only student in her class to achieve this rank. Here is the distribution of exam results for all the students in her class:

|48,| |50,| |51,| |52,| |54,| |55,| |56,| |58,| |59,| |60,| |62,| |63,| |64,| |66,| |67,| |68,| |70,| |71,| |72,| |74,| |75,| |76,| |78,| |79,| |80,| |82,| |83| |84,| |86,| |87|

What was Martine's grade on her math exam?

Calculate the position of Martine in the list.

||\begin{aligned}\begin{gathered}\text{Martine's position}\\\text{in the group}\end{gathered}&=\dfrac{R_{100}(\text{Martine})}{100}\times\begin{gather}\text{Total number}\\\text{of grades}\end{gather}\\&=\dfrac{89}{100}\times30\\&=26.7\end{aligned}||Rounding down to the nearest whole number, we find that Martine's grade is the 26th grade (82) in the distribution.

Check the answer.

Check to see if |82| indeed occupies the 89th percentile.||\begin{align}R_{100}(\text{Martine})&=\dfrac{\begin{gather}\text{Number of grades }\\\text{less than }82\ \%\end{gather}+\dfrac{1}{2}\left(\begin{gathered}\text{Number of grades }\\\text{equal to }82\ \%\end{gathered}\right)}{\begin{gather}\text{Total number of grades}\end{gather}}\times100\\&=\dfrac{25+\dfrac{1}{2}(1)}{30}\times100\\&=85\end{align}||Since the percentile rank of the data value |82| is |85| and not |89,| it means that |82| is not the data value we are looking for.

Calculate the percentile rank of the data value at the following position: |\boldsymbol{(83)}|

||\begin{align}R_{100}(\text{Martine})&=\dfrac{\begin{gather}\text{Number of grades }\\\text{less than }83\ \%\end{gather}+\dfrac{1}{2}\left(\begin{gathered}\text{Number of grades}\\\text{equal to }83\ \%\end{gathered}\right)}{\begin{gather}\text{Total number of grades}\end{gather}}\times100\\&=\dfrac{26+\dfrac{1}{2}(1)}{30}\times100\\&=88.\overline{3}\end{align}||Rounding up to the nearest whole number gives |89.| This means that |83| is the data value we are looking for.

Answer: Martine got |83\ \%| on the last mathematics exam.

Corps

To find the position of a data value in a condensed data distribution, we must use cumulative frequencies.

Content

Corps

Here are the best results from a strongman competition where participants had to lift a |200\ \text{kg}| weight as long as possible.

Recorded Time That |\boldsymbol{250}| Athletes Lifted a Weight of |\boldsymbol{200}\ \textbf{kg}|
Time (s)	Number of participants	Time (s)	Number of participants
\|8.9\|	\|10\|	\|9.6\|	\|24\|
\|9\|	\|15\|	\|9.7\|	\|22\|
\|9.1\|	\|23\|	\|9.8\|	\|20\|
\|9.2\|	\|26\|	\|9.9\|	\|17\|
\|9.3\|	\|27\|	\|10\|	\|10\|
\|9.4\|	\|25\|	\|10.1\|	\|5\|
\|9.5\|	\|24\|	\|10.2\|	\|2\|

What data value occupies the 25th percentile?

Solution

Corps

Calculate the position of the data value |\boldsymbol{(x)}| in the list.

||\begin{aligned}\begin{gathered}\text{Position of }x\\\text{in the distribution}\end{gathered}&=\dfrac{R_{100}(x)}{100}\times\begin{gather}\text{Total number }\\\text{of data values }\end{gather}\\&=\dfrac{25}{100}\times250\\&=62.5\end{aligned}||Rounding down to the nearest whole number gives |62.| We now need to find the 62nd data value using a cumulative frequency table.

Recorded Time That |\boldsymbol{250}| Athletes Lifted a Weight of |\boldsymbol{200}\ \textbf{kg}|
Time (s)	Number of participants	Cumulative frequency
\|8.9\|	\|10\|	\|10\|
\|9\|	\|15\|	\|25\|
\|9.1\|	\|23\|	\|48\|
\|9.2\|	\|26\|	\|\boldsymbol{74}\|
\|9.3\|	\|27\|
\|9.4\|	\|25\|
\|9.5\|	\|24\|

Therefore, the 62nd data value is associated with a result of |9.2\ \text{s}.|

Check the answer.

We check whether |9.2\ \text{s}| is indeed in the 25th percentile.||\begin{align}R_{100}(9.2\ \text{s})&=\dfrac{\begin{gather}\text{Number of data values}\\\text{less than }9.2\ \text{s}\end{gather}+\dfrac{1}{2}\left(\begin{gathered}\text{Number of data values}\\\text{equal to }9.2\ \text{s}\end{gathered}\right)}{\begin{gather}\text{Total number of data values}\end{gather}}\times100\\&=\dfrac{48+\dfrac{1}{2}(26)}{250}\times100\\&=24.4\end{align}||Rounding up to the nearest whole number, we get |25,| which implies that |9.2\ \text{s}| matches the data value we're looking for.

Answer: The data value that occupies the 25th percentile corresponds to a result of |9.2\ \text{s}.|

Title (level 2)

Time (s)	Number of participants	Time (s)	Number of participants
\|8.9\|	\|10\|	\|9.6\|	\|24\|
\|9\|	\|15\|	\|9.7\|	\|22\|
\|9.1\|	\|23\|	\|9.8\|	\|20\|
\|9.2\|	\|26\|	\|9.9\|	\|17\|
\|9.3\|	\|27\|	\|10\|	\|10\|
\|9.4\|	\|25\|	\|10.1\|	\|5\|
\|9.5\|	\|24\|	\|10.2\|	\|2\|

Time (s)	Number of participants	Cumulative frequency
\|8.9\|	\|10\|	\|10\|
\|9\|	\|15\|	\|25\|
\|9.1\|	\|23\|	\|48\|
\|9.2\|	\|26\|	\|\boldsymbol{74}\|
\|9.3\|	\|27\|
\|9.4\|	\|25\|
\|9.5\|	\|24\|