Which of the following is the correct formula for the standard deviation of grouped data multiple choice question?

Return to Stat Topics

Grouped Data Problems

Find the mean and standard deviation of the following quantitative frequency distributions. These problems were adapted from those on pages 146 to 148 of Michael Sullivan, Fundamentals of Statistics, 2nd edition, Pearson Education, Inc. 2008.

1) A sample of college students was asked how much they spent monthly on a cell phone plan (to the nearest dollar).

Monthly Cell Phone Plan Cost ($)

Number of Students

10 - 19

8

20 - 29

16

30 - 39

21

40 - 49

11

50 - 59

4

2) The following data represent the difference in scores between the winning and losing teams in a sample of 15 college football bowl games from 2004-2005.

Point Difference

Number of Bowl Games

1 - 5

8

6 - 10

0

11 - 15

2

16 - 20

3

21 - 25

1

26 - 30

0

31 - 35

1

3) The following table represents the distribution of the annual number of days over 100 degrees Fahrenheit for Dallas-Fort Worth for a sample of 80 years from 1905 to 2004.

Days Above 100 Degrees

Number of Years

0 - 9

25

10 - 19

33

20 - 29

14

30 - 39

5

40 - 49

2

50 - 59

1

4) The following table shows the distribution of the number of hours worked each week (on average) for a sample of 100 community college students.

Hours Worked per Week

Number of Students

0 - 9

24

10 - 19

14

20 - 29

39

30 - 39

18

40 - 49

5

5) The following data represents the age distribution of a sample of 100 people covered by health insurance (private or government). The sample was taken in 2003.

Age

Number

25 - 34

23

35 - 44

29

45 - 54

28

55 - 64

20

6) The following data represent the high temperature distribution in degrees Fahrenheit for a sample of 40 days from the month of August in Chicago since 1872.

Temperature

Days

60 - 69

3

70 - 79

15

80 - 89

17

90 - 99

5

7) The following data represent the annual rainfall distribution in St. Louis, Missouri, for a sample of 25 years from 1870 to 2004.

Rainfall (inches)

Number of Years

20 - 24

1

25 - 29

3

30 - 34

5

35 - 39

8

40 - 44

5

45 - 49

2

50 - 54

0

55 - 59

1

8) The following data represent the age distribution of a sample of 70 women having multiple-delivery births in 2002.

Age

Number

15 - 19

1

20 - 24

5

25 - 29

16

30 - 34

28

35 - 39

17

40 - 44

3

Published on September 17, 2020 by Pritha Bhandari. Revised on May 25, 2022.

The standard deviation is the average amount of variability in your dataset. It tells you, on average, how far each value lies from the mean.

A high standard deviation means that values are generally far from the mean, while a low standard deviation indicates that values are clustered close to the mean.

What does standard deviation tell you?

Standard deviation is a useful measure of spread for normal distributions.

In normal distributions, data is symmetrically distributed with no skew. Most values cluster around a central region, with values tapering off as they go further away from the center. The standard deviation tells you how spread out from the center of the distribution your data is on average.

Many scientific variables follow normal distributions, including height, standardized test scores, or job satisfaction ratings. When you have the standard deviations of different samples, you can compare their distributions using statistical tests to make inferences about the larger populations they came from.

Example: Comparing different standard deviationsYou collect data on job satisfaction ratings from three groups of employees using simple random sampling.

The mean (M) ratings are the same for each group – it’s the value on the x-axis when the curve is at its peak. However, their standard deviations (SD) differ from each other.

The standard deviation reflects the dispersion of the distribution. The curve with the lowest standard deviation has a high peak and a small spread, while the curve with the highest standard deviation is more flat and widespread.

The empirical rule

The standard deviation and the mean together can tell you where most of the values in your distribution lie if they follow a normal distribution.

The empirical rule, or the 68-95-99.7 rule, tells you where your values lie:

  • Around 68% of scores are within 1 standard deviation of the mean,
  • Around 95% of scores are within 2 standard deviations of the mean,
  • Around 99.7% of scores are within 3 standard deviations of the mean.
Example: Standard deviation in a normal distributionYou administer a memory recall test to a group of students. The data follows a normal distribution with a mean score of 50 and a standard deviation of 10.

Following the empirical rule:

  • Around 68% of scores are between 40 and 60.
  • Around 95% of scores are between 30 and 70.
  • Around 99.7% of scores are between 20 and 80.

Which of the following is the correct formula for the standard deviation of grouped data multiple choice question?

The empirical rule is a quick way to get an overview of your data and check for any outliers or extreme values that don’t follow this pattern.

For non-normal distributions, the standard deviation is a less reliable measure of variability and should be used in combination with other measures like the range or interquartile range.

Standard deviation formulas for populations and samples

Different formulas are used for calculating standard deviations depending on whether you have data from a whole population or a sample.

Population standard deviation

When you have collected data from every member of the population that you’re interested in, you can get an exact value for population standard deviation.

The population standard deviation formula looks like this:

FormulaExplanation
Which of the following is the correct formula for the standard deviation of grouped data multiple choice question?
  • Which of the following is the correct formula for the standard deviation of grouped data multiple choice question?
    = population standard deviation
  • Which of the following is the correct formula for the standard deviation of grouped data multiple choice question?
    = sum of…
  • Which of the following is the correct formula for the standard deviation of grouped data multiple choice question?
    = each value
  • Which of the following is the correct formula for the standard deviation of grouped data multiple choice question?
    = population mean
  • Which of the following is the correct formula for the standard deviation of grouped data multiple choice question?
    = number of values in the population

Sample standard deviation

When you collect data from a sample, the sample standard deviation is used to make estimates or inferences about the population standard deviation.

The sample standard deviation formula looks like this:

FormulaExplanation
Which of the following is the correct formula for the standard deviation of grouped data multiple choice question?
  • Which of the following is the correct formula for the standard deviation of grouped data multiple choice question?
    = sample standard deviation
  • Which of the following is the correct formula for the standard deviation of grouped data multiple choice question?
    = sum of…
  • Which of the following is the correct formula for the standard deviation of grouped data multiple choice question?
    = each value
  • Which of the following is the correct formula for the standard deviation of grouped data multiple choice question?
     = sample mean
  • Which of the following is the correct formula for the standard deviation of grouped data multiple choice question?
    = number of values in the sample

With samples, we use n – 1 in the formula because using n would give us a biased estimate that consistently underestimates variability. The sample standard deviation would tend to be lower than the real standard deviation of the population.

Reducing the sample n to n – 1 makes the standard deviation artificially large, giving you a conservative estimate of variability.

While this is not an unbiased estimate, it is a less biased estimate of standard deviation: it is better to overestimate rather than underestimate variability in samples.

Steps for calculating the standard deviation

The standard deviation is usually calculated automatically by whichever software you use for your statistical analysis. But you can also calculate it by hand to better understand how the formula works.

There are six main steps for finding the standard deviation by hand. We’ll use a small data set of 6 scores to walk through the steps.

Data set
46 69 32 60 52 41

Step 1: Find the mean

To find the mean, add up all the scores, then divide them by the number of scores.

Mean (x̅)
Which of the following is the correct formula for the standard deviation of grouped data multiple choice question?

Step 2: Find each score’s deviation from the mean

Subtract the mean from each score to get the deviations from the mean.

Since  = 50, here we take away 50 from each score.

ScoreDeviation from the mean
46 46 – 50 = -4
69 69 – 50 = 19
32 32 – 50 = -18
60 60 – 50 = 10
52 52 – 50 = 2
41 41 – 50 = -9

Step 3: Square each deviation from the mean

Multiply each deviation from the mean by itself. This will result in positive numbers.

Squared deviations from the mean
(-4)2 = 4 × 4 = 16
192 = 19 × 19 = 361
(-18)2 = -18 × -18 = 324
102 = 10 × 10 = 100
22 = 2 × 2 = 4
(-9)2 = -9 × -9 = 81

Step 4: Find the sum of squares

Add up all of the squared deviations. This is called the sum of squares.

Sum of squares
16 + 361 + 324 + 100 + 4 + 81 = 886

Step 5: Find the variance

Divide the sum of the squares by n – 1 (for a sample) or N (for a population) – this is the variance.

Since we’re working with a sample size of 6, we will use  n – 1, where n = 6.

Variance
Which of the following is the correct formula for the standard deviation of grouped data multiple choice question?

Step 6: Find the square root of the variance

To find the standard deviation, we take the square root of the variance.

Standard deviation
Which of the following is the correct formula for the standard deviation of grouped data multiple choice question?

From learning that SD = 13.31, we can say that each score deviates from the mean by 13.31 points on average.

Why is standard deviation a useful measure of variability?

Although there are simpler ways to calculate variability, the standard deviation formula weighs unevenly spread out samples more than evenly spread samples. A higher standard deviation tells you that the distribution is not only more spread out, but also more unevenly spread out.

This means it gives you a better idea of your data’s variability than simpler measures, such as the mean absolute deviation (MAD).

The MAD is similar to standard deviation but easier to calculate. First, you express each deviation from the mean in absolute values by converting them into positive numbers (for example, -3 becomes 3). Then, you calculate the mean of these absolute deviations.

Unlike the standard deviation, you don’t have to calculate squares or square roots of numbers for the MAD. However, for that reason, it gives you a less precise measure of variability.

Let’s take two samples with the same central tendency but different amounts of variability. Sample B is more variable than Sample A.

ValuesMeanMean absolute deviationStandard deviation
Sample A66, 30, 40, 64 50 15 17.8
Sample B51, 21, 79, 49 50 15 23.7

For samples with equal average deviations from the mean, the MAD can’t differentiate levels of spread. The standard deviation is more precise: it is higher for the sample with more variability in deviations from the mean.

By squaring the differences from the mean, standard deviation reflects uneven dispersion more accurately. This step weighs extreme deviations more heavily than small deviations.

However, this also makes the standard deviation sensitive to outliers.

Frequently asked questions about standard deviation

What is the empirical rule?

The empirical rule, or the 68-95-99.7 rule, tells you where most of the values lie in a normal distribution:

  • Around 68% of values are within 1 standard deviation of the mean.
  • Around 95% of values are within 2 standard deviations of the mean.
  • Around 99.7% of values are within 3 standard deviations of the mean.

The empirical rule is a quick way to get an overview of your data and check for any outliers or extreme values that don’t follow this pattern.

Sources in this article

We strongly encourage students to use sources in their work. You can cite our article (APA Style) or take a deep dive into the articles below.

This Scribbr article

Bhandari, P. (May 25, 2022). How to Calculate Standard Deviation (Guide) | Formulas & Examples. Scribbr. Retrieved October 4, 2022, from https://www.scribbr.com/statistics/standard-deviation/

Is this article helpful?

You have already voted. Thanks :-) Your vote is saved :-) Processing your vote...

What is the formula for the standard deviation for grouped data?

The standard deviation formula for grouped data is: σ² = (Σ(Fi * Mi2) - (n * μ2)) / (n - 1) , where σ² is the variance. To obtain the standard deviation, take the square root of the variance.

What is the formula of standard deviation in grouped and ungrouped data?

Here D=X–A and A are any assumed mean other than zero. ... Standard Deviation..

What is the formula used to find the mean of a grouped data?

To calculate the mean of grouped data, the first step is to determine the midpoint of each interval or class. These midpoints must then be multiplied by the frequencies of the corresponding classes. The sum of the products divided by the total number of values will be the value of the mean.

What is the standard deviation of a group of numbers?

Standard deviation is a measure of dispersion of data values from the mean. The formula for standard deviation is the square root of the sum of squared differences from the mean divided by the size of the data set.