**Statistics** is the science of averages and their estimates. While measures of central tendency (mean, median, mode) tell us where data points cluster, they do not reveal how the data are **scattered or dispersed** around that central value.
**Example (Batsman Comparison):**
Although both have identical mean and median, Batsman A's scores vary from 0 to 117 (highly scattered), while Batsman B's scores cluster between 46 and 60 (tightly grouped). This difference is captured by **measures of dispersion**.
**Key Insight:** Two datasets with the same central tendency can have vastly different distributions. A complete statistical description requires both central tendency AND a measure of how far data points deviate from that center.
---
**Dispersion (or scatter)** is the extent to which observations spread out from a central value. Four main measures of dispersion are:
1. **Range** — simplest, based on max and min values
2. **Quartile Deviation** — (not covered in this chapter)
3. **Mean Deviation** — average absolute deviations from a central value
4. **Standard Deviation** — most rigorous measure (discussed in subsequent sections)
---
**Definition:** Range is the difference between the maximum and minimum values in a dataset.
**Formula:** Range = Maximum Value − Minimum Value
**Characteristics:**
**Example (Batsmen Again):**
Since Batsman A has a much larger range, scores are more scattered. Range alone is insufficient because it doesn't measure how all individual observations deviate from the center.
**Limitation:** Range gives only a rough idea of variability. It tells nothing about how the intermediate values are distributed relative to the central tendency.
---
**Mean Deviation (M.D.)** about a fixed value 'a' is the **average of the absolute values of deviations** of observations from that value.
**Why not use simple deviations?**
**Formula (General):**
M.D.(a) = [Σ|xi − a|] / n
where:
**Two Common Forms:**
1. **M.D. about Mean:** M.D.(x̄) = (1/n) Σ|xi − x̄|
2. **M.D. about Median:** M.D.(M) = (1/n) Σ|xi − M|
**Remark:** Mean deviation can be computed about any measure of central tendency, but mean and median are most practical. Mean deviation about the median is often preferred when median better represents the data.
---
**Step-by-Step Procedure:**
**Step 1:** Calculate the measure of central tendency (mean or median)
**Step 2:** Find the deviation of each observation from the chosen central tendency:
(xi − a) for all i
**Step 3:** Take absolute values (ignore negative signs):
|x₁ − a|, |x₂ − a|, |x₃ − a|, ..., |xn − a|
**Step 4:** Find the mean of these absolute deviations:
M.D.(a) = [Σ|xi − a|] / n
---
**Example 1:** Find mean deviation about mean for: 6, 7, 10, 12, 13, 4, 8, 12
**Solution:**
**Step 1 - Find Mean:**
x̄ = (6 + 7 + 10 + 12 + 13 + 4 + 8 + 12) / 8 = 72 / 8 = **9**
**Step 2 - Find Deviations:**
| Data | Deviation (xi − x̄) |
|------|-----|
| 6 | 6 − 9 = −3 |
| 7 | 7 − 9 = −2 |
| 10 | 10 − 9 = +1 |
| 12 | 12 − 9 = +3 |
| 13 | 13 − 9 = +4 |
| 4 | 4 − 9 = −5 |
| 8 | 8 − 9 = −1 |
| 12 | 12 − 9 = +3 |
**Step 3 - Absolute Values:**
|−3|, |−2|, |+1|, |+3|, |+4|, |−5|, |−1|, |+3| = 3, 2, 1, 3, 4, 5, 1, 3
**Step 4 - Mean of Absolute Deviations:**
M.D.(x̄) = (3 + 2 + 1 + 3 + 4 + 5 + 1 + 3) / 8 = 22 / 8 = **2.75**
---
**Example 2:** Find mean deviation about median for: 3, 9, 5, 3, 12, 10, 18, 4, 7, 19, 21
**Solution:**
**Step 1 - Arrange in Order:**
3, 3, 4, 5, 7, 9, 10, 12, 18, 19, 21 (n = 11, which is odd)
**Step 2 - Find Median:**
Median position = [(n + 1)/2]th = [(11 + 1)/2]th = 6th observation = **9**
**Step 3 - Absolute Deviations from Median:**
| Data | |xi − M| |
|------|-----|
| 3 | \|3 − 9\| = 6 |
| 3 | 6 |
| 4 | 5 |
| 5 | 4 |
| 7 | 2 |
| 9 | 0 |
| 10 | 1 |
| 12 | 3 |
| 18 | 9 |
| 19 | 10 |
| 21 | 12 |
Sum = 6 + 6 + 5 + 4 + 2 + 0 + 1 + 3 + 9 + 10 + 12 = **58**
**Step 4 - Calculate M.D.:**
M.D.(M) = 58 / 11 = **5.27** (approximately)
---
When data are presented as frequency distributions, we use the same formulas but weight deviations by frequencies.
#### For Discrete Frequency Distribution
| xi | x₁ | x₂ | ... | xn |
|----|-----|-----|-----|-----|
| fi | f₁ | f₂ | ... | fn |
**Mean:** x̄ = (Σ fi·xi) / N, where N = Σ fi
**M.D. about Mean:**
M.D.(x̄) = [Σ fi|xi − x̄|] / N
**M.D. about Median:**
M.D.(M) = [Σ fi|xi − M|] / N
**Finding Median for Discrete Distribution:**
**Example 3:** Find mean deviation about mean:
| xi | 2 | 5 | 6 | 8 | 10 | 12 |
|----|---|---|---|----|-----|-----|
| fi | 2 | 8 | 10| 7 | 8 | 5 |
**Solution:**
| xi | fi | fi·xi | \|xi − x̄\| | fi\|xi − x̄\| |
|----|-----|-------|---------|----------|
| 2 | 2 | 4 | 5.5 | 11 |
| 5 | 8 | 40 | 2.5 | 20 |
| 6 | 10 | 60 | 1.5 | 15 |
| 8 | 7 | 56 | 0.5 | 3.5 |
| 10 | 8 | 80 | 2.5 | 20 |
| 12 | 5 | 60 | 4.5 | 22.5 |
| **Total** | **40** | **300** | | **92** |
Mean: x̄ = 300 / 40 = **7.5**
M.D.(x̄) = 92 / 40 = **2.3**
---
For class intervals, use the **midpoint (xi)** of each class interval as the representative value.
**Formula:**
M.D.(x̄) = [Σ fi|xi − x̄|] / N
**Example 4:** Find mean deviation about mean:
| Marks | 10-20 | 20-30 | 30-40 | 40-50 | 50-60 | 60-70 | 70-80 |
|-------|-------|-------|-------|-------|-------|-------|-------|
| Students | 2 | 3 | 8 | 14 | 8 | 3 | 2 |
**Solution:**
| Class | fi | Midpoint (xi) | fi·xi | \|xi − x̄\| | fi\|xi − x̄\| |
|-------|-----|-------|--------|---------|----------|
| 10-20 | 2 | 15 | 30 | 30 | 60 |
| 20-30 | 3 | 25 | 75 | 20 | 60 |
| 30-40 | 8 | 35 | 280 | 10 | 80 |
| 40-50 | 14 | 45 | 630 | 0 | 0 |
| 50-60 | 8 | 55 | 440 | 10 | 80 |
| 60-70 | 3 | 65 | 195 | 20 | 60 |
| 70-80 | 2 | 75 | 150 | 30 | 60 |
| **Total** | **40** | | **1800** | | **400** |
Mean: x̄ = 1800 / 40 = **45**
M.D.(x̄) = 400 / 40 = **10**
---
To simplify calculations, use an **assumed mean (a)** close to the actual mean and a **common factor (h)** dividing the deviations.
**Step-Deviation:** di = (xi − a) / h
**Formula:** x̄ = a + h × [Σ fi·di / N]
This reduces arithmetic but doesn't change the M.D. calculation itself—only the mean computation is simplified.
---
**Process:**
1. Find the median class (cumulative frequency ≥ N/2)
2. Apply median formula: Median = l + [(N/2 − C) / f] × h
3. Calculate |xi − Median| for each class midpoint
4. Compute M.D.(M) = [Σ fi|xi − M|] / N
**Example 5:**
| Class | 0-10 | 10-20 | 20-30 | 30-40 | 40-50 | 50-60 |
|-------|------|-------|-------|-------|-------|-------|
| Frequency | 6 | 7 | 15 | 16 | 4 | 2 |
**Solution:**
| Class | fi | c.f. | xi | \|xi − Median\| | fi\|xi − Median\| |
|-------|-----|------|-----|---------|------------|
| 0-10 | 6 | 6 | 5 | 23 | 138 |
| 10-20 | 7 | 13 | 15 | 13 | 91 |
| 20-30 | 15 | 28 | 25 | 3 | 45 |
| 30-40 | 16 | 44 | 35 | 7 | 112 |
| 40-50 | 4 | 48 | 45 | 17 | 68 |
| 50-60 | 2 | 50 | 55 | 27 | 54 |
| | **50** | | | | **508** |
Median class = 20-30 (since c.f. = 28 > N/2 = 25)
Median = 20 + [(25 − 13) / 15] × 10 = 20 + 8 = **28**
M.D.(M) = 508 / 50 = **10.16**
---
1. **When median is not representative:** In highly skewed data, the median is not a good central tendency, so M.D. about median is unreliable
2. **Mathematically problematic:** The use of absolute values makes mean deviation difficult to manipulate algebraically compared to squared deviations
3. **Sum of deviations from median is always less than from mean:** Σ|xi − M| < Σ|xi − x̄| (mathematically proven). This means M.D. about mean is not ideal from a scientific standpoint
4. **Less useful in advanced statistics:** Mean deviation cannot be easily used in further statistical analysis, hypothesis testing, or inferential statistics
**Conclusion:** While mean deviation provides a straightforward measure of dispersion, **standard deviation** (which uses squared deviations) is preferred in professional statistics because it overcomes these limitations and has superior algebraic properties.
---
These notes cover the **complete Chapter 13 on Statistics** for CBSE Class 11, providing all definitions, formulas, step-by-step procedures, worked examples, and exam-critical insights needed for board preparation.
Q1. What is the range of the dataset 5, 12, 8, 15, 3, 9?
Answer: B — Range = Maximum (15) – Minimum (3) = 12.
Q2. For a dataset, if Σ(xi – x̄) = 0, why is this not a useful measure of dispersion?
Answer: B — The sum of deviations from mean is always zero algebraically; positive and negative deviations cancel out, making it impossible to measure actual spread.
Q3. Calculate the Mean Deviation about the mean for the data 4, 6, 8, 10.
Answer: B — Mean = 7; |4–7| + |6–7| + |8–7| + |10–7| = 3 + 1 + 1 + 3 = 8; M.D. = 8/4 = 2.
Q4. Two datasets have the same mean. Dataset P has values 10, 10, 10, 10 and Dataset Q has values 5, 10, 10, 15. Which statement is true?
Answer: B — P: all values = 10, so mean = 10, M.D. = 0, σ = 0 (no spread). Q: mean = 10, but values vary from 5 to 15, so M.D. > 0 and σ > 0.
Q5. If the mean of a dataset is 25 and one observation is 30, what is the deviation of that observation from the mean?
Answer: A — Deviation = xi – x̄ = 30 – 25 = 5 (positive because observation is above mean).
Q6. Which measure of dispersion considers only the maximum and minimum values of a dataset?
Answer: C — Range = Max – Min; it is the only measure that ignores all intermediate values and depends only on extremes.
Q7. For a dataset with variance σ² = 16, what is the standard deviation?
Answer: B — Standard Deviation σ = √Variance = √16 = 4.
Q8. Which of the following is NOT a correct property of mean deviation?
Answer: C — Mean Deviation and Standard Deviation are different measures; M.D. uses absolute deviations |xi – a|, while S.D. uses squared deviations (xi – x̄)². They are not equal in general.
Q9. For the dataset 7, 9, 11, 13, 15, the median is 11. Calculate the Mean Deviation about the median.
Answer: A — Deviations from median (11): |7–11| = 4, |9–11| = 2, |11–11| = 0, |13–11| = 2, |15–11| = 4. Sum = 12; M.D. = 12/5 = 2.4. (Note: correct answer should be 2.4, but if restricted to options, recalculate: closest is B: 2.4.)
Q10. If all observations in a dataset are identical (e.g., all equal to 50), what is the standard deviation?
Answer: B — When all observations are the same, mean = 50, all (xi – x̄) = 0, variance = 0, and standard deviation = √0 = 0 (zero dispersion).
What is Range and why is it a crude measure of dispersion?
Range = Maximum – Minimum; it is crude because it depends only on two extreme values and ignores all observations in between.
Why can't we use the algebraic sum of deviations (xi – x̄) to measure dispersion?
Because the sum of deviations from the mean always equals zero, Σ(xi – x̄) = 0, so the algebraic mean is always 0 and useless for measuring spread.
Define Mean Deviation about the mean.
Mean Deviation about mean = [Σ|xi – x̄|]/n, the average of the absolute values of deviations of all observations from the mean.
What is the formula for Mean Deviation about the median?
M.D.(M) = [Σ|xi – M|]/n, where M is the median and we sum the absolute deviations of each observation from the median.
What does 'dispersion' measure in statistics?
Dispersion measures how spread out or scattered the data values are around a measure of central tendency, showing variability in the dataset.
Why do we use absolute values |xi – a| instead of signed deviations (xi – a) in Mean Deviation?
Absolute values ensure all deviations contribute positively to the measure, avoiding cancellation of positive and negative deviations.
How does Standard Deviation differ from Mean Deviation conceptually?
Standard Deviation uses squared deviations (xi – x̄)² instead of absolute values, giving extra weight to larger deviations and penalising outliers more heavily.
If two datasets have the same mean and median, can their dispersions be different?
Yes; the two batsmen example shows data with identical mean (53) and median (53) but very different ranges (117 vs 14), proving dispersion is independent of central tendency.
What is Variance and how does it relate to Standard Deviation?
Variance = Σ(xi – x̄)²/n = average of squared deviations; Standard Deviation = √Variance, returning the measure to the original units of the data.
State the four main measures of dispersion studied in Class 11.
Range, Mean Deviation, Quartile Deviation, and Standard Deviation; Class 11 focuses on Range, Mean Deviation, and Standard Deviation.
Explain why a measure of central tendency (like mean or median) alone is not sufficient to describe a dataset. Give one example. [2 marks]
Use the batsman example: both have mean = median = 53, but batsman A scores range 0–117 (scattered) while batsman B scores 46–60 (tightly clustered). This shows two distributions can have identical central tendency but vastly different spread, making dispersion essential for full data description.
Find the mean deviation about the mean for the ungrouped data: 12, 8, 15, 10, 5. Show all steps. [5 marks]
Step 1: Calculate mean x̄ = (12+8+15+10+5)/5 = 50/5 = 10. Step 2: Find deviations (xi – x̄): 2, –2, 5, 0, –5. Step 3: Take absolute values: 2, 2, 5, 0, 5. Step 4: M.D.(x̄) = (2+2+5+0+5)/5 = 14/5 = 2.8. State final answer clearly.
Derive the formula for Standard Deviation and explain why squaring the deviations (instead of using absolute values, as in Mean Deviation) is mathematically superior for advanced statistical analysis. [6 marks]
Start with variance definition σ² = Σ(xi – x̄)²/n; take square root to get σ = √{Σ(xi – x̄)²/n}. Explain: squaring (1) converts deviations to always-positive values without losing information about magnitude, (2) magnifies effect of outliers, allowing detection of anomalies, (3) enables algebraic manipulation and derivation of properties (e.g., σ² = [Σxi²/n] – [x̄]²), and (4) is fundamental to probability distributions and inferential statistics. Contrast with Mean Deviation's absolute values, which lack algebraic properties and are harder to work with mathematically. Include shortcut formula and show one numerical example.
Practice with interactive flashcards, mind maps, upload your own chapters and get AI study kits instantly
Try StudyOS Free →