Bio Statistics
Bio Statistics
What Is Statistics:Introduction 1
List ways that statistics is
used.
Know the differences
between descriptive and
inferential statistics.
Understand the differences
between a sample and a
population.
Explain the difference
between qualitative and
quantitative variables.
Probability 4
Analysis of Variance: 8
Explain the concept of central
tendency.
Identify and compute the arithmetic
mean.
Determine the median.
Identify the mode.
Explain and apply measures of
dispersion.
Compute and explain the variance and the
standard deviation.
Compute the mean and standard deviation of
grouped data.
1
Learning Objectives
LO1: List ways that statistics is used.
LO2: Know the differences between descriptive
and Inferential statistics.
LO3: Understand the differences between a
sample and a population.
LO4 Explain the difference between qualitative
and quantitative variables.
1.1. What is meaning by Statistics?
If you look through your university Catalog, you will find that
statistics is required for many college programs. Why is this
so? What are the differences in the statistics courses taught in
the Bio-statistics College, Engineering College, the Psychology
or Sociology Departments in the Liberal Arts College, and the
College of Business? The biggest difference is the examples
used. The course content is basically the same.
Example 1:
The grades of students in biostatistics are below:
C, C, D, A, C, C, C, C, C, C, C, A, A, D, C, C, B, B, C, C, D,
D, A, B, B, B, C, C, D, D, C, B, B, D, D, C, C, D, A, C, C,
C, C, B, C C, A, D, D, B, B, B, C, B, B, C, B, B, A, D
Construct a frequency distribution table for the above
categorical data and comment.
Solution: Table frequency distribution of shirts sold by
the departmental shore
GRADE Frequency (# of
grade)
B 15
C 26
D 12
A 7
Total 60
Example 2:
Construct relative and percent frequency distribution table
for the frequency distribution gives Example(1)
GRADE Frequency (# of
grade)
B 15
C 26
D 12
A 7
Total 60
Solution
Bar Charts
Type one Simple bar diagram
Simple bar diagram is most popular diagrammatical
representation of qualitative data. By simple bar diagram,
only one qualitative variable can be exhibited.
Firs a frequency distribution table of qualitative data is
constructed.
Then a bar of a fixed width above each category is
drowning.
Actually bars are looked like rectangles of equal width
over classes.
The height of bar over each class is equal to the class
frequency .
Example: 3
a. Construct a bar diagram with The frequency distribution.
b. Construct a bar diagram with The precentege frequency
distribution.
Grade Frequency (# number of
shirts)
B 15
C 26
D 12
A 7
Total 60
Solution
a. The bar diagram of the above data .
Frequency (# of shirts)
30
25
20
15
Frequency (# of shirts)
10
0
small medium large Extra large
b. Percent bar
Percent
50
45
40 43
35
30
25
25 Percent
20
15 20
10 12
5
0
Small Mediam Large Extra-large
Example 4
Small 40 50 60 80 100
Extra-Large 20 40 40 60 90
Solution
small 15
medium 26
large 12
Extra large 7
Total 60
Solution
small 15 0.25 25 90
large 12 0.2 20 72
large
20%
medium
43%
pie giagram
2. Continuous data.
Example:6
Number Frequency
of
children:
X
0 1
1 5
2 4
3 6
4 16
5 9
6 4
Total 45
0 1 0.02 2
1 5 0.11 11
2 4 0.09 9
3 6 0.13 13
4 16 0.36 36
5 9 0.20 20
6 4 0.09 9
Total 45 1 100
𝑟𝑎𝑛𝑔𝑒
3. 𝑊𝑖𝑑𝑡ℎ 𝑜𝑓 𝑡ℎ𝑒 𝑐𝑙𝑎𝑠𝑠 =
𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑐𝑙𝑎𝑠𝑠
b) Inclusive method
Frequency Distribution
Example:
The following table gives the hemoglobin level (g/dl) of a
sample of 50 men.
We wish to summarize these data using the following
class
From frequencies: The number of people whose
hemoglobin levels are between 17.0 and 17.9 = 10
From cumulative frequencies: The number of people
whose hemoglobin levels are less than or equal to 15.9 =
23 The number of people whose hemoglobin levels are
less than or equal to 17.9 = 49
From percentage frequencies: The percentage of
people whose hemoglobin levels are between 17.0 and
17.9 = 20%
From cumulative percentage frequencies: The
percentage of people whose hemoglobin levels are less
than or equal to 14.9 = 16% The percentage of people
whose hemoglobin levels are less than or equal to 16.9 =
78%
Example 7
The following data relate to the audit-time of 20 clients:
10, 15, 20, 28, 13, 18, 24, 29, 12, 16,
23, 34, 14, 17, 22, 17, 21, 16, 18, 19
Solution
Histograms
Frequency polygons
Histogram
Assignment One
G, P, A, P, P, A, A, A, A, A, A, A, A, A, A,
A, P, G, G, P, A, A, P, G, P, A, A, P, G, A,
A, G, P, A, A, A, A, G, P, P, A, A, G, P, P,
P, A, G, A, A, A, A, P, P, G
Learning Objectives
When you have completed
this chapter, you will be
able to:
LO1 Explain the concept of central tendency.
LO2 Identify and compute the arithmetic mean.
LO3 Determine the median.
LO4 Identify the mode.
LO5 Explain and apply measures of dispersion.
LO6 Compute and explain the variance and the standard
deviation.
LO7 Compute the mean and standard deviation of grouped
data.
3.1. Introduction to the concept of central
tendency
Chapter 2 began our study of descriptive statistics. To summarize
raw data into a meaningful form, we organized qualitative data into
a frequency table and portrayed the results in a bar chart. In a
similar fashion, we organized quantitative data into a frequency
distribution and portrayed the results in a histogram. We also
looked at other graphical techniques such as pie charts to portray
qualitative data and frequency polygons to portray quantitative
data.
This chapter is concerned with two numerical ways of describing
quantitative variables, namely, measures of location and
measures of dispersion. Measures of location are often referred
to as averages. The purpose of a measure of location is to pinpoint
the centre of a distribution of data. An average is a measure of
location that shows the central value of the data. Averages appear
daily on TV, on various websites, in the newspaper, and in other
journals. Here are some examples:
The average U.S. home changes ownership every 11.8 years.
An American receives an average of 568 pieces of mail per
year.
The average American home has more TV sets than people.
There are 2.73 TV sets and 2.55 people in the typical home.
We begin by discussing measures of location. There is not
just one measure of location; in fact, there are many. We
will consider three: the arithmetic mean, the median,
and the mode, . The arithmetic mean is the most widely
used and widely reported measure of location. We study the
mean as both a population parameter and a sample statistic.
3.2. Arithmetic mean: of the Population
For raw data—that is, data that have not been grouped in a
frequency distribution— the population mean is the sum of
all the values in the population divided by the number of
values in the population. To find the population mean, we
use the following formula
𝐒𝐮𝐦 𝐨𝐟 𝐚𝐥𝐥 𝐭𝐡𝐞 𝐯𝐚𝐥𝐮𝐞𝐬 𝐢𝐧 𝐭𝐡𝐞 𝐩𝐨𝐩𝐮𝐥𝐚𝐭𝐢𝐨𝐧
𝐩𝐨𝐩𝐮𝐥𝐚𝐭𝐢𝐨𝐧 𝐦𝐞𝐚𝐧 =
𝐍𝐮𝐦𝐛𝐞𝐫 𝐨𝐟 𝐯𝐚𝐥𝐮𝐞𝐬 𝐢𝐧 𝐭𝐡𝐞 𝐩𝐨𝐩𝐮𝐥𝐚𝐭𝐢𝐨𝐧
Instead of writing out in words the full directions for
computing the population mean (or any other measure), it is
more convenient to use the shorthand symbols of
mathematics. The mean of the population using
mathematical symbols is:
where:
𝝁 = Represents the population mean. It is the Greek lowercase
letter “mu.”
𝑵 = is the number of values in the population.
𝑿 = represents any particular value.
𝜮 = is the Greek capital letter “sigma” and indicates the
operation of adding.
𝜮𝑿 = is the sum of the X values in the population.
Any measurable characteristic of a population is called a
parameter. The mean of a population is an example of
a parameter.
The Sample Mean
For raw data—that is, ungrouped data—the mean is the
sum of all the sampled values divided by the total number
of sampled values. To find the mean for a sample:
∑( 𝑥 − 𝑥̅ )
MEDIAN: The midpoint of the values after they have
been ordered from the smallest to the largest, or the
largest to the smallest.
3.4. Mode
Where:
Solution
Sample Variance The formula for the population mean We
just changed the symbols for the sample mean
themconversion from the population variance to the sample
variance is not as direct. It requires a change in the
denominator. Instead of substituting n (number in the
sample) for N (number in the population), the denominator
is n - 1. Thus the formula for the sample variance is:
Where:
𝑠 2 is the sample variance.
𝑥 is the value of each observation in the sample.
𝑥̅ is the mean of the sample.
𝑛 is the number of observations in the sample.
Example
The hourly wages for a sample of part-time employees at
Home Depot are: $12, $20, $16, $18, and $19. What is the
sample variance?
Solution
Sample Standard Deviation The sample standard
deviation is used as an estimator of the population
standard deviation. As noted previously, the population
standard deviation is the square root of the population
variance. Likewise, the sample standard deviation is the
square root of the sample variance. The sample standard
deviation is most easily determined by:
3.7. The Mean and Standard Deviation
of Grouped Data
Arithmetic mean of grouped data: To approximate
the arithmetic mean of data organized into a frequency
distribution, we begin by assuming the observations in
each class are represented by the midpoint of the class.
The mean of a sample of data organized in a frequency
distribution is computed by:
Where:
̅ = The designation for the sample mean.
𝒙
𝑴 = The midpoint of each class
𝒇 = The frequency in each class
𝒇𝑴 = The frequency in each class times the midpoint of
the class.
∑ 𝒇𝑴 = The sum of these products
𝒏 = The total number of frequencies
Standard Deviation: To calculate the standard
deviation of data grouped into a frequency distribution,
we use this formula
Where :
𝒔 = The symbol for the sample standard deviation
𝑴= is the midpoint of the class
𝒇 = Is the class frequency
𝒏= is the number of observations in the sample.
̅ = The designation for the sample mean.
𝑿
Example
Determine the sample mean, sample variance and the sample
standard deviation of the following frequency distribution.
Solution
class frequency midpoint FM
0 to 5 2 2.5 5
5 to 10 7 7.5 52.5
10 to 15 12 12.5 150
15 to 20 6 17.5 105
20 to 25 3 22.5 67.5
Total 30 380
∑ 𝑓𝑀 380
̅=
𝒎𝒆𝒂𝒏 = 𝒙 = = 12.6
𝑛 30
class 𝒇 𝑴 FM ̅
𝒎−𝒙 ̅ )𝟐
(𝑴 − 𝑿 ̅ )𝟐
𝒇(𝑴 − 𝑿
0 to 5 2 2.5 5 -10.17 103.4289 517.1445
5 to 10 7 7.5 52.5 -5.17 26.7289 1403.267
10 to 15 12 12.5 150 -0.17 0.0289 4.335
15 to 20 6 17.5 105 4.83 23.3289 2449.535
20 to 25 3 22.5 67.5 9.83 96.6289 6522.451
Total 30 380 10896.73
̅ )𝟐
∑ 𝒇(𝑴 − 𝑿 𝟏𝟎𝟖𝟗𝟔. 𝟕𝟑
𝒔𝒂𝒎𝒑𝒍𝒆 𝒗𝒂𝒓𝒊𝒂𝒏𝒄𝒆 = 𝒔𝟐 = = = 𝟑𝟕𝟓. 𝟕𝟓
∑𝒇 − 𝟏 𝟐𝟗
̅ )𝟐
∑ 𝒇(𝑴− 𝑿
𝐬𝐚𝐦𝐩𝐥𝐞 𝐬𝐭𝐚𝐧𝐝𝐚𝐫𝐝 𝐝𝐞𝐯𝐢𝐚𝐭𝐢𝐨𝐧 = 𝐬 = √ ∑ 𝒇−𝟏
= √𝟑𝟕𝟓. 𝟕𝟓 = 19.38427
4 Probability - Two Events
10.1 Recap: Basic Probability for One Event
In this section we revise the use of probabilities for single events, remembering that:
Example 1
A tube of sweets contains 10 red sweets, 7 blue sweets, 8 green sweets and 5 orange
sweets. If a sweet is taken at random from the tube, what is the probability that it is:
(a) red,
(b) orange,
(c) green or red,
(d) not blue ?
Solution
166
MEP Y8 Practice Book A
(c) There are 8 green sweets and 10 red sweets in the tube, so
8 + 10
p (green or red) =
30
18
=
30
3
=
5
(d) There are 23 sweets that are not blue in the tube, so
23
p ( not blue) =
30
Example 2
Nine balls, each marked with a number from 1 to 9, are placed in a bag and one
ball is taken out at random. What is the probability that the number on the ball is:
(a) odd,
(b) a multiple of 3,
(c) a 5,
(d) not a 7 ?
Solution
There are 9 possible outcomes in each case.
(a) There are 5 possible odd numbers, so
5
p (odd ) =
9
167
04.1 MEP Y8 Practice Book A
Exercises
1. There are 16 girls and 8 boys in the tennis club. One of these is chosen at
random to enter a competition. What is the probability that a girl is chosen?
2. A bag contains 8 blue balls, 7 green balls and 5 red balls. A ball is taken at
random from the bag. What is the probability that the ball is:
(a) red,
(b) blue,
(c) green,
(d) yellow?
4. If you roll a fair dice, what is the probability that the number you get is:
(a) 5
(b) an odd number,
(c) a number greater than 1,
(d) a multiple of 4 ?
5. Ishmail writes a computer program that produces at random one of the digits
0, 1, 2, 3, 4, 5, 6, 7, 8, 9.
What is the probability that the program produces:
(a) an even number,
(b) a multiple of 4,
(c) a number less than 7,
(d) a multiple of 5 ?
6. The police line up 10 people in an identity parade; only one of the people is
the criminal. A witness does not recognise the criminal and so chooses a
person at random. What is the probability that:
(a) the criminal is chosen,
(b) the criminal is not chosen ?
168
MEP Y8 Practice Book A
7. There are 18 boys and 17 girls in a class. One of these pupils is selected at
random to represent the class. What is the probability that the pupil selected
is a girl?
8. In Hannah's purse there are three £1 coins, five 10p coins and eight 2p coins.
If she takes a coin at random from her purse, what is the probability that it is:
(a) a £1 coin,
(b) a 2p coin,
(c) not a £1 coin,
(d) a £1 coin or a 10p coin ?
9. Some of the children in a class write down the first letter of their surname on
a card; these cards are shown below:
W M G S J
S E S A
W H H E
M T S I
(a) One of these cards is taken at random. What is the probability that the
letter on it is:
(i) W,
(ii) S or T,
(iii) J or M,
(iv) not H
(v) a vowel ?
(b) Which letter is the most likely to be chosen?
10. Rachel buys a new CD, on which is her favourite track, 8 other tracks she
likes and 2 tracks that she does not like. She sets her CD player to play at
random. What is the probability that the first track it plays is:
(a) Rachel's favourite,
(b) a track that she likes,
(c) a track that she does not like ?
169
MEP Y8 Practice Book A
Example 1
Caitlin and Dave each buy a chocolate bar from a vending machine that sells Aero,
Bounty, Crunchie and Dime bars.
List the possible pairs of bars which Caitlin and Dave can choose.
Solution
Caitlin Dave
A A
A B
A C
A D
B A
B B
B C
B D A = Aero
B = Bounty
C A
C = Crunchie
C B
D = Dime
C C
C D
D A
D B
D C
D D
170
MEP Y8 Practice Book A
Example 2
A fair dice is rolled and an unbiased coin is tossed. Draw a table to show the
possible outcomes.
Solution
Possibilities DICE
for coin Possibilities
1 2 3 4 5 6
for dice
H H1 H2 H3 H4 H5 H6
COIN
T T1 T2 T3 T4 T5 T6
Example 3
Draw a table to show all the possible total scores when two fair dice are thrown at
the same time.
Solution
DICE B
1 2 3 4 5 6
The table shows that there are 36
1 2 3 4 5 6 7
possible outcomes, and gives the
D 2 3 4 5 6 7 8 total score for each outcome.
I
C 3 4 5 6 7 8 9
From the table it can be seen that
E
4 5 6 7 8 9 10 there are 6 outcomes that give a
A 5 6 7 8 9 10 11 score of 7.
6 7 8 9 10 11 12
Example 4
Use a tree diagram to show the possible outcomes when two unbiased coins are
tossed.
OUTCOMES
Coin B
H HH
Solution
Coin A H
The diagram shows that there
are 4 possible outcomes. T HT
H TH
T
T TT
171
04.2 MEP Y8 Practice Book A
Example 5
In a drawer there are some white socks and some black socks. Tim takes out one
sock and then a second. Draw a tree diagram to show the possible outcomes.
OUTCOMES
Second Sock
Solution B BB
First Sock
There are four possible B
outcomes, of which two will
W BW
will produce two socks of the
same colour. B WB
W
W WW
Exercises
1. Copy and complete the table to show COIN B
all possible outcomes when 2 fair
coins are tossed. H T
H
COIN A
T
SPINNER B
1 2 3 4
S 1
P
I 2
N A
N 3
E
R 4
172
MEP Y8 Practice Book A
5. In a bag there are red and blue counters. Two counters are taken out of the
bag at random.
(a) Copy and complete the tree diagram below, to show all outcomes:
OUTCOMES
2nd Counter
RR
1st Counter
R
6. (a) Draw a tree diagram to show all possible outcomes when two
unbiased coins are tossed.
(b) Extend your tree diagram to show the possible outcomes when three
unbiased coins are tossed.
(c) How many outcomes are there when three unbiased coins are tossed?
(d) How many outcomes are there when four unbiased coins are tossed?
7. In a jar there are three different types of sweets, eclairs, mints and toffees;
two sweets are taken at random.
(a) Draw a tree diagram to show the possible outcomes.
(b) How many of the outcomes include a toffee?
(c) How many of the outcomes include a mint and a toffee?
173
04.2 MEP Y8 Practice Book A
8. A red dice, a blue dice and a green dice are put into a bag; all the dice are
fair. One is then taken out and rolled. The colour of the dice and the score
shown are recorded.
(a) How many possible outcomes are there?
(b) How many outcomes include a 5 ?
9. In a game, two fair dice are rolled and the scores are multiplied together.
(a) Draw a table to show the possible outcomes and their scores.
(b) How many ways are there of scoring 12 ?
(c) How many ways are there of scoring 18 ?
10. A bag contains a mixture of red, green and white balls. Three balls are
taken at random from the bag.
(a) Write down all possible outcomes.
(b) How many outcomes include a red ball?
(c) How many outcomes include a red or a white ball?
(d) How many outcomes include a red and a green ball?
Example 1
Look at the list of chocolate bars which can be chosen by Caitlin and Dave in
Example 1 of section 10.2. What is the probability that they both choose the
same type of chocolate bar?
Solution
There are 16 different outcomes and all are equally likely.
In 4 of these outcomes both Caitlin and Dave choose the same type of bar.
So
4 1
p (same type) = or
16 4
174
MEP Y8 Practice Book A
Example 2
When two unbiased coins are tossed, determine the probability of obtaining:
(a) two heads,
(b) two tails,
(c) a head and a tail.
Solution
The table shows the possible outcomes:
H T
In this situation there are 4 outcomes that are
equally likely. H HH HH
T TH TH
(a) Here 1 of the 4 outcomes gives 2 heads, so
1
p (2 heads) =
4
Example 3
Two fair dice are rolled at the same time. What is the probability that the total
score is:
(a) 6,
(b) greater than 9,
(c) less than 7 ?
Solution 1 2 3 4 5 6
The table show the possible outcomes. 1 2 3 4 5 6 7
There are 36 equally likely scores. 2 3 4 5 6 7 8
3 4 5 6 7 8 9
(a) There are 5 outcomes that give a
score of 6, so 4 5 6 7 8 9 10
5 6 7 8 9 10 11
5
p (6) = 6 7 8 9 10 11 12
36
175
04.3 MEP Y8 Practice Book A
Exercises
1. Use information from the table in Example 3 to answer this question:
When two fair dice are thrown, what is the probability that the total score is:
(a) 9, (b) an odd number,
(c) greater than 10, (d) less than 8 ?
176
MEP Y8 Practice Book A
5. The diagram shows two spinners that are spun at the same time:
1 2 0 1
3 4 –1 2
7. Two unbiased coins are tossed at the same time. What is the probability of
obtaining:
(a) at least one head,
(b) no heads ?
8. Three unbiased coins are tossed at the same time. Use a tree diagram to show
the outcomes and determine the probability of obtaining:
(a) 3 heads,
(b) at least 1 head,
(c) at least 2 heads.
9. Two fair dice are rolled and the scores on each dice are multiplied together to
give a total score. What is the probability of getting a total score:
(a) of 12,
(b) of 20,
(c) greater than 25,
(d) less than 30,
(e) that is an even number ?
10. If 4 unbiased coins are tossed at the same time, what is the probability of
obtaining the same number of heads as tails?
177
MEP Y8 Practice Book A
p (A) A
p ( B)
B A B p ( A) × p ( B)
p ( B) p (A) A B A p ( B) × p ( A)
B
p ( B)
B B B p ( B) × p ( B)
Note: Here we have an experiment with two possible outcomes, A and B, and the
experiment is repeated once. It is assumed that the probability of either A
or B remains the same when the experiment is repeated; in this case, we say
that A and B are independent events.
Example 1
Two fair dice are rolled. Use a tree diagram to determine the probability of
obtaining:
(a) 2 sixes, (b) 1 six, (c) no sixes.
Solution
The tree diagram is shown below:
OUTCOMES PROBABILITIES
1 1 1 1
6 6, 6 × =
6 6 6 36
1 6 5
6 6 1 5 5
NOT 6 6, NOT 6 × =
6 6 36
5 1
6 NOT 6, 6 5 1 5
6 6 × =
NOT 6 6 36
6 5
6 5 5 25
NOT 6 NOT 6, NOT 6 × =
6 6 36
1 36
(a) p (2 sixes) = total =
36
=1
36
Note that these probabilities
5 5 10 5
(b) p (1 six) = + = = add up to 1. This will always
36 36 36 18 be so when the probabilities
25 are added from the outcome
(c) p ( no sixes) = of the tree diagram.
36 This is a very useful means
of checking your working.
178
MEP Y8 Practice Book A
Example 2
A bag contains 4 red balls and 3 green balls. A ball is taken out at random, and
then put back; a second ball is then taken from the bag. What is the probability
that:
(a) both balls are the same colour,
(b) at least one of the balls is green,
(c) the balls are of different colours?
Solution
Use a tree diagram:
2nd Ball OUTCOMES PROBABILITIES
4 4 4 16
1st Ball 7 R R R × =
7 7 49
4 R 3
7 7 4 3 12
G R G × =
7 7 49
3 4
R G R 3 4 12
7 7 × =
7 7 49
G 3
7 3 3 9
G G G × =
7 7 49
49
total = =1
49
16 9
= +
49 49
25
=
49
= p ( R G ) + p (G R )
12 12
= +
49 49
24
=
49
Note: In probability questions of this type, 'or' means adding the probabilities.
Example 3
On her way to work, Sylvia drives through three sets of traffic lights. The
probability of each set of lights being green is 0.3. What is the probability that
they are all green?
Solution
p (all green) = p (1st green and 2nd green and 3rd green )
Remember A tree diagram is drawn when it will help you to analyse a problem;
so if it will help, draw one. On the other hand, if you are able to solve a problem
without one (see Example 3 above), then do so.
Example 4
A P
B
Q R
C
D
The diagram shows a model railway track. At each of the junctions P, Q and R,
2
the probability of a train going straight ahead is and the probability of it
3
1
branching to the right is .
3
180
MEP Y8 Practice Book A
Solution
1 2 2
(a) p ( right and straight and straight ) = × ×
3 3 3
4
=
27
Exercises
1. A bag contains 3 red balls and 2 blue balls. A ball is taken at random from
the bag and then put back. A second ball is then taken out of the bag.
What is the probability that:
(a) both balls are red,
(b) both balls are the same colour,
(c) at least one of the balls is red ?
2. Repeat question 1 for a bag with 7 red balls and 3 blue balls.
3. Two fair dice are rolled at the same time. Use a tree diagram to determine
the probability of obtaining:
(a) two even numbers,
(b) at least one even number,
(c) no even numbers.
4. Two fair dice are rolled at the same time. Use a tree diagram to determine
the probability of obtaining:
(a) two multiples of 3,
(b) exactly one multiple of 3,
(c) less than two multiples of 3.
181
04.4 MEP Y8 Practice Book A
2
5. A coin has been weighted, so that the probability of getting a head is and
5
3
the probability of getting a tail is ; the coin is thrown twice. Determine
5
the probability of obtaining:
(a) 2 heads, (b) no heads, (c) at least one head.
(c) no reds.
9. A bag contains 1 red ball, 2 green balls and 4 yellow balls. A ball is taken
from the bag at random. The ball is then put back, and a second ball is
taken at random from the bag.
What is the probability that:
(a) both balls are the same colour,
(b) no yellow balls are taken out,
(c) at least one yellow ball is taken out?
10. Each of 10 balls is marked with a different number from 1 to 10. One ball
is taken at random and then replaced. A second ball is then taken at
random. Determine the probability that:
(a) both balls taken are marked with the number 5,
(b) both balls taken have even numbers,
(c) both balls taken have numbers which are multiples of 3,
(d) at least one of the balls taken has a number greater than 2.
11. On his way to work, Paul has to pass through 2 sets of traffic lights. The
probability that the first set of lights is green is 0.5, and the probability that
the second set of lights is green is 0.4.
What is the probability that both sets of lights are green?
182
MEP Y8 Practice Book A
12. On her way to the theatre, Sheila passes through 3 sets of traffic lights. The
1
probability that each set of lights is green is .
3
(a) What is the probability that none of the lights is green?
(b) What is the probability that two sets of lights are green and the other
set is not green?
13. A B C
P
D E
Q
R
The diagram shows a section of a railway track. At each of the junctions
3
B, C, D and E, the probability of going straight on is .
4
The train starts at A.
(a) What is the probability that it reaches P?
(b) What is the probability that it reaches Q?
14. Z
A (food)
X
R B (nothing)
W
C (water)
Y
A rat leaves position R and starts walking towards B. If it reaches B it gets
nothing, if it reaches A it gets food and if it reaches C it gets water.
At each of the junctions W, X, Y and Z, the probability of going straight on
is 0.6 and the probability of branching off is 0.4.
(a) What is the probability that the rat gets food?
(b) What is the probability that the rat gets water?
(c) What is the probability that it gets nothing?
15. When two fair dice are thrown, what is the probability that the score on the
second dice is higher than the score on the first dice?
183
MEP Y8 Practice Book A
EITHER the first ball was red, so the probability that the second ball is red is
7
, since there are 3 blue balls but only 7 red balls left.
10
OR the first ball was blue, so the probability that the second is red is
8 4
= , since there are 8 red balls but only 2 blue balls left.
10 5
Tree diagrams are very useful for this type of problems.
Example 1
A bag contains 7 yellow balls and 5 red balls. One ball is taken from the bag at
random, and is not replaced. A second ball is then taken from the bag.
Determine the probability that:
(a) both balls are red, (b) both balls are the same colour,
(c) the balls are different colours, (d) at least one ball is yellow.
Solution
The tree diagram below shows the probabilities and outcomes:
2nd Ball OUTCOMES PROBABILITIES
6 7 6 42
1st Ball Y Y Y × =
11 12 11 132
7 Y 5
12 11 7 5 35
R Y R × =
12 11 132
5 7
Y R Y 5 7 35
12 11 × =
12 11 132
R 4
11 5 4 20
R R R × =
12 11 132
132
total = =1
132
184
MEP Y8 Practice Book A
20
(a) p ( both red ) =
132
5
=
33
(b) p ( both the same colour ) = p (Y Y) + p ( R R )
42 20
= +
132 132
62
=
132
31
=
66
(c) p (different colours)
35 35
= 1 − p (same colour ) or p (Y R ) + p ( R Y) = +
132 132
31 70
= 1− =
66 132
35 35
= =
66 66
42 35 35
= + + or = 1 − p (R R)
132 132 132
112 20
= = 1−
132 132
28 112
= =
33 132
28
=
33
Example 2
There are 4 boys and 5 girls who are hoping to be selected for a school quiz team.
Two of them are selected at random to be in the team.
Determine the probability that:
(a) 2 boys are chosen,
(b) at least 1 girl is chosen,
(c) 1 girl and 1 boy are chosen.
185
04.5 MEP Y8 Practice Book A
Solution
The tree diagram below shows the outcomes and the probabilities:
OUTCOMES PROBABILITIES
4 5 4 20 5
G G G × = =
8 9 8 72 18
5 G 4
9 8 5 4 20 5
B G B × = =
9 8 72 18
4 5
G B G 4 5 20 5
9 8 × = =
9 8 72 18
B 3
8 4 3 12 3
B B B × = =
9 8 72 18
18
total = =1
18
3
(a) p (2 boys) =
18
1
=
6
5 5 5
(b) p (at least 1 girl) = + +
18 18 18
15
=
18
5
=
6
5 5
(c) p (1 boy and 1 girl) = +
18 18
10
=
18
5
=
9
Note: The questions in Examples 1 and 2 could have been answered without the
use of tree diagrams, but a tree diagram helps greatly with the analysis of
the problem; the same is true for the next example.
Example 3
1
The probability that Ravi does his homework is if he goes out with his friends
10
3
and of he does not go out with his friends. The probability that Ravi goes out
5
3
with his friends is . What is the probability that Ravi does his homework?
4
186
MEP Y8 Practice Book A
Solution
Solution 1
p ((goes out and does homework ) or (does not go out and does homework ))
=
p (goes out ) × p (does homework ) + p (does not go out ) × p (does homework )
3 1 1 3
= × + ×
4 10 4 5
3 3
= +
40 20
9
=
40
Solution 2
OUTCOMES PROBABILITIES
1 3 1 3
does O D × =
10 4 10 40
3 out 9
4 10 3 9 27
does not O D' × =
4 10 40
1 3
does O' D 1 3 3
4 5 × =
4 5 20
not out 2
5 1 2 2 1
does not O' D' × = =
4 5 20 10
Note: O' means does not go out, and D' means does not do homework.
3 3
p (does homework ) = +
40 20
9
=
40
Exercises
1. A bag contains 3 pink balls and 2 blue balls. One ball is taken out at random
and not replaced. A second ball is then taken out.
Determine the probability that:
(a) both balls are pink,
(b) both balls are the same colour,
(c) at least one ball is blue.
187
04.5 MEP Y8 Practice Book A
2. In Tim's drawer there are 6 black socks and 5 white socks. He takes out two
socks at random. What is the probability that he has taken two socks of the
same colour?
3. In a tennis club there are 5 boys and 3 girls in a training squad. Two are
chosen at random to represent the club.
Determine the probability that they are:
(a) both boys,
(b) both girls,
(c) a boy and a girl.
4. Tara has five 10p coins and four 20p coins in her purse. She takes out two
coins at random. What is the probability that she takes out at least 30p?
5. There are 8 footballs in a store cupboard; one is yellow and the others are
white. A pupil takes 2 footballs out of the cupboard at random. What is the
probability that one of them is the yellow ball?
2 1
6. The probability of Jeremy passing a maths exam is if he revises and if
3 3
1
he does not revise. The probability that he revises is . What is the
4
probability of Jeremy passing the maths exam?
7. The probability of Jenny getting to work on time is 0.8 if she gets up before
7 a.m. and 0.4 if she does not get up before 7 a.m. The probability that
Jenny gets up before 7 a.m. is 0.7. What is the probability that Jenny is late
for work?
8. Ian is an inept mountaineer who tends to fall from rock faces. The
probability that he falls is 0.2 if the weather is dry but rises to 0.5 if it is
wet. The probability of wet weather is 0.3. Determine the probability that
Ian falls.
10. Peter and Jane play a game in which they each in turn take a counter at
random from a bag containing 7 red counters and 3 yellow counters. The
winner is the first to get a red counter. Jane goes first. By drawing a tree
diagram, determine the probability that Peter wins the game.
188
Zamzam University science & Technology
Learning Objectives
When you have completed this chapter, you will be able to:
1. Define a parameter, statistic and with examples
2. Define a point estimate. Define level of confidence.
3. Construct a confidence interval for the population mean
when the population standard deviation is known.
4. Construct a confidence interval for a population mean when
the population standard deviation is unknown.
5. Construct a confidence interval for a population proportion.
6. Determine the sample size for attribute and variable
sampling
Introduction
This chapter considers several important aspects of
sampling. We begin by studying point estimates. A point
estimate is a single value (point) derived from a sample
and used to estimate a population value. For example,
suppose we select a sample of 50 junior executives and
ask how many hours they worked last week. Compute the
mean of this sample of 50 and use the value of the sample
mean as a point estimate of the unknown population
mean. However, a point estimate is a single value. A more
informative approach is to present a range of values in
which we expect the population parameter to occur. Such
a range of values is called a confidence interval.
1. Parameter and statistics
Population: is the totality or collection of all objects or
individuals on which observations are taken on the
basis of some characteristic of the objects in any field.
Parameter: any numerical value describing a
characteristic of population is called a parameter.
It is customary to represent parameters by
Greek letters. By tradition the arithmetic mean
of population is denoted by a Greek letter µ(mu)
similarly, population variance 𝜎 2 ,
Correlation coefficient (ρ), regression
coefficient (β), proportion (𝜋) etc.
Statistic: any numerical value describing a
characteristic of a sample is called a statistic.
Confidence intervals
b. Where:
̅
𝒙 is sample mean
𝒛 is normal distribution
𝜶 is level of confidence
𝜎 is Population Standard Deviation
𝑛 is sample size
𝜇 is population mean
c. Use Z-distribution: If the population
standard deviation is known or the sample
is greater than 30
Example 1
Find the reliability factor, 𝑍𝛼⁄2 to estimate
Example 2
A college admissions officer for a Health
Sciences HS program has determined that
historically applicants have undergraduate
grade point averages that are normally
distributed with standard deviation 0.45. From
a random sample of 49 applications from the
current year, the sample mean grade point
average is 2.90.
a. Find a 95% confidence interval for the
population mean
Lecturer: Abdirahman Macalim Ibrahim Abdi
Zamzam University science & Technology
b. Give comment
Solution
Interpretation
Based on a sample
of 49 applicants, a 95% confidence
interval for the unknown applicants on
HELTHE SCIESE have undergraduate
grade point averages mean extends
from approximately 2.8 grade to
approximately 3 grade.
where
̅
𝒙 is sample mean
Student’s t Table
Example 3
Example 4
Solition
and 33%.
Where:
Example 5
Solution
𝟐
𝒛. 𝝈 𝟐(1.96)(6250)
𝒏= ⟨ ⟩ = ⟨ ⟩ = 𝟔𝟎𝟏
𝑬 𝟓𝟎𝟎
where :
p is estimate from a pilot study or some source,
otherwise, 0.50 is used
z - the z - value for the desired confidence level
E - the maximum allowable error
Example 6
The American Kennel Club wanted to estimate the
proportion of children that have a dog as a pet. If the club
wanted the estimate to be within 3% of the population
proportion, how many children would they need to
contact? Assume a 95% level of confidence and that the
club estimated that 30% of the children have a dog as a
pet.
Solution
2
1.96
n (.30)(.70) 897
.03
completed
this chapter, you will be able to:
Define a hypothesis and hypothesis
testing.
Describe the five-step hypothesis-
testing procedure.
Distinguish between a one-tailed
and a two-tailed test of hypothesis.
Define Type I and Type II errors.
Conduct a test of hypothesis about
a population mean.
Conduct a test of hypothesis about
a population proportion.
Level of Significance:
Upper-Tail Tests
Lower-Tail Tests
Two-Tail Tests
Example1
A claim: the mean body temperature of health adults is
equal to 98.6℉. A sample data: 𝑛 = 106 , 𝑥̅ = 98.20℉. Assume
that 𝜎 = 0.62 and significance level is 𝛼 = 0.05
Solution
𝐻0 : 𝜇 = 98.6℉
𝐻1 ∶ 𝜇 ≠ 98.6℉
𝜶 = 𝟎. 𝟎𝟓
̅
𝒙− 𝝁 𝟗𝟖. 𝟐 − 𝟗𝟖. 𝟔
𝒛= 𝝈 = = −𝟔. 𝟔𝟒𝟐
𝟎. 𝟔𝟐
√𝒏 √𝟏𝟎𝟔
Example 2
The waiting time for patient of hospital follows a normal
distribution with a mean of 3 minutes and population
standard deviation of 1 minute. The department of ENT
sampled 50 patient and found that the mean waiting time
was 2.75 minutes. At the .05 significance level, can we
Example 3
The average cost of a hospital room in New
York City is assumed to be $ = 168 per night. A
random sample of 25 hospitals gives 𝑥̅ = $172.5
and 𝑠 = $15.4. Test the hypothesis at 𝛼 = 0.05
level (Assume the population distribution is
normal)
Solution
Example 4
MetLife claims that 8% of the Dhaka city residents
have graduates in Health Science. To test this claim, a
random sample of 500 were surveyed and found that
25 have graduate in Health Science. Test the
hypothesis at 𝛼 = 0.05 significance level.
Solution
Hypothesis Tests
Decision Rules
Example1
𝐻0 : 𝜇1 ≤ 𝜇2
𝐻1: 𝜇1 > 𝜇2
Step 2: the Level of Significance
𝜶 = 𝟎. 𝟎𝟓
𝟔𝟏.𝟒−𝟔𝟎.𝟔
𝒁= 𝟐 𝟐
= 𝟑. 𝟏𝟖𝟕
√(𝟏.𝟐) +(𝟏.𝟏)
𝟒𝟓 𝟑𝟗
Where:
𝒙 𝒏𝒖𝒎𝒃𝒆𝒓 𝒐𝒇 𝒔𝒖𝒄𝒄𝒆𝒔𝒔𝒇𝒖𝒍 𝒊𝒏 𝒇𝒊𝒔𝒕 𝐨𝐮𝐭𝐜𝐨𝐦𝐞𝐬
̂𝒙 =
𝒑
𝒏 𝒏𝒖𝒎𝒃𝒆𝒓 𝒐𝒇 𝒇𝒊𝒔𝒕 𝒔𝒂𝒎𝒑𝒍𝒆 𝒛𝒊𝒔𝒆
Example 2
𝐻0 : 𝜇𝑥 = 𝜇𝑦
𝐻0 : 𝜇𝑥 ≠ 𝜇𝑦
𝜶 = 𝟎. 𝟎𝟓
𝒏𝒙 = 𝟏𝟎𝟐𝟑𝟗
𝒙 = 𝟏𝟎𝟏
𝒏𝒚 = 𝟗𝟖𝟕𝟕
𝒚 = 𝟓𝟔
𝒙 𝟏𝟎𝟏
̂𝒙 =
𝒑 = = 𝟎. 𝟎𝟎𝟗𝟖𝟔
𝒏𝒙 𝟏𝟎𝟐𝟑𝟗
𝒚 𝟓𝟔
̂𝒚 =
𝒑 = = 𝟎. 𝟎𝟎𝟓𝟔𝟕
𝒏𝒚 𝟗𝟖𝟕𝟕
𝒏𝒙 𝒑 ̂𝒚
̂ 𝒙 + 𝒏𝒚 𝒑 (𝟏𝟎𝟐𝟑𝟗)(𝟎. 𝟎𝟎𝟗𝟖𝟔) + (𝟗𝟖𝟕𝟕)(𝟎. 𝟎𝟎𝟓𝟔𝟕)
̂𝟎 =
𝒑 =
𝒏𝒙 + 𝒏𝒚 𝟏𝟎𝟐𝟑𝟗 + 𝟗𝟖𝟕𝟕
𝟏𝟓𝟕
= = 𝟎. 𝟎𝟎𝟕𝟖
𝟐𝟎𝟏𝟏𝟔
p̂x − p̂y
Z=
p̂0 (1 − p̂0 ) p̂0(1 − p̂0)
√ +
nx ny
0.00986 − 0.00567
= = 3.377
( ) ( )
√0.0078 1 − 0.0078 + 0.0078 1 − 0.0078
10239 9877
Where:
𝑠12is the variance (standard deviations
squared)of the fist sample
𝑠22 is the variance (standard deviations squared)
of the second sample
Where:
̅𝟏 is the mean of the first sample.
𝒙
̅𝟐 is the mean of the second sample.
𝒙
𝒏𝟏 is the number of observations in the first
sample.
𝒏𝟐 is the number of observations in the second
sample.
𝒔𝟐 is the pooled estimate of the population
variance.
The number of degrees of freedom in the test is the
total number of items sampled minus the total
number of samples. Because there are two samples,
there are n1 + n2 - 2 degrees of freedom.
Example3
Solution
Step 1: the Null Hypothesis (H0) and the Alternate
Hypothesis (H1)
𝑯 𝟎 : 𝝁𝟏 ≤ 𝝁𝟐
Students who claimed to have voted in the last
presidential election is less than or equal students, who
did not vote
𝑯 𝟏 : 𝝁𝟏 > 𝝁𝟐
Students who claimed to have voted in the last
presidential election is greater than students, who did
not vote.
𝜶 = 𝟎. 𝟎𝟓
𝟐
(𝒏𝟏 − 𝟏 )𝒔𝟏 𝟐 + (𝒏𝟐 − 𝟏 )𝒔𝟐 𝟐
𝒔𝒑 =
𝒏𝟏 + 𝒏𝟐 − 𝟐
𝒙 ̅𝟐
̅𝟏 − 𝒙 𝟐.𝟕𝟏− 𝟐.𝟕𝟗
𝒕= 𝟏 𝟏
= = -0.3994886
𝟏 𝟏
√𝒔𝒑 𝟐 (𝒏 + 𝒏 √𝟎.𝟑𝟒𝟗( + )
𝟏 𝟐 𝟏𝟒 𝟐𝟑
Degrees of freedom 𝒏𝟏 + 𝒏𝟐 − 𝟐 = 𝟏𝟒 + 𝟐𝟑 − 𝟐 = 𝟑𝟓
Example 5
𝑯 𝟎 : 𝝁𝟏 = 𝝁𝟐
The average hospital room rate in Mogadishu is equal to
average room rate in Hargiesa.
𝑯 𝟏 : 𝝁𝟏 ≠ 𝝁𝟐
𝜶 = 𝟎. 𝟎𝟓
𝒏𝟏 = 𝟓𝟎
̅𝟏 = 𝟖𝟖. 𝟒𝟐
𝒙
𝒔𝟏 = 𝟓. 𝟔𝟐
𝒏𝟐 =50
̅𝟐 = 80.61
𝒙
𝒔𝟐 = 4.83
𝑥̅ 1 − 𝑥̅2 88.42− 80.61
𝑡= 𝑠 2 𝑠 2
= 2 2
= 7.452419
√1 + 2 √5.62 + 4.83
𝑛1 𝑛2 50 50
Example
Advertisements by Sylph Fitness Center claim that
completing its course will result in losing weight. A random
sample of eight recent participants showed the following
weights before and after completing the course. At the .01
significance level, can we conclude the students lost
weight?
a. State the null hypothesis and the alternate
hypothesis.
b. What is the critical value of t?
c. What is the computed value of t?
Solution
Lecturer:
Abdirahman Ibrahim Abdi
Zamzam University science & Technology
Characteristics of F-Distribution
Thus, the right-tail critical value is the only one required. The
critical value of F for a two tailed test is found by dividing the
significance level in half and then referring to the appropriate
degrees of freedom in Appendix B.4. An example will illustrate.
Decision Rules: Two Variances
Example: F Test
Solution
Example 2
Solution
ANOVA Assumptions:
H0: µ1 = µ2 =…= µk
Example :
ANOVA TABLE
Example
E 𝐻0: µ1 = µ2 = µ3
𝐻1: µ1 = µ2 = µ3
Scatter diagram
Karl Pearson's coefficient of
correlation.
Method of least squares.
1. Correlation
Correlation: The degree of relationship between the
variables under consideration is measure through the
correlation analysis.
The measure of correlation called the correlation
coefficient
The degree of relationship is expressed by coefficient
which range from correlation( −1 ≤ 𝑟 ≥ +1). The
direction of change is indicated by a sign.
The correlation analysis enables us to have an idea
about the degree & direction of the relationship
between the two variables under study.
Correlation Analysis is the study of the relationship
between variables. It is also defined as group of
techniques to measure the association between two
variables.
Correlation is a statistical tool that helps to measure
and analyze the degree of relationship between two
variables.
Correlation analysis deals with the association
between two or more variables
2. Types of Correlation
As X is decreasing, Y is decreasing
As X is decreasing, Y is increasing
Type II
Type III
𝑋 = 1, 2, 3, 4, 5, 6, 7, 8,
𝑌 = 3 + 2𝑥
( −𝑣𝑒) 𝑜𝑟 ( + 𝑣𝑒)
Example
Coefficient of Determination
Example