True or False?In a histogram, the height of the bar represents the frequency of the class.
❌ FalseCorrect reasoning: The AREA of the bar represents the frequency.The height represents the Frequency Density (k×ClassWidthFrequency)
In a histogram question, you are given that a specific bar (width w, height h) represents frequency f. How do you find the frequency of another bar?
1. Calculate the area of the known bar (A1=w×h).2. Establish the ratio k=AreaFrequency.3. Calculate the area of the target bar (A2).4. Target Frequency = k×A2.
True or False?In a histogram, the height of the bar always represents the frequency of the class.
❌ False✅ Correct: The area of the bar represents the frequency (Frequency = Area / k). The height represents the Frequency Density.
What is the formula for Frequency Density?
FrequencyDensity=ClassWidthFrequency
First step when tasked to draw a stem and leaf diagram from raw data?
Order the data (unless specifically asked for an unordered diagram, which is rare).
When calculating Frequency Density for a histogram, how do you determine the class width for a class written as 10≤x<15 versus 10−14 (where data is rounded to the nearest integer)?
1. For 10≤x<15: Width = 15−10=5.2. For 10−14 (rounded): Use class boundaries. Lower bound 9.5, Upper bound 14.5. Width = 14.5−9.5=5.
True or False?On a box plot, the ’whiskers’ must always extend to the minimum and maximum values of the dataset.
❌ False✅ Correct: If there are outliers, the whiskers extend to the highest/lowest values that are NOT outliers (or the outlier boundaries, depending on data availability). Outliers are marked separately with crosses.
True or False?In a back-to-back stem and leaf diagram, the leaves on the left-hand side are read from left to right (e.g., ∣5∣1 represents 51).
❌ False✅ Correct: Leaves on the left are read outwards from the stem. If the stem is 5 and the leaf is 1 (written as 1∣5∣), it represents 51, but the sequence 321∣5∣ implies data points 51, 52, 53.
True or False? When calculating the mean of grouped data, you must always use linear interpolation.
❌ False✅ Correct Method: Use the midpoint (x) of each class interval multiplied by frequency (f).
What is the first step to calculate the combined mean of two data sets, Group A (size n1, mean xˉ1) and Group B (size n2, mean xˉ2)?
Calculate the total sum of values for both groups: Σxtotal=n1xˉ1+n2xˉ2.
When comparing two data sets (e.g., marks of two classes), which two key features must be mentioned to get full marks?
1. A measure of Location (Median or Mean).2. A measure of Spread (IQR or Standard Deviation).
True or False?When comparing two data sets, it is valid to compare the Mean of one set with the Median of the other.
❌ False✅ Correct: You must compare like with like.- Use Mean & Standard Deviation together (for symmetrical data).- Use Median & IQR together (for skewed data or data with outliers).
Which measure of location (Mean, Median, or Mode) is preferred when the data is skewed or contains outliers?
The Median
When should you use the Median and IQR instead of the Mean and Standard Deviation to summarise a data set?
When the data is skewed or contains extreme values (outliers).
Data x is coded using y=bx−a. You have calculated the mean of the coded data (yˉ). How do you find the mean of the original data (xˉ)?
xˉ=byˉ+a
Data x is coded using y=bx−a. You have calculated the standard deviation of the coded data (σy). How do you find the standard deviation of the original data (σx)?
σx=b×σy
When calculating the Upper Quartile (Q3) for discrete data list of size n, what is the first step?
Calculate 43n.* If result is not an integer: Round UP to choose that position.* If result is an integer: Find the midpoint between that position and the next one.
True or False? Standard Deviation can be negative if the data values are all negative.
❌ False✅ Correct Concept: Standard Deviation is always positive (or zero).
You are calculating the Lower Quartile (Q1) for a discrete ordered list of n data values. You calculate 4n and get a whole number (integer). What do you do?
Find the midpoint between the 4nth value and the next value (4n+1th value).
How does the Standard Deviation (σy) change if the data is coded as y=bx−a?
σy=bσx
You are calculating the Upper Quartile (Q3) for a discrete ordered list of n data values. You calculate 43n and get a decimal (non-integer). What do you do?
Round up to the next whole number and pick that data point.
How do you determine the position of the Lower Quartile (Q1) from a discrete ordered list of n data points (Stem and Leaf)?
Calculate 4n.- If whole number: Pick the midpoint between this term and the next.- If decimal: Round UP to the next whole number and pick that term.
True or False? Sxx=Σx2−(Σx)2
❌ False✅ Correct Formula: Sxx=Σx2−n(Σx)2
True or False? In a table where ’Age’ is given as 25, 26, 27 (discrete integers), the class width for the group ’25’ is 0.
❌ False✅ Correct Width: 1 year (Boundary is 25.0 to 26.0)
True or False?When using linear interpolation to find the median of grouped data, you use the stated class limits (e.g., 10-14, 15-19).
❌ FalseCorrect reasoning: You must use the Class Boundaries (e.g., 9.5 and 14.5, 14.5 and 19.5) to close the gaps.
True or False? For grouped data with class intervals 10-14 and 15-19, the upper class boundary for the first class is 14.
❌ False✅ Correct Boundary: 14.5
To use linear interpolation to find a median or percentile, what must you calculate for the frequency table first?
Cumulative Frequency
Using measures of location (Mean, Median, Mode), what is the typical condition for a distribution to be Negatively Skewed?
Mean<Median<Mode
What is the formula for calculating Skewness involving standard deviation (given in textbook)?
StandardDeviation3(Mean−Median) (Positive result = positive skew; Negative result = negative skew)
Using quartiles (Q1,Q2,Q3), what is the condition for a distribution to be Positively Skewed?
Q3−Q2>Q2−Q1
True or False?If a dataset contains an outlier calculated using Mean±2σ, you should automatically remove it from the dataset before calculating the mean.
❌ False✅ Correct: You only remove anomalies (clearly impossible values or errors). Legitimate extreme values (e.g., a very tall person) should be kept unless the question instructs to ’clean’ the data.
You are asked to identify outliers in a dataset. How do you decide which formula to use?
Check the question carefully.Common definitions include:1. Q3+k(IQR) and Q1−k(IQR) (often k=1.5)2. μ±2σThe exam question will explicitly state which rule to apply.