Mean (Average)

Median

Mode

Comparing Means of Distributions

Impact of Outliers

Mean as the Balance Point

When summarizing data, the choice between mean, median, and mode depends on the data distribution and the presence of outliers. Understanding these measures and their implications is key to accurate data interpretation.

Interquartile Range (IQR)

Range

Variance

Standard Deviation

Calculating Standard Deviation Step by Step

  1. Find the Mean: Sum all the data points and divide by the number of points.
  2. Calculate Each Point's Deviation from the Mean: Subtract the mean from each data point.
  3. Square Each Deviation: Square each result from step 2.
  4. Sum the Squared Deviations: Add up all the squared deviations.
  5. Divide by (N-1) for a Sample, or (N) for a Population: This gives the variance.
  6. Take the Square Root of the Variance: This gives the standard deviation.

Understanding the Concepts

Each of these measures serves to describe the variability or dispersion within a dataset, and the choice among them depends on the specific context of the analysis.

Dividing by ( n - 1 ) in Variance Calculation

When calculating the variance for a sample (as opposed to a whole population), we use ( n - 1 ) in the denominator instead of ( n ). This is known as Bessel's correction.

Box and Whisker Plots

Box and whisker plots, also known simply as box plots, are a graphical representation of a statistical distribution's five-number summary: minimum, first quartile (Q1), median, third quartile (Q3), and maximum.

Box and whisker plots are a staple in exploratory data analysis, providing a quick visual summary of key data characteristics.