Continuous random variables can take any value in an interval. Their distributions are described by probability density functions (PDFs): probabilities are areas under the curve, never the height at a point.
The normal distribution is the cornerstone of statistics. Its bell-curve shape emerges naturally when many small independent effects are added, and it underpins virtually all classical inference methods.
The exponential and uniform distributions complete the toolkit for modeling waiting times, lifetimes, and situations where all outcomes are equally likely.
PDFs and CDFs
A probability density function (PDF) satisfies and . Probabilities are areas under the curve: . The PDF height is not a probability — it is a density (probability per unit length).
For a continuous distribution, for any single point . This is why for continuous distributions: adding or removing a single endpoint changes nothing.
The cumulative distribution function (CDF) is . It is non-decreasing from to , right-continuous, and gives the probability of being at or below .
The relationship between PDF and CDF: (derivative of CDF is PDF) and (antiderivative of PDF is CDF).
Quantiles and percentiles: the -th quantile satisfies . For the standard normal, — this is where of the distribution lies below.
💡Explain it simply
A PDF is like a topographic map of probability. The height at any point shows how densely packed the probability is there. But the actual probability of a region is the volume (area) under the curve over that region — not the height at a single point.
The CDF is the running total: tells you the total probability that has accumulated by the point . It starts at (nothing accumulated yet) and ends at (all probability accounted for).
The normal distribution
The normal (Gaussian) distribution has PDF . It is symmetric about ; the inflection points are at .
The empirical (68-95-99.7) rule: ; ; .
Standardisation: any is transformed to . Standard normal probabilities are read from -tables or computed with software.
The normal distribution is additive: if and are independent, then .
Why normal? By the Central Limit Theorem, the sum (or average) of many independent, identically distributed random variables is approximately normal regardless of the underlying distribution. This explains the normal's ubiquity in nature and statistics.
💡Explain it simply
The bell curve is the shape that emerges whenever many small, independent random factors add together. Heights, measurement errors, test scores — they all pile up near the average and thin out toward the extremes. That's the normal distribution, and it appears everywhere precisely because of this 'sum of many small effects' mechanism.
Normal probability using z-scores
- Exam scores: . Find .
- Standardise: , .
- .
- About of students score between and .
The exponential and uniform distributions
The exponential distribution : for . Mean , variance . It models the waiting time between events in a Poisson process at rate .
Memoryless property: . The past waiting time provides no information about the future wait. This is the continuous analogue of the geometric distribution's memorylessness.
The uniform distribution : for . Mean , variance . Every sub-interval of equal length is equally probable.
The chi-squared distribution is the sum of squared independent standard normals. Used in goodness-of-fit tests and inference about variances. Mean , variance .
The -distribution with degrees of freedom is the ratio where and independently. It is bell-shaped but heavier-tailed than the normal, converging to as .
💡Explain it simply
Exponential: you're waiting for a bus that comes randomly at an average rate of once every minutes. The exponential distribution models how long you wait. The memoryless property says: having already waited minutes tells you nothing — the next bus is still just as far away as when you started.
Uniform: you pick a random number between and . Every tiny interval of the same width has the same probability. Completely flat.
Exponential waiting time
- Calls arrive at rate per minute. What is ?
- .
- About chance of waiting more than minute.
- Mean wait: min.
Normal approximation to the binomial
When is large and is not extreme, can be approximated by .
Validity condition: and . When is close to or , use the Poisson approximation instead.
Continuity correction: because the binomial is discrete, is approximated by using the normal. For , use .
💡Explain it simply
For large , a binomial histogram starts to look like a smooth bell curve. The normal approximation exploits this: instead of summing many binomial terms, you compute one area under the normal curve. The continuity correction patches up the gap between the 'staircase' (binomial) and the 'smooth curve' (normal).
Common Mistakes to Avoid
- Computing for a continuous random variable. Point probabilities are always zero. Always specify an interval.
- Reading the PDF height as a probability. can exceed ; it is a density, not a probability.
- Applying the normal approximation when or . Use exact binomial or Poisson instead.
- Forgetting the -table gives left-tail probabilities. For , use . For , use .
- Confusing the rate with the mean in the exponential distribution. Mean , not .