Probability

Experiments, Sample Spaces and Events

  • For any experiment, outcome is uncertain.

  • “Sample Space - The set of all possible outcomes for an experiment.”

  • A set of possible outcomes i.e. a subset of points in a sample space in a probabilistic experiment is called an event.

Calculations involving sample spaces

  • The total probability of all points in a sample space adds up to 1.

Mutually Exclusive Events

  • “Mutually Exclusive Events - A set of events are mutually exclusive if the occurrence of one event precludes the occurrence of any other event.”

  • If the events are mutually exclusive then, \(P(E_1\) or \(E_2) = P(E_1) + P(E_2)\)

  • If the events are not mutually exclusive then, \(P(E_1\) or \(E) = P(E_1) + P(E_2) - P(E_1\) and \(E_2)\).

Complimentary Events

  • “Two events are complementary events if they have no points in common and together include all points in an experiment’s sample space.”

  • The complement of event \(E\) is \(\bar{E}\).

  • \(E\) and \(\bar{E}\) are mutually exclusive events.

  • \(P(E\) or \(\bar{E}) = 1 = P(E) + P(\bar{E})\).

Conditional Probability

  • “Given two events \(A\) and \(B\) the conditional probability of event \(A\) given that event \(B\) has occurred is written as \(P(A|B)\). Intuitively, once we know that event \(B\) has occurred, this is the chance that event \(A\) will occur.”

  • \(P(A|B) = \frac{P(A and B)}{P(B)}\)

  • “Given two events A and B we define the joint probability of A and B to be the probability that events A and B both occur.”

Independents Events

  • “Independent Events - Two events are independent if the occurrence of one of the events does not change our estimate of the probability of the other event. In short, events A and B are independent if and only if P(A|B) = P(A).”

  • For two independent events \(E_1\) and \(E_2\), \(P(E_1\) and \(E_2) = P(E_1) \times P(E_2)\)

Random Variables

  • “Random Variable - A function that associates a numerical value with every possible outcome of an experiment.”

  • Random variables can be discrete or continuous.

  • “Expected Value of a Random Variable - The average value you would expect to see of a random variable if you perform an experiment many times.”

  • “The expected value of a discrete random variable is found simply by multiplying each value of the random variable by its probability and then adding up the products.”

  • For discrete random variables, \(E(X) = \sum_{i=1}^{n} p_i x_i\).

  • “The expected value of a discrete random variable tells you, on average, what value that variable has.”

  • “Variance of a Random Variable - A measure of a random variable’s spread about its mean defined as the average squared deviation of a random variable from its mean. \(\sigma^2(X) = \sum_{i=1}^{n} p_i (x_i - E(X))^2\).”

  • The standard deviation of a random variable \(\sigma(X)\) is simply the square root of the variance.

Continuous Random Variables

  • “Continuous Random Variable - A continuous random variable is used to describe an uncertain quantity (such as height, weight, or return on a stock) which can assume an infinite number of values and is defined over an interval or intervals of values.”

  • “Probability density function (PDF) - A continuous random variable’s pdf tells us the relative likelihoods of the random variable’s possible values. The area under a pdf between a and b is the probability that the continuous random variable assumes a value between a and b.”

  • A PDF has the following properties:

    • It is always nonnegative.

    • The height of a PDF for a value x of a continuous random variable represents the relative likelihood that the random variable assumes a value near x.

    • The total area under the PDF must equal 1.

    • The probability of an event involving a continuous random variable corresponds to the area under the PDF. The total probability of all possible outcomes must equal 1, in line with the total area of 1 under the PDF.

Normal Random Variable

  • “Normal Random Variable - A continuous random variable that describes many quantities such as height, weight, or monthly sales of a product. A normal random variable is specified by its mean and standard deviation. The Excel function =NORMDIST(x, mean, standard deviation, True) gives the probability that a normal random variable with a given mean and standard deviation is ≤ x.”

  • “The normal random variable, often called the bell curve, has the following PDF: \(f(x) = \frac{1}{\sigma \sqrt{2\pi}} e^{- \frac{(x - \mu)^2}{2 \sigma^2}}\). Where, \(e=2.7182\).”

  • The normal random variable has several important properties:

    • A normal random variable assumes a mean value μ.

    • A normal random variable has a variance of \(\sigma^2\) and a standard deviation of \(\sigma\).

    • There is a 68% chance that a normal random variable assumes a value within σ of the mean, a 95% chance that it assumes a value within 2σ of the mean, and a 99.7% chance that it assumes a value within 3σ of the mean.

    • The normal PDF is symmetric about the mean: It looks the same to the left of the mean as it does to the right.