Posts

Statistics for Data Science

  This is a WIP  Written for audience who want to learn Statistics for Data Science Mean Median Mode Mean = Average = Arithmetic Mean = Sum of all numbers / number of numbers Mean indicates a typical number, middle number, measure of Central Tendency Data set = {4, 3, 1, 6, 1, 7} Mean = 22/6 = 3 2/3 = 3.666 Median is the middle number. Order all your numbers in a set and find the middle number. For even number of numbers, find the mean of two middle numbers. Median = Middle wo numbers 3 & 4 =  (3+4)/2 = 3.5 Mode is the most frequently occurring number in the data set. Mode = 1 Arithmetic Mean, Geometric Mean and Harmonic Mean Think about it this way. The arithmetic mean of a bunch of numbers is the number a that satisfies x₁ + x₂ + x₃ + .... = a + a + a + ... + a The geometric mean is the number b that satisfies x₁ * x₂ * x₃ * ... = b * b * b * ... * b There is also a harmonic mean, which is the number h that satisfies 1/x₁ + 1/x₂ + 1/x₃ + ... = 1/h + 1/h + ... + 1/h. Pearson Corre