A statistician's toolbox

Vocabulary

  • census: recensement
  • average: moyenne
  • mean: moyenne
  • median: médiane
  • standard deviation: écart-type
  • variance: variance
  • box plot: boîte à moustaches
  • scatter plot: nuage de points

Formulas

Let x1,x2,...,xn1,xnRx_1, x_2, ... , x_{n-1}, x_n\in\mathbb{R} be real numbers.

  • The mean is often denoted by x\overline x and is given by
x=1n×k=1nxk.{\displaystyle \overline x = \frac{1}{n}\times\sum_{k=1}^n x_k.}
  • The variance measures the dispersion of the values xkx_k. It is denoted by the the letter VV and is given by
V=1n×k=1n(xxk)2.{\displaystyle V = \frac{1}{n}\times\sum_{k=1}^n (\overline x -x_k)^2.}
  • The standard deviation is denoted by the Greek letter σ\sigma and is given by
σ=V.{\displaystyle \sigma = \sqrt{V}.}

Let y1,y2,...,yn1,ynRy_1, y_2, ... , y_{n-1}, y_n\in\mathbb{R} be real numbers. We now have two samples of data xx and yy.

  • We can compute their covariance
Cov(x,y)=1nk=1n(xxk)(yyk).{\displaystyle \mathrm{Cov}(x,y) = \frac{1}{n}\sum_{k=1}^n (\overline x-x_k)(\overline y-y_k).}

If the respective standard deviations of the values xx and yy are denoted by σx\sigma_x and σy\sigma_y, we then obtain their

  • Pearson correlation coefficient, denoted by rr, and given by
r=Cov(x,y)σxσy.{\displaystyle r = \frac{\mathrm{Cov}(x,y)}{\sigma_x\sigma_y}.}

We always have 1r1-1\leq r \leq 1. If the values xx and yy are mostly independant, we obtain r0r\approx0, whereas a value r±1r\approx\pm1 means that the values xx and yy are correlated.

Application

Three students, Alice, Beatriz, and Charles, are in the same class. Out of guilt, they admitted that they cheated during their year together. By analyzing their marks, can you understand who cheated?

Alice Beatriz Charles
12 2 15
15 12 12
3 12 6
12 6 16
3 5 6
18 17 18
8 5 9
8 16 11
4 9 0
7 13 14
9 13 12
8 14 2
12 10 14
17 17 12
16 9 9
14 5 12
9 9 16
16 10 9
16 9 8
1 15 7

Correlations

Sometimes two things can be correlated (have r1r\approx1), just by coincidence.

results matching ""

    No results matching ""