Skip to content

Statistics

Awantik Das edited this page Dec 1, 2017 · 2 revisions
Branch of mathematics & techniques with which we can understand data.
  • Central Tendencies - Understanding where the data is centered

    • mean - average of data. This is impacted by change of data
    • median - middle element of sorted data. Not impacted by change of data. Handle outliers
    • quantile - 25%, 50% & 75% of data
    • mode - most frequently occurring element.
    • dispersion - how spreadout your data is.
    • variance - sum of square distance from central tendency.
    • standard deviation - sqrt of variance
  • Correlation

    • co-variance - how two variables vary in tandem from their means
    • Simpson's Paradox - Correlations can be misleading when cofounding variables are ignored
  • Correlation and causation

    • correlation is not causation. x & y. both are correlated to each other. x can cause y & y can cause x. But, they can not be causing each other as well

Clone this wiki locally