Posts with the Statistics And Data Science tag
4 September, 2021
– It's possible to generate random variables from a specific distribution by using the inverse of the cumulative distribution function. We'll see how to do this with a few specific examples in R.
24 June, 2021
– Mean squared error (MSE) is an important measure of models that predict continuous variables. When we're using it to evaluate our model, however, we need to be careful that we're using the test MSE rather than the training MSE. We'll go over the differences here, using some examples in R.
15 February, 2021
– In part two, we dive deeper into some aspects of regression modelling, including multiple regression and evaluating model fit.
31 January, 2021
– This is the first of two posts describing the development and evaluation of a regression model of NZ's greenhouse gas emissions.
25 January, 2021
– A statistical interaction occurs when the effect of one variable is moderated by another variable. Here, we'll show how the relationship between a car's fuel efficiency and engine displacement changes with the number of cylinders.
21 January, 2021
– The chi-squared test is used to determine the statistical significance of a categorical explanatory variable's effect on a categorical response variable. We'll show to how to conduct a simple chi-squared test in Python.
24 February, 2020
– In The Undoing Project, Michael Lewis's excellent book about two groundbreaking behavioural psychologists, we get a glimpse at some of the mechanisms that drive our judgements and decisions. By coming up with interesting questions to pose to ...