2023-10-02
(A.K.A Hypothesis Testing)
The Washington Post has been keeping a database on use of fatal force by police since 2015. This database tracks all incidents where someone was shot by police, and records various characteristics such as the seniority level of the officer, date, location, gender, age and ethnicity of the victim along with information on if the person was armed, fleeing or acting erratic.
(board work)
To decide whether variability in data is due to
we set up a sham randomized experiment as a comparison.
If the actual experiment has more extreme results than any of the sham experiments, we are led to believe that it is the explanatory variable which is causing the result and not just variability inherent to the data.
Turning to something equally important, but less potentially distressful, let’s do an experiment on sex discrimination in promotional decisions
(in class)
Open the collaborative notes and work with your neighbor to answer the following questions under the “Inference using Randomization” section.
Errors do occur, just like rare events, and the dataset at hand might lead us to the wrong conclusion. While a given dataset may not always lead us to a correct conclusion, statistical inference gives us tools to control and evaluate how often these errors occur.