HW 07: Bivariate Modeling

The starting point for most models

Purpose

We’ve been visually exploring relationships between two variables by creating appropriate plots to assess how the distribution of a primary outcome (response/dependent) variable changes according to the level of a predictor (explanatory/independent/covariate) variable. We can learn a lot by conducting exploratory data analysis, and if description is the goal then this is where your work can stop.

However, if you want to make conclusions or inference about a relationship, then formal statistical analysis techniques are needed. We start here by formally testing if relationships or associations between two measures exist, then later will see how additional third variables can potentially disrupt or enhance any association that you may find.


Submission instructions

  • Use the QMD template answer the questions.
  • This is a big assignment, so it will be broken into two stages.
    • First draft will be on analyses 1-2, to make sure you are making correct progress.
    • Upload a PDF of your work to the assignment in Canvas

Instructions

In this assignment you will practice FOUR(4) different types of bivariate analysis:

  1. (Q~B) Quantitative Outcome ~ Binary Categorical Explanatory == Two-sample t-tests for a difference in means
  2. (Q~C) Quantitative Outcome ~ Categorical Explanatory == ANOVA
  3. (B~C) Binary Outcome ~ Categorical (or Binary) Explanatory == \(\chi^{2}\) test of Association.
  4. (Q~Q) Quantitative Outcome ~ Quantitative Explanatory == Correlation analysis

For each analysis you will do the following steps:

  1. State which variable (including the variable name from your codebook) will be your explanatory variable and which will be your response variable.

    • Remember, you have some variables in your codebook that can act as both categorical and quantitative.
    • Decide which of those variables makes sense to “explain” the other. Don’t just blindly pick a bunch of variables.
    • Think about the relationship among your variables, keeping in mind your original research questions. You may use gender as your categorical explanatory variable if you are struggling to find an explanatory and response relationship that makes sense.
  2. Create an appropriate bivariate plot to visualize the relationship you are exploring. Calculate appropriate summary statistics. Summarize the relationship between the explanatory and outcome variables in short paragraph form.

  3. Write the relationship you want to examine in the form of a research question using symbols and words.

    • Define the parameters being tested. (\(\rho\), \(p_{1}\), \(\mu_{1}\), \(\rho_{1}\) etc)
    • Translate the null and alternative hypotheses into \(H_{0}\) and \(H_{A}\) with symbols.
  4. State and justify the analysis model. Check assumptions.

    • Even if these assumptions are potentially violated, for the purposes of this assignment, acknowledge this limitation and continue with the prescribed analysis. (If a non-parametric test is appropriate, state it here.)
  5. Conduct the test and make a decision about the plausibility of the alternative hypothesis.

  6. Write a conclusion in context of the problem that includes a point estimate, confidence interval, and p-value.