Research Project

Overview

Throughout the entire semester you will work with a co-author to analyze a data set and present the results of your research. Regular homework assignments will serve as a first draft of exploration into your research questions, and helps you build your story. However expect to visualize and analyze your data outside the those assignments.

You will build up your project in stages, revising multiple times. Here is the general outline, with each stage is explained in detail further below.

Your research team will produce two separate deliverable: a research poster and a short journal article called a research brief.

Deliverable: Research Poster

You will be organizing your findings using a Google Slides template that will lead you through an organized approach to reporting research. This short format will also help you concisely explain your research findings in a way will be easier to translate (fit) onto a poster.

  • These slides stay in Google (are not submitted to Canvas) and are subject to the mastery based assessment discussed at the bottom of this page.
  • The contents of each slide are specified in the slide template and explained below.
    • You are welcome to have “staging” slides where you can dump content, thoughts, analyses that you may end up using. These extra slides should be at the END of the required slides.
  • There are also example slides from prior students as a references.

Deliverable: Research Brief

You will also disseminate your findings by submitting a short journal article format called a research brief to the internal Journal of Math 615. These reports will be compiled into an internal journal with selected images decorating the cover and published on the course website for future cohorts.

See the Guide for Authors for more details.

Peer Review

After stage 3 and 5 you will submit PDF draft copies of your research brief to the journal editor (your instructor) via Canvas for feedback, peer review and scoring. It is expected that you revise your work after each set of reviews.

Peer reviews will be done in Canvas using a scoring rubric. Reviewers will be randomly assigned and blinded to the authors.

Learning Resources

Details for each stage

Stage 1: Choose a data set and identify a few research topics

Open source data repositories to peruse:

Sources not allowed:

  • Kaggle
  • UCI (or any other) Machine learning repository
Criteria for choosing a data set
  • You either know something about the topic or it is something you want to learn about
  • File type must be a .txt, .csv, .xlsx or .xls file
  • File size is less than 1 Gig
  • A codebook or data dictionary that fully explains what each variable means is available.
  • There are at least 200 rows (observations), but ideally between 500-10,000.
  • There are 10 or more unique and interesting variables
    • At least 4 quantitative variables
    • Variables are not functions of each other (e.g. weight in lbs and weight in kg)

Your data set must be approved before you are allowed to work with it. Use this submission link to upload the data (a sample or a codebook will also be sufficient) and tell me a little about it. Questions you should be prepared to answer:

  1. What is the data set you are interested in analyzing?
  2. How many observations (rows) and variables (columns) does it contain?
  3. Where did you get it from?
  4. Why are you interested in this topic?
  5. What do you think you want to research with this data?

The sooner your data is approved the sooner you can work with it!

Stage 2: Introduce your research question and variables of interest

  1. Make a copy of this Template and save it in the Project -00 Poster prep folder in our Google Drive.
  2. Name your file using the last names of your research team. E.g. Donatello, Raymond
  3. Fill out the required information on slides 1 -7. These slides for this stage are pale yellow in the template.
  • Slide 01. Title
  • Slide 02. Introduction
  • Slide 03. Background & lit review.
  • Slide 04. Explain the research problem / topic area
  • Slide 05. State your Research Question as a testable hypothesis.
  • Slide 06. Introduce the data. Where does it come from, how was it collected. How many records?
  • Slide 07. Description of measures (variables) being used.
Pay attention to slide Numbers!
  • Please ensure that all content is on the slide as expected.
  • Slides will not be filled in in direct numerical order.

Stage 3: Exploratory data analysis

Poster Preparation slides

Fill out the required information on slides 9, 10, 11 and 13. These slides for this stage are pale blue in the template.

  • Slide 09. Fully describe your primary response variable
  • Slide 10. Fully describe your primary explanatory variable
  • Slide 11. Fully describe the relationship between your primary explanatory and primary response variables
  • Slide 13. Fully describe the relationship of a third variable to either your explanatory or response variables

All using appropriate summary statistics, graphic and an explanatory sentence.

Research Brief

  • Write the first two sections of the manuscript: Introduction & Background, Study Design and Data collection methods.
  • Submit a PDF to Canvas by the due date.

Stage 4: Bivariate Inference

Poster Preparation slides only

Fill out the required information on slides 8, 12 and 14. Note that these directly correspond to your variables used on slide 11 and 13. If you decided to change variables since the last time, you will need to update the corresponding univariate descriptions and graphs. These slides are pale purple in the template.

  • Slide 08. Describe in a few sentences what analysis tools you will use to answer your research question
  • Slide 12. Analyze the relationship discussed in slides 11
  • Slide 14. Analyze the relationship discussed in slides 13

Conclusions should be written in clear English, in the context of the problem, includes summary statistics, confidence interval and a p-value. Refer to the homework for proper writing style.

Stage 5: Multivariable modeling & summarizing findings

Poster Preparation slides

Fill out the required information on slides 15-21. These slides are pale green in the template. You are trying to understand the relationship between your explanatory and response variable, in presence of information contained in other variables.

Slide 15: Model Building

  • Build a multivariable model by adding additional predictors to the model.
  • Explain your model building process in a few bullet points.
    • What variables did you test as other explanatory variables?
    • Which ones did you examine as confounders, or as effect moderators?
    • How did you determine your final model?
  • See the lecture notes (ASCN Ch 10) on model building as guidance.
  • Include any variables that were found to be significantly associated with the outcome
  • If you found a moderator, your model should include an interaction term with your moderating variable.
  • If you have a confounding variable, you still need to keep your primary explanatory variable in the model.

Slide 16: Multivariable Model - Summary of results

  • A table or plot of the regression coefficients (or Odds Ratios) must be presented.
  • At least one coefficient, the primary explanatory variable, must be interpreted in context of the problem.

Slide 17. Model Assessment

  • If using a linear or log-linear model;
    • present and interpret the model diagnostic plots
    • report and interpret \(R^{2}\)
  • If using a logistic regression model;
    • describe the distribution of predicted probabilities and note if there are any concerns
    • report and interpret the model accuracy, and the cutpoint used. (Ref: ASCN Ch 12.5)

Slide 18. Discussion

  • Here you will explain what your graphical and inferential results tell you about your topic.
  • Discuss if your research hypothesis was supported, if it was not, why you think that might be
  • Explain the overall story/trend/what you learned when you consider your univariate, bivariate & multivariate results about your topic.
  • Compare your results to previous research results. Do they agree or disagree?

Slide 19. Implications

  • What are the practical implications of your results?
  • What could others do with your findings?
  • What future research needs to be conducted?
    • This needs to be more specific than “other variables could be explored”. Which variables and why? What other research articles indicate that those other variables are relevant?

Slide 20. Limitations

  • Who are the results of this study generalizable to? (i.e. a subset of individuals?)
  • Were there any model assumptions that were not upheld?
  • If this is an observational study, you should make a statement about the findings are associations and not causal in nature
  • Are there other factors that could explain your response variable that you did not include in your model?

Slide 21. References

  • You can use smaller font to get all references on one slide.
  • Use references from research plan, and any additional references gathered along the way.
  • Make sure these are correctly done in APA format.
  • Proper citations for

Research Brief

  • Write the remaining sections of the manuscript: Data Preparation and Statistical Analysis Methods, Results, Discussion/Conclusion.
  • Submit a PDF to Canvas and print a copy for your instructor by the due date.

Stage 6: Dissemination

Research Poster

  • You will transfer all findings into a research poster, print the poster, and then present your research to your classmates during our class final period in a poster symposium format.
  • Full guidelines including examples and evaluation criteria are written in this blog post.
  • Submit the poster file as printed to Canvas by the due date.

Draft version

This draft is graded based on how complete the poster is. You should consider this a draft that you would circulate to your colleagues for final review and comments. There is a rubric in Canvas with details.

Save your poster as a PDF and upload to the Poster-Draft folder in Google Drive.

Final Version

Upload your final poster as it is printed in PDF format Canvas.

Presentation at the Poster Symposium

When not presenting, you will walk around and learn about others research. Ask the presenters questions and fill out an evaluation form as you go. Poster scoring follows the evaluation criteria and will be done via Google Forms. The link to this semesters form is in Canvas. Printed copies will be available upon request.

Research Brief

Following the Guide for Authors you will submit both the self contained HTML file and the PDF file to Canvas by the due date.

Project Grading Method

Poster Preparation Slides

This work will be done through a series of revisions gaining feedback from the instructor at each stage.

  • Each slide will be marked one of the following categories
    • Not Aeesssible: No content presented.
    • NR: Needs revision. This is C level work.
    • ME: Meets expectation. This is B level work.
    • E : Exemplary. This is A level work.

Rubric / Assessment form

Each team has their own ‘project assessment’ Google spreadsheet (same location as your poster) that shows you what achievement stage you are at for each item. It is locked to only view mode and only you and I have access to it. The Assessment item descriptions are a work in progress. I may add, combine, or reword items as I do my revisions to better fit what I am actually looking for. At the bottom of this file you will find a personalized grading rubric containing an estimate of your final score.

I will update your status column for the slides that are being assessed at that stage.

You can request a reassessment of prior work at (nearly) any time. (You can ask for it, I just may not get to it until the next time I’m reviewing slides.)

You are responsible for keeping track of your status and asking for reassessment.

Research Brief

The research brief will be graded in Canvas using a rubric.