Online Courses and Tutorials provides you with the latest online courses information by assisting over 45,000 courses and 1 million students.

Learn programming, marketing, data science and more.

Get started today

Skip to main content

Data Analysis and Interpretation

About This Specialization Learn SAS or Python programming, expand your knowledge of analytical methods and applications, and conduct original research to inform complex decisions. The Data Analysis and Interpretation Specialization takes you from data novice to data expert in just four project-based courses. You will apply basic data science tools, including data management and visualization, modeling, and machine learning using your choice of either SAS or Python, including pandas and Scikit-learn. Throughout the Specialization, you will analyze a research question of your choice and summarize your insights. In the Capstone Project, you will use real data to address an important issue in society, and report your findings in a professional-quality report. You will have the opportunity to work with our industry partners, DRIVENDATA and The Connection. Help DRIVENDATA solve some of the world's biggest social challenges by joining one of their competitions, or help The Connection be…

Regression Modeling in Practice

Regression Modeling in Practice

About this course: This course focuses on one of the most important tools in your data analysis arsenal: regression analysis. Using either SAS or Python, you will begin with linear regression and then learn how to adapt when two variables do not present a clear linear relationship. You will examine multiple predictors of your outcome and be able to identify confounding variables, which can tell a more compelling story about your results. You will learn the assumptions underlying regression analysis, how to interpret regression coefficients, and how to use regression diagnostic plots and other tools to evaluate the quality of your regression model. Throughout the course, you will share with others the regression models you have developed and the stories they tell you.

Created by:  Wesleyan University

  • Jen Rose
    Taught by:  Jen Rose, Research Professor

  • Lisa Dierker
    Taught by:  Lisa Dierker, Professor
Basic Info
Commitment4 weeks, 4 - 5 hours per week
How To PassPass all graded assignments to complete the course.
User Ratings
Average User Rating 4.3See what learners said
Introduction to Regression
This session starts where the Data Analysis Tools course left off. This first set of videos provides you with some conceptual background about the major types of data you may work with, which will increase your competence in choosing the statistical analysis that’s most appropriate given the structure of your data, and in understanding the limitations of your data set. We also introduce you to the concept of confounding variables, which are variables that may be the reason for the association between your explanatory and response variable. Finally, you will gain experience in describing your data by writing about your sample, the study data collection procedures, and your measures and data management steps.
4 videos5 readings
  1. Reading: Some Guidance for Learners New to the Specialization
  2. Video: Lesson 1: Observational Data
  3. Video: Lesson 2: Experimental Data
  4. Video: Lesson 3: Confounding Variables
  5. Video: Lesson 4: Introduction to Multivariate Methods
  6. Reading: Getting Set up for Assignments
  7. Reading: Tumblr Instructions
  8. Reading: How to Write About Data
  9. Reading: Writing About Your Data: Example Assignment
Graded: Writing About Your Data
Basics of Linear Regression
In this session, we discuss more about the importance of testing for confounding, and provide examples of situations in which a confounding variable can explain the association between an explanatory and response variable. In addition, now that you have statistically tested the association between an explanatory variable and your response variable, you will test and interpret this association using basic linear regression analysis for a quantitative response variable. You will also learn about how the linear regression model can be used to predict your observed response variable. Finally, we will also discuss the statistical assumptions underlying the linear regression model, and show you some best practices for coding your explanatory variables Note that if your research question does not include one quantitative response variable, you can use one from your data set just to get some practice with the tool.
8 videos8 readings
  1. Reading: SAS or Python - Which to Choose?
  2. Reading: Getting Started with SAS
  3. Reading: Getting Started with Python
  4. Reading: Course Codebooks and Data
  5. Reading: Uploading Your Own Data to SAS
  6. Reading: SAS Program Code for Video Examples
  7. Video: SAS Lesson 1: More on Confounding Variables
  8. Video: SAS Lesson 2: Testing a Basic Linear Regression Mode
  9. Video: SAS Lesson 3: Categorical Explanatory Variables
  10. Reading: Python Program Code for Video Examples
  11. Video: Python Lesson 1: More on Confounding Variables
  12. Video: Python Lesson 2: Testing a Basic Linear Regression Model
  13. Video: Python Lesson 3: Categorical Explanatory Variables
  14. Video: Lesson 4: Linear Regression Assumptions
  15. Reading: Outlier Decision Tree
  16. Video: Lesson 5: Centering Explanatory Variables
Graded: Test a Basic Linear Regression Model
Multiple Regression
Multiple regression analysis is tool that allows you to expand on your research question, and conduct a more rigorous test of the association between your explanatory and response variable by adding additional quantitative and/or categorical explanatory variables to your linear regression model. In this session, you will apply and interpret a multiple regression analysis for a quantitative response variable, and will learn how to use confidence intervals to take into account error in estimating a population parameter. You will also learn how to account for nonlinear associations in a linear regression model. Finally, you will develop experience using regression diagnostic techniques to evaluate how well your multiple regression model predicts your observed response variable. Note that if you have not yet identified additional explanatory variables, you should choose at least one additional explanatory variable from your data set. When you go back to your codebooks, ask yourself a few questions like “What other variables might explain the association between my explanatory and response variable?”; “What other variables might explain more of the variability in my response variable?”, or even “What other explanatory variables might be interesting to explore?” Additional explanatory variables can be either quantitative, categorical, or both. Although you need only two explanatory variables to test a multiple regression model, we encourage you to identify more than one additional explanatory variable. Doing so will really allow you to experience the power of multiple regression analysis, and will increase your confidence in your ability to test and interpret more complex regression models. If your research question does not include one quantitative response variable, you can use the same quantitative response variable that you used in Module 2, or you may choose another one from your data set.
10 videos2 readings
  1. Reading: SAS Program Code for Video Examples
  2. Video: SAS Lesson 1: Multiple Regression
  3. Video: SAS Lesson 2: Confidence Intervals
  4. Video: SAS Lesson 3: Polynomial Regression
  5. Video: SAS Lesson 4: Evaluating Model Fit, pt. 1
  6. Video: SAS Lesson 5: Evaluating Model Fit, pt. 2
  7. Reading: Python Program Code for Video Examples
  8. Video: Python Lesson 1: Multiple Regression
  9. Video: Python Lesson 2: Confidence Intervals
  10. Video: Python Lesson 3: Polynomial Regression
  11. Video: Python Lesson 4: Evaluating Model Fit, pt. 1
  12. Video: Python Lesson 5: Evaluating Model Fit, pt. 2
Graded: Test a Multiple Regression Model
Logistic Regression
In this session, we will discuss some things that you should keep in mind as you continue to use data analysis in the future. We will also teach also you how to test a categorical explanatory variable with more than two categories in a multiple regression analysis. Finally, we introduce you to logistic regression analysis for a binary response variable with multiple explanatory variables. Logistic regression is simply another form of the linear regression model, so the basic idea is the same as a multiple regression analysis. But, unlike the multiple regression model, the logistic regression model is designed to test binary response variables. You will gain experience testing and interpreting a logistic regression model, including using odds ratios and confidence intervals to determine the magnitude of the association between your explanatory variables and response variable. You can use the same explanatory variables that you used to test your multiple regression model with a quantitative outcome, but your response variable needs to be binary (categorical with 2 categories). If you have a quantitative response variable, you will have to bin it into 2 categories. Alternatively, you can choose a different binary response variable from your data set that you can use to test a logistic regression model. If you have a categorical response variable with more than two categories, you will need to collapse it into two categories.
7 videos6 readings
  1. Reading: SAS Program Code for Video Examples
  2. Video: SAS Lesson 1: Categorical Explanatory Variables with More Than Two Categories
  3. Reading: Python Program Code for Video Examples
  4. Video: Python Lesson 1: Categorical Explanatory Variables with More Than Two Categories
  5. Video: Lesson 2: A Few Things to Keep in Mind
  6. Video: SAS Lesson 3: Logistic Regression for a Binary Response Variable, pt 1
  7. Video: SAS Lesson 4: Logistic Regression for a Binary Response Variable, pt. 2
  8. Video: Python Lesson 3: Logistic Regression for a Binary Response Variable, pt. 1
  9. Video: Python Lesson 4: Logistic Regression for a Binary Response Variable, pt. 2
  10. Reading: Week 1 Video Credits
  11. Reading: Week 2 Video Credits
  12. Reading: Week 3 Video Credits
  13. Reading: Week 4 Video Credits
Graded: Test a Logistic Regression Model
How It Works
Each course is like an interactive textbook, featuring pre-recorded videos, quizzes and projects.
Help from Your Peers
Help from Your Peers
Connect with thousands of other learners and debate ideas, discuss course material, and get help mastering concepts.
Earn official recognition for your work, and share your success with friends, colleagues, and employers.
Wesleyan University
At Wesleyan, distinguished scholar-teachers work closely with students, taking advantage of fluidity among disciplines to explore the world with a variety of tools. The university seeks to build a diverse, energetic community of students, faculty, and staff who think critically and creatively and who value independence of mind and generosity of spirit.


Popular posts from this blog

An Introduction to Interactive Programming in Python (Part 1)

About this course: This two-part course is designed to help students with very little or no computing background learn the basics of building simple interactive applications. Our language of choice, Python, is an easy-to learn, high-level computer language that is used in many of the computational courses offered on Coursera. To make learning Python easy, we have developed a new browser-based programming environment that makes developing interactive applications in Python simple. These applications will involve windows whose contents are graphical and respond to buttons, the keyboard and the mouse. In part 1 of this course, we will introduce the basic elements of programming (such as expressions, conditionals, and functions) and then use these elements to create simple interactive applications such as a digital stopwatch. Part 1 of this class will culminate in building a version of the classic arcade game "Pong".
Who is this class for: Recommended Background - A knowledge o…

Introduction to Data Science in Python

About this course: This course will introduce the learner to the basics of the python programming environment, including how to download and install python, expected fundamental python programming techniques, and how to find help with python programming questions. The course will also introduce data manipulation and cleaning techniques using the popular python pandas data science library and introduce the abstraction of the DataFrame as the central data structure for data analysis. The course will end with a statistics primer, showing how various statistical measures can be applied to pandas DataFrames. By the end of the course, students will be able to take tabular data, clean it,  manipulate it, and run basic inferential statistical analyses. This course should be taken before any of the other Applied Data Science with Python courses: Applied Plotting, Charting & Data Representation in Python, Applied Machine Learning in Python, Applied Text Mining in Python, Applied Social Ne…

Learn to Program and Analyze Data with Python

About This Specialization This Specialization builds on the success of the Python for Everybody course and will introduce fundamental programming concepts including data structures, networked application program interfaces, and databases, using the Python programming language. In the Capstone Project, you’ll use the technologies learned throughout the Specialization to design and create your own applications for data retrieval, processing, and visualization. Created by: 5 courses Follow the suggested order or choose your own. Projects Designed to help you practice and apply the skills you learn. Certificates Highlight your new skills on your resume or LinkedIn. Courses