In this capstone course, you will apply various data science skills and techniques that you have learned as part of the previous courses in the IBM Data Science with R or IBM Data Analytics with Excel and R Professional Certificate Programs.
In this capstone project, you will take on the role of a data scientist who has recently joined an organization and is presented with a challenge that requires data collection, analysis, basic hypothesis testing, visualization, and modeling to be performed on real-world datasets. You will collect and understand data from multiple sources, conduct data wrangling and preparation with Tidyverse, perform exploratory data analysis with SQL, Tidyverse and ggplot2, model data with linear regression, create charts and plots to visualize the data, and build an interactive dashboard.
The project will culminate with a presentation of your data analysis report, with an executive summary for the various stakeholders in the organization.
- Prepare data for modelling by handling missing values, formatting and normalizing data, binning, and turning categorical values into numeric values.
- Do exploratory data analysis using descriptive statistics, data grouping, data analysis and correlation statistics.
- Lectures 0
- Quizzes 0
- Duration 2 weeks
- Skill level All levels
- Language English
- Students 0
- Assessments Yes