11 Data Science Capstone

This is the final course of the Data Science Specialization in R. It combines all the knowledge and skills learned during the course - from understanding data science, to installing R and RStudio, loading, subsetting, wrangling, exploring, using statistical inference, training, and testing our data sets based on applicable machine learning.

The capstone is a partnership between Johns Hopkins University and Swiftkey. I have used this product a while back in 2013-2015. I was amazed at the innovation on digital keyboards. The ability to slide your finger across the keyboard without lifting it. It then predicts the word with high accuracy. This course provides a blueprint on how to achieve the word prediction technology behind it.

The capstone will be evaluated based on the following assessments:

  1. An introductory quiz to test whether you have downloaded and can manipulate the data.
  2. An intermediate R markdown report that describes in plain language, plots, and code your exploratory analysis of the course data set.
  3. Two natural language processing quizzes, where you apply your predictive model to real data to check how it is working.
  4. A Shiny app that takes as input a phrase (multiple words), one clicks submit, and it predicts the next word.
  5. A 5 slide deck created with R presentations pitching your algorithm and app to your boss or investor.