Week 2: Introduction to Classification

Learning Objectives

Concepts

Without any programming, you should be able to: * Explain the difference between a regression problem and a classification problem including differences in the data formatting, training process, loss functions, and evaluation of results. * Graphically describe what is happening when linear regression, LASSO regression, logistic regression, or ridge regression or training on categorical data and how classifications are chosen for the test data. * Graphically explain how the k-Nearest Neighbors algorithms is trained to perform classification and how the classifications are chosen for the test data.

Implementation

Using the Python programming language, you should be able to: * Convert a categorical data set into the form needed for classification including converting categorical data given in words to numeric categories. * Implement linear regression and ridge regression for classification and make improvements to the accuracy of the initial training. * Implement k-Nearest Neighbors for classification and make improvements to the accuracy of the initial training.