Project 2 - Dimensionality Reduction and Unsupervised Learning (Due 10/17)

Objective:

The objective of this project is, first of all, to practice the usage of dimensionality reduction as one of the preprocessing steps and study its impact to the classification performance. You need to use both supervised dimensionality reduction method (i.e., FLD) and unsupervised method (i.e., PCA and t-SNE) for that purpose. The second objective is to get an in-depth understanding of unsupervised learning algorithms and how to apply that for classification purpose. The third objective is to extend the horizon of machine learning applications and solve a seemingly quite unrelated problem -- image compression. Think hard if image compression should be solved using supervised learning or unsupervised learning, and what are the features in this application.

Data Sets:

Two datasets will be used. The first is the popular MNIST. The second is a beautiful color image of flowers.

Performance Metrics:

Besides the three metrics used in Project 1, i.e., 1) overall classification accuracy, 2) classwise classification accuracy (or confusion matrix), and 3) run time, we'll introduce a fourth metric that measures the quality of the compressed image as compared to the original full-color image - 4) root mean squared error (RMSE).

Tasks: