Skip to content

NithinMathewJosephAston/Dimensionality_reduction

Repository files navigation

Dimensionality_reduction

Task 1:

The file winequality-red-reduced.csv is used for this task. Implementation of sparse PCA to fit a model with three components to the raw data as mentioned in the coursework. Sparse PCA is a version of PCA which attempts to find sparse components. The idea is similar to factor analysis. Sparsity is controlled by a parameter alpha. For this task, set alpha to 5 as per the coursework. Followed by generating the components found by sparse PCA. Inorder to understand the advantage of scaling we iterate the same process but scale the data first so that it has mean 0 and variance 1, which is then succeeded by sparse PCA. We can observe magnitude variation in the array of the output generated between the two different iterations.

Before scaling:

image

After scaling:

image

Task 2:

The file winequality-red-full.csv is used for this task. It contains the full dataset in which each row has 12 features. Will be using this dataset to predict the quality of the wine from the other variables using PCA and the effect that reducing the dimensionality of the dataset will have on the performance of the regression algorithm.

image

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors