.
Practical 2
Aim: Perform following Data Pre-processing (Feature Selection/Elimination) tasks using Python
Theory: Feature selection is the method of reducing data dimension while doing predictive analysis. This method uses the variable ranking technique in order to select the variables for ordering and here, the selection of features is independent of the classifiers used. By ranking, it means how much useful and important each feature is expected to be for classification.
Dataset Description:
Various data pre-processing techniques:
Univariate Feature Selection: Univariate feature selection examines each feature individually to determine the strength of the relationship of the feature with the response variable.
Recursive Feature Elimination: In this method it fits a model and removes the weakest feature/features until the specified number of features is reached. RFE requires a specified number of features to keep, however it is often not known in advance how many features are valid.
PCA: This method is used for dimensionality reduction. This method uses a simple matrix operations from linear algebra and statistics to calculate a projection of the original data into the same number or fewer dimension.
Correlation Matrix: This matrix shows relation with each an every feature of the dataset .i.e. it will show relation one feature with all other features in the dataset including itself.
Dataset Description:
Task 1 Univariate Feature Selection:
Task 3 Heatmap:
No comments:
Post a Comment