EMPLOYEE PERFORMANCE ANALYSIS
EMPLOYEE PERFORMANCE ANALYSIS
4.Data Pre-Processing
c. Outlier Handling: Some features contain outliers so we are impute this outlier with the
help of IQR because in all features data is not normally distributed
d. Feature Transformation: In YearsSinceLastPromotion some skewed & kurtosis is
present, so we are use Square Root Transformation techinque
● Square root transformation: Square root transformation is one of the many types of
standard transformations.This transformation is used for count data (data that follow a
Poisson distribution) or small whole numbers. Each data point is replaced by its
square root. Negative data is converted to positive by adding a constant, and then
transformed.
● Q-Q Plot: Q–Q plot is a probability plot, a graphical method for comparing two
probability distributions by plotting their quantiles against each other.
e. Scaling The Data: scaling the data with the help of Standard scalar
● Standard Scaling: Standardization is the process of scaling the feature, it assumes the
feature follow normal distribution and scale the feature between mean and standard
deviation, here mean is 0 and standard deviation is always
SMOTE: SMOTE (synthetic minority oversampling technique) is one of the most commonly
used oversampling methods to solve the imbalance problem. It aims to balance class
distribution by randomly increasing minority class examples by replicating them. SMOTE
synthesises new minority instances between existing minority instances. 3.Splitting Training
And Testing Data: 80% data use for training & 20% data used for testing