Projects

Project

Importance of EDA in Data Science

Project

html5 bootstrap template by colorlib.com

Why is EDA so important?

The aim of this project was to highlight the importance of EDA in Data Science. Many upcoming data scientists tend to overlook this important step of Data Science and straight away go for Machine Learning. This method is actually wrong and should not be followed.

To highlight the importance of EDA, I took the spaceship titanic competition dataset and performed EDA on it. Then I highlighted the insights and conclusions that I gained from the EDA to prove its importance. I also handled missing values and outliers in the dataset which helped me create better visualizations such as Correlation Heatmap. I have received Kaggle Bronze Medal for this project and it was appreciated by some learners and community members for how neat and organized the code is.


Notebook