Skip to content

A collection of Exploratory Data Analysis (EDA) projects demonstrating skills in data cleaning, visualization, and statistical analysis using Python.

License

Notifications You must be signed in to change notification settings

navya99g/Exploration_Data_Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Exploratory Data Analysis (EDA) 📊🔍

Introduction

Exploratory Data Analysis (EDA) is a crucial step in data science that helps uncover patterns, detect anomalies, and summarize key characteristics of a dataset. This repository contains various EDA projects covering different domains, including global happiness, hotel bookings, and the Titanic disaster.

Projects 🚀

1️⃣ World Happiness Analysis (EDA) 🌍😊

This project explores the World Happiness Report 2019 to identify factors affecting happiness across different countries.

🔹 Key Insights:

  • Higher GDP per capita, social support, and healthy life expectancy are positively correlated with happiness.
  • Countries in North America, Australia, and Europe score higher in happiness compared to Asia and Africa.
  • Visualizations include happiness distribution, GDP correlation, and global happiness maps.

World Happiness Analysis


2️⃣ Hotel Booking Analysis (EDA) 🏨📊

This project examines hotel booking data to analyze booking trends, customer behavior, and cancellation rates.

🔹 Key Insights:

  • City Hotels have a higher cancellation rate than Resort Hotels.
  • Families with children prefer Resort Hotels.
  • Peak booking months occur in July and August.

🔗 Hotel Booking Analysis


3️⃣ Titanic Survival Analysis (EDA) 🚢⚓

This project explores survival patterns in the Titanic dataset, analyzing how factors like gender, class, and age influenced survival rates.

🔹 Key Insights:

  • Women had a 74% survival rate, while men had only 19%.
  • First-class passengers had a higher chance of survival (63%) than those in third-class (24%).
  • Children (0-12 years) had better survival rates than adults.
  • Passengers who paid higher fares were more likely to survive.

🔗 Titanic Survival Analysis


Tools Used 🔧

  • Python Libraries: pandas, numpy, matplotlib, seaborn, plotly
  • Statistical Analysis: Chi-square tests, correlation analysis
  • Visualization: Count plots, heatmaps, box plots, regression plots

Next Steps 📈

  • Expand trend analysis across multiple datasets.
  • Implement predictive modeling in another repository (Predictive Analysis ).
  • Conduct hypothesis testing to validate insights.

Acknowledgments

  • Datasets sourced from Kaggle.
  • Inspired by real-world data science applications.

📌 Check out the full analyses in their respective repositories! 🚀

About

A collection of Exploratory Data Analysis (EDA) projects demonstrating skills in data cleaning, visualization, and statistical analysis using Python.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published