LAB01
LAB01
Inde Project
Market (Weather Data (Healthcare
x Requirements
Analysis) Analysis) Data Analysis)
2. Find a Dataset:
!mkdir ~/.kaggle
import pandas as pd
2. Data Exploration:
stock_data.head()
stock_data.isnull().sum()
3. Data Cleaning:
stock_data['Date'] = pd.to_datetime(stock_data['Date'])
1. Basic Statistics:
stock_data.describe()
stock_data['Moving_Avg'] = stock_data['Close'].rolling(window=20).mean()
plt.legend()
plt.show()
y = stock_data['Close']
predictions = model.predict(X_test)
weather_data = pd.read_csv('path_to_weather_data.csv')
2. Data Exploration:
weather_data.head()
weather_data.isnull().sum()
3. Data Cleaning:
weather_data.fillna(weather_data.mean(), inplace=True)
o Convert Date column to datetime:
weather_data['Date'] = pd.to_datetime(weather_data['Date'])
1. Basic Statistics:
weather_data.describe()
plt.plot(weather_data['Date'], weather_data['Temperature'],
label='Temperature')
plt.show()
3. Correlation Matrix:
weather_data.corr()
y = weather_data['Temperature']
predictions = model.predict(X_test)
health_data = pd.read_csv('path_to_health_data.csv')
2. Data Exploration:
health_data.head()
health_data.isnull().sum()
3. Data Cleaning:
1. Basic Statistics:
health_data.describe()
health_data['Diagnosis'].value_counts().plot(kind='bar')
plt.show()
3. Correlation Matrix:
health_data.corr()
python
Copy code
y = health_data['Outcome']
predictions = model.predict(X_test)
Step 5: Data Visualization with Google Looker or Power BI
o Open Power BI, click on "Get Data," and select your CSV file.