0% found this document useful (0 votes)
14 views

Assignment R

The document is an assignment on statistical analysis using R, submitted by Vaibhav Keshari for the course B.A. Program. It covers various topics including data exploration, data frame operations, vector and matrix operations, correlation and regression analysis, as well as data visualization techniques using ggplot2. The assignment includes practical examples and code snippets demonstrating the analysis of the 'airquality' dataset.
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
14 views

Assignment R

The document is an assignment on statistical analysis using R, submitted by Vaibhav Keshari for the course B.A. Program. It covers various topics including data exploration, data frame operations, vector and matrix operations, correlation and regression analysis, as well as data visualization techniques using ggplot2. The assignment includes practical examples and code snippets demonstrating the analysis of the 'airquality' dataset.
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 9

Assignment: Statistical Analysis R

Submitted By:
 Name: Vaibhav Keshari
 Roll Number: 1021
 Subject: Roman with R
 Course: B.A. Program

Table of Contents
1. Introduction
2. Data Exploration
3. Data Frame Operations
4. Vector Operations
5. Matrix and Array Operations
6. Correlation Analysis
7. Regression Analysis
8. Time Series Analysis
9. Pie Chart Visualization
10. Data Visualization with ggplot2
11. Export and Load Operations
12. Conclusion
1.Data Exploration

# Load the dataset and explore its structure

data("airquality")

head(airquality)

summary(airquality)

str(airquality)

Data Frame Operations

# Data Frame Operations airquality_subset <- airquality[, c("Ozone", "Wind", "Temp")]


airquality_subset$ozone_to_temp <- airquality_subset$Ozone / airquality_subset$Temp
high_temp_days <- subset(airquality, Temp > 80) print(head(high_temp_days))
Vector Operations

temperatures <- airquality$Temp squared_temps <- temperatures^2 mean_temp <-


mean(temperatures, na.rm = TRUE) sd_temp <- sd(temperatures, na.rm = TRUE) print(mean_temp)
print(sd_temp)
Matrix and Array Operations

airquality_matrix <- as.matrix(airquality[1:3, 1:3]) transposed_matrix <- t(airquality_matrix)


airquality_array <- array(1:27, dim = c(3, 3, 3)) print(transposed_matrix) print

(airquality_array)
Correlation Analysis

cor_matrix <- cor(airquality[, c("Ozone", "Wind", "Temp")], use = "complete.obs") print(cor_matrix)


library(corrplot) corrplot(cor_matrix, method = "circle")

Regression Analysis

regression_model <- lm(Temp ~ Ozone + Wind, data = airquality) print(summary(regression_model))


plot(airquality$Wind, airquality$Temp, main = "Regression Plot: Temp vs Wind") abline(lm(Temp ~
Wind, data = airquality), col = "red")
Pie Chart Visualization

Pie Chart month_counts <- table(airquality$Month) pie(month_counts, labels =


names(month_counts), main = "Month Distribution")
ggplot2 Visualization

library(ggplot2) ggplot(airquality, aes(x = Ozone, y = Temp, color = as.factor(Month))) +


geom_point(size = 3, na.rm = TRUE) + theme_minimal() + labs(title = "Ozone vs Temperature", x =
"Ozone", y = "Temperature", color = "Month")

You might also like