0% found this document useful (0 votes)

6 views

Coding Self-Assessment 2023

Uploaded by

xhdztrmts8

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views

Coding Self-Assessment 2023

Uploaded by

xhdztrmts8

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Coding self-assessment

Harris Coding Camp

Summer 2023

As part of the statistics curriculum, you will be asked to analyze data using the programming language R.
R is an open source language that is widely used by data analysts and data scientists. In coding camp and
coding lab, we provide an introduction to R coding focused on data analysis.
This is a self-assessment. If you feel comfortable completing this assignment by yourself (with the help of
Google), then you are free to skip the coding camp and coding lab. Otherwise, you can use this to pick the
right track for you.

Task 1:1
1. Install R and RStudio.
2. Install the package readxl and tidyverse.
3. Adjust the following code block to read in the provided data set,
incarceration_counts_and_rates_by_type_over_time.xlsx

library(tidyverse)
library(readxl)
setwd(<Put path to file here>)
incarceration_data <- read_xlsx("incarceration_counts_and_rates_by_type_over_time.xlsx",
range = "A7:CO10") %>%
rename("type" = ...1) %>%
pivot_longer(`1925`:`2016`, names_to = "year", values_to = "counts")

4. What does the code library(readxl) do and why is it necessary?

5. Why do you need to set a working directory (setwd())?
6. How many vectors are there in the dataset? How many observations?

7. Briefly explain the difference between vectors, lists and data frame.

If you had trouble with readxl, we provide a csv file as well. You can load the data with the following code:

incarceration_data <- read_csv("incarceration_counts_and_rates_by_type_over_time.csv")

1 Copying and pasting from the pdf will create issues in syntax–particularly it messes up the type of quotes used. We provide

a file with this code in a text file. Alternatively, you can re-type the code or copy and paste and then fix syntax issues.

1
Task 2:
We want to analyze state prison counts by decade. We’ll prepare the data in the following ways. Store the
following changes in a new tibble (data frame) called state_data.

1. Add a column called decade that reflects which decade the observation comes from.
2. Filter the data so that you only have data from State prisons.
3. Use select to reorder the columns so that your data is organized as below:

## # A tibble: 10 x 4
## type counts decade year
## <chr> <dbl> <dbl> <dbl>
## 1 State prisons 85239 1920 1925
## 2 State prisons 91188 1920 1926
## 3 State prisons 101624 1920 1927
## 4 State prisons 108157 1920 1928
## 5 State prisons 107532 1920 1929
## 6 State prisons 117268 1930 1930
## 7 State prisons 124118 1930 1931
## 8 State prisons 125721 1930 1932
## 9 State prisons 125962 1930 1933
## 10 State prisons 126258 1930 1934

4. Finally, find out the mean, standard deviation, max and min value of counts for all observations from
State prisons.

Task 3:
In this section, you’ll use group_by() and summarize() to answer questions about state prison counts by
decade.

C −C
1. Which decade saw the largest percentage growth in State prisons? Measure percent growth as deCd ds
s
where Cde is the count at the end of decade and Cds is the start of the decade). You may consider
using the first() and last() functions.

## # A tibble: 10 x 2
## decade percentage_growth
## <dbl> <dbl>
## 1 1920 0.262
## 2 1930 0.365
## 3 1940 -0.0490
## 4 1950 0.245
## 5 1960 -0.0644
## 6 1970 0.581
## 7 1980 1.15
## 8 1990 0.725
## 9 2000 0.129
## 10 2010 -0.0553

2
Task 4:
You want to make a graph visualizing the change in incarceration counts in the United States over time.

incarceration_data %>%
ggplot(???) +
geom_???() +
labs(???)

Adjust the code above in order to reproduce the following graph, including the choice of both axes, labels
on both axes, choice of line type and title.
Incarceration counts (total population on a single day) over time

1e+06

type
counts

Federal prisons
Local jails
State prisons
5e+05

0e+00

1925 1950 1975 2000

year

Task 5:
Miscellaneous tasks – We leave the data behind and test skills.

1. Take numbers <- rep(seq(-9, 10, 1), 10). Show that the mean of the vector numbers is .5 and
the sum of the components of numbers is 100.
2. Combine the strings assigned to left and right into a single string using an R function.

left <- "Harris"

right <- "School of Public Policy"

3. Use ifelse() function to add a column called index to the incarceration_data and assign high if
count >= 300000 and low otherwise.

3
Task 6:
1. For loops: Take numbers <- rep(seq(-9, 10, 1), 10). Using a for-loop, save the square of each
number in a new vector called numbers_squared.
2. For loops: Take numbers. Using a for-loop, save the square of each number and add random noise
using a call to rnorm(1, sd = 5) in a new vector called noisy_numbers_squared.
You should be able to reproduce the graph below:

numbers_data <- tibble(numbers = numbers,

noisy_numbers_squared = noisy_numbers_squared)

numbers_data %>%
ggplot(aes(x = numbers, y = noisy_numbers_squared)) +
geom_point() +
geom_smooth()

## ‘geom_smooth()‘ using method = ’loess’ and formula = ’y ~ x’

80
noisy_numbers_squared

−5 0 5 10
numbers
3. Functions: Write a function called notice_gpa that takes gpa as an input and does the following:

• if gpa less than 2, prints: “Your GPA is gpa. You are on academic probation.”
• else if gpa is greater than or equal to 3.5, prints: “Your GPA is gpa. You made the Dean’s list.
Congrats!”
• otherwise, prints: “Your GPA is gpa”.

4
notice_gpa <- function(gpa) {
if (...) {
...
} else if (...) {
...
} else {
...
}
}

# When running each of the following, you should get different results!
notice_gpa(1.9)
notice_gpa(3.5)
notice_gpa(3)

C111-23 LookInside
No ratings yet
C111-23 LookInside
16 pages
Homework2 PDF
No ratings yet
Homework2 PDF
3 pages
Bayes CPH - Tutorial R
No ratings yet
Bayes CPH - Tutorial R
9 pages
APQP / PPAP Checklist - Suppliers: Responsiblility Step
100% (1)
APQP / PPAP Checklist - Suppliers: Responsiblility Step
8 pages
A Comparative Analysis of English and Igala Morphological Processes
100% (1)
A Comparative Analysis of English and Igala Morphological Processes
165 pages
Homework 9: Independent and Paired Samples T-Tests: Information 1
No ratings yet
Homework 9: Independent and Paired Samples T-Tests: Information 1
7 pages
Homework 3 R Tutorial: How To Use This Tutorial
No ratings yet
Homework 3 R Tutorial: How To Use This Tutorial
8 pages
Gretl Guide (201 250)
No ratings yet
Gretl Guide (201 250)
50 pages
Practical 3 Intro To R
No ratings yet
Practical 3 Intro To R
10 pages
3 Modelling An Infected Cohort
No ratings yet
3 Modelling An Infected Cohort
5 pages
R Intro STAT5000
No ratings yet
R Intro STAT5000
17 pages
Task - Preprocessing
No ratings yet
Task - Preprocessing
7 pages
Beginner Guide To R and R Studio V1
No ratings yet
Beginner Guide To R and R Studio V1
27 pages
STA1007S Lab 3: Plots (II) and Sub-Setting: "Sample"
No ratings yet
STA1007S Lab 3: Plots (II) and Sub-Setting: "Sample"
10 pages
An R Tutorial Starting Out
No ratings yet
An R Tutorial Starting Out
9 pages
Ds Lab 2
No ratings yet
Ds Lab 2
7 pages
Nomor 3 Uts
No ratings yet
Nomor 3 Uts
6 pages
R Tutorial
No ratings yet
R Tutorial
15 pages
Meteorology 50: (This Lab Is Worth 200 Points)
No ratings yet
Meteorology 50: (This Lab Is Worth 200 Points)
9 pages
Data Structures Assignment: Problem 1
No ratings yet
Data Structures Assignment: Problem 1
7 pages
IBM322 Last Year ETE
No ratings yet
IBM322 Last Year ETE
5 pages
Prerequis R
No ratings yet
Prerequis R
38 pages
Fortran 95 Practical Exercises
No ratings yet
Fortran 95 Practical Exercises
10 pages
Big Data Exercieses
No ratings yet
Big Data Exercieses
6 pages
9. SOLVED PRE QUALIFYING PABSON KATHMANDU 2081
No ratings yet
9. SOLVED PRE QUALIFYING PABSON KATHMANDU 2081
6 pages
Lab 1- Basic functions in R and plotting
No ratings yet
Lab 1- Basic functions in R and plotting
8 pages
Spa Sem 2 PDF
No ratings yet
Spa Sem 2 PDF
754 pages
Base R Course PDF
No ratings yet
Base R Course PDF
37 pages
Data Analysis Using R and Vectors
No ratings yet
Data Analysis Using R and Vectors
35 pages
r 2m
No ratings yet
r 2m
34 pages
CS3361 Set1
No ratings yet
CS3361 Set1
5 pages
Coding Introduction
No ratings yet
Coding Introduction
46 pages
lec_09
No ratings yet
lec_09
16 pages
COMP2501 - Assignment - 1 - Questions - RMD 2
No ratings yet
COMP2501 - Assignment - 1 - Questions - RMD 2
7 pages
FM Statistics, Fall 2022, Homework 02
No ratings yet
FM Statistics, Fall 2022, Homework 02
8 pages
SIG742 Task1
No ratings yet
SIG742 Task1
9 pages
Day1-Python-Assignment 1.ipynb - Colab
No ratings yet
Day1-Python-Assignment 1.ipynb - Colab
9 pages
Fds Answers
No ratings yet
Fds Answers
53 pages
data-frames-in-R
No ratings yet
data-frames-in-R
7 pages
AD3411 - 1 To 5
No ratings yet
AD3411 - 1 To 5
11 pages
CS3361 Set1
No ratings yet
CS3361 Set1
5 pages
R1 Guideline Session1 Part2
No ratings yet
R1 Guideline Session1 Part2
25 pages
UNIT 3 - Describing Numbers
No ratings yet
UNIT 3 - Describing Numbers
9 pages
Slicing. Both, Numpy Array Indexing and Slicing Will Be Discussed in The Remainder
No ratings yet
Slicing. Both, Numpy Array Indexing and Slicing Will Be Discussed in The Remainder
50 pages
MODEL EXAM II Answer Key - For Merge
No ratings yet
MODEL EXAM II Answer Key - For Merge
20 pages
R Module 2
No ratings yet
R Module 2
30 pages
CS3361 Set2
No ratings yet
CS3361 Set2
6 pages
P2 - Image RLE
No ratings yet
P2 - Image RLE
4 pages
Homework 1: Cut, Breaks C, A, B, C, D, A A, B B, C C, D A, B, C, D
No ratings yet
Homework 1: Cut, Breaks C, A, B, C, D, A A, B B, C C, D A, B, C, D
3 pages
SI: Step-By-Step EDM Analysis
No ratings yet
SI: Step-By-Step EDM Analysis
19 pages
STA1007S Lab 4: Scatterplots and Basic Programming: "Hist"
No ratings yet
STA1007S Lab 4: Scatterplots and Basic Programming: "Hist"
9 pages
Biol1001_RAssignment_1 (1)
No ratings yet
Biol1001_RAssignment_1 (1)
5 pages
Objective 8
No ratings yet
Objective 8
5 pages
Introduction To R
No ratings yet
Introduction To R
20 pages
Introduction To R
No ratings yet
Introduction To R
34 pages
Coursework 5 - Web
No ratings yet
Coursework 5 - Web
16 pages
DSA lab manual pgms_fINAL
No ratings yet
DSA lab manual pgms_fINAL
34 pages
A1RIB_T4
No ratings yet
A1RIB_T4
5 pages
Source Code 1
No ratings yet
Source Code 1
40 pages
Project CIS 2203
No ratings yet
Project CIS 2203
9 pages
Principles of Digital Electronics
From Everand
Principles of Digital Electronics
Sapana Rane
No ratings yet
100 Puzzles to Learn Data Warehousing
From Everand
100 Puzzles to Learn Data Warehousing
Cristian Scutaru
No ratings yet
Learn Programming Using C#
From Everand
Learn Programming Using C#
Taurius Litvinavicius
No ratings yet
Mozart String Quintet Analysis
No ratings yet
Mozart String Quintet Analysis
1 page
Frank Ian E. Escorsa PHYS101-A15: μC-charge so that a force
No ratings yet
Frank Ian E. Escorsa PHYS101-A15: μC-charge so that a force
10 pages
By THE The Private of South: ISBN 0-828.081 80.8
No ratings yet
By THE The Private of South: ISBN 0-828.081 80.8
10 pages
Directory of HD Centres
100% (1)
Directory of HD Centres
32 pages
4 - Mathematical Expectations
No ratings yet
4 - Mathematical Expectations
40 pages
3) How To Use XML Bursting To Send XML Report Via Email - Shareapps4u
No ratings yet
3) How To Use XML Bursting To Send XML Report Via Email - Shareapps4u
8 pages
2 BesuchInfo EN
No ratings yet
2 BesuchInfo EN
16 pages
Screening Report
No ratings yet
Screening Report
699 pages
Mechanics of Machines: DR Tuan Mohammad Yusoff Shah
No ratings yet
Mechanics of Machines: DR Tuan Mohammad Yusoff Shah
31 pages
Royal Rozep AW Hydraulic Oils (225-233)
No ratings yet
Royal Rozep AW Hydraulic Oils (225-233)
4 pages
Dossier de Competências
No ratings yet
Dossier de Competências
4 pages
Challenges of Engineering Education in Digital Intelligence Era
No ratings yet
Challenges of Engineering Education in Digital Intelligence Era
15 pages
I Am Sharing 'Ssemakula Henry CV' With You
No ratings yet
I Am Sharing 'Ssemakula Henry CV' With You
4 pages
OS Maps Intro and Symbols
No ratings yet
OS Maps Intro and Symbols
18 pages
A Procedure For Lube Oil Flushing
No ratings yet
A Procedure For Lube Oil Flushing
2 pages
The Importance of Reading
No ratings yet
The Importance of Reading
39 pages
MDNPart 3
No ratings yet
MDNPart 3
2 pages
Titanic
No ratings yet
Titanic
11 pages
Tekla Structures: Analysis Guide
No ratings yet
Tekla Structures: Analysis Guide
144 pages
MR - Molina's P.A and Nursing History
No ratings yet
MR - Molina's P.A and Nursing History
4 pages
Anti Malarial Drugs
No ratings yet
Anti Malarial Drugs
51 pages
Lease Contract: Know All Men by These Presents
No ratings yet
Lease Contract: Know All Men by These Presents
4 pages
Coagulation Flocculation
No ratings yet
Coagulation Flocculation
10 pages
Part 2: Pancreatic Lipase Activity
No ratings yet
Part 2: Pancreatic Lipase Activity
7 pages
Another One Bites The Dust - 'Another', 'The Other', and 'Other' - Student's
No ratings yet
Another One Bites The Dust - 'Another', 'The Other', and 'Other' - Student's
9 pages
Uploads - Downloads - Junior Plumber, OP, L-1
No ratings yet
Uploads - Downloads - Junior Plumber, OP, L-1
4 pages
PDF Pathogenesis of bacterial infections in animals 4th ed Edition C L Gyles download
No ratings yet
PDF Pathogenesis of bacterial infections in animals 4th ed Edition C L Gyles download
67 pages