0% found this document useful (0 votes)

72 views10 pages

CITS1401 Project#02, Sem2, 2024

Python project

Uploaded by

usamak.com

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

72 views10 pages

CITS1401 Project#02, Sem2, 2024

Python project

Uploaded by

usamak.com

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

CITS1401 Computational Thinking

with Python

Project 2, Semester 2, 2024

Submission deadline: 18th Oct. 2024, 11:59 PM
Total Marks: 30 (Value: 20%)

Project Submission Guidelines

You should construct a Python 3 program containing your solution to the given problem and
submit your program electronically on Moodle.
• The name of the file containing your code should be your student ID e.g. 12345678.py. No
other method of submission is allowed. Please note that this is an individual project.
• Your program will be automatically run on Moodle for sample test cases provided in the
project sheet if you click the “check” link. However, this does not test all required criteria
and your submission will be thoroughly tested manually for grading purposes after the due
date. Remember you need to submit the program as a single file and copy-paste the same
program in the provided text box.
• You have only one attempt to submit so don’t submit if you are not satisfied with your
attempt.
• All open submissions at the time of the deadline will be automatically submitted. There is
no way in the system to open the closed submission and reverse your submission.
• You must submit your project before the deadline listed above. Following UWA policy, a
late penalty of 5% will be deducted for each day (or part day), after the deadline, that the
assignment is submitted.
• No submissions will be allowed after 7 days following the deadline, including special
consideration cases.
You are expected to have read and understood the University's guidelines on academic conduct.
In accordance with this policy, you may discuss with other students the general principles
required to understand this project, but the work you submit must be the result of your own
effort. Plagiarism detection, and other systems for detecting potential malpractice, will
therefore be used. Besides, if what you submit is not your own work then you will have learnt
little and will therefore, likely, fail the final exam.
Project Overview
This project focuses on analysing hospital data from multiple countries, examining various key
factors such as patient mortality rates, disease-specific admissions, staff numbers and patient
demographics etc. The dataset includes information about each hospital's ID, country, the
number of staff and patients, male and female patient distribution, hospital category, bed
availability, and mortality figures for 2022 and 2023. The dataset includes a diverse range of
hospital categories, such as research, children, general, rehabilitation, teaching, speciality,
psychiatric, day surgery, standalone emergency, and outpatient hospitals, with each category
including a varying number of hospitals.

Dataset
The dataset for this project comprises two key files: a CSV and a TXT file. The CSV file
includes information on countries, hospital IDs, hospital categories, and the number of deaths
recorded in 2022 and 2023, among other details. Meanwhile, the TXT file provides a detailed
account of patient admissions for each country and corresponding hospitals, specifying the
number of cases for Covid, Stroke, and Cancer in 2022.
You are required to write a Python 3 program that will read two different files: a CSV file and
a TXT file. After reading the data file(s), your program will perform four different tasks
outlined below.

Tasks

i. Task 1: Generate Country-Specific Hospital Data

Create a list of three dictionaries [Country_to_hospitals, Country_to_death,

Country_to_covid_stroke] to map each country to its hospital IDs, deaths in 2022, and
the total number of patients admitted for covid and stroke in 2022, respectively. The
dictionaries will have country names as keys and lists as values, where each entry in the
lists corresponds to a specific hospital. The order of the entries in all lists across the
dictionaries must be perfectly aligned to ensure consistency across the data. For example:
o Country_to_hospitals['argentina'] = ['H1', 'H2', 'H3']
o Country_to_death['argentina'] = [10, 15, 8] (number of deaths for H1,
H2, H3 in 2022)
o Country_to_covid_stroke['argentina'] = [50, 60, 45] (covid and stroke
admissions for H1, H2, H3 in 2022)
ii. Task 2: Calculate Cosine Similarity

For each country, compute the Cosine Similarity between the number of deaths in 2022
and the total number of patients admitted for covid and stroke. Store these results in a
dictionary (Cosine_dict) with the country names as the keys and the Cosine Similarity
as the values.

iii. Task 3: Analyze Variance in Cancer Admissions

Create a dictionary (Variance_dict) that stores the variance in cancer patient admissions
across hospitals in each country for a specified hospital category (e.g., 'children'). The
key is the country name, and the value is the variance in the number of cancer admissions
for the specified hospital category.

iv. Task 4: Generate Hospital Category Statistics

Create a nested dictionary (Category_Country_dict) to store data for each hospital

category. The outer dictionary will use the hospital category as the key, while the value
will be another dictionary that maps country names to lists of key statistics, including:
o The average number of female patients treated.
o The maximum number of staff in hospitals.
o The percentage change between the average number of deaths from 2022 to 2023
Input
Your program must define the function main with the following syntax:
def main (CSVfile, TXTfile, category):
The input arguments for this function are:
• CSVfile: The name of the CSV file (as string) containing records of hospital data
across different countries. The first row of the CSV file contains the column headings.
• TXTfile: The name of the TXT file (as string) containing records of patients
admitted for covid, stroke, and cancer across various hospitals in different countries.
• category: A string representing the category of hospitals to be analysed. The CSV
file contains information about hospitals from multiple categories.

Output
The following four outputs are expected:
i. OP1= list of dictionary items: [Country_to_hospitals, Country_to_death,
Country_to_covid_stroke].
a) Country_to_hospitals maps each country name to a list of hospitals within that
country. In this dictionary, the key is the name of the country, and the value is a list of
hospital IDs in that country.
b) Country_to_death maps each country name to a list of the number of deaths in
2022 for each hospital in that country. The key is the name of the country, and the value
is a list where each element represents the death count for a corresponding hospital in
that country.
c) Country_to_covid_stroke maps each country name to a list of the total number
of patients admitted for covid and stroke in each hospital in that country. The key is the
name of the country, and the value is a list where each element represents the total
number of covid and stroke admissions for a hospital in that country.
Important Note: It is essential that the order of the entries in the values (lists) for all three
dictionaries (Country_to_hospitals, Country_to_death, and
Country_to_covid_stroke) is perfectly aligned. For example, the first entry in
Country_to_death['brazil']and Country_to_covid_stroke['brazil'] should
relate to the first hospital ID in Country_to_hospitals['brazil'], and so on for the
remaining entries.

ii. OP2= Cosine_dict: A dictionary where the key is the country name, and the value is the
Cosine Similarity between the number of deaths in 2022 and the total number of patients
admitted for covid and stroke in that country.

iii. OP3= Variance_dict: A dictionary where the key is the country name, and the value is
the variance in the number of Cancer patient admissions for a specified category,
such as 'children' across hospitals within that country.

iv. OP4= Category_Country_dict: A nested dictionary that stores information for each
hospital category ‘C’. The outer dictionary uses the hospital category as the key, and the
value is another dictionary ‘D’.
a) The dictionary ‘D’ uses the country name as the key, and the value is a list containing
the following data for hospitals in category ‘C’ within that country:
• The average number of female patients treated in hospitals under category ‘C’.
• The maximum number of staff working in hospitals within category ‘C’.
• The percentage change between the average number of deaths from 2022 to 2023
for hospitals in category ‘C’.

All returned numeric outputs must contain values rounded to four decimal places (if required
to be rounded off). Do not round the values during calculations. Instead, round them only at
the time when you save them into the final output variables.

Requirements
i. You are not allowed to import any external or internal module in python.
ii. Ensure your program does NOT call the input() function at any time. Calling the
input() function will cause your program to hang, waiting for input that automated
testing system will not provide (in fact, what will happen is that if the marking program
detects the call(s), it will not test your code at all which may result in zero grade).
iii. Your program should also not call print()function at any time except for the case of
graceful termination (if needed). If your program encounters an error state and exits
gracefully, it should return a cosine-similarity/variance/mean/percentage-change value of
zero and print an appropriate error message. At no point should you print the program’s
outputs or provide a printout of the program’s progress in calculating such outputs. Outputs
should be returned by the program instead.
iv. Do not assume that the input file names will end in .csv or .txt. File name suffixes such
as .csv and .txt are not mandatory in systems other than Microsoft Windows. Do not
enforce within your program that the file must end with a specific extension, nor should
you attempt to add an extension to the provided file name. Doing so can result in loss of
marks.

Examples

Download hospital_data.csv and disease.txt files from the folder of Project 2

on LMS or Moodle. An example of how you can call your program from the Python shell and
examine the results it returns, is provided below:

>> OP1, OP2, OP3, OP4 = main('hospital_data.csv', 'disease.txt',

'children')
Few details of the returned output variables are:

>> len(OP1)

>> OP1[0]['afghanistan']

['4eb9d3e5cf79b91', 'bba52b87bb6a32f','8a9190a50adf241']

>> OP1[1]['afghanistan']

[20, 2, 12]

>> OP1[2]['afghanistan']

[830, 3898, 6854]

>> len (OP2)

>> OP2['afghanistan']

0.5746

>> OP2['albania']

0.9257

>> OP3['afghanistan']

785004.5

>> OP3['brunei darussalam']

24420.5

>> len(OP4['children'])

>> OP4['children']['canada']

[3925.4, 4448, 22.0588]

Assumptions

Your program can assume the following:

1. All string data in the CSV file and TXT file is case-insensitive, which means “albania”
is same as “AlbAnia” or “covid” is same as “Covid”. Your program needs to handle the
situation to consider both strings to be the same.
2. In the CSV file, the order of columns can be different than the order provided in the
sample file. Also, there can be extra or less columns in the testing files. Moreover, rows
can be in random order except the first row which contains the headings.
3. There can be missing or invalid data in the CSV file; and in such an instance, the entire
row should be ignored. Some examples of invalid data can be negative or zero number
of staff, null/empty values in the required columns. You need to think of other invalid
cases yourself. In case any part of the calculation cannot be performed due to zero
values or other boundary conditions, do a graceful termination by printing an error
message and returning a zero value (for numbers), None for (string) or empty list
depending on the expected outcome. Your program must not crash.
4. All hospital IDs will be unique. There will be no missing or invalid data in the TXT
file.
5. The necessary formulas are provided at the end of this document.

Important grading instruction

Note that you have not been asked to write specific functions. The task has been left to you.
However, it is essential that your program defines the top-level function main(CSVfile,
TXTfile, category) (commonly referred to as ‘main()’ in the project documents to
save space when writing it. Note that when main() is written it still implies that it is defined
with its three input arguments). The idea is that within main(), the program calls the other
functions. (Of course, these functions may then call further functions.) This is important
because when your code is tested on Moodle, the testing program will call your main()
function. So, if you fail to define main(), the testing program will not be able to test your
code and your submission will be graded zero. Don’t forget the submission guidelines provided
at the start of this document.
Marking rubric
24 out of 30 marks will be awarded automatically based on how well your program completes
a number of tests, reflecting normal use of the program, and how the program handles various
states including, but not limited to, different numbers of rows in the input file and / or any error
states. You need to think creatively what your program may face. Your submission will be
graded by data files other than the provided data file. Therefore, you need to be creative to
investigate corner or worst cases. I have provided few guidelines from ACS Accreditation
manual at the end of the project sheet which will help you to understand the expectations.

6 out of 30 marks will be awarded on style (3/6) “the code is clear to read” and efficiency (3/6)
“your program is well constructed and run efficiently”. For style, think about use of comments,
sensible variable names, your name at the top of the program, student ID, etc. (Please watch
the lectures where this is discussed).

Style Rubric:
0 Gibberish, impossible to understand
1 Style is really poor or fair.
2 Style is good or very good, with small lapses.
3 Excellent style, really easy to read and follow

Your program will be traversing text files of various sizes (possibly including large csv files)
so you need to minimise the number of times your program looks at the same data items.
Efficiency rubric:
0 Code too complicated to judge efficiency or wrong problem tackled
1 Very poor efficiency, additional loops, inappropriate use of readline()
2 Acceptable or good efficiency with some lapses
3 Excellent efficiency, should have no problem on large files, etc.

Automated testing is being used so that all submitted programs are being tested the same way.
Sometimes it happens that there is one mistake in the program that means that no tests are
passed. If the marker can spot the cause and fix it readily, then they are allowed to do that and
your - now fixed - program will score whatever it scores from the tests, minus 4 marks, because
other students will not have had the benefit of marker intervention. Still, that's way better than
getting zero. On the other hand, if the bug is hard to fix, the marker needs to move on to other
submissions.
Extract from Australian Computing Society Accreditation manual 2019
As per Seoul Accord section D, a complex computing problem will normally have some or all
of the following criteria:
 involves wide-ranging or conflicting technical, computing, and other issues.
 has no obvious solution and requires conceptual thinking and innovative analysis to
formulate suitable abstract models.
 a solution requires the use of in-depth computing or domain knowledge and an
analytical approach that is based on well-founded principles.
 involves infrequently encountered issues.
 are outside problems encompassed by standards and standard practice for professional
computing.
 involves diverse groups of stakeholders with widely varying needs.
 has significant consequences in a range of contexts.
 is a high-level problem possibly including many component parts or sub-problems.
 identification of a requirement or the cause of a problem is ill defined or unknown.

Necessary formulas

1) Cosine similarity, 𝒄𝒄𝒄𝒄𝒄𝒄(𝜽𝜽)

∑𝑛𝑛𝑖𝑖=1 𝑥𝑥𝑖𝑖 ⋅ 𝑦𝑦𝑖𝑖
𝑐𝑐𝑐𝑐𝑐𝑐(𝜃𝜃) =
𝑛𝑛 𝑛𝑛
��𝑖𝑖=1 𝑥𝑥𝑖𝑖2 ��𝑖𝑖=1 𝑦𝑦𝑖𝑖2

where,
𝑥𝑥𝑖𝑖 = first set of data
𝑦𝑦𝑖𝑖 = second set of data
n = the number of samples

2) Variance, 𝑆𝑆 2

∑(𝑥𝑥𝑖𝑖 − 𝑥𝑥̅ )2
𝑆𝑆 2 =
𝑛𝑛 − 1

where,
𝑆𝑆 2 = sample variance
𝑥𝑥𝑖𝑖 = the value of the one observation
𝑥𝑥̅ = the mean value of all observations
n = the number of observations
3) Average Percentage Change:

Percentage change between the average number of deaths from 2022 to 2023, for all hospitals
of specific hospital type within a country.

𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃 𝐶𝐶ℎ𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎 𝑖𝑖𝑖𝑖 𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴 𝐷𝐷𝐷𝐷𝐷𝐷𝐷𝐷ℎ𝑠𝑠 (𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃)

(𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴 𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑ℎ𝑠𝑠 𝑖𝑖𝑖𝑖 2023) − (𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴 𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑ℎ𝑠𝑠 𝑖𝑖𝑖𝑖 2022)

𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃 = × 100%
𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴 𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑ℎ𝑠𝑠 𝑖𝑖𝑖𝑖 2022

Genius 7 On Bill Acceptor Manual
100% (1)
Genius 7 On Bill Acceptor Manual
10 pages
QMM1001 Applied Activity 2
No ratings yet
QMM1001 Applied Activity 2
2 pages
IP Projects For Class Xii
0% (1)
IP Projects For Class Xii
20 pages
Extended - Basic Eda Python Fellow
No ratings yet
Extended - Basic Eda Python Fellow
22 pages
Pandas Practicals - Term-1
100% (1)
Pandas Practicals - Term-1
18 pages
FINE Marine PDF
No ratings yet
FINE Marine PDF
8 pages
How To Perform An ATC Check Automatically During Transport TASK Release - SAP Blogs
No ratings yet
How To Perform An ATC Check Automatically During Transport TASK Release - SAP Blogs
8 pages
VR Integrated Heritage Recreation - Using Blender and Unreal Engine 4
100% (1)
VR Integrated Heritage Recreation - Using Blender and Unreal Engine 4
336 pages
New A Class w177 PDF
No ratings yet
New A Class w177 PDF
104 pages
Essential Software Assignment 3
No ratings yet
Essential Software Assignment 3
2 pages
r.jeevitha
No ratings yet
r.jeevitha
16 pages
Maheswari Public School Kalwar Road: Project File Session 2023-24
No ratings yet
Maheswari Public School Kalwar Road: Project File Session 2023-24
28 pages
Informatics Practices Project 12 New
No ratings yet
Informatics Practices Project 12 New
31 pages
COVID 19 Pandemic Analysis
No ratings yet
COVID 19 Pandemic Analysis
26 pages
Python Codes and Comments
No ratings yet
Python Codes and Comments
5 pages
CS685: Data Mining: Assignment 1 (100 Marks) Due On: 13th September, 2021, 11:00pm
No ratings yet
CS685: Data Mining: Assignment 1 (100 Marks) Due On: 13th September, 2021, 11:00pm
2 pages
COMP551 Fall 2020 P1
No ratings yet
COMP551 Fall 2020 P1
4 pages
Artificial Intelligence Project Report
No ratings yet
Artificial Intelligence Project Report
15 pages
Report - Data Visualization and Exploration
No ratings yet
Report - Data Visualization and Exploration
14 pages
DIY Project - Data Mining and Analytics2
No ratings yet
DIY Project - Data Mining and Analytics2
1 page
Computer Science Ip
No ratings yet
Computer Science Ip
16 pages
Assignment Sujith S
No ratings yet
Assignment Sujith S
13 pages
Assignment Statistics and Probability H118-HW1
No ratings yet
Assignment Statistics and Probability H118-HW1
2 pages
mini
No ratings yet
mini
6 pages
I.p Project
No ratings yet
I.p Project
24 pages
Assignment - Ipynb - Colaboratory
No ratings yet
Assignment - Ipynb - Colaboratory
14 pages
covid data report
No ratings yet
covid data report
21 pages
Green Minimalist Healthcare Flyer
No ratings yet
Green Minimalist Healthcare Flyer
37 pages
DAV Guidelines
No ratings yet
DAV Guidelines
4 pages
COVID 19 Pandemic Analysis class 12 practicals (1) (2)
No ratings yet
COVID 19 Pandemic Analysis class 12 practicals (1) (2)
29 pages
Unraveling The Interplay of Age, Hospitalization and Mortality 2
No ratings yet
Unraveling The Interplay of Age, Hospitalization and Mortality 2
30 pages
Project Documentaiotn - InDIA Abellllll
No ratings yet
Project Documentaiotn - InDIA Abellllll
27 pages
AI Practical Project
No ratings yet
AI Practical Project
15 pages
R Programming Exercice
No ratings yet
R Programming Exercice
5 pages
csv file (1)
No ratings yet
csv file (1)
20 pages
Syadatajveez
No ratings yet
Syadatajveez
21 pages
SBA 2020-2021 v2.1
No ratings yet
SBA 2020-2021 v2.1
12 pages
Case Study Guidelines
No ratings yet
Case Study Guidelines
7 pages
COMP2501 - Assignment - 1 - Questions - RMD 2
No ratings yet
COMP2501 - Assignment - 1 - Questions - RMD 2
7 pages
Screenshot 2024-11-07 at 8.59.45 PM
No ratings yet
Screenshot 2024-11-07 at 8.59.45 PM
15 pages
Python Practical Questions@Subas
No ratings yet
Python Practical Questions@Subas
7 pages
Prog Assignment 3
No ratings yet
Prog Assignment 3
10 pages
DS Lab-5 GP-Anirudh 180905452 B2 59
No ratings yet
DS Lab-5 GP-Anirudh 180905452 B2 59
27 pages
R Project
No ratings yet
R Project
2 pages
Final Group Project
No ratings yet
Final Group Project
26 pages
Ashutosh Project
No ratings yet
Ashutosh Project
19 pages
Project File -A
No ratings yet
Project File -A
20 pages
COVID-19 Clinical Trials EDA Pandas
No ratings yet
COVID-19 Clinical Trials EDA Pandas
30 pages
SC Cat
No ratings yet
SC Cat
6 pages
Covid 19 India Dashboard Using Python and Voila
No ratings yet
Covid 19 India Dashboard Using Python and Voila
6 pages
QA Unit 4
No ratings yet
QA Unit 4
2 pages
CS2B - April23 - EXAM - Clean Proof - v2
No ratings yet
CS2B - April23 - EXAM - Clean Proof - v2
8 pages
Problem IMPLEMENTATION START (1) 2
No ratings yet
Problem IMPLEMENTATION START (1) 2
8 pages
Programming Assignment 3
No ratings yet
Programming Assignment 3
5 pages
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
From Everand
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
Sama Alshatali
No ratings yet
Ip Project
No ratings yet
Ip Project
23 pages
Project Description
No ratings yet
Project Description
3 pages
Corona Virus in India
No ratings yet
Corona Virus in India
29 pages
manishadav
No ratings yet
manishadav
27 pages
HY Exam Revision (11/9/2024)
No ratings yet
HY Exam Revision (11/9/2024)
15 pages
IP Project Covid-19 Impact
No ratings yet
IP Project Covid-19 Impact
25 pages
Manya IP Project
No ratings yet
Manya IP Project
25 pages
obermeyer-sample
No ratings yet
obermeyer-sample
8 pages
Name
No ratings yet
Name
23 pages
PA Assignment 1 Oct2021
No ratings yet
PA Assignment 1 Oct2021
8 pages
Preventive Maintenance Guide
No ratings yet
Preventive Maintenance Guide
144 pages
Gmail - Ethiopian Trip Reminder
No ratings yet
Gmail - Ethiopian Trip Reminder
3 pages
Solid Works Praktikum
No ratings yet
Solid Works Praktikum
104 pages
B.Sc. Visual Communication Syllabus - 2020
No ratings yet
B.Sc. Visual Communication Syllabus - 2020
43 pages
POP Mdel Paper 3 Solution
No ratings yet
POP Mdel Paper 3 Solution
46 pages
Ertiga 2021 11 17 2021 12 17
No ratings yet
Ertiga 2021 11 17 2021 12 17
6 pages
Bosch CCTV Divar MR Operating Manual PDF
100% (1)
Bosch CCTV Divar MR Operating Manual PDF
136 pages
FortiOS v4.0 MR3 Patch Release 12 Release Notes
No ratings yet
FortiOS v4.0 MR3 Patch Release 12 Release Notes
36 pages
ICT NOTES
No ratings yet
ICT NOTES
17 pages
Bridge Design Manual
100% (3)
Bridge Design Manual
937 pages
AES 17 Conference Mp3 and AAC Explained AES17
No ratings yet
AES 17 Conference Mp3 and AAC Explained AES17
12 pages
Palo Alto Lab
100% (1)
Palo Alto Lab
87 pages
2024 12 31 Statement
No ratings yet
2024 12 31 Statement
3 pages
Assignment 1 Automata
No ratings yet
Assignment 1 Automata
10 pages
KPIs and Individual Accomplishment
No ratings yet
KPIs and Individual Accomplishment
6 pages
Ricef Document
No ratings yet
Ricef Document
8 pages
Time Table for Winter 2024 Theory Examination 1586
No ratings yet
Time Table for Winter 2024 Theory Examination 1586
5 pages
Azure Course Content PDF
No ratings yet
Azure Course Content PDF
2 pages
DATA MINING Project Report
No ratings yet
DATA MINING Project Report
28 pages
Guide To en 1991 1 4 Wind Actions
100% (4)
Guide To en 1991 1 4 Wind Actions
22 pages
Assignment 1 Report: Name: Manuja Sellahewa Student ID: Lab Class: Teacher:Miss - Lushaka Nissansala
No ratings yet
Assignment 1 Report: Name: Manuja Sellahewa Student ID: Lab Class: Teacher:Miss - Lushaka Nissansala
9 pages
Low Voltage - Reading Material
No ratings yet
Low Voltage - Reading Material
47 pages
Servicenow-Certification-Faq 20200128
No ratings yet
Servicenow-Certification-Faq 20200128
17 pages
BTCS-403_M2019
No ratings yet
BTCS-403_M2019
2 pages
ESIME-ZAC MAESTRIA Basilio Ortiz Manuel
No ratings yet
ESIME-ZAC MAESTRIA Basilio Ortiz Manuel
150 pages

CITS1401 Project#02, Sem2, 2024

Uploaded by

CITS1401 Project#02, Sem2, 2024

Uploaded by

CITS1401 Computational Thinking

Project 2, Semester 2, 2024

Project Submission Guidelines

i. Task 1: Generate Country-Specific Hospital Data

Create a list of three dictionaries [Country_to_hospitals, Country_to_death,

iii. Task 3: Analyze Variance in Cancer Admissions

iv. Task 4: Generate Hospital Category Statistics

Create a nested dictionary (Category_Country_dict) to store data for each hospital

Download hospital_data.csv and disease.txt files from the folder of Project 2

>> OP1, OP2, OP3, OP4 = main('hospital_data.csv', 'disease.txt',

[830, 3898, 6854]

>> len (OP2)

>> OP3['brunei darussalam']

[3925.4, 4448, 22.0588]

Your program can assume the following:

Important grading instruction

1) Cosine similarity, 𝒄𝒄𝒄𝒄𝒄𝒄(𝜽𝜽)

𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃 𝐶𝐶ℎ𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎 𝑖𝑖𝑖𝑖 𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴 𝐷𝐷𝐷𝐷𝐷𝐷𝐷𝐷ℎ𝑠𝑠 (𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃)

(𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴 𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑ℎ𝑠𝑠 𝑖𝑖𝑖𝑖 2023) − (𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴 𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑ℎ𝑠𝑠 𝑖𝑖𝑖𝑖 2022)

You might also like