Week1 1 Rev
Week1 1 Rev
• What is R?
• Why do we need to learn R programming for business analytics?
• What will you learn from this course?
• Summary of key class details
• It’s open-source
• No fees or licenses are needed
• It’s platform-independent. R runs on all operating systems
• It has more than 10,000 packages
• It’s useful for visualizing dataset
• It has latest cutting-edge technology (e.g., Machine learning, natural language processing)
• Practical tool that could get your future job
Real example
• Collaboration with an online dating platform in South Korea
Real example
• Check the types of variables in dataset
✓ There are plenty of types of variables in datasets
✓ Numeric variables (e.g., age, attraction score)
✓ String variables (e.g., occupation)
✓ List variables (e.g., religion, preferred age)
• Ex)
• religion – 1: Catholic, 2: Buddhism, etc.
• Preferred age – 1: 4 years older than the user, 2: 2 and 3 years older than the user, etc.
Real example
• Need to clean and organize dataset
✓ R provides several packages to organize dataset
Real example
• Check statistics and distributions
✓ To understand the data
✓ To get an idea
Real example
• Provide charts and graphs
✓ to persuade the online dating platform
Example
• rmarkdown-cheatsheet (rstudio.com)
• We have data about 53940 diamonds
• Only 126 are larger than 2.5 carats
• The distribution is shown below
Course schedule
Contacting us
• Email
✓ Instructor: [email protected]
✓ TA: [email protected]
✓ Begin subject [ISOM3390]…
Contacting website
• http://canvas.ust.hk
Participation (10%)
• In-Class Participation (10%)
✓ Students are expected to attend the courses (5%)
✓ Students are also expected to attend the lab sessions (5%)
Homework (20%)
• There will be hands-on homework assignments (using Canvas website)
• The due date of each homework assignment will be announced upon its release on Canvas
Late policy
• A 20% penalty will be deducted for each day or part of a day that an assignment is late
• For instance,
✓ if you are 1-day late in submission → 80% of your points for the submission
✓ if you 2-days late in submission → 60% (reduction of 2 × 20%) of your points for the submission
✓ if you are late by 5 days, then you are better off NOT submitting the deliverable