L1 - Introduction To Data Science
L1 - Introduction To Data Science
R is the most popular choice for data scientists. Following are some of the key
reasons as to why we will use R:
R is reliable and useful in academia for many years. Traditionally, R was
used for research purposes at the academy because it provided various
statistical tools for analysis. With the advancements in data science and the
need for analyzing data, R became a popular choice in the industry as well.
R is an ideal tool when it comes to data wrangling. It allows the usage of
several preprocessed packages that makes data wrangling a lot easier.
R provides its famous ggplot2 package which is most famous for its
visualizations. Ggplot2 provides aesthetic visualizations that cater to all the
data operations. Furthermore, ggplot2 provides a degree of interactivity to
the users so that they can understand the data embedded in the visualization
more clearly.
R contains machine learning packages for various operations. Boosting,
building random forests or performing regression and classification, machine
learning provides a wide array of packages.
Domain knowledge