Step 2 Big Data Analytics and Machine Learning
Step 2 Big Data Analytics and Machine Learning
Ibagué, Tolima
26 de Septiembre de 2024
Table of contents
Introduction ........................................................................................................................................... 3
Methodology .......................................................................................................................................... 4
Activity 1 ............................................................................................................................................ 4
Activity 2 ............................................................................................................................................ 5
Activity 3. ........................................................................................................................................... 6
Activity 4. ........................................................................................................................................... 8
Activity 5 ................................................................................................................................................ 9
Activity 6 ............................................................................................................................................ 9
Conclusions .......................................................................................................................................... 10
Bibliographic References .................................................................................................................... 11
Introduction
Big Data and Machine Learning are two fundamental pillars of the current digital
transformation. Big Data refers to the management of large volumes of data coming from
various sources such as social media, sensors, online transactions, mobile devices, among
others. These data are massive, generated rapidly, and in very diverse formats, presenting both
an opportunity and a challenge for analysis and management.
On the other hand, Machine Learning is a branch of artificial intelligence that allows machines
to learn from data, identify patterns, make predictions, and improve performance without being
explicitly programmed. Using algorithms, machines analyze large amounts of data and uncover
complex patterns that enable predictions or decision automation.
The combination of Big Data and Machine Learning enables organizations to extract value from
their data efficiently, resulting in better strategic decisions, optimized processes, and
personalized products or services. This synergy is applied across multiple areas such as
healthcare, commerce, finance, and the tech industry, revolutionizing how we interact with
technology and manage information.
Methodology
Activity 1. Conceptual Map For the development of this exercise, it is necessary to review the
references in the Learning Environment (Unit 1 - Historical interpretation and review of Big
DataContents and bibliographic references) After reviewing the suggested references, the
student must make a conceptual map with the following concepts: • Business Analytics. • Data
and Statistical Methods. For the construction of the conceptual map, tools such as Cmaptools,
GoCongr, PowerPoint, among others, can be used; then, this conceptual map has to be shared
in the discussion forum.
Activity 2. Description of Data domain Using an illustrative scheme, you should portray a Venn
Diagram of the 5 Vs attributes of Big Data, including the following points for statistics domain:
• Description of Data processing. • Data analysis. • Data visualization.
Machine Learning has made it possible to automate complex tasks that previously required
human intervention, improving efficiency and reducing errors. Additionally, organizations can
personalize products and services based on analysis of large amounts of behavioral data,
improving the customer experience.
Companies that adopt Big Data and Machine Learning technologies gain a significant
competitive advantage as they can respond faster to market trends, identify opportunities for
innovation, and detect potential risks more accurately.
Although Big Data and Machine Learning offer great benefits, they also present challenges,
such as the need for powerful infrastructures to store and process data. Additionally, handling
large volumes of sensitive data raises privacy and security concerns, which must be
appropriately addressed.
Bibliographic References
Géron, A. (2019). Hands-on Machine Learning with Scikit-Learn, Keras, and TensorFlow (2nd
ed.). O'Reilly Media.
Goodfellow, I., Bengio, Y., & Courville, A. (2016). Deep Learning. MIT Press.
Russell, S., & Norvig, P. (2021). Artificial Intelligence: A Modern Approach (4th ed.). Pearson.
HubSpot. (2023). ¿Qué es Big Data? Todo lo que necesitas saber. HubSpot.
IEBSchool. (s.f.). Las 5 V’s del Big Data y cómo aplicarlas en la empresa.
PowerData. (s.f.). ¿Qué es Big Data y cómo puede mejorar tu negocio?. PowerData.
Sutton, R. S., & Barto, A. G. (2018). Reinforcement Learning: An Introduction (2nd ed.). MIT
Press.
Zaharia, M., Wendell, P., Das, T., & Xin, R. (2020). Learning Spark: Lightning-fast Data
Analytics (2nd ed.). O'Reilly Media.