0% found this document useful (0 votes)
18 views

Step 2 Big Data Analytics and Machine Learning

Base de datos

Uploaded by

romacer96
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
18 views

Step 2 Big Data Analytics and Machine Learning

Base de datos

Uploaded by

romacer96
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 11

BIG DATA INTEGRATION

Step 2 Big Data Analytics and Machine Learning

Julián Esteban Cerquera Arévalo


Cluster_ 203008077A_1704

Universidad Nacional Abierta y a Distancia (UNAD)


Escuela de Ciencias Básicas, Tecnología e Ingeniería (ECBTI)
Tutor: JORGE LUIS QUINTERO

Ibagué, Tolima
26 de Septiembre de 2024
Table of contents

Introduction ........................................................................................................................................... 3
Methodology .......................................................................................................................................... 4
Activity 1 ............................................................................................................................................ 4
Activity 2 ............................................................................................................................................ 5
Activity 3. ........................................................................................................................................... 6
Activity 4. ........................................................................................................................................... 8
Activity 5 ................................................................................................................................................ 9
Activity 6 ............................................................................................................................................ 9
Conclusions .......................................................................................................................................... 10
Bibliographic References .................................................................................................................... 11
Introduction

Big Data and Machine Learning are two fundamental pillars of the current digital
transformation. Big Data refers to the management of large volumes of data coming from
various sources such as social media, sensors, online transactions, mobile devices, among
others. These data are massive, generated rapidly, and in very diverse formats, presenting both
an opportunity and a challenge for analysis and management.
On the other hand, Machine Learning is a branch of artificial intelligence that allows machines
to learn from data, identify patterns, make predictions, and improve performance without being
explicitly programmed. Using algorithms, machines analyze large amounts of data and uncover
complex patterns that enable predictions or decision automation.

The combination of Big Data and Machine Learning enables organizations to extract value from
their data efficiently, resulting in better strategic decisions, optimized processes, and
personalized products or services. This synergy is applied across multiple areas such as
healthcare, commerce, finance, and the tech industry, revolutionizing how we interact with
technology and manage information.
Methodology

Activity 1. Conceptual Map For the development of this exercise, it is necessary to review the
references in the Learning Environment (Unit 1 - Historical interpretation and review of Big
DataContents and bibliographic references) After reviewing the suggested references, the
student must make a conceptual map with the following concepts: • Business Analytics. • Data
and Statistical Methods. For the construction of the conceptual map, tools such as Cmaptools,
GoCongr, PowerPoint, among others, can be used; then, this conceptual map has to be shared
in the discussion forum.

Image 1. From the autor.


Image 2. From the author

Activity 2. Description of Data domain Using an illustrative scheme, you should portray a Venn
Diagram of the 5 Vs attributes of Big Data, including the following points for statistics domain:
• Description of Data processing. • Data analysis. • Data visualization.

Image 3. From the author.


Activity 3. Description of Data training, validation, and test Taking into account the
bibliographic references and others sources, the student must create a presentation of 3 slides
including the explanation of the Data training, validation and test.

Image 4. From the author.


Activity 4. The distinction of Machine Learning for computer processing Based on the
references, you have to make a comparison chart where you include and explain the importance
of Machine Learning for computer processing (Unsupervised, Supervised and Reinforcement
Learning).

Image 5. From the autor.


Activity 5. Pass and obtain accreditation Big Data 101, for the IBM certification. For the
development of this exercise, it is necessary to review the references in the Learning
Environment (Unit 1 - Historical interpretation and review of Big DataContents and
bibliographic references). After reviewing the suggested references, the student will go on the
Cognitive Class platform as a continuation of your academic progression. The task involves
your enrollment in a course that builds upon the concepts and lessons covered in our prior
learning guide activities. By successfully completing this course, you will not only exhibit your
comprehensive understanding of the material but also obtain the esteemed IBM certification,
symbolizing your mastery in this domain.

Image 6. From the autor.

Activity 6. Socialization in the Forum


Conclusions

Machine Learning has made it possible to automate complex tasks that previously required
human intervention, improving efficiency and reducing errors. Additionally, organizations can
personalize products and services based on analysis of large amounts of behavioral data,
improving the customer experience.

Companies that adopt Big Data and Machine Learning technologies gain a significant
competitive advantage as they can respond faster to market trends, identify opportunities for
innovation, and detect potential risks more accurately.

Although Big Data and Machine Learning offer great benefits, they also present challenges,
such as the need for powerful infrastructures to store and process data. Additionally, handling
large volumes of sensitive data raises privacy and security concerns, which must be
appropriately addressed.
Bibliographic References

Alpaydin, E. (2020). Introduction to Machine Learning (4th ed.). MIT Press.

ArcGIS. (s.f.). Realizar análisis de big data. Esri.

Géron, A. (2019). Hands-on Machine Learning with Scikit-Learn, Keras, and TensorFlow (2nd
ed.). O'Reilly Media.

Goodfellow, I., Bengio, Y., & Courville, A. (2016). Deep Learning. MIT Press.
Russell, S., & Norvig, P. (2021). Artificial Intelligence: A Modern Approach (4th ed.). Pearson.

HubSpot. (2023). ¿Qué es Big Data? Todo lo que necesitas saber. HubSpot.

IEBSchool. (s.f.). Las 5 V’s del Big Data y cómo aplicarlas en la empresa.

KeepCoding. (2022). ¿Cómo funciona la estadística en Big Data?. KeepCoding.

PowerData. (s.f.). ¿Qué es Big Data y cómo puede mejorar tu negocio?. PowerData.

Sutton, R. S., & Barto, A. G. (2018). Reinforcement Learning: An Introduction (2nd ed.). MIT
Press.

Tableau. (s.f.). ¿Qué es la visualización de datos?. Tableau.

Zaharia, M., Wendell, P., Das, T., & Xin, R. (2020). Learning Spark: Lightning-fast Data
Analytics (2nd ed.). O'Reilly Media.

You might also like