0% found this document useful (0 votes)

254 views

Data Analysis and Visualization Summer Training Report

data analysis and visualization summer training report

Uploaded by

shashank78199

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

254 views

Data Analysis and Visualization Summer Training Report

data analysis and visualization summer training report

Uploaded by

shashank78199

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

SUMMER TRAINING REPORT

Data Analysis and Visualization

Submitted in partial fulfillment of the

Requirements for the award of

Degree of Bachelor of Technology in Computer Science and Engineering

Submitted By

Name: Himanshu Sharma

Enrolment No. 00227902721 Semester: 7TH

SUBMITTED TO:

Dr. Shailendra Kumar

Department of Computer Science and

Engineering
TRINITY INSTITUTE OF INNOVATIONS IN
PROFESSIONAL STUDIES,
Greater Noida, (U P)
CERTIFICATE
CERTIFICATE
DECLARATION

I hereby declare that the Industrial Training Report on data Analysis and Visualization

Industry is an authentic record of my own work as requirements of Minor Industrial Training

during the period from 1st sept to 26th sept or the award of degree of B. Tech. CSE,

Trinity Institute of Innovations in Professional Studies, Greater Noida, (U P), under the

guidance of Dr Shailendra Kumar.

(Signature of student)
Himanshu Sharma
00227902721

Date: 26 /10/2024

Certified that the above statement made by the student is correct to the best of our knowledge and

belief.

Examined by:

(Signature)

Dr Shailendra Kumar

(Signature)
Head of Department
ACKNOWLEDGEMENT

First and foremost, I wish to express my sincere thanks and gratitude to my esteemed Mentor

“Accenture North America” who has contributed so much for successful completion of my

Industrial Training by his thoughtful reviews and valuable guidance.

Next I would like to tender my sincere thanks to “Gunjan Arya” (Head of CSE Department)

for his co-operation and encouragement.

(Signature of student)

Himanshu Sharma

00227902721
LIST OF CONTENTS

Content Page No.

Certificate by Company/Industry i
Declaration by student ii
Acknowledgement iii
Table of Contents iv
List of Tables v
List of Figures vi
Abbreviations and Nomenclature (If any) vii

1. Chapters 1-10
1.1 Introduction 1

1.2 Theory 2

LIST OF FIGURES

Figure Caption Page

No. No.
1.1 Figure caption title 2
1.2 Figure caption title 3

2.1 Figure caption title 4

2.2 Figure caption title 6

ABBREVIATIONS AND NOMENCLATURE

DATA UNDERSTANDING - Understanding data and data analysis involves grasping various
concepts, including the types of data, methods of collection, and how to interpret findings. Data
can be quantitative (numerical) or qualitative (categorical), and it can be gathered through
surveys, experiments, or observational studies. Structuring this data effectively, often in tables
or databases, is crucial for analysis.

DATA CLEANING - Data cleaning is a crucial step in data analysis that ensures the accuracy
and reliability of the dataset. It involves identifying and correcting errors or inconsistencies,
starting with the detection of missing values, which can be addressed by removing records or
imputing values.

DATA MODELING - Data modeling is the process of creating a conceptual representation of

data and its relationships within a system. It serves as a blueprint for organizing and managing
data in databases and applications.

1. Conceptual Models: High-level representations that outline the overall structure and
relationships without delving into technical details.
2.Logical Models: More detailed than conceptual models, these specify the data elements,
attributes, and relationships while remaining independent of a specific database management
system.
3.Physical Models: These represent how data is stored in the database, including specific
tables, columns, data types, and indexing strategies.

DATA VISUALIZATION - Data visualization is the graphical representation of information

and data, allowing for easier interpretation and insights. By using visual elements like charts,
graphs, and maps, data visualization helps communicate complex data in a clear and accessible
way.

Types of Visualizations
1.Charts: Bar charts, line charts, pie charts, and scatter plots are commonly used to represent
relationships and trends.
2.Maps: Geographic data is effectively displayed using heat maps and choropleth maps show
patterns across different locations.
3.Dashboards: Combining multiple visualizations into a single interface allows for
comprehensive data analysis briefly.
INTRODUCTION

Accenture is a global professional services firm specializing in consulting, digital

transformation, technology, and operations. With a commitment to delivering high-quality
services and innovative solutions, Accenture operates in multiple sectors such as finance,
healthcare, and retail. I chose to intern at Accenture due to its strong reputation for fostering
skill development and its focus on cutting-edge technologies, making it an ideal environment
for learning about data analysis and visualization.
My academic background in data science, combined with hands-on experience in data analysis
and visualization, has equipped me with the skills necessary to excel in this role. I have
developed a strong proficiency in tools such as Python, R, and Tableau, which allow me to
manipulate large datasets and create compelling visualizations that tell a story.
During my studies, I undertook several projects where I analysed complex datasets, identifying
trends and patterns that informed key decisions. These experiences have not only honed my
technical abilities but also deepened my understanding of how data can drive business
strategies. I am particularly inspired by Accenture’s focus on using advanced analytics and
technology to solve real-world challenges and enhance client outcomes.
I am eager to collaborate with talented professionals and contribute to projects that impact
diverse industries. I believe that my analytical mindset, attention to detail, and passion for data-
driven solutions will enable me to make meaningful contributions to your team. This internship
represents an invaluable opportunity for me to learn from industry leaders while applying my
skills to contribute to Accenture's mission of delivering high-impact insights. I look forward to
the possibility of being part of such a forward-thinking organization and helping clients
navigate their data journeys.

SKILLS I HAVE ACQUIRED IN THIS TRAINING

Data Analysis
1. Data Cleaning:
o Removing duplicates, correcting errors, and handling missing values.
2. Exploratory Data Analysis (EDA):
o Using statistics and visualization (e.g., histograms, scatter plots) to understand
patterns and trends.
3. Statistical Analysis:
o Descriptive Statistics: Mean, median, mode, standard deviation.
o Inferential Statistics: Making predictions or inferences about a population based
on a sample.
4. Data Visualization:
o Creating graphs and charts to represent data visually (e.g., bar charts, line
graphs).
5. Predictive Analysis:
o Using models (e.g., regression analysis, machine learning) to predict future
outcomes.
Data Cleaning
1. Identifying Missing Values
 Detection: Use techniques to find missing data points.
 Handling: Options include removing records, imputing values (mean, median, mode), or leaving
them as-is depending on the analysis context.
2. Removing Duplicates
 Detection: Identify duplicate entries that may skew results.
 Resolution: Remove duplicates to maintain a unique dataset.
3. Correcting Errors
 Typos and Inconsistencies: Standardize formats (e.g., date formats, categorical labels) and
correct spelling mistakes.
 Outliers: Identify and assess outliers to determine if they are valid data points or errors that need
correction.
4. Standardizing Data
 Ensure uniformity in data formats (e.g., converting all text to lowercase, ensuring consistent
units of measurement).
5. Validating Data Accuracy
 Cross-check data against reliable sources to confirm its accuracy, especially for critical fields.

Data Modelling
1.Entities and Attributes
 Entities: These are the primary objects or concepts that hold data (e.g., customers,
orders).
 Attributes: These are the characteristics or properties of entities (e.g., a customer’s name,
email, and purchase history).
2. Relationships
 Data modelling defines how entities relate to each other, such as one-to-one, one-to-
many, or many-to-many relationships. Understanding these relationships is crucial for
accurately representing the data structure.
3. Normalization
 This process involves organizing data to minimize redundancy and improve data
integrity. It helps ensure that dependencies are properly maintained, and that data is
stored efficiently.
4. Data Modelling Tools
 Various tools are available for data modelling, including ERD (Entity-Relationship
Diagram) software like Lucid chart, MySQL Workbench, and modelling languages like
UML (Unified Modelling Language).
5. Importance of Data Modelling
 Effective data modelling leads to better database design, enhances data consistency,
supports improved data management, and provides a clear framework for understanding
and analysing data relationships.
What Have I Created
In this project we worked on data sets, and we did data analysis, data visualization, modelling of
data and cleaning it. Let’s take a look we I have done in this project.

Process 1.
PROCESS 2.
In this we did Requirement Gathering.

PROCESS 3.
In this we have different data sets and we will perform data cleaning.
Data sets Before Data Cleaning and Modelling

Data sets After Data cleaning and modelling

Description of Internship Content

During my internship, I was involved in multiple projects focused on analyzing consumer

behavior data for a retail client. One of the key projects was to analyze sales data from the last
five years to identify trends and forecast future performance. The project’s primary goal was to
provide insights that would assist the client in optimizing inventory and marketing strategies.I
have understand very much about data analysis and data sets and working with them and also, I
have learned many related concepts like data cleaning and data understanding and modelling of
data. This internship helps me with my future understanding of data analysis and visualization.
For the start the whole team who are working on these kinds of projects help us intern to
understand the core and complex concepts of data analysis in such an easy way.

Internship Experience

The internship experience was both challenging and rewarding. One major challenge I faced
was dealing with incomplete data that hindered initial analysis. To overcome this, I developed a
systematic approach to identify missing values, use statistical methods for imputation, and
document my findings for transparency.

This internship has profoundly impacted my career development by enhancing my analytical

thinking and problem-solving skills. I learned how to present data findings in a narrative form
that communicates effectively with non-technical stakeholders. Moreover, I appreciated
Accenture’s management style that emphasized collaboration and innovation, fostering an
inclusive work culture.
Summary

In summary, my internship at Accenture provided me with valuable experience in data analysis

and visualization, along with essential soft skills such as communication and teamwork. I
learned the importance of data integrity, the power of visualization in storytelling, and effective
stakeholder engagement. Moving forward, my goal is to pursue a career in data science,
leveraging the skills acquired during my internship to contribute to data-driven solutions in
organizations.

This experience has set a strong foundation for my professional journey, and I am enthusiastic
about continuing to develop my skills in the field of data analytics.
REFERENCE
For online/Google Search

Website - https://www.accenture.com/gb-en/careers/local/virtual-experience-program
For Help - https://www.kaggle.com/datasets/ahmedsamir11111/project-data-analysis-
using-excel
GitHub - https://github.com/Mabrar92/Data-Analysis-Projects-Portfolio
Book reference - https://files.eric.ed.gov/fulltext/ED536788.pdf

Test Development: Fundamentals for Certification and Evaluation
From Everand
Test Development: Fundamentals for Certification and Evaluation
Melissa Fein
No ratings yet
Harsh It
No ratings yet
Harsh It
16 pages
Report File (VJ)
No ratings yet
Report File (VJ)
56 pages
CC6 Week 4 Chapter 2
No ratings yet
CC6 Week 4 Chapter 2
21 pages
Unit 2 Data Gathering
No ratings yet
Unit 2 Data Gathering
14 pages
Chapter 2 - Introduction to Data Science
No ratings yet
Chapter 2 - Introduction to Data Science
37 pages
Session1-DataCharacteristics
No ratings yet
Session1-DataCharacteristics
41 pages
Report On Summer Internship
No ratings yet
Report On Summer Internship
30 pages
Google Certificate Notes
No ratings yet
Google Certificate Notes
36 pages
Downloadable Official CompTIA Data+ Student Guide 3
50% (2)
Downloadable Official CompTIA Data+ Student Guide 3
426 pages
Intro To Data Analytics - Cleanup & Transformation
No ratings yet
Intro To Data Analytics - Cleanup & Transformation
30 pages
Chapter 2-2
No ratings yet
Chapter 2-2
34 pages
Unit 1
No ratings yet
Unit 1
61 pages
Chapter 2 Data Science
No ratings yet
Chapter 2 Data Science
55 pages
Chapter 2. Introduction to Data Science
No ratings yet
Chapter 2. Introduction to Data Science
41 pages
Data Smith Experience
No ratings yet
Data Smith Experience
15 pages
TRAINING Report
No ratings yet
TRAINING Report
32 pages
Lecture 2
No ratings yet
Lecture 2
14 pages
Unit 1 - Exploratory Data Analysis Fundamentals
No ratings yet
Unit 1 - Exploratory Data Analysis Fundamentals
47 pages
Lec 1
No ratings yet
Lec 1
32 pages
data scince report
No ratings yet
data scince report
11 pages
UNIT I - Introduction - DataScience - New
No ratings yet
UNIT I - Introduction - DataScience - New
34 pages
Chapter Two Data Science: by Abdulaziz Oumer
No ratings yet
Chapter Two Data Science: by Abdulaziz Oumer
29 pages
Data Analytics For IOT
No ratings yet
Data Analytics For IOT
57 pages
Big Data
No ratings yet
Big Data
10 pages
Chapter 2 Data Science
No ratings yet
Chapter 2 Data Science
37 pages
Data Tracks
No ratings yet
Data Tracks
8 pages
Fda 1
No ratings yet
Fda 1
5 pages
CH 2 Data Science
No ratings yet
CH 2 Data Science
28 pages
Unit 1 Part 1
No ratings yet
Unit 1 Part 1
18 pages
Data Science PPT Module 1
100% (1)
Data Science PPT Module 1
24 pages
UNIT - 2 .DataScience 04.09.18
No ratings yet
UNIT - 2 .DataScience 04.09.18
53 pages
INTERNSHIP
No ratings yet
INTERNSHIP
7 pages
Data Visulization Report
No ratings yet
Data Visulization Report
21 pages
Data Analytics III-i
No ratings yet
Data Analytics III-i
85 pages
Data Processing
No ratings yet
Data Processing
26 pages
Chapter 2
No ratings yet
Chapter 2
27 pages
YashJakhar_report.docx
No ratings yet
YashJakhar_report.docx
20 pages
Chapter 2
No ratings yet
Chapter 2
30 pages
ETCh2
No ratings yet
ETCh2
36 pages
DA-1,2,3[1]_merged
No ratings yet
DA-1,2,3[1]_merged
39 pages
Chapter Two
No ratings yet
Chapter Two
14 pages
L 4 and 5-Data Cleaning DS-Sa
No ratings yet
L 4 and 5-Data Cleaning DS-Sa
44 pages
DOC-20231118-WA0008new Unit 3
No ratings yet
DOC-20231118-WA0008new Unit 3
15 pages
Introduction To Data Science, Evolution of Data Science
No ratings yet
Introduction To Data Science, Evolution of Data Science
11 pages
Unit 3 DW
No ratings yet
Unit 3 DW
19 pages
21ai402 Data Analytics Unit-3
No ratings yet
21ai402 Data Analytics Unit-3
150 pages
Data_Mining_Warehousing Unit II
No ratings yet
Data_Mining_Warehousing Unit II
39 pages
Chapter - 2 - Data Science
No ratings yet
Chapter - 2 - Data Science
32 pages
4 z
No ratings yet
4 z
33 pages
Chapter 2 Data Science
No ratings yet
Chapter 2 Data Science
33 pages
Data Mining Practical 123
No ratings yet
Data Mining Practical 123
26 pages
R15a0530 Bda PDF
No ratings yet
R15a0530 Bda PDF
43 pages
Data Science Roles, Stages in A Data Science Project
No ratings yet
Data Science Roles, Stages in A Data Science Project
14 pages
1 Da
No ratings yet
1 Da
12 pages
Module1_Introduction to Data Processing Updated
No ratings yet
Module1_Introduction to Data Processing Updated
44 pages
Unit 3 Data Analytics
No ratings yet
Unit 3 Data Analytics
16 pages
EmgTech Chapter 02
No ratings yet
EmgTech Chapter 02
52 pages
DA Assignment 20241015 091512 0000
No ratings yet
DA Assignment 20241015 091512 0000
19 pages
"Big Data Science" Basic Concepts and Applications
From Everand
"Big Data Science" Basic Concepts and Applications
Sukanta Bhattacharya
No ratings yet
Financial Time Series Forecasting Using CNN and Transformer
No ratings yet
Financial Time Series Forecasting Using CNN and Transformer
4 pages
Ra 7394 Consumer Act of The Philippines Summary
No ratings yet
Ra 7394 Consumer Act of The Philippines Summary
6 pages
31-07-22 - Inc - Jr.iit - Star Co-Sc (Model-A) - Jee Adv - 2017 (P-I) - Wat-5 - QP
No ratings yet
31-07-22 - Inc - Jr.iit - Star Co-Sc (Model-A) - Jee Adv - 2017 (P-I) - Wat-5 - QP
19 pages
Experimental Basis of Percutaneous Laser Disc Decompression (PLDD) : A Review of Literature
No ratings yet
Experimental Basis of Percutaneous Laser Disc Decompression (PLDD) : A Review of Literature
5 pages
Cata Macrom 03-2
No ratings yet
Cata Macrom 03-2
16 pages
NCM 417 - Midterm Exam 2015
No ratings yet
NCM 417 - Midterm Exam 2015
6 pages
Reported Speech For All Tenses
No ratings yet
Reported Speech For All Tenses
7 pages
Why LC/MS/MS?: Technology Transfer Workshop
No ratings yet
Why LC/MS/MS?: Technology Transfer Workshop
47 pages
2 Software Design Processes and Management: 1. The Figure Below Indicates The Errors in The Diagram
No ratings yet
2 Software Design Processes and Management: 1. The Figure Below Indicates The Errors in The Diagram
10 pages
PCM 12TH Holiday Homework 2024-25 - 143
No ratings yet
PCM 12TH Holiday Homework 2024-25 - 143
5 pages
PowerUp PDF
No ratings yet
PowerUp PDF
1 page
Crs Clock Recovery
No ratings yet
Crs Clock Recovery
18 pages
Ntermittent Laudication Lass: Liz Bouch (Senior Specialist Physiotherapist) Manchester Royal Infirmary BACPAR - Nov 14
No ratings yet
Ntermittent Laudication Lass: Liz Bouch (Senior Specialist Physiotherapist) Manchester Royal Infirmary BACPAR - Nov 14
21 pages
Airwaves Issue 9 2011
No ratings yet
Airwaves Issue 9 2011
4 pages
Azrin & Lindsley 1956 - The Reinf of Cooperation Between Children - LIMPO
No ratings yet
Azrin & Lindsley 1956 - The Reinf of Cooperation Between Children - LIMPO
3 pages
Lonox: Physical Properties
No ratings yet
Lonox: Physical Properties
7 pages
EF3e Elem Filetest 5a
No ratings yet
EF3e Elem Filetest 5a
7 pages
Schedule Cracker Manual 1.2
No ratings yet
Schedule Cracker Manual 1.2
45 pages
8 Personal Narrative Essay
100% (1)
8 Personal Narrative Essay
15 pages
Aibt Chcprp003 Learner Workbook Ecec v1.0
No ratings yet
Aibt Chcprp003 Learner Workbook Ecec v1.0
58 pages
Fs2-Learning Julie Valdevieso FSM 4d
100% (1)
Fs2-Learning Julie Valdevieso FSM 4d
4 pages
Unit7 P
No ratings yet
Unit7 P
10 pages
Mindfulness Journal
100% (10)
Mindfulness Journal
12 pages
FoodChem1052007756-760
No ratings yet
FoodChem1052007756-760
6 pages
Maintaining IT Equipment and Consumables
No ratings yet
Maintaining IT Equipment and Consumables
154 pages
Presentation On Diagnostic Agent
No ratings yet
Presentation On Diagnostic Agent
31 pages
Experiences of LGBTQ
No ratings yet
Experiences of LGBTQ
14 pages
FDTL Quick Reference Guide (Easa-European Aviation Safety Agency)
No ratings yet
FDTL Quick Reference Guide (Easa-European Aviation Safety Agency)
18 pages
SCIENCE 9 3rd QUARTER REVIEWER VOLCANOES
No ratings yet
SCIENCE 9 3rd QUARTER REVIEWER VOLCANOES
6 pages
Rules and Procedures of Branch Change
No ratings yet
Rules and Procedures of Branch Change
3 pages

Data Analysis and Visualization Summer Training Report

Uploaded by

Data Analysis and Visualization Summer Training Report

Uploaded by

SUMMER TRAINING REPORT

Data Analysis and Visualization

Submitted in partial fulfillment of the

Degree of Bachelor of Technology in Computer Science and Engineering

Name: Himanshu Sharma

Enrolment No. 00227902721 Semester: 7TH

Dr. Shailendra Kumar

Department of Computer Science and

Industry is an authentic record of my own work as requirements of Minor Industrial Training

guidance of Dr Shailendra Kumar.

Industrial Training by his thoughtful reviews and valuable guidance.

for his co-operation and encouragement.

Content Page No.

Figure Caption Page

2.1 Figure caption title 4

2.2 Figure caption title 6

DATA MODELING - Data modeling is the process of creating a conceptual representation of

DATA VISUALIZATION - Data visualization is the graphical representation of information

Accenture is a global professional services firm specializing in consulting, digital

SKILLS I HAVE ACQUIRED IN THIS TRAINING

Data sets After Data cleaning and modelling

During my internship, I was involved in multiple projects focused on analyzing consumer

This internship has profoundly impacted my career development by enhancing my analytical

In summary, my internship at Accenture provided me with valuable experience in data analysis

You might also like