0% found this document useful (0 votes)
254 views

Data Analysis and Visualization Summer Training Report

data analysis and visualization summer training report

Uploaded by

shashank78199
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
254 views

Data Analysis and Visualization Summer Training Report

data analysis and visualization summer training report

Uploaded by

shashank78199
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 14

SUMMER TRAINING REPORT

On

Data Analysis and Visualization

Submitted in partial fulfillment of the


Requirements for the award of

Degree of Bachelor of Technology in Computer Science and Engineering

Submitted By

Name: Himanshu Sharma

Enrolment No. 00227902721 Semester: 7TH

SUBMITTED TO:

Dr. Shailendra Kumar

Department of Computer Science and


Engineering
TRINITY INSTITUTE OF INNOVATIONS IN
PROFESSIONAL STUDIES,
Greater Noida, (U P)
CERTIFICATE
CERTIFICATE
DECLARATION

I hereby declare that the Industrial Training Report on data Analysis and Visualization

Industry is an authentic record of my own work as requirements of Minor Industrial Training

during the period from 1st sept to 26th sept or the award of degree of B. Tech. CSE,

Trinity Institute of Innovations in Professional Studies, Greater Noida, (U P), under the

guidance of Dr Shailendra Kumar.

(Signature of student)
Himanshu Sharma
00227902721

Date: 26 /10/2024

Certified that the above statement made by the student is correct to the best of our knowledge and

belief.

Examined by:

(Signature)

Dr Shailendra Kumar

(Signature)
Head of Department
ACKNOWLEDGEMENT

First and foremost, I wish to express my sincere thanks and gratitude to my esteemed Mentor

“Accenture North America” who has contributed so much for successful completion of my

Industrial Training by his thoughtful reviews and valuable guidance.

Next I would like to tender my sincere thanks to “Gunjan Arya” (Head of CSE Department)

for his co-operation and encouragement.

(Signature of student)

Himanshu Sharma

00227902721
LIST OF CONTENTS

Content Page No.

Certificate by Company/Industry i
Declaration by student ii
Acknowledgement iii
Table of Contents iv
List of Tables v
List of Figures vi
Abbreviations and Nomenclature (If any) vii

1. Chapters 1-10
1.1 Introduction 1

1.2 Theory 2

LIST OF FIGURES

Figure Caption Page


No. No.
1.1 Figure caption title 2
1.2 Figure caption title 3

2.1 Figure caption title 4

2.2 Figure caption title 6


ABBREVIATIONS AND NOMENCLATURE

DATA UNDERSTANDING - Understanding data and data analysis involves grasping various
concepts, including the types of data, methods of collection, and how to interpret findings. Data
can be quantitative (numerical) or qualitative (categorical), and it can be gathered through
surveys, experiments, or observational studies. Structuring this data effectively, often in tables
or databases, is crucial for analysis.

DATA CLEANING - Data cleaning is a crucial step in data analysis that ensures the accuracy
and reliability of the dataset. It involves identifying and correcting errors or inconsistencies,
starting with the detection of missing values, which can be addressed by removing records or
imputing values.

DATA MODELING - Data modeling is the process of creating a conceptual representation of


data and its relationships within a system. It serves as a blueprint for organizing and managing
data in databases and applications.

1. Conceptual Models: High-level representations that outline the overall structure and
relationships without delving into technical details.
2.Logical Models: More detailed than conceptual models, these specify the data elements,
attributes, and relationships while remaining independent of a specific database management
system.
3.Physical Models: These represent how data is stored in the database, including specific
tables, columns, data types, and indexing strategies.

DATA VISUALIZATION - Data visualization is the graphical representation of information


and data, allowing for easier interpretation and insights. By using visual elements like charts,
graphs, and maps, data visualization helps communicate complex data in a clear and accessible
way.

Types of Visualizations
1.Charts: Bar charts, line charts, pie charts, and scatter plots are commonly used to represent
relationships and trends.
2.Maps: Geographic data is effectively displayed using heat maps and choropleth maps show
patterns across different locations.
3.Dashboards: Combining multiple visualizations into a single interface allows for
comprehensive data analysis briefly.
INTRODUCTION

Accenture is a global professional services firm specializing in consulting, digital


transformation, technology, and operations. With a commitment to delivering high-quality
services and innovative solutions, Accenture operates in multiple sectors such as finance,
healthcare, and retail. I chose to intern at Accenture due to its strong reputation for fostering
skill development and its focus on cutting-edge technologies, making it an ideal environment
for learning about data analysis and visualization.
My academic background in data science, combined with hands-on experience in data analysis
and visualization, has equipped me with the skills necessary to excel in this role. I have
developed a strong proficiency in tools such as Python, R, and Tableau, which allow me to
manipulate large datasets and create compelling visualizations that tell a story.
During my studies, I undertook several projects where I analysed complex datasets, identifying
trends and patterns that informed key decisions. These experiences have not only honed my
technical abilities but also deepened my understanding of how data can drive business
strategies. I am particularly inspired by Accenture’s focus on using advanced analytics and
technology to solve real-world challenges and enhance client outcomes.
I am eager to collaborate with talented professionals and contribute to projects that impact
diverse industries. I believe that my analytical mindset, attention to detail, and passion for data-
driven solutions will enable me to make meaningful contributions to your team. This internship
represents an invaluable opportunity for me to learn from industry leaders while applying my
skills to contribute to Accenture's mission of delivering high-impact insights. I look forward to
the possibility of being part of such a forward-thinking organization and helping clients
navigate their data journeys.

SKILLS I HAVE ACQUIRED IN THIS TRAINING


Data Analysis
1. Data Cleaning:
o Removing duplicates, correcting errors, and handling missing values.
2. Exploratory Data Analysis (EDA):
o Using statistics and visualization (e.g., histograms, scatter plots) to understand
patterns and trends.
3. Statistical Analysis:
o Descriptive Statistics: Mean, median, mode, standard deviation.
o Inferential Statistics: Making predictions or inferences about a population based
on a sample.
4. Data Visualization:
o Creating graphs and charts to represent data visually (e.g., bar charts, line
graphs).
5. Predictive Analysis:
o Using models (e.g., regression analysis, machine learning) to predict future
outcomes.
Data Cleaning
1. Identifying Missing Values
 Detection: Use techniques to find missing data points.
 Handling: Options include removing records, imputing values (mean, median, mode), or leaving
them as-is depending on the analysis context.
2. Removing Duplicates
 Detection: Identify duplicate entries that may skew results.
 Resolution: Remove duplicates to maintain a unique dataset.
3. Correcting Errors
 Typos and Inconsistencies: Standardize formats (e.g., date formats, categorical labels) and
correct spelling mistakes.
 Outliers: Identify and assess outliers to determine if they are valid data points or errors that need
correction.
4. Standardizing Data
 Ensure uniformity in data formats (e.g., converting all text to lowercase, ensuring consistent
units of measurement).
5. Validating Data Accuracy
 Cross-check data against reliable sources to confirm its accuracy, especially for critical fields.

Data Modelling
1.Entities and Attributes
 Entities: These are the primary objects or concepts that hold data (e.g., customers,
orders).
 Attributes: These are the characteristics or properties of entities (e.g., a customer’s name,
email, and purchase history).
2. Relationships
 Data modelling defines how entities relate to each other, such as one-to-one, one-to-
many, or many-to-many relationships. Understanding these relationships is crucial for
accurately representing the data structure.
3. Normalization
 This process involves organizing data to minimize redundancy and improve data
integrity. It helps ensure that dependencies are properly maintained, and that data is
stored efficiently.
4. Data Modelling Tools
 Various tools are available for data modelling, including ERD (Entity-Relationship
Diagram) software like Lucid chart, MySQL Workbench, and modelling languages like
UML (Unified Modelling Language).
5. Importance of Data Modelling
 Effective data modelling leads to better database design, enhances data consistency,
supports improved data management, and provides a clear framework for understanding
and analysing data relationships.
What Have I Created
In this project we worked on data sets, and we did data analysis, data visualization, modelling of
data and cleaning it. Let’s take a look we I have done in this project.

Process 1.
PROCESS 2.
In this we did Requirement Gathering.

PROCESS 3.
In this we have different data sets and we will perform data cleaning.
Data sets Before Data Cleaning and Modelling

Data sets After Data cleaning and modelling


Description of Internship Content

During my internship, I was involved in multiple projects focused on analyzing consumer


behavior data for a retail client. One of the key projects was to analyze sales data from the last
five years to identify trends and forecast future performance. The project’s primary goal was to
provide insights that would assist the client in optimizing inventory and marketing strategies.I
have understand very much about data analysis and data sets and working with them and also, I
have learned many related concepts like data cleaning and data understanding and modelling of
data. This internship helps me with my future understanding of data analysis and visualization.
For the start the whole team who are working on these kinds of projects help us intern to
understand the core and complex concepts of data analysis in such an easy way.

Internship Experience

The internship experience was both challenging and rewarding. One major challenge I faced
was dealing with incomplete data that hindered initial analysis. To overcome this, I developed a
systematic approach to identify missing values, use statistical methods for imputation, and
document my findings for transparency.

This internship has profoundly impacted my career development by enhancing my analytical


thinking and problem-solving skills. I learned how to present data findings in a narrative form
that communicates effectively with non-technical stakeholders. Moreover, I appreciated
Accenture’s management style that emphasized collaboration and innovation, fostering an
inclusive work culture.
Summary

In summary, my internship at Accenture provided me with valuable experience in data analysis


and visualization, along with essential soft skills such as communication and teamwork. I
learned the importance of data integrity, the power of visualization in storytelling, and effective
stakeholder engagement. Moving forward, my goal is to pursue a career in data science,
leveraging the skills acquired during my internship to contribute to data-driven solutions in
organizations.

This experience has set a strong foundation for my professional journey, and I am enthusiastic
about continuing to develop my skills in the field of data analytics.
REFERENCE
For online/Google Search

Website - https://www.accenture.com/gb-en/careers/local/virtual-experience-program
For Help - https://www.kaggle.com/datasets/ahmedsamir11111/project-data-analysis-
using-excel
GitHub - https://github.com/Mabrar92/Data-Analysis-Projects-Portfolio
Book reference - https://files.eric.ed.gov/fulltext/ED536788.pdf

You might also like