0% found this document useful (0 votes)
35 views2 pages

Mohit_Chatterjee

Uploaded by

shirish patne
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
35 views2 pages

Mohit_Chatterjee

Uploaded by

shirish patne
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

Mohit Chatterjee - Data Engineer

+91-7999744064 | [email protected] | LinkedIn | GitHub

Technical Skills and Tools

Big Data AWS


Data Processing Hadoop | Apache Spark EC2 | S3 |EMR | Step Function | SNS | Lambda
Data Ingestion Kafka
NoSQL Database Cassandra Programming Language
Hadoop Distribution EMR Python | Java

Other Skills
SQL | PySpark | Pandas | Numpy | Machine Learning (Classification and Regression ) | Statistics | ETL
Development | Terraform | Azure Data Bricks | tableau | Jira | Git | Jupyter Notebook | Excel | DataBricks |
Spyder | SQL Workbench| MYSQL | Data Analysis | Anaconda | Power BI | Intellij Idea | software engineer |
git(continuous integration and continuous deployement) | CI/CD | Jenkins | Confluence | BitBucket

Experience
DATA ENGINEER – ACCENTURE AUG/2022 – PRESENT
• Collaborate with teams to understand big data semantics and requirements, ultimately improving
operational efficiency by 30% and reducing big data-related errors by 40%.
• Orchestrated a comprehensive data attribute mapping initiative, enabling seamless integration of
diverse vocabularies, optimizing global accessibility and facilitating cross-functional collaboration,
leading to a 30% increase in data accuracy and streamlined data management processes.
• Created a Python-based utility for generating CSVW representations from data sheets, converting over
100 sheets into triple format, which enhanced data integration by 30%.
• Orchestrated end-to-end automation of the data pipeline and automated the whole pipeline process,
resulting in a 60% enhancement in operational efficiency.
• Created and maintained over 50 Python/PySpark scripts for data loading and transformation, leading
to a 25% improvement in data processing efficiency.
• Configured the MDG pipeline, enabling the extraction, transformation, and loading of over 500 GB of
data into the cloud environment, reducing ETL processing time by 40%.
• Executed rigorous data validation protocols through complex SQL queries on large data tables, ensuring
over 99% accuracy in reporting and maintaining data integrity across 50+ datasets used for strategic
decision-making.
• Tools & Technologies Used : Python, SQL, AWS EMR, AWS S3, AWS STEP FUNCTION, AWS LAMBDA,
PySpark, Jira, agile methodology

PYTHON DEVELOPER – ACCENTURE SEPT/2021 – JUL/2022

• Implemented automation workflows that reduced manual effort by 40% and improved operational
efficiency by automating key processes across development and production environments.
• Built a Python application that extracted and categorized over 10,000+ JIRA records, streamlining data
analysis and aligning with business requirements.
• Utilized an automation tool to deploy 100% of the data to both UAT and production environments
(AWS), improving deployment speed by 30% and minimizing errors.
• Constructing an automation release tool projected to enhance client profit by 20%.
• Authored complex SQL scripts for comprehensive end-to-end data validation, ensuring data integrity
across 500+ GB datasets and increasing accuracy by 25%.

Page 1
• Tools & Technologies Used : Python, SQL, TKinter, Pandas

Personal Project
Investment Prediction Mar/2023–Apr/2023

• Acquired and preprocessed historical data, improving data relevance and accuracy for predictive
modeling by 20%.
• Conducted model training sessions, optimizing machine learning algorithms, resulting in a 15% increase
in model performance.
• Directed the development and integration of a machine learning model for market trend forecasting,
achieving 85% prediction accuracy.

APS Fault Detection Dec/2022–Dec/2022

• Leveraged sensor big-data from vehicle systems, gathering critical insights that led to a 10% increase in
operational efficiency.
• Employed advanced feature selection techniques, identifying key data attributes and improving model
performance by 20%.
• Engineered and deployed a machine learning algorithm to predict market trends with 90% accuracy.

Cricket Score Predictor Oct/2022–Oct/2022

• Developed a predictive model to forecast cricket team runs, incorporating extensive historical data and
achieving an 80% forecast accuracy.
• Gathered and analyzed extensive historical cricket match data, processing over 1 million data points
and incorporating relevant features to enhance predictive accuracy by 20%.
• Developed a machine learning framework for dependable market trend forecasting, reaching 85%
accuracy while reducing forecasting time by 30%.

Education
• FULL STACK DATA SCIENCE, INEURON (PWSKILLS) May/2022 – Jul/2023
Internship
• BACHELOR OF TECHNOLOGY, RGPV UNIVERSITY Aug/2017–Aug/2021
Electronic and Communication
• SENIOR SECONDARY, RGPV UNIVERSITY Apr/2016–May/2017
Electronic and Communication
• HIGHER secondary, RGPV UNIVERSITY Apr/2016–May/2017
Electronic and Communication

Courses Certificate

• Python, Coursera • AWS Certified Data Engineer


• SQL, Coursera • Full Stack Data Science, Ineuron
• Full Stack Data Science, Ineuron • Data Analysis Using Python, IBM
• Machine Learning, Tata Steel • Python For Data Science, IBM

Page 2

You might also like