Mohit_Chatterjee
Mohit_Chatterjee
Other Skills
SQL | PySpark | Pandas | Numpy | Machine Learning (Classification and Regression ) | Statistics | ETL
Development | Terraform | Azure Data Bricks | tableau | Jira | Git | Jupyter Notebook | Excel | DataBricks |
Spyder | SQL Workbench| MYSQL | Data Analysis | Anaconda | Power BI | Intellij Idea | software engineer |
git(continuous integration and continuous deployement) | CI/CD | Jenkins | Confluence | BitBucket
Experience
DATA ENGINEER – ACCENTURE AUG/2022 – PRESENT
• Collaborate with teams to understand big data semantics and requirements, ultimately improving
operational efficiency by 30% and reducing big data-related errors by 40%.
• Orchestrated a comprehensive data attribute mapping initiative, enabling seamless integration of
diverse vocabularies, optimizing global accessibility and facilitating cross-functional collaboration,
leading to a 30% increase in data accuracy and streamlined data management processes.
• Created a Python-based utility for generating CSVW representations from data sheets, converting over
100 sheets into triple format, which enhanced data integration by 30%.
• Orchestrated end-to-end automation of the data pipeline and automated the whole pipeline process,
resulting in a 60% enhancement in operational efficiency.
• Created and maintained over 50 Python/PySpark scripts for data loading and transformation, leading
to a 25% improvement in data processing efficiency.
• Configured the MDG pipeline, enabling the extraction, transformation, and loading of over 500 GB of
data into the cloud environment, reducing ETL processing time by 40%.
• Executed rigorous data validation protocols through complex SQL queries on large data tables, ensuring
over 99% accuracy in reporting and maintaining data integrity across 50+ datasets used for strategic
decision-making.
• Tools & Technologies Used : Python, SQL, AWS EMR, AWS S3, AWS STEP FUNCTION, AWS LAMBDA,
PySpark, Jira, agile methodology
• Implemented automation workflows that reduced manual effort by 40% and improved operational
efficiency by automating key processes across development and production environments.
• Built a Python application that extracted and categorized over 10,000+ JIRA records, streamlining data
analysis and aligning with business requirements.
• Utilized an automation tool to deploy 100% of the data to both UAT and production environments
(AWS), improving deployment speed by 30% and minimizing errors.
• Constructing an automation release tool projected to enhance client profit by 20%.
• Authored complex SQL scripts for comprehensive end-to-end data validation, ensuring data integrity
across 500+ GB datasets and increasing accuracy by 25%.
Page 1
• Tools & Technologies Used : Python, SQL, TKinter, Pandas
Personal Project
Investment Prediction Mar/2023–Apr/2023
• Acquired and preprocessed historical data, improving data relevance and accuracy for predictive
modeling by 20%.
• Conducted model training sessions, optimizing machine learning algorithms, resulting in a 15% increase
in model performance.
• Directed the development and integration of a machine learning model for market trend forecasting,
achieving 85% prediction accuracy.
• Leveraged sensor big-data from vehicle systems, gathering critical insights that led to a 10% increase in
operational efficiency.
• Employed advanced feature selection techniques, identifying key data attributes and improving model
performance by 20%.
• Engineered and deployed a machine learning algorithm to predict market trends with 90% accuracy.
• Developed a predictive model to forecast cricket team runs, incorporating extensive historical data and
achieving an 80% forecast accuracy.
• Gathered and analyzed extensive historical cricket match data, processing over 1 million data points
and incorporating relevant features to enhance predictive accuracy by 20%.
• Developed a machine learning framework for dependable market trend forecasting, reaching 85%
accuracy while reducing forecasting time by 30%.
Education
• FULL STACK DATA SCIENCE, INEURON (PWSKILLS) May/2022 – Jul/2023
Internship
• BACHELOR OF TECHNOLOGY, RGPV UNIVERSITY Aug/2017–Aug/2021
Electronic and Communication
• SENIOR SECONDARY, RGPV UNIVERSITY Apr/2016–May/2017
Electronic and Communication
• HIGHER secondary, RGPV UNIVERSITY Apr/2016–May/2017
Electronic and Communication
Courses Certificate
Page 2