Prasanth Vemula - AI ML Engineer Big Data New

Dowtya Sri Prasanth Vemula is a highly skilled AI Engineer with a Master's in Artificial Intelligence and Machine Learning and extensive experience in machine learning, data science, and cloud platforms. He has worked on optimizing ML workflows, developing scalable data pipelines, and deploying production ML models, significantly improving data processing and model accuracy. His professional experience includes roles at Benchmark Gensuite and Salesforce, where he led various AI projects and mentored teams, alongside a strong academic background in teaching and project development.

Uploaded by

afreensmaill

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views4 pages

Prasanth Vemula - AI ML Engineer Big Data New

Uploaded by

afreensmaill

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

Dowtya Sri Prasanth Vemula

6145304641
[email protected]

EDUCATION:
 Master of Science in Artificial Intelligence and Machine Learning - University of North Texas
Relevant Courses: Machine Learning, Deep Learning, Feature Engineering, Data Visualization, Software
Development for AI, Reinforcement Learning using Neural Networks, Methods in Empirical Analysis, Statistics,
Data Structures and Algorithms and Problem Solving.
 Bachelor of Technology in Computer Science and Engineering - Amrita Vishwa Vidyapeetham
 Relevant Courses: Software Engineering, Database Management Systems, Data Structures and Algorithms, Big
Data Analytics, System Design
SKILLS:
 Machine Learning & Data Science: Transfer Learning, Model Evaluation (accuracy, precision, recall), API
Development
 Programming Languages: Python, SQL, Java, Bash, JavaScript, Scala, C, C++, C#, R, CUDA
 Cloud Platforms & APIs: AWS (EC2, S3, Lambda, RDS, SageMaker, Bedrock, CloudWatch, Glue, Redshift), Azure
(OpenAI, Cognitive Services, Machine Learning, AKS, Functions, DevOps, AutoML), GCP (Vertex AI, Cloud
Functions, Cloud Storage, BigQuery, Cloud Run, Cloud AI Platform, Document AI, Cloud Vision API, Rest Api
Design)
 Computer Vision: Object Detection (YOLO, Faster R-CNN), Image Segmentation (U-Net, Mask R-CNN, CLIPSeg),
Image Classification (ResNet, Inception), Feature Extraction, Image Augmentation, Transfer Learning
 Frameworks & Packages: TensorFlow, Pandas, NumPy, scikit-learn, Matplotlib, Seaborn, Keras, SciPy, OpenCV,
StatsModels, PySpark, PyTorch, NLP (BERT, openNER, spaCy, GLiNER, openNLP), LangChain
 Machine Learning & Statistics: Regression, Classification, SVM, Decision Trees, Random Forests, Clustering,
Gradient Boosted Trees, Neural Networks (LSTM, GANs, CNN, Transfer Learning), PCA, K-NN, XGBoost
 MLOps Tools: Docker, Kubernetes, Terraform , Ansible, Jenkins, Grafana, FastAPI, MLflow, Ray, Databricks,
Snowflake, Azure DevOps
 Generative AI & Vector Databases: GANs, VANs, Diffusion, RAG, LLMs, Transformers, Prompt Engineering,
Knowledge Graphs (Neo4j), Vector Databases (Pinecone, Weaviate, Milvus, Vespa, Chroma)
PROFESSIONAL EXPERIENCE:
Senior Ai Engineer -Benchmark Gensuite Nov 2024 – Present
Responsibilities:
 Optimized supervised and unsupervised Machine Learning workflows by designing a Platform Agent with LangGraph and
LangChain, processing terabytes of data on Apache Spark and Hadoop HDFS, reducing manual query handling by 35% and
accelerating model training pipelines.
 Engineered scalable big data pipelines by implementing Apache Spark jobs integrated with Teradata for data
warehousing, transforming raw datasets into feature sets for ML models, boosting data processing throughput by 50%.
 Deployed production ML models by containerizing inference services with Docker and Kubernetes and exposing APIs via
FastAPI and Python, achieving 99.8% uptime and reducing client-reported issues by 25%.
 Developed advanced embedding pipelines by leveraging AWS Bedrock Titan v2 and Cohere multilingual models alongside
Apache Spark and Hadoop, automating data chunking and model routing, improving text-processing throughput by 50%
and semantic search relevance by 45%.
 Refactored risk-classification logic by centralizing rule sets in PySpark and Teradata SQL, standardizing checks across 68
risk types, reducing regression bug reports by 60% and ensuring consistency in production.
 Architected a dual-agent document-processing system by orchestrating Hadoop-based ETL pipelines with AWS Textract
for OCR and OpenSearch Serverless for embeddings, reducing LLM call volume by 55% and cutting inference costs by
40%.
 Monitored and analyzed production ML model performance by querying Spark SQL and Pandas-based data analysis
pipelines, detecting anomalies in production logs, and automating re-training triggers to maintain 98% model accuracy.
 Presented machine learning solutions and big data insights to clients by developing interactive dashboards in AWS
CloudWatch and Tableau, enhancing stakeholder understanding and driving a 25% improvement in decision-making
speed.
 Led knowledge-transfer workshops on Hadoop, Apache Spark, and Teradata for cross-functional teams, mentoring new
engineers to accelerate skill acquisition by 70% and promote best practices in ML and big data processing.
 Streamlined CI/CD pipelines for ML model deployment by configuring Jenkins and GitLab to automate Docker builds,
tests, and Kubernetes rollouts, reducing deployment time from 3 hours to 45 minutes and ensuring reproducibility.

Environment: Python, Django, Pandas, Linux/Ubuntu, Machine learning, Artificial Intelligence, MLLib, NLP, Azure, RabbitMQ,
Restful web service, Shell Scripting, Docker, Bit Bucket, Github, CI/CD Pipelines, MySQL, Flask API, MySQL, RabbitMQ, Azure,
Tensorflow, Keras, Pandas, PyTest Responsible for systems implementation and business analysis related to streamlining
operations for ALM operations, especially related to mortgage-based products and their derivatives.
Generative Data Engineer - Salesforce Dec 2023 – Nov 2024
Responsibilities:
 Integrated large-scale Salesforce datasets into AWS S3 by deploying MuleApps in AnyPoint, automating data
ingestion pipelines with Apache Spark on Hadoop and Teradata connectors, reducing manual entry by 50% and
improving data analysis accuracy.
 Automated CPS property data generation by orchestrating Airflow DAGs with YAML configurations and Ninja
templates, loading terabytes of raw data from S3 into Snowflake and Teradata, increasing processing speed by
30% and cutting errors by 90%.
 Designed and deployed RAG pipelines for GenAI chatbots by integrating LangChain, AWS Bedrock models, and
OpenSearch Serverless on Hadoop clusters, improving contextual relevance by 35% and enabling scalable ML
model inference.
 Engineered a dynamic Router Query Engine using PySpark and Teradata SQL for big data routing logic, calling
specialized modules to process ML tasks, reducing query misrouting by 25% and enhancing system reliability.
 Developed data persistence strategies in AWS S3 with Snowflake and Teradata for streaming contexts, utilizing
Spark Structured Streaming on Hadoop to support real-time ML model updates with low-latency data availability.
 Built production-ready JavaScript RAG web apps by leveraging Node.js, React, and Elasticsearch on Hadoop HDFS,
delivering conversational ML interfaces with 35% higher user engagement.
Environment: Python, FastAPI, Streamlit, LangChain, LangGraph, AWS Bedrock (Titan v2, Cohere Multilingual, Claude via
RAG), AWS OpenSearch Serverless/Elasticsearch, AWS Textract, Mistral OCR, AWS Cognito, AWS API Gateway, Amazon S3,
KNN/ANN algorithms, AWS CloudWatch
AI Engineer Intern - Amity Tech Corporation Sep 2023 - Dec 2023
 Engineered AI chatbot leveraging GPT-3.5 and LangChain with Streamlit, deploying models on Hugging Face Spaces and
Docker, cutting client response times by 40% and boosting satisfaction by 35%.
 Optimized semantic search pipelines by building embedding-based retrieval using Cohere LLMs and AnnoyIndex on
Apache Spark, achieving a 25% increase in retrieval accuracy and reducing query latency by 20%.
 Developed end-to-end ML pipelines in Python and Hadoop for document ingestion, vector indexing, and query
processing, enabling scalable AI-driven support solutions with 50% faster resolution.
 Built interactive dashboards with Pandas, NumPy, and matplotlib in Streamlit, presenting ML performance metrics and
data analysis insights to stakeholders, enhancing transparency.
 Collaborated with cross-functional teams to integrate Python-based Machine Learning models with production-grade
RESTful APIs, supporting deployment on cloud platforms with 99% uptime.

Environment: Python, OpenAI GPT-3.5 API, LangChain, Streamlit, Hugging Face Spaces, Cohere LLMs, AnnoyIndex, semantic-
search & embedding pipelines, Python data-processing libraries (Pandas, NumPy), machine-learning frameworks (scikit-
learn), interactive dashboarding tools.
Graduate Teaching Assistant - Dept. of CSE, University of North Texas, Denton, TX. Aug 2022 – Sep 2023
 Tutored 200+ graduate and undergraduate students in statistics, probability, and advanced computer science
principles, incorporating machine learning concepts into lessons, which boosted average grades by 15% and
enhanced student comprehension.
 Led a wildfire spread prediction project by applying meteorological and topographical data to machine learning
models, improving next-day prediction accuracy by 20% through feature engineering and model tuning using
Python and scikit-learn.
 Developed and optimized Python-based workflows for satellite image processing in a geo-spatial lab, utilizing QIS,
GDAL, and NumPy, increasing image processing efficiency by 25% and enabling real-time analysis of large
datasets.
 Incorporated machine learning techniques into geospatial data visualization by leveraging Python libraries
(Matplotlib, Plotly) to create interactive dashboards, enabling students to visualize complex data sets and
increasing their analytical capabilities by 30%.
 Mentored students on research methodologies and project development in computational science, guiding them
in applying advanced algorithms and statistical models, which led to the publication of 2 student research papers
in peer-reviewed journals.
 Collaborated with faculty and industry professionals to incorporate the latest advancements in machine learning
and geospatial analytics, elevating the quality of teaching materials and project assignments.
 Implemented practical, hands-on assignments for students, introducing Python’s TensorFlow and Keras
frameworks for real-world data modeling, enhancing their exposure to state-of-the-art machine learning
techniques by 40%.
Site Reliability Engineer Intern - Honeywell, Bangalore, India Jan 2022 –
Jul 2022
 Developed JavaScript-based RAG web applications with Node.js, Express, and React, integrating Elasticsearch
on Hadoop for vector retrieval, increasing user engagement by 35%.
 Built retriever engines using LangChain.js and Redis for conversational memory, optimizing big data query
processing on Teradata and Hadoop clusters for 25% faster response.
 Implemented Docker and Kubernetes-based CI/CD pipelines in GitLab for continuous deployment of ML
inference services, reducing release cycle time by 40%.
Environment: JavaScript (ES6+), Node.js, Express.js, React.js, LangChain.js/RAG frameworks, Elasticsearch / OpenSearch
(vector retrieval), Redis (conversation memory), Docker, Kubernetes, Git & CI/CD pipelines.
Teaching Assistant - Dept. of CSE, Amrita Vishwa Vidyapeetham, Coimbatore, India. Jun 2018 – Dec 2021
 Illustrated fundamentals of supervised and unsupervised Machine Learning by mapping Scratch and Flogarithm
activities to K-means clustering and decision-tree workflows in visual programming environments, enhancing
student mastery of ML basics by 45%.
 Designed interactive lab sessions simulating big-data processing pipelines with pseudo–Apache Spark jobs
implemented in Python, demonstrating parallel data transformations and boosting engagement and proficiency
in programming basics by 30%.
 Mentored cohorts of 50+ undergraduates in C, Python, Data Structures, and Algorithms through hands-on coding
exercises and Teradata-style SQL query workshops, improving exam pass rates by 25% and solidifying
foundational data-analysis skills.
 Developed Python-based algorithmic assignment templates and leveraged a simulated Hadoop HDFS
environment to teach distributed file storage and map-reduce concepts, increasing student confidence in Big
Data workflows by 40%.
 Facilitated weekly solution walkthroughs and technical presentations on computational thinking and algorithm
design using structured slide decks and live coding demos, strengthening both peer-to-peer communication and
student feedback scores by 35%
ACADEMIC PROJECT EXPERIENCE
Intelligent Resume Matching Platform May 2024 – Present
 Developed an automated resume screening tool with Streamlit, LangChain, and Pinecone, enhancing
candidate-job match accuracy by 50% and optimizing high-volume HR workflows.
 Built a vectorized resume matching system using Sentence Transformers and Pinecone, reducing candidate
screening time by 40% through efficient, similarity-based retrieval.
Customer Care Call Summary Alert Jun 2024 – Jul 2024
 Developed an AI-driven call summarization and alert system with Streamlit, OpenAI's Whisper, and Zapier,
automating customer service call summaries and delivering real-time alerts to stakeholders, leading to a 30%
reduction in manual reporting time.
 Designed a zero-shot NLP architecture using LangChain and Whisper to transcribe, summarize, and notify
stakeholders with Zapier integration, resulting in improved response times and streamlined communication
workflows.
Invoice Extraction Chatbot Dec 2023 - Jun 2024
 Led the design of an OCR and NLP pipeline using PyPDF and OpenAI models, cutting manual processing time by
50%.
 Created a Streamlit interface for invoice uploads, enhancing user accessibility and processing efficiency by 40%.

AI Powered Support ChatBot For A Website Jun 2023 – Dec 2023

 Developed an AI-powered support chatbot using Streamlit, LangChain, and Pinecone, resulting in a 40%
increase in response accuracy and improved customer query resolution speed.
 Engineered a modular NLP and vector storage architecture with Sentence Transformers and Pinecone, enhancing
user satisfaction by 35% through fast and precise information retrieval.
Automatic Ticket Classification Tool Dec 2022 – Jun 2023
 Created an automated ticket classification system using SVM and Pinecone, reducing manual sorting by 70%.
 Designed a Streamlit interface for efficient ticket processing, enhancing categorization accuracy and scalability.
CSV Data AnalysisTool Dec 2020 – Jun 2021
 Engineered a web-based data analysis app with Streamlit and Pandas, improving insight extraction by 60%.
 Developed NLP-enhanced query processing with LangChain, boosting decision-making efficiency by 40%.

RidgeBot Deployment Quick Start Guide v4.2.2 Latest
No ratings yet
RidgeBot Deployment Quick Start Guide v4.2.2 Latest
98 pages
Security Pillar AWS Well-Architected Framework
No ratings yet
Security Pillar AWS Well-Architected Framework
137 pages
Yugandar - Generative AI Architect
No ratings yet
Yugandar - Generative AI Architect
8 pages
Cognito TechTalk
No ratings yet
Cognito TechTalk
71 pages
SAA-C03 AWS Certified Solutions Architect - Associate Updated Dumps
No ratings yet
SAA-C03 AWS Certified Solutions Architect - Associate Updated Dumps
90 pages
Vrealize Operations Manager 84 Config Guide
No ratings yet
Vrealize Operations Manager 84 Config Guide
884 pages
Amazon: Exam Questions AWS-Certified-Developer-Associate
No ratings yet
Amazon: Exam Questions AWS-Certified-Developer-Associate
6 pages
AMAZON
100% (1)
AMAZON
71 pages
AWS Well-Architected Partner Program APFP Guide Mar 2024
No ratings yet
AWS Well-Architected Partner Program APFP Guide Mar 2024
13 pages
AWS Solution Architect Associate - Update1
100% (1)
AWS Solution Architect Associate - Update1
133 pages
Devops Essentials Slides - 1524580554 PDF
100% (3)
Devops Essentials Slides - 1524580554 PDF
120 pages
Cortex Cloud Documentation
No ratings yet
Cortex Cloud Documentation
329 pages
Priya AIML Resumee
No ratings yet
Priya AIML Resumee
5 pages
GenAI Resume 02 Nov 20
No ratings yet
GenAI Resume 02 Nov 20
5 pages
Lakshmi Sampath Potluri AI ML Engineer
No ratings yet
Lakshmi Sampath Potluri AI ML Engineer
7 pages
Adish CV
No ratings yet
Adish CV
1 page
Ashika Resume DS
No ratings yet
Ashika Resume DS
7 pages
Rajesh DataEngineer
No ratings yet
Rajesh DataEngineer
7 pages
Srikanth - Bellary Architect Resume
No ratings yet
Srikanth - Bellary Architect Resume
6 pages
AWS Proposal
No ratings yet
AWS Proposal
7 pages
Data Scientist ML Resume
No ratings yet
Data Scientist ML Resume
5 pages
3-7 Year
No ratings yet
3-7 Year
2 pages
AI Engineer JD
100% (1)
AI Engineer JD
1 page
Cloud Computing, CS596-015: Amazon EC2 & Amazon Web Services (AWS)
No ratings yet
Cloud Computing, CS596-015: Amazon EC2 & Amazon Web Services (AWS)
92 pages
Builder Portfolio
No ratings yet
Builder Portfolio
35 pages
AWS Terminologies
No ratings yet
AWS Terminologies
41 pages
ShelfSync Presentation
No ratings yet
ShelfSync Presentation
20 pages
Aws Lab Manual
No ratings yet
Aws Lab Manual
44 pages
Satyanarayana Parakota
No ratings yet
Satyanarayana Parakota
8 pages
Kamlesh-AI - ML Architect
No ratings yet
Kamlesh-AI - ML Architect
8 pages
Devika.v ML
No ratings yet
Devika.v ML
7 pages
Ritishsajjagcp
No ratings yet
Ritishsajjagcp
7 pages
Satya Sandeep - Data Engineer Resume
No ratings yet
Satya Sandeep - Data Engineer Resume
8 pages
Ali Kone
No ratings yet
Ali Kone
6 pages
Elie Niring
No ratings yet
Elie Niring
6 pages
MLops 12 Draft
No ratings yet
MLops 12 Draft
5 pages
Mehdi RESUME
No ratings yet
Mehdi RESUME
8 pages
Abubakar Python Ai
No ratings yet
Abubakar Python Ai
4 pages
Prabhash Chandra Karan
No ratings yet
Prabhash Chandra Karan
5 pages
Anshul Yadav: Profile
No ratings yet
Anshul Yadav: Profile
6 pages
Supriya Data Engineer Resume
No ratings yet
Supriya Data Engineer Resume
4 pages
Third Semester MCA
No ratings yet
Third Semester MCA
22 pages
Suzy Napier3
No ratings yet
Suzy Napier3
4 pages
Aws EventBridge
No ratings yet
Aws EventBridge
4 pages
Resume 2
No ratings yet
Resume 2
4 pages
Anantha Sai Ram Padala - 9+ Yrs - AIML Engineer - 1st
No ratings yet
Anantha Sai Ram Padala - 9+ Yrs - AIML Engineer - 1st
6 pages
Data Scientist/ Machine Learning Engineer: Summary
No ratings yet
Data Scientist/ Machine Learning Engineer: Summary
4 pages
Ihumza980
No ratings yet
Ihumza980
4 pages
A Shlash-ML Data Engineer CV
No ratings yet
A Shlash-ML Data Engineer CV
5 pages
Senior Data Scientist Role Overview
No ratings yet
Senior Data Scientist Role Overview
3 pages
Latest Aditya Sharma Resume - 1719950877610 - Aditya Sharma
No ratings yet
Latest Aditya Sharma Resume - 1719950877610 - Aditya Sharma
2 pages
Corporate Deck
No ratings yet
Corporate Deck
8 pages
Tanner Scadden
No ratings yet
Tanner Scadden
2 pages
Cutshort cv1 YVR0
No ratings yet
Cutshort cv1 YVR0
2 pages
CV Parmar Jaimin Kanubhai Aug
No ratings yet
CV Parmar Jaimin Kanubhai Aug
3 pages
Naukri NiteshRanjanPanda (6y 5m)
No ratings yet
Naukri NiteshRanjanPanda (6y 5m)
3 pages
4 Ways To Structure Your Terraform Projects 1675057399
No ratings yet
4 Ways To Structure Your Terraform Projects 1675057399
4 pages
Resume Complete 2022 Data Eng
No ratings yet
Resume Complete 2022 Data Eng
2 pages
Tianbo Song
No ratings yet
Tianbo Song
2 pages
Biren Resume Senior AI Engineer
No ratings yet
Biren Resume Senior AI Engineer
2 pages
Aashish Arora DS Noida
No ratings yet
Aashish Arora DS Noida
2 pages
Lovey Mishra
No ratings yet
Lovey Mishra
2 pages
Updated - AWS Solution Architect (Associate) B
No ratings yet
Updated - AWS Solution Architect (Associate) B
6 pages
Aws Solution Architect Associate Course Agenda
No ratings yet
Aws Solution Architect Associate Course Agenda
9 pages
Kranthi New2
No ratings yet
Kranthi New2
3 pages
Reddit Redacted
No ratings yet
Reddit Redacted
1 page
Jayasree Yedlapally: Data Architecture Engineering - Senior
No ratings yet
Jayasree Yedlapally: Data Architecture Engineering - Senior
5 pages
Sachit Resume
No ratings yet
Sachit Resume
2 pages
Hardik-Sankhla 21EJDDS137
No ratings yet
Hardik-Sankhla 21EJDDS137
1 page
Jim Xiang: - Santa Clara, CA
No ratings yet
Jim Xiang: - Santa Clara, CA
5 pages
Shana Kallem - Atpco - Cloud Engineer Intern
No ratings yet
Shana Kallem - Atpco - Cloud Engineer Intern
1 page
Jaideep Resume Aiml
No ratings yet
Jaideep Resume Aiml
1 page
Technical Skills
No ratings yet
Technical Skills
5 pages
Technology Skills
No ratings yet
Technology Skills
6 pages
Shreyas Kulkarni 1309036282
No ratings yet
Shreyas Kulkarni 1309036282
2 pages
M.Eldeeb CV
No ratings yet
M.Eldeeb CV
2 pages
Advanced Software Development Methodologies PDF
No ratings yet
Advanced Software Development Methodologies PDF
6 pages
SHANA KALLEM - Goldman Sachs - Data Engineering
No ratings yet
SHANA KALLEM - Goldman Sachs - Data Engineering
1 page
JD - Software Engineer
No ratings yet
JD - Software Engineer
1 page
CPS Abhi Kavathiya
No ratings yet
CPS Abhi Kavathiya
2 pages
JD - ML Computer Vision
No ratings yet
JD - ML Computer Vision
2 pages
Currículum de Iago
No ratings yet
Currículum de Iago
1 page
Sanket Nikam Resume
No ratings yet
Sanket Nikam Resume
2 pages
Prathamesh Chikane Resume v15.7
No ratings yet
Prathamesh Chikane Resume v15.7
1 page
Gowrishankar - Yarra Resume
No ratings yet
Gowrishankar - Yarra Resume
3 pages
Rakesh Kumar - Data Scientist
No ratings yet
Rakesh Kumar - Data Scientist
3 pages
Using AWS in The Context of Australian Privacy Considerations
No ratings yet
Using AWS in The Context of Australian Privacy Considerations
12 pages