Yhills Intern-8
Yhills Intern-8
SUMMER INTERNSHIP
Bachelor of Technology
By
Gayathri Karri
(Roll No. 21L31A0598)
Mr.V.Nagu
Endorsements
Faculty Guide
Principal
CERTIFICATE FROM INTERN ORGANIZATION
This is to certify that K .Gayathri Reg.No 21L31A0598 of Vignan’s Institute of Information
Technology underwent internship in Yhills Edutech from 09-05-2023 to 05-07-2023. The overall
performance of the intern during her internship is found to be satisfactory.
First and foremost, I would like to thank the entire Yhills team for their guidance, support, and
encouragement throughout my internship journey. Their expertise, knowledge, and willingness to share
their experiences have been instrumental in enhancing my skills and understanding of the industry.
I am also grateful to my fellow interns at Yhills for their camaraderie and collaboration. Their enthusiasm
and shared experiences made my internship period enjoyable and fostered a positive learning
environment.
Furthermore, I would like to acknowledge the entire Yhills organization for creating a supportive and
inclusive work culture. The collaborative atmosphere, open communication, and emphasis on personal
growth have made my internship experience truly fulfilling.
Last but not least, I would like to express my heartfelt appreciation to the management team at Yhills for
providing me with this internship opportunity. I am grateful for their trust in my abilities and for entrusting
me with challenging responsibilities.
In conclusion, I am immensely grateful to Yhills for providing me with a remarkable internship experience.
The skills, knowledge, and connections I have gained during my time here will undoubtedly shape my
future endeavors. Thank you for the incredible opportunity and for being an integral part of my
professional journey.
My heartful thanks to my internship instructor Ms.Premlatha who helped me a lot to gain some
knowledge on Artificial Intelligence.
Contents
This internship report provides an overview of the internship experience at Yhills Edutech. The report
highlights the learning objectives and outcomes achieved during the internship period. It includes a
description of the business sector and the organization where the internship took place. Furthermore,
it summarizes all the activities undertaken by the intern throughout the internship duration.
1. Gain a comprehensive understanding of the educational technology sector and its trends.
Outcome: Developed a strong knowledge base of the latest advancements and innovations in the
edutech industry.
Yhills Edutech is a leading company operating in the educational technology sector. Founded in [year], the
organization is dedicated to revolutionizing education by leveraging technology to create innovative learning
solutions. Yhills Edutech aims to make education more accessible, engaging, and personalized for learners
of all ages. The company offers a wide range of products and services, including online learning platforms,
interactive educational content, data-driven learning analytics tools, and curriculum development resources.
Vision: Yhills Edutech envisions a world where education transcends boundaries and empowers individuals
to reach their full potential through innovative technology-driven solutions.
Mission: The mission of Yhills Edutech is to transform education by developing and providing cutting-edge
technology solutions that enhance the learning experience and promote lifelong learning.
Yhills Edutech has a comprehensive policy to guide the intern's role within the organization. This policy
emphasizes providing a valuable learning experience, mentorship, and professional development
opportunities. The organization aims to create a supportive and inclusive work environment where interns
can contribute to meaningful projects, gain practical skills, and receive guidance from experienced
professionals. The policy also emphasizes open communication, respect, and adherence to ethical standards.
D. Organizational Structure:
Yhills Edutech follows a hierarchical organizational structure that supports effective decision-making and
streamlined operations. The structure consists of various departments, including:
1. Executive Leadership: Comprising the CEO, CTO, CFO, and other key executives responsible for setting
the strategic direction of the organization.
2. Department Heads: Leading different functional areas such as Content Development, Technology,
Marketing, Sales, and Operations.
3. Teams: Each department is divided into teams, led by managers or team leaders, focusing on specific
functions or projects.
As an Artificial Intelligence (AI) Training Intern at Yhills Edutech, your role and responsibilities would
involve working closely with the AI team and contributing to the development and implementation of AI-
based educational solutions. Here are some key roles and responsibilities associated with the position:
1. Research and Analysis:
- Conduct research on AI technologies, algorithms, and methodologies applicable to the education sector.
- Analyze existing AI models and algorithms to identify potential applications for improving learning
outcomes.
Yhills Edutech has demonstrated impressive performance in terms of turnover, profits, market reach, and
market value. The organization has experienced consistent growth in revenue, driven by the increasing
demand for its innovative educational technology solutions. Yhills Edutech has expanded its market reach
globally, establishing partnerships with educational institutions, businesses, and government organizations.
The company's
commitment to innovation and customer satisfaction has contributed to its positive reputation and market
value.
Yhills Edutech has ambitious future plans aimed at further expanding its impact in the educational
technology sector. Some of the key future plans of the organization include:
1. Continual Innovation: Investing in research and development to introduce new products and services that
enhance learning experiences.
2. Market Expansion: Strengthening market presence in existing regions and exploring opportunities for
expansion into new markets.
3. Partnerships and Collaborations: Forging strategic partnerships with educational institutions, content
creators, and technology providers to enhance product offerings.
4. User Personalization: Developing personalized learning solutions by leveraging artificial intelligence and
data analytics.
5. Professional Development: Investing in the growth and development of employees through training
programs and career advancement opportunities.
CHAPTER 3: INTERNSHIP PART
I have worked as an Artificial Intelligence intern at Yhills Edutech. It was a great experience. During
this internship I have learnt many new things.
ACTIVITIES IN THE INTERN ORGANIZATION:
As the internship was an online internship I have performed the following tasks according to the instructions
given by my instructor. It was a 2 month internship where I need to attend the online classes weekly twice up to one
month. Each class is of two hours duration where the instructor teaches how to deal with different types of datasets.
The instructor will be explaining different types of attributes and function and built in libraries in python I used to
work with them parallelly along with the instructor. I have learnt many new things like how to import the libraries,
loading the dataset, exploratory data analysis(EDA) which is cleaning of the dataset and creating the models. In the
second month of Internship I have submitted the two projects allotted by my instructor.
ACTIVITY LOG FOR WEEK-1
Day-2
WEEKLY REPORT
WEEK – 1 (From Date: 09-05-2023 to Date:14-04-2023)
WEEKLY REPORT
WEEK – 3 (From Date: 22-05-2023to Date: 28/05/2023)
WEEKLY REPORT
WEEK – 4 (From Date: 29/05/2023to Date: 04/06/2023)
1. Target Variable: Logistic regression is used when the target variable is binary or categorical, with two
possible outcomes (e.g., yes/no, true/false, 0/1).
2. Probability Estimation: Instead of directly predicting the class labels, logistic regression estimates the
probability of an instance belonging to a specific class using a logistic function (also known as the sigmoid
function). This function maps any real-valued number to a probability between 0 and 1.
3. Model Interpretation: Logistic regression provides interpretable results. It calculates the coefficients
(weights) associated with each feature, indicating the impact of each feature on the predicted probability.
These coefficients can be interpreted as the change in the log-odds of the target class per unit change in the
corresponding feature.
4. Assumptions: Logistic regression assumes a linear relationship between the independent variables
(features) and the log-odds of the target class. It also assumes that there is no multicollinearity among the
features, no influential outliers, and the residuals follow a logistic distribution.
5. Training the Model: The logistic regression model is trained using maximum likelihood estimation, which
involves finding the coefficients that maximize the likelihood of the observed data. This is typically done
using optimization algorithms such as gradient descent.
6. Decision Boundary: Logistic regression uses a decision boundary to separate the two classes. The
boundary is determined by a threshold probability (usually 0.5). Instances with predicted probabilities above
the threshold are classified as one class, while those below the threshold are classified as the other class.
7. Evaluation: Model performance in logistic regression is often assessed using evaluation metrics such as
accuracy, precision, recall, and F1 score. Additionally, techniques like ROC curves and AUC (Area Under
the ROC Curve) can be used to evaluate the model's discrimination power.
Logistic regression is a widely used and well-established algorithm for binary classification tasks. It is
especially useful when interpretability of the model and understanding the impact of features on the outcome
is important.
ACTIVITY LOG FOR WEEK-5
WEEKLY REPORT
WEEK – 5 (From Date: 05-06-2023 to Date: 11-06-2023)
08/06/2023:
Exploratory Data Analysis:
EDA is basically about cleaning the dataset by removing the missing values duplicate outliers from the
dataset. It is also used for converting the categorical data into numerical data.
Exploratory Data Analysis (EDA) is a crucial initial step in data analysis, where the primary goal is to
understand the data and gain insights from it. EDA involves examining and summarizing the main
characteristics of the dataset, uncovering patterns, identifying outliers, and exploring relationships between
variables.
1. Data Cleaning: This involves handling missing values, dealing with outliers, and ensuring data integrity
by removing or imputing erroneous or inconsistent values.
2. Descriptive Statistics: Computing summary statistics such as mean, median, standard deviation, and
quartiles provides an overview of the data's central tendency, dispersion, and shape.
3. Data Visualization: Creating visual representations of the data through histograms, box plots, scatter plots,
and other visualizations helps identify patterns, trends, and relationships among variables.
4. Feature Engineering: EDA can help in identifying potential features or transformations that may enhance
the predictive power of machine learning models.
5. Univariate and Bivariate Analysis: Examining individual variables (univariate analysis) and exploring
relationships between pairs of variables (bivariate analysis) helps understand the distributions, correlations,
and dependencies in the data.
6. Hypothesis Generation: EDA often involves generating hypotheses about the data, which can be tested
further using statistical methods or machine learning algorithms.
EDA plays a crucial role in guiding subsequent analysis steps, such as selecting appropriate models,
identifying relevant variables, and detecting data issues that may affect the quality of the analysis. By
exploring and understanding the data, EDA helps uncover patterns, validate assumptions, and gain insights
that contribute to informed decision-making and problem-solving.
WEEKLY REPORT
WEEK – 6 (From Date:12-06-2023 to Date:18-06-2023 )
Random Forest is a popular and powerful supervised learning algorithm used for classification and
regression tasks. It is an ensemble method that combines multiple decision trees to make predictions.
Here are the key characteristics of Random Forest:
1. Ensemble of Decision Trees: Random Forest builds an ensemble of decision trees by training each tree on
a random subset of the data and a random subset of the features. Each tree independently predicts the target
variable, and the final prediction is obtained through voting (for classification) or averaging (for regression)
of the individual tree predictions.
2. Bagging: Random Forest employs a technique called bagging (bootstrap aggregating), which creates
multiple subsets of the training data through sampling with replacement. This helps in creating diverse trees
and reducing overfitting.
3. Feature Randomness: At each split in a decision tree, Random Forest considers only a subset of features
chosen randomly. This randomness encourages tree diversity and reduces the risk of selecting only the most
important features.
4. Robustness to Outliers: Random Forest is robust to outliers and noisy data because it aggregates
predictions from multiple trees, reducing the impact of individual outliers.
5. Variable Importance: Random Forest provides a measure of feature importance by assessing the reduction
in prediction accuracy when a specific feature is randomly permuted. This can help identify the most
influential features in the dataset.
6. Handling High-Dimensional Data: Random Forest performs well even with high-dimensional data, as it
can effectively handle a large number of features without feature selection.
7. Model Interpretability: Although individual trees in a Random Forest can be complex, the overall model
provides insights into feature importance and can be visualized to understand the decision-making process.
Random Forest has wide applicability and has been successfully used in various domains, including finance,
healthcare, and image recognition. It is known for its robustness, accuracy, and ability to handle large and
complex datasets.
The decision tree algorithm is a supervised learning method that uses a tree-like structure to make decisions
or predictions. It is a popular and interpretable machine learning algorithm that can be used for both
classification and regression tasks.
• Through this internship I have learned many new skills and acquired knowledge on unknown projects.
• By this internship I was able to create projects using the datasets and was able to perform operations and fit a
model to them.
• I was able to calculate the accuracy occurred for each model and able to find which model is best suit for the given
data set
• I was able to perform operations using both regression and classification
• I also learn time management by completing all the concepts explained by the instructor and able to understand
them
Daily Activity GPS Photos
Day-11, Date:15/06/2023