0% found this document useful (0 votes)
89 views32 pages

complete IT report

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
89 views32 pages

complete IT report

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 32

A TECHNICAL REPORT ON

STUDENTS INDUSTRIAL WORK EXPERIENCE SCHEME (SIWES)

UNDERTAKEN AT

nHub NIGERIA
3RD FLOOR OLD AIRPORT, JOS, PLATEAU STATE. JOS.

BY

ADDA CHRISTY

UJ/2019/EN/0227

SUBMITTED TO THE DEPARTMENT OF ELECTRICAL/ELECTRONICS


ENGINEERING, UNIVERSITY OF JOS, PLATEAU STATE, NIGERIA.

IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE AWARD OF


BACHELOR OF ENGINEERING (B.Eng.) DEGREE IN ELECTRICAL/ELECTRONICS
ENGINEERING.

JULY, 2024.

i
CERTIFICATION
This is to certify that ADDA CHRISTY of the Department of Electrical/Electronic Engineering,
University of Jos undertook his six months at nHub Nigeria 3 rd floor old airport road Jos. The
technical report presented herein meets the SIWES requirements for the award of the Degree of
Bachelor of Engineering in Electrical/Electronic Engineering of the University of Jos, Nigeria.

__________________________ ________________ Engr.


Patrick N. Julius DATE

(SIWES Supervisor)

__________________________ ________________

Eng. Patrick N. Julius DATE


(Departmental SIWES Coordinator)

__________________________ ___________________

Dr. Sunday Iliya DATE


(Head of Department)

ii
APPROVAL
The technical report has been read and approved to have met one of the condition outlines for the
scheme. This segment gives an approval that Adda Christy with matriculation number
UJ/2019/EN/0227 went on six (6) months industrial training at nHub Nigeria 3 rd floor old airport
road Jos

_________________________ ____________________

Engr. Patrick N. Julius Date


(Supervisor)

_________________________ ____________________

Engr. Patrick N. Julius Date


(Departmental SIWES coordinator)

_________________________ ____________________

Dr. Sunday Iliya Date


(Head of Department)

iii
DEDICATION
This is dedicated to my Lord and Savior Jesus Christ. The Lord God Almighty, my source,
protector, provider and sustainer, the only independent constant, whom other variables (creation)
depends on for life and relevance. The one by who I am able to successfully complete the
industrial training for the stipulated period.

Adda Christy _______________________ _______________


(Name of Student) (Signature) (Date)

iv
v
ACKNOWLEDGEMENT
I appreciate God Almighty for His unfailing love, mercies and sufficient grace to me. God has
been so faithful to me, He has been my sustenance throughout the period of this Industrial
attachment, glory be to His Holy name. I appreciate all those who have tutored me both directly
and indirectly, and also support me in many different ways. I am grateful for your love.

My profound gratitude goes to all the members of various units that constitute the Frontend stack
of nHub Nigeria, for the vast knowledge impacted.

With a grateful heart I want to specially thank this great set of people Mr. Zakawa Ngida who has
been my direct supervisor for his understanding and tolerance. Mr. Ediongsenyene Inokotong,
Mr. Desmond Fon, they support me and have been great instrumental in the activities of my I.T
in nHub Foundation.

Furthermore, I recognize the efforts of, Mr. Bashir Sheidu, the Chief Operating officer (COO),
Mr. Muhammad Sadiq Abdulrasheed, Jeremiah Falinyi, and Mr. Charles they have been very
supportive to me, God bless you richly.

I also acknowledge and salute the team spirit exhibited by my colleagues here, some are my
course mates from UNIJOS, and coincidentally we got industrial placement in the same place. To
all those too numerous to mention and have been very supportive, I say thank you all.

vi
LIST OF FIGURES

Figure 2.1 Numpy arrays ------------------------------------------------------------------------------2

Figure 2.2 Matplotlib plot------------------------------------------------------------------------------

Figure 2.3 Pie chart------------------------------------------------------------------------------------

Figure 2.5 Seaborn pairplot------------------------------------------------------------------------------

Figure 2.6 Linear Regression------------------------------------------------------------------------------

Figure 2.7 Logistic regression------------------------------------------------------------------------------

Figure 2.8 Support Vector Machine------------------------------------------------------------------------

Figure 2.9 Random Forest------------------------------------------------------------------------------

Figure 2.10 K-means Clustering-----------------------------------------------------------------------------

Figure 2.8 Naïve Bayes ------------------------------------------------------------------------------

Figure 2.9 Block Diagram of Machine


Learning----------------------------------------------------------------------------------------------------------
-------------------------------

vii
Table of Contents
CERTIFICATION................................................................................................................

APPROVAL................................................................................................................
DEDICATION................................................................................................................
ACKNOWLEDGEMENT................................................................................................................
LIST OF TABLES................................................................................................................
LIST OF APPENDICES................................................................................................................
CHAPTER ONE..........................................................................................................................1
1.1 Introduction-.....................................................................................................................1
1.1.1 Overview of Organization Activities……………………………………………………1

1.1.2 Aims and Objectives.........................................................................................................1

1.1.3 Scope and Limitations.......................................................................................................2

1.1.4 Significance of machine learning......................................................................................2

1.2 Literature Review..............................................................................................................2

1.2.1 The Current Focus on Machine Learning..........................................................................2

1.2.2 Machine learning awareness and adoption in construction...............................................13

1.2.3 Supervised Learning...........................................................................................................3

1.2.4 Unsupervised Learning......................................................................................................3

1.2.5 Reinforcement Learning.....................................................................................................4

1.2.6 Why Python Is a Perfect Language for Machine Learning?..............................................4

2.1 Python Libraries Used……………………………………………………………………5

2.1.1 Numpy……………………………………………………………………………………5

2.1.2 Pandas.................................................................................................................................5

viii
2.1.3 Matplotlib…………………………………………………………………………………6

2.0.4 Seaborn…………………………………………………………………………………..6

2.1.5 Scikit-learn………………………………………………………………………………6

2.2 Algorithms Used................................................................................................................6

2.2.1 Linear Regression………………………………………………………………………...6

2.2.2 Logistic Regression……………………………………………………………………….7

2.2.3 Decision Tree......................................................................................................................7

2.2.4 Support Vector Machine (SVM)………………………………………………………….7

2.2.5 Random Forest……………………………………………………………………………8

2.2.6 K-fold cross-validation……………………………………………………………………9

2.2.7 K-Means Algorithm……………………………………………………………………….9

2.1.8 Naïve Bayes Algorithm…………………………………………………………………..10

2.3 Machine Learning Steps………………………………………………………………….11

2.3.1 Data Collection…………………………………………………………………………..11

2.3.2 Data Processing………………………………………………………………………….12

2.3.3 Choosing a Model……………………………………………………………………….12

2.3.4 Training the Model………………………………………………………………………13

2.3.5 Evaluating the Model……………………………………………………………………13

ix
2.3.6 Parameter Tuning………………………………………………………………………...14

2.3.7 Predictions and deployment……………………………………………………………14

2.4 Internship Projects Implemented……………………………………………………….15

2.3.1 House Price Prediction App…………………………………………………………..27

2.3.2 Diabetes Detection Model……………………………………………………………29

4.1 Conclusion....................................................................................................................30

4.2 Recommendation..........................................................................................................30

x
CHAPTER 1

INTRODUCTION AND LITERATURE REVIEW

1.2 Introduction

SIWES is an acronym for the Students Industrial Work Experience Scheme and was initiated
in 1973 by the Industrial Training Fund (ITF). SIWES is a commendable program designed
in response to the government mandate given to the ITF through Decree 47 of 1971, charging
it with the responsibility of promoting and encouraging university students to obtain industry
and commerce skills to generate a pool of Indigenous manpower sufficient for proper
inculcation into the country’s job market and meet the needs of the economy.

SIWES is the accepted skills training programme that forms part of the approved minimum
academic standard in the various degree programmes for all Nigerian universities. The main
objective of the program is to bridge the gap between theory and practice by making it
possible for students to get themselves exposed to real jobs and real job situations and
environments.

1.2.1 Overview of Organization ActivitieThis paper reports a 6-month long internship at


nHub Foundation 2nd Floor Taen Business Complex Opposite Old Nitel Office Jos
Plateau State beginning from February through to the end of July, a period of 6
months. This company is involved in different aspects of programming such as data
science, frontend, backend, UI/UX design, embedded systems, and cybersecurity. I
acquired skills and knowledge in machine learning with Python which is a subset of
data science.

1.1.2 Aim and Objectives

The project aims to demonstrate the practical application of machine learning techniques,
from data preprocessing and model selection to evaluation and deployment, in solving a
specific problem relevant to the industry.

The objective of this project is to address challenges by developing and implementing


machine learning models using Python, a widely adopted programming language in the field
This report will detail the process, methodologies, and results obtained during the internship,
providing insights into the potential and limitations of machine learning in real-world
scenarios.
1.1.3 Scope and Limitations

The scope of machine learning encompass its ability to be applied across various industries,
such as healthcare, engineering, finance, and marketing, where it can analyse large volumes
of data, uncover hidden patterns, and automate routine tasks, leading to increased efficiency.
While the limitations machine learning models have the potential to improve over time as
they are exposed to more data, making them valuable in dynamic environments. However,
the effectiveness of machine learning is heavily dependent on the availability of large, high-
quality datasets.

1.1.4 Significance of machine learning

The significance of machine learning lies in its transformative impact across various domains
by enabling systems to learn from data and make intelligent decisions without explicit
programming. It drives advancements in automation, allowing businesses to optimize
processes, reduce costs, and increase efficiency. It drives innovation in areas like autonomous
vehicles and natural language processing. Additionally, it plays a crucial role in scientific
research by uncovering patterns and insights from complex datasets, leading to new
discoveries. Overall, machine learning is a key driver of technological progress, enabling
smarter, more responsive systems that adapt to changing environments and user needs.

1.2 Literature Review

1.2.1 The Current Focus on Machine Learning

Current AI/ML research focuses on reasoning, knowledge representation, learning and


autonomy. ML algorithms enable quick, unsupervised decision-making based on data,
enhancing performance and reducing costs in future networks.

The current state of machine learning research and development in the field of artificial
intelligence is advancing rapidly. Cutting-edge applications like Convolutional Neural
Networks (CNNs), Bidirectional Long Short-Term Memory (BLSTM), and AI engines such
as DALL-E and ChatGPT are at the forefront of innovation [1]. AI and ML technologies are
being seamlessly integrated into 5G systems to enhance performance and reduce costs, with a
focus on autonomy and decision support mechanisms. The latest technologies in AI
emphasize the importance of processing unstructured data efficiently, utilizing unsupervised
12
or semi-supervised learning, and ensuring safety and explainable predictions for real-world
applications [2]. Overall, the field is progressing towards more sophisticated algorithms,
diverse applications, and a greater emphasis on practicality and interpretability.

1.2.2 Machine Learning Awareness and Adoption in Construction

The idea of developing machines exhibiting intelligence like humans (otherwise known as
Artificial Intelligence) can be traced back to fields of computer science, fiction, philosophy
and engineering. Sixty years after Alan Turing's test for machine intelligence, intelligent
machines are now outperforming humans in domains such as learning. A major capability of
machine learning is the utilization of a trained dataset in identifying a trend or pattern to
predict an outcome. Unlike other computational approaches, ML does not require outright
programming before identifying or predicting such outcomes. Such ability gives ML an edge
over other analytical tools. ML has the capability of predicting outcomes based on historical
observations, image recognition and group objects. The required algorithms to carry out this
task are provided by existing libraries of programmes. Such libraries include Numpy,
MatPlotLib, Pandas, Seaborn and Scikit [3].
Machine learning algorithms are often categorized as supervised, unsupervised and
reinforcement learning.

1.2.3 Supervised learning

This is a fundamental branch of machine learning where the objective is to learn a function
that maps input data to output labels based on example input-output pairs. This learning
process is called "supervised" because the algorithm is provided with the correct output labels
during the training process, allowing it to learn by comparing its predictions to the actual
labels and adjusting accordingly. Supervised learning is a type of machine learning where an
algorithm is trained on labelled data. This means that each training example in the dataset is
paired with an output label. The goal of the algorithm is to learn a mapping from inputs
(features) to outputs (labels) so that it can predict the label for new, unseen data.

1.2.4 Unsupervised machine learning

These algorithms are used when the information used to train is neither classified nor
labelled. Unsupervised learning studies how systems can infer a function to describe a hidden
structure from unlabelled data. The system doesn’t figure out the right output, but it explores

13
the data and can draw inferences from datasets to describe hidden structures from unlabelled
data.

1.2.5 Reinforcement machine learning

This is a learning method that interacts with its environment by producing actions and
discovers errors or rewards. Trial and error search and delayed reward are the most relevant
characteristics of reinforcement learning. This method allows machines and software agents
to automatically determine the ideal behaviour within a specific context in order to maximize
its performance.

1.2.6 Why Python Is a Perfect Language for Machine Learning?

i. A great choice of libraries is one of the main reasons Python is the most popular
programming language used for AI. Python libraries provide base level items so
developers don’t have to code them from the very beginning every time. ML requires
continuous data processing and Python’s libraries let us access, handle and transform
data. These are some of the most widespread libraries you can use for ML and AI:
Scikit-learn for handling basic ML algorithms like clustering, linear and logistic
regressions, regression, classification, and others. Pandas for high-level data
structures and analysis. It allows merging and filtering of data, as well as gathering it
from other external sources like Excel, for instance. Keras for deep learning.
TensorFlow for working with deep learning by setting up, training, and utilizing
artificial neural networks with massive datasets. Matplotlib for creating 2D plots,
histograms, charts, and other forms of visualization. PyBrain for neural networks,
unsupervised and reinforcement learning etc.
ii. Working in the ML and AI industry means dealing with a bunch of data that we need
to process in the most convenient and effective way. The low entry barrier allows
more data scientists to quickly pick up Python and start using it for AI development
without wasting too much effort into learning the language. In addition to this, there’s
a lot of documentation available, and Python’s community is always there to help out
and give advice.
iii. Python for machine learning is a great choice, as this language is very flexible: It
offers an option to choose either to use OOPs or scripting. There’s also no need to
recompile the source code, developers can implement any changes and quickly see the
results. Programmers can combine Python and other languages to reach their goals.

14
iv. For AI developers, it’s important to highlight that in artificial intelligence, deep
learning, and machine learning, it’s vital to be able to represent data in a human-
readable format. Libraries like Matplotlib allow data scientists to build charts,
histograms, and plots for better data comprehension, effective presentation, and
visualization.
v. It’s always very helpful when there’s strong community support built around the
programming language. Python is an open-source language which means that there’s
a bunch of resources open for programmers starting from beginners and ending with
pros. A lot of Python documentation is available online as well as in Python
communities and forums, where programmers and machine learning developers
discuss errors, solve problems, and help each other out.
vi. Data Preprocessing, Analysis & Visualization, machine learning algorithms don’t
work so well with processing raw data. Before we can feed such data to an ML
algorithm, we must preprocess it. We must apply some transformations on it with data
preprocessing [4].

15
CHAPTER TWO

MATERIALS AND METHOD

2.1 Python Libraries Used

2.1.1 Numpy

NumPy is a Python library used for working with arrays. It also has functions for working in
domain of linear algebra, fourier transform, and matrices. NumPy was created in 2005 by
Travis Oliphant. It is an open source project and you can use it freely. NumPy stands for
Numerical Python [5].

Figure 2.1 Numpy arrays

2.1.2 Pandas

Pandas is an open-source software library built on top of Python specifically for data
manipulation and analysis, Pandas offers data structure and operations for powerful, flexible,
and easy-to-use data analysis and manipulation [6].

16
Figure 2.2 Data Cleaning

2.1.3 Matplotlib

A picture is worth a thousand words but in my case it worths a thousand data which is true,
humans process or understand better when you use pictures. Visualization helps you to
discover insights, underlying patterns hidden in the data which cannot be seen by mere
looking at the data. matplotlib was developed similar to MATLAB and is the most popular
visualization library.

Figure 2.3 Matplotlib plot

17
Figure 2.4 Pie chart

2.0.4 Seaborn

Seaborn is a Python data visualization library based on matplotlib. It provides a high-level


interface for drawing attractive and informative statistical graphics. For a brief introduction to
the ideas behind the library, you can read the introductory notes or the paper.

Figure 2.5 python libraries imported

Figure 2.6 Seaborn pairplot

18
2.1.5 Scikit-learn

Scikit-learn is a library in Python that provides many unsupervised and supervised learning
algorithms. It's built upon some of the technology like NumPy, pandas, and Matplotlib [7].
The functionality that scukit-learn provides include: Regression like linear and logistic
regression, classification like; K-nearest neighbors, clustering like; K-means, model selection
preprocessing like; min-max normalization.

2.2 Algorithms Used

There are many types of Machine Learning Algorithms specific to different use cases. As we
work with datasets, a machine learning algorithm works in two stages. I usually split the data
around 20%-80% between testing and training stages. Under supervised learning, we split a
dataset into a training data and test data in Python ML. The followings are the Algorithms of
Python Machine Learning

2.2.1 Linear Regression

Linear regression is one of the supervised Machine learning algorithms in Python that
observes continuous features and predicts an outcome. Depending on whether it runs on a
single variable or on many features, we can call it simple linear regression or multiple linear
regression. This is one of the most popular Python ML algorithms and often under-
appreciated. It assigns optimal weights to variables to create a line ax+b to predict the output.
We often use linear regression to estimate real values like a number of calls and costs of
houses based on continuous variables. The regression line is the best line that fits Y=a*X+b
to denote a relationship between independent and dependent variables.

Figure 2.6 Linear Regression

2.2.2 Logistic Regression

19
Logistic regression is a supervised classification, is unique Machine Learning algorithms in
Python that finds its use in estimating discrete values like 0/1, yes/no, and true/false. This is
based on a given set of independent variables. We use a logistic function to predict the
probability of an event and this gives us an output between 0 and 1. Although it says
‘regression’, this is actually a classification algorithm. Logistic regression fits data into a logit
function and is also called logit regression.

Figure 2.7 Logistic regression

2.2.3 Decision Tree

A decision tree is a machine learning algorithm in python that can work on both categorical
and continuous dependent variables. Here, we split a population into two or more
homogeneous sets. Tree models where the target variable can take a discrete set of values are
called classification trees; in these tree structures, leaves represent class labels and branches
represent conjunctions of features that lead to those class labels. Decision trees where the
target variable can take continuous values (typically real numbers) are called regression trees.

2.2.4 Support Vector Machine (SVM)

SVM is a supervised classification is one of the most important Machines Learning


algorithms in Python, that plots a line that divides different categories of your data. In this
ML algorithm, we calculate the vector to optimize the line. This is to ensure that the closest
point in each group lies farthest from each other. While you will almost always find this to be
a linear vector, it can be other than that. SVM can efficiently perform a non-linear

20
classification using what is called the kernel trick, implicitly mapping their inputs into high-
dimensional feature spaces. When data are unlabelled, supervised learning is not possible,
and an unsupervised learning approach is required, which attempts to find natural clustering
of the data to groups, and then map new data to these formed groups.

Figure 2.8 Support Vector Machine

2.2.5 Random Forest

A random forest is an ensemble of decision trees. In order to classify every new object based
on its attributes, trees vote for class- each tree provides a classification. The classification
with the most votes wins in the forest. Random forests or random decision forests are an
ensemble learning method for classification, regression and other tasks that operates by
constructing a multitude of decision trees at training time and outputting the class that is the
mode of the classes (classification) or mean prediction (regression) of the individual trees.

Figure 2.9 Random Forest

2.2.6 K-fold cross-validation

21
K-fold cross-validation is a technique used to evaluate the performance and generalizability
of a machine learning model. The process involves dividing the dataset into k equal-sized
subsets, or "folds." The model is trained k times, each time using a different fold as the
validation set and the remaining k-1 folds as the training set. This ensures that every data
point is used for both training and validation, providing a more accurate measure of the
model's performance. After all iterations, the results from each fold are averaged to produce a
single performance metric, reducing the variance associated with a single train-test split and
giving a better estimate of the model’s ability to generalize to unseen data.

2.2.7 K-Means Algorithm

k-Means is an unsupervised algorithm that solves the problem of clustering. It classifies data
using a number of clusters. The data points inside a class are homogeneous and
heterogeneous to peer groups. k-means clustering is a method of vector quantization,
originally from signal processing, that is popular for cluster analysis in data mining. k-means
clustering aims to partition n observations into k clusters in which each observation belongs
to the cluster with the nearest mean, serving as a prototype of the cluster. k-means clustering
is rather easy to apply to even large data sets, particularly when using heuristics such as
Lloyd's algorithm. It often is used as a preprocessing step for other algorithms, for example to
find a starting configuration. The problem is computationally difficult (NP-hard). k-means
originates from signal processing, and still finds use in this domain.

Figure 2.10 K-means Clustering

2.1.8 Naïve Bayes Algorithm

Naive Bayes is a classification method which is based on Bayes’ theorem. This assumes
independence between predictors. A Naive Bayes classifier will assume that a feature in a
class is unrelated to any other. Consider a fruit. This is an apple if it is round, red, and 2.5

22
inches in diameter. A Naive Bayes classifier will say these characteristics independently
contribute to the probability of the fruit being an apple. This is even if features depend on
each other. For very large data sets, it is easy to build a Naive Bayesian model. Not only is
this model very simple, it performs better than many highly sophisticated classification
methods. Naïve Bayes classifiers are highly scalable, requiring a number of parameters linear
in the number of variables (features/predictors) in a learning problem. Maximum-likelihood
training can be done by evaluating a closed form expression, which takes linear time, rather
than by expensive iterative approximation as used for many other types of classifiers.

Figure 2.8 Naïve Bayes

2.3 Machine Learning Steps

The task of imparting intelligence to machines seems daunting and impossible. It can be
broken down into 7 major steps:

23
Data Collection

Data Processing

Model Selection

Training

Training Set Testing Set

Model Evaluation

Parameter Tuning

Model Prediction
and Deployment

Figure 2.9 Block Diagram of Machine Learning

2.3.1 Data Collection

Machines initially learn from data fed in them. It is of utmost importance to collect reliable
data so machine learning model can find the correct patterns. The quality of the data that is
fed to the machine will determine the accuracy of the model. It is essential to use data from a

24
reliable source, as it will directly affect the outcome of the model. Good data is relevant,
contains very few missing and repeated values, and has a good representation of the various
subcategories/classes present.

2.3.2 Data Processing

After data is secured, it has to be processed, which can be done by:

Putting together all the data and randomizing it. This helps make sure that data is evenly
distributed, and the ordering does not affect the learning process.

Cleaning the data to remove unwanted data, missing values, rows, and columns, duplicate
values, data type conversion, etc. The dataset may be restructured by changing the rows and
columns or index of rows and columns.

Visualizing the data to understand how it is structured and understand the relationship between
various variables and classes present.

Splitting the cleaned data into two sets - a training set and a testing set. The training set is the
set your model learns from. A testing set is used to check the accuracy of your model after
training.

2.3.3 Choosing a Model

A machine learning model determines the output you get after running a machine learning
algorithm on the collected data. It is important to choose a model which is relevant to the task
at hand. Over the years, scientists and engineers developed various models suited for
different tasks like speech recognition, image recognition, prediction, etc. Apart from this,
you also have to see if your model is suited for numerical or categorical data and choose
accordingly.

2.3.4 Training the Model

Training is the most important step in machine learning. In training, you pass the prepared
data to your machine learning model to find patterns and make predictions. It results in the
model learning from the data so that it can accomplish the task set. Over time, with training,
the model gets better at predicting.

2.3.5 Evaluating the Model

25
After training your model, you have to check to see how it’s performing. This is done by
testing the performance of the model on previously unseen data. The unseen data used is the
testing set that you split our data into earlier. If testing was done on the same data which is
used for training, you will not get an accurate measure, as the model is already used to the
data, and finds the same patterns in it, as it previously did. This will give you
disproportionately high accuracy.

2.3.6 Parameter Tuning

Once you have created and evaluated your model, see if its accuracy can be improved in any
way. This is done by tuning the parameters present in your model. Parameters are the
variables in the model that the programmer generally decides. At a particular value of your
parameter, the accuracy will be the maximum. Parameter tuning refers to finding these
values.

2.3.7 Predictions and deployment

Once the model is trained and optimized, it's ready to make predictions on new data. This
process involves feeding new data into the model and using the model's output for decision-
making or further analysis.

Deploying the model involves integrating it into a production environment where it can
process real-world data and provide real-time insights.

2.4 Internship Projects Implemented

During my internship, I embarked on projects that were not already built, but rather, as
interns, we built them from scratch. Under the guidance of experienced mentors, we took on
the challenge of conceptualizing and developing these projects, acquiring valuable skills and
knowledge along the way. These hands-on experiences allowed us to understand the entire
development lifecycle, explore various methodologies, utilize different machine learning
libraries and work collaboratively as a team. As a result, we not only created functional
models but also developed ourselves as capable and confident developers, ready to take on
more complex and innovative projects in the future. These experiences during my internship
were truly transformative, laying a strong foundation for my career in software development.
Just as expected, I was engaged in major and real-world projects during my internship at
nHub Foundation. Some of these Projects are as follows:

 House Price Prediction App

26
 Diabetes Detection Model

CHAPTER THREE

RESULTS AND ANALYSIS

3.1.1 House Price Prediction App

The "House Price Prediction" project focuses on predicting housing prices using machine
learning techniques. By leveraging popular Python libraries such as NumPy, Pandas, Scikit-
learn (sklearn), Matplotlib and Seaborn. this project provides an end-to-end solution for
accurate price estimation.

Project Overview
The "House Price Prediction" project aims to develop a model that can accurately predict
housing prices based on various features. This prediction task is of great significance in real
estate and finance, enabling informed decision-making for buyers, sellers, and investors. By
employing machine learning algorithms and a curated dataset, this project provides a
powerful tool for estimating house prices.

27
Figure 3.1 House Price Prediction App

Diabetes Detection Model

This project focuses on predicting weather a person has diabetes or not based on various
health-related attributes. It utilizes the Random Forest classification algorithm to make
predictions and provides a user-friendly interface for input and prediction.

Random forest is a commonly-used machine learning algorithm that combines the output of
multiple decision trees to reach a single result.

28
Figure 3.2 Diabetes Detection Model

29
CHAPTER FOUR

CONCLUSION AND RECOMMENDATION

4.2 Conclusion

Undertaking my six (6) months of industrial training at nHub Nigeria has been a
transformative and invaluable experience. Throughout this journey, I had the privilege of
immersing myself in the dynamic world of technology, learning from seasoned professionals,
and gaining hands-on experience in cutting-edge projects. This training has not only
deepened my technical expertise but has also fostered personal growth and a greater
understanding of the demands and opportunities in the technology industry.

During my internship at nHub, I had the chance to work on diverse projects such as House
Price Prediction, Face Recognition Model, each presenting unique challenges that tested my
problem-solving abilities and pushed me beyond my comfort zone. Collaborating with a
talented and supportive team exposed me to the power of teamwork and taught me the
importance of effective communication and collaboration.

I look forward to applying the lessons learned and skills acquired during my internship to
excel in my future endeavours. The foundation laid at nHub Nigeria will serve as a solid
platform for me to build a successful and fulfilling career in the world of technology.

4.2 Recommendation

i. Emphasis on Remote Work Skills: Given the prevalence of remote work in today's
world, educational institutions should focus on equipping students with remote work
skills. This includes effective communication, collaboration tools, time management,
and remote project management techniques.
ii. Focus on Data Science, ML and AI: Data science and artificial intelligence (AI) are
becoming increasingly important in various industries. Students should be
encouraged to explore data analytics, machine learning, and AI technologies, as these
skills are highly sought-after in the job market.

30
iii. Ethical Tech Development: With technology influencing various aspects of society,
students should be educated on the importance of ethical technology development.
Courses and discussions on privacy, data ethics, and responsible AI should be part of
the curriculum.
iv. Remote Internships and Virtual Opportunities: Encouraging and facilitating
remote internships and virtual opportunities allow students to gain practical
experience from the comfort of their homes. This approach also provides access to a
more diverse range of opportunities beyond their immediate geographic location.
v. Continuous Learning and Adaptability: Technology is constantly evolving, and
the ability to adapt and learn new skills is essential. Encouraging a culture of
continuous learning and adaptability will help students remain competitive and
relevant in the ever-changing tech industry.
vi. Collaboration with Industry Partners: Establishing partnerships with industry
leaders and organizations can provide students with access to real-world projects,
mentorship, and exposure to cutting-edge technologies.
vii. Soft Skills Development: In addition to technical skills, students should be
encouraged to develop soft skills such as communication, critical thinking, problem-
solving, and emotional intelligence. These skills are crucial for effective
collaboration and leadership in the workplace.

31
References

[ O. V. Tuomo Sipola, "Latest Trends in Artificial Intelligence Technology: A Scoping


1 Review," A Scoping Review of Current Developments in the Field of Machine Learning
] and Artificial Intelligence, p. 6, 7th May 2023.

[ Deepak Kataria, Anwar Walid, Mahmoud Daneshmand, Ashutosh Dutta, M.A. Enright,
2 Rentao Gu, Alex Lackpour, Prakash Ramachandran, Honggang Wang, Chi-Ming Chen,
] Baw Chng, Frederica Darema - Show less, "Artificial Intelligence And Machine
Learning," What is the current state of machine learning research and development in the
field of artificial intelligence, vol. 1, p. 47, 03 09 2022.

E. C. E. I. A. A. O. S. O. O. SamuelOsusha Loya1, "AWARENESS AND ADOPTION


READINESS OF MACHINE LEARNING TECHNOLOGY IN THE CONSTRUCTION
INDUSTRY OF A DEVELOPING COUNTRY: A CASE OF NIGERIA," Journal of
Engineering and Technology for Industrial Applications, vol. 10, pp. 50-62, 28 June 2024.

[ S. Tyagi, "INDUSTRIAL TRAINING REPORT ON MACHINE LEARNING," 2023.


4
]

[ "w3schools," [Online]. Available:


5 https://www.w3schools.com/python/numpy/numpy_intro.asp#:~:text=NumPy%20is%20a
] %20Python%20library,NumPy%20stands%20for%20Numerical%20Python.. [Accessed
07 2024].

[ "NVIDIA," [Online]. Available:


6 https://www.nvidia.com/en-us/glossary/pandas-python/#:~:text=What%20is%20Pandas
] %3F,use%20data%20analysis%20and%20manipulation.. [Accessed 07 2024].

[ "Codecademy," [Online]. Available: https://www.codecademy.com/article/scikit-


7 learn#:~:text=Scikit%2Dlearn%20is%20a%20library,NumPy%2C%20pandas%2C
] %20and%20Matplotlib!. [Accessed 2024].

32

You might also like