ML Viva Questions

Uploaded by

Shriganesh Bhandari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF or read online on Scribd

0% found this document useful (0 votes)

327 views8 pages

ML Viva Questions

Uploaded by

Shriganesh Bhandari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF or read online on Scribd

Ql » Applications of ML. Answer:. 1. Product recommendations: Machine learning is widely used by various e-commerce and entertainment companies such as Amazon, Netflix, etc., for product recommendation to the user. Traffic prediction: If we want to visit a new place, we take help of Google Maps, which shows us the correct path with the shortest route and predicts the traffic conditions. It predicts the traffic conditions such as whether traffic is cleared, slow-moving, or heavily congested with the help of two ways: Real Time location of the vehicle form Google Map app and sensors Average time has taken on past days at the same time. Speech Recognition While using Google, we get an option of "Search by voice,’ it comes under speech recognition, and it's a popular application of machine learning. Image Recognition: Image recognition is one of the most common applications of machine learning. It is used to identify objects, persons, places, digital images, etc. Social Media Features Social media platforms use machine learning algorithms and approaches to create some attractive and excellent features. Q2. Discuss Generalization Error. Answer: The generalization error of a machine learning model is the difference between the empirical loss of the training set and the expected loss of a test set. In practice, it is measured by the difference between the error of the training data and the one of the test data.Q3. What is Machine Learning? Answer: Machine Learning is defined as the capability of a machine to imitate intelligent human behavior. Artificial intelligence systems are used to perform complex tasks in a way that is similar to how humans solve [Link] is a subfield of artificial [Link] basically focuses on analyzing and interpreting patterns and structures in data to enable learning, reasoning, and decision making outside of human interaction. Q4. Write Types of ML. Answer:Types of Systems of Machine Learning There are different types of machine-learning systems. 1. Supervised 2. Unsupervised 3. Semi-supervised 4. Reinforcement Learning Q5. What is Linear Regression? Answer: Linear Regression is a machine learning algorithm based on supervised learning. It performs a regression task. Regression models a target prediction value based on independent variables. It is mostly used for finding out the relationship between variables and forecasting. Linear regression can be further divided into two types of the algorithm: 1. Simple Linear Regression: If a single independent variable is used to predict the value of a numerical dependent variable, then such a Linear Regression algorithm is called Simple Linear Regression. 2. Multiple Linear regression: If more than one independent variable is used to predict the value of a numerical dependent variable, then such a Linear Regression algorithm is called Multiple Linear Regression.Q 6. What is Regression? Answer: Regression is a technique for investigating the relationship between independent variables or features and a dependent variable or [Link] prices of a house given the features of house like size, price etc is one of the common examples of Regression. It is a supervised technique. Q7 what is Multivariate Linear Regression? Answer: Multivariate Regression is a supervised machine learning algorithm involving multiple data variables for analysis. Multivariate regression is an extension of multiple regression with one dependent variable and multiple independent variables. Q8. What is Logistic Regression? Answer: Logistic regression is one of the most popular Machine Learning algorithms, which comes under the Supervised Learning technique. e It is used for predicting the categorical dependent variable using a given set of independent variables. © Logistic regression predicts the output of a categorical dependent variable. Therefore the outcome must be a categorical or discrete value. Qg. what are Decision Trees? Answer: Decision trees classify instances by sorting them down the tree from the root to some leaf node, which provides the classification of the instance. Each node in the tree specifies a test of some attribute of the instance, and each branch descendingQl 0. What is the fullform of CART? Answer: CART stands for Classification And Regression [Link] algorithm is a classification algorithm for building a decision tree based on Gini's impurity index as a splitting criterion. CART is a binary tree built by splitting a node into two child nodes repeatedly. Qll. What is the ROC curve? Answer: ROC stands for receiver operating characteristic and it's a tool that is used with binary [Link] ROC curve is a graph showing the performance of a classification model at all classification thresholds. This curve plots two parameters: True Positive Rate, False Positive Rate. Ql 2. vescribe Kappa Statistics. Answer: Kappa Statistics is a measure of how closely the instances classified by the machine learning classifier matched the data labeled as ground truth, controlling for the accuracy of a random classifier as measured by the expected accuracy. Ql 3. Discuss F-measure. Answer: The F-score is a measure of a model's accuracy on a dataset. It is used to evaluate binary classification systems, which classify examples into ‘positive’ or ‘negative’ Ql 4. What is K-fold cross validation? Answer:Cross-validation is a resampling procedure used to evaluate machine learning models on a limited data sample. The procedure has a single parameter called k that refers to the number of groups that a given data sample is to be split into.Ql 5. Write definition of XGBoost. Answer: XgBoost stands for Extreme Gradient Boosting, which was proposed by the researchers at the University of Washington. It is a library written in C++ which optimizes the training for Gradient Boosting. XGBoost is an implementation of Gradient Boosted decision trees. XGBoost models majorly dominate in many Kaggle Competitions. Ql 6. piscuss Bagging. Answer: A Bagging classifier is an ensemble meta-estimator that fits base classifiers each on random subsets of the original dataset and then aggregate their individual predictions (either by voting or by averaging) to form a final [Link] reduces overfitting (variance) by averaging or voting, however, this leads to an increase in bias, which is compensated by the reduction in variance though. Ql 7. What is Random Forest? Answer: Every decision tree has high variance, but when we combine all of them together in parallel then the resultant variance is low as each decision tree gets perfectly trained on that particular sample data and hence the output doesn’t depend on one decision tree but multiple decision trees. Random Forest has multiple decision trees as base learning models. We randomly perform row sampling and feature sampling from the dataset forming sample datasets for every model. Ql 8. Describe Support Vector Machine SVM. Answer: SVM is a supervised machine learning algorithm which can be used for classification or regression problems. They are used in applications like handwriting recognition, intrusion detection, face detection, email classification, gene classification, and in web [Link] can handle both classification and regression on linear and non-linear data.Ql 9. write about Margins and support vectors. Answer: The objective of the support vector machine algorithm is to find a hyperplane in an N-dimensional space(N — the number of features) that distinctly classifies the data points. Q20. What is Quadratic Programming? Answer: Quadratic programming (QP) is the process of solving certain mathematical optimization problems involving quadratic functions. Q21. Basics of Kernel Trick. Answer: Kernel Trick is a simple method where a Non Linear data is projected onto a higher dimension space so as to make it easier to classify the data where it could be linearly divided by a plane. Q22. Write about SUpport Vector Regression. Answer: Support Vector Regression is a supervised learning algorithm that is used to predict discrete values. Support Vector Regression uses the same principle as the SVMs. The basic idea behind SVR is to find the best fit line. Q23. What is Clustering ? Answer: Clustering is the task of dividing the population or data points into a number of groups such that data points in the same groups are more similar to other data points in the same group and dissimilar to the data points in other groups. It is basically a collection of objects on the basis of similarity and dissimilarity between them.Q24. What is Multiclass Classification? Answer: multinomial classification is the problem of classifying instances into one of three or more classes Q2 5. Overview of distance metrics. Answer: Distance metrics are used in both supervised and unsupervised learning, generally to calculate the similarity between data points. Q26. What is Graph Based Clustering? Answer: Graph clustering is to group the vertices of a graph into clusters based on the graph structure and/or node [Link] clustering aims at partitioning a set of graphs into different groups that share some form of similarity. Q27. Describe DBSCAN. Answer:The DBSCAN algorithm is based on this intuitive notion of “clusters” and “noise”. The key idea is that for each point of a cluster, the neighborhood of a given radius has to contain at least a minimum number of points. DBSCAN algorithm requires two parameters: 1. eps: e It defines the neighborhood around a data point ie. if the distance between two points is lower or equal to ‘eps’ then they are considered neighbors. 2. MinPts: Minimum number of neighbors (data points) within eps radius. Larger the dataset, the larger value of MinPts must be chosen.Q28. what is Model based Clustering ? Answer:Model-based clustering is a statistical approach to data clustering. The observed (multivariate) data is assumed to have been generated from a finite mixture of component models. Each component model is a probability distribution, typically a parametric multivariate distribution. Q29. What is Dimensionality Reduction ? Answer:Dimensionality reduction is a machine learning (ML) or statistical technique of reducing the amount of random variables in a problem by obtaining a set of principal [Link] Component Analysis (PCA),Backward Feature Elimination,Forward Feature Selection Missing Value Ratio are some of the many dimension reduction techniques . Q3 0. Discuss Principal Component Analysis. Answer:Principal Component Analysis is a statistical process that converts the observations of correlated features into a set of linearly uncorrelated features with the help of orthogonal transformation. e These new transformed features are called the Principal Components. e Itis one of the popular tools that is used for exploratory data analysis and predictive modeling. Q31. What is Singular Value Decomposition ? Answer:The Singular Value Decomposition (SVD) of a matrix is a factorization of that matrix into three matrices. It has some interesting algebraic properties and conveys important geometrical and theoretical insights about linear transformations.

Data Mining Insights for Analysts
No ratings yet
Data Mining Insights for Analysts
43 pages
FDS Unit1 Part1
No ratings yet
FDS Unit1 Part1
57 pages
Data Visualization Notes
No ratings yet
Data Visualization Notes
4 pages
CISC 867: Deep Learning Assignment #1: K J Net
No ratings yet
CISC 867: Deep Learning Assignment #1: K J Net
3 pages
All Units CCS334 Big Data Analytics Q&A Unitwise
No ratings yet
All Units CCS334 Big Data Analytics Q&A Unitwise
20 pages
PHD Test Question Paper 2018 Electronics Comm Engg
No ratings yet
PHD Test Question Paper 2018 Electronics Comm Engg
24 pages
Interpolation in Digital Signal Processing
No ratings yet
Interpolation in Digital Signal Processing
7 pages
Application of 8051 Microcontroller
100% (1)
Application of 8051 Microcontroller
8 pages
PPS Course Material
100% (1)
PPS Course Material
177 pages
Week 7 Solution
100% (1)
Week 7 Solution
4 pages
21it6203-Knowledge Engineering Laboratory
No ratings yet
21it6203-Knowledge Engineering Laboratory
32 pages
UNIT 4 - Perceptron and DL
No ratings yet
UNIT 4 - Perceptron and DL
39 pages
Embedded System Design Overview
No ratings yet
Embedded System Design Overview
67 pages
AI Midterm Exam - Helwan University
No ratings yet
AI Midterm Exam - Helwan University
2 pages
JTG SQA Online Assessment Prep
No ratings yet
JTG SQA Online Assessment Prep
4 pages
SOFT COMPUTING 2 Marks With Answer
No ratings yet
SOFT COMPUTING 2 Marks With Answer
13 pages
Theory of Computation Question Bank
No ratings yet
Theory of Computation Question Bank
44 pages
HPC - Model - Paper
No ratings yet
HPC - Model - Paper
3 pages
Distributed File Systems Overview
No ratings yet
Distributed File Systems Overview
49 pages
Assignment of ML
No ratings yet
Assignment of ML
5 pages
Digital Filter Structures Guide
No ratings yet
Digital Filter Structures Guide
32 pages
BDA Syllabus - Sem VII - Mumbai University
No ratings yet
BDA Syllabus - Sem VII - Mumbai University
3 pages
Fraud Detection in Credit Card Transactions
No ratings yet
Fraud Detection in Credit Card Transactions
14 pages
Arithmetic Mean Filter Geometric Mean Filter Harmonic Mean Filter Contraharmonic Mean Filter
No ratings yet
Arithmetic Mean Filter Geometric Mean Filter Harmonic Mean Filter Contraharmonic Mean Filter
18 pages
Machine Learning PYQ 2022 Ans
No ratings yet
Machine Learning PYQ 2022 Ans
17 pages
UNIT5
No ratings yet
UNIT5
44 pages
CS771A End-Sem Exam Paper 2016
100% (1)
CS771A End-Sem Exam Paper 2016
8 pages
Artificial Intelligence Questions and Answers - Neural Networks - 1
No ratings yet
Artificial Intelligence Questions and Answers - Neural Networks - 1
4 pages
Module-02 AIML NOTES
No ratings yet
Module-02 AIML NOTES
29 pages
Probabilistic Generative Models
No ratings yet
Probabilistic Generative Models
29 pages
Concept Learning
No ratings yet
Concept Learning
85 pages
Placement Preparation Tasks For AI, ML
No ratings yet
Placement Preparation Tasks For AI, ML
4 pages
Rabin-Karp Algorithm For Pattern Searching: Examples
No ratings yet
Rabin-Karp Algorithm For Pattern Searching: Examples
5 pages
Challenges in Deep Learning Optimization
No ratings yet
Challenges in Deep Learning Optimization
26 pages
Machine Learning Lab Viva Questions
100% (1)
Machine Learning Lab Viva Questions
4 pages
Bput Coa
No ratings yet
Bput Coa
2 pages
IOT Enabling Technologies
No ratings yet
IOT Enabling Technologies
5 pages
BDA Assignments
No ratings yet
BDA Assignments
5 pages
Deep Learning NPTEL Syllabus
No ratings yet
Deep Learning NPTEL Syllabus
1 page
ML Unit-4
No ratings yet
ML Unit-4
40 pages
Data Mining Basics for Beginners
No ratings yet
Data Mining Basics for Beginners
20 pages
Concepts - of - Machine - Learning (Minor)
No ratings yet
Concepts - of - Machine - Learning (Minor)
14 pages
Soft Computing vs Hard Computing
No ratings yet
Soft Computing vs Hard Computing
60 pages
Aiml Unit 4
No ratings yet
Aiml Unit 4
26 pages
Machine Learning
No ratings yet
Machine Learning
14 pages
Hardware Security Presentation
No ratings yet
Hardware Security Presentation
46 pages
Deep Learning (BCS714A)
No ratings yet
Deep Learning (BCS714A)
29 pages
Python ML and Clustering Quiz Answers
No ratings yet
Python ML and Clustering Quiz Answers
2 pages
Unsupervised Learning Notes
No ratings yet
Unsupervised Learning Notes
4 pages
Mtech RM & IPR I YEAR I SEM I MID IMP QUS WITH ANS
No ratings yet
Mtech RM & IPR I YEAR I SEM I MID IMP QUS WITH ANS
12 pages
Sigmoid Function: Soft Computing Assignment
100% (1)
Sigmoid Function: Soft Computing Assignment
12 pages
Key Concepts in Mobile Communication
No ratings yet
Key Concepts in Mobile Communication
5 pages
BDA University Question Paper
No ratings yet
BDA University Question Paper
10 pages
Experiment-7: Implementation of K-Means Clustering Algorithm
No ratings yet
Experiment-7: Implementation of K-Means Clustering Algorithm
3 pages
Key Concepts for MU IT Exam 2023
No ratings yet
Key Concepts for MU IT Exam 2023
23 pages
IAT-I Question Paper With Solution of 18CS71 Artificial Intelligence and Machine Learning Oct-2022-Dr. Paras Nath Singh
No ratings yet
IAT-I Question Paper With Solution of 18CS71 Artificial Intelligence and Machine Learning Oct-2022-Dr. Paras Nath Singh
7 pages
Unit 2 Fod
No ratings yet
Unit 2 Fod
27 pages
Understanding Simpson's Paradox in Data Science
No ratings yet
Understanding Simpson's Paradox in Data Science
61 pages
What Are The Types of Machine Learning?
100% (1)
What Are The Types of Machine Learning?
24 pages
Mubbashir Assignment ML
No ratings yet
Mubbashir Assignment ML
10 pages

ML Viva Questions

Uploaded by

ML Viva Questions

Uploaded by

You might also like