0% found this document useful (0 votes)

34 views10 pages

Unit3-2 Marks

The document provides an overview of key concepts in machine learning, including definitions of machine learning, data mining, overfitting, and underfitting. It discusses various machine learning algorithms, techniques, and the differences between supervised and unsupervised learning, as well as the importance of cross-validation and inductive bias. Additionally, it covers specific methods like logistic regression and Bayesian linear regression, along with their advantages and applications.

Uploaded by

indhumathisme

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

34 views10 pages

Unit3-2 Marks

Uploaded by

indhumathisme

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 10

1. What is Machine Learning?

Machine Learning is a branch of artificial intelligence (AI) that enables

systems to automatically learn from data and improve their performance over time
without being explicitly programmed. It involves algorithms that identify patterns in
data, allowing systems to make predictions or decisions based on past experiences.
2. Mention the difference between Data Mining and Machine learning
The key differences between Data Mining and Machine Learning are:
Purpose:
Data Mining focuses on discovering patterns, relationships, and insights from large
datasets, typically used for data analysis and decision support.
Machine Learning aims to develop algorithms that allow systems to learn from data
and make predictions or decisions, enabling automation and predictive analytics.
Process:
Data Mining is often a manual or semi-automated process that involves cleaning,
transforming, and analyzing data using statistical methods.
Machine Learning automates the learning process by training models to recognize
patterns and make decisions with minimal human intervention.
Output:
Data Mining results in insights and patterns that can be interpreted by humans for
decision-making.
Machine Learning produces models that can make predictions or decisions
autonomously based on new data.
Dependency:
Data Mining can be a stand-alone process that may or may not use machine learning.
Machine Learning often uses data mining as a preliminary step to gather and
preprocess data for model training.
3. What is ‘Overfitting’ and ‘Underfitting’ in Machine learning 2 marks
In Machine Learning:
Overfitting occurs when a model learns the training data too well, capturing noise
and specific patterns that do not generalize to new data. This leads to high accuracy
on training data but poor performance on unseen data.
Underfitting happens when a model is too simple to capture the underlying patterns
in the data, resulting in poor performance on both the training and test data. This
indicates that the model hasn’t learned enough from the data.
4. Define Logistic regression
Logistic Regression is a statistical method used in machine learning for binary
classification tasks, where the goal is to predict the probability that a given input
belongs to one of two possible classes. It models the relationship between input
features and the probability of a specific outcome using a logistic (sigmoid) function,
which outputs values between 0 and 1. Logistic regression is particularly useful for
predicting binary outcomes, such as "yes/no," "spam/not spam," or "disease/no
disease."
5. Why overfitting happens?
Overfitting happens when a machine learning model is too complex relative to
the amount and quality of data available, resulting in the model capturing noise and
specific patterns in the training data that don’t generalize to new data.
Key reasons for overfitting include:
 Excessive Model Complexity
 Insufficient Training Data
 Lack of Regularization
 High Variability in Data
6. How can you avoid overfitting
To avoid overfitting in machine learning models, several strategies can be
employed:
 Simplify the Model
 Regularization
 Cross-Validation
 Train with More Data
 Early Stopping
 Data Augmentation
 Feature Selection
 Use Dropout
Implementing these strategies can help improve a model's ability to generalize to
unseen data and reduce the likelihood of overfitting.
7. What are the five popular algorithms of Machine Learning
Here are five popular algorithms in machine learning:
 Linear Regression:
 Logistic Regression:
 Decision Trees:
 Support Vector Machines (SVM):
 K-Nearest Neighbors (KNN):
These algorithms are foundational in machine learning and serve various applications
across different domains.
8. What are the different Algorithm techniques in Machine Learning?
In machine learning, various algorithm techniques can be broadly categorized
based on their learning style and application. Here are the primary techniques:
Supervised Learning:
Involves training a model on labeled data, where the input-output pairs are
provided.
Examples:
 Linear Regression
 Logistic Regression
 Decision Trees
 Support Vector Machines (SVM)
 Random Forest
 Neural Networks
Unsupervised Learning:
Involves training a model on unlabeled data to discover underlying patterns or
groupings.
Examples:
 K-Means Clustering
 Hierarchical Clustering
 Principal Component Analysis (PCA)
 Autoencoders
 t-SNE (t-distributed Stochastic Neighbor Embedding)
Semi-Supervised Learning:
Combines both labeled and unlabeled data during training, leveraging the
small amount of labeled data to improve learning from the larger unlabeled set.
Reinforcement Learning:
Involves training an agent to make decisions by taking actions in an
environment to maximize cumulative reward. The model learns through trial and
error.
Examples:
 Q-Learning
 Deep Q-Networks (DQN)
 Proximal Policy Optimization (PPO)
 Ensemble Learning:
Combines multiple models to improve performance and robustness over individual
models. It aims to leverage the strengths of different algorithms.
Examples:
 Bagging (e.g., Random Forest)
 Boosting (e.g., AdaBoost, Gradient Boosting)
 Stacking
Deep Learning:
A subfield of machine learning that uses neural networks with many layers (deep
neural networks) to model complex patterns in large datasets.
Examples:
 Convolutional Neural Networks (CNNs) for image processing
 Recurrent Neural Networks (RNNs) for sequence data
 Transformers for natural language processing tasks
These techniques can be applied to a wide range of problems, from image
classification and speech recognition to recommendation systems and predictive
analytics.
9. What are the three stages to build the hypotheses or model in machine learning
The three stages to build a hypothesis or model in machine learning typically
include:
Model Training:
In this stage, the machine learning algorithm learns from the training data. The
dataset is divided into features (inputs) and labels (outputs), and the model uses this
data to find patterns and relationships.
During training, the model's parameters are adjusted to minimize the error in
predictions using techniques like gradient descent or backpropagation (for neural
networks).
Model Validation:
After training, the model is evaluated on a separate validation dataset to assess
its performance. This step helps to ensure that the model generalizes well to unseen
data and is not overfitting to the training set.
Various metrics (like accuracy, precision, recall, F1 score, etc.) are used to evaluate
the model's performance on the validation set. Hyperparameters may also be tuned
during this stage.
Model Testing:
Finally, the model is tested on an independent test dataset to provide an
unbiased evaluation of its performance. This step assesses how well the model can
make predictions on new, unseen data.
The results from the testing phase provide insights into the model's effectiveness and
can help determine whether it is suitable for deployment in real-world applications.
These stages are crucial for developing robust and effective machine learning models
that perform well in practical scenarios.
10. What is the standard approach to supervised learning?
The standard approach to supervised learning typically involves the following
steps:
 Data Collection:
 Data Preprocessing:
 Feature Selection/Engineering:
 Choosing a Model:
 Model Training:
 Model Validation:
 Model Evaluation:
 Model Deployment:
 Monitoring and Maintenance:
11. What is ‘Training set’ and ‘Test set’
In machine learning, a training set and a test set are two distinct subsets of a
dataset used for developing and evaluating models. Here’s what each term means:
Training Set
Definition: The training set is a subset of the dataset used to train a machine learning
model. It contains input data along with the corresponding labels (outputs).
Purpose: The model learns from this data by identifying patterns and relationships
between the features (inputs) and the labels (outputs). It adjusts its parameters during
this phase to minimize prediction errors.
Example: If you are building a model to classify emails as spam or not spam, the
training set would consist of a large number of emails along with their labels
indicating whether each email is spam or not.
Test Set
Definition: The test set is a separate subset of the dataset that is not used during the
training phase. It is used to evaluate the model's performance after it has been trained.
Purpose: The test set helps assess how well the trained model generalizes to new,
unseen data. It provides an unbiased evaluation of the model's predictive capability.
Example: Continuing with the email classification example, the test set would consist
of a different set of emails that the model has not seen before, along with their actual
spam/not spam labels. The model's predictions on this set are compared to the actual
labels to measure performance metrics like accuracy, precision, and recall.
12. What is the difference between artificial learning and machine learning?
The terms "artificial intelligence" (AI) and "machine learning" (ML) are often
used interchangeably, but they refer to different concepts within the field of computer
science. Here are the key differences:
13.
Feature Artificial Intelligence (AI) Machine Learning (ML)

A broad field focused on A subset of AI focused on

Definition
creating intelligent systems. learning from data.

Encompasses various techniques, Involves algorithms and

Scope
including ML, NLP, etc. statistical models.

Mimic human cognitive

Build models that learn
Goal functions and perform intelligent
and improve from data.
tasks.

Decision trees, neural

Chatbots, autonomous vehicles,
Examples networks, clustering
recommendation systems.
algorithms.

In summary, while machine learning is an essential component of artificial

intelligence, AI encompasses a wider range of technologies and approaches aimed at
stimulating human-like intelligence.
14. What is Bias and what are the types of Bias?
Bias in machine learning refers to systematic errors that result from incorrect
assumptions in the learning algorithm. It can lead to inaccuracies in the model's
predictions and affect its performance. Bias can manifest in various forms, often
influencing how well a model generalizes to unseen data.
Types of Bias
 Algorithmic Bias:
 Sample Bias:
 Confirmation Bias:
 Measurement Bias:
 Selection Bias:
 Exclusion Bias:
 Label Bias:
15. What is the main key difference between supervised and unsupervised machine
learning?
The main key difference between supervised and unsupervised machine
learning lies in the presence or absence of labeled data during the training process.
Here’s a breakdown of the differences:

Feature Supervised Learning Unsupervised Learning

Uses labeled data (input- Uses unlabeled data (input

Data Type
output pairs) only)

Objective Predict outcomes for new Discover patterns and

Feature Supervised Learning Unsupervised Learning

data structures in data

Clustering, anomaly
Applications Classification, regression
detection, association

Example Decision trees, SVM, K-means, PCA,

Algorithms neural networks hierarchical clustering

16. What is a Linear Regression?

Linear Regression is a fundamental statistical method used in machine
learning and data analysis to model the relationship between a dependent variable
(also called the target variable) and one or more independent variables (also known as
features or predictors). It is primarily used for predictive modeling and understanding
relationships among variables.
17. What are the disadvantages of the linear regression model
While linear regression is a widely used and powerful statistical tool, it has
several disadvantages and limitations.
Here are some of the key disadvantages:
 Assumption of Linearity:
 Sensitivity to Outliers:
 Multicollinearity:
 Assumption of Independence:
 Homoscedasticity Assumption:
 Limited Capacity:
 Overfitting with High-Dimensional Data:
 Interpretation Challenges:
 No Handling of Categorical Variables:
 Lack of Flexibility:
18. What is the difference between classification and regression?
Classification and regression are both types of supervised learning tasks in
machine learning, but they differ fundamentally in their objectives, output types, and
applications. Here are the key differences between the two:
Summary of Differences

Feature Classification Regression

Predicting categorical
Definition Predicting continuous values
labels

Output Type Discrete class labels Continuous numerical values

Spam detection, image House price prediction,

Examples
recognition temperature forecasting
Feature Classification Regression

Common Logistic regression, Linear regression,

Algorithms decision trees polynomial regression

Evaluation Accuracy, precision,

MAE, MSE, R-squared
Metrics recall

19. What is cross validation?

Cross-validation is a statistical technique used to assess the performance and
generalizability of a machine learning model. It involves partitioning a dataset into
multiple subsets (folds) and using these subsets to train and validate the model. This
approach helps to ensure that the model performs well not just on the training data but
also on unseen data.
20. What Is Bayesian Linear Regression and state its advantages
Bayesian Linear Regression is a statistical method that extends traditional linear
regression by incorporating Bayesian inference. In this approach, instead of
estimating fixed coefficients, Bayesian linear regression treats the model parameters
as random variables and uses probability distributions to represent the uncertainty
about these parameters.
Key Features of Bayesian Linear Regression:
Prior Distribution: Specifies beliefs about the parameters before observing any data.
Likelihood: Represents the probability of the observed data given the model
parameters.
Posterior Distribution: Combines the prior distribution and the likelihood using
Bayes' theorem to update beliefs about the parameters after observing the data.
Advantages of Bayesian Linear Regression:
Uncertainty Quantification:
Incorporation of Prior Knowledge:
Regularization:
Flexibility:
Prediction Intervals:

21.Define Inductive Bias.

Inductive Bias refers to the set of assumptions that a machine learning algorithm
makes to predict outputs for inputs it has not encountered during training. It is the
preference of a learning algorithm for a particular type of hypothesis or model when
generalizing from a training dataset to unseen data. Inductive bias is crucial for the
learning process because it guides the algorithm in making predictions beyond the
observed data.
Key Points about Inductive Bias:
Role in Learning:
Inductive bias enables the algorithm to generalize from specific training examples to
broader conclusions. Without some form of bias, learning algorithms would struggle
to make predictions, as they would have no basis for inferring unseen data.
Types of Inductive Bias:
Different algorithms have different inductive biases. For example:
Linear Regression assumes a linear relationship between input and output variables.
Decision Trees assume that data can be split into subsets based on feature values.
Neural Networks assume that the underlying function can be approximated using
layers of interconnected nodes.

22.Define Linear Algebra and its application in Machine Learning?

Linear Algebra is a branch of mathematics that deals with vectors, vector

spaces, matrices, and linear transformations. It provides the foundational concepts and
operations used to manipulate and analyze linear equations and systems. Linear
algebra is essential in many fields, including physics, engineering, computer science,
and machine learning.
Applications of Linear Algebra in Machine Learning:
Data Representation:
Linear Transformations:
Optimization:
Solving Systems of Equations:
Machine Learning Algorithms:

23. What is Hypothesis?

In the context of machine learning and statistics, a hypothesis refers to a

proposed explanation or model that describes the relationship between input
variables (features) and output variables (targets). It represents a specific
assumption or prediction about the underlying data and is a fundamental
concept in the process of building and evaluating machine learning models.
24. What the types are of cross validation?
Types of Cross-Validation:
k-Fold Cross-Validation: The most common method, where the dataset is divided
into kkk equal parts (folds). The model is trained kkk times, each time using a
different fold as the validation set and the remaining folds as the training set.
Stratified k-Fold Cross-Validation: Similar to k-fold, but ensures that each fold has
the same proportion of class labels as the original dataset. This is particularly useful
for imbalanced datasets.
Leave-One-Out Cross-Validation (LOOCV): A special case of k-fold where kkk
equals the number of data points in the dataset. Each training set is created by leaving
out one data point for validation.
Group k-Fold: Used when the dataset contains groups of related samples. It ensures
that the same group is not represented in both the training and validation sets.

Machine Learning Types & Techniques
No ratings yet
Machine Learning Types & Techniques
17 pages
Machine Learning Essentials Guide
No ratings yet
Machine Learning Essentials Guide
21 pages
ML-QB-Unit 1
No ratings yet
ML-QB-Unit 1
41 pages
Rohit Unit 1 ML Notes
No ratings yet
Rohit Unit 1 ML Notes
27 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
64 pages
R22 Machine Learning Digital Notes Final
No ratings yet
R22 Machine Learning Digital Notes Final
143 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
26 pages
Machine Learning Predicted Qs
No ratings yet
Machine Learning Predicted Qs
17 pages
What Are The Types of Machine Learning?
100% (1)
What Are The Types of Machine Learning?
24 pages
Ids Ashber
No ratings yet
Ids Ashber
9 pages
Machine Learning Unit-1
No ratings yet
Machine Learning Unit-1
22 pages
Machine Learning - Question Bank
No ratings yet
Machine Learning - Question Bank
45 pages
ML Unit 1
No ratings yet
ML Unit 1
9 pages
Assignment No 1
No ratings yet
Assignment No 1
9 pages
Introduction To ML
No ratings yet
Introduction To ML
48 pages
Unit 1
No ratings yet
Unit 1
20 pages
AI Module 1 Simple Notes
No ratings yet
AI Module 1 Simple Notes
14 pages
MachineLearning Perplexity
No ratings yet
MachineLearning Perplexity
5 pages
Basic of Machine Learning
No ratings yet
Basic of Machine Learning
7 pages
Machine Learning Is A Branch of Artificial Intelligence (AI)
No ratings yet
Machine Learning Is A Branch of Artificial Intelligence (AI)
80 pages
Sample Paper For The Machine Learning Course Ajay Sharma
No ratings yet
Sample Paper For The Machine Learning Course Ajay Sharma
19 pages
AIML Question Ans Part2
No ratings yet
AIML Question Ans Part2
25 pages
Interview Material
No ratings yet
Interview Material
14 pages
Unit 1
No ratings yet
Unit 1
10 pages
ML Viva Q&A
No ratings yet
ML Viva Q&A
17 pages
MLT Unit 1
No ratings yet
MLT Unit 1
15 pages
Machine Learning Concise Notes
No ratings yet
Machine Learning Concise Notes
7 pages
ML Unit1
No ratings yet
ML Unit1
25 pages
Twenty Frequently Asked Interview Questions and Answers
No ratings yet
Twenty Frequently Asked Interview Questions and Answers
8 pages
Module 1 ML
No ratings yet
Module 1 ML
8 pages
Aiml Unit Iii Class Test 3.1
No ratings yet
Aiml Unit Iii Class Test 3.1
3 pages
Question-Answers in Machine Learning
No ratings yet
Question-Answers in Machine Learning
14 pages
ML Unit-1
No ratings yet
ML Unit-1
28 pages
Machine Learning Interview Questions
No ratings yet
Machine Learning Interview Questions
15 pages
Machine Learning (Important QS) - Young Researchers
No ratings yet
Machine Learning (Important QS) - Young Researchers
81 pages
Lecture 2
No ratings yet
Lecture 2
36 pages
UNIT1@
No ratings yet
UNIT1@
4 pages
Null 5
No ratings yet
Null 5
16 pages
ASSIGNMENT 1 Mavhine Learning
No ratings yet
ASSIGNMENT 1 Mavhine Learning
8 pages
Session 8 - Machine Learning Techniques
No ratings yet
Session 8 - Machine Learning Techniques
48 pages
UNIT I - Introduction
No ratings yet
UNIT I - Introduction
76 pages
Ai Unit-4 ML
No ratings yet
Ai Unit-4 ML
4 pages
Machine Learning: BE Sixth Semester 20CS610
No ratings yet
Machine Learning: BE Sixth Semester 20CS610
211 pages
Machine Learning.
No ratings yet
Machine Learning.
50 pages
Unit-1 ML (1) .Docx 3rd Sem
No ratings yet
Unit-1 ML (1) .Docx 3rd Sem
20 pages
Machine Learning Concepts & Types
No ratings yet
Machine Learning Concepts & Types
14 pages
Machine Learning Oral Questions
No ratings yet
Machine Learning Oral Questions
10 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
21 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
14 pages
Machine Learning Basics
No ratings yet
Machine Learning Basics
32 pages
AITools Unit 2
No ratings yet
AITools Unit 2
34 pages
Sus-Learning, Supervised, Unsupervised-Some Notes-Pages 1-13
No ratings yet
Sus-Learning, Supervised, Unsupervised-Some Notes-Pages 1-13
13 pages
Machine Learning Interview Q&A Guide
No ratings yet
Machine Learning Interview Q&A Guide
17 pages
Tutorial Sheet1 (M.L.)
No ratings yet
Tutorial Sheet1 (M.L.)
49 pages
Chapter 01 Machine Learning
No ratings yet
Chapter 01 Machine Learning
22 pages
Machine Learning
No ratings yet
Machine Learning
12 pages
Qa DL
No ratings yet
Qa DL
48 pages
Slides - Module 2 Lesson 1
No ratings yet
Slides - Module 2 Lesson 1
16 pages
2 WhyMachineLearning PDF
No ratings yet
2 WhyMachineLearning PDF
27 pages
Class 10 Maths CH 2 Polynomials
No ratings yet
Class 10 Maths CH 2 Polynomials
3 pages
M.Tech - Computer Vision and Image Processing
No ratings yet
M.Tech - Computer Vision and Image Processing
21 pages
DLV Notes Preparatin
No ratings yet
DLV Notes Preparatin
24 pages
Digital Image Processing - Wikipedia, The Free Encyclopedia PDF
No ratings yet
Digital Image Processing - Wikipedia, The Free Encyclopedia PDF
4 pages
Unit 2 - Interpolation
No ratings yet
Unit 2 - Interpolation
31 pages
Euler and Simpson
No ratings yet
Euler and Simpson
14 pages
Rosenblatt's Perceptron: Neural Networks and Learning Machines, Third Edition
No ratings yet
Rosenblatt's Perceptron: Neural Networks and Learning Machines, Third Edition
12 pages
MS Thesis PSO Algorithm
100% (1)
MS Thesis PSO Algorithm
65 pages
CE F324 Numerical Analysis Course Handout
No ratings yet
CE F324 Numerical Analysis Course Handout
4 pages
Jury Stability Test
50% (2)
Jury Stability Test
7 pages
Deep Learning Answers (Full) PDF
No ratings yet
Deep Learning Answers (Full) PDF
82 pages
Operations Research: USN 06CS661 Sixth Semester B.E. Degree Examination, June/July 2009
33% (3)
Operations Research: USN 06CS661 Sixth Semester B.E. Degree Examination, June/July 2009
23 pages
Linear Programming Solutions and Duals
No ratings yet
Linear Programming Solutions and Duals
3 pages
Wine Quality Research Paper
100% (1)
Wine Quality Research Paper
3 pages
LDPC Options For Next Generation Wireless Systems: T. Lestable and E. Zimmermann
No ratings yet
LDPC Options For Next Generation Wireless Systems: T. Lestable and E. Zimmermann
10 pages
Gauss-Seidel - More Examples Civil Engineering: Example 1
No ratings yet
Gauss-Seidel - More Examples Civil Engineering: Example 1
4 pages
Lecture 4 - Constraint Satisfaction Problem II
No ratings yet
Lecture 4 - Constraint Satisfaction Problem II
27 pages
Wa0005
No ratings yet
Wa0005
4 pages
Module 3
No ratings yet
Module 3
3 pages
Engineering Management Exam 2019
No ratings yet
Engineering Management Exam 2019
12 pages
OTE Assignment-1 PDF
No ratings yet
OTE Assignment-1 PDF
2 pages
7.4.2 Factor A Difference of Squares Packet
No ratings yet
7.4.2 Factor A Difference of Squares Packet
3 pages
Visualize and Learn Sorting Algorithms in Data Structure Subject in A Game-Based Learning
No ratings yet
Visualize and Learn Sorting Algorithms in Data Structure Subject in A Game-Based Learning
5 pages
Mba 205
No ratings yet
Mba 205
23 pages
PSOSM Practice Set
No ratings yet
PSOSM Practice Set
3 pages
Muhammad Akif Naeem Open Ended Lab Signals and System
No ratings yet
Muhammad Akif Naeem Open Ended Lab Signals and System
20 pages
Transportation Optimization Guide
No ratings yet
Transportation Optimization Guide
10 pages
Overview of Adaptive Signal Processing
No ratings yet
Overview of Adaptive Signal Processing
72 pages

Unit3-2 Marks

Uploaded by

Unit3-2 Marks

Uploaded by

1. What is Machine Learning?

Machine Learning is a branch of artificial intelligence (AI) that enables

A broad field focused on A subset of AI focused on

Encompasses various techniques, Involves algorithms and

Mimic human cognitive

Decision trees, neural

In summary, while machine learning is an essential component of artificial

Feature Supervised Learning Unsupervised Learning

Uses labeled data (input- Uses unlabeled data (input

Objective Predict outcomes for new Discover patterns and

data structures in data

Example Decision trees, SVM, K-means, PCA,

16. What is a Linear Regression?

Feature Classification Regression

Output Type Discrete class labels Continuous numerical values

Spam detection, image House price prediction,

Common Logistic regression, Linear regression,

Evaluation Accuracy, precision,

19. What is cross validation?

21.Define Inductive Bias.

22.Define Linear Algebra and its application in Machine Learning?

Linear Algebra is a branch of mathematics that deals with vectors, vector

23. What is Hypothesis?

In the context of machine learning and statistics, a hypothesis refers to a

You might also like