0% found this document useful (0 votes)

142 views

Data Mining Methods Basics

The document discusses various concepts in data mining including: - Data mining involves extracting useful unknown information from data to make knowledge-driven business decisions. - Common data mining techniques include predictive modeling, decision trees, regression analysis, and clustering. - Data preprocessing steps like detecting missing values and outliers are important for data analysis. - Unsupervised learning infers structure from unlabeled data, like clustering.

Uploaded by

Lynch George

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

142 views

Data Mining Methods Basics

Uploaded by

Lynch George

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

You are on page 1/ 2

Data mining methods basics

The process of extracting valid, useful, unknown info from data and using it to
make proactive knowledge driven business is called --data mining

Which of the following is not applicable to Data Mining?-- Involves working with
known information

What is the other name for Data Preparation stage of Knowledge Discovery Process?
-- ETL

Which of the following modelling type should be used for Labelled data? --
Predictive Modelling

Which of the following activities is performed as part of data pre processing? --

detect missing values

Which of the following role is responsible for performing validation on analysis

datasets? -- Statisticians

Noisy values are the values that are valid for the dataset, but are incorrectly
recorded - true

What is the type of learning where a function is inferred to describe hidden

structure from unlabeled data - unsupervised

Probability of theft in an area is 0.03 with expected loss of 20% or 30% of things
with probabilities 0.55 and 0.45. Insurance policy from A costs $150 pa with 100%
repayment. Policy with B, costs $100 pa and first $500 of any loss has to be paid
by the owner. Which data mining technique can be used to choose the policy? ==
decision tree

Statistical technique used for investigating and modelling the relationship between
two or more variables is: -- Regression analysis

Which statistical technique deals with finding a structure in a collection of

unlabeled data? -- clustering

If time is used as an independent variable in a simple linear regression analysis,

which of the following assumptions could be violated? -- Successive observations of
the dependent variable are uncorrelated

---------

Simulations are carried out to develop a mathematical model of the process -- false

_________ are the values that mark the boundaries of the confidence interval. --
Confidence limits

Which of the following activities are performed as part of data pre processing? --
detecting outliers Data Cleansing * all options, Data Cleansing is wrong

Which of the following are Multi-class Classification problem? -- Will Indian

Cricket team win the next World Cup? Will it be a Rainy day or Sunny day
tomorrow? * should we gift, Will Indian Cricket team win the next World Cup? is
wrong answer

Which data mining method groups together objects that are similar to each other and
dissimilar to the other objects? -- clustering
Regression is typically carried out to develop a mathematical model of the process
-- true

Which is the statistical technique used for investigating and modelling the
relationship between two or more variables? -- regression

Machine learning task of inferring a function from labelled training data is known
as - supervised

Associate rule is known as _____________ -- affinity analysis

If time is used as an independent variable in a simple linear regression, which of

the following assumption could be violated? - Residual variation is same for all
fitted values of dependent variable * is wrong

Powershell
No ratings yet
Powershell
4 pages
Scala Constructs
No ratings yet
Scala Constructs
1 page
Jyotsna Final
No ratings yet
Jyotsna Final
59 pages
R Basic and Data Mining Methods Basics
No ratings yet
R Basic and Data Mining Methods Basics
2 pages
Data Mining
No ratings yet
Data Mining
3 pages
Clustering - The Data Ensemble Q&A
No ratings yet
Clustering - The Data Ensemble Q&A
2 pages
DIgital Primer
No ratings yet
DIgital Primer
2 pages
Blockchain Intermedio
No ratings yet
Blockchain Intermedio
16 pages
APIGEE - Analytics Services
100% (1)
APIGEE - Analytics Services
1 page
Continuous Integration
No ratings yet
Continuous Integration
6 pages
Bundling With Webpack
No ratings yet
Bundling With Webpack
3 pages
Statistics and Probability Katabasis 2
No ratings yet
Statistics and Probability Katabasis 2
2 pages
Continuous Integration
No ratings yet
Continuous Integration
20 pages
NoSQL Database Revolution
No ratings yet
NoSQL Database Revolution
5 pages
Caching Techniques
No ratings yet
Caching Techniques
3 pages
Intuitive Visualization Basics
0% (1)
Intuitive Visualization Basics
2 pages
Infrastructure As Code
No ratings yet
Infrastructure As Code
6 pages
Digital For Industries
No ratings yet
Digital For Industries
2 pages
Data Handling Using R
No ratings yet
Data Handling Using R
2 pages
In Cryptographic Terms
No ratings yet
In Cryptographic Terms
3 pages
e2
No ratings yet
e2
5 pages
AWS Essentials FP
No ratings yet
AWS Essentials FP
2 pages
Deep Learning - Chorale Prelude
No ratings yet
Deep Learning - Chorale Prelude
2 pages
Deep Learning-Chorale Prelude
No ratings yet
Deep Learning-Chorale Prelude
1 page
Mobile App Security Quiz
100% (1)
Mobile App Security Quiz
2 pages
Automation Anywhere
No ratings yet
Automation Anywhere
3 pages
Azure Virtual Machines
0% (1)
Azure Virtual Machines
5 pages
Hybrid Apps Introduction Resp
No ratings yet
Hybrid Apps Introduction Resp
5 pages
AUTOMATION
No ratings yet
AUTOMATION
2 pages
Data Visualization New
No ratings yet
Data Visualization New
3 pages
DC - Os
No ratings yet
DC - Os
3 pages
An Enlightment To Machine Learning
No ratings yet
An Enlightment To Machine Learning
2 pages
Statistics and Probability Katabasis
0% (4)
Statistics and Probability Katabasis
1 page
Mobile Primer Course
No ratings yet
Mobile Primer Course
4 pages
Web Services Security 1
No ratings yet
Web Services Security 1
1 page
Microservice
No ratings yet
Microservice
2 pages
AngularJS 1.x Routers and Custom Directives Q&A
No ratings yet
AngularJS 1.x Routers and Custom Directives Q&A
4 pages
Docker Magneta
No ratings yet
Docker Magneta
25 pages
Automation Anywhere
No ratings yet
Automation Anywhere
2 pages
AngularJS 1.x Internals
No ratings yet
AngularJS 1.x Internals
3 pages
This Study Resource Was: - Are A Set of Rules That Determine The Execution of A Transaction
No ratings yet
This Study Resource Was: - Are A Set of Rules That Determine The Execution of A Transaction
8 pages
APIGEE Analytics Services
No ratings yet
APIGEE Analytics Services
3 pages
New Text Document
No ratings yet
New Text Document
10 pages
Story Telling
33% (3)
Story Telling
8 pages
Q Answer
No ratings yet
Q Answer
11 pages
Continuous Integration
0% (1)
Continuous Integration
2 pages
Must Know in D3js
100% (1)
Must Know in D3js
1 page
Image Classification Handson-Image - Test
No ratings yet
Image Classification Handson-Image - Test
5 pages
NoSQL - Database Revolution
No ratings yet
NoSQL - Database Revolution
10 pages
Robotics-Automatix - Art of RPA Q & A
No ratings yet
Robotics-Automatix - Art of RPA Q & A
3 pages
DevOps Culture Q&A
No ratings yet
DevOps Culture Q&A
2 pages
Automatix - Art of RPA
No ratings yet
Automatix - Art of RPA
6 pages
Elements of User Experience
No ratings yet
Elements of User Experience
3 pages
Cybersecurity Prologue
No ratings yet
Cybersecurity Prologue
6 pages
Bundling With Webpack
No ratings yet
Bundling With Webpack
2 pages
Association Rule Mining
100% (2)
Association Rule Mining
2 pages
Subjects You Need To Know:: Programming Languages of AI
0% (1)
Subjects You Need To Know:: Programming Languages of AI
7 pages
Java8 Innards
No ratings yet
Java8 Innards
4 pages
Automatix - Art of RPA Q&A
No ratings yet
Automatix - Art of RPA Q&A
1 page
Data Mining Methods Basics Q&A
No ratings yet
Data Mining Methods Basics Q&A
2 pages
Data Mining
No ratings yet
Data Mining
1 page
Data Visualization
No ratings yet
Data Visualization
1 page
Bootstrap
No ratings yet
Bootstrap
1 page
Cassandra and Data Handling
No ratings yet
Cassandra and Data Handling
7 pages
BlockChain PotentusNexus
No ratings yet
BlockChain PotentusNexus
2 pages
Performance Management & Apraisal
0% (1)
Performance Management & Apraisal
22 pages
Company Analysis: Determining Strategic Capability: David Hussey
No ratings yet
Company Analysis: Determining Strategic Capability: David Hussey
10 pages
Revising, Editing & Proofreading: OBJECTIVE: To Convert Rough Draft Possible. Your Own Work
No ratings yet
Revising, Editing & Proofreading: OBJECTIVE: To Convert Rough Draft Possible. Your Own Work
18 pages
Certification of Expenses Not Requiring Receipts: Philippine Statistics Authority
No ratings yet
Certification of Expenses Not Requiring Receipts: Philippine Statistics Authority
11 pages
Quality Volunteering at The British Red Cross
100% (1)
Quality Volunteering at The British Red Cross
112 pages
Geological Abstract. Kilungu
No ratings yet
Geological Abstract. Kilungu
1 page
The Psychology of Romantic Relationships
No ratings yet
The Psychology of Romantic Relationships
41 pages
Lab Report
No ratings yet
Lab Report
45 pages
Hingst GCBE 2006 Conf Paper
No ratings yet
Hingst GCBE 2006 Conf Paper
8 pages
Organisational Behaviour Notes
No ratings yet
Organisational Behaviour Notes
3 pages
IOPS 311: Study Division A: Study Unit 2 Motivation, Attitudes & Job Satisfaction
No ratings yet
IOPS 311: Study Division A: Study Unit 2 Motivation, Attitudes & Job Satisfaction
38 pages
editorshmic,+07+STEFANUS (1) (1)
No ratings yet
editorshmic,+07+STEFANUS (1) (1)
14 pages
Final Synopsis
No ratings yet
Final Synopsis
41 pages
GEC 2 Readings in Philippine History Module
No ratings yet
GEC 2 Readings in Philippine History Module
110 pages
Human Resource Management Review: Sciencedirect
No ratings yet
Human Resource Management Review: Sciencedirect
15 pages
Synopsis: A Study On Customer Satisfaction AT Nerolac Paints LTD., Kadapa
No ratings yet
Synopsis: A Study On Customer Satisfaction AT Nerolac Paints LTD., Kadapa
5 pages
Poverty and The Law
No ratings yet
Poverty and The Law
216 pages
Impact of Media in Globalization
0% (1)
Impact of Media in Globalization
6 pages
Pearson Glopotb Guide
No ratings yet
Pearson Glopotb Guide
4 pages
Acknowledgement: Survey Camp Report 2015
No ratings yet
Acknowledgement: Survey Camp Report 2015
5 pages
PRMGT Assignment Question Marking Scheme
No ratings yet
PRMGT Assignment Question Marking Scheme
7 pages
Dissertation B.com 6 th semester
No ratings yet
Dissertation B.com 6 th semester
20 pages
DRDO Entry Test: Defence Research & Development Organisation (DRDO)
No ratings yet
DRDO Entry Test: Defence Research & Development Organisation (DRDO)
2 pages
Multiple Choice Questions
0% (1)
Multiple Choice Questions
3 pages
Study On Auditor S Attitude in Using Information Technology For Auditing Theory of Planned Behavior and Social Cognitive Theory Modification
No ratings yet
Study On Auditor S Attitude in Using Information Technology For Auditing Theory of Planned Behavior and Social Cognitive Theory Modification
9 pages
Legal Internship Resume - Sample 1: Sachin XXXXXX
0% (1)
Legal Internship Resume - Sample 1: Sachin XXXXXX
4 pages
The Marshmallow Challenge
No ratings yet
The Marshmallow Challenge
10 pages
Mir Faiz Dissertation Report
No ratings yet
Mir Faiz Dissertation Report
58 pages
Thesis Using One Way Anova
100% (3)
Thesis Using One Way Anova
7 pages

Data Mining Methods Basics

Uploaded by

Data Mining Methods Basics

Uploaded by

Data mining methods basics

Which of the following activities is performed as part of data pre processing? --

Which of the following role is responsible for performing validation on analysis

What is the type of learning where a function is inferred to describe hidden

Which statistical technique deals with finding a structure in a collection of

If time is used as an independent variable in a simple linear regression analysis,

Which of the following are Multi-class Classification problem? -- Will Indian

Associate rule is known as _____________ -- affinity analysis

If time is used as an independent variable in a simple linear regression, which of

You might also like