Data Mining Methods Basics
Data Mining Methods Basics
The process of extracting valid, useful, unknown info from data and using it to
make proactive knowledge driven business is called --data mining
Which of the following is not applicable to Data Mining?-- Involves working with
known information
What is the other name for Data Preparation stage of Knowledge Discovery Process?
-- ETL
Which of the following modelling type should be used for Labelled data? --
Predictive Modelling
Noisy values are the values that are valid for the dataset, but are incorrectly
recorded - true
Probability of theft in an area is 0.03 with expected loss of 20% or 30% of things
with probabilities 0.55 and 0.45. Insurance policy from A costs $150 pa with 100%
repayment. Policy with B, costs $100 pa and first $500 of any loss has to be paid
by the owner. Which data mining technique can be used to choose the policy? ==
decision tree
Statistical technique used for investigating and modelling the relationship between
two or more variables is: -- Regression analysis
---------
Simulations are carried out to develop a mathematical model of the process -- false
_________ are the values that mark the boundaries of the confidence interval. --
Confidence limits
Which of the following activities are performed as part of data pre processing? --
detecting outliers Data Cleansing * all options, Data Cleansing is wrong
Which data mining method groups together objects that are similar to each other and
dissimilar to the other objects? -- clustering
Regression is typically carried out to develop a mathematical model of the process
-- true
Which is the statistical technique used for investigating and modelling the
relationship between two or more variables? -- regression
Machine learning task of inferring a function from labelled training data is known
as - supervised