Data Mining Questions
Data Mining Questions
Lecture- 1
What is classification? How classification is done?
Classification is a data mining function that assigns items in a collection to target
categories or classes. The goal of classification is to accurately predict the target class
for each case in the data. For example, a classification model could be used to identify
loan applicants as low, medium, or high credit risks.
Applications of classification
Direct Marketing- Reduce cost of mailing by targeting a set of customers likely to
buy a new cell-phone product
Fraud Detection - Predict fraudulent cases in credit card transaction
Customer Attrition- To predict whether a customer likely to be lost to a competitor
Sky Survey Cataloging To predict class (star or galaxy) of sky objects,
especially visually faint ones, based on telegraphy survey images.
5. What is clustering?
Given a set of data points, each having a set of attributes, and a similarity
measure among them, find clusters such that
Data points in one cluster are more similar to one another.
Data points in separate clusters are less similar to one another.
Application of clustering.
Market Segmentation:
subdivide a market into distinct subsets of customers
where any subset may conceivably be selected as a
market target to be reached with a distinct marketing
mix.
Find clusters of similar customers.
9.Regression
10. Deviation/ analog detection
11. challenges of data mining.
Traditional data analysis methods often deal with data sets containing attributes of
the same type, either continuousorcategorical. As the role of data mining in
business, science, medicine,and other fields has grown, so has the need for
techniques that can handleheterogeneous attributes.
Lecture -2
1. How does data quality fall?
Examples of data quality problems:
missing values
duplicate data
5. Feature creation
Create new attributes that can capture the important information in a data
set much more efficiently than the original attributes
Three general methodologies:
Feature Extraction
domain-specific
Feature Construction
combining features