0% found this document useful (0 votes)
8 views

Unit 1 Data Mining

The document discusses various types of data in data mining, including multimedia, web, text, spatiotemporal, and stream data, emphasizing their unique methodologies. It outlines different data mining techniques such as association rule learning, clustering, classification, anomaly detection, regression, sequential pattern mining, time series analysis, text mining, and graph mining. Data mining is highlighted as a crucial process for extracting valuable insights across various fields like business and healthcare.

Uploaded by

Bodhan Laxman
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
8 views

Unit 1 Data Mining

The document discusses various types of data in data mining, including multimedia, web, text, spatiotemporal, and stream data, emphasizing their unique methodologies. It outlines different data mining techniques such as association rule learning, clustering, classification, anomaly detection, regression, sequential pattern mining, time series analysis, text mining, and graph mining. Data mining is highlighted as a crucial process for extracting valuable insights across various fields like business and healthcare.

Uploaded by

Bodhan Laxman
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 4

Different Types of Data in Data Mining

Introduction :
In general terms, “Mining” is the process of extraction. In the
context of computer science, Data Mining can be referred to
as knowledge mining from data, knowledge extraction,
data/pattern analysis, data archaeology, and data dredging.
There are other kinds of data like semi-structured or
unstructured data which includes spatial data, multimedia
data, text data, web data which require different
methodologies for data mining.
Data mining is the process of extracting valuable information
and insights from large datasets. It involves using various
techniques, such as statistical analysis, machine learning,
and database management, to discover patterns and
relationships in data that can be used to make predictions or
inform decisions.
Data mining can be applied in a wide range of fields,
including business, finance, healthcare, marketing, and more.
For example, in business, data mining can be used to analyze
customer data to identify trends and patterns that can inform
marketing strategies and improve sales. In healthcare, data
mining can be used to identify patterns in patient data that
can inform treatment decisions and improve patient
outcomes.
Data mining can also be used to extract insights from
unstructured data, such as text and images, using techniques
such as natural language processing and computer vision.
It is also important to note that data mining is a subset of
data science, and it is closely related to other fields such as
machine learning and artificial intelligence.
 Mining Multimedia Data: Multimedia data objects
include image data, video data, audio data, website
hyperlinks, and linkages. Multimedia data mining tries to
find out interesting patterns from multimedia databases.
This includes the processing of the digital data and
performs tasks like image processing, image classification,
video, and audio data mining, and pattern recognition.
Multimedia Data mining is becoming the most interesting
research area because most of the social media platforms
like Twitter, Facebook data can be analyzed through this
and derive interesting trends and patterns.
 Mining Web Data: Web mining is essential to discover
crucial patterns and knowledge from the Web. Web
content mining analyzes data of several websites which
includes the web pages and the multimedia data such as
images in the web pages. Web mining is done to
understand the content of web pages, unique users of the
website, unique hypertext links, web page relevance and
ranking, web page content summaries, time that the users
spent on the particular website, and understand user
search patterns. Web mining also finds out the best search
engine and determines the search algorithm used by it. So
it helps improve search efficiency and finds the best
search engine for the users.
 Mining Text Data: Text mining is the subfield of data
mining, machine learning, Natural Language processing,
and statistics. Most of the information in our daily life is
stored as text such as news articles, technical papers,
books, email messages, blogs. Text Mining helps us to
retrieve high-quality information from text such as
sentiment analysis, document summarization, text
categorization, text clustering. We apply machine learning
models and NLP techniques to derive useful information
from the text. This is done by finding out the hidden
patterns and trends by means such as statistical pattern
learning and statistical language modeling. In order to
perform text mining, we need to preprocess the text by
applying the techniques of stemming and lemmatization in
order to convert the textual data into data vectors.
 Mining Spatiotemporal Data: The data that is related to
both space and time is Spatiotemporal data.
Spatiotemporal data mining retrieves interesting patterns
and knowledge from spatiotemporal data. Spatiotemporal
Data mining helps us to find the value of the lands, the age
of the rocks and precious stones, predict the weather
patterns. Spatiotemporal data mining has many practical
applications like GPS in mobile phones, timers, Internet-
based map services, weather services, satellite, RFID,
sensor.
 Mining Data Streams: Stream data is the data that can
change dynamically and it is noisy, inconsistent which
contain multidimensional features of different data types.
So this data is stored in NoSql database systems. The
volume of the stream data is very high and this is the
challenge for the effective mining of stream data. While
mining the Data Streams we need to perform the tasks
such as clustering, outlier analysis, and the online
detection of rare events in data streams.
There are several different types of data mining,
including:
1. Association Rule Learning: This type of data mining
involves identifying patterns of association between items
in large datasets, such as market basket analysis, where
the items that are frequently bought together are
identified.
Three types of association rules are:
I. Multilevel Association Rule
II. Quantitative Association Rule
III. Multidimensional Association Rule
2. Clustering: This type of data mining involves grouping
similar data points together into clusters based on certain
characteristics or features. Clustering is used to identify
patterns in data and to discover hidden structures or
groups in data.
Different types of clustering methods are:
I. Density-Based Methods
II. Model-Based Methods
III. Partitioning Methods
IV. Hierarchical Agglomerative methods
V. Grid-Based Methods
3. Classification: This type of data mining involves using a
set of labeled data to train a model that can then be used
to classify new, unlabeled data into predefined categories
or classes.
4. Anomaly detection: This type of data mining is used to
identify data points that deviate significantly from the
norm, such as detecting fraud or identifying outliers in a
dataset.
5. Regression: This type of data mining is used to model
and predict numerical values, such as stock prices or
weather patterns.
6. Sequential pattern mining: This type of data mining is
used to identify patterns in data that occur in a specific
order, such as identifying patterns in customer buying
behavior.
7. Time series analysis: This type of data mining is used to
analyze data that is collected over time, such as stock
prices or weather patterns, to identify trends or patterns
that change over time.
8. Text mining: This type of data mining is used to extract
meaningful information from unstructured text data, such
as customer feedback or social media posts.
9. Graph mining: This type of data mining is used to extract
insights from graph-structured data, such as social
networks or the internet.

You might also like