Topic 1a - Introduction To Data Mining
Topic 1a - Introduction To Data Mining
INTRODUCTION
TO DATA MINING
OBJECTIVES
Pattern Recognition by Human Pattern Recognition by Computer Pattern Recognition from Data
perceptual (emotions, benefit of automated pattern learn or observe from large
feelings) recognition amounts of data
specialized – decision advantage in complex study the dependencies and
making calculations extract knowledge from data
WHAT IS DATA?
Data – the basic facts such as names, numbers or characters that come in different forms
(like text or image).
Alternative names
Knowledge discovery (mining) in databases (KDD), knowledge extraction,
data/pattern analysis, data archeology, data dredging, information harvesting
Source of data ?
“There were 5 exabytes of information created between the dawn of civilization through
2003, but that much information is now created every 2 days” – Eric Schmidt, Executive Chairman of Google
“Information is the oil of 21st century, and analytics is the combustion engine.” – Peter Sondergaard, Gartner Research
FROM DATA MINING TO BIG DATA MINING
Big data mining is referred to the Goal – to discover insights from the
collective data mining or extraction social media platforms (Instagram,
techniques that is performed on large Twitter, Facebook) with thousand of
volume of data or the big data. postings.
CONCLUSION
Finds relationship
(that exist within the dataset)
and
makes prediction
Photo-credit to:
https://www.bigstockphoto.com/image-12788702/stock-
vector-a-fortune-teller-holding-her-crystal-ball-vector
REFERENCES
1. Pang-Ning Tan, Michael Steinbach & Vipin Kumar, Introduction to Data Mining, Addison Wesley, 2019.
2. Jiawei Han and Micheline Kamber, Data Mining: Concepts and Techniques, 3rd Edition, Morgan Kaufmann, 2012.
3. Che D., Safran M., Peng Z. (2013) From Big Data to Big Data Mining: Challenges, Issues, and Opportunities. In: Hong
B., Meng X., Chen L., Winiwarter W., Song W. (eds) Database Systems for Advanced Applications. DASFAA 2013.
Lecture Notes in Computer Science, vol 7827. Springer, Berlin, Heidelberg.
https://doi.org/10.1007/978-3-642-40270-8_1
4. Razak, Z. I., & Mutalib, S. (2018). Web Mining In Classifying Youth Emotions. Malaysian Journal of Computing, 3(1), 1-
11.
5. Wah, Y. B., Abdullah, N., Abdul-Rahman, S., & Tan, M. L. P. (2018). text mining and sentiment analysis on reviews of
proton cars in malaysia. Malaysian Journal of Science, 37(2), 137-153.
THANK YOU
Shuzlina Abdul Rahman | Sofianita Mutalib | Siti Nur Kamaliah Kamarudin