0% found this document useful (0 votes)
3 views4 pages

2.structure and Unstructured Data Disruptive System

Uploaded by

manikdas031973
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
Download as pdf or txt
0% found this document useful (0 votes)
3 views4 pages

2.structure and Unstructured Data Disruptive System

Uploaded by

manikdas031973
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
Download as pdf or txt
Download as pdf or txt
You are on page 1/ 4

Mr.

Arindam Ghosh

DATA ANALYTICS
Disruptive Technology Innovation
Structured Data

Definition: Structured data refers to highly organized information that adheres to a predefined
schema or model. This type of data is typically stored in fixed fields within records or files, making it
easily accessible and manageable.

Characteristics:

1. Organization:

o Structured data is arranged in rows and columns, much like a spreadsheet or database
table. Each row represents a record, while each column corresponds to a specific
attribute or field of that record.

2. Schema:

o A predefined schema dictates how data is stored, including data types (e.g., integer,
string, date) and relationships between tables. This schema ensures consistency and
integrity of the data.

3. Data Types:

o The data is often numeric, categorical, or date-based, allowing for straightforward


analysis. For example, an employee database might include fields such as Employee
ID, Name, Hire Date, and Salary.

4. Querying:

o Structured data can be easily queried using Structured Query Language (SQL),
allowing users to perform complex searches and data manipulation. This makes it
ideal for reporting and analytics.

Examples:

• Databases: Relational databases like MySQL, PostgreSQL, and Microsoft SQL Server.

• Spreadsheets: Applications like Microsoft Excel or Google Sheets, where data is organized
in tabular format.

• CRM Systems: Customer relationship management software that stores structured customer
data.

Uses:

• Business Intelligence: Facilitates reporting, dashboards, and data visualization.

• Data Analysis: Supports statistical analysis and forecasting.

• Operational Reporting: Essential for daily operations, monitoring performance, and


decision-making.

Data Analytics || Disruptive Technology Innovation 1


Unstructured Data

Definition: Unstructured data refers to information that lacks a clear format or structure, making it
challenging to collect, process, and analyse. This data does not conform to a predefined schema and
often requires advanced techniques for extraction of insights.

Characteristics:

1. Lack of Organization:

o Unstructured data can exist in various forms and does not fit neatly into tables. It may
include free text, multimedia files, or other formats that require interpretation.

2. Variety of Formats:

o This type of data encompasses a wide range of formats, including:

▪ Text documents (e.g., articles, reports)

▪ Social media content (e.g., tweets, posts)

▪ Images and videos

▪ Emails and chat messages

▪ Sensor data from IoT devices

3. Difficult to Analyse:

o Extracting meaningful insights from unstructured data often requires advanced


technologies like natural language processing (NLP), image recognition, and machine
learning. The variability in format and context adds complexity to analysis.

Examples:

• Social media: Posts, comments, and multimedia content on platforms like Facebook,
Twitter, and Instagram.

• Emails: The content and attachments in emails can provide valuable information but lack a
structured format.

• Multimedia Files: Videos, podcasts, and images that may contain rich content but require
specialized tools for analysis.

• Web Content: Blogs, articles, and forums that offer insights but do not follow a consistent
structure.

Uses:

• Sentiment Analysis: Understanding public opinion or customer sentiment through analysis


of social media or review data.

Data Analytics || Disruptive Technology Innovation 2


• Content Discovery: Searching for relevant information in large volumes of text data.

• Machine Learning: Training algorithms on diverse datasets that include text, images, and
audio for predictive analytics and pattern recognition.

Data Analytics || Disruptive Technology Innovation 3

You might also like