Data Visualization Tech.
Data Visualization Tech.
Why visualization?
Without the concept of visualization, mining and analysis doesn’t play any role of
importance as data mining is the idea of finding inferences by analyzing
the data through patterns and those patterns can only be represented by
different visualization techniques.
Techniques:
Box plots
Histograms
Heat maps
Charts
Tree maps
Box plots:
A method for graphically depicting groups of numerical data through their quartiles. It
is a standardized way of displaying the distribution of data based on five features:
Also the lower and upper quartiles are shown as horizontal lines of the rectangle. It
has vertical line inside to indicate the median value.
Histograms:
Bins are consecutive, non- overlapping intervals of a variable. As the adjacent bins
leave no gaps, the rectangles of histogram touch each other to indicate that the
original value is continuous.
Heat maps:
Note that heat maps are useful when examining a large number of
values, but they are not a replacement for more precise graphical display, such as
bar charts, because color differences cannot be perceived accurately.
Charts:
It gives a graphical representation of data has various types:
Bar charts-
It represents the categorical data with rectangular bars of heights and lengths
proportional to the values they they represent.
Line charts-
Pie charts-
Scatter charts-
It is a type of plot or mathematical diagram that display value for typically two
variables for a set of data using Cartesian coordinates.
Bubble charts-
It is a variation of scatter chart in which the data points are replaced with
bubbles, and an additional dimension of data is represented in the size
of the bubbles.
A leaf node rectangle has an area proportional to specified dimension of the data.
Depending on choice leaf node is colored, sized or both according to chosen
attributes. They make efficient use of space, thus display thousands of items on the
screen simultaneously.
These are some of the ways by which we can represent our wide data for it’s better
understanding and interpretation.