0% found this document useful (0 votes)
15 views37 pages

Module 6_Data Visualization Tools

Uploaded by

sonia
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
Download as pptx, pdf, or txt
0% found this document useful (0 votes)
15 views37 pages

Module 6_Data Visualization Tools

Uploaded by

sonia
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1/ 37

Data Visualization

Data Visualization
• Data visualization is the study of representing data or information in a visual form.
With the advancement of digital technologies, the scope of multimedia has
increased manifold. Visuals in the form of graphs, images, diagrams, or animations
have completely proliferated the media industry and the Internet. It is an
established fact that the human mind can comprehend information more easily if it
is presented in the form of visuals. Instructional designers focus on abstract and
modelbased scientific visualizations to make the learning content more interesting
and easy to understand. News channels often integrate and present visuals related
to accidents, natural disasters, weather reports, and survey results to incite a
realistic imagination in the viewers’ mind.
Data Visualization
• Visualization is an excellent medium to analyze, comprehend, and share
information. Let’s see why:
•  Visual images help to transmit a huge amount of information to the human
brain at a glance.
•  Visual images help in establishing relationships and distinction between
different patterns or processes easily.
•  Visual interpretations help in exploring data from different angles, which help
gain insights.
•  Visualization helps in identifying problems and understanding trends and
outliers.
•  Visualizations point out key or interesting breakthroughs in a large dataset.
Techniques used for visual data
representation
▶ Data can be presented in various visual forms which include simple line diagrams, bar

graphs, tables, matrices etc.


▶ Some techniques used for a visual presentation of data are as follows:
▶ Isoline
▶ Iso surface
▶ Direct volume rendering
▶ Stream line
▶ Map
▶ Parallel coordinate plot
▶ Venn diagram
▶ Time line
▶ Euler diagram
▶ Hyperbolic trees
▶ Cluster diagram
▶ Ordinogram
Isoline
It is a 2-D data representation of a curved line that
moves constantly on the surface of a graph.
The plotting of an isoline is based on data
arrangement rather than data visualization.
Isolines are lines drawn to link different
places that share a common value. The
prefix 'iso' is a greek word meaning equal,
so an isoline must be a line joining equal
points.

For example, a line drawn on a map to join up


all the places that are the same height
above sea level

Isoline: It is a line on a map, chart or a


graph
connecting points of equal value.

Eg: A line drawn on a map to join all the


places that are the same height above the
sea level.

They are graphical tools used to denote


geographic lines of equal value. When we
study weather and climate, we will use
several kinds of isolines, such as isotherms
to show temperatures and isobars to show
atmospheric pressure.
Isosurface
An isosurface is a three-dimensional analog of an
isoline. It is a surface that represents points of a
constant value (e.g. pressure, temperature,
velocity, density) within a volume of space; in other
words, it is a level set of a continuous function
whose domain is 3D-space

They are normally displayed using computer


graphics and are used as data visualization
methods in computational fluid dynamics, allowing
engineers to study features of a fluid flow around
the objects, such as aircraft wings.

An isosurface may represent a shock wave in


supersonic flight. In medical imaging isosurfaces
may be used to represent regions of a particular
density in a 3-D CT scan, allowing the visualization
of internal organ, bones etc.
DVR
In scientific visualization and computer graphics,
volume rendering is a set of techniques used to
display a 2D projection of a 3D discretely sampled
data set, typically a 3D scalar field.
... Direct volume rendering is a computationally
intensive task that may be performed in several
ways.

Volume rendering is a method for rendering light as


it passes through media, within a 3D region.

A typical 3D dataset is a group of 2D images


acquired by a CT, MRI or micro CT scanner. The
opacity is defined using colours.

Eg: Volume rendered CT scan of a fore arm with


different color schemes for muscle, fat, bone and
blood.
Stream lines
A handy way of visualizing the
flow of fluid is through stream
lines.
A stream line shows the fluid
flow. It shows the direction of
flow.
It’s a 2D flow visualization used
in fluid mechanics and
aerodynamics etc.
Map
It is a visual representationof
locations within specific area.
It is depicted on a planar surface.
A map is a visual representation of an
entire area or a part of an area, typically
represented on a flat surface.
The work of a map is to illustrate specific
and detailed features of a particular
area, most frequently used to illustrate
geography.
There are many kinds of maps; static,
two- dimensional, three-dimensional,
dynamic and even interactive.
Maps attempt to represent various
things, like political boundaries, physical
features, roads, topography, population,
climates, natural resources and
economic activities.
Parallel coordinate plot
Used for plotting multivariate numerical
data.

It is a visualization technique of
representing
multidimensional data.

Ideal for comparing many variables


together and seeing the relationships
between them.

Eg: To compare an array of products


with the same attributes such as car
specifications across different models.
Time line
Timeline diagrams present events during
specific intervals shown chronologically
along a line.
These events may be historic, related to a
specific criminal case or business
development, or they may be important
milestones in a project.
Timelines are designed to provide a broad
overview of a sequence of events in time.
They don't go into detail, but links to events,
information and images may be added as
needed.
A timeline diagram consists of a horizontal
bar or line representing time progressing
from left to right.
This bar is marked with events or steps to
indicate when they should or did happen.
In project management, timelines are most
useful for showing important milestones and
deadlines
Hyperbolic trees
Often called a hyper tree is an
information visualization and graph
drawing method inspired by
hyperbola geometry.
This method can be used on
hierarchies.
They are often used to display web
information.
Eg: Visualization of Roget Thesaurus
using XML tool kit in a 2D hyperbolic
tree visualization
Venn diagram
A Venn diagram or set diagram is a diagram that shows
all possible logical relations between a finite
collection of sets.
...
They are used to teach elementary set theory, as
well as illustrate simple set relationships in
probability, logic, statistics, linguistics and computer
science.
Contained within each set is a collection of objects
or entities that all have something in common.
It enables one to organize information visually to see
the relationships between two or three sets of items.
Benefits and Purpose:
1. To visually organize information

2. To compare 2 or more choices

3. To solve complex mathematical problems

4. To compare data sets

5. To reason through the logic


Euler diagram
An Euler diagram is a diagrammatic
means of representing sets and their
relationships.
Typically they involve overlapping
shapes, and may be scaled, such that
the area of the shape is proportional
to the number of elements it
contains.
Euler diagram only shows
relationships that exist in real world.
Circles, Ovals, or other shapes can
be
used for Euler diagram
Cluster diagram

A cluster in general is a group or a bunch of


several discrete items that are close to each
other.

A network diagram can be seen as a special


orderly arranged kind of cluster diagram.

A cluster diagram is a mesh kind of network


diagram.

In computer science
more complex diagrams of computer
networks, computer architecture, file systems
and internet can be considered cluster
diagrams.

In information visualization specific visual


representation of large-scale collections of
non-numerical information are sometimes
drawn in the shape of a cluster diagram.
Types of Data Visualization
• 1D (Linear) Data
• 2D (Planar) Data Visualization
• 3D (Volumetric) Data Visualization
• Temporal Data Visualization
• Multidimensional Data Visualization
• Tree/Hierarchical Data Visualization
• Network Data Visualization
Types of Data Visualization
• 1D (Linear) Data Visualization―In the linear data visualization, data is presented in
the form of lists. Hence, we cannot term it as visualization. It is rather a data
organization technique. Therefore, no tool is required to visualize data in a linear
manner.
• 2D (Planar) Data Visualization―This technique presents data in the form of
images, diagrams, or charts on a plane surface. Cartogram and dot distribution
map are examples of 2D data visualization. Some tools used to create 2D data
visualization patterns are GeoCommons, Google Fusion Tables, Google Maps API,
Polymaps, Tableau Public, etc.
• 3D (Volumetric) Data Visualization―In this method, data presentation involves
exactly three dimensions to show simulations, surface and volume rendering, etc.
Generally, it is used in scientific studies. Today, many organizations use 3D
computer modeling and volume rendering in advertisements to provide users a
better feel of their products. To create 3D visualizations, we use some visualization
tools that involve AC3D, AutoQ3D, TrueSpace, etc.
Types of Data Visualization
• Temporal Data Visualization―Sometimes, visualizations are time dependent. To
visualize the dependence of analyses on time, the temporal data visualization is
used, which includes Gantt chart, time series, sanky diagram, etc. TimeFlow,
Timeline JS, Excel, Timeplot, TimeSearcher, Google Charts, Tableau Public, Google
Fusion Tables, etc. are some tools used to create temporal data visualization.
• Multidimensional Data Visualization―In this type of data visualization, numerous
dimensions are used to present data. We have pie charts, histograms, bar charts,
etc. to exemplify multidimensional data visualization. Many Eyes, Google Charts,
Tableau Public, etc. are some tools used to create multidimensional data
visualization.
• Tree/Hierarchical Data Visualization―Sometimes, data relationships need to be
shown in the form of hierarchies. To represent such kind of relationships, we use
tree or hierarchical data visualizations. Examples of tree/hierarchical data
visualization include hyperbolic tree, wedgestack graph, etc. Some tools to create
hierarchical data visualization are d3, Google Charts, and Network
Workbench/Sci2.
Types of Data Visualization
• Network Data Visualization―It is used to represent data relations that are too
complex to be represented in the form of hierarchies. Some examples of network
data visualization tools include matrix, node link diagram, hive plot, Pajek, Gephi,
NodeXL, VOSviewer, UCINET, GUESS, Network Workbench/Sci2, sigma.js,
d3/Protovis, Many Eyes, Google Fusion Tables, etc.
Applications of Data Visualization
• Education―Visualization is applied to teach a topic that requires simulation or
modeling of any object or process. Have you ever wondered how difficult it would
be to explain any organ or organ system without any visuals? Organ system or
structure of an atom is best described with the help of diagrams or animations.
• Information―Visualization is applied to transform abstract data into visual forms
for easy interpretation and further exploration.
• Production―Various applications are used to create 3D models of products for
better viewing and manipulation. Real estate, communication, and automobile
industry extensively use 3D advertisements to provide a better look and feel to
their products.
• Science―Every field of science, including fluid dynamics, astrophysics, and
medicine use visual representation of information. Isosurfaces and direct volume
rendering are typically used to explain scientific concepts.
Applications of Data Visualization
• Systems visualization―Systems visualization is a relatively new concept that
integrates visual techniques to better describe complex systems.
• Visual communication―Multimedia and entertainment industry use visuals to
communicate their ideas and information.
• Visual analytics―It refers to the science of analytical reasoning supported by the
interactive visual interface. The data generated by social media interaction is
interpreted using visual analytics techniques.
Tools Used in Data Visualization
• 1. Tableau
• 2. Dundas BI
• 3. JupyteR
• 4. Zoho Reports
• 5. Google Charts
• 6. Visual.ly
• 7. RAW
• 8. IBM Watson
• 9. Sisense
• 10. Plotly and many more…..
Tableau
• Tableau is a highly popular tool for visualizing data for two main reasons: it's easy to
use and very powerful. You can connect it to lots of data sources and create all sorts
of charts and maps. Salesforce owns Tableau, and it's widely used by many people
and big companies.

• Tableau has different versions like desktop, server, and web-based options, plus
some customer relationship management (CRM) software.

• Providing integration for advanced databases, including Teradata, SAP, My SQL,


Amazon AWS, and Hadoop, Tableau efficiently creates visualizations and graphics
from large, constantly-evolving datasets used for artificial intelligence, machine
learning, and Big Data applications.
Tableau
• The Pros of Tableau:

• Excellent visualization capabilities


• Easy to use
• Top class performance
• Supports connectivity with diverse data sources
• Mobile Responsive
• Has an informative community

• The Cons of Tableau:

• The pricing is a bit on the higher side


• Auto-refresh and report scheduling options are not available
Dundas BI
Dundas BI offers highly-customizable data visualizations with interactive scorecards,
maps, gauges, and charts, optimizing the creation of ad-hoc, multi-page reports. By
providing users full control over visual elements, Dundas BI simplifies the complex
operation of cleansing, inspecting, transforming, and modeling big datasets.
The Pros of Dundas BI:
•Exceptional flexibility
•A large variety of data sources and charts
•Wide range of in-built features for extracting, displaying, and modifying data
The Cons of Dundas BI:
•No option for predictive analytics
•3D charts not supported
JupyteR
A web-based application, JupyteR, is one of the top-rated data visualization tools that
enable users to create and share documents containing visualizations, equations,
narrative text, and live code. JupyteR is ideal for data cleansing and transformation,
statistical modeling, numerical simulation, interactive computing, and machine
learning.
The Pros of JupyteR:
•Rapid prototyping
•Visually appealing results
•Facilitates easy sharing of data insights
The Cons of JupyteR:
•Tough to collaborate
•At times code reviewing becomes complicated
Zoho Reports
Zoho Reports, also known as Zoho Analytics, is a comprehensive data visualization tool
that integrates Business Intelligence and online reporting services, which allow quick
creation and sharing of extensive reports in minutes. The high-grade visualization tool
also supports the import of Big Data from major databases and applications.
The Pros of Zoho Reports:
•Effortless report creation and modification
•Includes useful functionalities such as email scheduling and report sharing
•Plenty of room for data
•Prompt customer support.
The Cons of Zoho Reports:
•User training needs to be improved
•The dashboard becomes confusing when there are large volumes of data
Google Charts
One of the major players in the data visualization market space, Google Charts, coded
with SVG and HTML5, is famed for its capability to produce graphical and pictorial data
visualizations. Google Charts offers zoom functionality, and it provides users with
unmatched cross-platform compatibility with iOS, Android, and even the earlier
versions of the Internet Explorer browser.
The Pros of Google Charts:
User-friendly platform
Easy to integrate data
Visually attractive data graphs
Compatibility with Google products.
The Cons of Google Charts:
The export feature needs fine-tuning
Inadequate demos on tools
Lacks customization abilities
Network connectivity required for visualization
Visual.ly
Visual.ly is one of the data visualization tools on the market, renowned for its
impressive distribution network that illustrates project outcomes. Employing a
dedicated creative team for data visualization services, Visual.ly streamlines the
process of data import and outsource, even to third parties.
The Pros of Visual.ly:
Top-class output quality
Easy to produce superb graphics
Several link opportunities
The Cons of Visual.ly:
Few embedding options
Showcases one point, not multiple points
Limited scope
RAW
RAW, better-known as RawGraphs, works with delimited data such as TSV file or CSV
file. It serves as a link between data visualization and spreadsheets. Featuring a range
of non-conventional and conventional layouts, RawGraphs provides robust data
security even though it is a web-based application.
The Pros of RAW:
Simple interface
Super-fast visual feedback
Offers a high-level platform for arranging, keeping, and reading user data
Easy-to-use mapping feature
Superb readability for visual graphics
Excellent scalability option
The Cons of RAW:
Non-availability of log scales
Not user intuitive
IBM Watson
Named after IBM founder Thomas J. Watson, this high-caliber data visualization tool
uses analytical components and artificial intelligence to detect insights and patterns
from both unstructured and structured data. Leveraging NLP (
Natural Language Processing), IBM Watson's intelligent, self-service visualization tool
guides users through the entire insight discovery operation.
The Pros of IBM Watson:
NLP capabilities
Offers accessibility from multiple devices
Predictive analytics
Self-service dashboards
The Cons of IBM Watson:
Customer support needs improvement
High-cost maintenance
Sisense
Regarded as one of the most agile data visualization tools, Sisense gives users access to
instant data analytics anywhere, at any time. The best-in-class visualization tool can
identify key data patterns and summarize statistics to help decision-makers make data-
driven decisions.
The Pros of Sisense:
Ideal for mission-critical projects involving massive datasets
Reliable interface
High-class customer support
Quick upgrades
Flexibility of seamless customization
The Cons of Sisense:
Developing and maintaining analytic cubes can be challenging
Does not support time formats
Limited visualization versions
Plotly
An open-source data visualization tool, Plotly offers full integration with analytics-
centric programming languages like Matlab, Python, and R, which enables complex
visualizations. Widely used for collaborative work, disseminating, modifying, creating,
and sharing interactive, graphical data, Plotly supports both on-premise installation and
cloud deployment.
The Pros of Plotly:
Allows online editing of charts
High-quality image export
Highly interactive interface
Server hosting facilitates easy sharing
The Cons of Plotly:
Speed is a concern at times
Free version has multiple limitations
Various screen-flashings create confusion and distraction
THANK YOU

15

You might also like