Data Virtualization: How To Get Your Business Intelligence Answers Today
Data Virtualization: How To Get Your Business Intelligence Answers Today
Data Virtualization
How to get your Business Intelligence answers today
Forrester Research, Agile BI: Best Practices for Breaking through the BI Backlog, 2010.
or public cloud-based applications for those building decisionmaking applications or enhancing existing OLTP applications.
result, every part of the business can now access the up-todate status of the business as a whole, instantly.
DV in action
To understand the benefits of DV better, lets consider some
practical applications in more detail.
How DV works
The DV technology includes three
major components:
Integrated development environment (IDE): User
interface for development and access.
DV server environment: Kernel of DV tools.
Management environment: Monitoring, administration
and error handling.
Business scenario:
hospitality company
A hospitality company operates multiple casinos and luxury
hotels. The casinos have both table games and slot machines.
The hotels provide additional facilities such as spas, food and
beverage (F&B), and booking for theater shows.
Metadata
Repository
DB2
Connector
Oracle
Administration
Query
Engine
CACHE
File
IDMS
Database
Error Handling
DV Logical Layer
Security
Connector
Management
Environment
Source
System
Layer
BI
Applications
Downstream
Applications
Reports &
Analytics
Target
System
Layer
Performance
Optimization
Many users fear that a new DV layer may create performance
issues, because they feel that heavy query loads may choke up
source OLTP systems. DV vendors now address this issue with
an increased level of sophistication.
Most data virtualization tools persist or cache the source data in
traditional databases or flat files. If they encounter a slow query,
they can reconfigure the semantic layer as needed.
Many tool vendors are now enhancing their products to enable
database caching to take advantage of in-memory databases
you can actually use an in-memory/columnar database as
your database cache. DV solutions which embed an in-memory
database can even be configured to replicate the source
database in full, using mechanisms like Change Data Capture,
in order to optimize both synchronization workload and data
freshness. Once data is replicated in the in-memory cache, DV
queries no longer affect the performance of source systems.
This approach also provides optimized query execution,
particularly for decision support type queries which require full
data indexing in order to deliver speed-of-thought response
time on any query. As in-memory appliances get more and
more affordable, such an approach can provide an efficient
and non-intrusive way to deliver high-performance reporting
capability on a legacy system.
In addition, to prevent any unplanned performance impact on
the source system, you can implement a resource allocation
policy based on quotas. Most of todays databases allow you
to set a quota so that the resources allocated to processing
the queries generated by the DV layer are contained within
predefined limits, which makes the impact of DV predictable.
Checklist: could DV
help your organization?
Is there mounting pressure from the business for quick,
cost-effective reporting solutions?
Is there a burning need to build real time reporting and
analytics, and self-service BI?
Do regulatory constraints stop you from replicating data, so
that you need to access it within the source databases?
Do you need to overcome departmental politics to allow
your applications to share crucial information?
Do you need to use an enterprise data bus to exchange
data between applications?
Do you need to integrate external sources (social
Conclusion
Data Virtualization is the best way to provide an interim
solution for a single version of the truth with respect to multiple
databases, until the time when an EDW is built. DV is a highly
efficient way to make data from operational systems available
for real-time needs, and it can also pull in older data from BI
databases. That means BI tools can access all the companys
business information, both current and historical, in a unified
way.
Not only that: with DV it becomes possible for multiple BI
tools to access the data in the same way, using the same
business terminology and the same KPI aggregation rules. The
organization can ensure everyone uses the same data, yet can
still deploy a mixture of tools (whether reporting, dashboarding
or predictive analysis) to suit different needs, locations and
licensing arrangements.
The corollary is that there is no direct link, and so no technical
dependency, between the databases and the BI tools. That
makes it easier to upgrade, migrate, change or decommission
a database or BI tool.
To sum up, the main benefits of DV are:
A single version of the truth, even when multiple BI solutions
are used.
A huge time-to-market improvement: a virtual data
warehouse can be available faster than any other solution.
The ability to provide information in a controlled way, while
combining historical and real time data.
Mitigation of vendor and technology dependency by putting
a neutral layer between databases and the applications that
use them.
DV should be considered by any company that wants
to address BI needs fast. Even for organizations that are
also developing an EDW, DV can be used to supplement
these developments and is a convenient way to model the
requirements and confirm with the business that those
requirements are correctly defined. That approach can
significantly shorten delivery cycles for the EDW.
About Capgemini
With more than 120,000 people in 40 countries, Capgemini is one of the worlds
foremost providers of consulting, technology and outsourcing services. The Group
reported 2011 global revenues of EUR 9.7 billion.
Together with its clients, Capgemini creates and delivers business and technology
solutions that fit their needs and drive the results they want.
A deeply multicultural organization, Capgemini has developed its own way of working,
the Collaborative Business Experience, and draws on Rightshore, its worldwide
delivery model.
www.capgemini.com/BIM
The information contained in this document is proprietary. 2013 Capgemini. All rights reserved. Rightshore is a
trademark belonging to Capgemini.