Microsoft Modern Data Estate
Microsoft Modern Data Estate
Hybrid
Reason over any data, anywhere Flexibility of choice Security and performance
Industry leader 2 years in a row Operational databases Operational databases 70% faster than Aurora
#1 TPC-H performance Data warehouses Data warehouses 2x global reach than Redshift
T-SQL query over any data Data lakes Data lakes No Limits Analytics with 99.9% SLA
Reason over any data, anywhere Flexibility of choice Security and performance
LOB
Image
Data orchestration Big data Hadoop/Spark and Data warehouse
and monitoring store machine learning
Social
Apps + insights
IoT
Devices
Batch queries
Interactive queries
Real-time analytics
Machine Learning
Data warehouse
Skype
• 10K+ Developers running diverse workloads and
Exchange
scenarios Windows
Malware Protection Microsoft Stores
Commerce Risk
Business
apps Cosmos DB
Data Lake SQL DB
Web & mobile apps
Store Data Lake Analytics
Data Factory
(Data Movement)
SQL Data
Blob Azure Databricks Warehouse
(Spark) Operational reports
Storage
Custom
apps
Azure ExpressRoute Azure Data Factory Azure Key Vault Operations Management Suite
Private Connections Orchestration Key Management Monitoring
Azure HDInsight
ANALYTICS
BIG DATA
Azure Marketplace
HDP | CDH | MapR
Any Hadoop technology, Workload optimized, Frictionless & Optimized Data Engineering in a
any distribution managed clusters Spark clusters Job-as-a-service model
BIG DATA
STORAGE
Azure Storage
Teradata
Intellicloud
Apache Predictive apps
Kafka with Spark/Hive/Pig
Open 10
HDFS on Cassandra on
Custom 01
Classified as[Link]
Microsoft Confidential
Azure Data Factory: HYBRID DATA INTEGRATION AT SCALE
Data Processing & Movement CLOUD
Relational data Any BI tool
Dashboards | Reporting
Mobile BI | Cubes
Advanced
V-NET
Analytics
Machine Learning
Non-relational data Stream analytics Cognitive | AI
Any language
Web Media Social media Devices ON-PREMISE
.NET | Java | R | Python
Ruby | PHP | Scala
AZURE DATA FACTORY ORCHESTRATES DATA PIPELINE ACTIVITY WORKFLOW & SCHEDULING
Managed Elastic
Singleton
Instance Pool
Table API
MongoDB
SQL
Turnkey global
Comprehensive
distribution
SLAs
Tools
Azure Infrastructure
Local machine
Scale up to DSVM
ML Server
Spark clusters
SQL Server
Interactive workspace that enables collaboration between data scientists, data engineers, and business analysts.
Native integration with Azure ser vices (Power BI, SQL DW, Cosmos DB, Blob Storage)
Enterprise-grade Azure security (Active Director y integration, compliance, enterprise -grade SL As)
Azure
Azure
Saas
Azure
Office 365
Public
Cloud
Other SSMS
data sources
“We want to incorporate all “We are trying to predict “We are trying to get insights
of our data including ‘big when our customers churn.” from our devices in real-time,
data” with our data etc.”
warehouse”
AZURE CLI
CUSTOM APPS
DATA FACTORY
LOGS, FILES AND MEDIA
AZURE DATA LAKE STORE
(UNSTRUCTURED)
ANALYTICAL DASHBOARDS
AZURE SQL DATA WAREHOUSE AZURE ANALYSIS SERVICES
BUSINESS / CUSTOM DATA FACTORY
APPS
(STRUCTURED)
POLYBASE
ANALYTICAL DASHBOARDS
BUSINESS / CUSTOM DATA FACTORY
APPS
(STRUCTURED)
POLYBASE
ANALYTICAL DASHBOARDS
BUSINESS / CUSTOM DATA FACTORY
APPS
(STRUCTURED)
POLYBASE
ANALYTICAL DASHBOARDS
BUSINESS / CUSTOM DATA FACTORY
APPS
(STRUCTURED)
AZURE HDINSIGHT
AZURE DATA LAKE STORE
(Kafka)
AZURE HDINSIGHT
AZURE DATA LAKE STORE
(Kafka)
DATA FACTORY
(Data movement)
CUSTOM APPS
EVENT HUBS
SENSORS AND DEVICES
KAFKA ON HDINSIGHT
STREAM ANALYTICS ANALYTICAL DASHBOARDS