Lab - Qlik Replicate Azure Databricks
Lab - Qlik Replicate Azure Databricks
Databricks as a Target
TABLE OF CONTENTS
Overview ..................................................................................................................................................... 3
Introduction ............................................................................................................................................... 3
Managing Endpoint Connections .............................................................................................................. 4
Creating Source Endpoint Connections 4
Creating Target Endpoint Connections 6
Introduction
A Replicate Task is required to manage Source data change-data/capture from a Source System, into
the target database or file storage systems. A Replicate task can only manage one source and one
target system, but within a project, multiple Replicate Tasks can be used – one for each Source
System.
In this Lab, you will create a Qlik Replicate task with corresponding Source and Target Endpoints.
This task will be used to extract data from SAP ERP System, by way of SAP Extractors, and store in
Databricks in an Azure environment.
Qlik Replicate delivers a number of Endpoints types for various Source and Target Systems.
Steps
1. In the Replication Console, Select Manage Endpoint Connections.
3. Enter a meaningful Endpoint Name and Description for the Endpoint Connector.
4. With Source button selected, select dropdown arrow to select appropriate Source Type.
Name:
Description:
Role: Source
Instance Identifier: 0
Username:
Password:
Number Format:
Look for the “Test Connection succeeded” message. Any other message means something may be
incorrect with your Server/Database definitions, or the Server/Database is unavailable.
7. Select Save.
Steps
1. Select New Endpoint Connection.
2. Enter a meaningful Endpoint Name and Description for the Endpoint Connector.
5. Enter the following Target Server credentials as provided by your System/Database Administrator.
Azure Storage
Storage Type:
Storage Account:
File System:
Target Folder:
Storage Account:
Port:
Token:
HTTP Path:
Database:
Mount Path:
Again, look for the “Test Connection succeeded” message. Any other message means something
may be incorrect with your Server/Database definitions, or the Server/Database is unavailable.
7. Select Save.
8. Select Close.
8. Select OK.
- This closes the New Task dialog box.
The Replicate Task is now ready for the creation of Endpoints.
Steps
1. On the left of the Replicate Console panel, Select Source.
2. Locate the Source Endpoint created above or one which meets your Source definitions – SAP
Source.
5. Locate the Target Endpoint created above or one which meets your Target definitions – Azure
Databricks.
7. Select Save.
9. Select Finish.
10. Select OK.
3. Select OK.
4. Select Save.
Steps
1. In the Qlik Replicate Console, Select Table Selection.
2. Select Search.
A List of available files/tables will appear.
5. Select OK.
Once data load is completed, log into the Target Microsoft Azure Databricks and validate data.