Performance of Fastload Vs Multiload

1. Fastload loads data into an empty table in two phases - first distributing rows to AMPs without sorting, then sorting the rows. Multiload loads directly into a work table in sorted order. 2. The primary advantages of Fastload are avoiding an initial sort and sending one copy of rows regardless of fallback. Multiload sends two copies for fallback tables and sorts in the work table. 3. Fastload does not allow duplicates as it lacks information to handle them like Multiload's sequence numbers. It was designed before multi-set tables when duplicates were eliminated instead of loaded.

Uploaded by

saroj8898110179

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

124 views

Performance of Fastload Vs Multiload

Uploaded by

saroj8898110179

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Performance of Fastload vs Multiload

This is my understanding of the different types of check pointing taking place and the utility
differences...
During Phase I of the Fastload, the rows are de-blocked and redistributed to their proper AMP,
but they are not sorted in hashing order. In phase II of the Fastload, the rows are then sorted
and merged into the actual table. There is some kind of internal check pointing going on in this
phase to keep track of which data blocks have been merged in and which have not. This is
different than the check pointing that takes place in Phase I which keeps track of which rows
have been loaded from the host file. At the end of the phase II of Fastload, the fallback copy of
the table is created (if fallback is specified).
In Mload, during the acquisition phase, if you have fallback on the table, two copies of the row
are created and sent to the AMPs (one to the primary and one to the fallback AMP). This is one
difference between Mload and Fastload. Also, during the acquisition phase, as rows are being deblocked and sent to the appropriate AMP, they are put into the work table in hashing order just
like they exist in the target table. So, at the end of the acquisition phase, there is no need for a
sort (another difference between Mload and Fastload).
Once all of the rows have been put into the work table, the application phase then begins and
the rows are merged from the work table into the target table. There is also check pointing that
takes place in the application phase to keep track of which rows (or data blocks) from the work
table have been applied to the target table and which have not. This is different than the check
pointing that takes place in the acquisition phase, which again, keeps track of which rows have
been loaded from the file.
So, the differences between the two utilities when loading into an empty table would be:
1) Fastload sends one copy of each row regardless of fallback or non-fallback and then
creates the fallback copy in Phase II. Mload creates a second copy of the row if the target
table is defined as fallback. (Of course, you can always alter the table to add fallback at the
end of the Mload or Fastload, which would make this difference moot).
2) Fastload does a sort in Phase II to get the rows in hashing order. Mload puts the rows in
hashing order in the work table as it goes along. So, there is no sort that takes place in Mload
prior to merging the rows into the target table.

If you are going into a non-fallback table, I would think that #2 would be the primary advantage
of using Fastload. Rather than going through the overhead of keeping the rows in the work table
in order as it goes along (which will slow things down), Fastload simply gets the rows to the right
AMP (in phase I) and then sorts them at the end of that process (during phase II).

Why fastload doesn't allow duplicates

FastLoad discards duplicate rows, because it doesn't have/store any information about the input
record sequence like MultiLoad's Match Tag
(ApplySeq+DMLSeq+ImportSeq+SMTSeq+SourceSeq), thus it simply doesn't know, if a row was
duplicate within the data or was sent twice because of a restarted FastLoad (in Application
Phase).
If FastLoad would be able to load MultiSet like MLoad, there would be more overhead regarding
perm space. Currently the intermediate size of the target table is (almost) the same as the final
size and this is one of the big advantages of FastLoad over MLoad.
I think FastLoad is older than MultiSet tables and there's no reason to add that feature as long as
there's Mload.

BTEQ returns rows in blocks from the database to the underlying CLI in blocks as well. Depending on
your setup 64K may be the default but I believe this can be adjusted upwrd to 1MB (CLI parameter
not BTEQ parameter). This should be changed if exporting medium to large result sets.
a2) The primary difference between fastexport and BTEQ export is the ability to ship data over
multiple session connections simultaneously thereby leveraging the total connectivity available
between the client platform and the database engine. In order to do this, Fastexport spends more
resources on executing the query in order to prepare the blocks in such a way that when they are
exported over multiple sessions they can easily be reassembled in the right order by the client without
additiona sorting or processing of the rows.
b1) BTEQ import does process a row at a time. Thus it is generally appropriate only for very small
imports.
b2) Fastload does use a buffered approach. But its real differentiation comes from its use of multiple
simultaneous sessions which allow it to leverage all of the parallelism of the parallel database engine.
Multiple AMPs are working on receiving, deblocking, data type transformation, hashing and
redistribution of the data.
b3) Fastload is only capable of inserting data into an empty table. This obviates the need for
journaling because if it doesn't finish properly, the proper recovery is to simply delete the table and
start over. BTEQ import uses journaling because it can load into an existing table and needs to be able
to roll back an unfinished import row.
c1) Fastload only allows insert into empty tables. Thus the thought is that it is easy to add indexes and
other table attributes after the table has been loaded with the data. No need for journals because the
data is either loaded or it is not. No need to roll forward or back.
c2) The primary reason Fastload does not allow duplicate records is that at the time it was designed
and built, multi-set tables did not exist. Teradata's early days allowed only SET tables so Fastload did
not have to be designed for being able to load duplicate records. It was deemed a use friendly feature
to automatically eliminate them rather than report them as errors. When Multiload was designed it
had a very different set of requirements including Multiset and the ability to insert, update and delete
in existing populated tables. This in turn led to very different design including the sequence numbers
in the incoming rows to allow ordered apply, insert of duplicate rows and the checkpoint/restart case
while still preserving the duplicate rows. It was decided that we would not go back and redesign
Fastload to cover all of these cases but rather to leave it for the simple case of insert into empty tables
and have Multiload handle any other case not supported by Fastload.

Choosing between the bulk tools should be less about relative performance and more about matching
the required functionality to the use case. If inserting data into an empty table (without dups) then
Fastload else Multiload. BTEQ import and TPump are chosen if the import data is small or medium
respectively and it is desirable to avoid the overhead of using a bulk utility slot and the overhead of
startup and shutdown of the bulk utility for small data sets. BTEQ export likewise is appropriate for
small to medium result sets. FastExport is designed to handle th high volume exports in an
expeditious way. And of course TPump can be used when the requirement is to load continuously.

The Mountain Between Us by Charles Martin - Excerpt
24% (75)
The Mountain Between Us by Charles Martin - Excerpt
29 pages
Nokia-ORAN Intro
No ratings yet
Nokia-ORAN Intro
20 pages
Teradata Performance Tuning and Optimization
100% (1)
Teradata Performance Tuning and Optimization
9 pages
C & C++ Interview Questions You'll Most Likely Be Asked
From Everand
C & C++ Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
OLTP To OLAP COnversion
No ratings yet
OLTP To OLAP COnversion
39 pages
Intermediate Load Runner With Oracle/Apex Concepts.
From Everand
Intermediate Load Runner With Oracle/Apex Concepts.
Rohan Gordon
No ratings yet
EnterpriseOne Interview Questions
From Everand
EnterpriseOne Interview Questions
equitypress
No ratings yet
FastExport Vs BTEQ Export # FastLoad Vs BTEQ Import#FastLoad Vs MultiLoad
No ratings yet
FastExport Vs BTEQ Export # FastLoad Vs BTEQ Import#FastLoad Vs MultiLoad
2 pages
Teradata Fast Load Utility
No ratings yet
Teradata Fast Load Utility
6 pages
Load Utilities in Teradata
No ratings yet
Load Utilities in Teradata
12 pages
Teradata Questions
No ratings yet
Teradata Questions
5 pages
Teradata Utilities FastLoad
No ratings yet
Teradata Utilities FastLoad
21 pages
Limitations On Fastload
No ratings yet
Limitations On Fastload
6 pages
FLOAD Intro HowToFastLoad 2
No ratings yet
FLOAD Intro HowToFastLoad 2
7 pages
An Introduction To MultiLoad
No ratings yet
An Introduction To MultiLoad
9 pages
Teradata Tools Fastload: Vidya T
No ratings yet
Teradata Tools Fastload: Vidya T
23 pages
FLOAD Sample&Problem
No ratings yet
FLOAD Sample&Problem
10 pages
Teradata Utilities
No ratings yet
Teradata Utilities
4 pages
Utilities Which Can Be Used in Datastage:: Advantages
No ratings yet
Utilities Which Can Be Used in Datastage:: Advantages
6 pages
Teradata Notes
No ratings yet
Teradata Notes
7 pages
Teradata Interview Questions
No ratings yet
Teradata Interview Questions
8 pages
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet
Quick Reference Teradata-Load Utilities
No ratings yet
Quick Reference Teradata-Load Utilities
3 pages
SAP interface programming with RFC and VBA: Edit SAP data with MS Access
From Everand
SAP interface programming with RFC and VBA: Edit SAP data with MS Access
Karl Josef Hensel
No ratings yet
Teradata Quick Notes
0% (2)
Teradata Quick Notes
5 pages
TeraData Interview Questions
No ratings yet
TeraData Interview Questions
91 pages
3 Teradata Interview Questions and Answers
No ratings yet
3 Teradata Interview Questions and Answers
7 pages
More on C# in Front Office
From Everand
More on C# in Front Office
Xing Zhou
No ratings yet
DWH Interview Q&A
No ratings yet
DWH Interview Q&A
50 pages
Oracle Full Table Scan
No ratings yet
Oracle Full Table Scan
8 pages
What Is Teradata
0% (1)
What Is Teradata
7 pages
Teradata Interview Questions and Answers
No ratings yet
Teradata Interview Questions and Answers
21 pages
Teradata Utilities MultiLoad
No ratings yet
Teradata Utilities MultiLoad
35 pages
TPT Teradata - The Teradata Parallel Transporter
No ratings yet
TPT Teradata - The Teradata Parallel Transporter
6 pages
Oracle SQL Loader - Conventional Path vs. Direct Path
No ratings yet
Oracle SQL Loader - Conventional Path vs. Direct Path
2 pages
ConventionalVsDirect Path
No ratings yet
ConventionalVsDirect Path
3 pages
Rules Files From Basic To Advanced Glenn Schwartzberg
No ratings yet
Rules Files From Basic To Advanced Glenn Schwartzberg
58 pages
Tableau 8.2 Training Manual: From Clutter to Clarity
From Everand
Tableau 8.2 Training Manual: From Clutter to Clarity
Larry Keller
No ratings yet
Teradata Interview Questions
100% (1)
Teradata Interview Questions
11 pages
Informatica Geek Interview Questions
No ratings yet
Informatica Geek Interview Questions
69 pages
Techtip Improve Performance When Writing To Db2 For I Tables Part I 1
No ratings yet
Techtip Improve Performance When Writing To Db2 For I Tables Part I 1
3 pages
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet
Tera-Tom On T Utilities E O: Eradata Ducation UTLINE V12-V13
No ratings yet
Tera-Tom On T Utilities E O: Eradata Ducation UTLINE V12-V13
9 pages
MultiLoad General Code
No ratings yet
MultiLoad General Code
55 pages
Teradata Bteq, Mload, Fload, Fexport, Tpump and Sampling
No ratings yet
Teradata Bteq, Mload, Fload, Fexport, Tpump and Sampling
49 pages
Module 7: A Multiload Application
No ratings yet
Module 7: A Multiload Application
30 pages
Comparison of The Teradata Loading Utilities
No ratings yet
Comparison of The Teradata Loading Utilities
4 pages
Joiner Transformation2
No ratings yet
Joiner Transformation2
10 pages
Multiload
No ratings yet
Multiload
28 pages
Teradata Utilities
No ratings yet
Teradata Utilities
16 pages
Data Protection in Teradata and Teradata Utilities Overview
No ratings yet
Data Protection in Teradata and Teradata Utilities Overview
29 pages
Lobs: Stronger and Faster in Db2 9 For Z/Os (Part 2 of 2)
No ratings yet
Lobs: Stronger and Faster in Db2 9 For Z/Os (Part 2 of 2)
55 pages
Practica de Base de Datos
No ratings yet
Practica de Base de Datos
73 pages
FastReader Best Practices For Teradata Migration From Oracle
No ratings yet
FastReader Best Practices For Teradata Migration From Oracle
2 pages
A Interview Questions and Answers - Cool Interview
100% (16)
A Interview Questions and Answers - Cool Interview
30 pages
Insert, Update Ordering in Informatica
No ratings yet
Insert, Update Ordering in Informatica
6 pages
Teradata Normalized Transformation
No ratings yet
Teradata Normalized Transformation
39 pages
Teradata Interview Questions
No ratings yet
Teradata Interview Questions
6 pages
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
From Everand
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
ADVANCED DATA STRUCTURES FOR ALGORITHMS: Mastering Complex Data Structures for Algorithmic Problem-Solving (2024)
From Everand
ADVANCED DATA STRUCTURES FOR ALGORITHMS: Mastering Complex Data Structures for Algorithmic Problem-Solving (2024)
VIOLET CASTRO
No ratings yet
Starting Guide for Postfix Mail Server Configuration Supporting Anti Spam and Anti Virus
From Everand
Starting Guide for Postfix Mail Server Configuration Supporting Anti Spam and Anti Virus
Dr. Hidaia Mahmood Alassouli
No ratings yet
Interview Questions for IBM Mainframe Developers
From Everand
Interview Questions for IBM Mainframe Developers
Robert Wingate
1/5 (1)
Operation Sheet 3.1: Man-In-The-Middle Attack Using Ettercap
No ratings yet
Operation Sheet 3.1: Man-In-The-Middle Attack Using Ettercap
19 pages
SYSTEM DESIGN DOCUMENT TEMPLATE v0.1
No ratings yet
SYSTEM DESIGN DOCUMENT TEMPLATE v0.1
14 pages
Advanced Java 02 - Collections
No ratings yet
Advanced Java 02 - Collections
7 pages
CS-144 Fundamentals of Programming: Project: House Designs Group: 7 M.Haider Bin Arif
No ratings yet
CS-144 Fundamentals of Programming: Project: House Designs Group: 7 M.Haider Bin Arif
8 pages
MasterThesis Basem Idlbi (
No ratings yet
MasterThesis Basem Idlbi (
119 pages
Code and Decode Exercise
No ratings yet
Code and Decode Exercise
4 pages
Video Watermarking: Subhajit Brojabasi Prof. Mihir Singh
No ratings yet
Video Watermarking: Subhajit Brojabasi Prof. Mihir Singh
31 pages
Capacity in KVAR
No ratings yet
Capacity in KVAR
2 pages
MSC Application Guide For TU Delft
No ratings yet
MSC Application Guide For TU Delft
13 pages
Documentation: International ST Andards On Technical Document Ation
100% (1)
Documentation: International ST Andards On Technical Document Ation
15 pages
MetaboliticsDB IEEE TCBB-2
No ratings yet
MetaboliticsDB IEEE TCBB-2
13 pages
Power Modules: Product Guide
No ratings yet
Power Modules: Product Guide
24 pages
Basis Data Inventaeis
No ratings yet
Basis Data Inventaeis
5 pages
M00-075-0085 Citizen Printer Setup
No ratings yet
M00-075-0085 Citizen Printer Setup
4 pages
CasinoDays TermsAndConditions in
No ratings yet
CasinoDays TermsAndConditions in
24 pages
App of Graphical Method
No ratings yet
App of Graphical Method
5 pages
Lab-1 Protutorial PDF
No ratings yet
Lab-1 Protutorial PDF
16 pages
Self-Assessment Checklist: FORM 4.1
No ratings yet
Self-Assessment Checklist: FORM 4.1
62 pages
Info For CTP Prep Course 2017-19
No ratings yet
Info For CTP Prep Course 2017-19
2 pages
CTA G3 WhitePaper Character Creation
No ratings yet
CTA G3 WhitePaper Character Creation
158 pages
Information System 621 Assignment
No ratings yet
Information System 621 Assignment
4 pages
A Ariyasinghe Profile
No ratings yet
A Ariyasinghe Profile
4 pages
How to Steps for Client Onboarding Process
No ratings yet
How to Steps for Client Onboarding Process
16 pages
LBP1000PC
No ratings yet
LBP1000PC
48 pages
Force Copy Queries From One Info Provider To Another With Different Structure
No ratings yet
Force Copy Queries From One Info Provider To Another With Different Structure
11 pages
University of Mumbai Class: T.E. Branch: Semester: VI Subject: Object Oriented Software Engineering (Abbreviated As OOSE)
No ratings yet
University of Mumbai Class: T.E. Branch: Semester: VI Subject: Object Oriented Software Engineering (Abbreviated As OOSE)
3 pages
Crash Dump Analysis: Debugging in Windows
100% (1)
Crash Dump Analysis: Debugging in Windows
28 pages
Pre-Alignment Delays v.1.1 - Guide EN
No ratings yet
Pre-Alignment Delays v.1.1 - Guide EN
48 pages

Performance of Fastload Vs Multiload

Uploaded by

Performance of Fastload Vs Multiload

Uploaded by

Performance of Fastload vs Multiload

Why fastload doesn't allow duplicates

You might also like