Data Visualization With Power Bi - Tech Leap
Data Visualization With Power Bi - Tech Leap
ANS: INFORMATION
ANS: KNOWLEDGE
[iii]THE ABILITY TO USE YOUR KNOWLEDGE AND EXPERIENCE TO MAKE GOOD DECISIONS AND
JUDGEMENTS?
ANS: WISDOM
[iv]CONTEXTUAL, ORGANIZED AND VETTED DATA THAT CONVEY SOME SORT OF TREND OR PATTERN?
ANS: INFORMATION
ANS: DATA
2]ALEX IS IN THE PROCESS OF CREATING A REPORT THAT DISPLAYS THE RESULTS OF A SURVEY. WHICH
DATA TYPE BEST DESCRIBES THE DATA THAT ALEX IS DEALING WITH?
ANS: OBSERVATIONAL
3]A UNIVERSITY PROFESSOR COLLECTS INFORMATION ABOUT THE CLASSIFICATION OF HER STUDENTS
AS FRESHMEN, SOPHOMORES, JUNIORS, SENIORS. THE DATA IS THEN DISPLAYED IN A BAR CHART.
WHAT TYPE OF DATA IS THE UNIVERSITY PROFESSOR COLLECTING?
[ii]GETS A BACKUP OF THE DATA GENERATED OR REVISED SINCE THE LAST BACKUP, REGARDLESS OF
THE TYPE OF THE LAST BACKUP?
[iv]GETS A BACKUP OF THE DATA GENERATED OR REVISED SINCE THE LAST FULL BACKUP?
ANS: THEY ALL REQUIRE PERFORMING PROPER ETL MAPPING TO ENSURE CONSISTENCY AND
COMPATIBILITY, THEY ALL BEGIN WITH DATA INGESTIONS AND CLEANSING OF THE DATA PRIOR TO
INTEGRATION, THEY ALL REQUIRE IDENTIFYING SOURCE AND TARGET SYSTEMS.
7]RAVI WANTS TO CREATE A DATA VISUALIZATION TO SHOW WHICH PARTS OF HIS COMPANY
WEBSITE ARE RECEIVING THE MOST CLICKS AND ARE BEING MOST VIEWED BY HIS VIEWERS. WHICH
DATA VISUALIZATION WILL PROVIDE DAN WITH A VISUAL THAT IS EASY TO ASSIMILATE AND MAKE
DECISIONS FROM?
ANS: FROM
ANS: GROUP BY
ANS: WHERE
ANS: ORDER BY
ANS: HAVING
1]SUPPOSE YOU HAVE A COMPANY OF 50 EMPLOYEES AND YOU ARE WRITING CODE FOR A VERY
SPECIFIC TYPE OF PROGRAM. THERE ARE FIVE VENDORS THAT PROVIDE YOU THE GRAPHICAL
SUPPORT AND YOUR TARGET CLIENT ARE VERY SMALL BUSINESS. WHICH STATEMENT WILL BE TRUE
IN THE DOMAIN OF DATA MANAGEMENT?
2]WHAT WILL BE THE BEST APPROACH OF DATA MANAGEMENT FOR A MULTI-FACETED ENTERPRISE
WITH MULTIPLE DOMAINS?
ANS: LOGICAL
4]WHAT IS THE PRIMARY REASON FOR DATA INTEGRATION ACROSS DOMAINS?
5]WHICH CORRECTLY DESCRIBES THE DIFFERENCE BETWEEN STATIC AND DYNAMIC DATA?
ANS: STATIC DATA DOESNOT REQUIRE UPDATING WHEREAS DYNAMIC DATA REQUIRES REGULAR
UPDATING
6]IN YOUR MLTI-DOMAIN ENTERPRISE WHERE THE PRIMARY FUNCTION IS OF A STOCK MARKET
BROKER AND WHERE YOU NEED REAL-TIME DATA SYNCHRONIZATON WHAT WILL BE THE REQUIRED
STYLE OF ARCHITECTURE NEEDED FOR THE DATA MANAGEMENT PROGRAM?
8]IN YOUR ENTERPRISE WHERE YOU ARE PLANNING TO DEVELOP A PERFECTLY ALIGNED SYSTEM
ACROSS ALL THE DOMAINS, WHICH FUNCTION WILL YOU DEEM AS NOT NECESSARY FOR BUILDING A
TRULY ALIGNED SYSTEM?
9]WHAT IS THE PROPER ORDER FOR THE LEVELS OF MANAGEMENT AND CONTROL FROM A
MEASUREMENT OF MATURITY POINT OF VIEW?
1]WHICH IS NOT THE KIND OFJOB THAT IS TO BE INCLUDED IN THE ROLE OF A DATA STEWARD?
3]WHICH CAN BE A MAJOR PROBLEM IN COMPANIES THAT HAVE A “BRING YOUR OWN DEVICE”
POLICY?
6]WHICH ARE THE TWO FACTORS CRITICAL TO AN ORGANIZATION WHEN CONSIDERING THE
REGULATION OF DATA PRIVACY?
ANS: APPLICATION OF BUSINESS AND DATA QUALITY RULES IS NOT A STEP IN ENTITY RESOLUTION
9]IN YOUR MULTI-DOMAIN ENTERPRISE WHERE THERE ARE MULTIPLE VENDORS AND MULTIPLE
RESOURCES, WHICH WILL BE A KEY DECIDING FACTOR IN HOW YOU WILL MANAGE THE CRUD?
10]IN BUILDIN YOUR DATA GOVERNANCE PRACTICE, WHICH WILL NOT BE AN OBJECTIVE FOR THE
GOVERNANCE PRACTICE?
3]IN YOUR DATA DRIVEN ENTERPRISE, HOW WILL YOU ENSURE THAT GOOD QUALITY DATA IS BEING
MAINTAINED AND RECONCILED ACROSS SYSTEMS?
8]WHICH FUNCTION IS A MUST FOR DATA COMPLIANCE BUT NOT REQUIRED AT ALL FOR DATA
MANAGEMENT AND DATA GOVERNANCE?
ANS: INTER-RELATIONS
9]WHICH IS A MAJOR SOLUTION TO ADDRESS THE DATA GOVERNANCE AND COMPLIANCE ISSUES?
MCQS TQ CLOUD:
6.An COMPANY client is eager to begin their cloud transformation but needs
guidance on selecting the right approach to meet their specific needs and
objectives. Which asset can help this client discover assess design and simulate
end to end cloud migration solutions at scale
Ans: myNav
10. The all India cricket council is experiencing sporadic activity on their
ticketing websites depending on the announcement of maths schedules the
required high speed crossing during peak times but otherwise website traffic is
nominal which unique cloud feature seamlessly helps manage the increase and
decrease in traffic?
Ans . Elasticity
13. The activities bikes during Black Friday sales and the holiday session put
significant stress on a major retailers website to what feature of cloud
computing would resolve this problem?
Ans: elasticity
16. Sarah's business offer a service where multiple users applications are hosted
on a server at the same time based on the scenario on which facet of cloud
computing does saras business rely
Ans. Multitenancy
17. A healthcare client approaches COMPANY for help in defining their Cloud
migration strategy. During the initial discussions, the client explains that there
are very specific regulatory and other compliance requirements within their
industry. Which Cloud deployment model would give this client complete
control over their data and security?
Ans: private cloud
18.Most companies are using Cloud. However, many companies fail to achieve
the anticipated benefits from their Cloud investments. Of the companies
COMPANY speaks with, what percentage realize the full benefits they expected
to get from the Cloud?
Ans 75%
22.COMPANY began its own Cloud journey in 2015, after our internal IT
organization recognized the significant advantages to be gained in leveraging
Cloud capabilities at scale. Today, which percentage of COMPANY’s
infrastructure and processes are in the Cloud?
Ans: 95%
27. Which benefits does the cloud provide to startup companies without access
to large funding?
Ans. The ability to pay as you go and create another ornaments with little
upfront costs.
29.To help build credibility with a potential new client, an COMPANY Sales Lead
shares the story of COMPANY’s own journey to the Cloud. Which statement is
true about COMPANY’s own Cloud transformation journey?
Ans . COMPANY has implemented our cloud solution at scale and delivered clear
business values
Ans . MyNav
33. The activity spikes during Black Friday sales and the holiday season put
significant stress on a major retailer’s website. What feature of Cloud
computing would resolve this problem?
Ans. Elasticity
37.A healthcare client approaches COMPANY for help in defining their Cloud
migration strategy. During the initial discussions, the client explains that there
are very specific regulatory and other compliance requirements within their
industry. Which Cloud deployment model would give this client complete
control over their data and security?
Ans. Private cloud
38. COMPANY began its own Cloud journey in 2015, after our internal IT
organization recognized the significant advantages to be gained in leveraging
Cloud capabilities at scale. Today, which percentage of COMPANY’s
infrastructure and processes are in the Cloud?
Ans .95%
40. In future where will Yan increasing majority of data come from?
Ans . Individual users and devices
42. What would be an ideal scenario for using age computing solutions?
Ans . A hospital department that perform high risk time sensitive surgeries.
44.A Chief Technology Officer has seen the company grow significantly in the
last three months. Historically, the application was hosted with on-premise
hardware, but recently moved some components of the application to the
Cloud. Which type of Cloud environment is being used?
Ans : Hybrid cloud
45. Which term describes a cloud provider allowing more than one company to
to share or rent the same server?
Ans. Multitenancy
46. How does a cloud first strategy approach clients migration to the cloud?
Ans. By bringing multiple services together to serve the client's business needs
47. How has the covid-19 pandemic affected business relationship to cloud
computing?
Ans. It has accelerated the urgency for a business to move to the cloud quickly
MCQS TQ DATA:
1. Which scenario best illustrates the implementation of data governance?
Ans. Drafting a press release based on that are extracted from the news of the
day
2. How does COMPANY helps companies harness the power of data to achieve
optimal business outcomes?
Ans. They develop data governance frameworks that go on to form the basis of
the company data analytics strategy.
10. How do data platforms help business with their data governance needs?
Ans . By controlling security and accessability
11. Which technology is combined to make that are critical organisational asset?
Ans . Machine learning and artificial intelligence (AI)
12. Why is it important for companies to invest in building a complete data and
analytics platform?
Ans . It produces cultural support and alignment a growth mindset and new
ideas and priorities to improve business process
13. How does the implementation of an enterprise wide data and analytics
strategy help organisation?
Ans. By housing data in oneplus and converting it into easily consumerical
information at high speed
14. A mobile application delivers market predictions based on stock data from
the stock market data platform knowing that the data platform can stream its
data how should the application interact with the data platform to deliver
predictions in real time?
Ans. Interact programmatically through the data platform API (application
programming interface)
15. Yah healthcare provider approaches COMPANY to help them increase their
efficiency through an advance data science platform what is the first step
essential could take ?
Ans . Train the client to independently manage their data
16. What is an advantage of using fully integrated cloud based data analytics
platform?
Ans . It provides the computing power needed to convert raw data into
meaningful information for decision makers
19. What happens when a data set include records with missing data?
Ans . Itna the data set which must then be discarded and recollected
20. Which comprehensive approach should companies follow to trust and use
that as a critical differentiater?
Ans . Develop clear data governance policies with a strong data strategy
21. A client approaches COMPANY for resolution to collect the large amount of
data it has collected over the years from various sources which solution would
help the client pool there that are together?
Ans. A cloud analytics solution
22. How does COMPANY help companies turnless meaningful data collection
into an effective data strategy?
Ans . By recommending the tool and platforms to drive the data analytics
governance and process
23. Where does change management play a major role in transforming client
business into a data driven intelligence enterprise?
Ans. Data governance and management
24. What is the most important action that organisations should take with the
data they captured about their customers?
Ans. process and organise it into meaningful information.
26. How does COMPANY help companies turnless meaningful data collection
into an effective data strategy?
Ans. by recommending the tools and platform to drive the data analytics
governance and process
27. What does COMPANY help business implement to ensure their data is
trustworthy and reliable?
Ans . Data governance and management policies
28. What role does the data information knowledge wisdom DIKW pyramid
plane managing a data driven project?
Ans. It serves as a linear model to show the various ways of extracting insights
and value from all types of data
29. How should a common data source like social media comments be
categorised?
Ans . Unstructured data
30. A client approach Assassin Safari solution to call at the large amount of data
was collected over the years from various sources which solution would help
the client pool their data together?
Ans. A cloud analytics solution
31. The department manager researches new data plans from the company and
request the list of essential features which essential features should the list
included?
Ans . Strong security centralisation programmetic access
32. How does a fully integrated data and analytics platform enable organisation
to converted data into consumeable information and insight?
Ans. Buy creating analytics reports and build mission learning modules to
refined predictive capabilities
39. How should a company adopt a data-driven culture that will stick?
Ans. use change management to transformer how the company things about
data.
40.COMPANY is helping a large retailer transform their online sales and services.
A data analyst audits the client’s customer journey and maps out the kind of
data they need to collect at each touch point. How does this help the client
achieve their objectives?
Ans. By increasing their customer knowledge and leveraging that information to
improve customer experience.
42. What is the next up and organisation should take after capturing and
collecting data?
Ans . Process and curate data
47. What is an advantage of using a fully integrated cloud based data analytics
platform?
Ans. It provides the computing power needed to convert raw data into
meaningful information for decision making.
48. COMPANY processes and migrates data for an E-Commerce frame to billion
advanced data platform how does this platform benefit the firm?
Ans . Increases business returns
49. The city of alums field is using electronic sensors to collect data on traffic
patterns in pollution and crime this data is used to manage assets and the
resources more efficiently and in real time which of these teams would better
define the data collected from the sensors in this scenario?
Ans. Big data
50. A large hotel chain approaches Accentur for advice the company has a
accumulated a vast amount of data during its years of business but it uncertain
how to put the data to use how can a sensor best help the client?
Ans . By helping the client development enterprise by data and analytics
strategy
54. How can the adoption of data platform simplify data governance for an
organisation?
Ans. By checking the integrity of data that is currently stored in the
organisations data lake
55. How does machine learning and artificial intelligence yeah AI technologies
help business use their enterprise data effectively?
Ans. The capture all the data in real time or near real time.
3]YOU HAVE INCORPORATED THE USAGE OF AMAZON REDSHIFT IN YOUR ORGANIZATION, AND YOU
DON’T WANT YOUR DATA TO BE CORRUPTED BY PROCESSING. THEREFORE YOU WANT THE DATA TO
BE STORED IN RAW FORMAT BEFORE THE PROCESSING IS DONE, WHICH IS A SERVICE OFFERED BY
AMAZON REDSHIFT. WHAT IS THE KEY BENEFIT OF STORING DATA IN RAW FORMAT?
5]YOU HAVE A VERY WELL-RUN MANAGEMENT PROGRAM IN YOUR ORGANIZATION, WHICH IS VERY
SECURE. YOU USE ANALYTICS FOR MAKING BIG DATA DECISIONS. IN RECENT MONTHS, YOU REALIZE
THAT WHATEVER DECISIONS YOU ARE MAKING BASED ON THE ANALYTICS ARE PROVIDING TO BE
WRONG AND HARMFUL FOR THE ORGANIZATION. AFTER CONSULTING WITH THE DATA ANALYTICS
AND DATA STEWARDSYOU CONCLUDE THAT IT IS DUE TO POOR DATA QUALITY. WHAT SHOULD BE
THE NEXT STEP OF ACTION?
6]YOUR ORGANIZATION HAS AN EFFECTIVE SALES TEAM THAT IS BACKED UP BY ANALYTICS THAT
HELP ACCLERATE THE PROCESS OF SALES FROM THE INITIAAL CONTACT. ONE OF THE PRIMARY
REASONS FOR ITS EFFECTIVENESS IS A DATA INPUT TOOL THAT HASTENS THE PROCESS OF DATA
ENTRY BY PROVIDING PRESET SUGGESTIONS. WHAT ARE THESE SUGGESTIONS COMMONLY KNOWN
AS IN DATA SCIENCE TERMINOLOGY?
7]YOU HAVE AN AUTOMOBILE COMPANY THAT HELPS SORT OUT VEHICLES AND MAKE MONTHLY
SALES VERSUS EXPENDITURE REPORTS. WHAT IS THE BEST WAY TO HANDLE THE DATA FOR
CENTRALLY STORING IT?
8]OTHER THAN GAINING REAL-TIME INSIGHTSOF THE DATA, WHAT IS ANOTHER MAJOR ADVANTAGE
OF STREAMING ANALYTICS?
ANS: REAL-TIME DASHBOARDS
9]IN A CANDLESTICK CHART, YOU SEE THE SHARE PRICE OF YOUR COMPANY FALLING. YOU HAVE
IMPLEMENTED ASTREAMING ANALYTICAL TOOL HELPS IN ANALYZING AND DASHBOARDING THE
DATA AS IT IS PRODUCED. OU REALIZE THAT THE CANDLESTICKS PATTERN HAS CONSOLIDATED AND IS
NOT RESPONDING WELL TO THE INFLUX OF DATA. WHAT IS THE SOURCE OF THIS PROBLEM?
10]YOU PLAN TO IMPLEMENT A MODERN DATA WAREHOUSE SOLUTION INTO YOUR ENTERPRISE.
YOU HAVE UNDERSTOOD THE PROPER DATA MANAGEMENT AND GOVERNANCE ISSUES. YOU HAVE
SET UP ALL YOUR DOMAINS AND DATA INGESTION. NOW YOU PLAN TO MAKE A CENTRAL
REPOSITORY OF ALL YOUR FILES. WHAT SHOULD BE THE NEXT STEP FOR THE IMPLEMENTATION OF
THE DATA WAREHOUSE SOLUTION?
1]THE REASON WHY AZURE DATABRICKS IS SO EASY TO USE IS BECAUSE IT IS UNIVERSAL AND IS
INTEGRATED WITH MICROSOFTS SERVER FOR BETTER PARSING OF INFORMATION. WHICH PLATFORM
HAS THE SAME SOURCE OF ORIGIN AS DATABRICKS, WHICH GIVES IT AN ANALYTICAL
ADVANTAGEOVER OTHER PLATFORMS?
2]PLACE THE STEPS FOR TYPICAL AZURE DATABRICKS WAREHOUSE IN THE CORRECT ORDER?
3]WHAT IS THE ONE DIFFERENCE THAT SEPARATES THE MODEL OF THE SNOWFLAKE DATA
WAREHOUSE FROM ALL THE OTHER DATA WAREHOUSE SOLUTIONS?
4]WHAT ARE THE MAJOR DISADVANTAGES OF SNOWFLAKE THAT MIGHT BE TROUBLESOME FOR A
FEW COMPANIES THAT SEEK DATA CATEGORIZATION?
ANS: FEWER OPTIONS WITH GEOSPATIAL SPACE, NO OPTION FOR UN-STRUCTURED DATA
6]WHAT ARE THE MAJOR ADVANTAGES OF A CLOUD WAREHOUSE SOLUTION OVER AN ON-PREMISES
DATA WAREHOUSE SOLUTION?
8]WHAT ARE THE TWO DIFFERENT DATA PIPELINE TOOLS THAT ADDRESS SPECIFIC JOB ROLES?
1]WHICHOF THESE STATEMENTS ACCURATELY DESCRIBES THE UNIQUE MATCH JOIN MATCH MODEL
IN THE MAP EDITOR?
ANS: LAST MATCH IS CONSIDERED AND PASSED TO THE OUTPUT, SET AS DEFAULT WHEN
CONFIGURING AN EXPLICIT JOIN
2]YOU ARE BUILDING AN EXPRESSION IN MAP EDITOR FOR A COLUMN IN WHICH YOU WANT TO PAD
THE ROW1.CUSTOMERID STRING WITH LEADING ZEROS UP TO MAXIMUM LENGTH OF 6 CHARACTERS.
WHICH CODE CAN YOU USE TO ACCOMPLISH THIS FOR YOU?
3]YOU HAVE MAP EDITOR OPEN FOR A Tmap OBJECT FOR WHICH YOU ARE MAPPING DATABASES
OBJECTS. YOU CLICK THE JOIN MODEL FOR A TABLE JOIN PROPERTY. WHICH OPTIONS APPEAR IN THE
OPTIONS DIALOG THAT APPEARS?
4]YOU ARE BUILDING A FILTER EXPRESSION IN MAP EDITOR FOR A COLUMN IN WHICH YOU WANT TO
FILTER THE PRODUCT NAME, PRD.NAME TO EQUAL “TURBO WIDGETS”, AND YOU WANT THE
TRANSACTION QUANTITY, TX.QTY TO BE GREATER THAN 100. WHICH CODE CAN YOU USE TO
ACCOMPLISH THIS FOR YOU?
6]ON THE COMPONENT TAB DISPLAYED FOR THE Tsortrow COMPONENT, WHEN YOU CLICK TO ADD
CRITERIA TO THE CRITERIA TABLE, TALEND AUTOMATICALLY POPULATES THE COLUMN VALUES WITH
DEFAULTS. IN THE “SORT NUM OR ALPHA?” COLUMN TALEND HA SCHOOSEN NUM BY DEFAULT FOR
CUSTOMER_ID AS DISPLAYED. WHICH OTHER VALUES ARE AVAILABLE FOR THE “SORT NUM OR
ALPHA?” COLUMN WHEN YOU CLICK TO OPEN THE DROPDOWN LIST FOR THAT COLUMN?
7]YOU ARE USING A tExtractDelimitedFields COMPONENT TO SPLIT THE ADDRESS2 FIELD IN THE
DELIMITED FILE AS DISPLAYED. WHAT MUST YOU SPECIFY THE FIELD SEPARATOR PROPERLY FOR THE
tExtractDelimitedFields COMPONENT TO PROPERLY SPLIT THE ADDRESS2 FIELD?
8]WHEN CONFGURING THE PROPERTIES FOR A tReplace COMPONENT, YOU CAN OPTIONALLY CLICK
THE ADVANCED MODE CHECKBOX. DOING SO ALLOWS YOU TO SPECIFY WHAT TYPE OF EXPRESSION
AS THE PATTERN TO SEARCH FOR?
9]WHEN CONFIGURING THE PROPERTIES FOR A tAggregateRow component. YOU ARE GOING TO
GROUP BY THE CUSTOMER_ID FIELD IN ORDER TO AGGREGATE THE SALES ON A CUSTOMER_ID BASIS,
SO THAT IN THE RESULTING OUTPUT FILE YOU WILL HAVE ONE ROW FOR EACH CUSTOMER_ID WITH
AGGREGATED SALES FIGURES. WHICH FUNCTION VALUE MUST YOU CHOOSE WHEN CONGIGURING
THE OPERATIONS TABLE FOR THIS tAggregateRow COMPONENT?
ANS: SUM
10]YOU ARE USING tNormalize COMPONENT TO NORMALIZE THE CATEGORY FIELD IN THE DELIMITED
FILE AS DISPLAYED. WHAT MUST YOU SPECIFY AS THE ITEM SEPARATOR PROPERTY FOR THE
tNormalize COMPONENT TO PROPERLY NORMALIZE THE CATEGORY FIELD?
1]SELECT THE MAIN DEPENDENCY THAT HAS TO BE INSTALLED FOR TALEND TO BE INSTALLED?
ANS: JAVA
5]SELECT THE FOLDER THAT CONTAINS ALL THE PROJECT INFORMATION FOR A JOB THAT IS EXPORTED
FROM TALEND STUDIO?
ANS: PROCESS
ANS: CREATE JOB IN THE REPOSITORY, ADD COMPONENTS FROM PALETTE TO THE DESIGN SPACE,
CONFIGURE THE COMPONENTS PROPERTIES, RUN THE JOB
ANS: ELEMENTS CAN HAVE MULTIPLE VALUES, ELEMENTS CAN CONTAIN TREE STRUCTURE
2]SELECT THE COMPONENT USED TO GENERATE AN XML FILE FROM A CSV FILE IN TALEND STUDIO?
ANS: tFileOutputXML
3]SELECT THE EXIT CODE VALUE THE SIGNIFIES THE SUCCESSFUL COMPLETION OF A JOB IN TALEND
STUDIO?
ANS: 0
5]SELECT THE OPTION TO BE ENABLED TO ALLOW THE SPECIFICATION OF 2 SCHEMAS FOR AN XML
INPUT FILE IN TALEND STUDIO?
ANS: ENABLE XPATH IN COLUMN “SCHEMA XPATH LOOP” BUT LOSE THE ORDER
6]IN ORDER TO GENERATE A COMPLEX XML FILE WHERE DATA IS SPECIFIED USING ATTRIBUTES OF
ELEMENTS AND ELEMENTS TREES IN TALEND STUDIO, WHICH COMPONENT ALLOWS SUCH?
ANS: tAdvancedFileOutputXML
ANS: tJoin
ANS: tMysqlinput
9]SELECT THE TOOL THAT ALLOWS SPECIFYING THE RELATION OF MULTIPLE TABLES AS DATA SOURCES
WHEN READING DATA FROM A DATABASE AS INPUT I TALEND STUDIO?
10]MATCH THE ATTRIBUTE VALUE WITH ITS ATTRIBUTE THAT WILL ONLY AD NEW RECORDS OR
MODIFY EXISTING ONES WITHOUT MODIFYING THE TABLE STRUCTURE OR OTHER RECORDS ALREADY
EXIST IN THE TABLE WHEN WRITING DATA TO A DATABASE TABLE IN TALEND STUDIO. TWO OPTIONS
ARE INVALID?
i]ACTION ON DATA?
ii]ACTION ON TABLE?
ANS: DEFAULT
11]SELECT THE COMPONENT THAT ALLOWS UPDATING DATA IN A DATABASE IN TALEND STUDIO?
ANS: tMySQLRow
ANS: tDenormalize.
MCQS Complex Data Types in Python: Working with Dictionaries & Sets in
Python:
Ans: names_ages[‘Alice’]
Code Editor:
names_ages[‘Tim’]
Code Editor:
Code Editor:
How would you update the names_ages dictionary with the values in
updated_names_ages dictionary?
Ans: names_ages.update(updated_names_ages)
Code Editor:
set_1 = {2, 4, 6, 8}
set_2 = {1, 2, 5, 6, 7, 8}
What operation would I run to get a result set with all of the elements from
both sets?
Ans: set_1.union(set_2)
Code Editor:
Ans: names_ages[2][1]
Code Editor:
How would you convert this to a dictionary with names as the keys and ages as
values?
Ans: dict(names_ages)
MCQS Complex Data Types in Python: Working with Lists & Tuples in Python:
Ans: Lists in python can have elements of different data types, Lists are ordered
collections
2] If you wanted to insert an element at index 2 in a particular list named
names_list what is the function that you would invoke?
Ans: names_list.insert(2,”john”)
3] If you want to count the number of times the name “John” appears in the
names_list what function would you invoke?
Ans: names_list.count(“John”)
4] If you wanted to sort the elements in the list names_list in alphabetical order
which of the following statements in Python are valid?
Code Editor:
How do you slice this list to access the elements ‘c’, ‘d’?
Ans: some_lst[2:4]
Code Editor:
names_list[::2]
Ans: [‘John’,’Lily’,’Nina’]
Code Editor:
some_string = “Python”
a, b, c, d = some_string
Code Editor:
city.find(‘x’)
Ans: -1
9] Which of the following lines of code will print this string in reverse i.e. print
out “olleH”
Code Editor:
some_str = “Hello”
Ans: some_str[::-1]
10] All of the following statements are ways in which lists and tuples are similar.
Which one of these is NOT true?
11] All of the following statements are ways in which lists and tuples are
different. Which one of these is true?
Ans: A list can be changed once creating, a tupe is immutable and cannot be
changed
12] Which of the following are valid complex data types in Python?
Ans: Using additional indentation from the left relative to the lines just before
and after the block
3] What is the output for this code?
print('a')
print('b')
print('c')
Ans: a b c
4] What is the output for this code?
if None:
print('Hi')
b = len(a)
if b == 4:
if b == 5:
else:
print(b)
a = “six”
b = (int(a), float(a))
a = “40.6 ”
b = “60.4 ”
c = a+b
num_one = 76
num_two = 23.4
value = 4
a = str(value)
b = a + “^” + “2”
c = a + “^” + “3”
Ans: 4 + 4 ^ 2 + 4 ^ 3
11] What do the values of d[0], d[1], d[2], d[3] evaluate to after the execution of
the Python code below?
z = sorted(new_list)
d = list(z)
Ans: “White”,”red”,”green”,”blue”
12] What is the output of the program below?
var = "hi"
if(type(var) == int):
elif(type(var) == float):
print("Type of the variable is Float")
elif(type(var) == complex):
else:
Ans: Unknown
13] What is the output of the program below?
total_classes = 100
attended_classes = 67
attendance = (attended_classes/total_classes)*100
else:
MCQS Conditional Statements & Loops: The Basics of for Loops in Python:
1] Which of these Python data types can NOT be iterated through using for
loops?
Ans: int
2] Given a variable my_dict which is a dictionary, consider you use it in a for
loop in this manner:
for x in my_dict:
print (x)
What are the contents printed out?
3] Which TWO of the following statements about for loops in Python are TRUE?
Ans: They may have an associated else block, They can iterate over the
elements in tuples, lists, and dictionaries
4] Given the following code, what is the type of x which is printed out in each
iteration?
for x in my_list:
print(x)
x = range(2, 14)
Ans: 13
6] What is the correct value of x given the assignment shown?
7] Which of the following function calls will generate the list below?
[10, 7, 4, 1, -2]
2] Which of the following terms best describes Jupyter notebooks that you can
use to write Python code?
Code Editor:
13 // 5?
Ans: 2
Code Editor:
a = True
b = True
Ans: a or b , a and b
9] Consider this bit of Python code:
Code Editor:
num_1 = 10
num_2 = 20
num_3 = num_1
num_1 = 100
Ans: 10
10] If you want to increment the value stored in the num_1 variable by 10 which
of the following Python statements are valid?
12] What is the correct syntax for specifying multi-line strings in Python?
Ans: functions cannot access variables which are declared outside the function
4] Consider a function definition which looks like this:
print(a, b, c)
x=3
y=4
result = x + y
print(result)
add(10, 20)
Ans: 7
Ans: A function can accept any number of positional arguments, They can be of
any data type- primitive or complex types
7] What is the default return value from a function when no return statement is
specified?
Ans: None
Ans: A function can have just one return statement, A function has to have a
return statement, A function with input arguments cannot have a return
statement
9] Which of the following statements(s) about the data types of return values
is/are false?
10] Which of the following are valid kinds of input arguments for Python
functions?
11] Which of the following are some of the advantages of using keyword
arguments to invoke functions?
Ans: Easier to maintain code since the value of each argument is clearly seen
during invocation, Keyword arguments can be specified out of order
12] What does this function definition indicate?
print(a, b, c)
13] Which of the following function definitions allows the function to accept
variable length arguments?
Ans: some_fn(*args)
14] Which of the following are valid types of arguments in Python?
Ans: Variable length positional arguments are passed into the function as a
tuple, Variable length keyword arguments are passed into the function as a
dictionary
Ans: Scikit-image
Ans: Statsmodel
Ans: Matplotlib
2] Let’s say you have imported the numpy package as np and you want to assign
the variable “x” with a 3 by 2 array of type integer, all of whose values are 1.
Which of these commands will you use to do so?
Ans: x=np.ones((3,2),dtype=np.int32)
3] What will be the value stored in the variable y after we have executed the
following code
import numpy np
y=np.arange(2,4,0.5)
Ans: [2, 2.5, 3, 3.5]
4] Let’s say you have imported the numpy package as np and you want to print
the first 2000 natural numbers in the form of an array and you want all 2000 of
the numbers to be visible on screen when printing (including the number 2000).
Which of these commands would you use to do so?
import numpy as np
x=np.array([[1,2] , [3,4]])
y=np.array([ [5,6],[7,8]])
z=(x*y)
Import numpy as np
x=np.array([4,6,2,8])
np.median(x)
Ans: 5
7] Which of these slicing operations can be used to quickly get the reversed
contents of a numpy array called “array”?
8] Match the following features of the numpy nditer function mentioned here
with the correct Boolean category
i] False
Ans: By default, the nditer object returns arrays that can be written on, The
nditer can accept only one dimensional arrays as input
ii]True
Ans: Using this function, we can iterate through each of the individual elements
of the array passed as an input argument
9] Which of these statements regarding the ravel() object in Python are true?
1] Let’s say you have a two dimensional numpy array called “twod” and you want
to split it row-wise into two equal halves. Then, which of these numpy functions
would you call on it to do so?
Ans: vsplit(twod,2)
2] Some of the features of digital images in Numpy are given below. Which of
these are true?
Ans: In numpy, images can be represented as a 3D matrix where the first two
dimensions represent the pixels in the image that are arranged in the form of a
grid and the third dimension specifies the number of channels for the image, A
digital image is a multidimensional array and every pixel in a digital image is
represented by a number
3] Let’s say you have an image that you have split into two equal halves along the
x axis. You have stored these two halves of the original image in the variables x1
and x2 respectively. Which numpy function would you use to combine these two
halves to reconstruct the original image?
Ans: concatenate((x1,x2),axis=1)
4] Let’s say you have a numpy array called “array_1” and you initialize “another
array called “array_2” with the help of a following command:
array_2 = array_1.view()
Match the following statements about “array_2” with the correct Boolean value
i]True
Ans: The base for array_2 points to the same object as array_1, array_1 and
array_2 contain the same elements
ii]False
Ans: array_2 points to the same object as array_1, If we re-assign array_2, then
we will end up re-assigning array_1 as well and change its contents
5] Let’s say you have a numpy array called “array_3” and you initialize “array_4”
with the help of a following command:
array_4 = array_3.copy()
Match the following statements about “array_4” with the correct Boolean value
i]False
Ans: If we re-assign array_4, then we will end up re-assigning array_3 as well and
change its contents, If we change a single element of array_4, then the
corresponding element in array_3 changes too, Changing the shape of array_4
will change the shape of array_3 as well
ii]True
6] Let’s say you have a 1-D numpy array called “cubes” consisting of the cubes of
the numbers 1,2,3 and so on till 10. What would be the value of the array:
cubes [ [ [ 4, 5 ], [ 1, 2 ] ] ]
7] Some of the features of Pandas is given below. Which of these are true?
8] Let’s say you imported numpy as np and you have initialized a 1-D array of
integers called “array”. What would np.all (x < 50) return?
Ans: This function would return a true boolean value if all the entries in your array
are less than 50 and false otherwise
9] Let’s say you have a Pandas dataframe called “phone_data” which contains the
data of various phones released in 2018 and their prices. It has the following
three columns:
You want only the names of all the phones that are priced more than 10,000.
Which of these commands can be used to print these values?
10] What are the conditions under which broadcasting can take place between
two elements in Numpy?
Ans: A smaller array can be broadcast on a larger array only when the
corresponding dimensions of the two arrays being operated upon are compatible
i.e. when the corresponding dimensions are equal or one of the two dimensions is
1, Broadcasting works when at least one of the elements is a scalar
11] Match the following statements about broadcasting with the correct Boolean
value:
i]False
Ans: The array [ [ 1, 2] , [ 3, 4] ] and the scalar 10 are incompatible with
broadcasting
ii]True
Ans: The scalar 10 and the scalar 20 are compatible with broadcasting, The array [
[ 1, 2] , [ 3, 4] ] and the array [ 1, 2 ,3 ] are incompatible with broadcasting, The
array [ [ 1, 2] , [ 3, 4] ] and the array [ [1], [2] ] are compatible with broadcasting.
Ans: Statsmodel
Ans: Bokeh
iii] Specifically meant for machine learning, data mining, and data analysis
Ans: Scikit-learn
2] Which of these statements related to the Pandas Series object are true?
Ans: Pandas Series object is similar to a Python list, Once we create a Pandas
Series object, an index representing the positions for each of the data points is
automatically created for the list
3] In the following Python code, typing which Python command will give the
user the CEO of Facebook?
import pandas as pd
companies_ceo = {
'Amazon' : 'Jeff Bezos'
companies_ceo_series= pd.Series(companies_ceo)
import pandas as pd
companies = {
'Amazon'
'Apple'
‘SpaceX’
‘Facebook’
‘Netflix’
ceo = {
'Jeff Bezos'
'Tim Cook‘,
‘Elon Musk‘
‘Mark Zuckerberg’
‘Reed Hastings’
Ans: frame.to_csv(‘datasets/data_frame.csv’)
7] Let’s say you have a pandas DataFrame called “panda” which has 8 rows in
total and you want to remove the last row from this DataFrame. Which of these
Python commands would you use to do so?
Ans: panda.drop(panda.index[7])
8] Let’s say you have saved a dataset in a pandas DataFrame called “dataset”
which has tons of records and you only want to access the details of the records
in only the 5th, 8th and 14th index. Which of these Python commands can you use
to do so?
Ans: dataset.loc[5,8,14], dataset.loc[[5,8,14],:]
9] Match the following statements related to the iloc indexer in Pandas with the
correct boolean values.
i] False
Ans: The column headers can be passed as input arguments in the form of a
string to the iloc function without any errors, When we pass 2:6 as input
argument to the iloc function, we get all details of the records located in the
second index all the way up to the 5th index of the DataFrame
ii]True
Ans: The iloc indexer is similar to the loc indexer and can be used to access
records located at a particular index in a Pandas DataFrame
i]True
Ans: MultiIndex is useful when we have large datasets where using numeric
indexes to refer to each record is unintuitive, MultiIndex lets the user effectively
store and manipulate higher dimensional data in a 2-dimensional tabular
structure
ii]False
Ans: The MultiIndex for a row is some composite key made up of exactly one
column
11] Which of these statements related to the pivot function in Pandas is true?
Ans: The combination of the row index and the column header must be unique
in order to generate a pivot table, The Pivot function summarizes the details of
each column in a DataFrame
12] What happens when we call the stack () function on a Pandas DataFrame?
Ans: It will create a new DataFrame such that a single row in the original
DataFrame is stacked into multiple rows in the new DataFrame depending on
the number of columns for each row in the original DataFrame
import pandas as pd
companies = {
companies_ceo = pd.DataFrame(companies)
2] Which of the following formats does Pandas not support natively when
exporting the contents of a Dataframe?
Ans: JPEG
3] Let’s say you have created a Pandas DataFrame called “unsorted” and you
want to sort the contents of this DataFrame column wise in alphabetical order
of the header name. Then, which function would you call on the “unsorted”
DataFrame to do so?
Ans: unsorted.sort_index(axis=1)
4] Match the following functions that you can call on a Pandas DataFrame
correctly with what they do
i] All the rows which contain a NaN value in any cell of that row are removed
Ans: .dropna()
ii] Every cell in the Dataset which has a NaN value will be replaced with 0
Ans: .fillna(0)
iii] Returns a Boolean array containing true or false values and returns the value
in a cell as true if it contains NaN
Ans: .isnull()
iv] Returns a Boolean array containing true or false values and returns the value
in a cell as true if it does not contain NaN
Ans: .notnull()
i] False
Ans: The .xs function cannot be used to return a cross section of columns
ii]True
Ans: By default, the .xs function only takes a look at values in the first level
index, The .xs function is used when our Pandas DataFrame makes use of a
MultiIndex
6] Let’s say you have imported Python as pd and have instantiated two
DataFrames called “frame_1” and “frame_2” with the exact same schema. What
command will you use to combine these two DataFrames into a single
DataFrame and make sure that the combined DataFrame has its own unique
index?
Ans: pd.concat( [frame_2, frame_1], ignore_index = True ), pd.concat( [frame_1,
frame_2], ignore_index = True )
7] The ‘how’ argument in the Pandas merge function allows us to specify what
kind of join operation we want to perform on the given Pandas DataFrames.
What are the valid values that we can give for this argument?
8] Some statements related to working with SQL Databases in Python are given
below. Match them with their correct Boolean values.
i] False
Ans: Once we have created a table, we can use sqlite3’s .execute() function to
recreate the same table with the same table name so that we have duplicates of
a table
ii]True
Ans: The sqlite3 library in Python allows us to create Databases on our local file
system, All the changes that we make to an SQL database on a Jupyter
notebook by connecting with it, will be committed to the database only after
we execute sqlite3’s .commit() function
3] Four of the seven characteristics of big data are listed. Match each
characteristic with its description. One description will not be used?
i] Volume
iii] Veracity
Ans: Making sure the data is accurate, which requires processes to keep bad
data from accumulating in your systems
iv] Variety
Ans: Unstructured data is information that does not have a predefined data
model, Common examples of unstructured data include audio, video files, or
No-SQL databases
6] What are the most important advantages of big data, according to the
International Institute for Analytics (IIA)?
Ans: Big data leads to cost reductions, Big data enables faster, better decision
making, Big data helps to identify what customers need and to introduce new
products and services accordingly
7] What are some of the main business domains that use big data tools today?
8] Which statements are correct about how Netflix utilizes big data?
Ans: Netflix has screenshots of scenes people might have viewed repeatedly,
the associated ratings, and the number of searches and the search topics,
Netflix uses what is known as the big data recommendation algorithm to
suggest TV shows and movies based on a user’s preferences
10] What are the main challenges that companies experience with big data?
Ans: Unfamiliarity with big data and confusing it with traditional methods,
Integrating data from a variety of sources, Data security issues, Unprecedented
data growth.
2] Which statements are true about resilient distributed datasets (RDDs) and
directed acyclic graphs (DAGs)?
4] What are some examples of metrics that Alibaba measures by utilizing Spark?
6] Which statement is correct about how Spark and Hadoop are different?
9] What are the three API types that are compatible with Spark?
10] What are some of the most important best practices when it comes to using
Apache Spark?
Ans: Proper tuning, Using the right level of parallelism, Joining a large and a
medium size RDD.
1] What are the biggest challenges associated with traditional data analytics?
2] Place the layers of big data analytics architecture in the correct order from
the bottom to the top.
Ans: Data monitoring, Data security, Data storage, Data processing, Data query,
Data visualization
Ans: Data may be in a raw, native format and not useful unless processed, Data
is not easily accessible using common tools, Data is stored in isolation and
cannot be combined with other sources
2] Which of the following are valid data types that can be stored in a data lake?
4] Which of the following are challenges involved in designing and building data
lakes?
Ans: Data lakes need to work with different data types and sparse and
incomplete data, Data lakes need to be able to support a huge volume of data,
Data lakes need to maintain data security and compliance
Ans: A database supports ACID properties and a data warehouse does not, A
data warehouse is optimized for read access, a database is optimized for read as
well as write access
6] Which of the following statements about data lakes and data warehouses are
true?
Ans: Data warehouses hold fairly structured data optimized for analysis, Data
lakes need to maintain security and ensure compliance of the data stored within
it, Data lakes promote shared data stewardship
Ans: A single catalog which indexes data from multiple sources to make it
searchable
10] Which of the following AWS services can be used to visualize data stored in
a data lake on AWS?
2] Arrange the following ETL processing steps in order from the top?
Ans: ingest data from source, message brokering, streaming data engine, long-
term storage and analytics
Ans: ETL
Ans: Transform
Ans: Load
Ans: Extract
6] Where does the library of job components reside in the Talend Open Studio
UI?
Ans: Palette
7] What high level model is used to get a project overview for ETL jobs in Talend
Open Studio?
8] Put the following AI hierarchy steps in pyramid order from the bottom up?
Ans: column-based
10] Match the data storage model approach with its descriptions.
i] Normalization
END