0% found this document useful (0 votes)

47 views10 pages

Exercise 7 - Using Hive To Access Hadoop-Hbase Data

This document outlines a lab exercise focused on using Hive to access Hadoop/HBase data through a command line interface. It includes step-by-step instructions for connecting to the lab environment, starting the HBase shell, creating tables, and modifying table properties. The lab aims to provide practical experience in storing and querying data using HBase commands and features.

Uploaded by

risalafr

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

47 views10 pages

Exercise 7 - Using Hive To Access Hadoop-Hbase Data

Uploaded by

risalafr

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Unit 7 Storing and querying data

Lab 1
Using Hive to access Hadoop/HBase data

Storing and querying data © Copyright IBM Corporation 2018

Lab 1: Using Hive to access Hadoop/HBase data

© Copyright IBM Corp. 2018 7-68

Course materials may not be reproduced in whole or in part without the prior written permission of IBM.
Unit 7 Storing and querying data

Lab 1:
Using Hive to access Hadoop/HBase data

Purpose:
This lab is intended to provide you with experience in accessing
Hadoop/HBase data using a command line interface (CLI).

Task 1. Storing and accessing HBase data.

The major references for the HBase can be found on the [Link] and
Apache Wiki websites:
- [Link]
- [Link]
Cognitive Class has a free course on HBase at:
- [Link]
1. Connect to and login to your lab environment with user student and password
student credentials.
2. Launch Firefox, and then if necessary, navigate to the Ambari login page,
[Link] logging in as admin/admin.
3. Verify that Hive is running by clicking on Hive in the left panel:

If Hive is not running, you will have to start it using the central panel.
When running, minimize the Ambari Web Console browser.
4. Open a new terminal window then type cd to change to your home directory.

© Copyright IBM Corp. 2018 7-69

Course materials may not be reproduced in whole or in part without the prior written permission of IBM.
Unit 7 Storing and querying data

5. To start the HBase CLI shell, type the following command:

/usr/bin/hbase shell

[student@ibmclass ~]$ /usr/bin/hbase shell

2015-06-07 [Link],247 INFO [main] [Link]: [Link] is
deprecated. Instead, use [Link]
HBase Shell; enter 'help<RETURN>' for list of supported commands.
Type "exit<RETURN>" to leave the HBase Shell
Version 0.98.8_HDP_4-hadoop2, rUnknown, Fri Mar 27 [Link] PDT 2015

hbase(main):001:0>

The last line here is the prompt for the HBase CLI Client. Note that the
interactions are numbered (001) for each CLI session.
6. To view the online CLI help manual, type help at the prompt and then press
Enter:
hbase(main):001:0> help
HBase Shell, version 0.98.8_HDP_4-hadoop2, rUnknown, Fri Mar 27 [Link] PDT 2015
Type 'help "COMMAND"', (e.g. 'help "get"' -- the quotes are necessary) for help on a
specific command.
Commands are grouped. Type 'help "COMMAND_GROUP"', (e.g. 'help "general"') for help on a
command group.

COMMAND GROUPS:
Group name: general
Commands: status, table_help, version, whoami

Group name: ddl

Commands: alter, alter_async, alter_status, create, describe, disable, disable_all,
drop, drop_all, enable, enable_all, exists, get_table, is_disabled, is_enabled, list,
show_filters

Group name: namespace

Commands: alter_namespace, create_namespace, describe_namespace, drop_namespace,
list_namespace, list_namespace_tables

Group name: dml

Commands: append, count, delete, deleteall, get, get_counter, incr, put, scan, truncate,
truncate_preserve

Group name: tools

Commands: assign, balance_switch, balancer, catalogjanitor_enabled, catalogjanitor_run,
catalogjanitor_switch, close_region, compact, compact_rs, flush, hlog_roll, major_compact,
merge_region, move, split, trace, unassign, zk_dump

Group name: replication

Commands: add_peer, disable_peer, enable_peer, list_peers, list_replicated_tables,
remove_peer, set_peer_tableCFs, show_peer_tableCFs

Group name: snapshots

Commands: clone_snapshot, delete_snapshot, list_snapshots, rename_snapshot,
restore_snapshot, snapshot

Group name: security

Commands: grant, revoke, user_permission

Group name: visibility labels

Commands: add_labels, clear_auths, get_auths, set_auths, set_visibility

SHELL USAGE:
Quote all names in HBase Shell such as table and column names. Commas delimit

© Copyright IBM Corp. 2018 7-70

Course materials may not be reproduced in whole or in part without the prior written permission of IBM.
Unit 7 Storing and querying data

command parameters. Type <RETURN> after entering a command to run it.

Dictionaries of configuration used in the creation and alteration of tables are
Ruby Hashes. They look like this:

{'key1' => 'value1', 'key2' => 'value2', ...}

and are opened and closed with curley-braces. Key/values are delimited by the
'=>' character combination. Usually keys are predefined constants such as
NAME, VERSIONS, COMPRESSION, etc. Constants do not need to be quoted. Type
'[Link]' to see a (messy) list of all constants in the environment.

If you are using binary keys or values and need to enter them in the shell, use
double-quote'd hexadecimal representation. For example:

hbase> get 't1', "key\x03\x3f\xcd"

hbase> get 't1', "key\003\023\011"
hbase> put 't1', "test\xef\xff", 'f1:', "\x01\x33\x40"

The HBase shell is the (J)Ruby IRB with the above HBase-specific commands added.
For more on the HBase Shell, see [Link]
hbase(main):002:0>

Take a minute to look through the Help to see what is available to you. Note that
in the commands, the names of the tables, rows, column families are all in
quotes. You will need to make sure that when you are referring to specific
tables, rows, column families, that they are enclosed in quotes.
Practical notes:
• Command (such as create) must be lowercase
• Table name (such as t1) must be quoted, …
• In interactive mode, you do not need a semicolon as statement terminator
or separator (unlike standard SQL)
7. Create an HBase table using the create command:
create 't1', 'cf1', 'cf2', 'cf3'
to create a table t1 with three column families (cf1, cf2, cf3).
hbase(main):002:0> create 't1', 'cf1', 'cf2', 'cf3'
0 row(s) in 5.9840 seconds

=> Hbase::Table - t1
hbase(main):003:0>

The table t1 has been created.

© Copyright IBM Corp. 2018 7-71

Course materials may not be reproduced in whole or in part without the prior written permission of IBM.
Unit 7 Storing and querying data

Note that these single quotes are inch-type quotes (') and not Microsoft smart-
quotes (‘’). Take care anytime, here and elsewhere, if you cut-and-paste code
that Microsoft or other products have not changed the original code available to
use by converting to "smart-quotes".
Other notes:
• The create command only requires the name of the table and one or more
column families. Columns can be added dynamically to the table. Also,
each row can have a different set of columns (within each column family).
However, the table may not be mappable to SQL in such cases.
• Our column family names have been deliberately kept short. This is a best
practice: keep your column family names short. For example, instead of
'col_fam1' use 'cf1'. HBase stores the entire names across all of their
nodes where the data resides. If you use a long name, it will get repeated
across all of the nodes increase the total usage. You want to avoid this by
using as short of a name as possible.
• The table name does not need to be short. The name t1 here is short, and
cryptic, merely to save your typing convenience. In reality you should use
more expressive names as that is better documentation of your data
model.
8. Type describe 't1' to verify your table creation.
hbase(main):001:0> describe 't1'
Table t1 is ENABLED
t1
COLUMN FAMILIES DESCRIPTION
{NAME => 'cf1', DATA_BLOCK_ENCODING => 'NONE', BLOOMFILTER => 'ROW', REPLICATION_SCOPE =>
'0', VERSIONS => '1', COMPRESSION => 'NONE', M
IN_VERSIONS => '0', TTL => 'FOREVER', KEEP_DELETED_CELLS => 'FALSE', BLOCKSIZE => '65536',
IN_MEMORY => 'false', BLOCKCACHE => 'true'}
{NAME => 'cf2', DATA_BLOCK_ENCODING => 'NONE', BLOOMFILTER => 'ROW', REPLICATION_SCOPE =>
'0', VERSIONS => '1', COMPRESSION => 'NONE', M
IN_VERSIONS => '0', TTL => 'FOREVER', KEEP_DELETED_CELLS => 'FALSE', BLOCKSIZE => '65536',
IN_MEMORY => 'false', BLOCKCACHE => 'true'}
{NAME => 'cf3', DATA_BLOCK_ENCODING => 'NONE', BLOOMFILTER => 'ROW', REPLICATION_SCOPE =>
'0', VERSIONS => '1', COMPRESSION => 'NONE', M
IN_VERSIONS => '0', TTL => 'FOREVER', KEEP_DELETED_CELLS => 'FALSE', BLOCKSIZE => '65536',
IN_MEMORY => 'false', BLOCKCACHE => 'true'}
3 row(s) in 1.4450 seconds

hbase(main):002:0>

© Copyright IBM Corp. 2018 7-72

Course materials may not be reproduced in whole or in part without the prior written permission of IBM.
Unit 7 Storing and querying data

Note the following metadata features in your describe response:

• COMPRESSION
• IN_MEMORY
• VERSIONS
You will be changing some of these values shortly with an alter statement:
The next question is: just where is the table t1 stored? It is stored in HDFS, but
where in particular?
9. Open another terminal window (so that you can continue later in the first
terminal window) and execute the following command:
hadoop fs -ls -R / 2>/dev/null | grep t1
where this command lists all files (recursive, -R) and passes the results to the
Linux command grep. There would be errors, but these are discarded
(2>/dev/null).
[student@ibmclass Desktop]$ hadoop fs -ls -R / 2>/dev/null | grep t1
drwxr-xr-x - hbase hdfs 0 2015-06-07 12:17
/apps/hbase/data/data/default/t1
drwxr-xr-x - hbase hdfs 0 2015-06-07 12:17
/apps/hbase/data/data/default/t1/.tabledesc
-rw-r--r-- 3 hbase hdfs 769 2015-06-07 12:17
/apps/hbase/data/data/default/t1/.tabledesc/.tableinfo.0000000001
drwxr-xr-x - hbase hdfs 0 2015-06-07 12:17
/apps/hbase/data/data/default/t1/.tmp
drwxr-xr-x - hbase hdfs 0 2015-06-07 12:17
/apps/hbase/data/data/default/t1/8a45456f26ee4569360c6af03e893ed6
-rw-r--r-- 3 hbase hdfs 35 2015-06-07 12:17
/apps/hbase/data/data/default/t1/8a45456f26ee4569360c6af03e893ed6/.regioninfo
drwxr-xr-x - hbase hdfs 0 2015-06-07 12:17
/apps/hbase/data/data/default/t1/8a45456f26ee4569360c6af03e893ed6/cf1
drwxr-xr-x - hbase hdfs 0 2015-06-07 12:17
/apps/hbase/data/data/default/t1/8a45456f26ee4569360c6af03e893ed6/cf2
drwxr-xr-x - hbase hdfs 0 2015-06-07 12:17
/apps/hbase/data/data/default/t1/8a45456f26ee4569360c6af03e893ed6/cf3
[student@ibmclass Desktop]$

Note that directories are created. Files will be put into those directories as
records are stored into the table t1.

© Copyright IBM Corp. 2018 7-73

Course materials may not be reproduced in whole or in part without the prior written permission of IBM.
Unit 7 Storing and querying data

10. In the first terminal window, specify compression for a column family in the table
using the following statement:
alter 't1', {NAME => 'cf1', COMPRESSION => 'GZ'}

hbase(main):002:0> alter 't1', {NAME => 'cf1', COMPRESSION => 'GZ'}

Updating all regions with the new schema...
0/1 regions updated.
1/1 regions updated.
Done.
0 row(s) in 2.4600 seconds

hbase(main):003:0>

The other compression algorithms that are supported but would need extra
configuration are SNAPPY and LZO. Note that gzip is slow but also has the
most efficient compression option.
Sometimes you may find that you need to disable the table prior to executing
the alter statement. This can be done using the following statement:
disable 't1'

You will now make additional changes to the metadata for the table.
11. Specify the IN_MEMORY option for a column family that will be queried
frequently. This does not ensure the data will be in memory always. It only gives
priority for the corresponding data to stay in the cache longer.
alter 't1', {NAME => 'cf1', IN_MEMORY => 'true'}
12. Specify the required number of versions for a column. By default, HBase stores
1 version of the value, but you can set to have more than 1 versions stored.
Enter the following in one continuous line:
alter 't1', {NAME => 'cf1', VERSIONS => 3},
{NAME => 'col_fam2', VERSIONS => 2}
13. Run the describe statement again, and verify that these changes were made to
the table and the column families.
describe 't1'

Course materials may not be reproduced in whole or in part without the prior written permission of IBM.
Unit 7 Storing and querying data

14. Insert dates into the table using the put command. Each row could have
different set of columns. The below set of put commands inserts two rows with a
different set of column names. Go ahead and enter each of these commands
(singly or as a group) into the HBase CLI shell, or insert something similar of
your choice.
put 't1', 'row1', 'cf1:c11', 'r1v11'
put 't1', 'row1', 'cf1:c12', 'r1v12'
put 't1', 'row1', 'cf2:c21', 'r1v21'
put 't1', 'row1', 'cf3:c31', 'r1v31'
put 't1', 'row2', 'cf1:d11', 'r2v11'
put 't1', 'row2', 'cf1:d12', 'r2v12'
put 't1', 'row2', 'cf2:d21', 'r2v21'

hbase(main):010:0> put 't1', 'row1', 'cf1:c11', 'r1v11'

put 't1', 'row1', 'cf1:c12', 'r1v12'
put 't1', 'row1', 'cf2:c21', 'r1v21'
put 't1', 'row1', 'cf3:c31', 'r1v31'
put 't1', 'row2', 'cf1:d11', 'r2v11'
put 't1', 'row2', 'cf1:d12', 'r2v12'
put 't1', 'row2', 'cf2:d21', 'r2v21'
0 row(s) in 0.3680 seconds

0 row(s) in 0.0410 seconds

0 row(s) in 0.0590 seconds

0 row(s) in 0.0890 seconds

0 row(s) in 0.0520 seconds

0 row(s) in 0.0420 seconds

0 row(s) in 0.0550 seconds

hbase(main):016:0>

15. To view the data, you may use the get command to retrieve an individual row,
or the scan command to retrieve multiple rows:
get 't1', 'row1'
hbase(main):021:0> get 't1', 'row1'
COLUMN CELL
cf1:c11 timestamp=1433702916069, value=r1v11
cf1:c12 timestamp=1433702916309, value=r1v12
cf2:c21 timestamp=1433702916379, value=r1v21
cf3:c31 timestamp=1433702916476, value=r1v31
4 row(s) in 0.0570 seconds

Curiosity point: All data is versioned either using an integer timestamp (seconds
since the epoch, 1 Jan 1970 UCT/GMT), or another integer of your choice.

Course materials may not be reproduced in whole or in part without the prior written permission of IBM.
Unit 7 Storing and querying data

16. If you run the scan command, you will see a per-row listing of values:
scan 't1'

hbase(main):022:0> scan 't1'

ROW COLUMN+CELL
row1 column=cf1:c11, timestamp=1433702916069, value=r1v11
row1 column=cf1:c12, timestamp=1433702916309, value=r1v12
row1 column=cf2:c21, timestamp=1433702916379, value=r1v21
row1 column=cf3:c31, timestamp=1433702916476, value=r1v31
row2 column=cf1:d11, timestamp=1433702916539, value=r2v11
row2 column=cf1:d12, timestamp=1433702916595, value=r2v12
row2 column=cf2:d21, timestamp=1433702916661, value=r2v21
2 row(s) in 0.1230 seconds

hbase(main):023:0>

Notes:
• The above scan results show that HBase tables do not require a set schema.
This is good for some applications that need to store arbitrary data. To put this
in other words, HBase does not store null values. If a value for a column is null
(e.g. values for d11, d12, d21 are null for row1), it is not stored. This is one
aspect that makes HBase work well with sparse data.
• In addition to the actual column value (r1v11), each result row has the row key
value (row1), column family name (col_fam1), column qualifier/column (c11)
and timestamp. These pieces of information are also stored physically for each
value. Having a large number of columns with values for all rows (in other
words, dense data) would mean this information gets repeated. Also, larger row
key values, longer column family and column names would increase the storage
space used by a table. For example use r1 instead of row1.
Good business practices:
• Try to use smaller row key values, column family and qualifier names.
• Try to use fewer columns if you have dense data

Course materials may not be reproduced in whole or in part without the prior written permission of IBM.
Unit 7 Storing and querying data

Task 2. Storing and accessing Hive data.

The major references for the Hive can be found on the [Link] and Apache
Wiki websites:
- [Link]
- [Link]
Cognitive Class has a free course on Hive:
- [Link]
You will not have time in this unit to do a full exercise on Hive, but you will start
the Hive CLI client to learn where to find it.
1. In a new terminal window, type cd to change to the home directory.
2. To start the Hive client, type hive.
[student@ibmclass ~]$ hive
15/06/07 [Link] WARN [Link]: HiveConf of name [Link]
does not exist
15/06/07 [Link] WARN [Link]: HiveConf of name [Link] does not exist
15/06/07 [Link] WARN [Link]: HiveConf of name
[Link] does not exist

Logging initialized using configuration in file:/etc/hive/conf/[Link]

hive>

Some configuration is needed for Hive. With the appropriate configuration and
setup, the HBase table that you created can be accessed with Hive and
HiveQL.
It is recommended that you take the Cognitive Class course and/or continue
your learning with one of the many tutorials available on the Internet.
3. Close all open windows.
Results:
You accessed Hadoop/Hive data using a command line interface (CLI).

Course materials may not be reproduced in whole or in part without the prior written permission of IBM.

HBASE
No ratings yet
HBASE
11 pages
HBASE
No ratings yet
HBASE
18 pages
Unit 1 P2 HBase
No ratings yet
Unit 1 P2 HBase
22 pages
HBase Shell Commands Guide
No ratings yet
HBase Shell Commands Guide
10 pages
rc159-HBase 7 PDF
No ratings yet
rc159-HBase 7 PDF
7 pages
Unit 5 Lecture No-3 (Hbase)
No ratings yet
Unit 5 Lecture No-3 (Hbase)
35 pages
Unit 5 Lecture No-3 (Hbase)
No ratings yet
Unit 5 Lecture No-3 (Hbase)
35 pages
Bda Unit 5
No ratings yet
Bda Unit 5
16 pages
HBase - Tutorial
No ratings yet
HBase - Tutorial
14 pages
HBase: Features, Operations, and Architecture
No ratings yet
HBase: Features, Operations, and Architecture
93 pages
Chapter 12 HBase
No ratings yet
Chapter 12 HBase
108 pages
Hbase
No ratings yet
Hbase
14 pages
9 HBase
No ratings yet
9 HBase
77 pages
HBase Big Data Processing Guide
No ratings yet
HBase Big Data Processing Guide
6 pages
HBase: A Key-Value NoSQL Database
100% (1)
HBase: A Key-Value NoSQL Database
47 pages
Big Data UNIT 5 Own
No ratings yet
Big Data UNIT 5 Own
18 pages
Pbds Unit-5
No ratings yet
Pbds Unit-5
60 pages
11lecture - Technology and Tools (HiveHbaseMahout)
No ratings yet
11lecture - Technology and Tools (HiveHbaseMahout)
54 pages
DDMUNIT5
No ratings yet
DDMUNIT5
11 pages
HBase
No ratings yet
HBase
39 pages
Experiment No 2
No ratings yet
Experiment No 2
9 pages
HBase & Pig Shell Commands Guide
No ratings yet
HBase & Pig Shell Commands Guide
23 pages
Hbase Installation Steps
No ratings yet
Hbase Installation Steps
13 pages
BDM Unit 5
No ratings yet
BDM Unit 5
60 pages
HBase Shell Commands
No ratings yet
HBase Shell Commands
8 pages
HBase
No ratings yet
HBase
27 pages
HBase Shell Commands Guide
No ratings yet
HBase Shell Commands Guide
7 pages
Apache HBase Tutorial & Setup Guide
No ratings yet
Apache HBase Tutorial & Setup Guide
19 pages
HBase: Data Management & Architecture
No ratings yet
HBase: Data Management & Architecture
36 pages
BDA Unit 5 HIVE HBASE
No ratings yet
BDA Unit 5 HIVE HBASE
33 pages
HBase Key Components and Configuration Guide
No ratings yet
HBase Key Components and Configuration Guide
5 pages
Unit 5 Big Data
No ratings yet
Unit 5 Big Data
34 pages
1.2. Quick Start - Standalone HBase
No ratings yet
1.2. Quick Start - Standalone HBase
7 pages
HBase Overview: Data Model & Clients
No ratings yet
HBase Overview: Data Model & Clients
34 pages
Big Data Analytics Unit-5
No ratings yet
Big Data Analytics Unit-5
28 pages
Apache Hbase ™ Reference Guide
No ratings yet
Apache Hbase ™ Reference Guide
792 pages
HBase Overview and Architecture Guide
No ratings yet
HBase Overview and Architecture Guide
37 pages
Hadoop Week 6
No ratings yet
Hadoop Week 6
38 pages
bdcc-2 5
No ratings yet
bdcc-2 5
9 pages
Shibasish Chatterjee (2153203) Big Data SME Hands-On
No ratings yet
Shibasish Chatterjee (2153203) Big Data SME Hands-On
85 pages
Hadoop HBASE
No ratings yet
Hadoop HBASE
71 pages
Apache Hbase Reference Guide
No ratings yet
Apache Hbase Reference Guide
836 pages
Columnar Databases for Data Analysts
No ratings yet
Columnar Databases for Data Analysts
18 pages
10 HBase
No ratings yet
10 HBase
13 pages
Hbase Tutorial
No ratings yet
Hbase Tutorial
22 pages
Assignment Day 10: Task 1
No ratings yet
Assignment Day 10: Task 1
8 pages
Lecture10 HBase
No ratings yet
Lecture10 HBase
70 pages
BigData Cheatsheet HBase Hive
No ratings yet
BigData Cheatsheet HBase Hive
1 page
Final Bda 1-8 Lab Aayush
No ratings yet
Final Bda 1-8 Lab Aayush
17 pages
Hbase Commands
No ratings yet
Hbase Commands
9 pages
HBase: Key Features and Architecture
No ratings yet
HBase: Key Features and Architecture
31 pages
Unit 5 Notes
100% (3)
Unit 5 Notes
66 pages
SIC Big Data Chapter 4 HBase
No ratings yet
SIC Big Data Chapter 4 HBase
14 pages
HBase Overview and Data Management
No ratings yet
HBase Overview and Data Management
35 pages
HBase NoSQL Database Overview
No ratings yet
HBase NoSQL Database Overview
9 pages
H Base Tutorial
No ratings yet
H Base Tutorial
38 pages
Hbase Lab Manual3.0-Update
No ratings yet
Hbase Lab Manual3.0-Update
8 pages
Unit 5 Hbase
No ratings yet
Unit 5 Hbase
15 pages
Hands-On Lab Step-By Step - Azure Security Privacy and Compliance - Published
100% (1)
Hands-On Lab Step-By Step - Azure Security Privacy and Compliance - Published
51 pages
Basic PostgreSQL SQL Queries Guide
No ratings yet
Basic PostgreSQL SQL Queries Guide
16 pages
ArcGIS Training NEA
100% (1)
ArcGIS Training NEA
161 pages
Uma DataEngineer Resume
No ratings yet
Uma DataEngineer Resume
6 pages
Excel
No ratings yet
Excel
21 pages
Library Management System
No ratings yet
Library Management System
8 pages
MCQs On Data Structures and Algorithms
No ratings yet
MCQs On Data Structures and Algorithms
28 pages
SQL Database Normalization Techniques
No ratings yet
SQL Database Normalization Techniques
57 pages
SQL - Drop Table - 1keydata
No ratings yet
SQL - Drop Table - 1keydata
1 page
WPF Database Setup Guide PRN212
No ratings yet
WPF Database Setup Guide PRN212
4 pages
12IP and CS BOTH - 100 - VIVA Qs - CS 12 by Lovejeet Arora
No ratings yet
12IP and CS BOTH - 100 - VIVA Qs - CS 12 by Lovejeet Arora
8 pages
Web-Based Student Result Management System: October 2018
No ratings yet
Web-Based Student Result Management System: October 2018
21 pages
Ch02 Constraints Triggers View
No ratings yet
Ch02 Constraints Triggers View
35 pages
Unit 5 VB
No ratings yet
Unit 5 VB
37 pages
PRAVEEN
No ratings yet
PRAVEEN
10 pages
Column Oriented Database
No ratings yet
Column Oriented Database
45 pages
Relational Model Lecture Notes
No ratings yet
Relational Model Lecture Notes
20 pages
Bite302l - Database-Systems - TH - 1.0 - 71 - Bite302l - 66 Acp
No ratings yet
Bite302l - Database-Systems - TH - 1.0 - 71 - Bite302l - 66 Acp
2 pages
DataBase Systems 5th Edition, Silberschatz, Korth and Sudarshan - Chapter 1
71% (7)
DataBase Systems 5th Edition, Silberschatz, Korth and Sudarshan - Chapter 1
34 pages
Amazon Relational Database Service: User Guide
No ratings yet
Amazon Relational Database Service: User Guide
2,102 pages
Semantic Web Unit - 5 Material Final
No ratings yet
Semantic Web Unit - 5 Material Final
22 pages
Course Objectives
No ratings yet
Course Objectives
2 pages
Data Mining (Module-1)
No ratings yet
Data Mining (Module-1)
14 pages
Database Normalization Guide
No ratings yet
Database Normalization Guide
46 pages
Working With SQL & Transact SQL (T-SQL) Queries
No ratings yet
Working With SQL & Transact SQL (T-SQL) Queries
7 pages
COP 5725 Database Systems Homework 2
No ratings yet
COP 5725 Database Systems Homework 2
8 pages
Kalamansig, Sultan Kudarat: Topics and Objectives
No ratings yet
Kalamansig, Sultan Kudarat: Topics and Objectives
3 pages
DBMS Question Bank
No ratings yet
DBMS Question Bank
10 pages
HW1 Solu
No ratings yet
HW1 Solu
9 pages
Writing Business Rules for Data Models
No ratings yet
Writing Business Rules for Data Models
28 pages

Exercise 7 - Using Hive To Access Hadoop-Hbase Data

Uploaded by

Exercise 7 - Using Hive To Access Hadoop-Hbase Data

Uploaded by

Unit 7 Storing and querying data

Storing and querying data © Copyright IBM Corporation 2018

Lab 1: Using Hive to access Hadoop/HBase data

© Copyright IBM Corp. 2018 7-68

Task 1. Storing and accessing HBase data.

© Copyright IBM Corp. 2018 7-69

5. To start the HBase CLI shell, type the following command:

[student@ibmclass ~]$ /usr/bin/hbase shell

Group name: ddl

Group name: namespace

Group name: dml

Group name: tools

Group name: replication

Group name: snapshots

Group name: security

Group name: visibility labels

© Copyright IBM Corp. 2018 7-70

command parameters. Type <RETURN> after entering a command to run it.

{'key1' => 'value1', 'key2' => 'value2', ...}

hbase> get 't1', "key\x03\x3f\xcd"

The table t1 has been created.

© Copyright IBM Corp. 2018 7-71

© Copyright IBM Corp. 2018 7-72

Note the following metadata features in your describe response:

© Copyright IBM Corp. 2018 7-73

hbase(main):002:0> alter 't1', {NAME => 'cf1', COMPRESSION => 'GZ'}

© Copyright IBM Corp. 2018 7-74

hbase(main):010:0> put 't1', 'row1', 'cf1:c11', 'r1v11'

0 row(s) in 0.0410 seconds

0 row(s) in 0.0590 seconds

0 row(s) in 0.0890 seconds

0 row(s) in 0.0520 seconds

0 row(s) in 0.0420 seconds

0 row(s) in 0.0550 seconds

© Copyright IBM Corp. 2018 7-75

hbase(main):022:0> scan 't1'

© Copyright IBM Corp. 2018 7-76

Task 2. Storing and accessing Hive data.

Logging initialized using configuration in file:/etc/hive/conf/[Link]

© Copyright IBM Corp. 2018 7-77

You might also like