0% found this document useful (0 votes)

36 views23 pages

Storage Systems

The document discusses cloud computing storage systems, focusing on the Google File System (GFS) and Bigtable. GFS is designed for large files and high throughput, utilizing a master-chunkserver architecture to manage data efficiently, while Bigtable serves structured data across distributed systems, supporting various workloads. Both systems emphasize fault tolerance, scalability, and optimized performance for large datasets.

Uploaded by

kokywrite

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

36 views23 pages

Storage Systems

Uploaded by

kokywrite

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

Cloud Computing

Storage Systems
Eva Kalyvianaki
[email protected]
Contents
 The Google File System SOSP 2003
 Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung
 https://static.googleusercontent.com/media/research.google.c
om/en//archive/gfs-sosp2003.pdf

 Bigtable: A Distributed Storage System for

Structured Data OSDI 2006
 Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C. Hsieh,
Deborah A. Wallach Mike Burrows, Tushar Chandra, Andrew
Fikes, Robert E. Gruber
 https://storage.googleapis.com/pub-tools-public-publication-d
ata/pdf/68a74a85e1662fe02ff3967497f31fda7f32225c.pdf

2
Requirements of cloud
applications
 Most cloud applications are data-intensive and test
the limitations of the existing infrastructure.
Requirements:
 Rapid application development and short-time to the
market
 Low latency
 Scalability
 High availability
 Consistent view of the data

 These requirements cannot be satisfied

simultaneously by existing database models; e.g.,
relational databases are easy to use for application
development but do not scale well 3
Google File System (GFS)
Motivation
 GFS  developed in the late 1990s; uses thousands of
storage systems built from inexpensive commodity
components to provide petabytes of storage to a large
user community with diverse needs
 Motivation
1. Component failures is the norm
 Appl./OS bugs, human errors, failures of disks, power
supplies, …
2. Files are huge (muti-GB to -TB files)
3. The most common operation is to append to an
existing file; random write operations to a file are
extremely infrequent. Sequential read operations
are the norm
4. The consistency model should be relaxed to simplify
4
the system implementation but without placing an
GFS Assumptions
 The system is built from inexpensive commodity
components that often fail.
 The system stores a modest number of large files.
 The workload consists mostly of two kinds of reads:
large streaming reads and small random reads.
 The workloads also have many large sequential writes
that append data to files.
 The system must implement well-defined semantics for
many clients simultaneously appending to the same
file.
 High sustained bw is more important than low latency.

5
GFS API
 It provides a familiar interface, though not POSIX.
 Supports: create, delete, open, close, read and write
 Plus: snapshot and record append
 snapshot
creates a file copy or a directory tree at a low cost
 record append
allows multiple clients to append data to the
same file concurrently while guaranteeing atomicity.

6
The Architecture of a GFS
Cluster

7
The Architecture of a GFS
Cluster
 Single master, multiple chunkservers and clients, running
on Linux machines.
 Fixed-size chunks, 64-bit unique and immutable chunk
handle.
 Chunks are stored on local disks on chunkservers, three
replicas.
 Master maintains all file system metadata: access
control, mapping from files to chunks, chunks locations,
etc.
 GFS client code implements the fs API and communicates
with master and chunkservers to read/write data for
applications.
 No caching by the client or the chunkservers.

8
GFS – Design Decisions
 Segment a file in large chunks
 Implement an atomic file append operation allowing
multiple applications operating concurrently to append to
the same file
 Build the cluster around a high-bandwidth rather than
low-latency interconnection network. Separate the flow of
control from the data flow. Exploit network topology by
sending data to the closest node in the network.
 Eliminate caching at the client site. Caching increases the
overhead for maintaining consistency among cashed
copies
 Ensure consistency by channeling critical file operations
through a master, a component of the cluster which
controls the entire system
 Minimise the involvement of the master in file access
operations to avoid hot-spot contention and to ensure 9

scalability
GFS Chunks
 GFS files are collections of fixed-size segments called
chunks
 The chunk size is 64 MB; this choice is motivated by
the desire to optimise the performance for large files
and to reduce the amount of metadata maintained by
the system
 A large chunk size increases the likelihood that
multiple operations will be directed to the same chunk
thus, it reduces the number of requests to locate the
chunk and, it allows the application to maintain a
persistent TCP network connection with the server
where the chunk is located
 Large chunk size reduces the size of metadata stored
on the master
 A chunk consists of 64 KB blocks 10
Consistency Model
 Mutations are writes or record appends
 Each mutation is performed at all chunk’s replicas.
 Use of leases for consistent mutation order:
 Master grants a chunk lease to one of the replicas, primary
 The primary picks a serial order of all mutations to the chunk
 All replicas follow this order when applying mutations
 Global mutation order is defined by:
1. The lease grant order chosen by the master, and
2. Within a lease by the serial numbers assigned by the primary.
 Leases are initially 60 secs
 If the masters looses the primary, it grants a new
lease to another replica after the old lease expires.

11
Write Control and Data Flow

12
Atomic Record Appends
 Client specifies only the data
 GFS appends it to the file at an offset at GFS’s
choosing and returns the offset to the client
 Primary checks if appending would cause the chunk to
exceed the maximum size, if so:
1. Pads the chunk to the maximum size, and
2. Indicates client to retry on the next chunk

13
Master Operation
Namespace and Locking
 Each master operation acquires a set of locks before it
runs
 Allows concurrent mutations in the same directory
 Locks are acquired in a consistent total order to
prevent deadlocks
Replica Management
 Chunks replicas are spread across racks
 Traffic for a chunk exploits the aggregate bw of
multiple racks.
 New chunks are placed on servers with low disk-space-
utilisation, with few “recent” creations, and across
racks
 Re-replication once the no of available replicas is below
14
Conclusions
 Component failures are the norm
 System optimised for huge files that are mostly
appended and then read
 Fault-tolerance is achieved by constant monitoring,
replicating crucial data and automatic recovery, chunk
replication, checksumming to detect data corruption
 High-aggregate throughput by separating file system
control from data transfer. Master involvement in
common operation is minimised by a large chunk size
and chunk leases  a centralised master is not a
bottleneck

15
Bigtable: A Distributed Storage System for
Structured Data
 Bigtable:a distributed storage for structured data
designed to scale big, petabytes of data and
thousands of machines.

 Used by many Google products:

 Google Earth, Google Analytics, web indexing, …
 Handles diverse workload:
 Throughput-oriented batch-processing
 Latency-sensitive apps to end users
 Clients can control locality and whether to server their
data from memory or disk

16
Data Model
 “ABigtable is a sparse, distributed, persistent multi-
dimensional sorted map.”
(row:string, column:string, time:int64)  string

17
Tablets
 Data is maintained in lexicographic order by row key.
 The row range of a table can be dynamically
partitioned.
 Each range is called a tablet. The unit of distribution.

 Nearby rows will be served by the same server

 Good locality properties by properly selecting the row
keys

18
Building Blocks
 GFS stores logs and data files
 Bigtable clusters runs on a shared pool of machines
(co-location).
 It depends on a cluster management system for
scheduling jobs
 The Google SSTable file format is used to store Bigtable
data
 SSTable: a persistent, ordered immutable map from keys to
values
 It contains a sequence of 64KB blocks of data
 A block index to locate blocks; lookups with a single disk seek,
find the block from the in-memory index (loaded in mem when
SSTable is opened) and then getting the block from disk.
 Bigtable uses the Chubby persistent distributed lock
service to:
19
 Ensure that there is at most one active master at any time,
Implementation
 Three major components:
1. A library linked into every client
2. One master server
3. Multiple tablet servers
 Master server: assigns tablets to table servers, adds and
monitors tablet servers, balances tablet-server load, …
 Each tablet server: manages a set of tables, handles
reads/writes to its tablets, splits too large tablets.
 Clients communicate directly with tablet servers for
reads/writes. Bigtable clients do not rely on the master
for tablet location  lightly loaded master

 Bigtable cluster stores a number of tables  a table

consists of a set of tables  each table has data related
to a row range 20
 At first a table has one tablet then splits into more
Table Location

Addresses 234 tablets

21
Table Assignment
 Each tablet is assigned to one tablet server at-a-time.
 Master keeps track of live tablet servers, current
assignments, and unassigned tablets

 Upon a master starting

 Acquires master lock in Chubby
 Scans live tablet servers
 Gets list of tablets from each tablet server, to find out assigned
tablets
 Learns set of existing tablets → adds unassigned tablets to list

22
Table Serving

Cloud Application Requirements Overview
No ratings yet
Cloud Application Requirements Overview
21 pages
Storage Systems
No ratings yet
Storage Systems
23 pages
The Google File System: Alexandru Costan
No ratings yet
The Google File System: Alexandru Costan
38 pages
20 GFS BigTable
No ratings yet
20 GFS BigTable
36 pages
5.cloud Computing Lecture
No ratings yet
5.cloud Computing Lecture
7 pages
Google File System Review 2016
No ratings yet
Google File System Review 2016
4 pages
Overview of Google File System (GFS)
No ratings yet
Overview of Google File System (GFS)
40 pages
Google File System
No ratings yet
Google File System
22 pages
Thegooglefilesystem Lecturebyromainjacotin 141001154546 Phpapp02
No ratings yet
Thegooglefilesystem Lecturebyromainjacotin 141001154546 Phpapp02
52 pages
The Google File System
No ratings yet
The Google File System
21 pages
The File System: Sanjay Ghemawat, Howard Gobioff, Shun-Tak Leung (Google)
No ratings yet
The File System: Sanjay Ghemawat, Howard Gobioff, Shun-Tak Leung (Google)
31 pages
Chapter 2 Google File System 250525 070947
No ratings yet
Chapter 2 Google File System 250525 070947
42 pages
GFS vs HDFS in Cloud Computing
No ratings yet
GFS vs HDFS in Cloud Computing
26 pages
The Google File System: S. Ghemawat, H. Gobioff, and S. T. Leung. SOSP 2003
No ratings yet
The Google File System: S. Ghemawat, H. Gobioff, and S. T. Leung. SOSP 2003
33 pages
Case Study: Google File System
No ratings yet
Case Study: Google File System
7 pages
2 GFS
No ratings yet
2 GFS
30 pages
Chapter 2 1712934164766
No ratings yet
Chapter 2 1712934164766
21 pages
An Overview of Google File System (GFS) - Medium
No ratings yet
An Overview of Google File System (GFS) - Medium
10 pages
Ds 2016 17 Lec18
No ratings yet
Ds 2016 17 Lec18
26 pages
Unit 5 Lecture 2
No ratings yet
Unit 5 Lecture 2
22 pages
TLW Assignment 3 27-Sep-2024 10-32-28
No ratings yet
TLW Assignment 3 27-Sep-2024 10-32-28
28 pages
Google File System for Developers
No ratings yet
Google File System for Developers
28 pages
Google App Engine and Google File System
No ratings yet
Google App Engine and Google File System
5 pages
Ccomputing Madurya
No ratings yet
Ccomputing Madurya
20 pages
Distributed File System Google File System
No ratings yet
Distributed File System Google File System
44 pages
Lecture 4.1 - Hadoop - MapReduce - Hbase
No ratings yet
Lecture 4.1 - Hadoop - MapReduce - Hbase
94 pages
Cloud Storage Systems: Unit-Iii
No ratings yet
Cloud Storage Systems: Unit-Iii
40 pages
Overview of Google File System (GFS)
No ratings yet
Overview of Google File System (GFS)
20 pages
Distributed File System Study
No ratings yet
Distributed File System Study
4 pages
Bigtable A System For Distributed Structured Storage: Motivation
No ratings yet
Bigtable A System For Distributed Structured Storage: Motivation
9 pages
Overview of Google File System (GFS)
No ratings yet
Overview of Google File System (GFS)
20 pages
M4 - 05 - Google File System
No ratings yet
M4 - 05 - Google File System
28 pages
The Google File System: Firas Abuzaid
No ratings yet
The Google File System: Firas Abuzaid
22 pages
Google App Engine Programming Guide
No ratings yet
Google App Engine Programming Guide
8 pages
GFS - Architecture M5 GFS - Architecture M5
No ratings yet
GFS - Architecture M5 GFS - Architecture M5
25 pages
GFD Summary
No ratings yet
GFD Summary
3 pages
Chap 6
No ratings yet
Chap 6
54 pages
Google Architecture Case Study
No ratings yet
Google Architecture Case Study
44 pages
Introduction to Distributed Data Processing
No ratings yet
Introduction to Distributed Data Processing
2 pages
Paper Gfs Summary
No ratings yet
Paper Gfs Summary
14 pages
Lecture 14 HDFS GFS
No ratings yet
Lecture 14 HDFS GFS
30 pages
05 en Distributed File Systems
No ratings yet
05 en Distributed File Systems
63 pages
Programming Environment For GAE
No ratings yet
Programming Environment For GAE
35 pages
Google Distributed Systems Overview
No ratings yet
Google Distributed Systems Overview
23 pages
Unit 5 CC
No ratings yet
Unit 5 CC
8 pages
Google File System 1
No ratings yet
Google File System 1
48 pages
9238 DC Assignment 3
No ratings yet
9238 DC Assignment 3
5 pages
Distributed File Systems Overview
No ratings yet
Distributed File Systems Overview
30 pages
Distributed File System Analysis
No ratings yet
Distributed File System Analysis
30 pages
Google File System
No ratings yet
Google File System
48 pages
Google File System Overview and Analysis
No ratings yet
Google File System Overview and Analysis
35 pages
Cloud Data Strategies Overview
No ratings yet
Cloud Data Strategies Overview
13 pages
Refer Slide Time: 00:15
No ratings yet
Refer Slide Time: 00:15
31 pages
CC
No ratings yet
CC
17 pages
Unit 4
No ratings yet
Unit 4
41 pages
Unit - 4-Cloud
No ratings yet
Unit - 4-Cloud
122 pages
Google File System Architecture Overview
No ratings yet
Google File System Architecture Overview
40 pages
Research On Cloud Data Storage
No ratings yet
Research On Cloud Data Storage
5 pages
Compressed Air Filter User Manual
No ratings yet
Compressed Air Filter User Manual
44 pages
Calculation of Natural Frequency of Earth Dams by Means of Analyt
No ratings yet
Calculation of Natural Frequency of Earth Dams by Means of Analyt
7 pages
Reactive Power at Night Inverter Paper
No ratings yet
Reactive Power at Night Inverter Paper
5 pages
Time Table Chemical Engg Autumn Semester 2023-24 Final
No ratings yet
Time Table Chemical Engg Autumn Semester 2023-24 Final
6 pages
SPC 880 Conversion
No ratings yet
SPC 880 Conversion
8 pages
GM320 Eng
No ratings yet
GM320 Eng
16 pages
Chapter 5
No ratings yet
Chapter 5
15 pages
Code Management
No ratings yet
Code Management
2 pages
SmartPropoPlus RC Simulatos
No ratings yet
SmartPropoPlus RC Simulatos
3 pages
Proposer Details: My Health Care - Individual-Policy Schedule
No ratings yet
Proposer Details: My Health Care - Individual-Policy Schedule
4 pages
REVIEWER - Business Finance Q3 TQ
100% (1)
REVIEWER - Business Finance Q3 TQ
3 pages
Fuel Advisory 24-08 - Airbus Fuel Caps
No ratings yet
Fuel Advisory 24-08 - Airbus Fuel Caps
3 pages
RFID Proposal For Schools v1.0
No ratings yet
RFID Proposal For Schools v1.0
19 pages
Humanresource Planning: Tapan Girdhar University Business School Panjab University
No ratings yet
Humanresource Planning: Tapan Girdhar University Business School Panjab University
46 pages
Forme de Notification Pour Dechets Genérés Par Navire: Form To Be Sent To Agent 24 Hours Before Arrival
No ratings yet
Forme de Notification Pour Dechets Genérés Par Navire: Form To Be Sent To Agent 24 Hours Before Arrival
3 pages
ICT - Contact Center Services CG
No ratings yet
ICT - Contact Center Services CG
22 pages
Introduction To Web Development
No ratings yet
Introduction To Web Development
16 pages
PIL Notes
No ratings yet
PIL Notes
42 pages
DD-MP MegaPress Fittings
No ratings yet
DD-MP MegaPress Fittings
16 pages
Shrink Disc Dimensions
100% (1)
Shrink Disc Dimensions
1 page
JVC dr-mv1s
No ratings yet
JVC dr-mv1s
104 pages
Sceptical Process
No ratings yet
Sceptical Process
14 pages
Strategic Management MCQs
0% (1)
Strategic Management MCQs
4 pages
SDN Unit
No ratings yet
SDN Unit
14 pages
HCMUT Report Template
No ratings yet
HCMUT Report Template
15 pages
E-Marketing Communication: Earned Media
No ratings yet
E-Marketing Communication: Earned Media
21 pages
HHR Passenger Charter
No ratings yet
HHR Passenger Charter
24 pages
18,21. Naidian Catalogue
No ratings yet
18,21. Naidian Catalogue
31 pages
IN THE COURT OF THE - O.S.No. of 2020
No ratings yet
IN THE COURT OF THE - O.S.No. of 2020
38 pages
SA35AC E01 Merged
No ratings yet
SA35AC E01 Merged
87 pages

Storage Systems

Uploaded by

Storage Systems

Uploaded by

Cloud Computing

 Bigtable: A Distributed Storage System for

 These requirements cannot be satisfied

 Used by many Google products:

 Nearby rows will be served by the same server

 Bigtable cluster stores a number of tables  a table

Addresses 234 tablets

 Upon a master starting

You might also like