A Course in In-Memory Data Management: Prof. Hasso Plattner

This chapter discusses tuple reconstruction in row-oriented and column-oriented databases stored in main memory. Tuple reconstruction is needed when a user requests multiple columns of a tuple, such as when viewing or editing a record. In a row-oriented layout, all attributes of a tuple are stored sequentially, allowing the tuple to be reconstructed with few cache accesses. In contrast, a column-oriented layout stores attributes of different tuples together, requiring a cache access for each attribute needed, making tuple reconstruction less efficient than in a row-oriented layout. The performance difference increases as the number of attributes in each tuple grows. Selecting only necessary fields can reduce the disadvantages of column-oriented layouts for tuple reconstruction.

Uploaded by

Alexandru Moldovan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

70 views5 pages

A Course in In-Memory Data Management: Prof. Hasso Plattner

Uploaded by

Alexandru Moldovan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Prof.

Hasso Plattner

A Course in
In-Memory Data Management
The Inner Mechanics
of In-Memory Databases

September 1, 2013

This learning material is part of the reading material for Prof.

Plattner’s online lecture "In-Memory Data Management" taking place at
www.openHPI.de. If you have any questions or remarks regarding the
online lecture or the reading material, please give us a note at openhpi-
[email protected]. We are glad to further improve the material.
Chapter 13
Tuple Reconstruction

13.1 Introduction

As mentioned earlier, data resembling a table can be stored in linear memory

either column by column (columnar layout) or row by row (row layout) . The
impacts were already discussed in Chapter 8 in more detail. The columnar
layout is optimized for analytical set-based operations that work on many
rows but for a notably smaller subset of all columns of data. The row layout
shows a better performance for select operations on few single tuples. In
this chapter, we discuss the operations needed for tuple reconstruction in
detail and explain the influence of the di↵erent layouts on the performance
of these operations. Tuple reconstruction is a typical functionality in OLTP
applications. It is executed whenever more than one column is requested
from the database, for example when the user in an ERP system calls the
"show" or "edit" transactions for the master data object or for a document.
To explain the influence of the main memory layout organization on the
performance of the tuple reconstruction operation, we have to consider the
notion of the cache access and the size of the cache line. A CPU cache is a
cache used by the central processing unit of a computer to reduce the average
time to access memory. The cache is a smaller, faster memory which stores
copies of the data from the most frequently used main memory locations.
Memory cache is organized in 32 or 64 byte long cache lines. Even when
reading just one byte from the memory, the CPU reads a complete cache
line and places it into the cache. This characteristic of a cache will help us to
estimate the response time for the tuple reconstruction operations for both
layouts.

85
86 13 Tuple Reconstruction

13.2 Tuple Reconstruction in Row-Oriented Databases

First, let us consider an example using the row layout. Let us assume, we
need to reconstruct the tuple knowing the position of the tuple. As a first
example, we take into account the following properties of the tuple:
• the size of one tuple is 200 byte;
• the number of attributes in the tuple is 6.
To estimate the result, we also need the following parameters:
• speed of the read operation from main memory: 2 MB/ms/core;
• we consider 64 byte long cache lines;
• all calculations will be done for one core per CPU. If we consider more
cores, the performance will increase appropriately.
Let us calculate how much time the read operation for the tuple recon-
struction will take in this case considering that the data is organized using
row layout. The operation is executed relatively fast, as all attributes are
stored sequentially. Considering a size of 200 bytes per tuple, we will need
4 cache accesses (d 200
64 e = 4) to read the whole tuple from main memory. The
CPU reads a bit more than the size of a tuple (200 byte) in this case, because it
will read a complete cache line for every cache access (in case of a row layout,
the CPU will load some data of the following tuple to the cache). Thus, we
read 256 byte from main memory. Considering the speed 2 MB/ms/core, we
can calculate the time as described below:
256 byte
Tuple reconstruction response time (row layout) =
2, 000, 000 byte/ms/core
= 0.128 microseconds

13.3 Tuple Reconstruction in Column-Oriented Databases

Now let us estimate the processing time for the same operation and tuples
with the same characteristics but taking into account that the data is orga-
nized in a columnar layout. The data is stored attribute-wise in this case. To
reconstruct the tuple, the CPU cannot just sequentially read data from mem-
ory. It needs to do cache accesses for every attribute of the tuple required
for the tuple reconstruction. Therefore, knowing the implicit recordID of
the tuple to be reconstructed, it will “jump” between the attributes of the
tuple to collect the values. Let us calculate how much time the read opera-
tion for the tuple reconstruction will take in this case. Considering that the
reconstructed tuple has 6 attributes and that for a complete read of every
attribute one cache access is required, we will need 6 cache accesses to read
all attributes of the tuple from main memory. Taking into account a cache
13.4 Further Examples and Discussion 87

line size of 64 byte, the CPU needs to read: 64 byte · 6 = 384 byte from main
memory. The CPU reads more than the size of a tuple (200 byte) in this case,
because it will read a complete cache line for every memory access (using
a columnar layout, the CPU will load some additional attributes’ values of
the following tuples). Considering the speed 2 MB/ms/core, we can calculate
the time as described below:
384 byte
Tuple reconstruction response time(column layout) =
2, 000, 000 byte/ms/core
= 0.192 microseconds

In this simple example, performance of the tuple reconstruction operation

using a columnar layout is 50% worse in comparison with the row layout.
The di↵erence in the response time can be even higher if we consider an
example for a tuple with a larger number of attributes.

13.4 Further Examples and Discussion

In reality, the number of attributes in the tables of business applications is

much larger. As an example, let us calculate the response time for tuple
reconstruction with the following characteristics:
• The size of one tuple is 3,200 byte. For the response time of the column
layout calculation, we also consider that for every attribute of the tuple,
one cache access is enough to read the whole attribute of the tuple.
• The number of attributes in the tuple is 100.
Let us calculate response times for the tuple reconstruction operation for
both layouts considering the same CPU characteristics that were described
in the example above.
Row layout:
50 cache accesses are required for a CPU to read the whole tuple: 50 · 64 byte
= 3,200 byte.

3, 200 byte
Tuple reconstruction response time (row layout) =
2, 000, 000 byte/ms/core
= 1.6 microseconds

Columnar layout:
100 cache accesses are required in case of the columnar layout to read the
attributes of the tuple: 100 · 64 byte = 6,400 byte.
88 13 Tuple Reconstruction

6, 400 byte
Tuple reconstruction response time (column layout) =
2, 000, 000 byte/ms/core
= 3.2 microseconds

This example shows how the number of attributes of the tuple can influence
the response time for both layouts. The performance for tuple reconstruction
of the columnar layout will become progressively worse in comparison to the
row store when we increase the number of a tuple’s attributes and request
all attributes.
We can conclude that it is important to select only the necessary fields of
a tuple. This way, the potential disadvantage of tuple reconstruction using a
columnar layout can be reduced to a minimum.

3 Storage
No ratings yet
3 Storage
34 pages
GCP Reference Architecture Humanitec
No ratings yet
GCP Reference Architecture Humanitec
36 pages
Exercise Problems
No ratings yet
Exercise Problems
14 pages
REA Model
No ratings yet
REA Model
65 pages
Collision Handling Techniques
No ratings yet
Collision Handling Techniques
4 pages
L10-Query Evaluaion
No ratings yet
L10-Query Evaluaion
50 pages
Database Architecture in Airbnb
No ratings yet
Database Architecture in Airbnb
14 pages
Event Driven 6
No ratings yet
Event Driven 6
26 pages
PIVOT Vs UNPIVOT in SQL
No ratings yet
PIVOT Vs UNPIVOT in SQL
13 pages
DMS Design Document
No ratings yet
DMS Design Document
18 pages
OWASP Top 10 Proactive Controls V3
No ratings yet
OWASP Top 10 Proactive Controls V3
40 pages
@ A Case Study On Car Evaluation and Prediction Comparative Analysis Using Data Mining Models
No ratings yet
@ A Case Study On Car Evaluation and Prediction Comparative Analysis Using Data Mining Models
5 pages
Assignment Set 1 Dbms
No ratings yet
Assignment Set 1 Dbms
10 pages
DBMS Module 2.5 Query Processing
No ratings yet
DBMS Module 2.5 Query Processing
19 pages
1 s2.0 S0925527307001892 Main PDF
No ratings yet
1 s2.0 S0925527307001892 Main PDF
17 pages
Homework 1 Solutions
No ratings yet
Homework 1 Solutions
3 pages
DBMS 10 Joins v2
No ratings yet
DBMS 10 Joins v2
38 pages
Ccure 9000 v2 - 10 Ccure ID Guide - rk0 - LT - en (311 338)
No ratings yet
Ccure 9000 v2 - 10 Ccure ID Guide - rk0 - LT - en (311 338)
28 pages
Ch-2: Abstract Data Structures
No ratings yet
Ch-2: Abstract Data Structures
8 pages
1 s2.0 S0925527311003872 Main PDF
No ratings yet
1 s2.0 S0925527311003872 Main PDF
9 pages
New Requirements For Enterprise Computing: 2.1 Processing of Event Data
No ratings yet
New Requirements For Enterprise Computing: 2.1 Processing of Event Data
8 pages
Ais Guidelines
No ratings yet
Ais Guidelines
116 pages
Enterprise Application Characteristics: 3.1 Diverse Applications
No ratings yet
Enterprise Application Characteristics: 3.1 Diverse Applications
4 pages
CS363 Spring 2021 Homework 2
No ratings yet
CS363 Spring 2021 Homework 2
9 pages
AWS Report
No ratings yet
AWS Report
31 pages
Revision 1
No ratings yet
Revision 1
33 pages
A Course in In-Memory Data Management: Prof. Hasso Plattner
No ratings yet
A Course in In-Memory Data Management: Prof. Hasso Plattner
12 pages
Sathiadas 2003
No ratings yet
Sathiadas 2003
10 pages
Esports Year Book PDF
No ratings yet
Esports Year Book PDF
147 pages
Esports Year Book PDF
No ratings yet
Esports Year Book PDF
147 pages
How To Upgrade To SAP BW4HANA and BW 7.5 On SAP HANA - Potential Pitfalls and Tried and True Instructions For Success
No ratings yet
How To Upgrade To SAP BW4HANA and BW 7.5 On SAP HANA - Potential Pitfalls and Tried and True Instructions For Success
61 pages
A Course in In-Memory Data Management: Prof. Hasso Plattner
No ratings yet
A Course in In-Memory Data Management: Prof. Hasso Plattner
9 pages
CAT1 F2 Final Key
No ratings yet
CAT1 F2 Final Key
6 pages
Swapnil Patil-1
No ratings yet
Swapnil Patil-1
3 pages
Insert PDF
No ratings yet
Insert PDF
7 pages
Machine Learning Lab: Delhi Technological University
No ratings yet
Machine Learning Lab: Delhi Technological University
6 pages
A Course in In-Memory Data Management: Prof. Hasso Plattner
No ratings yet
A Course in In-Memory Data Management: Prof. Hasso Plattner
6 pages
A Course in In-Memory Data Management: Prof. Hasso Plattner
No ratings yet
A Course in In-Memory Data Management: Prof. Hasso Plattner
6 pages
ATW115 Slides Chp02
No ratings yet
ATW115 Slides Chp02
52 pages
Chapter 13: Disk Storage, Basic File Structures, and Hashing
No ratings yet
Chapter 13: Disk Storage, Basic File Structures, and Hashing
12 pages
A Course in In-Memory Data Management: Prof. Hasso Plattner
No ratings yet
A Course in In-Memory Data Management: Prof. Hasso Plattner
4 pages
Platform Support Matrix 9.3e Ene20
No ratings yet
Platform Support Matrix 9.3e Ene20
14 pages
For 100% Result Oriented IGNOU Coaching and Project Training Call CPD: 011-65164822, 08860352748
No ratings yet
For 100% Result Oriented IGNOU Coaching and Project Training Call CPD: 011-65164822, 08860352748
9 pages
Changes in Hardware: 4.1 Memory Cells
No ratings yet
Changes in Hardware: 4.1 Memory Cells
11 pages
Tutorial 1 5
No ratings yet
Tutorial 1 5
4 pages
Measuring Marketing Productivity: Current Knowledge and Future Directions
No ratings yet
Measuring Marketing Productivity: Current Knowledge and Future Directions
14 pages
Power BI SQL
No ratings yet
Power BI SQL
8 pages
Final BI Lab Manual
No ratings yet
Final BI Lab Manual
42 pages
Update PDF
No ratings yet
Update PDF
6 pages
CS143-HW5 Disk
No ratings yet
CS143-HW5 Disk
1 page
CPEG655-High-Perf Computing Lab 0: 1. Environment
No ratings yet
CPEG655-High-Perf Computing Lab 0: 1. Environment
3 pages
hw3 Sol
100% (1)
hw3 Sol
6 pages
Features SQL Server Versions
No ratings yet
Features SQL Server Versions
6 pages
1) What Is The Read and Write Service Times TS For A Single Stripe Unit On A Single Disk?
No ratings yet
1) What Is The Read and Write Service Times TS For A Single Stripe Unit On A Single Disk?
5 pages
LSMW For OM Objects
No ratings yet
LSMW For OM Objects
18 pages
Partitioning PDF
No ratings yet
Partitioning PDF
5 pages
SYNOPSIS-Online Prediction Dia
No ratings yet
SYNOPSIS-Online Prediction Dia
3 pages
cs53 Super-Imp-Tie-23
No ratings yet
cs53 Super-Imp-Tie-23
2 pages
Information Management
No ratings yet
Information Management
18 pages
Hashing
No ratings yet
Hashing
13 pages
Computer Science Option A Database
No ratings yet
Computer Science Option A Database
9 pages
Wollega University Department of Electrical and Computer Engineering
100% (1)
Wollega University Department of Electrical and Computer Engineering
1 page
Assign Security in Fusion ERP
No ratings yet
Assign Security in Fusion ERP
18 pages
3 Must-Have Projects For Your Data Science Portfolio - by Aakash N S - Jovian - Jan, 2021 - Medium
No ratings yet
3 Must-Have Projects For Your Data Science Portfolio - by Aakash N S - Jovian - Jan, 2021 - Medium
1 page
NATO's Cyber Strategies and Wireless Warfare in The Information Age
No ratings yet
NATO's Cyber Strategies and Wireless Warfare in The Information Age
7 pages
Numerical Based On Indexing: Problem 1.2
No ratings yet
Numerical Based On Indexing: Problem 1.2
3 pages
Bash Shell from Zero to Hero: An SRE's Practical Guide to Terminal Skills, Scripting, and Automation
From Everand
Bash Shell from Zero to Hero: An SRE's Practical Guide to Terminal Skills, Scripting, and Automation
Nolan Reeves
No ratings yet
World’s First AC-Powered Multi-Parameter Processor: A Journey Beyond Limits
From Everand
World’s First AC-Powered Multi-Parameter Processor: A Journey Beyond Limits
RAJKUMAR OJHA
No ratings yet
How Linux Works, 3rd Edition: What Every Superuser Should Know
From Everand
How Linux Works, 3rd Edition: What Every Superuser Should Know
Brian Ward
4/5 (24)
Kernel Concepts and Architecture: Definitive Reference for Developers and Engineers
From Everand
Kernel Concepts and Architecture: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Efficient Algorithms and Structures with Heaps: Definitive Reference for Developers and Engineers
From Everand
Efficient Algorithms and Structures with Heaps: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Foundations of Data Intensive Applications: Large Scale Data Analytics under the Hood
From Everand
Foundations of Data Intensive Applications: Large Scale Data Analytics under the Hood
Supun Kamburugamuve
No ratings yet
Mastering the Craft of C Programming: Unraveling the Secrets of Expert-Level Programming
From Everand
Mastering the Craft of C Programming: Unraveling the Secrets of Expert-Level Programming
Steve Jones
No ratings yet
PostgreSQL Replication - Second Edition
From Everand
PostgreSQL Replication - Second Edition
Hans-Jurgen Schonig
No ratings yet
The Tech Interview Playbook: From DSA to System Design
From Everand
The Tech Interview Playbook: From DSA to System Design
Chinmoy Mukherjee
No ratings yet
Building Serverless Apps with Azure Functions and Cosmos DB: Leverage Azure functions and Cosmos DB for building serverless applications (English Edition)
From Everand
Building Serverless Apps with Azure Functions and Cosmos DB: Leverage Azure functions and Cosmos DB for building serverless applications (English Edition)
Hansamali Gamage
No ratings yet
Mastering MariaDB
From Everand
Mastering MariaDB
Razzoli Federico
No ratings yet
Ruby Gems Mastery: 100 Essential Packages for 2024
From Everand
Ruby Gems Mastery: 100 Essential Packages for 2024
Kanto
No ratings yet
Distributed Caching & Data Management: Mastering Redis, Memcached, And Apache Ignite Caching
From Everand
Distributed Caching & Data Management: Mastering Redis, Memcached, And Apache Ignite Caching
Rob Botwright
No ratings yet
Machine Learning in the AWS Cloud: Add Intelligence to Applications with Amazon SageMaker and Amazon Rekognition
From Everand
Machine Learning in the AWS Cloud: Add Intelligence to Applications with Amazon SageMaker and Amazon Rekognition
Abhishek Mishra
No ratings yet
LPIC-3 Exam 306-300 Mastery: 500 Practice Questions on High Availability & Storage Clusters
From Everand
LPIC-3 Exam 306-300 Mastery: 500 Practice Questions on High Availability & Storage Clusters
Steve Brown
No ratings yet
PostgreSQL 9.0 High Performance
From Everand
PostgreSQL 9.0 High Performance
Gregory Smith
4/5 (1)
A SECURE DATA AGGREGATION TECHNIQUE IN WIRELESS SENSOR NETWORK
From Everand
A SECURE DATA AGGREGATION TECHNIQUE IN WIRELESS SENSOR NETWORK
Dr Chaitra HV
No ratings yet
Oracle 11g Streams Implementer's Guide
From Everand
Oracle 11g Streams Implementer's Guide
Ann L. R. McKinnell
No ratings yet
Mastering Data Structures and Algorithms in Python & Java
From Everand
Mastering Data Structures and Algorithms in Python & Java
Sachin Naha
No ratings yet
MICROSOFT AZURE ADMINISTRATOR EXAM PREP(AZ-104) Part-3: AZ 104 EXAM STUDY GUIDE
From Everand
MICROSOFT AZURE ADMINISTRATOR EXAM PREP(AZ-104) Part-3: AZ 104 EXAM STUDY GUIDE
Devi Prasad
No ratings yet
Oracle Recovery Appliance Handbook: An Insider’S Insight
From Everand
Oracle Recovery Appliance Handbook: An Insider’S Insight
Ramesh Raghav
No ratings yet
Xbox Architecture: Architecture of Consoles: A Practical Analysis, #13
From Everand
Xbox Architecture: Architecture of Consoles: A Practical Analysis, #13
Rodrigo Copetti
No ratings yet
Basic Information About C language PDF
From Everand
Basic Information About C language PDF
Suraj Das
No ratings yet
C# Package Mastery: 100 Essentials in 1 Hour - 2024 Edition
From Everand
C# Package Mastery: 100 Essentials in 1 Hour - 2024 Edition
Tenko
No ratings yet
PSP Architecture: Architecture of Consoles: A Practical Analysis, #18
From Everand
PSP Architecture: Architecture of Consoles: A Practical Analysis, #18
Rodrigo Copetti
No ratings yet
SNES Architecture: Architecture of Consoles: A Practical Analysis, #4
From Everand
SNES Architecture: Architecture of Consoles: A Practical Analysis, #4
Rodrigo Copetti
No ratings yet
C & C++ Interview Questions You'll Most Likely Be Asked
From Everand
C & C++ Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
GameCube Architecture: Architecture of Consoles: A Practical Analysis, #10
From Everand
GameCube Architecture: Architecture of Consoles: A Practical Analysis, #10
Rodrigo Copetti
No ratings yet
Administering Microsoft Azure SQL Solutions DP 300
From Everand
Administering Microsoft Azure SQL Solutions DP 300
Manish Soni
No ratings yet
AWS Certified Solutions Architect - Professional
From Everand
AWS Certified Solutions Architect - Professional
VB Dev
No ratings yet
Build Your Own Distributed Compilation Cluster - A Practical Walkthrough
From Everand
Build Your Own Distributed Compilation Cluster - A Practical Walkthrough
Hunter Davis
No ratings yet
Nintendo 64 Architecture: Architecture of Consoles: A Practical Analysis, #8
From Everand
Nintendo 64 Architecture: Architecture of Consoles: A Practical Analysis, #8
Rodrigo Copetti
No ratings yet
Computer Science II Essentials
From Everand
Computer Science II Essentials
Randall Raus
No ratings yet
PlayStation 2 Architecture: Architecture of Consoles: A Practical Analysis, #12
From Everand
PlayStation 2 Architecture: Architecture of Consoles: A Practical Analysis, #12
Rodrigo Copetti
No ratings yet
DRBD-Cookbook: How to create your own cluster solution, without SAN or NAS!
From Everand
DRBD-Cookbook: How to create your own cluster solution, without SAN or NAS!
Joerg Christian Seubert
No ratings yet
Practical Reverse Engineering: x86, x64, ARM, Windows Kernel, Reversing Tools, and Obfuscation
From Everand
Practical Reverse Engineering: x86, x64, ARM, Windows Kernel, Reversing Tools, and Obfuscation
Bruce Dang
No ratings yet
Preliminary Specifications: Programmed Data Processor Model Three (PDP-3) October, 1960
From Everand
Preliminary Specifications: Programmed Data Processor Model Three (PDP-3) October, 1960
Digital Equipment Corporation
No ratings yet
Operating Systems Interview Questions You'll Most Likely Be Asked
From Everand
Operating Systems Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Comptia Server+ Primer
From Everand
Comptia Server+ Primer
John Greene
5/5 (1)
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
From Everand
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet

A Course in In-Memory Data Management: Prof. Hasso Plattner

Uploaded by

A Course in In-Memory Data Management: Prof. Hasso Plattner

Uploaded by

Prof.

This learning material is part of the reading material for Prof.

As mentioned earlier, data resembling a table can be stored in linear memory

13.2 Tuple Reconstruction in Row-Oriented Databases

13.3 Tuple Reconstruction in Column-Oriented Databases

In this simple example, performance of the tuple reconstruction operation

13.4 Further Examples and Discussion

In reality, the number of attributes in the tables of business applications is

You might also like