0% found this document useful (0 votes)

148 views96 pages

DBMS File

Uploaded by

Ansh Pandey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

148 views96 pages

DBMS File

Uploaded by

Ansh Pandey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 96

DBMS

SECTION 1: Introduction to DBMS:

1.1 Overview
1.2 Introduction
1.3 DBMS vs. File Systems.
1.4 DBMS Architecture
1.5 Database Users & Database schema
1.6 Database Language
1.7 Data Independence
1.8 Assignment
1.9 Test/Quiz

SECTION 2: KEYS

2.1 Super key

2.2 Primary key
2.3 Foreign key
2.4 Composite key
2.5 Unique key
2.6 Candidate key
2.7 Alternate key
2.8 Assignment
2.9 Test/Quiz

SECTION 3: E-R (Entity Relationship) MODEL:

3.1 What is E-R Model
3.2 E-R Symbols
3.3 Cardinality
3.4 Generalization & Specialization
3.5 Convert ER into Tables
3.6 Assignment
3.7 Test/Quiz

SECTION 4: Relational Model:

4.1 What is Relational Model
4.2 Constraints
4.3 CODD’s Rule
4.4 Relational Algebra
4.5 Relational calculus
4.6 Join Operations
4.7 Assignment
4.8 Test/Quiz

SECTION 5: NORMALIZATION:
5.1 What is Normalization
5.2 Functional Dependency
5.3 Inference Rule
5.4 Normal forms
5.5 Relational Decomposition
5.6 Multi valued Dependency
5.7 Join Dependency
5.8 Assignment
5.9 Test/Quiz

SECTION 6: Transaction & Concurrency control:

6.1 Transaction
6.2 ACID Properties
6.3 Transaction States
6.4 Schedule
6.5 Serializability & Recoverability
6.6 Concurrency control
6.7 Deadlock
6.8 Starvation
6.9 Assignment
6.10 Test/Quiz

SECTION 7: Indexing & Hashing:

7.1 Indexing
7.2 Hashing
7.3 B+ Tree
7.4 Assignment
7.5 Test/Quiz

SECTION 8: RAID:
8.1 levels
8.2 Assignment
8.3 Test/Quiz

SECTION 9: ADVANCE TOPICS:

9.1 DBMS Storage & File Structure
9.2 DBMS Backup & Recovery
9.3 DBMS vs. RDBMS
9.4 SQL vs. NO-SQL
9.5 Clustered vs. Non-Clustered Index
9.6 Assignment
9.7 Test/Quiz

SECTION 10: Useful Resources:

10.1 Useful links
10.2 Useful Books
DBMS
Overview
Organizations use large amounts of data. Basically database management system (DBMS) is not a software
tool but it’s a concept that is used to organize data in a database.

A database-management system (DBMS) is a collection of interrelated data and a set of programs to access
those data.

The ultimate purpose of a Database Management system is to store and transform data into
information to support making decisions.
DBMS provides the following functions:

 Concurrency: At the same time multiple users can access data in the same database.
 Security: security rules to determine access rights of users
 Backup and recovery: processes to back-up the data regularly and recover data if a problem
occurs
 Integrity: database structure and rules improve the integrity of the data
 Data descriptions: a data dictionary provides a description of the data

SECTION 1: Introduction to DBMS:

1.1 DBMS Introduction:
 What is Data –
In simple words, the data can be facts related to any object into consideration.
For example – your name, age, height, weight, etc. are some data related to you. A picture, image,
file, pdf, etc. can also be considered data.
In computing, data is information that has been translated into a form that is efficient for movement or
processing. Data can exist in a variety of forms – as numbers or text on pieces of paper, as bits and
bytes stored in electronic memory, or as facts stored in a person’s mind.
 Database –
A database is a data structure that stores organized information. it can be easily accessed, managed
and updated. Databases support storage and manipulation of data. Databases make data
management easy.
For example, a company database may include tables for products, employees, and financial
records.
Your electricity service provider is obviously using a database to manage billing, client-related issues,
to handle fault data, etc.
 DBMS –
Stands for “Database Management System.” A database management system (DBMS) is a program
designed to define, manipulate, retrieve and manage data in a database.
It provides Protection and security to the database. In case of multiple users, it also maintains the
data consistency.
Database Management Systems are not a new concept and as such had been first implemented in
the 1960s.
Charles Bachman’s Integrated Data Store (IDS) is said to be the first DBMS in history.
Some other DBMS examples:

 MySQL
 Microsoft Access
 Oracle
 PostgreSQL
 dBASE
 FoxPro
 SQLite
 IBM DB2
 LibreOffice Base
 MariaDB
 Microsoft SQL Server etc.

 History of DBMS
 1960 – Charles Bachman designed the first DBMS system
 1970 – Codd introduced IBM’S Information Management System (IMS)
 1976- Peter Chen coined and defined the Entity-relationship model also know as the ER
model
 1980 – Relational Model becomes a widely accepted database component
 1985- Object-oriented DBMS develops.
 1990- Incorporation of object-orientation in relational DBMS.
 1991- Microsoft ships MS access, a personal DBMS and that displaces all other personal
DBMS products.
 1995: First Internet database applications
 1997: XML applied to database processing. Many vendors begin to integrate XML into
DBMS products.
 Types of DBMS
There are 4 major types of DBMS.
Hierarchical –
In hierarchical database management systems (hierarchical DBMSs) model, data is stored in a
parent-children relationship node. In a hierarchical database, besides actual data, records also
contain information about their groups of parent/child relationships. ta gets stored in the form of a
collection of fields in which each field contains only one value, i.e., every individual record has only
one parent and a parent can have one or more than one child. To retrieve a field’s data, we need to
traverse through each tree until the record is found.
The hierarchical database system structure was developed by IBM in the early 1960s.
Example: The IBM Information Management System (IMS) and Windows Registry are two popular
examples of hierarchical databases.
Network DBMS –
The network database structure was invented by Charles Bachman. Network database management
systems (Network DBMSs) uses network structure to create a relationship between entities. It
supports many-to-many relations.
The network database is more efficient. Therefore, is similar to the hierarchical database.
Example: Integrated Data Store (IDS), IDMS (Integrated Database Management System), Raima
Database Manager
Relational DBMS –
In relational databases, the relationship between data files is relational. Data is stored in the tabular
form of columns and rows. Each column if a table represents an attribute and each row in a table
represents a record. Each field in a table represents a data value. The relational database depicts the
relation between two or more tables. Structured Query Language (SQL) is a language used to query
an RDBMS including inserting, updating, deleting, and searching records. RDBMS is the most
popular databases in the world.
Example: Oracle, SQL Server, MySQL, SQLite, and IBM DB2.
Object-Oriented Relation DBMS-
Object-oriented Databases were created in the early 1980s. Object-Oriented Databases deals with
the functionality of object-oriented programming and it increases the semantics of C++ and Java. It
adds the database functionality to object programming languages. Object developers can write
complete database applications with a decent amount of additional effort.
Example: Some Object-Oriented Databases were designed to work with OOP languages such as
Delphi, Ruby, C++, Java, and Python.
 Application of DBMS –
 Banking:- For customer information, payments, deposits, loans etc….
 Airlines:- For reservations and schedule information.
 Universities:- For students and faculty information, course registrations, colleges and grades.
 Telecommunication:- It helps to keep call records, monthly bills,Maintaining balances, etc..
 Finance:- For storing information about stock, sales, and purchases of financial instruments
like stocks and bonds.
 Sales:- Use for storing customer, product and sales information.
 Manufacturing:- It is used for the management of supply chain and for tracking production of
items. Inventors status in warehouses.

 Characteristics of DBMS –

 Support for multiple views of the data

 It is used to provide security of data.
 It brings only consistency and accurate data into the database.
 It takes the backup of the database and whenever it is needed, it can be stored back.
 DBMS should be able to represent the complex relationship between data to make the efficient and
accurate use of data.
 DBMS supports a multi-user environment that allows users to access and manipulate data in parallel.
 It follows the ACID concept (Atomicity, Consistency, Isolation, and Durability).

 Advantages of DBMS –

 Control Database Redundancy.

 Data Sharing
 Easy Maintenance
 Reduce time
 Backup and Recovery
 Multiple User Interface

 Disadvantages of DBMS –
 The centralization of resources increases the vulnerability of the system.
 The cost of Hardware and Software of a DBMS is quite high.
 The use of the same program at a time by many users sometimes lead to the loss of data.
 Any accidental failure of a component may cause loss of valuable data.
 As DBMS becomes big software due to its functionalities so it requires lots of space and
memory to run its application efficiently.
 DBMS requires updates itself daily. DBMS should be updated according to the current
scenario.
 DBMS gives poor performance for small scale firms as its speed is slow.
 DBA (Database Administrator)

 A database administrator (DBA) is a specialized computer systems administrator who

maintains a successful database environment by directing or performing all related activities to
keep the data secure.
 A DBA can be a system administrator who was given the added responsibility of maintaining a
SQL Server. DBAs can even come from unrelated fields, such as accounting or the help desk,
and switch to Information Systems to become DBAs.
 The DBA is responsible for understanding and managing the overall database environment.
By developing and implementing a strategic blueprint to follow when deploying databases
within their organization, DBAs are instrumental to the ongoing efficacy of modern applications
that rely on databases for data storage and access.

1.2 DBMS vs. File system:

Difference Between DBMS and File System: -

S.No. File System DBMS

1. Software that manages the data files in a Software to create & manages tha database.
computer system.

2. Helps to store a collection of raw data files into Help to easily stored, retrieve and manipulate
the hard disk. data in a database.

3. Data inconsistency Data consistency using Normalization.

4. More redundant data Low redundant data.

5. No security ( Less secure ) More secure

6. Simple to handle Complex to handle

7. NTFS & Ext. MySql & Oracle

8. Backup and recovery is not possible. It has a sophisticated backup & recovery

9. File system handles data in a small scale. DBMS handles data on large scale.

10. Task such as Storing, Retrieving & Searching Operations such as Updating, Searching data is
are done manually in the file system. Therefore, easier in DBMS because it allows SQL queries.
it is difficult to manage data in the file system.

1.3 DBMS Architecture:

DBMS Architecture
Database management systems architecture will help us understand the components of the database
system and the relation among them. The architecture of DBMS depends on the computer system on
which it runs.
Types of DBMS Architecture
There are three types of DBMS architecture:
1. Single tier architecture –
In this architecture, the database is directly available to the user. It means the user can directly sit on
the DBMS and uses it. Any changes done here will directly be done on the database itself. It doesn’t
provide a handy tool for end-users. The 1-Tier architecture is used for the development of the local
application, where programmers can directly communicate with the database for the quick response.

DATABASE

USER

2. Two-tier architecture –
In two-tier architecture, the Database system is present at the server machine and the DBMS
application is present at the client machine, these two machines are connected with each other
through a reliable network as shown in the above diagram.
Database System
SSSystemSystem Server

Application
Client

USER

3. Three-tier architecture –
3-tier schema is an extension of the 2-tier architecture. 3-tier architecture has the following layers-

1. Presentation layer (your PC, Tablet, Mobile, etc.)

2. Application layer (server)
3. Database Server

This DBMS architecture contains an Application layer between the user and the DBMS, which is
responsible for communicating the user’s request to the DBMS system and send the response from
the DBMS to the user.
The application layer (business logic layer) also processes functional logic, constraint, and rules
before passing data to the user or down to the DBMS
Three-tier architecture is the most popular DBMS architecture.

Database
Server

Application Server

Application Client

Client

USER
1.5 Database users & Schemas

 Database Users:-
Application Programmers – They are the developers who interact with the database by means of
DML queries. These DML queries are written in application programs like C, C++, JAVA, Pascal, etc.
These programs meet the user requirement and made according to user requirements. Retrieving
information, creating new information and changing existing information is done by these application
programs.

End Users – End users are those who access the database from the terminal end. They use the
developed applications and they don’t have any knowledge about the design and working of the
database. These are the second class of users and their main motto is just to get their task done.
There are basically two types of end-users that are discussed below.

Native Users – Any user who does not have any knowledge about the database can be in this
category. Their task is to just use the developed application and get the desired results. For example,
Clerical staff in any bank is a naïve user. They don’t have any DBMS knowledge but they still use the
database and perform their given task.

Stand-alone Users – These are those users whose job is basically to maintain personal databases
by using a ready-made program package that provides easy to use menu-based or graphics-based
interfaces, An example is the user of a tax package that basically stores a variety of personal
financial data of tax purposes. These users become very proficient in using a specific software
package.

specialized Users – These are sophisticated users writing special database application programs.
These may be CADD systems, knowledge-based and expert systems, complex data systems
(audio/video), etc.

Sophisticated Users – These users basically include engineers, scientists, business analytics and
others who thoroughly familiarize themselves with the facilities of the DBMS in order to implement
their application to meet their complex requirements.

Database Schema:-
The data which is stored in the database at a particular moment of time is called an instance of the
database. Database systems comprise of complex data structures. Thus, to make the system
efficient for retrieval of data and reduce the complexity of the users, developers use the method of
Data Abstraction. A database schema defines its entities and the relationship among them.
There are mainly three levels of data abstraction:

1. Internal Level: The internal schema defines the physical storage structure of the database. It
defines how the data will be stored in secondary storage. It is also called the Physical
Database Schema. It never deals with physical devices. Instead, internal schema views a
physical device as a collection of physical pages.
2. Conceptual or Logical Level: The conceptual schema describes the Database structure of
the whole database for the community of users. This logical level comes between the user
level and the physical storage view. However, there is only a single conceptual view of a single
database.
3. External or View level: An external schema describes the part of the database which a
specific user is interested in. An external view is just the content of the database as it is seen
by some specific particular users. For example, a user from the sales department will see only
sales-related data.
 Database Instance
A database schema is the skeleton of database. It is designed when the database doesn’t exist at all. Once
the database is operational, it is very difficult to make any changes to it. A database schema does not contain
any data or information.

A database instance is a state of operational database with data at any given time. A DBMS ensures that it is
every instance (state) is in a valid state, by diligently following all the validations, constraints, and conditions
that the database designers have imposed.

1.6 Database Languages:

 DDL (Data Definition Language) –DDL is an abbreviation of Data Definition Language. It is used to
create, modify and destroy the structure of database objects in the database. Data definition
language is used to store the information of metadata like the number of tables and schemas, their
names, indexes, columns in each table, constraints, etc.

Task Perform by DDL:-

 CREATE – It is used to create the database or its objects (like table, function, views)
 ALTER – It is used to alter the structure of the database.
 DROP – It is used to delete objects from the database.
 RENAME – It is used to rename an object existing in the database.
 TRUNCATE – It is used to remove all records from a table, including all spaces allocated for
the records are removed.
 COMMENT – It is used to add comments to the data dictionary.

 DQL (Data Query Language) – DML statements are used for performing queries on the data within database objects.
The purpose of DQL Command is to get some database relation based on the query passed to it.

Task Perform by DQL:-

 SELECT – Used for retrieving data from the database.

 DML (Data Manipulation Language) – DML is an abbreviation of Data Manipulation Language. It is used to retrieve,
modify, add, and delete data in the database.

Task Perform by DML:-

 INSERT – It is used for adding or inserting new data into a database.

 UPDATE – It is used for modifying the data in the database.
 DELETE – It is used for deleting the already existing data from the database.

 TCL (Transaction Control Language) – TCL commands deal with the transaction within the
database.
Task Perform by TCL:-

 COMMIT– Commits a Transaction.
 ROLLBACK– Rollbacks a transaction in case any error occurs.
 SAVEPOINT– Sets a savepoint within a transaction.
 SET TRANSACTION– Specify characteristics for the transaction.

 DCL (Data Control Language) – DCL includes commands such as GRANT and REVOKE which
mainly deals with the rights, permissions and other controls of the database system.
Task Perform by DCL:-

 GRANT – It gives user’s access privileges to the database.

 REVOKE – withdraw user’s access privileges given by using the GRANT command.

1.7 Data Independence

A major objective for the three-level architecture is to provide data independence.

This means that upper levels are unaffected by changes to lower levels, Helps you to improve the
quality of the data.

The conventional data processing does not provide data independence in application programs. So,
any kind of changes in the information, layouts, or arrangements need the change in application
programs also.

Fig : Conventional data processing without data independence

Advantages of Data independence

The advantages of data independency in DBMS are as follows:
 Ability of improving performance
 Alterations in data structure does not requires alterations in application programs
 Implementation details can be hidden from the users
 Reduction of incongruity
 Affordable prices of maintaining system
 Providing the best services to the users
 Permit users to focus on general structure
 Enforcement of standards
 Improvement of security
 The state of being undamaged or undivided can be improved

Types Of Data Independence:

 Physical/ Internal Level:
 Physical data independence helps you to separate conceptual levels from the internal/physical
levels.
 It allows you to provide a logical description of the database without the need to specify physical
structures.
 Compared to Logical Independence, it is easy to achieve physical data independence.
 With Physical independence, you can easily change the physical storage structures or devices
with an effect on the conceptual schema.
 Any change done would be absorbed by the mapping between the conceptual and internal levels.
 Physical data independence is achieved by the presence of the internal level of the database and
then the transformation from the conceptual level of the database to the internal level.

Examples of changes under Physical Data Independence

Due to Physical independence, any of the below change will not affect the conceptual layer.

 Using a new storage device like Hard Drive or Magnetic Tapes

 Modifying the file organization technique in the Database
 Switching to different data structures.
 Changing the access method.
 Modifying indexes.
 Changes to compression techniques or hashing algorithms.
 Change of Location of Database from say C drive to D Drive

 Logical/Conceptual Level:
 Logical Data Independence is the ability to change the conceptual scheme without changing
1. External views
2. External API or programs
 Logical data independence refers characteristic of being able to change the conceptual
schema without having to change the external schema.
 Logical data independence is used to separate the external level from the conceptual view.
 If we do any changes in the conceptual view of the data, then the user view of the data would
not be affected.
 Logical data independence occurs at the user interface level.

Examples of changes under Logical Data Independence

Due to Logical independence, any of the below change will not affect the external layer.

 Add/Modify/Delete a new attribute, entity or relationship is possible without a rewrite of existing

application programs
 Merging two records into one
 Breaking an existing record into two or more record.

Section 2: Keys:
2.1 Super Key
The set of attributes that can uniquely identify a tuple is known as Super Key.

 Adding zero or more attributes to the candidate key generates the super key.
 A candidate key is a super key but vice versa is not true.

For example:

st_id St_Number St_Name

s01 657849 Anshul

s02 2278 Shweta

s03 2288 Sachin

s05 2290 Poornima

From the above table Number of Super keys we have :

{st_Id}

{st_Number}

{st_Id, st_Number}

{st_Id, st_Name}

{st_Id, st_Number, st_Name}

{st_Number, st_Name}
2.2 Primary key:
 A primary key also called a primary keyword, is a key in a relational database that is unique for each
record.
 A relational database must always have one and only one primary key.
 Primary keys typically appear as columns in relational database tables.
 They allow you to find the relation between two tables.

Student_ID studentName collegeName

001 Chanchal Jain PIMG college

002 Vaibhani Upreti MITS college

003 Radhika Gupta MITS college

For Example: In the above-given example, student ID is a primary key because it uniquely identifies a Student
record. In this table, no other Student can have the same Student ID, And Multiple students can enroll in the
same college but a single student cannot enroll in multiple colleges at a time.

2.3 Foreign key:

A foreign key is a column or group of columns in a relational database table that provides a link between data
in two tables.

 It acts as a cross-reference between tables because it references the primary key of another table
 A correct definition of a foreign key would be: Foreign keys are the columns of a table that points to the
candidate key of another table.

For example:

Country_I Country_Name
D

01 India

02 China

03 Nepal

State_id Country_id State_Name

1 01 Madhya Pradesh

2 01 Uttar Pradesh

3 02 Fujian

4 02 Beijing

5 03 Mahakali
In the above-given example, there were 2 tables one for the country and the other tables contain a number of
States.
From the country table, here country_id is a primary key whereas country_id is used as a foreign key in a state
table. With the help of foreign keys, we can easily access records of the country table.

2.4 Composite Key:

A composite key, in the context of relational databases, is a combination of two or more columns in a table that
can be used to uniquely identify each row in the table. Uniqueness is only guaranteed when the columns are
combined; when taken individually the columns do not guarantee uniqueness.

 The Primary Key consisting of two or more attribute is called Composite Key.
 It is a combination of two or more columns.

For Example:

StudentID StudentEnrollNo StudentMarks StudentPercentage

S001 0721722 570 90

S002 0721790 490 80

S003 0721766 440 86

Above, our composite keys are StudentID and StudentEnrollNo. The table has two attributes as the primary
key.

2.5 Unique key:

A unique key is a set of one or more than one fields/columns of a table that uniquely identify a record in a
database table.

 It is little like a primary key but it can accept only one null value and it cannot have duplicate values.
 The unique key and primary key both provide a guarantee for uniqueness for a column or a set of
columns.
 There is an automatically defined unique key constraint within a primary key constraint.
 There may be many unique key constraints for one table, but only one PRIMARY KEY constraint for
one table.
 The UNIQUE constraint ensures that all values in a column are different.

Key Differences Between Primary key and Unique key:

1. The primary key will not accept NULL values whereas the Unique key can accept one NULL value.
2. A table can have only primary keys whereas there can be multiple unique keys on a table.
3. A Clustered index automatically created when a primary key is defined whereas the Unique key generates
the non-clustered index.
2.6 Alternate Key
An alternate key is a key associated with one or more columns whose values uniquely identify every row in
the table, but which is not the primary key.
As we have seen in the candidate key guide that a table can have multiple candidate keys. Among these
candidate keys, only one key gets selected as the primary key, the remaining keys are known as alternative
or secondary keys.
For Example:
where the primary key for a table may be the student id, the alternate key might combine the first, middle, and
last names of the student. Each alternate key can generate a unique index or a unique constraint in a target
database.

StudentID StudentEnrollNo StudentMarks StudentPercentage

S001 0721722 570 90

S002 0721790 490 80

S003 0721766 440 86

From the above example, studentID and StudentEnrollNo both can be a primary key because they both can
give a unique record separately.
We can also consider studentID as a primary key then other columns become Alternative key

2.7 Candidate key:

The minimal set of an attribute which can uniquely identify a tuple is known as a candidate key

 The value of Candidate Key is unique and non-null for every tuple.
 There can be more than one candidate key in a relation.
 A super key with no redundant attribute is known as a candidate key.
 Candidate keys are selected from the set of super keys.

The candidate key can be simple (having only one attribute) or composite as well. For Example, {country_id,
State_id} is a composite candidate key for relation Country_State

Country_I Country_Name
D
01 India

02 China

03 Nepal

State_id Country_id State_Name

1 01 Madhya Pradesh

2 01 Uttar Pradesh

3 02 Fujian

4 02 Beijing

5 03 Mahakali

SECTION 3: E-R model:

3.1 What is ER model

An Entity-relationship model (ER model) describes the structure of a database with the help of a diagram,
which is known as an Entity Relationship Diagram (ER Diagram). An ER model is a design or blueprint of a
database that can later be implemented as a database. It also develops a very simple and easy to design view
of data. It defines the conceptual view of a database.

An ER diagram shows the relationship among entity sets. An entity set is a group of similar entities and these
entities can have attributes. In terms of DBMS, an entity is a table or attribute of a table in the database, so by
showing a relationship among tables and their attributes, the ER diagram shows the complete logical structure
of a database.

Ex:-

Component of ER Diagram
3.2 Symbols Used in ER

Entity – An entity can be a real-world object, either animate or inanimate, that can be easily identifiable. An
entity set is a collection of similar types of entities. An entity set may contain entities with attributes sharing
similar values. An entity may be any object, class, person or place. In the ER diagram, an entity can be
represented as rectangles.

 Weak Entity – An entity that depends on another entity called a weak entity. The weak entity doesn’t contain
any key attribute of its own. The weak entity is represented by a double rectangle. A weak entity depends on a
strong entity to ensure the existence of a weak entity. Like a strong entity, a weak entity does not have any
primary key, it has partial discriminator key. The weak entity is represented by a double rectangle.
The relation between one strong and one weak entity is represented by a double diamond.

 A strong entity is not dependent on any other entity in a schema. A strong entity always has a primary key.
The strong entity is represented by a single rectangle. Two strong entity’s relationship is represented by a
single diamond. Various strong entities together make the strong entity set.
2. Attribute – Attributes are the properties of entities. Attributes are represented by means of ellipses. Every
ellipse represents one attribute and is directly connected to its entity (rectangle).

An attribute can be of many types, here are different types of attributes defined in the ER database
model:

 key Attribute – The key attribute is used to represent the main characteristics of an entity. It represents a
primary key. The key attribute is represented by an ellipse with the text underlined.

 Composite Attribute – Composite attributes are made of more than one simple attribute. For example, a
student’s complete name may have first_name and last_name. The composite attribute is represented by an
ellipse, and those ellipses are connected with an ellipse.

 Multivalued Attribute – Multi-value attributes may contain more than one value. For example, a person can
have more than one phone number, email_address, etc. The double oval is used to represent the multivalued
attribute.

 Derived Attribute – Derived attributes are the attributes that do not exist in the physical database, but their
values are derived from other attributes present in the database. For another example, age can be derived
from data_of_birth. It can be represented by a dashed ellipse.

3.3 Cardinality
Mapping Constraints
Cardinality defines the number of attributes in one entity set, which can be associated with the number of
attributes of other sets via a relationship set. In simple words, it refers to the relationship one table can have
with the other table.
Notations used for Cardinality:

Relationship –

 Relationships are represented by the diamond-shaped box.

 The name of the relationship is written inside the diamond-box.
 All the entities (rectangles) participating in a relationship, are connected to it by a line.

Types of Relationships:

 One to One – When only one instance of an entity is associated with the relationship, it is marked as ‘1:1’.A
one-to-one relationship can be used for security purposes, to divide a large table, and various other specific
purposes.

 One to Many – When a single instance of an entity is associated with more than one instance of another entity
then it is called one to many relationships. This is the most common relationship type. One-to-Many
relationships can also be viewed as Many-to-One relationships, depending on which way you look at it.
 Many to One – More than one entity from entity set A can be associated with at most one entity of entity set B,
however an entity from entity set B can be associated with more than one entity from entity set A.

 Many to Many- Entity from A can be associated with more than one entity from B and vice versa. A many-to-
many relationship could be thought of as two one-to-many relationships, linked by an intermediary table.

3.4 Generalization and Specialization:

DBMS Generalization:
Generalization, this term is often used while designing any relational Schema.Generalization is the process of
extracting common properties from a set of entities and create a generalized entity from it.

 In generalization two entities combine together to form a new higher level entity. The higher level entity can
also combine with other lower level entities to make further higher level entity.
 It is a Bottom up Approach.
 It is a reverse process of Specialization.
 It’s more like Super-class and Sub-class system.
 Sub-classes are combined to form a super-class.

Examples:

DBMS Specialization:
Specialization is a designing procedure that proceeds in a top-down manner.

 Specialization is the opposite of generalization.

 In specialization, a gr-oup of entities is divided into sub-groups based on their characteristics.
 we split an entity to form multiple lower level entities. These newly formed lower level entities inherit some
features of the higher level entities.
 Normally, the super class is defined first, the subclass and its related attributes are defined next, and
relationship set are then added.
 Specialization is always applied on a single entity, and if overviewed, it increases the size of a schema.

Examples:

Difference between Generalization & Specialization

Diagram;

Generalization and Specialization are exactly opposite to each other.

 Generalization helps in reducing the size of schema whereas, specialization is just opposite it increases
the number of entities thereby increasing the size of a schema.
 Generalization is always applied to the group of entities whereas, specialization is always applied on a
single entity.
 Generalization results in a formation of a single entity whereas, Specialization results in the formation of
multiple new entities.
 Generalization is a bottom-up approach, whereas Specialization is a Top-down approach.

DBMS Aggregation:
ER Model is a way which helps in database design with outmost efficiency. One of the major limitations of ER
Model is its inability to represent relationship among relationship.
In order to represent ternary relationship, it can be represented using ER Model but a lot of redundancies will
arise. Hence, the concept of aggregation is used to remove these redundancies.
In aggregation, the relation between two entities is treated as a single entity. In aggregation, relationship with
its corresponding entities is aggregated into a higher level entity.
Example:
3.5 Convert ER model into table:
The database can be represented using the notations, and these notations can be reduced to a collection of
tables.

In the database, every entity set or relationship set can be represented in tabular form.

ER diagram is given below:

There are some points for converting the ER diagram to the table:
o Entity type becomes a table.

In the given ER diagram, LECTURE, STUDENT, SUBJECT and COURSE forms individual tables.

o All single-valued attribute becomes a column for the table.

In the STUDENT entity, STUDENT_NAME and STUDENT_ID form the column of STUDENT table. Similarly,
COURSE_NAME and COURSE_ID form the column of COURSE table and so on.

o A key attribute of the entity type represented by the primary key.

In the given ER diagram, COURSE_ID, STUDENT_ID, SUBJECT_ID, and LECTURE_ID are the key attribute
of the entity.

o The multivalued attribute is represented by a separate table.

In the student table, a hobby is a multivalued attribute. So it is not possible to represent multiple values in a
single column of STUDENT table. Hence we create a table STUD_HOBBY with column name STUDENT_ID
and HOBBY. Using both the column, we create a composite key.

o Composite attribute represented by components.

In the given ER diagram, student address is a composite attribute. It contains CITY, PIN, DOOR#, STREET,
and STATE. In the STUDENT table, these attributes can merge as an individual column.

o Derived attributes are not considered in the table.

In the STUDENT table, Age is the derived attribute. It can be calculated at any point of time by calculating the
difference between current date and Date of Birth.

Using these rules, you can convert the ER diagram to tables and columns and assign the mapping between
the tables. Table structure for the given ER diagram is as below:

TABLE STRUCTURE:
SECTION 4:RELATIONAL MODEL:

4.1 What is Relational model:

The relational data model is the primary data model, which is used widely around the world for data storage
and processing. This model is simple and it has all the properties and capabilities required to process data
with storage efficiency.

 It can represent a table with columns and rows.

 Each row is known as a tuple.
 Each table of the column has a name or attribute.

Key terms used for Relational Model:

Table: In Relational Data Model, a table is a collection of data elements organized in terms of rows and
columns, where rows represent records and columns represent the attributes. Relations are saved in the
format of Tables. This format stores the relationships among entities.
Tuple: A single row of a table, which contains a single record for that relation is called a tuple.
Ex:- Example of a single record or tuple.

1 Joy India

Domain: It contains a set of atomic values that an attribute can take.

Attribute: A table consists of several records(row), each record can be broken down into several smaller
parts of data known as Attributes. It contains the name of a column in a particular table. Each attribute must
have a domain.
EX: Student table consists of four attributes, ID, Name, Age and Address

ID NAME AGE ADDRESS

101 Amit 25 Agra

Attribute Domain: When an attribute is defined in a relation(table), it is defined to hold only a certain type of
values, which is known as Attribute Domain.
Hence, the attribute Name will hold the name of Student for every tuple. If we save the Student’s address
there, it will be a violation of the Relational database model.
Ex:

NAME

Amit

Sumit

Agra

Roy

Relational instance: In the relational database system, the relational instance is represented by a finite set of
tuples. Relation instances do not have duplicate tuples.

Relation Schema: Arelation schema describes the structure of the relation, with the name
of the relation(name of the table), its attributes and their names and type.

Relation Key: A
relation key is an attribute that can uniquely identify a particular
tuple(row) in a relation(table).
Example for Student Relation:
S_Name S_Roll S_Mobile S_Address
Amit 1011 0987654327 Delhi
Sumit 1012 7896547484 Gwalior
Ram 1013 9456788343 Gurgaon
Shyam 1014 8764678934 Noida

 In the given table, S_Name, S_Roll, S_Mobile, and S_Address are the attributes.
 The instance of schema STUDENT has 4 tuples.
 t3 = <Ram, 1013, 9456788343, Gurgaon>

Properties of Relations:
 The name of the relation is distinct from all other relations.
 Each relation cell contains exactly one atomic (single) value
 Each attribute contains a distinct name
 Attribute domain has no significance
 Tuple has no duplicate value
 Order of tuple can have a different sequence
4.2 CONSTRAINTS:
Constraints invoke limits to the data or type of data that can be inserted/updated/deleted from a table.
• This ensures the accuracy and reliability of the data in the database.
• It could be either on a column level or a table level.
• The column level constraints are applied only to one column, whereas the table level constraints are applied
to the whole table.
• Relational constraints are the restrictions imposed on the database contents and operations.
• Integrity constraints are a set of rules. It is used to maintain the quality of information.
• Thus, integrity constraint is used to guard against accidental damage to the database.

1. Domain Constraints:

Here, value ‘A’ is not allowed since only integer values can be taken by the age attribute.

2. Tuple Uniqueness Constraint:

Tuple Uniqueness constraint specifies that all the tuples must be necessarily unique in any relation.
Example1:

Country_id Country_name Country_population

C01 India 4000

C02 Nepal 4567

C03 China 4324

C04 Bhutan 5675

C05 Pakistan 5623

This relation satisfies the tuple uniqueness constraint since here all the tuples are unique.
Example 2:
Country_id Country_name Country_population

C01 India 4000

C02 Nepal 4567

C03 China 4324

C04 Bhutan 5675

C05 Pakistan 5623

This relation does not satisfy the tuple uniqueness constraint since here all the tuples are not unique.

3. Key constraint specifies that in any relation-

 Keys are the entity set that is used to identify an entity within its entity set uniquely.
 An entity set can have multiple keys, but out of which one key will be the primary key.
 All the values of the primary key must be unique.
 The value of the primary key must not be null.

Example:

Country_id Country_name Country_population

C01 India 4000

C02 Nepal 4567

C03 China 4324

C04 Bhutan 5675

C05 Pakistan 5623

This relation does not satisfy the key constraint as here all the values of the primary key are not unique.

4. Entity Integrity Constraint:

 The entity integrity constraint states that the primary key value can’t be null.
 This is because the primary key value is used to identify individual rows in relation and if the primary key has a
null value, then we can’t identify those rows.
 A table can contain a null value other than the primary key field.

Example 1:

Country_id Country_name Country_population

C01 India 4000

Nepal 4567

C03 China 4324

C04 Bhutan 5675

C05 Pakistan 5623

This relation does not satisfy the entity integrity constraint as here the primary key contains a NULL value.
Ex-2:

Country_id Country_name Country_population

C01 India 4000

C02 4567

C03 China 4324

C04 Bhutan

C05 Pakistan 5623

This relation satisfies the entity integrity constraint as here the primary key does not contain a NULL value.
Whereas, A table can contain a null value other than the primary key field.

5. Referential Integrity Constraint:

 A referential integrity constraint is specified between two tables.

 This constraint is enforced when a foreign key references the primary key of a relation.
 It specifies that all the values taken by the foreign key must either be available in the relation of the primary
key or be null.

Example:
Consider the following two relations- ‘Country’ and ‘State’.
Here, relation ‘State’ references the relation ‘Country’.

Country_i Country_name
d

C01 India

C02 Nepal

State_id Country_id State_name

S01 C01 Madhya Pradesh

S02 C02 Uttar Pradesh

S03 C03 beijing

4.3 CODD’s Rules
Dr Edgar F. Codd, after his extensive research on the Relational Model of database systems, came up with
twelve rules of his own.

These rules can be applied on any database system that manages stored data using only its relational
capabilities.

1. Rule 0:This is the foundational Rule. This rule states that any database system should have characteristics as relational, as a
database and as a management system to be RDBMS.

2. Rule 1:This is aInformation Rule. All information in an RDBMS is represented logically in just one way - by values in tables. In other words,All
information(including metadata) is to be represented as stored data in cells of tables. The rows and columns have to be strictly
unordered.
3. Rule 2: This is aGuaranteed Access Rule.Every single data element (value) is guaranteed to be accessible logically with a
combination of table-name, primary-key (row value), and attribute-name (column value). No other means, such as pointers, can be
used to access data.
4. Rule 3: This is aSystematic treatment of Null Rule. Null has several meanings, it can mean missing data, not applicable or no
value. It should be handled consistently. Also, Primary key must not be null, ever. Expression on Null must give null
5. Rule 4: This is aActive Online Catalog Rule.Structure of database must be stored in an online catalog which can be queried by
authorized users.
6. Rule 5: This is aPowerful and Well Structured Language Rule. The system must support at least one relational
language that
 Has a linear syntax
 Can be used both interactively and within application programs
 Supports data definition operations (including view definitions), data manipulation operations
(update as well as retrieval), security and integrity constraints, and transaction management
operations (begin, commit, and rollback).
7. Rule 6: This is aView Updation Rule.Different views created for various purposes should be automatically
updatable by the system.
8. Rule 7:This is aRelational Level Operation Rule.A database must support high-level insertion, updation,
and deletion. This must not be limited to a single row, that is, it must also support union, intersection and
minus operations to yield sets of data records.
9. Rule 8:This is aPhysical Data Independence Rule.Any modification in the physical location of a table
should not enforce modification at application level.
10. Rule 9:This is aLogical Data Independence Rule.Changes to the logical level (tables, columns, rows, and
so on) must not require a change to an application based on the structure. Logical data independence is more
difficult to achieve than physical data independence.
11. Rule 10:This is aIntegrity Independence Rule.The database should be able to enforce its own integrity
rather than using other programs. Key and Check constraints, trigger etc, should be stored in Data Dictionary.
This also make RDBMS independent of front-end.
12. Rule 11:This is aInformation Distribution Independence Rule. Distribution of data over various locations
should not be visible to end-users.
13. Rule 12:This is Nonsubversion Rule.If low level access is allowed to a system, it should not be able to
subvert or bypass integrity rules to change the data.

4.4 RELATIONAL ALGEBRA

Relational Algebra is a procedural query language which takes a relation as an input and generates a relation
as an output.
It gives a step by step process to obtain the result of the query. It uses operators to perform queries.

Characteristics-
Following are the important characteristics of relational operators-

 Relational Operators always work on one or more relational tables.

 Relational Operators always produce another relational table.
 The table produced by a relational operator has all the properties of a relational model.

1. Selection Operator(σ):It selects tuples that satisfy the given predicate from a relation. In other words,
The
SELECT operation is used for selecting a subset of the tuples according to a given selection
condition.
Notation-σp(r)

σ is the predicate

r stands for relation which is the name of the table

p is prepositional logic

Example:

Country_id Country_name Country_population

C01 India 4000

C02 Nepal 4567

C03 China 4324

C04 Bhutan 5675

C05 Pakistan 5623

Input:

σCountry_name=”India”(COUNTRY)

Country_id Country_name Country_population

C01 India 4000

Output:

2. Projection Operator(π):
Projection Operator (π) is a unary operator in relational algebra that performs a projection operation.
It displays the columns of a relation or table based on the specified attributes.
Project operator in relational algebra is similar to the Select statement in SQL.
Syntax:
Π column_name1, column_name2,…..,column_nameN(table_name)
Example:
Table: Country

Country_id Country_name Country_population

C01 India 4000

C02 Nepal 4567

C03 China 4324

C04 Bhutan 5675

C05 Pakistan 5623

Input:
Π Country_name, Country_population (Country)
Output:

Country_name Country_population

India 4000

Nepal 4567

China 4324

Bhutan 5675

Pakistan 5623
3. Rename Operator(ρ): Rename (ρ) operation can be used to rename a relation or an attribute of
a relation. It is denoted by rho (ρ). The results of relational algebra are also relations but without any name.
Syntax:
ρ×(E)[Where the result of expression E is saved with name of x.]
or
ρ(Relation_New, Relation_Old)
Example:
Table: Country

Country_id Country_name Country_population

C01 India 4000

C02 Nepal 4567

C03 China 4324

C04 Bhutan 5675

C05 Pakistan 5623

Input:
ρ(Country_pop, π(Country_population)(Country))
Output:
Country_population

4000

4567

4324

5675

5623

4. Union Operator(U):This operation is used to fetch data from two relations(tables) or temporary
relation(result of another operation).
For this operation to work, the relations(tables) specified should have same number of attributes(columns) and same
attribute domain. Also the duplicate tuples are autamatically eliminated from the result.

Syntax: A U B, Where A and B are two relations(Tables).

A union operation must hold the following condition:

o A and B must have the attribute of the same number.

o Duplicate tuples are eliminated automatically.

Example:
Table 1: Course

Course_id Student_name Student_id

C01 Amit S011
C02 Amit S011
C03 Rahul S012
C04 Ravi S013

Table 2: Student
Student_id Student_name Student_age
C01 Amit 22
C05 Neha 33
C03 Rahul 32
C04 Ravi 25

Input:
Π Student_name (Course) U π Student_name(Student)

Output:
Student_name
Amit
Rahul
Ravi

Note: As you can see there are no duplicate names present in the output even though we had few
common names in both the tables, also in the COURSE table we had the duplicate name itself.

5. Cartesian Product(×):This is used to combine data from two different relations(tables) into one and fetch data from
the combined relation.
It is denoted by X.

Syntax: A × B

Example:
Table 1: Student

S_id S_name S_dep

S01 Sumit A

S02 Honey C

S03 Harry B

Table 2: Department:
Dep_id Dep_name
A CS

B Mechanical

C Sales

Input:
Student × Department

Output:
S_id S_name S_dep Dep_id Dep_name

S01 Sumit A A CS

S01 Sumit A B Mechanical

S01 Sumit A C Sales

S02 Honey C A CS

S02 Honey C B Mechanical

S02 Honey C C Sales

S03 Harry B A CS

S03 Harry B B Mechanical

S03 Harry B C Sales

6. Minus/Set-Difference (–):The result of set difference operation is tuples, which are present in one relation but
are not in the second relation.
-Symbol denotes it.

 The attribute name of A has to match with the attribute name in B.

 The two-operand relations A and B should be either compatible or Union compatible.
 It should be defined relation consisting of the tuples that are in relation A, but not in B.

Syntax: A–B
Input:
Finds all the tuples that are present in A but not in B.
π author(Books) – π author(Articles)

Output − Provides the name of authors who have written books but not articles.

7. Intersection ( Ⴖ ):An intersection is defined by the symbol ∩ . Defines a relation consisting of a set of all tuple that are in both A
and B. However, A and B must be union-compatible.

Syntax: A Ⴖ B

Table 1: Depositor

Cust_name Acc_number
Lara A-01
Harry A-02
Potter A-04
Smith A-10

Table 2: Borrower

Cust_name Loan_number
John L-05
Harry L-012
Potter L-07
Shiv L-11

Input:

πCust_name(Borrower) Ⴖ πCust_name(Depositor)

Output:

Cust_name
Harry

Potter

4.5 Relational Calculus

 Relational calculus is a non-procedural query language. In the non-procedural query language, the user is
concerned with the details of how to obtain the end results.
 The relational calculus has variables, constants, comparison ops, logical connectives and quantifiers.It tells
what to do but never explains how to do.
 In first-order logic or predicate calculus, a predicate is a truth-valued function with arguments. When we
replace with values for the arguments, the function yields an expression, called aProposition, which will
be either true or false.

Procedural Query Language:

 Query specification involves giving a step by step process of obtaining the query result.
 Ex: Relational Algebra
 Difficult for the use of non-Experts.

Declarative Query Language:

 Query specification involves giving the logical conditions the result are required to satisfy.
 Easy for the use of non-Experts.

Types of Relational Calculus:

 Tuple Relational Calculus:It is a Declarative Query Language.
Tuple Variable (T) – associated with a relation ( called the range relation)
Variable range over tuples.( i.e. get bound to) takes tuples from the range
relation as its values.
Syntax: {T| P(T)}
Where T is tuple variable and P(T) is a formula that describes T
P(t) may have various conditions logically combined with OR (∨), AND (∧), NOT(¬).
It also uses quantifiers.
∃ T∈ r (Q(T)) = ”there exists” a tuple in t in relation r such that predicate Q(t) is true.
∀ T∈ r (Q(T)) = Q(t) is true “for all” tuples in relation r.

Example:
Table -1: Student

Student_id Student_name Student_age

C01 Amit 22
C05 Neha 33
C03 Rahul 32
C04 Ravi 25

Query: {T| T є Student ∧T[Student_age]>=30}

Resulting relation:

Student_id Student_name Student_age

C05 Neha 33
C03 Rahul 32

 Domain Relational Calculus:In domain relational calculus the records are filtered based on the domain of the
attributes and not based on the tuple values.
Domain relational calculus uses the same operators as tuple calculus. It uses logical connectives ∧ (and), ∨ (or) and ┓
(not).
It uses Existential (∃) and Universal Quantifiers (∀) to bind the variable.

A general expression in Domain Relational calculus is of the form

{<x1,x2,x3…..xn>| P(x1,x2,x3…..xn)}

Where x1,x2,x3…..xn represent domain variables. P represent the formula composed of atoms, as same in the case of
Tuple relational calculus
Atomic Formula:

<x1,x2,x3…..xn> є r, where r is a relation on n attributes and x1,x2,x3…..xn are domain vlues or domain constraints.

X © Y, where X and Y are domain values and © is a comparison operator ( <,<=,=,>,>=).

X © c, where x is a domain variable,© is a comparison operator, and c is a constant in the domain of the attribute
for which x is a domain variable.

Example:

Table -1: Student

Student_id Student_name Student_age

C01 Amit 22
C05 Neha 33
C03 Rahul 32
C04 Ravi 25

Query: {<Student_name, Student_age> | є Student ∧ Student_age>30}

Ouput:

Student_name Student_age
Neha 33
Rahul 32

Join Operations
Join is a binary operation which allows you to combine join product and selection in one single statement. The goal of
creating a join condition is that it helps you to combine the data from multiple join tables.

Types of Join Operations:

Basically there are two types of joins:

1. Inner joins:

 Theta join
 Equi join
 Natural Join

2. Outer joins:

 Left Outer join

 Right outer join
 Full outer join

Inner join:This is the most common type of join and is similar to AND operation. It combines the results of one or more tables and
displays the results when all the filter conditions are met.
Returns records that have matching values in both tables.
Syntax:
Select *
FROM table1
INNER JOIN table2
ON table1.column_name = table2.column_name;

Theta join:A theta join is a join that links tables based on a relationship other than equality between two columns.
A theta join could use any operator other than the “equal” operator.
Basically it is also known as Generic join.
It is same as Equi join but it allows all other operators like >, <, >= etc..

Equi join:In equi join, tables are merged on the basis of common attributes. For whatever the Join type (Inner, outer
etc..), if we use only the equality operator(=) then we can say that the join is EQUI join.

Natural join:The join involves an Equality test, and thus is often described as an Equi-join.
The natural join will remove the duplicate attributes. No comparison operator is used in Natural Join.
Syntax:
Select *
FROM table1
NATURAL JOIN table2;

Outer join: All the above given joins are also known as inner join keeps only the tuples satisfying the given condition or
matching attribute but in Outer join, all the tuples of a relation are present in the resulting relation based
on the type of join.
An Outer Join does not require each record in the two joined tables to have a matching record.
Syntax:
Select *
FROM table1,table2
WHERE conditions [ + ];

Left Outer join:Keep data from the left-hand table and if there are no columns matching in the right
table, it returns NULL values.
Syntax:
Select *
FROM table1
LEFT OUTER JOIN table2
ON table1.column_name = table2.column_name;

Right Outer join:Keep data from the right-hand table and if there are no columns matching in the left
table, it returns NULL values
Syntax:
Select *
FROM table1<br>
RIGHT OUTER JOIN table2
ON table1.column_name = table2.column_name;

Full Outer join: Full outer join actually combines the result of both left outer join and right outer join. Keep data from
both tables and it returns row from either table when the conditions are met and returns NULL
value when there is no match.
Syntax:
Select *
FROM table1
FULL OUTER JOIN table2
ON table1.column_name = table2.column_name;

Self join:A self join is a special form of Equi join or Inner join in which a table is joined against itself.
Syntax:
SELECT a.column_name,b.column_name…
FROM table1 a, table2 b
WHERE a.column_field=b.column_field
Cross join: The CROSS JOIN produces a result set which is the number of rows in the first table multiplied by the
number of rows in the second table, if no WHERE clause is used along with CROSS JOIN. This kind of result is called
as Cartesian Product.
Syntax:
Select *
FROM table1
CROSS JOIN table2;

Section 5: Normalization

5.1 What is Normalization:

Normalization is the process of organizing the data in the database. Normalization is used to minimize
the redundancy from a relation or set of relations. It is also used to eliminate the undesirable
characteristics like Insertion, Update and Deletion Anomalies. Normalization divides the larger table
into the smaller table and links them using relationship. The normal form is used to reduce redundancy
from the database table.

Types of Normal Forms

There are the four types of normal forms:
Normal Description
Form

1NF A relation is in 1NF if it contains an atomic value.

2NF A relation will be in 2NF if it is in 1NF and all non-key attributes are fully functional
dependent on the primary key.

3NF A relation will be in 3NF if it is in 2NF and no transition dependency exists.

4NF A relation will be in 4NF if it is in Boyce Codd normal form and has no multi-valued
dependency.

5NF A relation is in 5NF if it is in 4NF and not contains any join dependency and joining
should be lossless.

5.2 Functional Dependency

Functional dependency is a relationship that exists when one attribute uniquely determines another attribute.
Functional dependency in a database serves as a constraint between two sets of attributes between the PK and
other non-key attributes within a table. Defining functional dependency is an important part of relational database
design and contributes to aspect normalization.

Suppose there is a relation R which has two attributes X and Y,value of X uniquely determines the value of Y.
This relationship is indicated by the representation below :

X→Y
Y=Functionally dependent on X
X=Determinant set
Y=Dependent attribute

Example:-
We have <Country >table with two attributes cid and cname

cid cname
C01 India
C02 Pakistan
Therefore, the above functional dependency between cid and cname can be determined as cid is functionally dependent
on cname:
cid -> cname

Types of Functional Dependency

Functional Dependency has three forms:
1. Trivial Functional Dependency
2. Non-Trivial Functional Dependency
3. Completely Non-Trivial Functional Dependency

Trivial Functional Dependency:

 A → B has trivial functional dependency if B is a subset of A.

 The following dependencies are also trivial like: A → A, B → B

Example:- We are considering the same <Country> table with two attributes to understand the concept of trivial dependency.

The following is a trivial functional dependency since cid is a subset of cid and cname

{ cid, cname } -> cid

Non Trivial Functional Dependency:

 A → B has a non-trivial functional dependency if B is not a subset of A.

 When A intersection B is NULL, then A → B is called as complete non-trivial.

Example:- cid->cname

Completely Non - Trivial Functional Dependency:

It occurs when A intersection B is null in:
A ->B

5.3 Inference Rule:

Rules of Inference for functional dependencies, called inference axioms or Armstrong
axioms, after their developer, can be used to find all the FDs logically implied by a set
of FDs.These rules are sound , meaning that they are an immediate consequence of
the definition of functional dependency and that any FD that can be derived from a
given set of FDs using them is true.

5.3.1 Reflexive Rule:

If A is a set of attributes and B is subset of C, then C holds B. If B is a subset of A then Adetermines B This
property is trivial property.
If A ⊇ B then A→ B

5.3.2 Augmentation Rule:

If Adetermines B holds and Y is attribute set, then AY must also determines BY    also holds. That is adding
attributes in dependencies, does not change the basic dependencies. If Adetermines B , then
AC must alsodetermines BC   for any C. It is also called as partial dependency.
If A→ B then AC → BC

5.3.3 Transitive Rule:

Same as the transitive rule in algebra, if Adetermines B holds and Bdetermines C holds, then

Amust also determines C also holds. A→ B   is called as A functionally that determines B.
If A→ B   and B→ C then A→ C
5.3.4 Additive/Union Rule:
If  Adetermines B    holds and Adetermines C   holds, then Amust also determines B∧C    holds.

If A→ B and A→ C then A→ BC

5.3.5 Pseudo Transitive Rule:

If Adetermines B holds and BCdetermines D holds, then AC must alsodetermines D

If A→ B and BC→ D then AC → D

5.3.6 Productive/Decomposition Rule:
Decomposition rule is also known as project rule. It is a reverse of union rule.
If Adetermines BC   holds then Adetermines B   and Adetermines C   hold.
If A→ BC   then A→ B   and A→ C
Composition:- If Adetermines B   holds and Xdetermines Y   holds, then A X must also determines BY
If A→ B   and X→ Y   then A X → BY

5.4 Normal forms:

5.4.1 First Normal form (1NF):
A relation will be 1NF if it contains an atomic value.

It states that an attribute of a table cannot hold multiple values. It must hold only single-valued
attribute.

First normal form disallows the multi-valued attribute, composite attribute, and their combinations.

Example: Relation Student is not in 1NF because of multi-valued attribute StudentEnrollNo.

StudentID StudentEnrollNo StudentMarks StudentPercentage

S001 0721722 570 90

0721723

S002 0721790 490 80

S003 0721766 440 86

The decomposition of the Student table into 1NF has been shown below:

StudentID StudentEnrollNo StudentMarks StudentPercentage

S001 0721722 570 90

S001 0721723 570 90

S002 0721790 490 80

S003 0721766 440 86

5.4.2 Second Normal form (2NF):

o In the 2NF, relational must be in 1NF.
o In the second normal form, all non-key attributes are fully functional dependent on the primary key

Example:- Suppose a school wants to store the data of teachers and the subjects they teach. They
create a table that looks like this: Since a teacher can teach more than one subjects, the table can have
multiple rows for a same teacher.

Teacher_id subject Teacher_age

101 Maths 38
101 Physics 38
102 Biology 38
103 Physics 40
103 Chemistry 40

Candidate Keys: {Teacher_id, subject}

Non prime attribute: Teacher_age

The table is in 1 NF because each attribute has atomic values. However, it is not in 2NF because non prime
attribute Teacher_age is dependent on Teacher_id alone which is a proper subset of candidate key. This
violates the rule for 2NF as the rule says “no non-prime attribute is dependent on the proper subset of any
candidate key of the table”.

To make the table complies with 2NF we can break it in two tables like this:

Table 1: Teacher_details table

Teacher_id Teacher_age
101 38
102 38
103 40

Table 1: Teacher_subject table

Teacher_id Subject
101 Maths
101 Physics
102 Biology
103 Physics
103 Chemistry
Now the tables comply with Second normal form (2NF).

5.4.3 Third Normal form (3NF):

A relation is in third normal form if it is in 2NF and no non key attribute is transitively dependent on the primary key.
There is no transitive functional dependency.

By transitive functional dependency, we mean we have the following relationships in the table: A is functionally dependent on B, and B is functionally
dependent on C. In this case, C is transitively dependent on A via B.

Example:

In the table able, [Book ID] determines [Genre ID], and [Genre ID] determines [Genre Type]. Therefore, [Book ID] determines [Genre
Type] via [Genre ID] and we have transitive functional dependency, and this structure does not satisfy third normal form.

To bring this table to third normal form, we split the table into two as follows:

Now all non-key attributes are fully functional dependent only on the primary key. In [TABLE_BOOK], both [Genre ID] and [Price] are
only dependent on [Book ID]. In [TABLE_GENRE], [Genre Type] is only dependent on [Genre ID].

5.4.4 Boyce Codd normal form (BCNF)

BCNF is the advance version of 3NF. It is stricter than 3NF.
A table is in BCNF if every functional dependency X → Y, X is the super key of the table.
For BCNF, the table should be in 3NF, and for every FD, LHS is super key.
Example:
5.4.5 Fourth normal form (4NF)
A relation will be in 4NF if it is in Boyce Codd normal form and has no multi-valued dependency.

For a dependency A → B, if for a single value of A, multiple values of B exists, then the relation will be a multi-valued
dependency.

STUDENT

STU_ID COURSE HOBBY

21 Computer Dancing

21 Math Singing

34 Chemistry Dancing

74 Biology Cricket

59 Physics Hockey

The given STUDENT table is in 3NF, but the COURSE and HOBBY are two independent entity. Hence, there is no relationship
between COURSE and HOBBY.

In the STUDENT relation, a student with STU_ID, 21 contains two courses, Computer and Math and two

hobbies, Dancing and Singing. So there is a Multi-valued dependency on STU_ID, which leads to unnecessary repetition of
data.

So to make the above table into 4NF, we can decompose it into two tables:

STUDENT_COURSE

STU_ID COURSE

21 Computer

21 Math

34 Chemistry

74 Biology

59 Physics

STUDENT_HOBBY

STU_ID HOBBY

21 Dancing

21 Singing

34 Dancing
74 Cricket

59 Hockey

5.4.5 Fifth normal form (5NF)

Fifth normal form (5NF), also known as project-join normal form (PJ/NF), is a level of database normalization
designed to reduce redundancy in relational databases recording multi-valued facts by isolating semantically related
multiple relationships.
o A relation is in 5NF if it is in 4NF and not contains any join dependency and joining should be lossless.
o 5NF is satisfied when all the tables are broken into as many tables as possible in order to avoid redundancy.

Example:
SUBJECT LECTURER SEMESTER

Computer Anshika sem1

Computer John sem1

Math John sem1

Math Akash sem2

Chemistry Praveen sem1

In the above table, John takes both Computer and Math class for Semester 1 but he doesn't take Math class for sem2. In this
case, combination of all these fields required to identify a valid data.

Suppose we add a new Semester as sem3 but do not know about the subject and who will be taking that subject so we
leave Lecturer and Subject as NULL. But all three columns together acts as a primary key, so we can't leave other two
columns blank.

So to make the above table into 5NF, we can decompose it into three relations P1, P2 & P3:

SEMESTER SUBJECT

sem1 Computer

sem1 Math

sem1 Chemistry

sem2 Math

SUBJECT LECTURER

Computer Anshika

Computer John
Math John

Math Akash

Chemistry Praveen

SEMSTER LECTURER

sem 1 Anshika

sem 1 John

sem1 John

sem2 Akash

sem1 Praveen

5.5 Relational Decomposition:

When a relation in the relational model is not in appropriate normal form then the decomposition of a relation is
required. In a database, it breaks the table into multiple tables. If the relation has no proper decomposition, then it may
lead to problems like loss of information. Decomposition is used to eliminate some of the problems of bad design like
anomalies, inconsistencies, and redundancy.

Properties of Relational Decomposition:

1. Relation Decomposition and Insufficiency of Normal Forms
2. Dependency Preservation Property of a Decomposition
3. Non-additive (Lossless) Join Property of a Decomposition
4. Testing Binary Decompositions for the Non-additive Join Property
5. Successive Non-additive Join Decompositions.

1. Lossless decomposition-

Lossless decomposition ensures-

 No information is lost from the original relation during decomposition.

 When the sub relations are joined back, the same relation is obtained that was decomposed. Every decomposition must
always be lossless.
 The relation is said to be lossless decomposition if natural joins of all the decomposition give the original relation.

Example:- <Employee_Department> Table:

E_id E_name E_age City E_Salary Dept_id Dept_Name

E01 ABC 27 Pune 5000 D1 Finance

E02 GHI 28 Mumbai 50000 D2 Sales

E03 XYZ 25 Mumbai 40000 D3 Marketing

E04 PQR 30 Bangalore 2500 D4 Human Resource

 Decompose the above relation into two relations to check whether a decomposition is lossless or lossy.
 Now, we have decomposed the relation that is Employee and Department.
Relation 1 : <Employee> Table

E_id E_name E_age City E_Salary

E01 ABC 27 Pune 5000

E02 GHI 28 Mumbai 50000

E03 XYZ 25 Mumbai 40000

E04 PQR 30 Bangalore 2500

 Employee Schema contains (E_id, E_name, E_Age, City, E_Salary).

Relation 2 : <Department> Table

Dept_id E_id Dept_Name

D1 E01 Finance
D2 E02 Sales
D3 E03 Marketing
D4 E04 Human Resource

 Department Schema contains (Dep_id, E_id, Dep_Name).

 So, the above decomposition is a Lossless Join Decomposition, because the two relations contains one
common field that is 'E_id' and therefore join is possible.
 Now apply natural join on the decomposed relations.

Employee ⋈ Department

E_id E_name E_age City E_Salary Dep_id Dep_Name

E01 ABC 27 Pune 5000 D1 Finance

E02 GHI 28 Mumbai 50000 D2 Sales

E03 XYZ 25 Mumbai 40000 D3 Marketing

E04 PQR 30 Bangalore 2500 D4 Human Resource

Hence, this example exists lossless decomposition.

2. Dependency Preserving-
 Dependency is an important constraint on the database.
 Every dependency must be satisfied by at least one decomposed table.
 In this property, it allows to check the updates without computing the natural join of the database structure.
 If we decompose a relation R into relations R1 and R2, All dependencies of R either must be a part of R1 or R2 or
must be derivable from combination of FD’s of R1 and R2.

For Example, A relation R (A, B, C, D) with FD set{A->BC} is decomposed into R1(ABC) and R2(AD) which is
dependency preserving because FD A->BC is a part of R1(ABC).

5.6 Multi valued Dependency:

Multivalued dependency occurs when there are more than one independent multivalued attributes in a
table.
A multivalued dependency consists of at least two attributes that are dependent on a third attribute that's why it
always requires at least three attributes.

Example:-

Suppose there is a bike manufacturer company which produces two colors(Red and blue) of each model every year.

BIKE_MODEL MANUF_YEAR BIKE_COLOR

MP1975 2011 Red

MP1975 2011 Blue

MP2002 2013 Red

MP2002 2013 Blue

Here columns BIKE_COLOR and MANUF_YEAR are dependent on BIKE_MODEL and independent of each other.

In this case, these two columns can be called as multivalued dependent on BIKE_MODEL. The representation of these
dependencies is shown below:

BIKE_MODEL → → MANUF_YEAR // BIKE_MODEL ->> MANUF_YEAR

BIKE_MODEL → → BIKE_COLOR // BIKE_MODEL ->> BIKE_COLOR

5.7 Join Dependency:

If a table can be recreated by joining multiple tables and each of this table have a subset of the attributes of the table,
then the table is in Join Dependency. It is a generalization of Multivalued Dependency
Example:-
<Country>
Country_id Country_name Country_population

C01 India 3000

C02 Pakistan 4000

C03 Afganistan 4087

The above table can be decomposed into the following three tables; therefore it is not in 5NF:

<Country_name>
Country_id Country_name

C01 India

C02 Pakistan
C03 Afganistan

<Country_population>
Country_id Country_population

C01 3000

C02 4000

C03 4087

<Namepopulation>
Country_name Country_population

India 3000

Pakistan 4000

Afganistan 4087

Our Join Dependency:

{(Country_id,Country_name ), ( Country_id, Country_population), (Country_name, Country_population)}
The above relations have join dependency, so they are not in 5NF. That would mean that a join relation of the above three
relations is equal to our original relation <Country>.

Inclusion Dependency:
An Inclusion Dependency is a statement of the form that some columns of a relation
are contained in other columns. A foreign key constraint is an example of inclusion dependency.
In one relation, the referring relation is contained in the primary key column(s) of the referenced relation.
In inclusion dependency, we should not split groups of attributes that participate in an inclusion
dependency.
In practice, most inclusion dependencies are key-based that is involved only keys.

Section 6: Transaction & CONCURRENCY CONTROL

6.1 Database Transaction:

Database transactions represent real-world events of any enterprise.
A transaction can be defined as a group of tasks. A single task is the minimum processing unit which cannot be divided
further.
A transaction is an action or series of actions. It is performed by a single user to perform operations for accessing the
contents of the database.

Example:

Transaction T transfer 100 units of account A to B.

Read (A)
A=A-100
Write (A)
Read(B) → If transaction fails here, then system will be inconsistent as 100 units debited from
Account A but not added to account B.

B = B + 100
Write (B)

To remove this partially executed problem, we increase the level of atomicity and bundle all
instruction of a logical operation into a unit called transaction.

Transaction Operations:
Following are the main operations of transaction:

Read(X): Read operation is used to read the value of X from the database and stores it in a buffer in
main memory.

Write(X): Write operation is used to write the value back to the database from the buffer.

we have two other important operations:

Commit: It is used to save the work done permanently.

Rollback: It is used to undo the work done.

6.2 Properties of Transaction:

These are the important properties of transaction that a DBMS must ensure to maintain the database. These
properties are called as “ACID Properties”

1. Atomicity
The atomicity property of a transaction requires that all operations of a transaction be completed, if not, the
transaction is aborted. A transaction is treated as single, individual logical unit of work.

"A" stands for atomicity it states that either all the instructions participating in a transaction will execute
or none. Atomicity is guaranteed by transaction management component.

2. Consistency
"C" stands for consistency it states that if a database is consistent before the execution of a transaction that if
must remains consistent after execution of a transaction.
If the transaction fails, the database must be returned to the state it was in prior to the execution of the failed
transaction.
Note: If atomicity, isolation, durability holds well then consistency holds well automatically.

3. Isolation
Isolation property of a transaction means that the data used during the execution of a transaction cannot be
used by a second transaction until the first one is completed.
Isolation means if a transaction run isolately or concurrently with other transaction then the result must be
same. Concurrency control component takes cares of isolation.

4. Durability
When a transaction is completed, the database reaches a consistent state and that state cannot be lost, even in
the event of system's failure.
Durability means that the work done by a successful transaction must remain in the system. Even in case of any
hardware or software failure.
Note: Recovery management component takes care of durability.

6.3 Transaction States:

A transaction moves from one state to the other as it entries the system to be executed. A transaction must be in one
of the following states:

Data flushed to disk

Partially Committed Committed
Committed

System Failure
Active

Abort

Aborted
Failed
All changes being
Rolled back

Active state
o The active state is the first state of every transaction. In this state, the transaction is being
executed.
o For example: Insertion or deletion or updating a record is done here. But all the records are still
not saved to the database.

Partially committed
o In the partially committed state, a transaction executes its final operation, but the data is still not
saved to the database.
o In the total mark calculation example, a final display of the total marks step is executed in this
state.

Committed
A transaction is said to be in a committed state if it executes all its operations successfully. In this state,
all the effects are now permanently saved on the database system.

Failed state
o If any of the checks made by the database recovery system fails, then the transaction is said to be
in the failed state.
o In the example of total mark calculation, if the database is not able to fire a query to fetch the
marks, then the transaction will fail to execute.

Aborted
o If any of the checks fail and the transaction has reached a failed state then the database recovery
system will make sure that the database is in its previous consistent state. If not then it will abort
or roll back the transaction to bring the database into a consistent state.
o If the transaction fails in the middle of the transaction then before executing the transaction, all
the executed transactions are rolled back to its consistent state.
o After aborting the transaction, the database recovery module will select one of the two operations:
1. Re-start the transaction
2. Kill the transaction

Types of Transaction:

Based on Application areas:

 Non-distributed vs. distributed

 Compensating transactions
 Transactions Timing
 On-line vs. batch

Based on Actions

 Two-step
 Restricted
 Action model

Based on Structure

 Flat or simple transactions: It consists of a sequence of primitive operations executed between a begin
and end operations.
 Nested transactions: A transaction that contains other transactions.
 Workflow

6.4 Transaction Schedule:

A series of operation from one transaction to another transaction is known as schedule.
A Schedule is a process creating a single group of the multiple parallel transactions and executing them one by one.
It is used to preserve the order of the operation in each of the individual transaction.

Serial Schedule

Schedule Non-Serial Schedule

Serializable Schedule
1.Serial Schedule

The serial schedule is a type of schedule where one transaction is executed completely before starting
another transaction. In the serial schedule, when the first transaction completes its cycle, then the
next transaction is executed.

For example: Suppose there are two transactions T1 and T2 which have some operations. If it has no
interleaving of operations, then there are the following two possible outcomes:
Execute all the operations of T1 which was followed by all the operations of T2.
Execute all the operations of T1 which was followed by all the operations of T2.
In the given (a) figure, Schedule A shows the serial schedule where T1 followed by T2.
In the given (b) figure, Schedule B shows the serial schedule where T2 followed by T1.

2. Non-serial Schedule
If interleaving of operations is allowed, then there will be non-serial schedule.
It contains many possible orders in which the system can execute the individual operations of the
transactions.
In the given figure (c) and (d), Schedule C and Schedule D are the non-serial schedules. It has
interleaving of operations.

3. Serializable schedule
The serializability of schedules is used to find non-serial schedules that allow the transaction to
execute concurrently without interfering with one another.
It identifies which schedules are correct when executions of the transaction have interleaving of their
operations.
A non-serial schedule will be serializable if its result is equal to the result of its transactions executed
serially.
Figures to explain these schedules:

a) Schedule A

T1 T2

Read (A);
A := A - N;
Time Write (A);
Read (B);
B :=B + N;
Write (B);
Read (A);
A := A + M;
Write (A);

b) Schedule B

T1 T2

Read (A);
A := A + M;
Write (A);
Read (A);
A := A - N;
Time Write (A);
Read (B);
B :=B + N;
Write (B);

c) Schedule C

T1 T2

Read (A);
A := A - N;
Time Read (A);
A := A + M;
Write (A);
Read (B);
Write (A);
B :=B + N;
Write (B);

d) Schedule D

T1 T2

Read (A);
A := A - N;
Time Write (A);
Read (A);
A := A + M;
Write (A);
Read (B);
B :=B + N;
Write (B);

Here,
Schedule A and Schedule B are serial schedule.
Schedule C and Schedule D are Non-serial schedule.

6.5 Serializability & Recoverability

Serializability is the process of search for a concurrent schedule whose output is equal to a serial schedule
where transaction execute one after the other. Depending on the type of schedules, there are two types of
serializability:

 Conflict
 View

Or in other words, When multiple transactions are running concurrently then there is a possibility that the
database may be left in an inconsistent state. Serializability is a concept that helps us to check
which schedules are serializable. A serializable schedule is the one that always leaves the database in
consistent state.

Conflict Serializability
It is one of the type of Serializability, which can be used to check whether a non-serial schedule is conflict
serializable or not.
A schedule is called conflict serializable if we can convert it into a serial schedule after swapping its non-
conflicting operations.

Conflicting operations:-

Two operations are said to be in conflict, if they satisfy all the following three conditions:

1.Both the operations should belong to different transactions.

2. Both the operations are working on same data item.
3. At least one of the operation is a write operation.

Example:
Operation W(X) of transaction T1 and operation R(X) of transaction T2 are conflicting operations,
because they satisfy all the three conditions mentioned above. They belong to different transactions,
they are working on same data item X, one of the operation in write operation.
Following two operations are:

 Conflict Equivalent Schedules

Two schedules are said to be conflict Equivalent if one schedule can be converted into other schedule after
swapping non-conflicting operations.

 Conflict Serializable check

Lets check whether a schedule is conflict serializable or not. If a schedule is conflict Equivalent to its serial
schedule then it is called Conflict Serializable schedule. Lets take few examples of schedules.

View Serializability
View Serializability is a process to find out that a given schedule is view serializable or not.
o A schedule will view serializable if it is view equivalent to a serial schedule.
o If a schedule is conflict serializable, then it will be view serializable.
o The view serializable which does not conflict serializable contains blind writes.

View Equivalent

Two schedules S1 and S2 are said to be view equivalent if they satisfy the following conditions:

1. Initial Read

An initial read of both schedules must be the same. Suppose two schedule S1 and S2. In schedule S1, if a
transaction T1 is reading the data item A, then in S2, transaction T1 should also read A.

Schedule s1:

T1 T2

Read (A)

Write( A)

Schedule s2:

T1 T2

Write (A)

Read (A)

Above two schedules are view equivalent because Initial read operation in S1 is done by T1 and in S2 it is
also done by T1.

2. Updated Read

In schedule S1, if Ti is reading A which is updated by Tj then in S2 also, Ti should read A which is updated
by Tj.

Schedule S1:

T1 T2 T3

Write (A)

Write( A)

Read (A)

Schedule S2:

T1 T2 T3

Write( A)

Write (A)
Read (A)

Above two schedules are not view equal because, in S1, T3 is reading A updated by T2 and in S2, T3 is
reading A updated by T1.

3. Final Write

A final write must be the same between both the schedules. In schedule S1, if a transaction T1 updates A
at last then in S2, final writes operations should also be done by T1.

Schedule S1:

T1 T2 T3

Write (A)

Read ( A)

Write (A)

Schedule S2:

T1 T2 T3

Read ( A)

Write (A)

Above two schedules is view equal because Final write operation in S1 is done by T3 and in S2, the final
write operation is also done by T3.

Example:

T1 T2 T3

Read (A)

Write( A)

Write (A)

Write (A)
Schedule S

With 3 transactions, the total number of possible schedule

1. = 3! = 6
2. S1 = <T1 T2 T3>
3. S2 = <T1 T3 T2>
4. S3 = <T2 T3 T1>
5. S4 = <T2 T1 T3>
6. S5 = <T3 T1 T2>
7. S6 = <T3 T2 T1>

Taking first schedule S1:

T1 T2 T3

Read (A)

Write (A)

Write( A)

Write (A)

Schedule S1

Step 1: Final updation on data items:

In both schedules S and S1, there is no read except the initial read that's why we don't need to check that
condition.

Step 2: Initial Read:

The initial read operation in S is done by T1 and in S1, it is also done by T1.

Step 3: Final Write:

The final write operation in S is done by T3 and in S1, it is also done by T3. So, S and S1 are view
Equivalent.

The first schedule S1 satisfies all three conditions, so we don't need to check another schedule.

Hence, view equivalent serial schedule is:

T1 → T2 → T3
Recoverability:
Recoverability of Schedule. Sometimes a transaction may not execute completely due to a software
issue, system crash or hardware failure. In that case, the failed transaction has to be rollback. But some
other transaction may also have used value produced by the failed transaction.
Need to address the effect of transaction failures on concurrentlt running transactions.

Recoverable schedule:- If a transaction Tj reads a data items previously written by a transaction Ti, the
commit operation of Ti appears before the commit operation Tj.

T1 T2

Read (A)

Write (A)

Read (A)

Read (B)

The following schedule is not recoverable if T2 commits immediately after the read.
If T1 should abort, T2 would have read (and possibly shown to the user) an inconsistent database state. Hence
database must ensure that schedules are recoverable.

Cascading Rollback:- A single transaction failure leads to a series of transaction rollbacks.

Consider the following schedule where none of the transactions has yet committed (so the
schedule is recoverable).

T1 T2 T3

Read (A)

Read (B)

Write (A)

Read (A)

Write (A)

Read (A)
If T1 fails, T2 & T3 must also be rolled back.

Cascadeless Schedules:- Cascading rollback cannot occur, for each pair of transactions Ti & Tj reads a data
item previously written by Ti, the commit operation of Ti appears before the read operation of Tj.

Every cascadeless schedule is also recoverable.

It is desirable to restrict the schedules to those that are cascadeless.

Failure Classification

To find that where the problem has occurred, we generalize a failure into the following categories:
1. Transaction failure
2. System crash
3. Disk failure

1. Transaction failure

The transaction failure occurs when it fails to execute or when it reaches a point from where it can't
go any further. If a few transaction or process is hurt, then this is called as transaction failure.

Reasons for a transaction failure could be -

1. Logical errors: If a transaction cannot complete due to some code error or an internal
error condition, then the logical error occurs.
2. Syntax error: It occurs where the DBMS itself terminates an active transaction because the
database system is not able to execute it. For example, The system aborts an active
transaction, in case of deadlock or resource unavailability.

2. System Crash
o System failure can occur due to power failure or other hardware or software
failure. Example: Operating system error.

Fail-stop assumption: In the system crash, non-volatile storage is assumed not to be

corrupted.

3. Disk Failure
o It occurs where hard-disk drives or storage drives used to fail frequently. It was a common
problem in the early days of technology evolution.
o Disk failure occurs due to the formation of bad sectors, disk head crash, and unreachability
to the disk or any other failure, which destroy all or part of disk storage.

Concurrency control
In a multiprogramming environment where multiple transactions can be executed simultaneously, it
is highly important to control the concurrency of transactions. We have concurrency control protocols
to ensure atomicity, isolation, and serializability of concurrent transactions.
Concurrency control protocols can be broadly divided into two categories −

 Lock based protocols

 Time stamp based protocols

Lock-based Protocols
Database systems equipped with lock-based protocols use a mechanism by which any transaction
cannot read or write data until it acquires an appropriate lock on it. Locks are of two kinds −
 Binary Locks − A lock on a data item can be in two states; it is either locked or unlocked.
 Shared/exclusive − This type of locking mechanism differentiates the locks based on their uses. If a lock is
acquired on a data item to perform a write operation, it is an exclusive lock. Allowing more than one
transaction to write on the same data item would lead the database into an inconsistent state. Read locks
are shared because no data value is being changed.
There are four types of lock protocols available −
Simplistic Lock Protocol
Simplistic lock-based protocols allow transactions to obtain a lock on every object before a 'write'
operation is performed. Transactions may unlock the data item after completing the ‘write’ operation.
Pre-claiming Lock Protocol
Pre-claiming protocols evaluate their operations and create a list of data items on which they need
locks. Before initiating an execution, the transaction requests the system for all the locks it needs
beforehand. If all the locks are granted, the transaction executes and releases all the locks when all
its operations are over. If all the locks are not granted, the transaction rolls back and waits until all
the locks are granted.

Two-Phase Locking 2PL

This locking protocol divides the execution phase of a transaction into three parts. In the first part,
when the transaction starts executing, it seeks permission for the locks it requires. The second part
is where the transaction acquires all the locks. As soon as the transaction releases its first lock, the
third phase starts. In this phase, the transaction cannot demand any new locks; it only releases the
acquired locks.

Two-phase locking has two phases, one is growing, where all the locks are being acquired by the
transaction; and the second phase is shrinking, where the locks held by the transaction are being
released.
To claim an exclusive (write) lock, a transaction must first acquire a shared (read) lock and then
upgrade it to an exclusive lock.
Strict Two-Phase Locking
The first phase of Strict-2PL is same as 2PL. After acquiring all the locks in the first phase, the
transaction continues to execute normally. But in contrast to 2PL, Strict-2PL does not release a lock
after using it. Strict-2PL holds all the locks until the commit point and releases all the locks at a time.
Strict-2PL does not have cascading abort as 2PL does.

Timestamp-based Protocols
The most commonly used concurrency protocol is the timestamp based protocol. This protocol uses
either system time or logical counter as a timestamp.
Lock-based protocols manage the order between the conflicting pairs among transactions at the time
of execution, whereas timestamp-based protocols start working as soon as a transaction is created.
Every transaction has a timestamp associated with it, and the ordering is determined by the age of
the transaction. A transaction created at 0002 clock time would be older than all other transactions
that come after it. For example, any transaction 'y' entering the system at 0004 is two seconds
younger and the priority would be given to the older one.
In addition, every data item is given the latest read and write-timestamp. This lets the system know
when the last ‘read and write’ operation was performed on the data item.

Timestamp Ordering Protocol

The timestamp-ordering protocol ensures serializability among transactions in their conflicting read
and write operations. This is the responsibility of the protocol system that the conflicting pair of tasks
should be executed according to the timestamp values of the transactions.

 The timestamp of transaction T is denoted as TS(T ).

i i

 Read time-stamp of data-item X is denoted by R-timestamp(X).

 Write time-stamp of data-item X is denoted by W-timestamp(X).
Timestamp ordering protocol works as follows −
 If a transaction Ti issues a read(X) operation −
o If TS(Ti) < W-timestamp(X)
 Operation rejected.
o If TS(Ti) >= W-timestamp(X)
 Operation executed.
o All data-item timestamps updated.
 If a transaction Ti issues a write(X) operation −
o If TS(Ti) < R-timestamp(X)
 Operation rejected.
o If TS(Ti) < W-timestamp(X)
 Operation rejected and Ti rolled back.
o Otherwise, operation executed.
Thomas' Write Rule
This rule states if TS(Ti) < W-timestamp(X), then the operation is rejected and T is rolled back.
i

Time-stamp ordering rules can be modified to make the schedule view serializable.
Instead of making T rolled back, the 'write' operation itself is ignored.
i

Deadlock in DBMS
A deadlock is a condition where two or more transactions are waiting indefinitely for one another to give
up locks. Deadlock is said to be one of the most feared complications in DBMS as no task ever gets
finished and is in waiting state forever.

For example: In the student table, transaction T1 holds a lock on some rows and needs to update some
rows in the grade table. Simultaneously, transaction T2 holds locks on some rows in the grade table and
needs to update the rows in the Student table held by Transaction T1.

Now, the main problem arises. Now Transaction T1 is waiting for T2 to release its lock and similarly,
transaction T2 is waiting for T1 to release its lock. All activities come to a halt state and remain at a
standstill. It will remain in a standstill until the DBMS detects the deadlock and aborts one of the
transactions.

Deadlock Avoidance
o When a database is stuck in a deadlock state, then it is better to avoid the database rather than
aborting or restating the database. This is a waste of time and resource.
o Deadlock avoidance mechanism is used to detect any deadlock situation in advance. A method like
"wait for graph" is used for detecting the deadlock situation but this method is suitable only for the
smaller database. For the larger database, deadlock prevention method can be used.

Deadlock Detection

In a database, when a transaction waits indefinitely to obtain a lock, then the DBMS should detect
whether the transaction is involved in a deadlock or not. The lock manager maintains a Wait for the graph
to detect the deadlock cycle in the database.
Wait for Graph
o This is the suitable method for deadlock detection. In this method, a graph is created based on the
transaction and their lock. If the created graph has a cycle or closed loop, then there is a deadlock.
o The wait for the graph is maintained by the system for every transaction which is waiting for some
data held by the others. The system keeps checking the graph if there is any cycle in the graph.

The wait for a graph for the above scenario is shown below:

Deadlock Prevention
o Deadlock prevention method is suitable for a large database. If the resources are allocated in such
a way that deadlock never occurs, then the deadlock can be prevented.
o The Database management system analyzes the operations of the transaction whether they can
create a deadlock situation or not. If they do, then the DBMS never allowed that transaction to be
executed.

Wait-Die scheme

In this scheme, if a transaction requests for a resource which is already held with a conflicting lock by
another transaction then the DBMS simply checks the timestamp of both transactions. It allows the older
transaction to wait until the resource is available for execution.

Let's assume there are two transactions Ti and Tj and let TS(T) is a timestamp of any transaction T. If T2
holds a lock by some other transaction and T1 is requesting for resources held by T2 then the following
actions are performed by DBMS:

1. Check if TS(Ti) < TS(Tj) - If Ti is the older transaction and Tj has held some resource, then Ti is
allowed to wait until the data-item is available for execution. That means if the older transaction is
waiting for a resource which is locked by the younger transaction, then the older transaction is
allowed to wait for resource until it is available.
2. Check if TS(Ti) < TS(Tj) - If Ti is older transaction and has held some resource and if Tj is waiting
for it, then Tj is killed and restarted later with the random delay but with the same timestamp.
Wound wait scheme
o In wound wait scheme, if the older transaction requests for a resource which is held by the younger
transaction, then older transaction forces younger one to kill the transaction and release the
resource. After the minute delay, the younger transaction is restarted but with the same
timestamp.
o If the older transaction has held a resource which is requested by the Younger transaction, then
the younger transaction is asked to wait until older releases it.

Starvation:-
Starvation or Livelock is the situation when a transaction has to wait for a indefinate period of time to acquire a
lock.

Reasons of Starvation –
 If waiting scheme for locked items is unfair. ( priority queue )
 Victim selection. ( same transaction is selected as a victim repeatedly )
 Resource leak.
 Via denial-of-service attack.

Example for Starvation:

Problem:-Starvation can be best explained with the help of an example – Suppose there are 3 transactions namely T1,
T2, and T3 in a database that are trying to acquire a lock on data item ‘ I ‘ . Now, suppose the scheduler grants the lock to
T1(may be due to some priority), and the other two transactions are waiting for the lock. As soon as the execution of T1 is
over, another transaction T4 also come over and request unlock on data item I. Now, this time the scheduler grants lock to
T4, and T2, T3 has to wait again . In this way if new transactions keep on requesting the lock, T2 and T3 may have to wait
for an indefinate period of time, that leads to Starvation.

Solution of Starvation:

1. Increasing Priority –

Starvation occurs when a transaction has to wait for an indefinate time, In this situation we can increase the priority
of that particular transaction/s. But the drawback with this solution is that it may happen that the other transaction
may have to wait longer untill the highest priority transaction comes and proceeds.

2. Modification in Victim Seletion algorithm –

If a transaction has been a victim of repeated selections, then the algorithm can be modified by lowering
its priority over other transactions.

3. First Come First Serve approach –

A fair scheduling approach i.e FCFS can be adopted, In which the transaction can acquire a lock on an
Item in the order, in which the requested the lock.

4. Wait die and wound wait scheme –

These are the schemes that uses timestamp ordering mechanism of transaction .
Section 7: INDEXING & HASHING

INDEXING:
We know that data is stored in the form of records. Every record has a key field, which helps it to be
recognized uniquely.
Index is a physical structure contains pointers to the data.
Indexing is a way to optimize the performance of a database by minimizing the number of disk accesses required when a
query is processed.
It is a data structure technique which is used to quickly locate and access the data in a database.
Indexing in database systems is similar to what we see in books.

When a database is very huge, even a smallest transaction will take time to perform the action. In order to reduce the
time spent in transactions, Indexes are used.

The users cannot see the indexes, they are just used to speed up queries. Effective indexes are one of the best ways to improve
performance in a database application.

Advantages of Indexing:
 They make it possible to quickly retrieve (fetch) data.
 They can be used for sorting.
 Their use in queries usually results in much better performance.
 Unique indexes guarantee uniquely identifiable records in the database.
 You can't sort data in the lead nodes as the value of the primary key classifies it.

DisAdvantages of Indexing:
 They decrease performance on inserts, updates, and deletes.
 They take up space (this increases with the number of fields used and the length of the fields).
 To perform the indexing database management system, you need a primary key on the table with a unique value.
 You are not allowed to partition an index-organized table.
 You can't perform any other indexes on the Indexed data.

Types of Indexing:

Structure of Indexing:

Indexes can be created using some database columns.

 The first column of the database is the search key that contains a copy of the primary key or candidate key
of the table. The values of the primary key are stored in sorted order so that the corresponding data can be
accessed easily.
 The second column of the database is the data reference. It contains a set of pointers holding the address
of the disk block where the value of the particular key can be found.

The indexing has various attributes:

 Access Types: This refers to the type of access such as value based search, range access, etc.
 Access Time: It refers to the time needed to find particular data element or set of elements.
 Insertion Time: It refers to the time taken to find the appropriate space and insert a new data.
 Deletion Time: Time taken to find an item and delete it as well as update the index structure.
 Space Overhead: It refers to the additional space required by the index.

Ordered Indices:

The indices are usually sorted to make searching faster. The indices which are sorted are known as ordered
indices. These are generally fast and a more traditional type of storing mechanism.

Lets take one example:

Imagine we have a student table with thousands of records, each of which is 10 bytes long. Imagine their IDs
start from 1 2, 3… and goes on. And we have to search student with ID 572. In a normal database with no
index, it searches the disk block from the beginning till it reaches 572. So the DBMS will reach this record after
reading 571*10 = 5710 bytes. But if we have index on ID column, then the address of the location will be
stored as each record as (1,200), (2, 201)… (572, 773) and so on. One can imagine it as a smaller table with
index column and address column. Now if we want to search record with ID 572, then it will search using
indexes. i.e.; here it will traverse only 571*2 = 1142 bytes which very less compared to earlier one. Hence
retrieving the record from the disk becomes faster. Most of the cases these indexes are sorted and kept to
make searching faster. If the indexes are sorted, then it is called as ordered indices.

Primary Index:

If the index is created on the primary key of the table then it is called as Primary Indexing. Since these primary
keys are unique to each record and it has 1:1 relation between the records, it is much easier to fetch the
record using it.
As primary keys are stored in sorted order, the performance of the searching operation is quite efficient.
The primary index will be categorized into 2 parts: Dense index and Sparse index.

Dense index:

The dense index contains an index record for every search key value in the data file. It makes searching
faster.
In this, the number of records in the index table is same as the number of records in the main table.
It needs more space to store index record itself. The index records have the search key and a pointer to the
actual record on the disk.

Sparse Index:
In the data file, index record appears only for a few items. Each item points to a block.
In this, instead of pointing to each record in the main table, the index points to the records in the main table in a gap.
In this method of indexing, range of index columns store the same data block address. And when data is to be retrieved,
the block address will be fetched linearly till we get the requested data.

Example:

As you can see, the data blocks have been divided in to several blocks, each containing a fixed number of records (in our case 10).
The pointer in the index table points to the first record of each data block, which is known as the Anchor Record for its important
function. If you are searching for roll 14, the index is first searched to find out the highest entry which is smaller than or equal to 14. We
have 11. The pointer leads us to roll 11 where a short sequential search is made to find out roll 14.

Clustering Index:
In some cases, the index is created on non-primary key columns which may not be unique for each record. In such cases,
in order to identify the records faster, we will group two or more columns together to get the unique values and create
index out of them.

Example: suppose a company contains several employees in each department. Suppose we use a clustering index,
where all employees which belong to the same Dept_ID are considered within a single cluster, and index pointers point to
the cluster as a whole. Here Dept_Id is a non-unique key.
The previous schema is little confusing because one disk block is shared by records which belong to the different cluster.
If we use separate disk block for separate clusters, then it is called better technique.

Secondary Index:
In the sparse indexing, as the size of the table grows, the size of mapping also grows. These mappings are usually kept in
the primary memory so that address fetch should be faster. Then the secondary memory searches the actual data based
on the address got from mapping. If the mapping size grows then fetching the address itself becomes slower. In this case,
the sparse index will not be efficient. To overcome this problem, secondary indexing is introduced.

A non-clustered index just tells us where the data lies, i.e. it gives us a list of virtual pointers or references to
the location where the data is actually stored.
Data is not physically stored in the order of the index. Instead, data is present in leaf nodes.
Example:
Multilevel Indexing:

Multilevel Indexing is created when a primary index does not fit in memory.

Index records comprise search-key values and data pointers.

Multilevel index is stored on the disk along with the actual database files.
There is an immense need to keep the index records in the main memory so as to speed up the search
operations.
If single-level index is used, then a large size index cannot be kept in memory which leads to multiple disk
accesses.

you can reduce the number of disk accesses to short any record and kept on a disk as a sequential file and
create a sparse base on that file.
Multi-level Index helps in breaking down the index into several smaller indices in order to make the outermost
level so small that it can be saved in a single disk block, which can easily be accommodated anywhere in the
main memory.

HASHING
In DBMS, hashing is a technique to directly search the location of desired data on the disk without using index
structure.
Data is stored in the form of data blocks whose address is generated by applying a hash function in the
memory location where these records are stored known as a data block or data bucket.
Hashing uses hash functions with search keys as parameters to generate the address of a data record.

Need for Hashing:-

Here, are the situations in the DBMS where you need to apply the Hashing method:

 For a huge database structure, it's tough to search all the index values through all its level and then you need
to reach the destination data block to get the desired data.
 Hashing method is used to index and retrieve items in a database as it is faster to search that specific item
using the shorter hashed key instead of using its original value.
 Hashing is an ideal method to calculate the direct location of a data record on the disk without using index
structure.
 It is also a helpful technique for implementing dictionaries.

Important Terminologies using in Hashing

Here, are important terminologies which are used in Hashing:

 Data bucket – Data buckets are memory locations where the records are stored. It is also known as Unit Of
Storage.
 Key: A DBMS key is an attribute or set of an attribute which helps you to identify a row(tuple) in a
relation(table). This allows you to find the relationship between two tables.
 Hash function: A hash functionh, is a mapping function which maps all the set of search keys Kto the address
where actual records are placed.
 Linear Probing – Linear probing is a fixed interval between probes. In this method, the next available data
block is used to enter the new record, instead of overwriting on the older record.
 Quadratic probing- It helps you to determine the new bucket address. It helps you to add Interval between
probes by adding the consecutive output of quadratic polynomial to starting value given by the original
computation.
 Hash index – It is an address of the data block. A hash function could be a simple mathematical function to
even a complex mathematical function.
 Double Hashing –Double hashing is a computer programming method used in hash tables to resolve the
issues of has a collision.
 Bucket Overflow: The condition of bucket-overflow is called collision. This is a fatal stage for any static has to
function.

There are mainly two types of SQL hashing methods:

1. Static Hashing
2. Dynamic Hashing

Static Hashing

In the static hashing, the resultant data bucket address will always remain the same.

The hash function always computes the same address.

Therefore, if you generate an address for say Student_ID = 10 using hashing function mod(3), the resultant
bucket address will always be 1. So, you will not see any change in the bucket address.

Therefore, in this static hashing method, the number of data buckets in memory always remains constant.

Static Hash Functions

 Inserting a record: When a new record requires to be inserted into the table, you can generate an address for
the new record using its hash key. When the address is generated, the record is automatically stored in that
location.
 Searching: When you need to retrieve the record, the same hash function should be helpful to retrieve the
address of the bucket where data should be stored.
 Delete a record: Using the hash function, you can first fetch the record which is you wants to delete. Then you
can remove the records for that address in memory.
 Update a record:To update a record, we will first search it using a hash function, and then the data record is
updated.
Operation
 Insertion − When a record is required to be entered using static hash, the hash function h computes
the bucket address for search key K, where the record will be stored.
Bucket address = h (K)
 Search − When a record needs to be retrieved, the same hash function can be used to retrieve the
address of the bucket where the data is stored.
 Delete − This is simply a search followed by a deletion operation.

Static hashing is further divided into

1. Open hashing
2. Close hashing.

Open Hashing

In Open hashing method, Instead of overwriting older one the next available data block is used to enter the
new record, This method is also known as linear probing.

Example: suppose R3 is a new address which needs to be inserted, the hash function generates address
as 112 for R3. But the generated address is already full. So the system searches next available data bucket,
113 and assigns R3 to it.

Close Hashing

In the close hashing method, when buckets are full, a new bucket is allocated for the same hash and result are
linked after the previous one.

Example:Suppose R3 is a new address which needs to be inserted into the table, the hash function
generates address as 110 for it. But this bucket is full to store the new data. In this case, a new bucket is
inserted at the end of 110 buckets and is linked to it.
Dynamic Hashing

The problem with static hashing is that it does not expand or shrink dynamically as the size of the database
grows or shrinks.
Dynamic hashing offers a mechanism in which data buckets are added and removed dynamically and on
demand. In this hashing, the hash function helps you to create a large number of values.

How to search a key

o First, calculate the hash address of the key.
o Check how many bits are used in the directory, and these bits are called as i.
o Take the least significant i bits of the hash address. This gives an index of the directory.
o Now using the index, go to the directory and find bucket address where the record might be.

How to insert a new record

o Firstly, you have to follow the same procedure for retrieval, ending up in some bucket.
o If there is still space in that bucket, then place the record in it.
o If the bucket is full, then we will split the bucket and redistribute the records.
Example: Consider the following grouping of keys into buckets, depending on the prefix of their hash
address:
KEY Hash Address
1 1101
2 0000
3 11110
4 00000
5 01001
6 10101
7 10111

The last two bits of 2 and 4 are 00. So it will go into bucket B0. The last two bits of 5 and 6 are 01, so it will go
into bucket B1. The last two bits of 1 and 3 are 10, so it will go into bucket B2. The last two bits of 7 are 11, so
it will go into B3.
Dynamic Hashing
o The dynamic hashing method is used to overcome the problems of static hashing like bucket overflow.
o In this method, data buckets grow or shrink as the records increases or decreases. This method is also known
as Extendable hashing method.
o This method makes hashing dynamic, i.e., it allows insertion or deletion without resulting in poor performance.

How to search a key

How to insert a new record

For example:
Consider the following grouping of keys into buckets, depending on the prefix of their hash address:
The last two bits of 2 and 4 are 00. So it will go into bucket B0. The last two bits of 5 and 6 are 01, so it will go
into bucket B1. The last two bits of 1 and 3 are 10, so it will go into bucket B2. The last two bits of 7 are 11, so
it will go into B3.

Insert key 9 with hash address 10001 into the above structure:
o Since key 9 has hash address 10001, it must go into the first bucket. But bucket B1 is full, so it will get split.
o The splitting will separate 5, 9 from 6 since last three bits of 5, 9 are 001, so it will go into bucket B1, and the
last three bits of 6 are 101, so it will go into bucket B5.
o Keys 2 and 4 are still in B0. The record in B0 pointed by the 000 and 100 entry because last two bits of both
the entry are 00.
o Keys 1 and 3 are still in B2. The record in B2 pointed by the 010 and 110 entry because last two bits of both
the entry are 10.
o Key 7 are still in B3. The record in B3 pointed by the 111 and 011 entry because last two bits of both the entry
are 11.
What is Collision?

Hash collision is a state when the resultant hashes from two or more data in the data set, wrongly map the
same place in the hash table.

How to deal with Hashing Collision?

There are two technique which you can use to avoid a hash collision:

1. Rehashing: This method, invokes a secondary hash function, which is applied continuously until an empty slot
is found, where a record should be placed.
2. Chaining: Chaining method builds a Linked list of items whose key hashes to the same value. This method
requires an extra link field to each table position.

Hashing is not favorable when the data is organized in some ordering and the queries require a range of data.
When data is discrete and random, hash performs the best.
Hashing algorithms have high complexity than indexing. All hash operations are done in constant time.

B+ Tree
The B+ tree is a balanced binary search tree. It follows a multi-level index format.
In the B+ tree, leaf nodes denote actual data pointers. B+ tree ensures that all leaf nodes remain at
the same height.
In the B+ tree, the leaf nodes are linked using a link list. Therefore, a B+ tree can support random access as
well as sequential access.
Structure of B+ tree:
o In the B+ tree, every leaf node is at equal distance from the root node. The B+ tree is of the order n where n is
fixed for every B+ tree.
o It contains an internal node and leaf node.

Internal node
o An internal node of the B+ tree can contain at least n/2 record pointers except the root node.
o At most, an internal node of the tree contains n pointers.

Leaf node
o The leaf node of the B+ tree can contain at least n/2 record pointers and n/2 key values.
o At most, a leaf node contains n record pointer and n key values.
o Every leaf node of the B+ tree contains one block pointer P to point to next leaf node.
Searching a record in B+ Tree
Suppose we have to search 55 in the below B+ tree structure. First, we will fetch for the intermediary node
which will direct to the leaf node that can contain a record for 55.

So, in the intermediary node, we will find a branch between 50 and 75 nodes. Then at the end, we will be
redirected to the third leaf node. Here DBMS will perform a sequential search to find 55.

B+ Tree Insertion
Suppose we want to insert a record 60 in the below structure. It will go to the 3rd leaf node after 55. It is a
balanced tree, and a leaf node of this tree is already full, so we cannot insert 60 there.

In this case, we have to split the leaf node, so that it can be inserted into tree without affecting the fill factor,
balance and order.

The 3rd leaf node has the values (50, 55, 60, 65, 70) and its current root node is 50. We will split the leaf node
of the tree in the middle so that its balance is not altered. So we can group (50, 55) and (60, 65, 70) into 2 leaf
nodes.

If these two has to be leaf nodes, the intermediate node cannot branch from 50. It should have 60 added to it,
and then we can have pointers to a new leaf node.
This is how we can insert an entry when there is overflow. In a normal scenario, it is very easy to find the node
where it fits and then place it in that leaf node.

B+ Tree Deletion
Suppose we want to delete 60 from the above example. In this case, we have to remove 60 from the
intermediate node as well as from the 4th leaf node too. If we remove it from the intermediate node, then the
tree will not satisfy the rule of the B+ tree. So we need to modify it to have a balanced tree.

After deleting node 60 from above B+ tree and re-arranging the nodes, it will show as follows:

SECTION 8: RAID levels

RAID
RAID stands for Redundant Array of Inexpensive (Independent) Disks.
RAID is a storage virtualization technology which is used to organise multiple drives into various arrangments
to meet certain goals like redundancy, speed and capacity.
It can be classified into two categories: Software RAID and Hardware RAID.
In software RAID, the memory architecture is managed by the operating system.
In case of hardware RAID, there is a dedicated controller and processor present inside the disks that manage
the memory.
Data striping entails dividing storage into fixed size blocks called strips.
Following RAID levels are:
RAID 0:-
RAID 0 splits data across any number of disks allowing higher data throughput. An individual file is
read from multiple disks giving it access to the speed and capacity of all of them. This RAID level is
often referred to as striping and has the benefit of increased performance. However, it does not
facilitate any kind of redundancy and fault tolerance as it does not duplicate data or store any parity
information (more on parity later). Both disks appear as a single partition, so when one of them fails,
it breaks the array and results in data loss.
 Blocks Striped, No mirror, no Parity.
 Minimum 2 disks.
 Excellent performance ( as blocks are striped ).
 No redundancy ( no mirror, no parity ).
 Don’t use this for any critical system.
 Business use: Live streaming, IPTV, VOD Edge Server

Disk 1 Disk 2 Disk 3

1 2 3
4 5 6
7 8 9

Advantage:

 RAID 0 offers great performance, both in read and write operations. There is no overhead caused by parity
controls.
 All storage capacity is used, there is no overhead.
 The technology is easy to implement.

Disadvantage:

 RAID 0 is not fault-tolerant. If one drive fails, all data in the RAID 0 array are lost. It should not be used for
mission-critical systems.

RAID 1:-
RAID 1 writes and reads identical data to pairs of drives. This process is often called data mirroring
and it’s a primary function is to provide redundancy. If any of the disks in the array fails, the system
can still access data from the remaining disk(s). Once you replace the faulty disk with a new one, the
data is copied to it from the functioning disk(s) to rebuild the array. RAID 1 is the easiest way to
create failover storage.
 Blocks Mirrored, No stripe, No parity
 Minimum 2 disks.
 Good performance ( no striping. no parity ).
 Excellent redundancy ( as blocks are mirrored )
 Business use: Standard application servers where data redundancy and availability is important.

Disk 1 Disk 2
1 1
4 4
7 7

Advantages

 RAID 1 offers excellent read speed and a write-speed that is comparable to that of a single drive.
 In case a drive fails, data do not have to be rebuild, they just have to be copied to the replacement drive.
 RAID 1 is a very simple technology.

Disadvantages

 The main disadvantage is that the effective storage capacity is only half of the total drive capacity because all
data get written twice.
 Software RAID 1 solutions do not always allow a hot swap of a failed drive. That means the failed drive can
only be replaced after powering down the computer it is attached to. For servers that are used simultaneously
by many people, this may not be acceptable. Such systems typically use hardware controllers that do support
hot swapping.

RAID 5:-
RAID 5 stripes data blocks across multiple disks like RAID 0, however, it also stores parity
information (Small amount of data that can accurately describe larger amounts of data) which is used
to recover the data in case of disk failure.
This level offers both speed (data is accessed from multiple disks) and redundancy as parity data is
stored across all of the disks. If any of the disks in the array fails, data is recreated from the
remaining distributed data and parity blocks. It uses approximately one-third of the available disk
capacity for storing parity information.
 Blocks Striped, Distributed Parity
 Minimum 3 disks.
 Good performance ( as blocks are striped ).
 Good redundancy ( distributed parity ).
 Best cost effective option providing both performance and redundancy. Use this for DB that is heavily
read oriented. Write operations will be slow.
 Ideal use: File storage servers and application servers.

Disk 1 Disk 2 Disk 3

1 2 p
3 p 4
p 5 6
7 8 p

Advantages

 Read data transactions are very fast while write data transactions are somewhat slower (due to the parity that
has to be calculated).
 If a drive fails, you still have access to all data, even while the failed drive is being replaced and the storage
controller rebuilds the data on the new drive.
Disadvantages

 Drive failures have an effect on throughput, although this is still acceptable.

 This is complex technology. If one of the disks in an array using 4TB disks fails and is replaced, restoring the
data (the rebuild time) may take a day or longer, depending on the load on the array and the speed of the
controller. If another disk goes bad during that time, data are lost forever.

RAID 10:-
RAID 10 combines the mirroring of RAID 1 with the striping of RAID 0. Or in other words, it combines
the redundancy of RAID 1 with the increased performance of RAID 0. It is best suitable for
environments where both high performance and security is required.
 Blocks Mirrored, and Blocks Striped.
 Minimum 4 disks.
 This is also called as “stripe of mirrors”
 Excellent redundancy ( as blocks are mirrored)
 Excellent performance ( as blocks are striped)
 If you can afford the dollar, this is the best option for any mission critical applications (especially databases).
 Ideal use: Highly utilized database servers/ servers performing a lot of write
operations.

Disk 1 Disk 2 Disk 3 Disk 4

1 2 1 2
3 4 3 4
5 6 5 6
7 8 7 8

Advantages

 If something goes wrong with one of the disks in a RAID 10 configuration, the rebuild time is very fast since all
that is needed is copying all the data from the surviving mirror to a new drive. This can take as little as 30
minutes for drives of 1 TB.

Disadvantages

 Half of the storage capacity goes to mirroring, so compared to large RAID 5 or RAID 6 arrays, this is an
expensive way to have redundancy.

Other RAID levels are:

RAID levels 2, 3, and 4, 7are theoretically defined but not used in practice.

RAID 2

 It is similar to RAID 5, but instead of disk striping using parity, striping occurs at the bit-level.
 RAID 2 is seldom deployed because costs to implement are usually prohibitive (a typical setup
requires 10 disks) and gives poor performance with some disk I/O operations.

RAID 3

 It is also similar to RAID 5, except this solution requires a dedicated parity drive.
 RAID 3 is seldom used except in the most specialized database or processing environments,
which can benefit from it.

RAID 4:-
 It is a configuration in which disk striping happens at the byte level, rather than at the bit-level.

RAID 6:-
 It is also used frequently in enterprises.
 It's identical to RAID 5, except it's an even more robust solution because it uses one more parity
block than RAID 5.
 You can have two disks die and still have a system be operational.
RAID 7 :-
It is a proprietary level of RAID owned by the now-defunct Storage Computer Corporation.
RAID 0+1:-
 It is often interchanged for RAID 10 (which is RAID 1+0), but the two are not same.
 RAID 0+1 is a mirrored array with segments that are RAID 0 arrays.
 It's implemented in specific infrastructures requiring high performance but not a high level of
scalability.

Advantages of RAID:-
 Performance, resiliency and cost are among the major benefits of RAID.
 By putting multiple hard drives together, RAID can improve on the work of a single hard drive and,
depending on how it is configured.
 It can increase computer speed and reliability after a crash.
 RAID can still result in lower costs by using lower-priced disks in large numbers.
 Servers make use of RAID technology.

Disadvantages of RAID:-

 It is very much difficult for an administrator to configure the RAID System.

 A major disadvantage regarding the RAID drive is that there needs to be written the drivers for a
Network Operating System ( NOS )
 The ability to dynamically enlarge the RAID server is also complex process.

Section 9: Advance Topics

DBMS vs. RDBMS

DBMS RDBMS
DBMS system, stores data in either a navigational RDBMS uses a tabular structure where the headers are
or hierarchical form. the column names, and the rows contain corresponding
values
DBMS supports single user only. It supports multiple users.
In a regular database, the data may not be stored Relational databases are harder to construct, but they are
following the ACID model. This can develop consistent and well structured. They obey ACID
inconsistencies in the database. (Atomicity, Consistency, Isolation, Durability).
Low software and hardware needs. Higher hardware and software need.
Data is stored in the form of tables which are related to
No relationship between data
each other with the help of foreign keys.
Multiple levels of security. Log files are created at OS,
There is no security.
Command, and object level.
Data redundancy is common in this model. Keys and indexes do not allow Data redundancy.
DBMS does not support client-server architecture RDBMS supports client-server architecture.
It deals with small quantity of data. It deals with large amount of data.
DBMS does not support distributed database. RDBMS supports distributed database.
Examples of DBMS are a file system, XML, Example of RDBMS is MySQL, Oracle, SQL Server, etc.
Windows Registry, etc.

DBMS Storage & File Structure

A file is a sequence of records stored in binary format. A disk drive is formatted into several blocks that can store records.
File records are mapped onto those disk blocks.

Magnetic disks and magnetic tapes are used to store data in RDBMS.The disk space analyser maintains records for
available space and used space in the disk.

Memory Hierarchy

The computer system handles various types of memory to achieve faster execution of process.

The memory hierarchy for computer system can be elaborated as:

1. Cache Memory and Main memory

Cache memory and main memory are at the top level in the memory
hierarchy which are responsible for fast execution.
Example: RAM, ROM etc.

2. Secondary memory
Secondary memory or storage is used to store data in computer system. The secondary storage is relatively
slower than cache or main memory.
Example: Magnetic tape, hard disk, CD, DVD etc.

Access methods in DBMS

The main goal of DBMS is to return data which is requested by the user. In RDBMS it may be a record or set
of records. In an object- oriented database it may be object or set of objects.

Indexes – Access Method

 Index is the small table having two columns. The first columns consist of primary key of the table and
second column consists of a set of pointers holding the address of the disk, where the particular key
(value) can found.
 The indexes are very useful to improve the search operation in the DBMS system.
Type of indexes are discussed as follows:

1. Function-based indexes

 A function-based index computes the values of expression which are present in one or more column and
stored in the table.The expression can be an arithmetic expression or SQL function.A function-based index
cannot contain null value.

2. Bitmap indexes

 Bitmap index is used to work with well for low-cardinal (refers to columns few unique values)
columns in tables.
For example: Boolean data which has only two values true or false.
 Bitmap indexes are very useful in data ware house applications for joining the large fact tables.
3. Domain indexes
Domain index is used to create index type schema object and an application specific index. It is used for
indexing data in application specific domain.

4. Clusters

 Clustering in DBMS is design for high availability of data. Clustering is applied on tables which are
repeatedly used by the user.
For example: When there are many employees in the department, we can create index of non-unique key,
such as that Dept-id. With this, all employees belonging to the same department are consider to be within a
same cluster.

 Clustering can improve the performance of the system.

5. Indexed sequential access method (ISAM)
 ISAM was developed by IBM for mainframe computers but the term is used in several concepts.
 In DBMS, ISAM is used to access data in sequentially (sequence in which data is entered) or randomly (with
an index).

DBMS Storage & File Structure

A file is a sequence of records stored in binary format. A disk drive is formatted into several blocks that can
store records. File records are mapped onto those disk blocks.File Organization defines how file records are
mapped onto disk blocks. File organization is a logical relationship among various records. This method
defines how file records are mapped onto disk blocks.

File organization is used to describe the way in which the records are stored in terms of blocks, and the blocks
are placed on the storage medium.

Heap File Organization

o When a file is created using Heap File Organization, the Operating System allocates memory area to that file without any
further accounting details. File records can be placed anywhere in that memory area. It is the responsibility of the
software to manage the records. Heap File does not support any ordering, sequencing, or indexing on its own. In the file,
every record has a unique id, and every page in a file is of the same size. It is the DBMS responsibility to store and manage the new
records.

Sequential File Organization

Every file record contains a data field (attribute) to uniquely identify that record. In sequential file organization,
records are placed in the file in some sequential order based on the unique key field or search key. Practically,
it is not possible to store all the records sequentially in physical form.

Hash File Organization

Hash File Organization uses Hash function computation on some fields of the records. The output of the hash
function determines the location of disk block where the records are to be placed.

Clustered File Organization

Clustered file organization is not considered good for large databases. In this mechanism, related records from
one or more relations are kept in the same disk block, that is, the ordering of records is not based on primary
key or search key.

DBMS Backup & Recovery

Database Backup
 Database Backup is storage of data that means the copy of the data.It is a safeguard against
unexpected data loss and application errors.It protects the database against data loss.If the original
data is lost, then using the backup it can reconstructed.

The backups are divided into two types –

1. Physical Backup
2. Logical Backup

1. Physical backup
Physical Backups are the backups of the physical files used in storing and recovering your database, such as
data files, control files and archived redo logs, log files.It is a copy of files storing database information to some
other location, such as disk, some offline storage like magnetic tape.Physical backups are the foundation of
the recovery mechanism in the database.Physical backup provides the minute details about the transaction
and modification to the database.

2. Logical backup
Logical Backup contains logical data which is extracted from a database.It includes backup of logical data like
views, procedures, functions, tables, etc.It is a useful supplement to physical backups in many circumstances
but not a sufficient protection against data loss without physical backups, because logical backup provides
only structuralinformation.

DATABASE RECOVERY –

What is recovery?

Recovery is the process of restoring a database to the correct state in the event of a failure. It ensures that the
database is reliable and remains in consistent state in case of a failure.

Database recovery can be classified into two parts;

1. Rolling Forward
2. Rolling Back

We can recover the database using Log–Based Recovery.

Log-Based Recovery
Logs are the sequence of records, that maintain the records of actions performed by a transaction.In
Log – Based Recovery, log of each transaction is maintained in some stable storage. If any failure
occurs, it can be recovered from there to recover the database.The log contains the information
about the transaction being executed, values that have been modified and transaction state.All this
information will be stored in the order of execution.

Example:
Assume, a transaction to modify the address of an employee. The following logs are written for this
transaction,

Log 1: Transaction is initiated, writes 'START' log.

Log: <Tn START>

Log 2: Transaction modifies the address from 'Gwalior' to 'Mumbai'.

Log: <Tn Address, 'Gwalior', 'Mumbai'>

Log 3: Transaction is completed. The log indicates the end of the transaction.
Log: <Tn COMMIT>
Recovery with Concurrent Transaction
 When two transactions are executed in parallel, the logs are interleaved. It would become difficult
for the recovery system to return all logs to a previous point and then start recovering.
 To overcome this situation 'Checkpoint' is used.

Checkpoint

 Checkpoint acts like a benchmark.

 Checkpoints are also called as Syncpoints or Savepoints.
 It is a mechanism where all the previous logs are removed from the system and stored permanently in a
storage system.
 It declares a point before which the database management system was in consistent state and all the
transactions were committed.
 It is a point of synchronization between the database and the transaction log file.
 It involves operations like writing log records in main memory to secondary storage, writing the modified
blocks in the database buffers to secondary storage and writing a checkpoint record to the log file.
 The checkpoint record contains the identifiers of all transactions that are active at the time of the
checkpoint.

SQL vs. No-SQL

SQL NO-SQL

SQL databases are primarily called RDBMS or Relational Databases NoSQL databases are primarily called as Non-relational or
distributed database

SQL databases have fixed or static or predefined schema. NoSQL databases have dynamic schema.

It was developed in the 1970s to deal with issues with flat file Developed in the late 2000s to overcome issues and
storage limitations of SQL databases.

SQL databases are table based databases NoSQL databases can be document based, key-value
pairs, graph databases

SQL databases are vertically scalable. NoSQL databases are horizontally scalable.

SQL databases use a powerful language "Structured Query In NoSQL databases, collection of documents are used to
Language" to define and manipulate the data. query the data. It is also called unstructured query
language. It varies from database to database.

It should be used when data validity is super important Use when it's more important to have fast data than correct
data

SQL databases are not best suited for hierarchical data storage. NoSQL databases are best suited for hierarchical data
storage.

ACID( Atomicity, Consistency, Isolation, and Durability) is a standard Base ( Basically Available, Soft state, Eventually
for RDBMS. Consistent) is a model of many NoSQL systems.

MySQL, Oracle, SQLite, PostgreSQL and MS-SQL etc. are the MongoDB, BigTable, Redis, RavenDB, Cassandra, Hbase,
example of SQL database. Neo4j, CouchDB etc. are the example of nosql database

Clustered vs. Non-Clustered Index

Clustered Non-Clustered

Cluster index is a type of index which sorts the data rows in A Non-clustered index stores the data at one location
the table on their key values. and indices at another location.

In the Database, there is only one clustered index per table. A single table can have many non-clustered indexes
as an index in the non-clustered index is stored in
different places.

A clustering index is defined in the ordering field of the table. A non-clustering index is defined in the non-ordering
field of the table.

A clustered index actually describes the order in which A Non-Clustered Index defines a logical order that
records are physically stored on the disk, hence the reason does not match the physical order on disk.
you can only have one. A clustered index is essentially a sorted copy of the
data in the indexed columns.

Faster to read than non-clustered as data is physically stored Quicker for insert and update operations than a
in index order clustered index

Usually made on the primary key. Usually made on the any key.

The leaf nodes of a clustered index contain the data pages. The leaf node of a non-clustered index does not
consist of the data pages. Instead, the leaf nodes
contain index rows.
DBMS Storage & File Structure
9.2 DBMS Backup & Recovery
9.3 DBMS vs. RDBMS
9.4 SQL vs. NO-SQL
9.5 Clustered vs. Non-Clustered Index
1. The storage structure which do not survive system crashes are ______
A. Volatile storage
B. Non-volatile storage
C. Stable storage
D. Dynamic storage
Answer: A
Explanation: Volatile storage, is a computer memory that requires power to maintain the stored information,
in other words it needs power to reach the computer memory.
2. Storage devices like tertiary storage, magnetic disk comes under
A. Volatile storage
B. Non-volatile storage
C. Stable storage
D. Dynamic storage
Answer: b
Explanation: Information residing in nonvolatile storage survives system crashes.

3. For a transaction to be durable, its changes need to be written to ________ storage.

A. Volatile storage
B. Non-volatile storage
C. Stable storage
D. Dynamic storage
Answer: c
Explanation: Similarly, for a transaction to be atomic, log records need to be written to stable storage
before any changes are made to the database on disk.

4. The unit of storage that can store one are more records in a hash file organization are
A. Buckets
B. Disk pages
C. Blocks
D. Nodes
Answer: a
Explanation: Buckets are used to store one or more records in a hash file organization.

5. A ______ file system is software that enables multiple computers to share file storage while maintaining
consistent space allocation and file content.
A. Storage
B. Tertiary
C. Secondary
D. Cluster
Answer: d
Explanation: With a cluster file system, the failure of a computer in the cluster does not make the file
system unavailable.

6. A file produced by a spreadsheet

A. is generally stored on disk in an ASCII text format
B. can be used as is by the DBMS
C. all of the mentioned
D. none of the mentioned
Answer: a
Explanation: ASCII text format uses the standard text file for the changing the value.

7. SDL means _____________

A. Storage Discrete Language
B. Storage Definition Language
C. Storage Definition Localisation
D. Storage Discrete Localisation
Answer: b
Explanation: It specifies internal schema and also mapping between two schemas.

8. Which of the following are the process of selecting the data storage and data access characteristics of the
database?
A. Logical database design
B. Physical database design
C. Testing and performance tuning
D. Evaluation and selecting
Answer: b
Explanation: Physical database design is the process of selecting the data storage and data access
characteristics of the database.

9. Which of the following is the oldest database model?

A. Relational
B. Hierarchical
C. Physical
D. Network
Answer: d
Explanation: Network model has data stored in a hierarchical network flow.

10. The process of saving information onto secondary storage devices is referred to as
A. Backing up
B. Restoring
C. Writing
D. Reading
Answer: c
Explanation: The information is written into the secondary storage device.
11. The file organization which allows us to read records that would satisfy the join condition by using one block
read is
A. Heap file organization
B. Sequential file organization
C. Clustering file organization
D. Hash file organization
Answer: c
Explanation: All systems in the cluster share a common file structure via NFS, but not all disks are
mounted on all other systems.
12. What are the correct features of a distributed database?
A. Is always connected to the internet
B. Always requires more than three machines
C. Users see the data in one global schema.
D. Have to specify the physical location of the data when an update is done
Answer: c
Explanation: Users see the data in one global schema.
13. Each tablespace in an Oracle database consists of one or more files called
A. Files
B. name space
C. datafiles
D. PFILE
Answer: c
Explanation: A data file is a computer file which stores data to use by a computer application or system.
14. The management information system (MIS) structure with one main computer system is called a
A. Hierarchical MIS structure
B. Distributed MIS structure
C. Centralized MIS structure
D. Decentralized MIS structure
Answer: c
Explanation: Structure of MIS may be understood by looking at the physical components of the
information system in an organization.
15. A top-to-bottom relationship among the items in a database is established by a
A. Hierarchical schema
B. Network schema
C. Relational schema
D. All of the mentioned
Answer: a
Explanation: A hierarchical database model is a data model in which the data is organized into a tree-
like structure. The structure allows representing information using parent/child relationships.
16. Choose the RDBMS which supports full-fledged client server application development
A. dBase V
B. Oracle 7.1
C. FoxPro 2.1
D. Ingress
Answer: b
Explanation: RDBMS is Relational Database Management System.
17. One approach to standardization storing of data?
A. MIS
B. Structured programming
C. CODASYL specification
D. None of the mentioned
Answer: c
Explanation: CODASYL is an acronym for “Conference on Data Systems Languages”.
18. The highest level in the hierarchy of data organization is called
A. Data bank
B. Data base
C. Data file
D. Data record
Answer: b
Explanation: Database is a collection of all tables which contains the data in form of fields.
19. What is the purpose of index in sql server
A. To enhance the query performance
B. To provide an index to a record
C. To perform fast searches
D. All of the mentioned
Answer: D
Explanation: A database index is a data structure that improves the speed of data retrieval operations on a
database table at the cost of additional writes.
20. How many types of indexes are there in sql server?
A. 1
B. 2
C. 3
D. 4
Answer: B
21. How non clustered index point to the data?
A. It never points to anything
B. It points to a data row
C. It is used for pointing data rows containing key values
D. None of the mentioned
Answer: C
22. Which one is true about clustered index?
A. Clustered index is not associated with table
B. Clustered index is built by default on unique key columns
C. Clustered index is not built on unique key columns
D. None of the mentioned
Answer: B
23. What is true about indexes?
A. Indexes enhance the performance even if the table is updated frequently
B. It makes harder for SQL server engines to work to work on index which have large keys
C. It doesn’t make harder for SQL server engines to work to work on index which have large keys
D. None of the mentioned
Answer: B
24. Does index take space in the disk?
A. It stores memory as and when required
B. Yes, Indexes are stored on disk
C. Indexes are never stored on disk
D. Indexes take no space
Answer: B
25. If an index is _________________ the metadata and statistics continue to exists
E. Disabling
F. Dropping
G. Altering
H. Both a and b
Answer: A
26. A clustering index is defined on the fields which are of type

A. Non-key and ordering

B. Non-key and non-ordering
C. Key and ordering
D. Key and non-ordering
Answer: A
27. A file is organized so that the ordering of data records is the same as or close to the ordering of data
entries in some index. Then that index is called
A. Dense
B. Sparse
C. Clustered
D. Unclustered
Answer: C
28. What is meant by type in RDBMS?
A. Domain
B. Range
C. Named set of value
D. Both a and c
Answer: D

List of Colleges of Indore (M.P) : College Name
50% (4)
List of Colleges of Indore (M.P) : College Name
11 pages
Sony CameraRemoteSDK API-Reference v1.09.00
No ratings yet
Sony CameraRemoteSDK API-Reference v1.09.00
292 pages
CSE Database Management System
No ratings yet
CSE Database Management System
23 pages
Dbms Notes
No ratings yet
Dbms Notes
5 pages
jahjahsajhsa
No ratings yet
jahjahsajhsa
6 pages
unit-1-introduction-to-dbms
No ratings yet
unit-1-introduction-to-dbms
52 pages
1_sql Server Notes
No ratings yet
1_sql Server Notes
19 pages
DBMS 1st Unit Notes
No ratings yet
DBMS 1st Unit Notes
20 pages
Database Managment Notes
No ratings yet
Database Managment Notes
10 pages
Term Paper On Database Management System
No ratings yet
Term Paper On Database Management System
40 pages
Introduction to DBMS
No ratings yet
Introduction to DBMS
4 pages
Class Notes DBMS 1
No ratings yet
Class Notes DBMS 1
15 pages
IM III Unit Notes
No ratings yet
IM III Unit Notes
41 pages
Introduction to DBMS
No ratings yet
Introduction to DBMS
27 pages
INTRODUCTION - 15.02.2023 To 17.02.2023
No ratings yet
INTRODUCTION - 15.02.2023 To 17.02.2023
46 pages
Adobe Scan 16 Mar 2024
No ratings yet
Adobe Scan 16 Mar 2024
15 pages
Database Management Systems Lec 1a
No ratings yet
Database Management Systems Lec 1a
29 pages
Csedatabasemanagementsystemppt-170825044344 DSDSD
No ratings yet
Csedatabasemanagementsystemppt-170825044344 DSDSD
23 pages
BANA 2 Reviewer
No ratings yet
BANA 2 Reviewer
6 pages
DBMS Unit -1 - Part-1
No ratings yet
DBMS Unit -1 - Part-1
35 pages
Database Management Systems-By-py Kumar
No ratings yet
Database Management Systems-By-py Kumar
123 pages
DBMS Complete Unit 1 Material
No ratings yet
DBMS Complete Unit 1 Material
92 pages
Untitled Document
No ratings yet
Untitled Document
6 pages
UNIT1 NOTES PPT AND PDF
No ratings yet
UNIT1 NOTES PPT AND PDF
20 pages
rt503 - Dbms - Module1 - 2 MG UNIVERSITY
No ratings yet
rt503 - Dbms - Module1 - 2 MG UNIVERSITY
25 pages
DBMS and Data Model PDF
No ratings yet
DBMS and Data Model PDF
102 pages
Dbms Unit 1 Koushik
No ratings yet
Dbms Unit 1 Koushik
83 pages
Dbms New
No ratings yet
Dbms New
156 pages
Basics of DB
No ratings yet
Basics of DB
5 pages
All Unit Notes
No ratings yet
All Unit Notes
109 pages
Unit 1 DBMS1
No ratings yet
Unit 1 DBMS1
7 pages
DBMS-Module-1
100% (1)
DBMS-Module-1
83 pages
Mba Sem-1 Presentation On DBMS & RDBMS: Submitted By:-Bhalodia Sanket Jethwa Gautam Mavani Hardik
No ratings yet
Mba Sem-1 Presentation On DBMS & RDBMS: Submitted By:-Bhalodia Sanket Jethwa Gautam Mavani Hardik
34 pages
DBMS-Introduction To Database Management Systems Notes
100% (3)
DBMS-Introduction To Database Management Systems Notes
36 pages
DBMS UNIT-1
No ratings yet
DBMS UNIT-1
50 pages
DBMS 1 As 1 Chandupa Jayalath
No ratings yet
DBMS 1 As 1 Chandupa Jayalath
12 pages
UNIT 1
No ratings yet
UNIT 1
104 pages
DBMS Module1 2023 v1
No ratings yet
DBMS Module1 2023 v1
63 pages
RDBMS 5 units Notes
No ratings yet
RDBMS 5 units Notes
113 pages
Unit 1_Basic Concepts
No ratings yet
Unit 1_Basic Concepts
119 pages
Unit:-I (Database Concepts) Topic: - Introduction To Database Lecture: - 1-2 Dated: - 01.01.2020 - 02.01.2021
No ratings yet
Unit:-I (Database Concepts) Topic: - Introduction To Database Lecture: - 1-2 Dated: - 01.01.2020 - 02.01.2021
5 pages
DBMS Notes
No ratings yet
DBMS Notes
114 pages
Assignmt1 (522) Wajid Sir
No ratings yet
Assignmt1 (522) Wajid Sir
11 pages
Unit - 2
No ratings yet
Unit - 2
26 pages
Dbms(Computers)
No ratings yet
Dbms(Computers)
121 pages
BCA - 5 - S301T - Unit 1 - MR - PRADEEP - BHANDARI
No ratings yet
BCA - 5 - S301T - Unit 1 - MR - PRADEEP - BHANDARI
37 pages
Business Support System: Indian School of Mines Dhanbad
No ratings yet
Business Support System: Indian School of Mines Dhanbad
18 pages
61bdbf4846c3f - Database Management System 2078
No ratings yet
61bdbf4846c3f - Database Management System 2078
9 pages
Database Management System
No ratings yet
Database Management System
11 pages
DBMS 2
No ratings yet
DBMS 2
28 pages
Use of Dbms in Sportsorganization
No ratings yet
Use of Dbms in Sportsorganization
12 pages
Dbms Unit 1
No ratings yet
Dbms Unit 1
97 pages
DBMS
No ratings yet
DBMS
66 pages
Entities Such As Students, Faculty, Courses, and Classrooms
No ratings yet
Entities Such As Students, Faculty, Courses, and Classrooms
10 pages
NOTES DBMS
No ratings yet
NOTES DBMS
8 pages
Introduction To Database Management System: Poonam Mehta
No ratings yet
Introduction To Database Management System: Poonam Mehta
21 pages
dbms lecture notes unit 1
No ratings yet
dbms lecture notes unit 1
24 pages
Extraunit 1
No ratings yet
Extraunit 1
115 pages
Lecture 1 Addition What Is Data?
No ratings yet
Lecture 1 Addition What Is Data?
22 pages
DBMS - R18 UNIT 1 Notes
No ratings yet
DBMS - R18 UNIT 1 Notes
25 pages
Introduction To DBMS
No ratings yet
Introduction To DBMS
46 pages
DBMS MASTER: Become Pro in Database Management System
From Everand
DBMS MASTER: Become Pro in Database Management System
Ummed Singh
No ratings yet
Sagar Ujjain
No ratings yet
Sagar Ujjain
3 pages
Jabalpur
No ratings yet
Jabalpur
8 pages
Bhopal RG PV
No ratings yet
Bhopal RG PV
15 pages
Basic Shell Scripting Questions
No ratings yet
Basic Shell Scripting Questions
4 pages
Quiz Game: (Using by C Language)
No ratings yet
Quiz Game: (Using by C Language)
16 pages
Chapitre 05
No ratings yet
Chapitre 05
42 pages
UNIT 5 COMPILER DESIGN
No ratings yet
UNIT 5 COMPILER DESIGN
29 pages
STB 197 Tni VJW (01 Och) S - 4118 Coil Ph1 6hrs
No ratings yet
STB 197 Tni VJW (01 Och) S - 4118 Coil Ph1 6hrs
16 pages
(Ebook) The Science of the Blockchain by Roger Wattenhofer ISBN 9781522751830, 1522751831 - Read the ebook online or download it for a complete experience
100% (1)
(Ebook) The Science of the Blockchain by Roger Wattenhofer ISBN 9781522751830, 1522751831 - Read the ebook online or download it for a complete experience
55 pages
Device Id
No ratings yet
Device Id
67 pages
Chapter 6 Compiler Phases
No ratings yet
Chapter 6 Compiler Phases
20 pages
36510hp Vectra Vei 7 6285486998b83061721133
No ratings yet
36510hp Vectra Vei 7 6285486998b83061721133
3 pages
Firebase Arduino
No ratings yet
Firebase Arduino
49 pages
Speculative Execution in High Performance Computer Architectures Chapman Hall Crc Computer Information Science Series 1st Edition David Kaeli pdf download
100% (2)
Speculative Execution in High Performance Computer Architectures Chapman Hall Crc Computer Information Science Series 1st Edition David Kaeli pdf download
74 pages
Assignment 1 (COM-502)
No ratings yet
Assignment 1 (COM-502)
1 page
Master Thesis Lab Inventory System
No ratings yet
Master Thesis Lab Inventory System
92 pages
Cyberspace News Prediction of Text and Image
No ratings yet
Cyberspace News Prediction of Text and Image
53 pages
Time Complexity - Part 1 - Java
No ratings yet
Time Complexity - Part 1 - Java
15 pages
Administering Users and Permissions
No ratings yet
Administering Users and Permissions
269 pages
Scan
No ratings yet
Scan
75 pages
Network Layer Logical Addressing
No ratings yet
Network Layer Logical Addressing
62 pages
Cloud Computing - Kisi2
No ratings yet
Cloud Computing - Kisi2
4 pages
fdocuments.net_fundamentals-hitachi-data-systems-details-on-the-hitachi-unified-storage-hus
No ratings yet
fdocuments.net_fundamentals-hitachi-data-systems-details-on-the-hitachi-unified-storage-hus
2 pages
Oop1 CLM LT 3
No ratings yet
Oop1 CLM LT 3
2 pages
Systems Analysis and Design in A Changing World, 7th Edition - Chapter 6 ©2016. Cengage Learning. All Rights Reserved. 1
No ratings yet
Systems Analysis and Design in A Changing World, 7th Edition - Chapter 6 ©2016. Cengage Learning. All Rights Reserved. 1
24 pages
Dict 3
No ratings yet
Dict 3
2 pages
Nicolas Peteuil - CV
No ratings yet
Nicolas Peteuil - CV
2 pages
IT 238: Data Structure and Algorithms: Course Objectives
No ratings yet
IT 238: Data Structure and Algorithms: Course Objectives
2 pages
Cobra: 2.1 For Viper GC and Viper GC Extreme
No ratings yet
Cobra: 2.1 For Viper GC and Viper GC Extreme
11 pages
Labsheet 1 BEEL2214 Sem 2 20232024
No ratings yet
Labsheet 1 BEEL2214 Sem 2 20232024
5 pages
Installation WD swpm20 Win
No ratings yet
Installation WD swpm20 Win
58 pages
Mohammad Wahaj Tariq Resume AI_ML (3)
No ratings yet
Mohammad Wahaj Tariq Resume AI_ML (3)
1 page