Chapter 1 (1)
Chapter 1 (1)
CHAPTER ONE
1. Introduction to Database System
1.1 What is database?
A database is a collection of related data.
This collection of related data with an implicit meaning is a database.
The preceding definition of database is quite general, however, the common use of the term
database is usually more restricted.
A database has the following implicit properties:
A database represents some aspect of the real world, sometimes called the
miniworld or the universe of discourse (UoD).
Changes to the miniworld are reflected in the database.
A database is a logically coherent collection of data with some inherent meaning.
A random collection of data cannot correctly be referred to as a database.
A database is designed, built, and populated with data for a specific purpose.
Database systems are designed to manage large data set in an organization. The data
management involves both definition and the manipulation of the data which ranges from simple
representation of the data to considerations of structures for the storage of information. The data
management also consider the provision of mechanisms for the manipulation of information.
Today, Databases are essential to every business. They are used to maintain internal records, to
present data to customers and clients on the World-WideWeb, and to support many other
commercial processes. Databases are likewise found at the core of many modern organizations.
The power of databases comes from a body of knowledge and technology that has developed
over several decades and is embodied in specialized software called a database management
system, or DBMS. A DBMS is a powerful tool for creating and managing large amounts of data
efficiently and allowing it to persist over long periods of time, safely. These systems are among
the most complex types of software available.
Thus, for our question: What is a database? In essence a database is nothing more than a
collection of shared information that exists over a long period of time, often many years. In
common dialect, the term database refers to a collection of data that is managed by a DBMS.
1. Manual Approach
In the manual approach, data storage and retrieval follows the primitive and traditional way of
information handling where cards and paper are used for the purpose. The data storage and
retrieval will be performed using human labour.
Files for as many event and objects as the organization has are used to store information.
Each of the files containing various kinds of information is labelled and stored in one ore
more cabinets.
The cabinets could be kept in safe places for security purpose based on the sensitivity of
the information contained in it.
Insertion and retrieval is done by searching first for the right cabinet then for the right the
file then the information.
One could have an indexing system to facilitate access to the data
3. Database Approach
The database approach emphasizes the integration and sharing of data throughout the
organization.
Thus in Database Approach:
Database is just a computerized record keeping system or a kind of electronic filing
cabinet.
Database is a repository for collection of computerized data files.
Database is a shared collection of logically related data designed to meet the information
needs of an organization. Since it is a shared corporate resource, the database is
integrated with minimum amount of or no duplication.
Database is a collection of logically related data where these logically related data
comprises entities, attributes, relationships, and business rules of an organization's
information.
In addition to containing data required by an organization, database also contains a
description of the data which called as “Metadata” or “Data Dictionary” or “Systems
Catalogue” or “Data about Data”.
Since a database contains information about the data (metadata), it is called a self
descriptive collection on integrated records.
The purpose of a database is to store information and to allow users to retrieve and update
that information on demand.
• This definition is stored in the DBMS catalog, which contains information such as the
structure of each file, the type and storage format of each data item, and various
constraints on the data.
• The information stored in the catalog is called meta-data, and it describes the structure
of the primary database .
• The catalog is used by the DBMS software and also by database users who need
information about the database structure.
• A general-purpose DBMS software package is not written for a specific database
application.
• Therefore, it must refer to the catalog to know the structure of the files in a specific
database, such as the type and format of data it will access.
• DBMS catalog will store the definitions of all the files.
• These definitions are specified by the database designer prior to creating the actual
database and are stored in the catalog.
• Whenever a request is made to access, say, the Name of a STUDENT record, the DBMS
software refers to the catalog to determine the structure of the STUDENT file and the
position and size of the Name data item within a STUDENT record.
The DBMS environment has five components. To design and use a database, there will
be the interaction or integration of Hardware, Software, Data, Procedure and People.
1. Hardware: are components that one can touch and feel. These components are
comprised of various types of personal computers, mainframe or any server computers to
be used in multi-user system, network infrastructure, and other peripherals required in the
system.
2. Software: are collection of commands and programs used to manipulate the hardware to
perform a function. These include components like the DBMS software, application
programs, operating systems, network software, language software and other relevant
software.
3. Data: since the goal of any database system is to have better control of the data and
making data useful, Data is the most important component to the user of the database.
There are two categories of data in any database system: that is Operational and
Metadata. Operational data is the data actually stored in the system to be used by the
user. Metadata is the data that is used to store information about the database itself. The
structure of the data in the database is called the schema, which is composed of the
Entities, Properties of entities, and relationship between entities and business constraints.
4. Procedure: this is the rules and regulations on how to design and use a database. It
includes procedures like how to log on to the DBMS, how to use facilities, how to start
and stop transaction, how to make backup, how to treat hardware and software failure,
how to change the structure of the database.
5. People: this component is composed of the people in the organization that are
responsible or play a role in designing, implementing, managing, administering and using
the resources in the database. This component includes group of people with high level of
knowledge about the database and the design technology to other with no knowledge of
the system except using the data in the database.
1.4 Actors on the Scene:
A person who design, use, and administer a database, and whose jobs involve the day-to-day use
of a large database; we call them the "actors on the scene."
Data Administrator
Database Administrator
Database Designer
End Users
2.5 Workers behind the Scene
Persons those are associated with the design, development, and operation of the DBMS software
and system environment. These persons are typically not interested in the database itself. We call
them the "workers behind the scene," and they include the following categories.
DBMS designers and implementers: who design and implement different DBMS
software.
Tool Developers: experts who develop software packages that facilitates database
system designing and use. Prototype, simulation, code generator developers could
be an example. Independent software vendors could also be categorized in this
group.
Operators and Maintenance Personnel: system administrators who are
responsible for actually running and maintaining the hardware and software of the
database system and the information technology facilities.