Student IM 1
Student IM 1
This course introduces the differentiations of data, information and computer applications. It also
introduces the various database models as well as databases and this helps students understand and
analyze how computer applications make of data warehouses.
V. LESSON CONTENT
1. Data, Information and Computer Applications
Data – are raw facts and figures. They can be combinations, sequences, and representations of letters,
numbers, and symbols.
Information – is also called transformed or processed data. These are data which are refined using data
processing techniques.
Computer Program/Application – is a set of instructions which are followed by a computer for a specific
task. These are mainly written in programming languages which are translated and executed by
computers.
2. Types of Databases and Database Design Principles
Database Design Methods
1. Requirements Analysis – this phase or step entails the collection, classification and preparation of
data and information collected from the various entities or objects in an organization or company which
are a necessity for a company’s workflow.
2. Logical and Conceptual Design – is the creation of blueprints or plans for the database to be
implemented in organizations. Entity Relationship diagrams and relational diagrams are used in this
phase to get a better understanding of how the database should work in the context of the organization.
3. Physical Design – is the creation proper of the database in a sense. This phase uses Database
Management Software/Systems to help database programmers and database administrators create a
fully functioning database with the use of the diagrams used in the previous phase.
4. Tuning Phase – this step includes the review of the created database as well as the created diagrams.
It is reviewed if there are any more flaws or if there are anomalies or data redundancies present in the
created database in the previous phase.
Types of Databases
1. Online Transaction Processing Databases – is a transaction processing database which deals with
transactions from varying users who use the database. It should possess high amounts of processing
power because it frequently deals with high concurrency which means that lots of transactions from
users are catered by the database. Concurrency is synonymous with simultaneous.
2. Client-server Database – is a transaction processing database which deals with transactions but are
fewer as compared to OLTP databases. Client-server databases are mostly used by smaller
environments and are only limited to country-wide transactions. Client-server Databases mostly are
used by organizations, institutions, offices and small businesses.
Client-Server Databases and Online Transaction Processing Databases are almost as identical as
each other. The only differences of Online Transaction Processing Databases are the OLTP Databases
use internet connections to function, hence, it is larger than the Client-Server Database The Client
Server Databases can still use network solutions but it is much smaller than the OLTP Databases.
3. Data Warehouse Database – is a database which deals with mass data for future decision making
processes and forecasting. They are mostly used for knowledge discovery and creation. Decision
support databases should possess large amounts of storage spaces because they will be handling and
storing large amounts of data from the past years of an organization to the future years to come.
4. Data Mart – are smaller versions of Data Warehouses which are used mainly by smaller organizations
and businesses.
5. Reporting Databases – these databases are also used for decision-making purposes however,
reporting databases only show users actual data. Decision-makers from organizations use these
databases to plot reports for sales, site visits, and so on. They do not have automated decision making
support functions and processes.
Data Warehouses, Data Marts and Reporting Databases are all the same in essence. These
Databases are used to hold large amounts of data from the organization as well as from different
clienteles of organizations. Data Warehouses are much larger than Data Marts. These are mainly used
by large companies like google and facebook. Reporting Databases however, are used for creating
reports. For example, it tallies and categorizes data to let the organization know what products are
popular, how much the organization has in terms of sales, how much a product performs in a market
and so on.
6. Hybrid Databases – are combinations of transaction processing and decision support databases. They
combine high processing power of transaction processing databases and the vast amounts of storage
spaces of decision support databases which can provide a more robust and cost-efficient database
implementation.
Downsides to File Systems
1. File systems can barely use large amounts of data at a single time. Computers with 32-bit addressing
cannot refer directly to more than 4 gigabytes of data.
2. To classify and categorize data, we need to write or create special applications.
3. Because operating systems can be used by lots of users, data inconsistency becomes a problem. Data
should not be made available to unauthorized users.
Data becomes inconsistent when unauthorized users tamper with or change and modify data present in
a database.
4. Operating systems only employ and use password mechanisms for data security. More options for
security implementations should be made available.
5. File systems should employ easy to use system/data recovery procedures in case of crashes.
File sizes: a kilobyte is 1024 bytes, a megabyte is 1024 kilobytes, a gigabyte is 1024 megabytes, a
terabyte is 1024 gigabytes, and a petabyte is 1024 terabytes.
Advantages of a Database Management System
1. Data independence – A DBMS employs data independence. DBMS should hide sensitive details of
data stored in a database. The DBMS should employ an abstract view of data from a database to hide
these details.
Because database management systems use data independence, it hides sensitive data or irrelevant
data from other users of the databases. It also employs layers which hides the data from prying eyes or
from other unauthorized users.
2. Efficient Data Access – A DBMS utilizes sophisticated techniques to store and retrieve data efficiently
especially if the data are stored on external storage devices.
3. Data Integrity and Security – Changes to data should be logged and a DBMS should check/validate
incoming data, it should check if it exceeds data type constraints and etc. A DBMS also uses security
implementations and processes to preserve data integrity.
Data Integrity is the maintenance of and the assurance of the accuracy and consistency of data.
4. Data Administration – A DBMS provides users managerial processes and techniques to manage data
and users of a database.
5. Concurrent Access and Crash Recovery – A DBMS schedules concurrent or simultaneous access to
the data in a database which makes the user think that the data is only accessed by a single user. A
DBMS also utilizes crash recovery procedures in case of system failures and crashes.
6. Reduced Application Time – A DBMS has innate functions to categorize and classify data. Separate
applications can use these functions to specify the data coming from the database.
People who work with Databases
Database application programmers develop packages that facilitates data access for end users, who
are usually not computer professionals, using the host or data languages provided by DBMS vendors.
Such tools include report writers, spreadsheets, statistical packages, and the like. Application programs
made by database application programmers should typically and ideally access data through external
schemas or views. It is possible to write applications that access data at a lower level, but such
applications would compromise data independence.
Database Administrators are entrusted for maintaining, designing and creating databases. The
Database Administrator has the responsibilities:
- Design of the Conceptual and Physical Schema. The Database administrator should be responsible for
interacting with the users to understand what data is to be stored in the database. Based on the knowledge
collected by the database administrator, they should be able to design conceptual, physical and external
designs/schemas tailor fit for the users of an application program with a database or a single database.
- Security and Authorization. The Database administrator is responsible for creating and implementing
security procedures. They should ensure that unauthorized data access is not permitted. Not everyone
should be able to access data from a database. Users can be granted only views.
- Data Availability and Recovery from Failures. The Database administrators should take steps to ensure
that if there are system failures and crashes, users can continue to access as much of the uncorrupted
data as possible. The Database Administrator works to restore the data to a consistent. The Database
Administrators should also take into account the creation and maintenance of system logs, audit trails and
data backup.
- Database Tuning. User’s need to evolve with time. Database administrators are responsible for modifying
the database to ensure adequate performance as requirements and processes change.
A description of data in terms of a data model is a schema. A schema, in a relational model, has tuples
and records of instances of data as well as attributes and columns which describes the records.
The conceptual schema describes the data stored in the data model of the database management
system. What data types are used, what columns are created, as well as the instances of records or rows
are present in the conceptual schema.
The external schema describes the data showed and are seen by the users. The select statements
which specifies the instances of data in the databases or tables can be used to create external schemas.
The physical schema describes how the data is stored from a physical standpoint. The physical schema
describes what hardware is used to store the data as well as the specifications of the hardware used for
the database and database management system. The physical schema also describes how much
memory size is allocated for a table or a database and what data types (present in the database
management system) are used.
The different schemas are used so as to prevent unnecessary data manipulation/changes to the different
levels of data abstraction or schemas. The insulation of data changes from one abstraction or schema is
known as data independence.
Database – is a collection of data typically describing activities of one or more related organizations.
These are structured objects which typically are seen as physical piles of papers in the past, however, in
the modern age, they are seen inside and associated with computers. Metadata is used to define and
describe various data stored in a database.
• Databases can have entities, such as students, faculty, courses, classrooms, employees, managers
and so on. Entities are the physical entities, objects, or people that possesses data which can be
collected and stored in a database. These data which are collected may be specific for one group or for
a single entity.
• Databases can also have relationships which exists between two or more entities. Examples of these
relationships can include: students enrolling in courses, teachers teaching classes, cashiers
interacting with customers and so on. Notice that relationships often are indicated by verbs or action
words. These relationships help dictate how and why an entity has an impact on another entity.
• Entities can also have attributes or characteristics, These attributes and characteristics are data
that we collect, and manipulate from entities.
Databases include Columns, Attributes, or Fields as well as Rows, Tuples or Records. Rows, Tuples or
Records are live data or actual data which are collected from various entities in an environment or
organization. Columns, Attributes, or Fields describe and define the actual data stored in the table.
Database Management Systems – are software which are used for the effective management of
databases, data, and information. They can be used to create databases, define tables, execute code,
process data, and data storage. Database management systems typically are computer programs which
give database administrators/users control over databases and data.
With Database Management Systems, we have tables, rows and columns. A table in Database
Management Systems is only another word for entities. However, in Database Management Systems,
tables are used to classify and order different data gathered from entities. Columns or Fields are
another word for attributes and characteristics. Rows or Records are the live data or raw data
collected from entities.
An example of a table is:
Table Name: tbl_Students
In the above table, name, age, gender, and course are the Columns or Fields(When referring to
tables), attributes or characteristics(When referring to entities).
Pepito Manaloto, Elsa Manaloto, 21, Male, Female, BSCS, and BSIT are what we call the raw data,
the live data or the rows or records associated with the entity or table.
The table named tbl_Students has columns which are name, age, gender, and course and the
different instances of data in the table are Pepito Manaloto, 21, Male, BSCS and Elsa Manoloto, 21,
Female, BSIT.
VI. REFERENCES
School Year
Semester
Course Number
e.g.:
IM-COURSE NO-SEMESTER-SCHOOL YEAR
IM-MCB180-1STSEM-2020-2021