MIS UNIT-II
MIS UNIT-II
DRM involves the management of files and computer data for businesses and
companies.
➢ DRM is also known as data administration deals with computer science and
information systems.
➢ Workers in this filed help design, control, protect, store, administer and organize
saved data.
➢ Normally, this information is stored on data base with Data Base Management
Systems (DBMS) or software.
➢ DRM is a managerial activity that applies IT and software tools to the task of
managing an organizations data resources.
➢ Earlier, we use traditional file processing approach, which is too difficult ,costly
and inflexible to supply the information.
➢ Thus DRM approach was developed to solve the problems of file processing
systems.
➢ Data is an important input in an IS(Information System)
➢ DATA RESOURCE is also called the database.
➢ DATA BASE: Data is processed and converted into information to satisfy the
needs of the organization.
➢ Now-a-days internal and external information was increasing rapidly so database
was necessary in any organization.
➢ The business environment has forced the businesses to take quick and right
decisions for which databases are required to be queried frequently.
➢ QUERIES may be varied,
EXAMPLES
1. One manager may be interested to know the names of all those products for which
sales in the current year exceed that of the previous year.
2. One may require information on the total amount outstanding.
3. One may require the list of products having a market share greater than 30% and
soon.
To correctly process varied types of queries and to ensure a fast response time, the use of
computer based IS has become a necessity of any business.
OR
Anything of interest to the user about which data is to be collected / stored is called
entity.
Entity
Employee, a student, a spare part or a place event, a job title, a customer account
Attributes:
Explanation:
1. CHARACTER:
➢ It consists of a single alphabetic, numeric, or other symbol, which is represented
by Bit or Byte.
➢ Character is the most BASIC ELEMENT of data
2. FIELD :
➢ A collection of characters is called field.
➢ A field is a physical space on the storage device.
For Example – the field in an employee may be employee name, sex, address etc.
File
Course File
Name Class Course
1.Vikky MBA MIS
1
2.Rahul MBA ITM
1
Administrative File
3. RECORD:
For example – student – name, address, roll-no, marks etc., will be a record of the
student.
4. FILE:
OR
Any collection of related records in the form of rows and columns (tabular form) is
called a file.
For example – If there are many students in a class, then a group of related records
would form student – file.
5. DATA BASE
OR
It is an organized collection of data, stored and accessed electronically.
An Information System (IS) application may have several related files and all related
files would constitute a database for that application.
For example – In a salary processing system, the files may be employee-file, provident-
fund-file, income-tax-file etc.
All these files, which are related to the application, are combined in a database.
1. Data duplication
2. Data inconsistency
3. Lack of data integration
4. Data dependence
5. Program dependence
1. DATA DUPLICATION:
Each application has its own data file, the same data may have to be recorded and stored
in several files.
Example – payroll application, and personnel application, both will have data on
employee name, designation etc. This results in unnecessary duplication/redundancy of
common data items.
2. DATA INCONSISTENCY
4. DATA DEPENDENCE
5. PROGRAM DEPENDENCE
➢ The reports produced by the file processing system are program dependent, implies
that if any change in the format/structure of data and records in the file to be made, a
corresponding change in the programs have to be made.
➢ Similarly, if any new report is to be produced, new programs will have to be
developed.
It is because of all these drawbacks in the traditional files approach of
organizing data that led to the development of data bases.
Data inconsistency
Data dependence
Program dependence
Student Financial
administration management
Course Faculty
administration administration
2.3.1 DBMS Definition:
The software that allows an organization to centralize data, manage it efficiently, and
provides access to the database by application programs is known as DBMS.
• The DBMS thus solves the problems of the traditional file processing environment.
• The DBMS is the software that interacts with end users, applications and the
database itself to capture and analyze data.
1. Redundancy control
2. Data consistency
3. Management queries
4. Data independence
5. Enforcement of standards
1. REDUNDANCY CONTROL
✓ In a file management system, each application has its own data, which causes
duplication of common data items in more than one file.
✓ This data duplication needs more storage space as well as multiple updations for a
single transaction.
✓ This problem is overcome in database approach where data is stored only once.
2. DATA CONSISTANCY
In data base approach, the problem of inconsistent data is automatically solved with the
control of redundancy.
3. MANAGEMENT QUERIES
The database approach, in most of IS(Information System), pools the organization-
wide files at one place known as CENTRAL DATABASE and thus is capable of
answering queries of the management, relating to more than one functional area.
4. DATA INDEPENDENCE
✓ File management system-data dependent
o Database approach – data independent
✓ The database approach provides independence between file structure and program
structure.
✓ Such system provides an interface between the programs and the database and
takes care of the storage, retrieval and update of data in the database.
✓ It allows applications to be written as general programs to operate on files whose
structures can be made available to the program.
✓ DBMS – generalized file processing system.
5. ENFORCEMENT OF STANDARDS
✓ In the database approach, data being stored at one central place, standards can
easily be enforced.
✓ This ensures standardized data formats to facilitate data transfers between
systems.
1. Centralized database
2. More disk space
3. Operationally of the system
4. Security risk
1. CENTRALIZED DATABASE
• The data structure may become quite complex because of the centralized database
supporting many applications in an organization.
• This may leads to difficulties in its management and may require a professional/
an experienced database designer and sometimes extensive training for users.
2. MORE DISK SPACE
Data base approach generally requires more processing than file management
system and thus needs more disk space for program storage.
3. OPERATIONALITY OF THE SYSTEM
Since the database is used by many users in the organization, any failure in it,
whether due to a system fault, database corruption etc, will affect the
operationality of the system as it would render all users unable to access the
database.
4. SECURITY RISK
Being a centralized database, it is more prone to security disasters.
2.3.6 Functions of DBMS
1. Data organization
2. Data integration
3. Physical/logical – level separation
4. Data control
5. Data protection
1. DATA ORGANIZATION
DBMS organizes data items as per the specifications of the data definition
language.
Data base administrator decides about the data specifications that are most-
suited to each application.
2. DATA INTEGRATION
Data is inter-related together at the element level and can be manipulated in
many combinations during execution of a particular application program.
DBMS facilitates collection, combination and retrieval of the required data to
the user.
3. PHYSICAL/LOGICAL – LEVEL SEPARATION
It separates application programs and their associated data.
DBMS separates the logical description and relationships of data from the way
in which the data is physically stored.
4. DATA CONTROL
DBMS receives requests for storing data from different programs.
It controls how and where data is physically stored.
Similarly it locates and returns requested data to the program.
5. DATA PROTECTION
DBMS protects the data against access by unauthorized users, physical damage,
operating system failure etc.
DBMS is equipped with a facility to backup data and restore it automatically in
the case of any system failure.
Other security features include password protection and sophisticated encryption
schemes.
1. HIERARCHICAL MODEL
❖ In the hierarchical structure, the relationship between records are stored in the
form of a hierarchy or a tree (inverted tree, with the root at the top and branches
below)
❖ In this model, all records are dependent and arranged in a multi-level structure,
thus the root may have a number of branches and each branch may have a number
of sub-branches and soon.
❖ The lower most record is known as the ‘child’ of the next higher level record,
whereas the higher level record is called the ‘parent’ of its child records.
❖ Thus in this approach, all the relationship among records are one-to-many.
❖ Early mainframe DBMS package used hierarchical model.
❖ A hierarchical approach is simple to understand and design but cannot represent
data items that may simultaneously appear at two different levels of hierarchy
PARENT(ROOT)
Order
Items
2. NETWORK MODEL
• The network model allows more complex 1:M(one to many) or M:M (many to
many) logical relationships among entities.
• The relationships are stored in the form of linked list structure in which
subordinate records, called members, can be linked to more than one owner
(parent)
• This approach does not place any restrictions on the number of relationships.
• However, to design and implement, the network model is the most complicated
one, and is used only in special type of applications.
Customer
Order
Warehouse
Item
Many many
ONE - MANY
MANY - MANY
In relational model
COLUMN – Attribute ,set of values of one data item
DOMAIN
4. OBJECT-ORIENTED MODEL
❖ Object-oriented model is an approach to data management that stores both data
and the operations that can be performed upon the data as OBJECTS.
❖ While traditional DBMS are designed for HOMOGENEOUS DATA, object-
oriented database area capable of manipulating HETEROGENOUS DATA that
include drawings, images, photographs, voice and full-motion video.
❖ Object oriented database, stores the data and procedures as objects that can be
automatically retrieved and shared.
❖ These days, object-oriented model is gaining popularity and many modern
database systems support this model.
5. MULTI-DIMENSIONAL MODEL
❖ This model is an extension of the relational model.
❖ In this model, data is organized using multi-dimensional structure.
❖ Multi-dimensional structure can be visualized as cubes of data and cubes within
cubes of data.
❖ Different sides of the cube are considered different dimensions of the data.
❖ This model enables a user to selectively extract and view data in one or more
number of different dimensions, such as time, geographic region, product,
organizational department, customer, or other factors.
❖ This model has become the most popular data model for the analytical databases
that support OnLine Analytical Processing (OLAP) applications.
2.5 .DATA WAREHOUSING AND DATA MINING
2.5.1 DATA WAREHOUSE
1. SUBJECT-ORIENTED
• It focuses on modeling and analysis of data relating to a specific area.
• The data warehouse is organized around subject such as product, customer, sales
etc.
2. INTEGRATED
It is an integration of data from various different applications like ERP systems,
CRM system etc.
3. HISTORICAL PERSPECTIVE
The time variant for a data warehouse has a historical perspective in its approach,
For example – past 5-10 years.
4. NON-VOLATILE
It means data is stored permanently i.e. data once stored cannot be updated.
Data warehouses are capable of storing vast quantities of data, but there is a
challenge in implementing data warehousing applications.
For successful implementation, organizations need to be very careful about the
data quality.
Missing and miscoded data has to be cleaned up, and variables often come in a
variety of types, such as nominal data with no numeric content, dates, counts,
averages etc.
Thus, organizations must ensure the data quality in a data warehouse.
To make data warehouses useful, organizations must use BI
(business intelligence) tools to process data into meaningful information.
These databases are used for data mining and online analytical processing (OLAP)
The organizations that develop business intelligence (BI) tools create interfaces
that help the managers to quickly grasp business situations.
Such an interface is simple to understand and the interpretation by the managers
becomes easy.
Example – one such interface is called dash board ,because it looks similar to a car
dash board visual images like speedometer – like indicators for periodic revenues,
profits, and other financial information ;plus bar charts, line graphs, and other
graphical representations are used in dashboards.
It is defined as a process used to extract usable data from a larger set of any raw data.
For example – a typical traditional query may be” what is the relationship between the
amount of product A and the amount of product B that an organization sold over the past
week?”.
Where as in Data Mining, the manager would be interested to know the products that
would be in demand on the coming weekend and thus the query from the data mining
may be” find out the products most likely to have the maximum demand on the coming
weekend.”
The combination of data-warehousing techniques and data mining software makes it
easier to predict future outcomes based on patterns discovered within historical data.
1. Retail or marketing
2. Banking
3. Insurance and health care
4. Transportation and
5. Medicine
DIFFERENCE BETWEEN DATA HAREHOUSING AND DATA MINING
DATA WAREHOUSING DATA MINING
• Data warehousing is the process of • Data mining is the process of extracting
competing and organizing data into meaning full data from that database.
common database.
• Helps in identifying the certain data in a • Helps in figuring out a certain pattern of a
collection of data. data.
• Data is stored periodically. • Data is analyzed regularly
• Stores a huge amount of data • Analyses a sample of data.
• Provides a mechanism to store a huge • Discover patterns in data for better
amount of data decision making.
DIFFERENCE BETWEEN DATABASES AND DATA WAREHOUSES
DATA BASE DATA WAREHOUSE
• Collection of files • Collection of databases in a
qualitative way.
• An organized collection of data. • A central repository of integrated
data from one or more sources.
• Primarily insert/write data. • Primary read/retrieve data.
• Current/point-in-time data. • Historical data.
• Online transactional processing • Online analytical processing
• Provides a detailed relation all view. • Provides a summarized multi-
dimensional view.
• For many concurrent transactions. • Not for a large amount of
concurrent transactions.
DIFFERENCE BETWEEN TRADITIONAL APROACHES/FILE SYSTEM AND
MODERN APPROACHES
TRADITIONAL APPROACHES/FILE MODERN APPROACHES/DBMS
SYSTEM
• Data redundancy/duplication. • Controlled data redundancy.
• Data dependency • Data independence
• Program dependency • Program independence.
• No security • Have security
• No access control • Access control
• Lack of integration • Integrated system
• It is for small system like C++ • It is used in large systems like oracle
• These are relatively cheap • These are expensive
• They are very simple structure. • Very complex structure.
• It requires very low design • Designing is important
• Not secure • Secure
• They are used for single user • Multi-user
• Isolated data • Shared data
• Very simple back up mechanism • Backup complex.