RAC Database Monitoring and Tuning
RAC Database Monitoring and Tuning
Note: You can query the GV$CLUSTER_INTERCONNECTS view to see information about
the private interconnect:
The statistics snapshots generated by the AWR can be evaluated by producing reports
displaying summary data such as load and cluster profiles based on regular statistics
and wait events gathered on each instance.
The AWR functions in a similar way as Statspack. The difference is that the AWR
automatically collects and maintains performance statistics for problem detection and
selftuning purposes. Unlike in Statspack, in the AWR, there is only one snapshot_id
per snapshot across instances.
Active Session History (ASH) is an integral part of the Oracle Database self-
management framework and is useful for diagnosing performance problems in Oracle
RAC environments. ASH report statistics provide details about Oracle Database session
activity. Oracle Database records information about active sessions for all active Oracle
RAC instances and stores this data in the System Global Area (SGA). Any session that
is connected to the database and using CPU is considered an active session. The
exception to this is sessions that are waiting for an event that belongs to the idle wait
class.
ASH reports present a manageable set of data by capturing only information about
active sessions. The amount of the data is directly related to the work being performed,
rather than the number of sessions allowed on the system. ASH statistics that are
gathered over a specified duration can be put into ASH reports.
Each ASH report is divided into multiple sections to help you identify short-lived
performance problems that do not appear in the ADDM analysis. Two ASH report
sections that are specific to Oracle RAC are Top Cluster Events and Top Remote
Instance.
Using the Automatic Database Diagnostic Monitor (ADDM), you can analyze the
information collected by AWR for possible performance problems with your Oracle
database. ADDM presents performance data from a clusterwide perspective, thus
enabling you to analyze performance on a global basis. In an Oracle RAC environment,
ADDM can analyze performance using data collected from all instances and present it
at different levels of granularity, including:
Analysis for the entire cluster
Analysis for a specific database instance
Analysis for a subset of database instances
To perform these analyses, you can run the ADDM Advisor in Database ADDM for RAC
mode to perform an analysis of the entire cluster, in Local ADDM mode to analyze the
performance of an individual instance, or in Partial ADDM mode to analyze a subset of
instances. Database ADDM for RAC is not just a report of reports but has independent
analysis that is appropriate for RAC. You activate ADDM analysis using the advisor
framework through Advisor Central in Oracle Enterprise Manager, or through the
DBMS_ADVISOR and DBMS_ADDM PL/SQL packages.
Database ADDM has access to AWR data generated by all instances, thereby making
the analysis of global resources more accurate. Both database and instance ADDM run
on continuous time periods that can contain instance startup and shutdown. In the case
of database ADDM, there may be several instances that are shut down or started during
the analysis period. However, you must maintain the same database version throughout
the entire time period.
Database ADDM runs automatically after each snapshot is taken. You can also perform
analysis on a subset of instances in the cluster. This is called partial analysis ADDM.
I/O capacity finding (the I/O system is overused) is a global finding because it concerns
a global resource affecting multiple instances. A local finding concerns a local resource
or issue that affects a single instance. For example, a CPU-bound instance results in a
local finding about the CPU. Although ADDM can be used during application
development to test changes to either the application, the database system, or the
hosting machines, database ADDM is targeted at DBAs.
Data sources are:
Wait events (especially Cluster class and buffer busy)
Active Session History (ASH) reports
Instance cache transfer data
Interconnect statistics (throughput, usage by component, pings)
ADDM analyzes the effects of RAC for both the entire database (DATABASE analysis
mode) and for each instance (INSTANCE analysis mode).
The Database Resource Manager (also called Resource Manager) enables you to
identify work by using services. It manages the relative priority of services within an
instance by binding services directly to consumer groups. When a client connects by
using a service, the consumer group is assigned transparently at connect time. This
enables the Resource Manager to manage the work requests by service in the order of
their importance.
For example, you define the AP and BATCH services to run on the same instance, and
assign AP to a high-priority consumer group and BATCH to a low-priority consumer
group. Sessions that connect to the database with the AP service specified in their TNS
connect descriptor get priority over those that connect to the BATCH service.
With FAN, important high-availability events are pushed as soon as they are detected,
which results in a more efficient use of existing computing resources, and a better
integration with your enterprise applications, including mid-tier connection managers, or
IT management consoles, including trouble ticket loggers and email/paging servers.
FAN is, in fact, a distributed system that is enabled on each participating node. This
makes it very reliable and fault tolerant because the failure of one component is
detected by another. Therefore, event notification can be detected and pushed by any
of the participating nodes.
FAN events are tightly integrated with Oracle Data Guard Broker, Oracle JDBC implicit
connection cache, ODP.NET, TAF, and Enterprise Manager. For example, Oracle
Database 11g JDBC applications managing connection pools do not need custom code
development. They are automatically integrated with the ONS if implicit connection
cache and fast connection failover are enabled.
FAN delivers events pertaining to the list of managed cluster resources shown in the
slide.
The table describes each of the resources.
Note: SRV_PRECONNECT and SERVICEMETRICS are discussed later in this lesson.