What Is
Imagine a scenario where you have an Oracle RAC cluster with three nodes: Node A, Node B, and Node C. These nodes are interconnected through a network for communication and data sharing.
Now, let's say there's a temporary network issue that causes Node A to lose connectivity with Node B and Node C. However, Node B and Node C can still communicate with each other.
In this situation, Node A continues to operate independently and may make changes to a specific database block. At the same time, Node B and Node C, unaware of Node A's actions due to the network issue, also modify the same database block independently.
When the network issue resolves and Node A reconnects with the cluster, there's a conflict. The data block that Node A modified independently differs from the version modified by Nodes B and C. This inconsistency in data versions is the essence of a split brain scenario in Oracle RAC.
Node eviction in a cluster is like kicking out a problematic member to keep the group running smoothly. It happens when a node has issues and can't work properly, so it's removed to prevent it from causing problems for the rest of the group.
There would be many common reasons why a node might be evicted from an Oracle RAC cluster:
Network issues: If a node loses connectivity to the cluster interconnect network, it may be evicted from the cluster to prevent it from causing network instability or data corruption.
Hardware failures: If a node experiences a hardware failure, such as a power supply or disk failure, it may be evicted from the cluster to prevent it from causing data corruption or other problems.
Resource constraints: If a node runs out of resources, such as memory or CPU, it may be evicted from the cluster to prevent it from causing performance issues or instability for the other nodes in the cluster.
Software issues: If a node experiences a software issue, such as a hung process or kernel panic, it may be evicted from the cluster to prevent it from causing instability or data corruption.
Contains node-specific information which is required by OHASD.
Every node has its own dedicated OLR file.(it will not be shared between the nodes).
* The default location of OLR: $GRID_HOME/cdata/*olr
* View the available backups of OLR:$GRID_HOME/bin/ocrconfig -local -showbackup
* Take the backup of OLR manually: (Need to done from ROOT user) -- ocrconfig -local -manualbackup
* Verify integrity of the OLR: cluvfy comp olr
* Check the current OLR location: $GRID_HOME/bin/ocrcheck -local
* View OLR in text format: $GRID_HOME/bin/ocrdump -local -stdout
* Restore OLR from backup: In case OLR is corrupted, Restore it from backup.
*ONS provides event-based communication between the database instances and applications, allowing them to receive notifications about various cluster-related events.
Its act in following condition:
* When Node Status Changes: ONS enables applications to receive notifications when the status of a node changes, such as when a node joins or leaves the cluster.
* Service Availability: ONS allows applications to subscribe to service availability events, so they can receive notifications when a service becomes available or unavailable.
* This helps applications to dynamically adjust their connection endpoints based on the availability of services, improving high availability and load balancing.
* Database Failover: In the event of a database instance failure, Oracle RAC automatically performs a failover to another available instance.
ONS plays a vital role in facilitating this failover process by notifying applications about the instance failure and the new primary instance to which they should connect.
Applications can subscribe to failover events and receive notifications to adjust their connection information and continue accessing the database seamlessly.
Oracle clusterware software run by CRS (Cluster Ready Services) by using OCR and Voting disk. Anything that Oracle Clusterware manages is known as a CRS resource. Some examples of CRS resources are database,an instance,a service,a listener,a VIP address,an application process etc.
Post a Comment