Managing and patching Oracle Exadata Database Service on Dedicated Infrastructure (ExaDB-D/ExaCS) requires a combined approach where Oracle manages the physical infrastructure, and you manage the software stack from the hypervisor up. The OCI console simplifies patching via rolling, zero-downtime updates
- Oracle: Maintains physical hardware, network fabric, PDU, switches, and hypervisors.
- Customer: Responsible for patching the Guest VM OS, Grid Infrastructure (GI), Database Homes, and the databases themselv
patchmgr utility on a designated Exadata compute node to drive operating system updates across all nodes in the clusterdbaascli patch db apply to apply quarterly database patches, or use exadbcpatchmulti to handle multiple database patch operations directly from the command lindbaascli utility on the compute nodesdbaascli patch db get_listdbaascli patch db apply --patchid <patch_id>dbaascli patch db precheck --patchid <patch_id>patchmgr from a driving node to orchestrate updates across compute nodes- Always back up your databases before executing any patching operations.
- Follow the \(N - 3\) versioning rule: Always use the current major version or one of the three most recent ones (N through N - 3).
- Off-peak monthly patching is scheduled automatically by Oracle for critical infrastructure. Be sure to check the Maintenance Preferences in your Exadata Infrastructure settings to define rolling versus non-rolling schedules and reschedule if needed
Question : do we patch storage server on (ExaCS) and ExaDB-D
No, you do not manually patch the Exadata Storage Servers (Cells) on ExaCS (Exadata Cloud Service) or ExaDB-D (Exadata Database Service on Dedicated Infrastructure). Oracle manages the storage server patching and updates via cloud automation
- Oracle-Managed Components: Oracle manages everything at and below the hypervisor. This includes the Storage Servers, hardware, firmware, and network fabric. Oracle updates these components in the background using rolling updates to ensure zero database downtime.
- Customer-Managed Components: You are responsible for patching the software layers you control above the hypervisor. This includes:
- Database Homes (Oracle Database software).
- Grid Infrastructure (Clusterware and ASM).
- Guest VM Operating System (the OS on your compute nodes)
- Rolling Execution: Updates to storage servers are applied in a rolling manner. Thanks to ASM High Redundancy and Exadata software design, this happens without impacting database or application availability.
- Scheduling: While Oracle controls the underlying infrastructure patches, you can define maintenance windows and schedule your infrastructure updates using the Oracle Cloud Infrastructure Console
Question : what is general issue on patch Oracle Exadata Cloud Infrastructure (ExaCS) and ExaDB-D
Patch failures on Oracle Exadata Cloud Service (ExaCS) and ExaDB-D usually stem from outdated Cloud Tooling, lack of network connectivity to the Object Store, or improper cluster states. Addressing pre-checks and dependency conflicts early prevents most interruptions
- The Issue: Attempting to patch the Grid Infrastructure (GI) or Database Homes without first updating the
dbclianddbaasclicloud tooling. - The Fix: Ensure all Exadata database nodes run the same, most current version of cloud tooling before initiating any upgrade sequence
- The Issue: The virtual machine cannot reach the Oracle Cloud Infrastructure Object Store. This often happens if the service gateway or static route is misconfigured, causing patch downloads to stall.
- The Fix: Verify your VCN route tables and ensure a static route exists for Object Storage on each compute node.
- The Issue: Patching may fail if instances are down, ASM is not running properly, or custom wallet/listener files do not match across cluster nodes.
- The Fix: Ensure the database instance status is Open and active on all nodes before starting operations. Temporarily restore standard configuration files (like custom wallets) if the patch process trips on them.
- The Issue: Insufficient disk space on the
/u01or/u02partitions causes pre-checks to fail. - The Fix: Clear out old trace files, log files, or obsolete backups prior to patching
- The Issue: If you installed non-Exadata RPMs (extra OS packages) on the Guest VMs, the pre-check may flag conflicts with Oracle-installed RPMs.
- The Fix: Resolve the RPM dependencies or uninstall the conflicting non-Exadata packages before trying the Guest VM upgrade again
dbaascli)dbaascli patch tools list: Displays the currently installed cloud tooling version and checks if any updates are available for your system.dbaascli admin showLatestStackVersion: Returns the version number of the latest availabledbaastoolsRPM stack update.- Context note: These commands are run as the root user after connecting to a compute node as the
opcuser
- /var/opt/oracle/misc/platforminfo: A system file containing the deployment type identifier. On Exadata Cloud Service (ExaCS) or ExaDB-D, querying this file will return
EXACSorEXACC(Cloud@Customer). - /usr/local/bin/imageinfo: An Exadata utility script used to generate a summary of the release versions and statuses of software, OS, and firmware components on your Exadata compute or storage nodes
dbaascli cswLib listLocal: Lists the database software images and versions locally available in your environment for patching or provisioning
dbaascli tde status --dbname <dbname>: Checks the status of the Transparent Data Encryption (TDE) keystore (open, closed, auto-login).dbaascli database verify_wallet --dbname <dbname>: Validates the integrity and accessibility of the database wallet
dbaascli database getDetails --dbname <dbname>: Returns specific configuration and operational details for the specified database
dbaascli database backup --dbname <dbname> --getSchedules: Displays the configured automated backup schedules.dbaascli database backup --getConfig --dbName <dbname> --configFile /tmp/<dbname>_cfg.txt: Exports current backup configuration parameters to a text file for review or editing.dbaascli database backup --dbname <dbname> --list: Lists all available backups taken for the database.dbaascli database backup --dbName <dbname> --showHistory --all: Displays a comprehensive, historical log of all backup jobs.dbaascli database backup --dbname <dbname> --status --uuid <uuid>: Checks the status of a specific, previously run backup job using its UUID.dbaascli database backup --getLatestBackupJob --dbname <dbname>: Fetches the job details and status of the most recent backup execution
GetExaWatcherResults.sh command extracts ExaWatcher performance data on Oracle Exadata servers. It gathers detailed OS metrics (like CPU, memory, and network) between your specified timestamps, and compiles them into a compressed archive (e.g., .zip or .tar.gz)- Locate the Output: The generated archive is typically saved in the current directory or a designated directory (e.g.,
/opt/oracle.ExaWatcher/archive/). - Reviewing the Data: The archive contains CSV/raw data files for OS tools like
iostat,mpstat, andvmstat, along with a small subset of pre-built visual charts
tfactl diagcollect command with the -node flag to target specific nodes. By default, TFA collects data for the past 12 hours and from all nodes. To pinpoint the collection, restrict it to the exact nodes and timeframe when the issue occurred- Target Nodes: Specify
-node localfor just the server you are on, or-node node1,node2for a comma-separated list. - Time Range: Use
-fromand-tofor an exact window, or-last <n>h|dto gather logs for the past \(n\) hours or days (e.g.,-last 4h). - Specific Component: Add flags like
-crs,-asm, or-database <db_name>to restrict collection to those specific sub-systems
Question : How to administer Oracle Exadata Cloud Infrastructure (ExaCS) and ExaDB-D
srvctl, SQL*Plus, or dcli) for databases- Compute & VM Clusters: Use the OCI Console to provision and scale Exadata VM Clusters, manage database homes, and allocate CPU/RAM.
- Patching & Updates: OCI handles the lifecycle management for Grid Infrastructure and database software images. You can schedule automated updates or trigger them manually via the console.
- Storage Configuration: Manage your Exadata storage, adjust Exadata I/O Resource Management (IORM), and monitor metrics directly through the OCI dashboards.
- Backups: Configure automated OCI-managed backups (which offload directly to Object Storage) and manage retention policies at the VM cluster level
- Command Line Utilities: For Grid Infrastructure operations, connect via SSH to the compute nodes and use utilities like
srvctl(for database and service management),crsctl(for clusterware), anddcli(for executing commands across all compute nodes). - Database Operations: Continue using native Oracle tools like RMAN for backups, Data Pump for migrations, and SQL*Plus/SQLcl for typical database administration.
- Data Guard: Set up, monitor, and failover/switchover Oracle Data Guard configurations—including multi-standby deployments—either using OCI automation in the console or via traditional DGMGRL commands
- Compartments & Policies: Control who can view and modify Exadata resources by configuring specific IAM policies and organizing your infrastructure into Compartments.
- Encryption & Keystores: Manage your master encryption keys natively using OCI Vault, or integrate with Oracle Data Safe to manage user security and auditing
Question : what will you check and analyze in exadata through AWR
- Exadata Configuration Differences: Checks for mismatched hardware or software releases across your storage servers (cells). Discrepancies can lead to unpredictable I/O behavior.
- Server Health Report: Reviews disk statuses and validates that no grid disks or cell disks are unexpectedly offline, which would reduce your total available I/O bandwidth.
- Offload Efficiency Percentages: Analyzes how many I/O operations are being offloaded to the storage cells. Low offload statistics usually mean the database is pulling raw data blocks instead of leveraging Exadata Smart Scans for filtering.
- Storage Index Usage: Checks the number of "Smart IO bytes saved by storage index." Higher savings mean Exadata is successfully skipping reading data blocks that do not meet your query criteria, saving I/O resources
- Flash Cache Hit Ratios: Assesses read requests satisfied by flash rather than traditional hard disks. You can review cache usage broken down by workload type (OLTP, Scan, Keep).
- Smart Flash Log Statistics: Ensures that log file parallel write operations are being accelerated by flash. You should check for "Flash Log Skips" and redo write latency histograms to identify high-latency I/O outliers
- Top Databases by I/O Requests: Shows which databases or workloads on the Exadata machine are consuming the bulk of the I/O throughput.
- IORM Wait Time: Evaluates queue times for flash and disk devices. If queue times are high (e.g., greater than 5-10 ms), IORM plans may need tuning to prevent noisy neighbors from impacting critical databases
- Exadata Outlier Summary: Exadata typically distributes I/O requests evenly across all cells. AWR's outlier analysis pinpoints which individual cell servers, grid disks, or host HBAs are experiencing disproportionately high latency or service times compared to the rest of the storage grid.
Question : what will you check and analyze in exadata x8m through AWR
log file sync and cell single block physical read wait times drop dramatically compared to traditional all-flash architectures2. Exadata Smart Flash Cache EfficiencyCheck if your most active data is sitting in flash.Flash Cache Hit Ratios: Review the Flash Cache User Reads and User Writes sections. High percentages of unoptimized read requests (reads that hit spinning hard disks) indicate your working set outgrows the flash cache. Write-Back vs Write-Through: Look for the ratio of First Writes to Overwrites. High overwrites indicate your flash cache is absorbing I/O effectively, dramatically saving disk write operations.
3. Smart Scan & Offload EfficiencySmart Scans reduce the volume of data traveling across the Exadata network by filtering rows and columns at the storage server layer Interconnect vs Eligible I/O: Compare cell physical IO interconnect bytes returned by smart scan to cell physical IO bytes eligible for predicate offload. A large disparity proves that predicate filtering (column/row pruning) is working well. Storage Index Savings: Review the IO Saved by Storage Index metrics. If savings are consistently low, queries are not skipping unnecessary Exadata I/O regions efficiently, which points to tuning opportunities on table clustering or data types
4. Wait Events (Foreground & Background)Correlate traditional database wait times with Exadata-specific hardware eventsCell Single Block Physical Read / Multiblock Physical Read: Analyze the average wait time for these events. High times typically mean you are hitting spinning disks instead of the PMEM or Flash cache tiers. Reliable Message: In Exadata, this event indicates internal cell communications or cluster channel syncs. Spikes here may indicate network contention or RoCE adapter congestion
. IORM (I/O Resource Management)- IORM Wait Events: Verify that no specific database or pluggable database (PDB) is being excessively throttled. Look at
IORM transient bottleneck and db file sequential read waits to ensure your consumer groups are properly prioritized. [1]
6. Health & Configuration Checks- Offline Disks: The Exadata Health Report section automatically alerts you if grid disks or cell disks are offline or degraded. Even one offline disk can halve your I/O bandwidth.
- Storage Server Software Version: Ensure Exadata server versions are uniform across all storage cells to prevent mismatched offload algorithms or software limitations
Question : what will you check and analyze in exadata x8m if database is getting hanged
If an Exadata X8M database hangs, immediately identify if it is a global grid/cluster issue or an isolated database slowdown. Focus on cluster metrics, interconnect/network health (key to X8M's RoCE architecture), and storage cell bottlenecks
Review the following components systematically:1. Database & Compute Nodes- Clusterware & High Availability: Run
crsctl check cluster to see if the cluster is healthy. In X8M, hardware-based RDMA immediately catches severe node freezes; verify that the node hasn't been evicted. - Active Session History (ASH): Since the database is hung, generate a report using
oradebug setmypid or Real-Time ADDM to identify the predominant wait classes (e.g., Cluster, Concurrency, System I/O). - Crucial Wait Events: Look for
cell single block physical read (indicates storage tier bottlenecks) or log file sync (indicates log write issues).
2. Exadata X8M Network Fabric (RoCE)The X8M replaces traditional interconnects with RDMA over Converged Ethernet (RoCE), making network latency the primary suspectSwitch Health: Check the Cisco or Mellanox switch ports for errors, drops, or packet discards using esxcfg-nics or native switch commands. Interconnect Waits: Check for spikes in gcs drm freeze or gc cr block busy waits, which point to node-to-node communication stalls
3. Exadata Storage CellsIf the database isn't fully locked but queries are hanging, the issue may stem from the storage tier Smart PMEM Log: In X8M, redo logs are written via RDMA directly to Persistent Memory (PMEM). Check v$sysstat or v$log to ensure PMEM commits are completing properly. Storage CPU Utilization: Log into the cell servers via cellcli and run SCLI=list metriccurrent where objectType='CELL' and name like 'CPU_UTIL%' to ensure the cell server CPUs are not saturated. Disk Queuing: Check for high I/O latencies using SCLI=list metriccurrent where objectType='GRIDDISK' and name like 'GD_IO_RQ_TM%' to see if grid disks are causing long waits
4. Diagnostics & Logs- Exachk: Run the Oracle Exachk health check tool to identify known hardware or software configuration bugs (e.g., node evictions not properly resetting).
- ADRCI: Execute
adrci and check the alert logs to capture specific incident numbers and trace files
Question : what will you check and analyze in exadata x8m if database is getting hanged with command
When an Oracle Exadata database hangs, troubleshooting requires looking past the database layer down into the Exadata-specific hardware (Storage Servers, InfiniBand/RoCE network
A robust troubleshooting workflow requires checking the following areas using specific tools and commands:1. Database Tier ChecksIf you can still connect to the database (even via sqlplus -prelim / as sysdba), investigate where sessions are spending their time.- Identify Critical Wait Events: Check for Exadata-specific wait events (e.g.,
cell single block physical read, cell smart table scan).sqlSELECT event, SUM(wait_time), SUM(seconds_in_wait)
FROM v$session_wait
WHERE wait_class NOT IN ('Idle')
GROUP BY event;
Look for Contention: Check for enqueue or locking issues. Real-Time ADDM: If the database is completely unresponsive, use Real-Time ADDM to diagnose the hang without logging in.
2. Exadata Storage Level (PMEM & Smart Logging)- Exadata Smart PMEM Cache: Exadata X8M utilizes Persistent Memory (PMEM) and RoCE (RDMA over Converged Ethernet) to bypass traditional OS I/O stacks. Check for waits specifically related to the RDMA path:
cell single block physical read: pmem cache or cell single block physical read: xrmem cache. - Cell Server (CellSRV) Metrics: Use the
cellcli command line on the Exadata storage servers to check for any slow disk response times or interface issues on the RoCE network fabric
2. Storage Cell Tier Checks (Storage Servers)If the database waits indicate I/O or cell-related issues, log into a compute node and use dcli or cellcli to interrogate the Exadata Storage ServersCheck Storage Server Health:bashdcli -c cell01,cell02,cell03 "cellcli -e list alertcurrent"
Verify Disk/Cell Health: Look for offline or critical disks or flash drives
bashdcli -c cell01,cell02,cell03 "cellcli -e list griddisk attributes name,status"
Examine Quarantined Cells: Check if Exadata has quarantined any faulty offload operations that might be forcing the DB into slow single-block reads.bashdcli -c cell01,cell02,cell03 "cellcli -e list quarantine"
3. Exadata X8M Persistent Memory (PMEM)A key feature of the Exadata X8M is its RoCE network and NVDIMM/PMEM (Persistent Memory) write accelerators. If these hang, commit times drop to a crawlVerify PMEM State: Use the cellcli tool to ensure the NVDIMM hardware and PMEM controllers are operating normally.
First Writes to Overwrites. High overwrites indicate your flash cache is absorbing I/O effectively, dramatically saving disk write operations.cell physical IO interconnect bytes returned by smart scan to cell physical IO bytes eligible for predicate offload. A large disparity proves that predicate filtering (column/row pruning) is working well.- IORM Wait Events: Verify that no specific database or pluggable database (PDB) is being excessively throttled. Look at
IORM transient bottleneckanddb file sequential readwaits to ensure your consumer groups are properly prioritized. [1]
- Offline Disks: The Exadata Health Report section automatically alerts you if grid disks or cell disks are offline or degraded. Even one offline disk can halve your I/O bandwidth.
- Storage Server Software Version: Ensure Exadata server versions are uniform across all storage cells to prevent mismatched offload algorithms or software limitations
Question : what will you check and analyze in exadata x8m if database is getting hanged
- Clusterware & High Availability: Run
crsctl check clusterto see if the cluster is healthy. In X8M, hardware-based RDMA immediately catches severe node freezes; verify that the node hasn't been evicted. - Active Session History (ASH): Since the database is hung, generate a report using
oradebug setmypidor Real-Time ADDM to identify the predominant wait classes (e.g.,Cluster,Concurrency,System I/O). - Crucial Wait Events: Look for
cell single block physical read(indicates storage tier bottlenecks) orlog file sync(indicates log write issues).
esxcfg-nics or native switch commands.gcs drm freeze or gc cr block busy waits, which point to node-to-node communication stallsv$sysstat or v$log to ensure PMEM commits are completing properly.cellcli and run SCLI=list metriccurrent where objectType='CELL' and name like 'CPU_UTIL%' to ensure the cell server CPUs are not saturated.SCLI=list metriccurrent where objectType='GRIDDISK' and name like 'GD_IO_RQ_TM%' to see if grid disks are causing long waits- Exachk: Run the Oracle Exachk health check tool to identify known hardware or software configuration bugs (e.g., node evictions not properly resetting).
- ADRCI: Execute
adrciand check the alert logs to capture specific incident numbers and trace files
sqlplus -prelim / as sysdba), investigate where sessions are spending their time.- Identify Critical Wait Events: Check for Exadata-specific wait events (e.g.,
cell single block physical read,cell smart table scan).sqlSELECT event, SUM(wait_time), SUM(seconds_in_wait) FROM v$session_wait WHERE wait_class NOT IN ('Idle') GROUP BY event;
- Exadata Smart PMEM Cache: Exadata X8M utilizes Persistent Memory (PMEM) and RoCE (RDMA over Converged Ethernet) to bypass traditional OS I/O stacks. Check for waits specifically related to the RDMA path:
cell single block physical read: pmem cacheorcell single block physical read: xrmem cache. - Cell Server (CellSRV) Metrics: Use the
cellclicommand line on the Exadata storage servers to check for any slow disk response times or interface issues on the RoCE network fabric
dcli or cellcli to interrogate the Exadata Storage Serversdcli -c cell01,cell02,cell03 "cellcli -e list alertcurrent"
Verify Disk/Cell Health: Look for offline or critical disks or flash drives
bashdcli -c cell01,cell02,cell03 "cellcli -e list griddisk attributes name,status"
Examine Quarantined Cells: Check if Exadata has quarantined any faulty offload operations that might be forcing the DB into slow single-block reads.bashdcli -c cell01,cell02,cell03 "cellcli -e list quarantine"
cellcli tool to ensure the NVDIMM hardware and PMEM controllers are operating normally.bashdcli -c cell01,cell02,cell03 "cellcli -e list pmemdisk attributes name,status"
dcli -c cell01,cell02,cell03 "cellcli -e list pmemdisk attributes name,status"4. Fabric / Network (RoCE) ChecksExadata X8M uses RDMA over Converged Ethernet (RoCE) for cluster communication and storage access. A network degradation here will look like a database hang
- Check RoCE Switches: From the compute node, verify that there are no packet drops or latency spikes across the X8M network fabric. [1]
5. Operating System / Hardware Layer- Log to ILOM: If the storage or database server host OS completely fails to respond, query the server's Integrated Lights Out Manager (ILOM) for hardware-level faults or system freezes.
bashshow /SP/logs/event/list
- Check RoCE Switches: From the compute node, verify that there are no packet drops or latency spikes across the X8M network fabric. [1]
- Log to ILOM: If the storage or database server host OS completely fails to respond, query the server's Integrated Lights Out Manager (ILOM) for hardware-level faults or system freezes.
show /SP/logs/event/listcell single block physical read)?/opt/oracle.SupportTools/em/cell_alert_log)?- Alert Log: Located in
diag/rdbms/.../trace/alert_<SID>.log. Look for "LGWR is taking too long" warnings. - LGWR Trace Files: Check for I/O errors or timeouts in the storage layer.
- Cell Server Logs: On the storage cells, use the
cellclitool to checkLIST METRICCURRENTfor flash or PMEM health alerts
ORA-00070 (deadlock), memory leaks, or storage-offline events- ORA-Errors: Look for sequence errors such as
ORA-04031(shared pool exhaustion) or checkpoint not complete messages (ORA-00316). - Exadata Cell Alerts: Check for storage-related hardware errors, flash cache failures, or quorum disk drops.
- Trace File Analyzer (TFA): Run
tfactl diagcollect -since 1hto grab all relevant logs across the grid infrastructure
V$SESSION_WAIT and V$SYSTEM_EVENT to see where the system is blockedcellcli)dcli or cellcli) to verify hardware and cache integrityflashlog performance is healthy, as it directly impacts commit processing.- Hung Cluster: Run
crsctl check cluster -allto ensure the cluster nodes are communicating. - Interconnect: A hang is often triggered by network drops. Review the interconnect (private network) for packet drops or latency issues. Exadata X8M uses RoCE (RDMA over Converged Ethernet); check for switch port flaps.
- Check operating system stats (
top,vmstat) to see if you are facing CPU starvation (runqueuesize) or memory swapping (pi/po). - In the database, check for
latch freeorbuffer busy waits(due to unoptimized SQL or heavy concurrency).
- What specific wait events are currently showing as the highest in
V$SESSION? - Are there any ORA- errors printed in the alert log right before the freeze began?
- Is this a Single-Instance database or a RAC (Real Application Clusters) environment?
- Database Alert Log: Usually located in
$ORACLE_BASE/diag/rdbms/{DB_NAME}/{SID}/trace/alert_{SID}.log. - Hang Manager Logs: Look for messages containing
ORA-32701ordia0background process trace files in$ORACLE_BASE/diag/rdbms/{DB_NAME}/{SID}/incident/incdir_*. - Exadata Cell Alert Log: Verify Exadata storage server health by checking
/opt/oracle/cell/log/diag/asm/cell/{cell_name}/trace/alert.log.
- Log Write/Commit Bottlenecks: Look for
checkpoint not completeorLGWR wait for redo copymessages, which could point to I/O stalls. - PMEM / RoCE Issues: The Exadata X8M relies on Smart PMEM (Persistent Memory) for fast commits and RDMA over Converged Ethernet. Check for errors related to PMEM hardware faults or network fabric stalls.
- OOM (Out of Memory): Look for memory allocation failures or
ORA-04030/ORA-04031errors
- System State Dump: Execute
oradebug dump systemstate 266to get a precise snapshot of all processes and what they are waiting for. - Hang Analyzer: Run
oradebug hanganalyze 3to identify the blocking and waiting process chains. - AWR & ASH: If you can still log in, generate an AWR report or query the Active Session History (ASH) to review the top wait events
Question: How to enable Data Guard Oracle on Exadata Cloud Infrastructure (ExaCS) and ExaDB-D
- Navigate to the Primary Database:
- Open the OCI navigation menu and go to Oracle Database, then select Exadata on Oracle Public Cloud (ExaDB-D) or Exadata Cloud@Customer (ExaCS).
- Select the Compartment and the VM Cluster containing the primary database.
- Click the name of the specific Database you want to protect.
- Add a Standby Database:
- Under the Resources section on the left, click Data Guard Associations (or Data Guard Group for newer versions like 19c+).
- Click Add Standby or Enable Data Guard
- Configure the Standby Settings:
- Select Peer VM Cluster: Choose the target region, availability domain, and the destination VM Cluster where the standby will reside.
- Data Guard Type: Select either Data Guard (standard) or Active Data Guard (requires additional licensing for features like real-time query).
- Protection Mode: Choose Maximum Performance (asynchronous) or Maximum Availability (synchronous).
- Database Credentials: Enter the SYS password for the primary database to authorize the creation.
- Finalize and Monitor:
- (Optional but recommended) Click Run Precheck to ensure the environment is ready before proceeding.
- Click Add Standby or Enable Data Guard to start the provisioning process.
- Monitor the progress via the Work Requests page. Once completed, the database role will reflect its new status (Primary or Standby)
- Infrastructure: For maximum fault isolation, configure the standby on a different Exadata Infrastructure than the primary.
- Software Versions: Both the primary and standby VM Clusters must have identical DBaaS Tools and Agent versions.
- Network: Ensure proper security rules are in place to allow network communication between the primary and standby client subnets
Question : what is wait event gc cr block 2-way and gc current block 2-way and gc cr block busy
gc cr block 2-way event signifies a Consistent Read (CR) block requested by one instance being transferred directly from another instance over the cluster interconnect, involving exactly two nodes (the requestor and the holder) and a single network hop.- Identify the Object: Run an AWR (Automatic Workload Repository) report to check the "Segments by Global Cache Cr Blocks" section. Pinpointing the exact table or index causing the block transfers is step one.
- Application Partitioning: Segregate workloads so that sessions modifying data (DML) run on the same instance that queries (SELECT) that same data, localizing block access and eliminating cross-node chatter.
- Re-evaluate Index Usage: Frequent full-table scans or heavy index maintenance can trigger high block transfers. Optimize queries to use more localized or partitioned data access paths.
- Optimize Interconnect: Ensure your cluster interconnect network is fast, reliable, and not acting as a bottleneck
gc current block 2-way occurs when a session requests a data block in "Current" (DML/Exclusive) mode, and the block is transferred directly from a remote instance in 2 network hops (Requesting Instance \(\rightarrow \) Holding Instance \(\rightarrow \) Requesting Instance)- Current Mode: Indicates a request for the current block data (typically for DML like
UPDATE,INSERT,DELETE, orSELECT FOR UPDATE) rather than a Consistent Read (CR) snapshot. - 2-Way Transfer: The block is found in the cache of exactly one other remote instance and is sent directly over the interconnect. No third master instance is required for the transfer.
- Normal Operation: In an Oracle RAC environment, block transfers are standard. This wait event alone does not necessarily indicate a problem, unless the wait times or total waits are excessively high and degrading performance
V$ACTIVE_SESSION_HISTORY or the Oracle AWR Report to find the specific segments (tables/indexes) associated with the waits.INSERT operations (like appending to sequences) can cause "hot" blocks at the ends of indexes. Consider using partitioned indexes or increasing the number of sequence cache entries (e.g., CACHE 1000 NOORDER).PCTFREE on tables with high concurrency to reduce the number of rows per block. This helps minimize multiple instances hitting the exact same physical block simultaneously.gc cr block busy :
gc cr block busy is an Oracle RAC (Real Application Clusters) wait event indicating that a session requested a consistent read (CR) block, but the block transfer between instances was delayed. This means there is high contention for a "hot block" across nodes- Remote Pinning: The remote instance holding the block is actively modifying it (e.g., locking, updating) or has not yet finished writing its redo logs for that transaction.
- Log Flush Delays: The transfer is held up because the holding instance cannot write to the online redo logs quickly enough.
- Contention: Multiple instances are requesting the same block simultaneously
- Identify the Hot Block: Use the
V$SESSION_WAITview to find the file and block number causing the waits (using parametersP1andP2). - Find the Object: Map the file/block to a specific database table or index using
DBA_EXTENTS. - Tune the Application:
- Optimize SQL queries to reduce large full table or index scans that span across nodes.
- If a single block holds too many small rows (e.g., sequence generators), consider increasing the cache size or partitioning the table.
- Check I/O Performance: Review your redo log write times. If you see high
log file syncwaits alongside this event, your disk group for redo logs may be experiencing I/O bottlenecks
No comments:
Post a Comment