VM Network Port Attachment Failure and High CPU Usage on OVNDB

Problem

PCD environment is experiencing a critical failure in VM network port attachment operations, preventing virtual machines from connecting to the network. The service is consuming 88-95% CPU, causing it to become unavailable for legitimate API requests.

Ovn Logs
Copy
Ostackhost Logs
Copy

Environment

  • Private Cloud Director Virtualization - v2025.4 and Higher
  • Private Cloud Director Kubernetes – v2025.4 and Higher
  • Self-Hosted Private Cloud Director Virtualization - v2025.4 and Higher
  • Self-Hosted Private Cloud Director Kubernetes - v2025.4 and Higher

Cause

The OVN Database (OVN-DB) service has entered a non-responsive state characterised by a high-CPU spin loop.

Diagnostics

  1. VM deployment fails with the below exception shown on the pf9-ostackhost logs located on the hypervisor at location /var/log/pf9/.
Logs
Copy

Step 2 -3 only accessible and applicable for the Self-hosted Private Cloud Director. For SaaS environments, please reach out to Platform9 support.

  1. Review the ovn-ovsdb-sb-0 pod logs from the workload region namespace.
Logs
Copy
  1. Validate the above pod logs and check CPU usage messages like shown in below snippet.
Logs
Copy

Resolution

These resolution steps are only applicable for the Self-hosted Private Cloud Director. For SAAS environment please reach out to Platform9 support.

  1. Take the backup of OVNDB database.
Command
Copy
  1. Once backup is taken, execute compaction command
Command
Copy
  1. Enable memory compaction on the below pods
Command
Copy
  1. Restart the below pods
Bash
Copy

Validation

Validation steps 1 & 2 are only applicable for the Self-hosted Private Cloud Director. For SAAS environment please reach out to Platform9 support.

  1. Check the pods' status; it should be running
Command
Copy
  1. Review the ovn-ovsdb pod logs from the workload region namespace and confirm if no high CPU usage messages are logged in these pods.
Logs
Copy
  1. Deploy a new VM and review the pf9-ostackhost logs from the hypervisor (host); it no longer logs Port <PORT_UUID> is still in use messages.
Type to search, ESC to discard
Type to search, ESC to discard
Type to search, ESC to discard