Upgrading

This guide covers upgrading your RootCause.ai deployment to new versions. Proper upgrade procedures ensure zero-downtime updates and safe rollback options.


Upgrade Process Overview

  1. Review release notes – Understand what changed

  2. Backup data – Ensure recovery options

  3. Test in staging – Validate before production

  4. Upgrade platform – Deploy new version

  5. Upgrade dependencies – If required by the new version

  6. Verify – Confirm everything works

  7. Rollback if needed – Quick recovery option


Checking Current Version

# View installed chart version
helm list -n perceptura

# View running image versions
kubectl get pods -n perceptura -o jsonpath='{range .items[*]}{.metadata.name}{"\t"}{.spec.containers[*].image}{"\n"}{end}'

Checking Available Versions


Pre-Upgrade Checklist

1. Backup MongoDB

2. Backup PostgreSQL

3. Export current values

4. Check pod health


Upgrading from OCI Registry

Standard Upgrade (Latest):

Upgrade to Specific Version:

Dry Run First:


Upgrading from Local Chart


Upgrading Dependencies

When upgrading dependencies (MongoDB, PostgreSQL, etc.), follow their specific upgrade paths:

Dependencies Chart:

⚠️ Database upgrades require special care:

  • Always backup first

  • Check for schema migrations

  • Follow vendor upgrade guides

  • Test data integrity after upgrade


Monitoring the Upgrade

Watch the rollout:


Post-Upgrade Verification

1. Check pod status

2. Check logs for errors

3. Test application functionality

  • Login to the UI

  • Load a Data View

  • Open a Digital Twin

  • Run a simple simulation

4. Check health endpoints


Rollback Procedures

If something goes wrong:

Quick Rollback:

View revision history:

Example output:


Zero-Downtime Upgrades

For zero-downtime upgrades, ensure:

1. Rolling update strategy (default):

2. Sufficient replicas:

3. Pod disruption budgets:

4. Proper health checks:


Handling Breaking Changes

For major version upgrades with breaking changes:

1. Read migration guide – Check release notes for migration steps

2. Blue-green deployment – Deploy new version alongside old:

3. Database migrations – Run any required migrations:


Upgrade Schedule

Recommended approach:

Environment
Timing
Validation Period

Development

Immediately

N/A

Staging

1 day after release

1-2 days

Production

3-5 days after release

After staging validation

Maintenance windows:

  • Schedule during low-usage periods

  • Notify users in advance

  • Have support team available


Troubleshooting Upgrades

Pods stuck in Pending:

CrashLoopBackOff:

Image pull errors:

Configuration errors:


Disaster Recovery

If rollback doesn't work:

1. Restore from backup:

2. Reinstall from known-good version:


Next Steps

With upgrade procedures established:

  • Set up automated backup schedules

  • Configure monitoring alerts for deployment status

  • Document your specific upgrade runbook

  • Practice rollback procedures regularly

Last updated