Alembic Setup and Production Migration Guide¶

Date: 2025-10-15 Author: Claude Code Status: Alembic Configured - Ready for Preview Testing

Executive Summary¶

Alembic database migration management has been successfully configured for S5 Slidefactory. This document explains:

What was done in the preview environment
How to safely migrate production without downtime
Ongoing migration workflow for future schema changes

Key Achievement: Alembic is now the official migration tool, replacing manual SQL migrations while maintaining backward compatibility.

What Was Changed¶

Files Added/Modified¶

Added Files:¶

alembic.ini - Alembic configuration file
alembic/env.py - Migration environment and model imports
alembic/script.py.mako - Migration script template
alembic/README - Quick reference guide
alembic/versions/ae98aa32e0d2_baseline_existing_schema.py - Baseline migration (empty)

Modified Files:¶

requirements.txt - Added alembic>=1.13.0
CLAUDE.md - Added Database Migrations section
This document - Production migration guide

Architecture Changes¶

Before: - Manual SQL migrations in /migrations/ folder - No version tracking - No rollback support - Manual execution required

After: - Alembic-managed migrations in alembic/versions/ - Automatic version tracking via alembic_version table - Built-in rollback support - Autogenerate migrations from model changes

Legacy Support: - Old SQL migrations in /migrations/ retained for reference - init.sql still used for fresh database initialization - Existing databases can adopt Alembic without re-creation

How Alembic Was Configured¶

Step 1: Installation¶

Added to requirements.txt:

alembic>=1.13.0                 # Database migration tool for SQLAlchemy

Step 2: Initialization¶

Created Alembic directory structure:

alembic/
├── env.py              # Migration environment configuration
├── script.py.mako      # Migration template
├── README              # Quick reference
└── versions/           # Migration history
    └── ae98aa32e0d2_baseline_existing_schema.py
alembic.ini             # Main configuration file

Step 3: Configuration¶

alembic.ini: - Configured to load database URL dynamically from app.config.settings - Set script location to alembic/ - Configured logging

alembic/env.py: - Imports Base from app.util.database - Imports all SQLAlchemy models: - app.context.models: Document, Chunk - app.results.models: ResultData - app.auth.users.models: User - app.filemanager.workflow_files.models: WorkflowMetaFile - app.n8nmanager.process: Process - app.api.models: ApiKey, PresentationRecord - Sets target_metadata = Base.metadata for auto-detection - Connects to database using settings.DATABASE_URL

Step 4: Baseline Migration¶

Created empty baseline migration ae98aa32e0d2_baseline_existing_schema.py: - Revision ID: ae98aa32e0d2 - No schema changes (upgrade() and downgrade() are pass) - Purpose: Mark the current schema state as the starting point

Why empty? The database already has the schema. This migration simply establishes a baseline version for Alembic to track future changes.

SQLAlchemy Models vs. Database Tables¶

Models Currently in Alembic¶

These tables have SQLAlchemy models and are tracked by Alembic:

Table	Model Location
`context_documents`	`app/context/models.py::Document`
`context_chunks`	`app/context/models.py::Chunk`
`results_data`	`app/results/models.py::ResultData`
`api_keys`	`app/api/models.py::ApiKey`
`presentation_records`	`app/api/models.py::PresentationRecord`
`app_users`	`app/auth/users/models.py::User`
`app_workflow_meta_files`	`app/filemanager/workflow_files/models.py::WorkflowMetaFile`
`n8n_processes`	`app/n8nmanager/process.py::Process`

Legacy Tables (Not in Alembic Yet)¶

These tables exist in init.sql but don't have SQLAlchemy models:

app_processes - Legacy process tracking (being phased out)
app_workflows - Legacy workflow executions
app_workflow_meta - Legacy workflow metadata
app_prompts - Legacy prompt storage
app_sequence_order - Legacy sequence configuration
app_dimension_order - Legacy dimension configuration
app_storage_items - Legacy storage tracking
cache_scraped_raw - Web scraping cache

Strategy for Legacy Tables: - Keep them in the database (no breaking changes) - Managed via init.sql for new installations - If models are created later, Alembic will detect them - Can be migrated to Alembic incrementally as needed

N8N Separate Database¶

These models use a separate database (N8NBase) and are not tracked by Alembic: - n8n_vector_collections (in N8N database) - n8n_vectors (in N8N database)

Safe Production Migration Strategy¶

Overview¶

The goal is to add Alembic version tracking to production without changing the schema or causing downtime.

Zero-Downtime Migration Process¶

Phase 1: Preview Testing (Current)¶

Status: ✅ COMPLETE

✅ Alembic configured in preview environment
✅ Baseline migration created
⏳ Test migration workflow on preview database (when database is available)

Test Plan:

# When preview database is running:

# 1. Mark current state as baseline (safe - only adds tracking)
alembic stamp head

# 2. Verify stamping worked
alembic current
# Expected output: ae98aa32e0d2 (head)

# 3. Create a test migration
alembic revision --autogenerate -m "test migration"

# 4. Review generated migration (should detect no changes)

# 5. Test upgrade (should be no-op)
alembic upgrade head

# 6. Test downgrade
alembic downgrade -1

# 7. Re-upgrade
alembic upgrade head

# 8. Clean up test migration
rm alembic/versions/*_test_migration.py

Phase 2: Production Deployment¶

When to Deploy: After preview testing confirms everything works.

Deployment Steps:

Option A: Deploy with Code (Recommended)

Deploy new code with Alembic installed (no migrations run yet)

git push origin main  # Triggers GitHub Actions deployment

SSH/connect to production container

# Via Azure Container Apps console or SSH

Run stamp command (safe - only adds version tracking)
```
alembic stamp head
```

Verify stamping

alembic current
# Should show: ae98aa32e0d2 (head)

Check database

# Should see new table:
SELECT * FROM alembic_version;
-- Expected: version_num = 'ae98aa32e0d2'

Done! Production is now Alembic-managed.

Option B: Database Maintenance Window (if preferred)

Schedule brief maintenance window (5 minutes)

Backup production database

pg_dump $DATABASE_URL > backup_pre_alembic.sql

Deploy code with Alembic installed
Run stamp command
```
alembic stamp head
```
Verify and resume operations

Rollback Plan (if needed):

-- If something goes wrong, simply drop the version table:
DROP TABLE alembic_version CASCADE;
-- System will function normally without Alembic tracking

Safety Guarantees¶

What `alembic stamp head` Does¶

Schema Changes: NONE Data Changes: NONE Action: Creates alembic_version table and inserts one row

-- What actually happens:
CREATE TABLE alembic_version (
    version_num VARCHAR(32) NOT NULL,
    CONSTRAINT alembic_version_pkc PRIMARY KEY (version_num)
);

INSERT INTO alembic_version (version_num) VALUES ('ae98aa32e0d2');

Impact: - ✅ Existing tables untouched - ✅ Existing data untouched - ✅ Application continues working normally - ✅ Reversible (just drop alembic_version table)

Why This is Safe¶

No Schema Modifications: The baseline migration is empty (pass)
Only Tracking: Alembic just records "the database is at this version"
Non-Blocking: Operation takes <1 second
No Downtime: Application keeps running during stamping
Reversible: Can remove Alembic tracking without affecting schema

Future Migration Workflow¶

Creating New Migrations¶

When to Create: - Adding new tables - Adding/removing columns - Changing column types - Adding indexes - Any database schema change

Process:

Modify SQLAlchemy model(s)

# Example: app/context/models.py
class Document(Base):
    # ... existing fields ...
    new_field = Column(String, nullable=True)  # NEW

Generate migration (auto-detects changes)

alembic revision --autogenerate -m "add new_field to documents"

Review generated migration

cat alembic/versions/<revision>_add_new_field_to_documents.py

Test on local/preview
```
alembic upgrade head
```

Commit to git

git add alembic/versions/<revision>_*.py app/context/models.py
git commit -m "feat: add new_field to documents table"
git push

Deploy to production
CI/CD pipeline deploys code
Run alembic upgrade head on production
Or add to deployment script

Manual Migrations (when autogenerate isn't enough)¶

For complex migrations (data transformations, etc.):

# Create empty migration
alembic revision -m "complex data migration"

# Edit the generated file manually
# Add custom SQL or Python code

# Test thoroughly before deploying

Rollback Process¶

If a migration causes issues:

# Rollback one migration
alembic downgrade -1

# Rollback to specific version
alembic downgrade <revision_id>

# Rollback all migrations (danger!)
alembic downgrade base

Integration with CI/CD¶

Current Deployment Pipeline¶

GitHub Actions: - .github/workflows/preview.yml - Preview branch - .github/workflows/production.yml - Main branch

Recommendation: Add migration step to deployment workflow.

Suggested CI/CD Integration¶

Add to deployment workflow:

# .github/workflows/production.yml (example)
- name: Run Database Migrations
  run: |
    # Install dependencies
    pip install -r requirements.txt

    # Run migrations
    alembic upgrade head
  env:
    DATABASE_URL: ${{ secrets.DATABASE_URL }}

Alternative: Run migrations as init container in Azure Container Apps.

For now: Run manually after code deployment until automated.

Migration Best Practices¶

DO ✅¶

Always review autogenerated migrations - Alembic might miss edge cases
Test migrations on preview first - Never test migrations directly on production
Commit migrations to git - Migrations are code
Document complex migrations - Add comments explaining non-obvious changes
Use transactions - Migrations are wrapped in transactions by default
Add rollback logic - Ensure downgrade() works correctly
Check for breaking changes - Consider application compatibility

DON'T ❌¶

Don't edit applied migrations - Create a new migration instead
Don't skip migrations - Apply them in order
Don't delete migrations - They're part of history
Don't run autogenerate without review - It can make mistakes
Don't forget to test rollback - Ensure downgrade() works
Don't modify schema directly - Always use migrations

Troubleshooting¶

Problem: "alembic: command not found"¶

Solution:

# Install alembic in virtual environment
.venv/bin/pip install alembic>=1.13.0

# Or use via venv
.venv/bin/alembic --version

Problem: "Target database is not up to date"¶

Solution:

# Check current version
alembic current

# Check pending migrations
alembic history

# Apply pending migrations
alembic upgrade head

Problem: "Can't locate revision identified by 'ae98aa32e0d2'"¶

Solution:

# The alembic/versions/ directory is missing or not in Python path
# Verify files exist:
ls alembic/versions/

# Verify alembic.ini points to correct location:
grep script_location alembic.ini

Problem: Autogenerate detects unwanted changes¶

Causes: - Legacy tables without models - Type mismatches (e.g., String vs VARCHAR(255)) - Enum type differences

Solutions: 1. Exclude tables from autogenerate:

# In alembic/env.py
def include_object(object, name, type_, reflected, compare_to):
    if type_ == "table" and name in ["legacy_table_1", "legacy_table_2"]:
        return False
    return True

context.configure(
    # ...
    include_object=include_object
)

Manually edit generated migration to remove unwanted changes
Create models for legacy tables to bring them under Alembic management

Legacy SQL Migrations¶

Status¶

Location: /migrations/ folder Status: Retained for reference, not actively used

Files: - 002_add_results_data_table.sql - 003_add_engine_type_to_workflow_meta_files.sql - add_engine_reference_to_processes.sql - create_api_keys_table.sql - extend_workflow_meta_files.sql - remove_workflow_meta_files_fk.sql - README.md

What to do with them: - ✅ Keep for historical reference - ✅ Document that they've been applied - ❌ Don't apply them again (already in schema) - ❌ Don't delete them (useful for understanding schema evolution)

Database Schema Versions¶

Version History¶

Version	Date	Description
`ae98aa32e0d2`	2025-10-15	Baseline - existing schema before Alembic

Future migrations will be added here automatically.

Next Steps¶

Immediate (Before Production Deployment)¶

✅ Test Alembic setup on preview environment (when database available)
✅ Verify alembic stamp head works correctly
✅ Create a test migration to validate workflow
✅ Test rollback functionality

Production Deployment¶

Deploy code with Alembic to production
Run alembic stamp head on production database
Verify version tracking is working
Document actual deployment date and any issues

Ongoing¶

Use Alembic for all future schema changes
Consider automating migrations in CI/CD pipeline
Create models for legacy tables if needed
Monitor migration performance on large tables

FAQ¶

Q: Will this break my existing database?¶

A: No. The alembic stamp head command only adds a version tracking table. No schema or data changes occur.

Q: Do I need to migrate legacy tables to Alembic?¶

A: Not immediately. Legacy tables without models can coexist. Migrate them as you create models for them.

Q: What happens if I don't run `alembic stamp head` on production?¶

A: The application will work fine, but you won't be able to use Alembic for future migrations on that database.

Q: Can I still use manual SQL if needed?¶

A: Yes, but it's better to use Alembic's empty migration template and add custom SQL there. This maintains version tracking.

Q: How do I see what changes a migration will make?¶

A: Use alembic upgrade head --sql to see the SQL without applying it.

Q: What if autogenerate misses a change?¶

A: Manually edit the generated migration file or create a new manual migration with the missing changes.

Summary¶

What Was Accomplished¶

✅ Alembic installed and configured ✅ Migration environment set up with all models imported ✅ Baseline migration created ✅ Documentation added to CLAUDE.md ✅ Production migration guide created (this document)

Production Deployment Checklist¶

Test on preview environment
Review this guide with team
Schedule production deployment
Deploy code with Alembic
Run alembic stamp head on production
Verify alembic current shows correct version
Update deployment documentation

Benefits Achieved¶

✅ Version tracking - Know exactly what schema version is deployed ✅ Rollback support - Undo problematic migrations ✅ Autogenerate - Detect model changes automatically ✅ Team collaboration - No confusion about which migrations to run ✅ CI/CD ready - Can automate future migrations ✅ Industry standard - Using proven tool (Alembic)

Contact¶

For questions or issues with Alembic migrations: 1. Check this document first 2. Review Alembic docs: https://alembic.sqlalchemy.org/ 3. Check migration history: alembic history --verbose 4. Review migration files in alembic/versions/

Last Updated: 2025-10-15 Next Review: After production deployment

Alembic Setup and Production Migration Guide¶

Executive Summary¶

What Was Changed¶

Files Added/Modified¶

Added Files:¶

Modified Files:¶

Architecture Changes¶

How Alembic Was Configured¶

Step 1: Installation¶

Step 2: Initialization¶

Step 3: Configuration¶

Step 4: Baseline Migration¶

SQLAlchemy Models vs. Database Tables¶

Models Currently in Alembic¶

Legacy Tables (Not in Alembic Yet)¶

N8N Separate Database¶

Safe Production Migration Strategy¶

Overview¶

Zero-Downtime Migration Process¶

Phase 1: Preview Testing (Current)¶

Phase 2: Production Deployment¶

Safety Guarantees¶

What alembic stamp head Does¶

Why This is Safe¶

Future Migration Workflow¶

Creating New Migrations¶

Manual Migrations (when autogenerate isn't enough)¶

Rollback Process¶

Integration with CI/CD¶

Current Deployment Pipeline¶

Suggested CI/CD Integration¶

Migration Best Practices¶

DO ✅¶

DON'T ❌¶

Troubleshooting¶

Problem: "alembic: command not found"¶

Problem: "Target database is not up to date"¶

Problem: "Can't locate revision identified by 'ae98aa32e0d2'"¶

Problem: Autogenerate detects unwanted changes¶

Legacy SQL Migrations¶

Status¶

Database Schema Versions¶

Version History¶

Next Steps¶

Immediate (Before Production Deployment)¶

Production Deployment¶

Ongoing¶

FAQ¶

Q: Will this break my existing database?¶

Q: Do I need to migrate legacy tables to Alembic?¶

Q: What happens if I don't run alembic stamp head on production?¶

Q: Can I still use manual SQL if needed?¶

Q: How do I see what changes a migration will make?¶

Q: What if autogenerate misses a change?¶

Summary¶

What Was Accomplished¶

Production Deployment Checklist¶

Benefits Achieved¶

Contact¶

What `alembic stamp head` Does¶

Q: What happens if I don't run `alembic stamp head` on production?¶