local-cluster-manager

star 0

Manage local multigres cluster components (multipooler, pgctld, multiorch, multigateway) - start/stop services, view logs, connect with psql, test S3 backups locally

mkindahl By mkindahl schedule Updated 2/6/2026

name: "Local Cluster Manager" description: "Manage local multigres cluster components (multipooler, pgctld, multiorch, multigateway) - start/stop services, view logs, connect with psql, test S3 backups locally"

Local Cluster Manager

Manage local multigres cluster - both cluster-wide operations and individual components.

When to Use This Skill

Invoke this skill when the user asks to:

  • Start/stop/restart the entire cluster or individual components
  • View logs for any component
  • Connect to multipooler or multigateway with psql
  • Check status of cluster components
  • Check multipooler topology status (PRIMARY/REPLICA roles)
  • Check if PostgreSQL instances are in recovery mode
  • Test S3 backups (initialize cluster with S3, create/list/restore backups)
  • Configure or troubleshoot S3 backup settings

Performance Optimization

Parse ./multigres_local/multigres.yaml once when this skill is first invoked and cache the cluster configuration in memory for the duration of the conversation. Use the cached data for all subsequent commands. Only re-parse if the user explicitly asks to "reload config" or if a command fails due to stale config.

Cluster-Wide Operations

Start entire cluster:

./bin/multigres cluster start

Stop entire cluster:

./bin/multigres cluster stop

Stop entire cluster and delete all cluster data:

./bin/multigres cluster stop --clean

Check cluster status:

./bin/multigres cluster status

Initialize new cluster:

./bin/multigres cluster init

Get all multipoolers from topology:

./bin/multigres getpoolers

Returns JSON with all multipoolers, their cells, service IDs, ports, and pooler directories.

Get detailed status for a specific multipooler:

./bin/multigres getpoolerstatus --cell <cell-name> --service-id <service-id>

Returns detailed status including:

  • pooler_type: 1 = PRIMARY, 2 = REPLICA
  • postgres_role: "primary" or "standby"
  • postgres_running: Whether PostgreSQL is running
  • wal_position: Current WAL position
  • consensus_term: Current consensus term
  • primary_status: (for PRIMARY) connected followers and sync replication config
  • replication_status: (for REPLICA) replication lag and primary connection info

Example:

./bin/multigres getpoolerstatus --cell zone1 --service-id thhcdhbp

Check PostgreSQL recovery mode directly:

psql -h <pooler-dir>/pg_sockets -p <pg-port> -U postgres -d postgres -c "SELECT pg_is_in_recovery();"

Returns t (true) if in recovery/standby mode, f (false) if primary.

S3 Backup Testing

Test S3 backups using AWS S3. When the user wants to test S3 backups:

Configuration Caching: When S3 configuration values are first provided, cache them in memory for the duration of the conversation. Reuse these cached values for all subsequent S3 operations. Only re-prompt if:

  • The user explicitly asks to change the configuration
  • A command fails due to invalid/expired credentials
  • The values have never been provided in this conversation
  1. Prompt for S3 configuration using AskUserQuestion (only if not already cached):

    • Path to AWS credentials file (e.g., ./.staging-aws or ~/.aws/credentials)
    • S3 backup URL (e.g., s3://bucket-name/backups/)
    • AWS region (e.g., us-east-1)
  2. Check/source credentials:

# Check if AWS credentials are already set
env | grep AWS_

# If not, source the credentials file (path from user)
source <credentials-file-path>

# Verify credentials are now set
env | grep AWS_

IMPORTANT:

  • NEVER commit AWS credentials files to git
  • Avoid printing credentials to the terminal
  • Credentials file should contain: AWS_ACCESS_KEY_ID, AWS_SECRET_ACCESS_KEY, AWS_SESSION_TOKEN (if using temporary credentials)
  1. Initialize cluster with S3:
./bin/multigres cluster stop --clean
rm -rf multigres_local
./bin/multigres cluster init \
  --backup-url=<s3-url-from-user> \
  --region=<region-from-user>
  1. Start cluster (use standard cluster start command)

  2. Verify S3 configuration:

grep -r "aws_access_key_id\|aws_secret_access_key\|region\|repo1-s3" ./multigres_local/data/pooler_*/pgbackrest.conf

Should see AWS credentials and S3 configuration in all pgbackrest.conf files.

Backup Commands

Create backup:

./bin/multigres cluster backup

List all backups:

./bin/multigres cluster list-backups

Restore from backup:

./bin/multigres cluster restore --backup-label <label>

Troubleshooting S3 Issues

Missing/expired credentials:

# Re-source credentials file
source <credentials-file-path>

# Verify they're set
env | grep AWS_ | wc -l  # Should show 3+ environment variables

# Reinitialize cluster to pick up new credentials
./bin/multigres cluster stop --clean
rm -rf multigres_local
./bin/multigres cluster init --backup-url=<s3-url> --region=<region>

Check pgbackrest logs for errors:

# View recent errors
tail -100 ./multigres_local/data/pooler_*/pg_data/log/pgbackrest-*.log

# Follow logs in real-time
tail -f ./multigres_local/data/pooler_*/pg_data/log/pgbackrest-*.log

Verify S3 bucket access:

# Use AWS CLI to test bucket access (if installed)
aws s3 ls <s3-bucket-path> --region <region>

Individual Component Operations

Configuration

  1. Parse the config: Read ./multigres_local/multigres.yaml to discover available components and their IDs

  2. Component ID mapping:

    • multipooler IDs: extracted from .provisioner-config.cells.<zone>.multipooler.service-id
    • pgctld uses the same IDs as multipooler
    • multiorch has separate IDs for each zone
    • multigateway has separate IDs for each zone
  3. If no ID provided: Use AskUserQuestion to let the user select which instance to operate on

    • Show available IDs with their zone names
    • Example: "xf42rpl6 (zone1)", "hm9hmxzm (zone2)", "n6t8hvgl (zone3)"

Commands

Stop pgctld:

./bin/pgctld stop --pooler-dir <pooler-dir-from-config>

Start pgctld:

./bin/pgctld start --pooler-dir <pooler-dir-from-config>

Restart pgctld (as standby):

./bin/pgctld restart --pooler-dir <pooler-dir-from-config> --as-standby

Check pgctld status:

./bin/pgctld status --pooler-dir <pooler-dir-from-config>

View logs:

  • multipooler: ./multigres_local/logs/dbs/postgres/multipooler/[id].log
  • pgctld: ./multigres_local/logs/dbs/postgres/pgctld/[id].log
  • multiorch: ./multigres_local/logs/dbs/postgres/multiorch/[id].log
  • multigateway: ./multigres_local/logs/dbs/postgres/multigateway/[id].log
  • PostgreSQL: ./multigres_local/data/pooler_[id]/pg_data/postgresql.log

Tail logs:

tail -f <log-path>

Connect to multipooler (via Unix socket):

psql -h <pooler-dir>/pg_sockets -p <pg-port> -U postgres -d postgres

Where:

  • pooler-dir is from .provisioner-config.cells.<zone>.multipooler.pooler-dir
  • pg-port is from .provisioner-config.cells.<zone>.pgctld.pg-port
  • PostgreSQL socket is at <pooler-dir>/pg_sockets/.s.PGSQL.<pg-port>

Example:

psql -h ./multigres_local/data/pooler_xf42rpl6/pg_sockets -p 25432 -U postgres -d postgres

Connect to multigateway (via TCP):

psql -h localhost -p <pg-port> -U postgres -d postgres

Where:

  • pg-port is from .provisioner-config.cells.<zone>.multigateway.pg-port

Example:

psql -h localhost -p 15432 -U postgres -d postgres

Config Paths

Extract from YAML config at .provisioner-config.cells.<zone>.pgctld.pooler-dir

Examples

Cluster-wide:

User: "start the cluster"

  • Execute: ./bin/multigres cluster start

User: "stop cluster"

  • Execute: ./bin/multigres cluster stop

User: "cluster status"

  • Execute: ./bin/multigres cluster status

User: "show me all multipoolers" or "get poolers"

  • Execute: ./bin/multigres getpoolers

User: "check if multipoolers are in recovery" or "check multipooler status"

  • Parse config to get all zones and service IDs
  • Execute: ./bin/multigres getpoolerstatus --cell <zone> --service-id <id> for each
  • Display pooler_type (PRIMARY/REPLICA) and postgres_role (primary/standby)

User: "check zone1 multipooler status"

  • Look up service ID for zone1
  • Execute: ./bin/multigres getpoolerstatus --cell zone1 --service-id <id>

Individual components:

User: "stop pgctld"

  • Read config to find available pgctld instances
  • Ask user which one to stop (zone1, zone2, or zone3)
  • Execute stop command with selected pooler-dir

User: "restart pgctld xf42rpl6 as standby"

  • Look up pooler-dir for xf42rpl6 in config
  • Execute: ./bin/pgctld restart --pooler-dir /path/to/pooler_xf42rpl6 --as-standby

User: "logs multipooler hm9hmxzm"

  • Show: ./multigres_local/logs/dbs/postgres/multipooler/hm9hmxzm.log

User: "tail pgctld"

  • Ask which instance
  • Tail the corresponding log file

User: "connect to multipooler zone1" or "psql multipooler xf42rpl6"

  • Look up pooler-dir and pg-port from config
  • Show: psql -h <pooler-dir>/pg_sockets -p <pg-port> -U postgres -d postgres

User: "connect to multigateway" or "psql multigateway"

  • Ask which zone
  • Show: psql -h localhost -p <pg-port> -U postgres -d postgres
Install via CLI
npx skills add https://github.com/mkindahl/multigres --skill local-cluster-manager
Repository Details
star Stars 0
call_split Forks 0
navigation Branch main
article Path SKILL.md
More from Creator