name: local-mainnet-debug description: Debug Lodestar beacon node issues by running a local mainnet node with checkpoint sync and engineMock. Use for investigating networking bugs, peer discovery issues, identify failures, metrics anomalies, or any behavior that needs real-world peer interactions without a full execution client.
Local Mainnet Debugging
Run a local Lodestar beacon node against mainnet peers using checkpoint sync and engineMock to reproduce and debug networking, peer discovery, and protocol-level issues in a real-world environment.
Quick Start
cd ~/lodestar # or worktree directory
# Basic run — connects to mainnet peers, no EL needed
./lodestar beacon \
--network mainnet \
--dataDir /tmp/lodestar-debug \
--rest false \
--metrics \
--execution.engineMock \
--port 19771 \
--logLevel debug \
--checkpointSyncUrl https://beaconstate-mainnet.chainsafe.io \
--forceCheckpointSync
# Time-boxed run (e.g., 2 minutes)
timeout 120 ./lodestar beacon \
--network mainnet \
--dataDir /tmp/lodestar-debug \
--rest false \
--metrics \
--execution.engineMock \
--port 19771 \
--logLevel debug \
--checkpointSyncUrl https://beaconstate-mainnet.chainsafe.io \
--forceCheckpointSync
Key Parameters
| Parameter | Purpose | Notes |
|---|---|---|
--network mainnet |
Connect to real mainnet peers | Use holesky for testnet |
--execution.engineMock |
Skip EL requirement | Node won't validate execution payloads |
--rest false |
Disable REST API | Reduces noise, avoids port conflicts |
--metrics |
Enable Prometheus metrics | Scrape at http://localhost:8008/metrics |
--port 19771 |
Custom P2P port | Avoid conflicts with other instances |
--logLevel debug |
Verbose logging | Use trace for maximum detail |
--checkpointSyncUrl |
Checkpoint sync endpoint | https://beaconstate-mainnet.chainsafe.io for mainnet |
--forceCheckpointSync |
Force checkpoint sync even if DB exists | Clean start each run |
Checkpoint Sync Endpoints
# Mainnet
--checkpointSyncUrl https://beaconstate-mainnet.chainsafe.io
--checkpointSyncUrl https://mainnet-checkpoint-sync.stakely.io
--checkpointSyncUrl https://sync-mainnet.beaconcha.in
# Holesky
--checkpointSyncUrl https://beaconstate-holesky.chainsafe.io
--checkpointSyncUrl https://holesky.beaconstate.ethstaker.cc
Metrics Scraping
# One-shot metric grab
curl -s http://localhost:8008/metrics | grep <pattern>
# Periodic sampling (every 30s)
while true; do
echo "=== $(date -u +%H:%M:%S) ==="
curl -s http://localhost:8008/metrics | grep -E 'lodestar_peers_by_client|peer_count'
sleep 30
done
# Key metrics for peer debugging
curl -s http://localhost:8008/metrics | grep -E \
'lodestar_peers_by_client|libp2p_identify|peer_count|connected_peers'
Debugging Techniques
1. A/B Testing with Code Changes
The most powerful technique — test a hypothesis by comparing control vs modified runs.
# Control: capture baseline (2 min)
timeout 120 ./lodestar beacon [flags] 2>&1 | tee /tmp/control.log &
# Sample metrics during run
for i in 1 2 3 4; do sleep 30; curl -s http://localhost:8008/metrics > /tmp/control-$i.metrics; done
# Test: apply change, repeat
timeout 120 ./lodestar beacon [flags] 2>&1 | tee /tmp/test.log &
for i in 1 2 3 4; do sleep 30; curl -s http://localhost:8008/metrics > /tmp/test-$i.metrics; done
# Compare
diff <(grep pattern /tmp/control-4.metrics) <(grep pattern /tmp/test-4.metrics)
2. Instrument libp2p Internals (Monkeypatching)
For deep protocol debugging, add temporary instrumentation to node_modules:
# Find the file to patch
find node_modules -path '*libp2p*identify*' -name '*.js' | head -20
# Key files:
# - node_modules/@libp2p/identify/dist/src/identify.js
# - node_modules/@chainsafe/libp2p-yamux/dist/src/stream.js (yamux streams)
# - node_modules/@libp2p/mplex/dist/src/mplex.js (mplex streams)
Important: Always remove monkeypatches before committing. Use pnpm install to restore.
3. Log Analysis
# Count specific errors
grep -c "Error setting agentVersion" /tmp/run.log
# Track identify success/failure over time
grep -E "identify (success|error|timeout)" /tmp/run.log | head -50
# Extract peer connection events
grep -E "peer:(connect|disconnect|identify)" /tmp/run.log
4. Stream-Level Debugging
For protocol stream issues (identify, ping, metadata):
Trace stream lifecycle:
1. Stream opened (protocol, direction, connection ID)
2. MSS negotiation (success/failure)
3. Data read/write (first frame timing)
4. Stream close (who closed, when)
Key locations for instrumentation:
@libp2p/identify: identify.js →_identify()method- Stream open/close: yamux or mplex stream.js
- Protocol negotiation:
@libp2p/multistream-select
Common Issues & Diagnostic Approaches
Unknown Peers (identify failures)
Symptoms: High ratio of "Unknown" in lodestar_peers_by_client metric.
Diagnostic approach:
- Check
lodestar_peers_by_clientfor Unknown ratio - Enable debug logs, grep for "Error setting agentVersion"
- Instrument identify stream to check stream state before
pb.read() - A/B test: disable suspect components → if Unknown drops to 0, found the cause
Slow Peer Discovery
Symptoms: Low peer count after startup.
Diagnostic approach:
- Check
libp2p_dialer_pending_dial_countandlibp2p_connections - Verify port is accessible:
lsof -i :<port> - Check discv5 logs for bootstrap ENR issues
- Monitor
lodestar_discovery_find_node_query_*metrics
High Disconnect Rate
Symptoms: Peers connecting and quickly disconnecting.
Diagnostic approach:
- Check
libp2p_connectionsvslibp2p_protocol_streams - Grep logs for disconnect reasons
- Check if peers are being pruned by score:
lodestar_gossip_peer_score_* - Monitor
lodestar_peers_by_directionfor inbound/outbound balance
Port Conflicts
# Check if port is in use
lsof -i :19771
lsof -i :8008
# Use different ports
--port 19772 --metrics.port 8009
Cleanup
Always use --dataDir /tmp/lodestar-debug when running debug sessions to avoid touching the default mainnet database. This isolates debug data from any production node on the same machine.
# Remove isolated debug data directory after testing
rm -r /tmp/lodestar-debug
Tips
- Short runs are fine. 60-120 seconds is enough to connect to 15-30 peers and observe identify behavior.
- engineMock means no execution validation. The node will sync headers but won't validate blocks. Fine for networking/peer debugging.
- Custom port avoids conflicts with any production nodes on the same machine.
- Always use
--forceCheckpointSyncto ensure a clean start. Stale DB state can mask issues. - Metrics lag behind logs. After stopping the node, the last metrics scrape reflects final state. Periodic sampling during the run gives time-series data.
- Debug builds not needed. Standard
pnpm buildoutput is sufficient; libp2p debug info comes from log level and instrumentation. - Use
--network holeskywhen you need a smaller, faster-syncing network for testing.
Self-Maintenance
If any commands, file paths, URLs, or configurations in this skill are outdated or no longer work, update this SKILL.md with the correct information after completing your current task. Skills should stay accurate and self-healing — fix what you find broken.