name: discounted-mpc-robustness description: "Robustness analysis for MPC and infinite-horizon optimal control under plant-model mismatch with quadratic costs. Covers discounted and undiscounted scenarios, stability guarantees, and suboptimality bounds. Use when: (1) MPC robustness analysis, (2) plant-model mismatch effects, (3) discounted infinite-horizon control, (4) model uncertainty in optimal control, (5) stability under model errors, (6) data-driven surrogate models."
Discounted MPC Robustness Analysis
Comprehensive framework for analyzing stability and suboptimality of MPC and infinite-horizon optimal control under plant-model mismatch.
Core Problem
Design optimal controls using surrogate model f instead of true plant dynamics g. Characterize:
- Stability of closed-loop system
- Suboptimality of closed-loop cost
- Interaction with horizon length and discount factor
System Setup
Plant vs Surrogate
True plant dynamics:
x+ = g(x, u), g(0,0) = 0
Surrogate model (used for control design):
x+ = f(x, u), f continuous, f(0,0) = 0
Proportional Mismatch
Key measure: Proportional plant-model mismatch
|f - g|_S := inf{p ≥ 0 : |f(x,u) - g(x,u)| ≤ p(|x| + |u|) ∀x ∈ S, u ∈ U}
Properties:
- Bounds mismatch proportional to state and control magnitude
- Origin equilibrium preserved:
f(0,0) = g(0,0) = 0 - Valid in region
S(model assumptions hold / data available)
Additional Assumptions
L-Lipschitz continuity:
|f(x,u) - f(y,u)| ≤ L|x - y| ∀x,y ∈ R^n, ∀u ∈ U
Control set:
Uclosed, contains 0- Ensures existence of optimal controls
Optimal Control Problem
Cost Function
General form (finite/infinite horizon, discounted/undiscounted):
J_{γ,N}^f(x, u_N) = ∑_{k=0}^{N-1} γ^k ℓ(φ_f(k,x,u_k), u_k)
where:
N ∈ N ∪ {∞}: Horizon lengthγ ∈ (0,1]: Discount factorℓ(x,u) = x^T Q x + u^T R u: Quadratic stage cost
Quadratic stage cost:
Q ∈ R^{n×n}, symmetric positive definite
R ∈ R^{m×m}, symmetric positive definite
ℓ(x,u) = ||x||_Q^2 + ||u||_R^2
Value Function
Optimal value function:
V_{γ,N}^f(x) := min_{u_N ∈ U^N} J_{γ,N}^f(x, u_N)
Bellman equation (Proposition 1):
V_{γ,N}^f(x) = min_{u ∈ U} [ℓ(x,u) + γ V_{γ,N-1}^f(f(x,u))]
Optimal Feedback Policy
Set-valued policy:
U_{γ,N}^f(x) := {u ∈ U : ℓ(x,u) + γ V_{γ,N-1}^f(f(x,u)) = V_{γ,N}^f(x)}
Closed-Loop Dynamics
Plant under Surrogate-Based Control
Closed-loop system:
x+ = g(x, κ(x)) where κ(x) ∈ U_{γ,N}^f(x)
Key issue: Control designed for f, applied to g
Closed-loop cost:
J_{γ,N}^g(x_0) = ∑_{k=0}^{N-1} γ^k ℓ(φ_g(k,x_0,u_k), u_k)
where u_k from surrogate-based policy.
Cost Controllability
Assumption (Cost Controllability)
Key requirement:
∃α_c > 0: V_{γ,N}^f(x) ≤ α_c ℓ(x, κ(x)) ∀x ∈ R^n
Interpretation: Optimal value function bounded proportional to stage cost
Consequences:
- Exponential stability for sufficiently long horizons [18]
- Enables Lyapunov-based stability analysis
- Bounds computational complexity
Main Results
Stability Guarantee
Theorem (Stability under Mismatch):
Under:
fcontinuous, L-Lipschitz- Cost controllability (α_c)
- Horizon
Nsufficiently long (or γ sufficiently close to 1) - Mismatch
|f-g|_Ssufficiently small
Result: Closed-loop system exponentially stable about origin.
Lyapunov function: V_{γ,N}^f serves as Lyapunov function
Suboptimality Bound
Theorem (Suboptimality):
Closed-loop cost approaches infinite-horizon optimal cost of surrogate model:
J_{γ,∞}^g(x_0) - V_{γ,∞}^f(x_0) ≤ δ(α_c, L, γ, |f-g|_S)
where δ quantifies performance degradation due to mismatch.
Key Innovation
Perturbation bounds independent of horizon length:
Unlike prior work [8,19,25]:
- Previous: Longer horizon → smaller mismatch needed
- This work: Bounds uniform over horizon length
Advantage: Can use longer horizons without requiring tighter model accuracy
Tradeoffs
Stability Conditions
Stability guaranteed when:
- Horizon length:
N ≥ N*sufficiently long (undiscounted) - Discount factor:
γ ≥ γ*sufficiently close to 1 (infinite horizon) - Mismatch:
|f-g|_S ≤ εsufficiently small
Tradeoff: Can compensate larger mismatch with longer horizon or larger γ.
Suboptimality Degradation
Performance bound depends on:
α_c: Cost controllability parameterL: Lipschitz constantγ: Discount factor|f-g|_S: Mismatch magnitude
Better model (smaller mismatch) → Better performance
Applications
Data-Driven Control
Surrogate from system identification:
- Kernel EDMD: Data-driven models with arbitrarily small mismatch [8]
- Koopman operator: Linear surrogates for nonlinear systems
- More data → More accurate surrogate → Smaller mismatch
Model Simplification
Using simplified models for tractability:
- Linear approximations for nonlinear systems
- Reduced-order models
- Discretization of continuous-time systems
Reinforcement Learning
Discounted costs common in RL:
- Mitigate prediction error accumulation
- Better numerical properties
- Stability requires γ close to 1
Implementation Considerations
Horizon Selection
Finite horizon (MPC):
N_min = compute_minimum_horizon(α_c, L, ε_mismatch)
# Use N ≥ N_min for stability
Infinite horizon:
γ_min = compute_minimum_discount(α_c, L, ε_mismatch)
# Use γ ≥ γ_min for stability (close to 1)
Mismatch Estimation
For data-driven surrogates:
# Kernel EDMD provides mismatch bound
|f - g|_S ≤ p(data_quality, system_class)
# More data → smaller p
For model simplification:
# Analytical mismatch bound
|f - g|_S ≤ p(model_complexity_reduction)
Lyapunov Verification
def verify_stability(x, V_f, κ, α_c, L, ε):
"""Check Lyapunov conditions under mismatch"""
# Decrease condition
ΔV = V_f(g(x, κ(x))) - V_f(x)
# Stability if ΔV ≤ -α ℓ(x, κ(x))
return ΔV ≤ -α * (||x||_Q^2 + ||κ(x)||_R^2)
Comparison to Prior Work
Key Differences
| Aspect | This Work | Prior [8,19,25] |
|---|---|---|
| Horizon dependence | Independent | Worsens with longer N |
| Discounted case | Included | Not studied |
| Infinite horizon | Covered | Not applicable |
| Framework | Unified (all cases) | Finite horizon only |
Advantages
- Horizon-independent bounds: Use longer horizons freely
- Discounted costs: RL applications
- Infinite horizon: Direct optimal control (no MPC receding)
- Unified theory: Single framework for all scenarios
Limitations
Assumption Requirements
- Quadratic stage cost: May not apply to general costs
- Proportional mismatch: Origin must be equilibrium of both systems
- Lipschitz continuity: Required for surrogate
- Cost controllability: May not hold for all systems
Practical Challenges
- Estimating mismatch:
|f-g|_Smay be difficult to determine - Lipschitz constant: May be large for complex systems
- Minimum horizon: May be impractically long for some systems
- Region S: Valid only where model assumptions hold
Related Concepts
- Robust MPC: Terminal ingredients vs long horizon approaches
- Relaxed DP: Bellman inequality for stability
- Data-driven control: Kernel EDMD, Koopman operators
- Discounted optimal control: RL connections
- Lyapunov stability: Value function as Lyapunov function
References
- Paper: Moldenhauer et al., "Discounted MPC and infinite-horizon optimal control under plant-model mismatch" (arXiv:2604.08521v1, April 2026)
- Related: [8] Data-driven MPC with kernel EDMD
- Related: [18] MPC stability via relaxed DP
- Related: [22] Discounted optimal control stability
Open Questions
- Non-quadratic costs: Extension to general stage costs?
- Stochastic systems: Noise handling in framework?
- State constraints: Incorporate constraint satisfaction?
- Non-proportional mismatch: Other mismatch measures?
- Adaptive horizon: Horizon adjustment based on mismatch?
Tools Used
exec: Run stability verification scriptsread: Load reference materialswrite: Save analysis results
Notes
- Horizon-independent bounds major practical improvement
- Discount factor γ close to 1 critical for stability
- More data → more accurate surrogate → better performance
- Framework applicable to MPC, value iteration, direct infinite-horizon control