Min–Max Dynamic Programming Control for Systems with Uncertain Mathematical Models via Differential Neural Network Bellman’s Function Approximation

Alexander Poznyak; Sebastian Noriega-Marquez; Alejandra Hernandez-Sanchez; Mariana Ballesteros-Escamilla; Isaac Chairez

doi:10.3390/math11051211

Min–Max Dynamic Programming Control for Systems with Uncertain Mathematical Models via Differential Neural Network Bellman’s Function Approximation

Alexander Poznyak, Sebastian Noriega-Marquez, Alejandra Hernandez-Sanchez, Mariana Ballesteros-Escamilla, Isaac Chairez

Producción científica: Contribución a una revista › Artículo › revisión exhaustiva

1 Cita (Scopus)

Resumen

This research focuses on designing a min–max robust control based on a neural dynamic programming approach using a class of continuous differential neural networks (DNNs). The proposed controller solves the robust optimization of a proposed cost function that depends on the trajectories of a system with an uncertain mathematical model satisfying a class of non-linear perturbed systems. The dynamic programming min–max formulation enables robust control concerning bounded modelling uncertainties and disturbances. The Hamilton–Jacobi–Bellman (HJB) equation’s value function, approximated by a DNN, permits to estimate the closed-loop formulation of the controller. The controller design is based on an estimated state trajectory with the worst possible uncertainties/perturbations that provide the degree of robustness using the proposed controller. The class of learning laws for the time-varying weights in the DNN is produced by studying the HJB partial differential equation. The controller uses the solution of the obtained learning laws and a time-varying Riccati equation. A recurrent algorithm based on the Kiefer–Wolfowitz method leads to adjusting the initial conditions for the weights to satisfy the final condition of the given cost function. The robust control suggested in this work is evaluated using a numerical example confirming the optimizing solution based on the DNN approximate for Bellman’s value function.

Idioma original	Inglés
Número de artículo	1211
Publicación	Mathematics
Volumen	11
N.º	5
DOI	https://doi.org/10.3390/math11051211
Estado	Publicada - mar. 2023
Publicado de forma externa	Sí

Acceder al documento

10.3390/math11051211

Otros archivos y enlaces

Enlace a la publicación en Scopus

Citar esto

@article{88460e4777ec4153ae8bb79666e0f620,

title = "Min–Max Dynamic Programming Control for Systems with Uncertain Mathematical Models via Differential Neural Network Bellman{\textquoteright}s Function Approximation",

abstract = "This research focuses on designing a min–max robust control based on a neural dynamic programming approach using a class of continuous differential neural networks (DNNs). The proposed controller solves the robust optimization of a proposed cost function that depends on the trajectories of a system with an uncertain mathematical model satisfying a class of non-linear perturbed systems. The dynamic programming min–max formulation enables robust control concerning bounded modelling uncertainties and disturbances. The Hamilton–Jacobi–Bellman (HJB) equation{\textquoteright}s value function, approximated by a DNN, permits to estimate the closed-loop formulation of the controller. The controller design is based on an estimated state trajectory with the worst possible uncertainties/perturbations that provide the degree of robustness using the proposed controller. The class of learning laws for the time-varying weights in the DNN is produced by studying the HJB partial differential equation. The controller uses the solution of the obtained learning laws and a time-varying Riccati equation. A recurrent algorithm based on the Kiefer–Wolfowitz method leads to adjusting the initial conditions for the weights to satisfy the final condition of the given cost function. The robust control suggested in this work is evaluated using a numerical example confirming the optimizing solution based on the DNN approximate for Bellman{\textquoteright}s value function.",

keywords = "Kiefer–Wolfowitz method, approximate models, artificial neural networks, robust optimal control",

author = "Alexander Poznyak and Sebastian Noriega-Marquez and Alejandra Hernandez-Sanchez and Mariana Ballesteros-Escamilla and Isaac Chairez",

note = "Publisher Copyright: {\textcopyright} 2023 by the authors.",

year = "2023",

month = mar,

doi = "10.3390/math11051211",

language = "Ingl{\'e}s",

volume = "11",

journal = "Mathematics",

issn = "2227-7390",

number = "5",

}

Min–Max Dynamic Programming Control for Systems with Uncertain Mathematical Models via Differential Neural Network Bellman’s Function Approximation. / Poznyak, Alexander; Noriega-Marquez, Sebastian; Hernandez-Sanchez, Alejandra et al.
En: Mathematics, Vol. 11, N.º 5, 1211, 03.2023.

Producción científica: Contribución a una revista › Artículo › revisión exhaustiva

TY - JOUR

T1 - Min–Max Dynamic Programming Control for Systems with Uncertain Mathematical Models via Differential Neural Network Bellman’s Function Approximation

AU - Poznyak, Alexander

AU - Noriega-Marquez, Sebastian

AU - Hernandez-Sanchez, Alejandra

AU - Ballesteros-Escamilla, Mariana

AU - Chairez, Isaac

PY - 2023/3

Y1 - 2023/3

N2 - This research focuses on designing a min–max robust control based on a neural dynamic programming approach using a class of continuous differential neural networks (DNNs). The proposed controller solves the robust optimization of a proposed cost function that depends on the trajectories of a system with an uncertain mathematical model satisfying a class of non-linear perturbed systems. The dynamic programming min–max formulation enables robust control concerning bounded modelling uncertainties and disturbances. The Hamilton–Jacobi–Bellman (HJB) equation’s value function, approximated by a DNN, permits to estimate the closed-loop formulation of the controller. The controller design is based on an estimated state trajectory with the worst possible uncertainties/perturbations that provide the degree of robustness using the proposed controller. The class of learning laws for the time-varying weights in the DNN is produced by studying the HJB partial differential equation. The controller uses the solution of the obtained learning laws and a time-varying Riccati equation. A recurrent algorithm based on the Kiefer–Wolfowitz method leads to adjusting the initial conditions for the weights to satisfy the final condition of the given cost function. The robust control suggested in this work is evaluated using a numerical example confirming the optimizing solution based on the DNN approximate for Bellman’s value function.

AB - This research focuses on designing a min–max robust control based on a neural dynamic programming approach using a class of continuous differential neural networks (DNNs). The proposed controller solves the robust optimization of a proposed cost function that depends on the trajectories of a system with an uncertain mathematical model satisfying a class of non-linear perturbed systems. The dynamic programming min–max formulation enables robust control concerning bounded modelling uncertainties and disturbances. The Hamilton–Jacobi–Bellman (HJB) equation’s value function, approximated by a DNN, permits to estimate the closed-loop formulation of the controller. The controller design is based on an estimated state trajectory with the worst possible uncertainties/perturbations that provide the degree of robustness using the proposed controller. The class of learning laws for the time-varying weights in the DNN is produced by studying the HJB partial differential equation. The controller uses the solution of the obtained learning laws and a time-varying Riccati equation. A recurrent algorithm based on the Kiefer–Wolfowitz method leads to adjusting the initial conditions for the weights to satisfy the final condition of the given cost function. The robust control suggested in this work is evaluated using a numerical example confirming the optimizing solution based on the DNN approximate for Bellman’s value function.

KW - Kiefer–Wolfowitz method

KW - approximate models

KW - artificial neural networks

KW - robust optimal control

UR - http://www.scopus.com/inward/record.url?scp=85149814526&partnerID=8YFLogxK

U2 - 10.3390/math11051211

DO - 10.3390/math11051211

M3 - Artículo

AN - SCOPUS:85149814526

SN - 2227-7390

VL - 11

JO - Mathematics

JF - Mathematics

IS - 5

M1 - 1211

ER -

Min–Max Dynamic Programming Control for Systems with Uncertain Mathematical Models via Differential Neural Network Bellman’s Function Approximation

Resumen

Acceder al documento

Otros archivos y enlaces

Huella

Citar esto