Reinforcement Q-learning optimal control of 2D discrete-time systems with unknown dynamics

Wei Wu, Jin Yan, Wei Lin, Zhengjiang Zhang, Guoqiang Zeng, Shipei Huang

Transactions of the Institute of Measurement and Control

Published online on April 17, 2026

Abstract

Transactions of the Institute of Measurement and Control, Ahead of Print.
This paper proposes a Q-learning-based algorithm to solve the linear quadratic regulator (LQR) problem for unknown dynamic two-dimensional (2D) discrete-time systems. First, based on the value function formulation constructed using the Lyapunov function ...