Explorized policy iteration for continuous-time linear systems

Tae Yoon Chun, Yoon Ho Choi, Jin Bae Park

Research output: Contribution to journalArticle

Abstract

This paper addresses the problem that policy iteration (PI) for continuous-time (CT) systems requires explorations of the state space which is known as persistency of excitation in adaptive control community, and as a result, proposes a PI scheme explorized by an additional probing signal to solve the addressed problem. The proposed PI method efficiently finds in online fashion the related CT linear quadratic (LQ) optimal control without knowing the system matrix A, and guarantees the stability and convergence to the LQ optimal control, which is proven in this paper in the presence of the probing signal. A design method for the probing signal is also presented to balance the exploration of the state space and the control performance. Finally, several simulation results are provided to verify the effectiveness of the proposed explorized PI method.

Original languageEnglish
Pages (from-to)451-458
Number of pages8
JournalTransactions of the Korean Institute of Electrical Engineers
Volume61
Issue number3
DOIs
Publication statusPublished - 2012 Mar 1

Fingerprint

Linear systems
Continuous time systems

All Science Journal Classification (ASJC) codes

  • Electrical and Electronic Engineering

Cite this

@article{6ff3624546d04b48b766014664d8a12b,
title = "Explorized policy iteration for continuous-time linear systems",
abstract = "This paper addresses the problem that policy iteration (PI) for continuous-time (CT) systems requires explorations of the state space which is known as persistency of excitation in adaptive control community, and as a result, proposes a PI scheme explorized by an additional probing signal to solve the addressed problem. The proposed PI method efficiently finds in online fashion the related CT linear quadratic (LQ) optimal control without knowing the system matrix A, and guarantees the stability and convergence to the LQ optimal control, which is proven in this paper in the presence of the probing signal. A design method for the probing signal is also presented to balance the exploration of the state space and the control performance. Finally, several simulation results are provided to verify the effectiveness of the proposed explorized PI method.",
author = "Chun, {Tae Yoon} and Choi, {Yoon Ho} and Park, {Jin Bae}",
year = "2012",
month = "3",
day = "1",
doi = "10.5370/KIEE.2012.61.3.451",
language = "English",
volume = "61",
pages = "451--458",
journal = "Transactions of the Korean Institute of Electrical Engineers",
issn = "1975-8359",
publisher = "Korean Institute of Electrical Engineers",
number = "3",

}

Explorized policy iteration for continuous-time linear systems. / Chun, Tae Yoon; Choi, Yoon Ho; Park, Jin Bae.

In: Transactions of the Korean Institute of Electrical Engineers, Vol. 61, No. 3, 01.03.2012, p. 451-458.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Explorized policy iteration for continuous-time linear systems

AU - Chun, Tae Yoon

AU - Choi, Yoon Ho

AU - Park, Jin Bae

PY - 2012/3/1

Y1 - 2012/3/1

N2 - This paper addresses the problem that policy iteration (PI) for continuous-time (CT) systems requires explorations of the state space which is known as persistency of excitation in adaptive control community, and as a result, proposes a PI scheme explorized by an additional probing signal to solve the addressed problem. The proposed PI method efficiently finds in online fashion the related CT linear quadratic (LQ) optimal control without knowing the system matrix A, and guarantees the stability and convergence to the LQ optimal control, which is proven in this paper in the presence of the probing signal. A design method for the probing signal is also presented to balance the exploration of the state space and the control performance. Finally, several simulation results are provided to verify the effectiveness of the proposed explorized PI method.

AB - This paper addresses the problem that policy iteration (PI) for continuous-time (CT) systems requires explorations of the state space which is known as persistency of excitation in adaptive control community, and as a result, proposes a PI scheme explorized by an additional probing signal to solve the addressed problem. The proposed PI method efficiently finds in online fashion the related CT linear quadratic (LQ) optimal control without knowing the system matrix A, and guarantees the stability and convergence to the LQ optimal control, which is proven in this paper in the presence of the probing signal. A design method for the probing signal is also presented to balance the exploration of the state space and the control performance. Finally, several simulation results are provided to verify the effectiveness of the proposed explorized PI method.

UR - http://www.scopus.com/inward/record.url?scp=84859060072&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84859060072&partnerID=8YFLogxK

U2 - 10.5370/KIEE.2012.61.3.451

DO - 10.5370/KIEE.2012.61.3.451

M3 - Article

AN - SCOPUS:84859060072

VL - 61

SP - 451

EP - 458

JO - Transactions of the Korean Institute of Electrical Engineers

JF - Transactions of the Korean Institute of Electrical Engineers

SN - 1975-8359

IS - 3

ER -