- 1、本文档共17页,可阅读全部内容。
- 2、有哪些信誉好的足球投注网站(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。
- 3、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载。
- 4、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
查看更多
Existence of risk sensitive optimal stationary policies for controlled Markov processes
Existence of Risk Sensitive Optimal StationaryPolicies for Controlled Markov Processes1
Daniel Hernandez-Hernandez2 and Steven I. Marcus3
AbstractIn this paper we are concerned with the existence of optimal stationary poli-cies for innite horizon risk sensitive Markov control processes with denu-merable state space, unbounded cost function, and long run average cost.Introducing a discounted cost dynamic game, we prove that its value func-tion satises an Isaacs equation, and its relationship with the risk sensitivecontrol problem is studied. Using the vanishing discount approach, we provethat the risk- sensitive dynamic programming inequality holds, and derivean optimal stationary policy.Key Words. Risk sensitive stochastic control, dynamic games, Isaacs equa-tion, optimal stationary policies.Mathematics Subject clasications (1991). 90C40 (93E20).Running Head. Risk Sensitive Controlled Markov Processes.1Supported in part by the National Science Foundation under grant EECnstitute for Systems Research, University of Maryland, College Park, Maryland20742. On leave from Department of Mathematics, CINVESTAV-IPN, MEXICO,dher@math.cinvestav.mx3Electrical Engineering Department and Institute for Systems Research, University ofMaryland, College Park, Maryland 20742, marcus@src.umd.edu1
1 IntroductionIn this paper we are concerned with the existence of optimal stationary poli-cies for innite horizon risk sensitive stochastic control problems with denu-merable state space, discrete time parameter, unbounded cost function, andlong run average cost. For the risk neutral stochastic control problem, thesame kind of problem has been addressed, see e.g. [CC, CC-S, S1, S2, HL-L1, HL-L2], exploiting the vanishing discount approach, in which the valuefunction of the average cost control problem is approximated by the valuefunction of a sequence of discounted problems. However, for the risk sensitivecontrol problem there does not seem to be a sequence of discounted contro
您可能关注的文档
- Digital Booklet - good kid, m.A.A.d.pdf
- Digital holography of particle fields.pdf
- Digital image correlation measurement of the bond–slip relationship between fiber-reinforced polyme.pdf
- Digital PCR provides sensitive and absolute calibration for high throughput sequencing.pdf
- Digital Waveguide Mesh Modelling of Room Acoustics Surround-sound, Boundaries and Plugin Im.pdf
- Dirac versus reduced quantization and operator ordering.pdf
- Direct detection of exoplanet host star companion gamma Cep B and revised masses for both s.pdf
- Direct frequency comb measurements of absolute optical frequencies and population transfer.pdf
- Direct gray-scale minutiae detection in fingerprints.pdf
- Direct observation of quasi-particle band in CeIrIn$_5$ Angle-resolved photoemission spectr.pdf
文档评论(0)