时 间:2025年10月17日(周五)9:30 – 11:30
线 上:腾讯会议ID:131-318-814 密码:705614
报告人:柏立华 南开大学教授
主持人:危佳钦 华东师范大学教授
摘 要:
This paper develops a novel entropy-regularized policy iteration algorithm (PIA)for solving the optimal dividend problem under the classical Compound-Poission risk model. Building on Howard’s classical PIA framework, we resolve longstanding barriers to policy iteration in dividend optimization: entropy regularization guarantees smooth PIA iterates, eliminating historical nonsmoothness obstacles;first-claim truncation transforms the governing integro-differential equation into an exactly solvable ODE system, overcoming spatial nonlocality; and boundedness arguments establish unique closed-form solutions without ad hoc boundary specifications. Furthermore, we prove uniform convergence of both value function sequences and associated policies – ensuring algorithmic stability under general compound Poisson dynamics. Finally, asymptotic analysis demonstrates consistency with classical theory: as λ → 0+, our regularized solutions converge to the discontinuous bang-bang strategy and its value function. Collectively, this work establishes the first provably convergent implementation of Howard’s policy iteration algorithm (PIA) for Compound-Poission dividend models, resolving the tripartite challenges of nonsmoothness, nonlocality, and nonlinearity while preserving compatibility with classical control theory through vanishing entropy regularization.
报告人简介:
柏立华,理学博士,南开大学数学科学学院教授、博士生导师。入选教育部新世纪优秀人才支持计划、天津市青年拔尖人才支持计划、天津市创新人才推进计划青年科技秀人才。获全国优秀博士学位论文提名奖、天津市数学会青年学术奖一等奖。其主要研究方向包括随机过程、随机控制、精算数学、金融数学等。目前已经在Annals of Applied Probability、SIAM J. Control Optim.、Finance Stoch.、Stoch. Proc. Appl.、Bernoulli、J optim. theory appl、App Math Optima、Quant. Finance、Scand. Actuarial J.、Insurance: Math. Econ.等主流期刊发表论文20余篇。