10月17日 | 柏立华：The Howard's Policy Iteration and Convergence for Optimal Dividend under Compound-Poission Model

时间：2025年10月17日（周五）9：30 – 11：30

线上：腾讯会议ID:131-318-814 密码:705614

报告人：柏立华南开大学教授

主持人：危佳钦华东师范大学教授

摘要：

This paper develops a novel entropy-regularized policy iteration algorithm (PIA)for solving the optimal dividend problem under the classical Compound-Poission risk model. Building on Howard’s classical PIA framework, we resolve longstanding barriers to policy iteration in dividend optimization: entropy regularization guarantees smooth PIA iterates, eliminating historical nonsmoothness obstacles;first-claim truncation transforms the governing integro-differential equation into an exactly solvable ODE system, overcoming spatial nonlocality; and boundedness arguments establish unique closed-form solutions without ad hoc boundary specifications. Furthermore, we prove uniform convergence of both value function sequences and associated policies – ensuring algorithmic stability under general compound Poisson dynamics. Finally, asymptotic analysis demonstrates consistency with classical theory: as λ → 0+, our regularized solutions converge to the discontinuous bang-bang strategy and its value function. Collectively, this work establishes the first provably convergent implementation of Howard’s policy iteration algorithm (PIA) for Compound-Poission dividend models, resolving the tripartite challenges of nonsmoothness, nonlocality, and nonlinearity while preserving compatibility with classical control theory through vanishing entropy regularization.

报告人简介：

柏立华，理学博士，南开大学数学科学学院教授、博士生导师。入选教育部新世纪优秀人才支持计划、天津市青年拔尖人才支持计划、天津市创新人才推进计划青年科技秀人才。获全国优秀博士学位论文提名奖、天津市数学会青年学术奖一等奖。其主要研究方向包括随机过程、随机控制、精算数学、金融数学等。目前已经在Annals of Applied Probability、SIAM J. Control Optim.、Finance Stoch.、Stoch. Proc. Appl.、Bernoulli、J optim. theory appl、App Math Optima、Quant. Finance、Scand. Actuarial J.、Insurance: Math. Econ.等主流期刊发表论文20余篇。

发布者：张瑛发布时间：2025-10-11浏览次数：10