A first-order condition for discrete-time distribution steering
Abstract
We study a class of distribution-steering problems from a variational point of view. Under some differentiability assumptions, we derive necessary conditions for optimal Markov policies in the spirit of the Lagrange multiplier approach. We also provide a heuristic gradient-based method derived from the variational principle.