Matrix conditioning and adaptive simultaneous perturbation stochastic approximation method

Xun Zhu
2001 Proceedings of the 2001 American Control Conference. (Cat. No.01CH37148)  
This paper proposes a modification to the simultaneous perturbation stochastic approximation (SPSA) methods based on the comparisons made between the first order and the second order SPSA (1SPSA and 2SPSA) algorithms from the perspective of loss function Hessian. At finite iterations, the convergence rate depends on the matrix conditioning of the loss function Hessian. It is shown that 2SPSA converges more slowly for a loss function with an ill-conditioned Hessian than the one with a
more » ... ioned Hessian. On the other hand, the convergence rate of 1SPSA is less sensitive to the matrix conditioning of loss function Hessians. The modified 2SPSA (M2SPSA) eliminates the error amplification caused by the inversion of an ill-conditioned Hessian at finite iterations which leads to significant improvements in its convergence rate in problems with an ill-conditioned Hessian matrix. Asymptotically, the efficiency analysis shows that M2SPSA is also superior to 2SPSA in terms of their convergence rate coefficients. It is shown that for the same asymptotic convergence rate, the ratio of the mean square errors for M2SPSA to 2SPSA is always less than one except for a perfectly conditioned Hessian.
doi:10.1109/acc.2001.945918 fatcat:fl3gzjj6qjeb3l7b2fw4svubzu