Monday, April 21, 2025 10:00AM

Master's Thesis Defense

 

Peter Lehmann

(Advisor: Prof. Theodorou)

 

Maximum Entropy Sequential Quadratic Programming

 

Monday, April 21 

10:00 a.m. 

Coda C1315 Grant Park 756 W Peachtree St NW

Abstract 

Trajectory optimization (TO) plays a critical role across a broad spectrum of scientific and engineering fields, including robotics, energy and power systems, economics, and biomechanics. Through the lens of optimal control, TO often involves solving constrained optimization problems with non-convex objective function, nonlinear state and actuation constraints, and nonlinear dynamics. A widely used technique to solve such problems is Sequential Quadratic Programming (SQP). This method iteratively solves a series of quadratic subproblems, each of which relies on first- and second-order approximations of the constraints and cost around a nominal trajectory. However, since each subproblem only captures local information, SQP can suffer from converging to local minima. To overcome this limitation, this thesis proposes Maximum Entropy Sequential Quadratic Programming (MESQP). The proposed method leverages a stochastic policy and introduces an entropy-based regularization into the objective function. This regularization encourages exploration during optimization and enables the formulation of MESQP under both unimodal and multimodal policy representations. By embedding this stochastic policy into the SQP framework, the resulting MESQP algorithm creates a scheme of alternating optimization and sampling steps, yielding the ability to overcome local minima. To evaluate the method’s efficacy, this framework is compared experimentally with regular SQP and Maximum Entropy Differential Dynamic Programming (MEDDP) across several TO tasks.

Committee

· Prof. Evangelos Theodorou – School of Aerospace Engineering (advisor)

· Prof. Kyriakos Vamvoudakis – School of Aerospace Engineering

· Prof. Lu Gan – School of Aerospace Engineering