Enrique Mallada
Johns Hopkins University
Talk On the Inductive Bias for Learning in Nonlinear Control: Trade-offs and Guarantees
Abstract
Reliable data-driven control must provide closed-loop guarantees—on stability, performance, safety—by generalizing across an entire domain from finite samples of the dynamics. In learning theory, this is usually achieved via the introduction of an inductive bias, that is, a set of structural assumptions placed on the problem to connect sampled and unsampled data. While inductive biases for classification and regression problems have been widely studied and their performance is well understood, much less is known for control tasks. This raises a central question: which inductive bias enables efficient nonlinear control with rigorous guarantees on stability, safety, and optimality?
For Lipschitz continuous vector fields, a common assumption (or inductive bias) in control, we construct behavioral guarantees by combining local improvement conditions—integral Lyapunov-like conditions or Bellman inequalities—with coverage arguments over the state space that render such behavior recurrent. This viewpoint enables data-driven verification, but also inspires a novel class of nonparametric controllers, called here chain policies, which are akin to action chunking but with variable duration, and compose a sequence of locally verified controls (a chain) into globally valid certifiable policies. We apply these ideas to data-driven stabilization and to the acceleration of model predictive control, where performance can be systematically traded for reduced data requirements.
Notably, this Lipschitz viewpoint, while flexible, is very conservative: its worst-case bounds still require dense coverage of the state space, a demand that scales poorly with state dimension. To overcome this limitation, we turn to Hamiltonian dynamics, which offer a structurally different inductive bias based on energy and volume conservation. These conservation laws imply, via the Poincaré recurrence theorem, that every region visited by a trajectory is revisited infinitely often, providing vast opportunities for generalization. This allows us to construct chain policies for target reachability from remarkably small datasets.
Bio
Enrique Mallada is an Associate Professor of Electrical and Computer Engineering at Johns Hopkins University, where he has been a faculty member since 2016. He received his Ph.D. in Electrical and Computer Engineering with a minor in Applied Mathematics from Cornell University and a Telecommunications Engineering degree from ORT University, Uruguay. Before joining Hopkins, he was a Postdoctoral Fellow at Caltech’s Center for the Mathematics of Information. His honors include the Johns Hopkins Alumni Association Teaching Award (2021), NSF CAREER Award (2018), Caltech’s CMI Fellowship (2014), and Cornell ECE Director’s Thesis Award (2014). His research spans control, dynamical systems, and optimization, with applications to safety-critical systems, networks, and power grids.