# A hybrid approach of physical laws and data-driven modeling for estimation: the example of queuing networks

### Aude Hofleitner

###
EECS Department

University of California, Berkeley

Technical Report No. UCB/EECS-2013-87

May 17, 2013

### http://www.eecs.berkeley.edu/Pubs/TechRpts/2013/EECS-2013-87.pdf

Mathematical models are a mathematical abstraction of the physical reality which is of great importance to understand the behavior of a system, make estimations and predictions and so on. They range from models based on physical laws to models learned empirically, as measurements are collected, and referred to as data-driven models. A model is based on a series of choices which influence its complexity and realism. These choices represent trade-offs between different competing objectives including interpretability, scalability, accuracy, adequation to the available data, robustness or computational complexity. The thesis investigates the advantages and disadvantages of models based on physical laws versus data-driven models through the example of signalized queuing networks such as urban transportation networks.

The dynamics of conservation flow networks are accurately represented by a first order partial differential equation. Using Hamilton-Jacobi theory, the thesis underlines the importance to leverage physical laws to reconstruct missing information (e.g. signal or bottleneck characteristics) and estimate the state of the network at any time and location. Noise and uncertainty in the measurements can be integrated in the model. When measurements are sparse, the state of the network cannot be estimated at every time and location on the network. Instead, the thesis shows how to leverage other characteristics, such as periodicity. From deterministic dynamics, the thesis derives the probability distribution functions of physical entities (e.g. waiting time, density) by marginalizing the periodic variable. Using a Dynamic Bayesian Network formulation and exploiting the convexity structure of the system, the thesis shows how this modeling leads to realistic estimations and predictions, even when little measurements are available. Finally, the thesis investigates how sparse modeling and dimensionality reduction can provide insights on the large scale behavior of the network. Large scale dynamics and patterns are hard to model accurately based on physical laws. They can be discovered through data mining algorithms and integrated into physical models.

**Advisor:** Alexandre Bayen and Pieter Abbeel

BibTeX citation:

@phdthesis{Hofleitner:EECS-2013-87, Author = {Hofleitner, Aude}, Title = {A hybrid approach of physical laws and data-driven modeling for estimation: the example of queuing networks}, School = {EECS Department, University of California, Berkeley}, Year = {2013}, Month = {May}, URL = {http://www.eecs.berkeley.edu/Pubs/TechRpts/2013/EECS-2013-87.html}, Number = {UCB/EECS-2013-87}, Abstract = {Mathematical models are a mathematical abstraction of the physical reality which is of great importance to understand the behavior of a system, make estimations and predictions and so on. They range from models based on physical laws to models learned empirically, as measurements are collected, and referred to as data-driven models. A model is based on a series of choices which influence its complexity and realism. These choices represent trade-offs between different competing objectives including interpretability, scalability, accuracy, adequation to the available data, robustness or computational complexity. The thesis investigates the advantages and disadvantages of models based on physical laws versus data-driven models through the example of signalized queuing networks such as urban transportation networks. The dynamics of conservation flow networks are accurately represented by a first order partial differential equation. Using Hamilton-Jacobi theory, the thesis underlines the importance to leverage physical laws to reconstruct missing information (e.g. signal or bottleneck characteristics) and estimate the state of the network at any time and location. Noise and uncertainty in the measurements can be integrated in the model. When measurements are sparse, the state of the network cannot be estimated at every time and location on the network. Instead, the thesis shows how to leverage other characteristics, such as periodicity. From deterministic dynamics, the thesis derives the probability distribution functions of physical entities (e.g. waiting time, density) by marginalizing the periodic variable. Using a Dynamic Bayesian Network formulation and exploiting the convexity structure of the system, the thesis shows how this modeling leads to realistic estimations and predictions, even when little measurements are available. Finally, the thesis investigates how sparse modeling and dimensionality reduction can provide insights on the large scale behavior of the network. Large scale dynamics and patterns are hard to model accurately based on physical laws. They can be discovered through data mining algorithms and integrated into physical models.} }

EndNote citation:

%0 Thesis %A Hofleitner, Aude %T A hybrid approach of physical laws and data-driven modeling for estimation: the example of queuing networks %I EECS Department, University of California, Berkeley %D 2013 %8 May 17 %@ UCB/EECS-2013-87 %U http://www.eecs.berkeley.edu/Pubs/TechRpts/2013/EECS-2013-87.html %F Hofleitner:EECS-2013-87