Skip to main content
U.S. flag

An official website of the United States government

Publication Date
27 November 2020

New Study Demonstrates a Particular Method of Coupling Data-Driven Hypothesis with Process-Based Hydrologic Modeling Capabilities

Print / PDF
Powerpoint Slide
Science

Some machine learning (ML) methods such as classification trees are useful tools to generate hypotheses about how hydrologic systems function. However, data limitations dictate that ML alone often cannot differentiate between causal and associative relationships. Typical causal analysis based on process-based models (PBMs) is inefficient and susceptible to human bias.

Impact

We demonstrate a more efficient and objective analysis procedure where ML is first applied to generate data-consistent hypotheses, and then a PBM (CLM-PAWS) is invoked to verify these hypotheses. We employed a surface-subsurface processes model and conducted perturbation experiments to implement these competing hypotheses and assess the impacts of the changes. The experimental results strongly support the soil thickness hypothesis as opposed to the terrain slope and soil texture ones, which are co-varying and coincidental factors. Thicker soil permits larger saturation excess and longer system memory that carries wet season water storage to influence dry season baseflows.

Summary

We further suggest this analysis could be formulated into a data-centric Bayesian framework. This study demonstrates that PBM presents indispensable value for problems that ML cannot solve alone, and is meant to encourage more synergies between ML and PBM in the future.

Point of Contact
Wen-Ping Tsai
Institution(s)
Pennsylvania State University
Funding Program Area(s)
Project(s)
Publication