Skip to main content
U.S. flag

An official website of the United States government

When Machine Learning Objectives Compete for Improved Subseasonal Bias Correction, Who Wins?

PRESENTERS:
To attach your poster or presentation:

E-mail your file for upload
Authors

Lead Presenter

Co-Author

Abstract

In supervised machine learning (ML), a model is optimized to generate predictions using a predefined objective function and labeled data. An example of an optimized ML model for a predefined objective could entail one that minimizes a measurement of error between its predictions and corresponding observations. However, in real-world applications, multiple objectives may be needed for an ML model to be useful or add value to existing physics-based approaches. These various objectives may sometimes compete, necessitating that trade-offs be made in the optimal performance of one objective for the optimal performance of the other(s). The parameter space where no optimal solution exists in multi-objective problems is referred to as the Pareto frontier.

Earth system science is a field rich with multi-objective problems. An example of a multi-objective problem in Earth system science involves the interface between ML model interpretability and ML model complexity, with the latter being potentially more skillful than the former. In this work, we focus on offline bias correction for subseasonal forecasts of temperature and precipitation created using the Community Earth System Model version 2 (CESM2) configured for initialized prediction. While conducting this task, we focused on bias correction using various objectives that were at times in competition with each other. These objectives included improving the skill of temperature or precipitation over land, improving the skill of temperature or precipitation globally, and improving the sharpness representation of temperature or precipitation that is at times overly smoothed in coarse resolution Earth system model simulations and ML-based model output. An extensive hyperparameter grid search was conducted to identify image-to-image ML models that performed skillfully across these various metrics using the Earth Computing Hyperparameter Optimization (ECHO) software, which is a distributed hyperparameter optimization package built with Optuna (a commonly used software for ML model optimization). Various models were identified among a Pareto frontier, where improved performance in one metric could be achieved, but necessitated skill reduction of other metrics.

Category
Innovative and Emerging technologies: ML/AI, Digital Earth, Exascale and Quantum Computing, advanced software infrastructures
Funding Program Area(s)