Extended Verification and Validation for Earth System Models
evv4esm is a Python-based toolkit for extended verification and validation of Earth system models (ESMs). Currently, it provides a number of tests to determine if modifications either to the code or parameters of an ESM alter its simulated climate.
Test descriptions
- Multi-variate Kolmogorov-Smirnov, MVK: Tests the null hypothesis that the reference (n) and modified (m) model Short Independent Simulation Ensembles (SISE) represent the same climate state, based on the equality of distribution of each variable's annual global average in the standard monthly model output between the two simulations.
- Time Step Convergence, TSC: Tests the null hypothesis that the convergence of the time stepping error for a set of key atmospheric variables is the same for a reference ensemble and a test ensemble. Both the reference and test ensemble are generated with a two-second time step, and for each variable the RMSD between each ensemble and a truth ensemble, generated with a one-second time step, is calculated. RMSD is calculated globally and over two domains, the land and the ocean. The land/ocean domains contain just the atmosphere points that are over land/ocean cells.
- Perturbation Growth, PGN: Tests the null hypothesis that the reference (n) and modified (m) model ensembles represent the same atmospheric state after each physics parameterization is applied within a single time-step using the two-sample (n and m) T-test for equal averages at a 95% confidence level. Ensembles are generated by repeating the simulation for many initial conditions, with each initial condition subject to multiple perturbations.
Kolmogorov-Smirnov test
|
Test status |
Variables analyzed |
Rejecting |
Critical value |
Ensembles |
---|
mvk_bit-for-bit |
|
pass |
118 |
0 |
13 |
statistically identical |
mvk_zmconv_c0_ocn_045 |
|
fail |
118 |
94 |
13 |
statistically different |
Perturbation growth test
|
Test status |
Null hypothesis |
T test (t, p) |
Ensembles |
---|
pge_bit-for-bit |
|
pass |
accept |
(0.000, 1.000) |
statistically identical |
pge_zmconv_c0_ocn_045 |
|
fail |
reject |
(-31.721, 0.000) |
statistically different |
Time step convergence test
|
Test status |
Global |
Land |
Ocean |
Ensembles |
---|
tsc_bit-for-bit |
|
Pass |
bit-for-bit |
bit-for-bit |
bit-for-bit |
statistically identical |
tsc_zmconv_c0_ocn_045 |
|
Fail |
Fail |
Fail |
Fail |
statistically different |