Common errors and debugging
Common errors
PermissionError
: typically appears when you have two model runs in progress at the same time that are using the
same netCDF file for the input/output steps. Ensure that a) you only have one MONARCHS process running on a particular
set of files and b) that if running multiple runs at the same time your filepaths are correct. It can also occur if you
have the netCDF file open for e.g. plotting data.
You can try setting the environment variable HDF5_USE_FILE_LOCKING=FALSE
(using export
on Linux or set
on Windows)
and try again to see if this resolves the issue. This is however not recommended for doing anything other than debugging
since you may have two processes writing to the same NetCDF file in this case.
ValueError
: can happen if the initial conditions are ill-posed. Typically, this is because enough firn has melted
that the amount of melt occurring in a timestep is higher than the vertical resolution. While MONARCHS has ways of
handling this, if the firn depth is small then this can cause the column to melt very quickly.
It can also arise if the solid or liquid fraction in a cell goes above 1. It may be worth setting firn_min_height
to a higher value in the model setup script, and using min_height_handler = 'extend'
to ensure that the model
minimum firn height is not too small.
ImportError
: One of the required modules is missing. If this is NumbaMinpack
, then likely your system does not have
CMake
and a compatible Fortran compiler installed. You can still run the model however using use_numba = False
,
in which case this ImportError
should disappear. If you are running with use_numba = False
and still get
ImportError
on NumbaMinpack, please submit an Issue on GitHub..
It may also occur when trying to load in a DEM.
Debugging
When using the Python debugger, you need to ensure that you run with use_numba = False
and parallel == False
.
I like to take advantage of the fact that the model setup script is a Python script and add in an extra variable
debug_mode
, which when set to True
sets via an if
block use_numba
, parallel
, use_mpi
and
reload_from_dump
to False
/False
/False
/True
respectively , so that I can speedily load in the
state immediately before the model crashes and run it in a debugger to diagnose the problem.
If an error happens during the lateral movement step, and you don’t want to run the entire single-column phase in
serial so that you can debug the lateral bit that happens at the end, you can add the dump_data_pre_lateral_movement
flag and set it to True
in model_setup
. This will create a dump file directly after the single column physics
finishes, rather than at the end of the model day.
A possible workflow if you see a lateral movement error might be to:
- restart the run in parallel with dump_data_pre_lateral_movement = True
- wait for it to finish, then re-run in serial with a debugger loading in from this dump file, ensuring that
single_column_toggle
is set to False
so the model goes straight to the lateral bit you want to debug.
scripts/debug_lateral_flow.py
shows an example of this, where a
dump file is loaded in and the lateral movement step is run outside of MONARCHS, which allows for quick debugging.