Common errors and debugging

Common errors

PermissionError : typically appears when you have two model runs in progress at the same time that are using the same netCDF file for the input/output steps. Ensure that a) you only have one MONARCHS process running on a particular set of files and b) that if running multiple runs at the same time your filepaths are correct. It can also occur if you have the netCDF file open for e.g. plotting data.

You can try setting the environment variable HDF5_USE_FILE_LOCKING=FALSE (using export on Linux or set on Windows) and try again to see if this resolves the issue. This is however not recommended for doing anything other than debugging since you may have two processes writing to the same NetCDF file in this case.

ValueError : can happen if the initial conditions are ill-posed. Typically, this is because enough firn has melted that the amount of melt occurring in a timestep is higher than the vertical resolution. While MONARCHS has ways of handling this, if the firn depth is small then this can cause the column to melt very quickly. It can also arise if the solid or liquid fraction in a cell goes above 1. It may be worth setting firn_min_height to a higher value in the model setup script, and using min_height_handler = 'extend' to ensure that the model minimum firn height is not too small.

ImportError : One of the required modules is missing. If this is NumbaMinpack, then likely your system does not have CMake and a compatible Fortran compiler installed. You can still run the model however using use_numba = False, in which case this ImportError should disappear. If you are running with use_numba = False and still get ImportError on NumbaMinpack, please submit an Issue on GitHub..

It may also occur when trying to load in a DEM.

Debugging

When using the Python debugger, you need to ensure that you run with use_numba = False and parallel == False. I like to take advantage of the fact that the model setup script is a Python script and add in an extra variable debug_mode, which when set to True sets via an if block use_numba, parallel, use_mpi and reload_from_dump to False/False/False/True respectively , so that I can speedily load in the state immediately before the model crashes and run it in a debugger to diagnose the problem.

If an error happens during the lateral movement step, and you don’t want to run the entire single-column phase in serial so that you can debug the lateral bit that happens at the end, you can add the dump_data_pre_lateral_movement flag and set it to True in model_setup. This will create a dump file directly after the single column physics finishes, rather than at the end of the model day.

A possible workflow if you see a lateral movement error might be to: - restart the run in parallel with dump_data_pre_lateral_movement = True - wait for it to finish, then re-run in serial with a debugger loading in from this dump file, ensuring that single_column_toggle is set to False so the model goes straight to the lateral bit you want to debug. scripts/debug_lateral_flow.py shows an example of this, where a dump file is loaded in and the lateral movement step is run outside of MONARCHS, which allows for quick debugging.