[ecco-support] [EXTERNAL] Restarts not matching control simulation

Zhang, Hong (US 398K) hong.zhang at jpl.nasa.gov
Wed Jul 17 15:36:30 EDT 2024



> On Jul 16, 2024, at 9:38 AM, Matthew Mazloff <mmazloff at ucsd.edu> wrote:
> 
> Hi Max
> 
> It may be chaos originating from roundoff differences when using different sets of machines in parallel computations. 
> Do they differ on the first day of simulation or does it takes a few weeks/months before you notice it?
> 
> Matt
> 
Hi Max,
Matt's’ comments make sense.
According to our experience,
using different types of node will cause 6-th or 7-th digit difference in output, like
https://github.com/MITgcm-contrib/ecco_darwin/blob/c0767f818c7ebcea9a226d0b8d2ab412dc2b5b61/v04/llc270_JAMES_paper/input/job_ECCO_darwin#L2
vs
https://github.com/MITgcm-contrib/ecco_darwin/blob/c0767f818c7ebcea9a226d0b8d2ab412dc2b5b61/v04/llc270_JAMES_paper/input/job_ECCO_darwin#L3
due to the compilation  w/ “O2” flag
https://github.com/MITgcm/MITgcm/blob/acacc28f7fa33925e71abf11d29f2fedab031e4e/tools/build_options/linux_amd64_ifort%2Bmpi_ice_nas#L46

Curious to know which ECCO-Darwin version you used
https://github.com/MITgcm-contrib/ecco_darwin/tree/master/v04/llc270_JAMES_paper
or 
https://github.com/MITgcm-contrib/ecco_darwin/tree/master/v05/llc270
Maybe we can take a further look.

Hong



More information about the ecco-support mailing list