Is anyone using xr.open_mfdataset() on the cgd machine? It doesn't work for me, failing with an error I haven't seen before:
distributed.protocol.core - CRITICAL - Failed to deserialize
this happens whenever / however I pass more than one files, either with a wildcard or when passing a list of files.
I made sure that my dask / distributed / xarray versions are the same as the ones I use on casper when running this.
any ideas?
@Anna-Lena Deppenmeier,
What version of
- xarray
- dask
- distributed
are you using?
What is the output of client.get_versions(check=True)
?
distrubuted version: 2.7.0
dask version: 2.7.0
xarray version: 0.14.1
{'scheduler': {'host': (('python', '3.7.3.final.0'),
('python-bits', 64),
('OS', 'Linux'),
('OS-release', '3.10.0-1062.12.1.el7.x86_64'),
('machine', 'x86_64'),
('processor', 'x86_64'),
('byteorder', 'little'),
('LC_ALL', 'None'),
('LANG', 'en_US.UTF-8'),
('LOCALE', 'en_US.UTF-8')),
'packages': {'required': (('dask', '2.7.0'),
('distributed', '2.7.0'),
('msgpack', '1.0.0'),
('cloudpickle', '1.3.0'),
('tornado', '5.1.1'),
('toolz', '0.10.0')),
'optional': (('numpy', '1.16.4'),
('pandas', '0.25.0'),
('bokeh', '1.3.1'),
('lz4', '2.1.9'),
('dask_ml', None),
('blosc', '1.8.1'))}},
'workers': {'tcp://127.0.0.1:32838': {'host': (('python', '3.7.3.final.0'),
('python-bits', 64),
('OS', 'Linux'),
('OS-release', '3.10.0-1062.12.1.el7.x86_64'),
('machine', 'x86_64'),
('processor', 'x86_64'),
('byteorder', 'little'),
('LC_ALL', 'None'),
('LANG', 'en_US.UTF-8'),
('LOCALE', 'en_US.UTF-8')),
'packages': {'required': (('dask', '2.7.0'),
('distributed', '2.7.0'),
('msgpack', '1.0.0'),
('cloudpickle', '1.3.0'),
('tornado', '5.1.1'),
('toolz', '0.10.0')),
'optional': (('numpy', '1.16.4'),
('pandas', '0.25.0'),
('bokeh', '1.3.1'),
('lz4', '2.1.9'),
('dask_ml', None),
('blosc', '1.8.1'))}},
'tcp://127.0.0.1:36557': {'host': (('python', '3.7.3.final.0'),
('python-bits', 64),
('OS', 'Linux'),
('OS-release', '3.10.0-1062.12.1.el7.x86_64'),
('machine', 'x86_64'),
('processor', 'x86_64'),
('byteorder', 'little'),
('LC_ALL', 'None'),
('LANG', 'en_US.UTF-8'),
('LOCALE', 'en_US.UTF-8')),
'packages': {'required': (('dask', '2.7.0'),
('distributed', '2.7.0'),
('msgpack', '1.0.0'),
('cloudpickle', '1.3.0'),
('tornado', '5.1.1'),
('toolz', '0.10.0')),
'optional': (('numpy', '1.16.4'),
('pandas', '0.25.0'),
('bokeh', '1.3.1'),
('lz4', '2.1.9'),
('dask_ml', None),
('blosc', '1.8.1'))}},
'tcp://127.0.0.1:37239': {'host': (('python', '3.7.3.final.0'),
('python-bits', 64),
('OS', 'Linux'),
('OS-release', '3.10.0-1062.12.1.el7.x86_64'),
('machine', 'x86_64'),
('processor', 'x86_64'),
('byteorder', 'little'),
('LC_ALL', 'None'),
('LANG', 'en_US.UTF-8'),
('LOCALE', 'en_US.UTF-8')),
'packages': {'required': (('dask', '2.7.0'),
('distributed', '2.7.0'),
('msgpack', '1.0.0'),
('cloudpickle', '1.3.0'),
('tornado', '5.1.1'),
('toolz', '0.10.0')),
'optional': (('numpy', '1.16.4'),
('pandas', '0.25.0'),
('bokeh', '1.3.1'),
('lz4', '2.1.9'),
('dask_ml', None),
('blosc', '1.8.1'))}},
'tcp://127.0.0.1:37465': {'host': (('python', '3.7.3.final.0'),
('python-bits', 64),
('OS', 'Linux'),
('OS-release', '3.10.0-1062.12.1.el7.x86_64'),
('machine', 'x86_64'),
('processor', 'x86_64'),
('byteorder', 'little'),
('LC_ALL', 'None'),
('LANG', 'en_US.UTF-8'),
('LOCALE', 'en_US.UTF-8')),
'packages': {'required': (('dask', '2.7.0'),
('distributed', '2.7.0'),
('msgpack', '1.0.0'),
('cloudpickle', '1.3.0'),
('tornado', '5.1.1'),
('toolz', '0.10.0')),
'optional': (('numpy', '1.16.4'),
('pandas', '0.25.0'),
('bokeh', '1.3.1'),
('lz4', '2.1.9'),
('dask_ml', None),
('blosc', '1.8.1'))}},
'tcp://127.0.0.1:37751': {'host': (('python', '3.7.3.final.0'),
('python-bits', 64),
('OS', 'Linux'),
('OS-release', '3.10.0-1062.12.1.el7.x86_64'),
('machine', 'x86_64'),
('processor', 'x86_64'),
('byteorder', 'little'),
('LC_ALL', 'None'),
('LANG', 'en_US.UTF-8'),
('LOCALE', 'en_US.UTF-8')),
'packages': {'required': (('dask', '2.7.0'),
('distributed', '2.7.0'),
('msgpack', '1.0.0'),
('cloudpickle', '1.3.0'),
('tornado', '5.1.1'),
('toolz', '0.10.0')),
'optional': (('numpy', '1.16.4'),
('pandas', '0.25.0'),
('bokeh', '1.3.1'),
('lz4', '2.1.9'),
('dask_ml', None),
('blosc', '1.8.1'))}},
'tcp://127.0.0.1:38280': {'host': (('python', '3.7.3.final.0'),
('python-bits', 64),
('OS', 'Linux'),
('OS-release', '3.10.0-1062.12.1.el7.x86_64'),
('machine', 'x86_64'),
('processor', 'x86_64'),
('byteorder', 'little'),
('LC_ALL', 'None'),
('LANG', 'en_US.UTF-8'),
('LOCALE', 'en_US.UTF-8')),
'packages': {'required': (('dask', '2.7.0'),
('distributed', '2.7.0'),
('msgpack', '1.0.0'),
('cloudpickle', '1.3.0'),
('tornado', '5.1.1'),
('toolz', '0.10.0')),
'optional': (('numpy', '1.16.4'),
('pandas', '0.25.0'),
('bokeh', '1.3.1'),
('lz4', '2.1.9'),
('dask_ml', None),
('blosc', '1.8.1'))}}},
'client': {'host': [('python', '3.7.3.final.0'),
('python-bits', 64),
('OS', 'Linux'),
('OS-release', '3.10.0-1062.12.1.el7.x86_64'),
('machine', 'x86_64'),
('processor', 'x86_64'),
('byteorder', 'little'),
('LC_ALL', 'None'),
('LANG', 'en_US.UTF-8'),
('LOCALE', 'en_US.UTF-8')],
'packages': {'required': [('dask', '2.7.0'),
('distributed', '2.7.0'),
('msgpack', '1.0.0'),
('cloudpickle', '1.3.0'),
('tornado', '5.1.1'),
('toolz', '0.10.0')],
'optional': [('numpy', '1.16.4'),
('pandas', '0.25.0'),
('bokeh', '1.3.1'),
('lz4', '2.1.9'),
('dask_ml', None),
('blosc', '1.8.1')]}}}
Interesting... I see that you are on ('msgpack', '1.0.0')
. I suspect that this is where the issue is coming from ( a few weeks ago, this release of mspack had some incompatibility issue with dask)...
I recommend upgrading your dask/distributed (even xarray) to the latest versions
The latest versions of dask is 2.12.0
Another solution is to downgrade msgpack
to a version earlier than 1.0.0
OKay, I will try to update everything and see how it goes. The reason I had specified the versions was because there was some problem with a later dask or distributed version, I think 2.9, but I see that that is superseded by now.
Great! Let me know how it goes
Thanks @Anderson Banihirwe , it works!
Last updated: May 16 2025 at 17:14 UTC