Stream: python-questions
Topic: xarray on cgd system
Anna-Lena Deppenmeier (Mar 11 2020 at 19:31):
Is anyone using xr.open_mfdataset() on the cgd machine? It doesn't work for me, failing with an error I haven't seen before:
distributed.protocol.core - CRITICAL - Failed to deserialize
this happens whenever / however I pass more than one files, either with a wildcard or when passing a list of files.
I made sure that my dask / distributed / xarray versions are the same as the ones I use on casper when running this.
any ideas?
Anderson Banihirwe (Mar 11 2020 at 20:37):
@Anna-Lena Deppenmeier,
What version of
- xarray
- dask
- distributed
are you using?
Anderson Banihirwe (Mar 11 2020 at 20:40):
What is the output of client.get_versions(check=True)
?
Anna-Lena Deppenmeier (Mar 11 2020 at 20:43):
distrubuted version: 2.7.0
dask version: 2.7.0
xarray version: 0.14.1
Anna-Lena Deppenmeier (Mar 11 2020 at 20:43):
{'scheduler': {'host': (('python', '3.7.3.final.0'),
('python-bits', 64),
('OS', 'Linux'),
('OS-release', '3.10.0-1062.12.1.el7.x86_64'),
('machine', 'x86_64'),
('processor', 'x86_64'),
('byteorder', 'little'),
('LC_ALL', 'None'),
('LANG', 'en_US.UTF-8'),
('LOCALE', 'en_US.UTF-8')),
'packages': {'required': (('dask', '2.7.0'),
('distributed', '2.7.0'),
('msgpack', '1.0.0'),
('cloudpickle', '1.3.0'),
('tornado', '5.1.1'),
('toolz', '0.10.0')),
'optional': (('numpy', '1.16.4'),
('pandas', '0.25.0'),
('bokeh', '1.3.1'),
('lz4', '2.1.9'),
('dask_ml', None),
('blosc', '1.8.1'))}},
'workers': {'tcp://127.0.0.1:32838': {'host': (('python', '3.7.3.final.0'),
('python-bits', 64),
('OS', 'Linux'),
('OS-release', '3.10.0-1062.12.1.el7.x86_64'),
('machine', 'x86_64'),
('processor', 'x86_64'),
('byteorder', 'little'),
('LC_ALL', 'None'),
('LANG', 'en_US.UTF-8'),
('LOCALE', 'en_US.UTF-8')),
'packages': {'required': (('dask', '2.7.0'),
('distributed', '2.7.0'),
('msgpack', '1.0.0'),
('cloudpickle', '1.3.0'),
('tornado', '5.1.1'),
('toolz', '0.10.0')),
'optional': (('numpy', '1.16.4'),
('pandas', '0.25.0'),
('bokeh', '1.3.1'),
('lz4', '2.1.9'),
('dask_ml', None),
('blosc', '1.8.1'))}},
'tcp://127.0.0.1:36557': {'host': (('python', '3.7.3.final.0'),
('python-bits', 64),
('OS', 'Linux'),
('OS-release', '3.10.0-1062.12.1.el7.x86_64'),
('machine', 'x86_64'),
('processor', 'x86_64'),
('byteorder', 'little'),
('LC_ALL', 'None'),
('LANG', 'en_US.UTF-8'),
('LOCALE', 'en_US.UTF-8')),
'packages': {'required': (('dask', '2.7.0'),
('distributed', '2.7.0'),
('msgpack', '1.0.0'),
('cloudpickle', '1.3.0'),
('tornado', '5.1.1'),
('toolz', '0.10.0')),
'optional': (('numpy', '1.16.4'),
('pandas', '0.25.0'),
('bokeh', '1.3.1'),
('lz4', '2.1.9'),
('dask_ml', None),
('blosc', '1.8.1'))}},
'tcp://127.0.0.1:37239': {'host': (('python', '3.7.3.final.0'),
('python-bits', 64),
('OS', 'Linux'),
('OS-release', '3.10.0-1062.12.1.el7.x86_64'),
('machine', 'x86_64'),
('processor', 'x86_64'),
('byteorder', 'little'),
('LC_ALL', 'None'),
('LANG', 'en_US.UTF-8'),
('LOCALE', 'en_US.UTF-8')),
'packages': {'required': (('dask', '2.7.0'),
('distributed', '2.7.0'),
('msgpack', '1.0.0'),
('cloudpickle', '1.3.0'),
('tornado', '5.1.1'),
('toolz', '0.10.0')),
'optional': (('numpy', '1.16.4'),
('pandas', '0.25.0'),
('bokeh', '1.3.1'),
('lz4', '2.1.9'),
('dask_ml', None),
('blosc', '1.8.1'))}},
'tcp://127.0.0.1:37465': {'host': (('python', '3.7.3.final.0'),
('python-bits', 64),
('OS', 'Linux'),
('OS-release', '3.10.0-1062.12.1.el7.x86_64'),
('machine', 'x86_64'),
('processor', 'x86_64'),
('byteorder', 'little'),
('LC_ALL', 'None'),
('LANG', 'en_US.UTF-8'),
('LOCALE', 'en_US.UTF-8')),
'packages': {'required': (('dask', '2.7.0'),
('distributed', '2.7.0'),
('msgpack', '1.0.0'),
('cloudpickle', '1.3.0'),
('tornado', '5.1.1'),
('toolz', '0.10.0')),
'optional': (('numpy', '1.16.4'),
('pandas', '0.25.0'),
('bokeh', '1.3.1'),
('lz4', '2.1.9'),
('dask_ml', None),
('blosc', '1.8.1'))}},
'tcp://127.0.0.1:37751': {'host': (('python', '3.7.3.final.0'),
('python-bits', 64),
('OS', 'Linux'),
('OS-release', '3.10.0-1062.12.1.el7.x86_64'),
('machine', 'x86_64'),
('processor', 'x86_64'),
('byteorder', 'little'),
('LC_ALL', 'None'),
('LANG', 'en_US.UTF-8'),
('LOCALE', 'en_US.UTF-8')),
'packages': {'required': (('dask', '2.7.0'),
('distributed', '2.7.0'),
('msgpack', '1.0.0'),
('cloudpickle', '1.3.0'),
('tornado', '5.1.1'),
('toolz', '0.10.0')),
'optional': (('numpy', '1.16.4'),
('pandas', '0.25.0'),
('bokeh', '1.3.1'),
('lz4', '2.1.9'),
('dask_ml', None),
('blosc', '1.8.1'))}},
'tcp://127.0.0.1:38280': {'host': (('python', '3.7.3.final.0'),
('python-bits', 64),
('OS', 'Linux'),
('OS-release', '3.10.0-1062.12.1.el7.x86_64'),
('machine', 'x86_64'),
('processor', 'x86_64'),
('byteorder', 'little'),
('LC_ALL', 'None'),
('LANG', 'en_US.UTF-8'),
('LOCALE', 'en_US.UTF-8')),
'packages': {'required': (('dask', '2.7.0'),
('distributed', '2.7.0'),
('msgpack', '1.0.0'),
('cloudpickle', '1.3.0'),
('tornado', '5.1.1'),
('toolz', '0.10.0')),
'optional': (('numpy', '1.16.4'),
('pandas', '0.25.0'),
('bokeh', '1.3.1'),
('lz4', '2.1.9'),
('dask_ml', None),
('blosc', '1.8.1'))}}},
'client': {'host': [('python', '3.7.3.final.0'),
('python-bits', 64),
('OS', 'Linux'),
('OS-release', '3.10.0-1062.12.1.el7.x86_64'),
('machine', 'x86_64'),
('processor', 'x86_64'),
('byteorder', 'little'),
('LC_ALL', 'None'),
('LANG', 'en_US.UTF-8'),
('LOCALE', 'en_US.UTF-8')],
'packages': {'required': [('dask', '2.7.0'),
('distributed', '2.7.0'),
('msgpack', '1.0.0'),
('cloudpickle', '1.3.0'),
('tornado', '5.1.1'),
('toolz', '0.10.0')],
'optional': [('numpy', '1.16.4'),
('pandas', '0.25.0'),
('bokeh', '1.3.1'),
('lz4', '2.1.9'),
('dask_ml', None),
('blosc', '1.8.1')]}}}
Anderson Banihirwe (Mar 11 2020 at 20:51):
Interesting... I see that you are on ('msgpack', '1.0.0')
. I suspect that this is where the issue is coming from ( a few weeks ago, this release of mspack had some incompatibility issue with dask)...
Anderson Banihirwe (Mar 11 2020 at 20:52):
I recommend upgrading your dask/distributed (even xarray) to the latest versions
Anderson Banihirwe (Mar 11 2020 at 20:53):
The latest versions of dask is 2.12.0
Anderson Banihirwe (Mar 11 2020 at 20:54):
Another solution is to downgrade msgpack
to a version earlier than 1.0.0
Anna-Lena Deppenmeier (Mar 11 2020 at 20:55):
OKay, I will try to update everything and see how it goes. The reason I had specified the versions was because there was some problem with a later dask or distributed version, I think 2.9, but I see that that is superseded by now.
Anderson Banihirwe (Mar 11 2020 at 20:57):
Great! Let me know how it goes
Anna-Lena Deppenmeier (Mar 11 2020 at 21:06):
Thanks @Anderson Banihirwe , it works!
Last updated: Jan 30 2022 at 12:01 UTC