How to use the Pre-loaded datasets?

Hello, I encountered a problem when I first called the pre-loaded dataset in pyleoclim. It prompted a UnicodeDecodeError: ‘gbk’ codec can’t decode byte 0xb0 in position 3633: illegal multibyte sequence. How should I solve this?

Thanks for reporting this @zxc . Which pre-loaded dataset is triggering this error?

Hello, thank you for your reply. After importing the relevant modules according to the tutorial, I tried to run "pyleo.utils.available_dataset_names()"in Jupyter Notebook, but an error occurred during execution.

It behaves normally for me:

I’m guessing your environment may not be compatible with Pyleoclim. Can you try following these installation instructions to the letter? Also, please let us know what machine/OS you are using.

I am using the Windows operating system and followed the tutorial step by step, also installing the additional packages climlab,etc… The strange thing is that only these pre-loaded datasets fail to run, but the data from NOAA and PANGAEA in the tutorial can be imported and run without any issues.


It looks like your machine cannot parse the yaml file that describes those datasets, as it has non Unicode characters in expressions like “Foram bent δ18O [‰ PDB] (VPDB)”. I would look on StackOverflow for potential remedies. This is not an issue on other machines we know of.

If you can find that file on your computer (where the Pyleoclim folder is installed from PyPi), try to open it first with a regular text editor and then with PyYAM, it might tell you what to install on your machine to read it.

Thank you so much for your helpful suggestions! I’ll follow your advice to check the Pyleoclim folder files with a text editor first and then use PyYAML to identify any dependencies I need to install.