Publications
Here are the data sets, text corpora and publications we have produced and made available so far.
Text corpora and other data sets:
- UNESCO’s Standard-Setting Instruments
- Standard-setting instruments, 1945-2019
- This digital text corpus compiles the English-language texts of all Conventions, Declarations and Recommendations adopted by UNESCO’s General Conference (1945-2019).
- Available for download on our GitHub repository “Legal Instruments”.
- Standard-setting instruments, 1945-2019
- The UNESCO Courier:
- Data sets:
- Curated Courier article corpus, 1948-2020
- This corpus consists of the texts of all articles published in the English-language edition of The UNESCO Courier between 1948 and 2020, and includes a comprehensive curated metadata index (document_index.csv).
- Online at Zenodo.
- POS-tagged and DTM versions of the curated article corpus are available in the project GitHub release.
- Complete curated issue corpus, 1948-2020
- This corpus compiles the complete text of all Courier issues (English-language edition), 1948-2020.
- Online at Zenodo.
- Curated Courier article corpus, 1948-2020
- Analytical tools and Supplementary materials:
- Courier-Lab
- Courier-Lab allows you to explore the Courier text corpus through a variety of digital text analysis tools through a web-based Jupyter Notebook.
- Quality control data and supplementary material
- Available on project GitHub release.
- GitHub repository “Tagged Courier”
- The Tagged Courier repository contains working data from the curating process.
- Courier-Lab
- Data sets:
- Proceedings of the General Conference:
- Curated corpus of Proceedings of the General Conference, 1945-2017
- Online at Zenodo
- Curated corpus of Proceedings of the General Conference, 1945-2017
Project publications
B. Martin and F. Mohammadi Norén, “Nature and Culture in the Age of Environmental Crisis: Digital Analysis of a Global Debate in The UNESCO Courier, 1948-2020”, in A. Rockenberger, S. Gilbert and J. Tiemann, eds., DHNB2023 Conference Proceedings. Digital Humanities in the Nordic and Baltic Countries Publications 5, 1 (Oslo, 2023): 274-86. DOI: https://doi.org/10.5617/dhnbpub.10671
Benjamin G. Martin, Fredrik Mohammedi Norén, Roger Mähler, Andreas Marklund and Oriane Martin, “The Curated UNESCO Courier 1.0: Annotated Corpora for Digital Research in the Global Humanities,” Journal of Open Humanities Data, 10: 20, pp. 1–13. DOI: https://doi.org/10.5334/johd.181
Fredrik Mohammadi Norén, “Balancing contentious concepts: Ideas of ‘communication’ and ‘information’ in UNESCO’s magazine Courier 1948–2020”. Conference paper: NordMedia 2023 (16-18 August 2023), Berge, Norway. Länk: https://urn.kb.se/resolve?urn=urn:nbn:se:mau:diva-66032