categories.socialScienceWordPress

OpenCitations blog

OpenCitations blog
The blog of the OpenCitations Infrastructure
Home PageAtom FeedMastodon
language
Published
Author OpenCitations Team

As part of the Open Citations project, we have been asked to review and improve the process of importing data into the Open Citations Corpus, taking the scripts from the initial project as our starting point. The current import procedure evolved from several disconnected processes and requires running multiple command line scripts and transforming the data into different intermediate formats.

Published
Author OpenCitations Team

As part of our work on the Open Citations extensions project, I have recently been doing one of my favourite things – namely indexing large quantities of data then exploring it. On this project we are interested in the PubMed Central Open Access subset, and more specifically, we are interested in what we can do with the citation data contained within the records that are in that subset – because, as they are open access, that citation data is