Rogue Scholar

Published July 1, 2011

**Executive summary ** *Introduction * To general readers of this blog, this post will appear different from normal posts. Rather than being about a particular topic, it pulls together a summary of the work undertaken over the past year within the Open Citations Project supported by the JISC, and is primarily intended to assist JISC evaluation of the project and its outputs.

JISCOpen CitationsCitationCitation DataJiscexpocategories.socialScience

JISC Open Citations Project web site

https://doi.org/10.59350/mca1v-w0123

Published July 1, 2011

Author David M. Shotton

The JISC Open Citations Project website at http://opencitations.net exists for several purposes: To hold introductory information about the JISC Open Citations Project. To hold contact information for the project team.

JISCOntologiesOpen CitationsSemantic PublishingBibliographycategories.socialScience

Nomenclature for citations and references

https://doi.org/10.59350/keh9k-qgh36

Published July 1, 2011

Author David M. Shotton

Reis et al . (2008) [1] cites an earlier paper from Albert Ko’s research group, Ko et al . (1999) [2]. In conventional parlance, as the following diagram shows, the word “reference” can mean either what is found in the text, what is found in the reference list, the act of citation, or the object of the citation itself, as in the sentence “All the references you will need to prepare for the journal club are on Kevin’s desk”.

JISCOpen CitationsBibliographyCitationCitation Datacategories.socialScience

The citation processing pipeline and the Open Citations Corpus

https://doi.org/10.59350/jnbxe-dtq62

Published July 1, 2011

Author David M. Shotton

The input PubMed Central Open Access subset XML reference data, our starting corpus, were transformed into Open Citations RDF in multiple stages: The original XML was first transformed into an intermediate form using XSLT.

JISCOpen CitationsBibliographyCitationCitation Datacategories.socialScience

Citation correction methods

https://doi.org/10.59350/yet6y-e9m21

Published July 1, 2011

Author David M. Shotton

As previously described, the PubMed Central Open Access subset of journal articles yielded 6,529,815 independent bibliographic records of both citing and cited entities, while our use of the PubMed Entrez API provided a further 2,304,143 bibliographic records for the same cited entities. Before converting these references into RDF to create the Open Citations Corpust, we attempted to remove errors in the data.

JISCOpen CitationsBibliographyCitationCitation Datacategories.socialScience

Who wrote this paper? Author list problems in PubMed Central references

https://doi.org/10.59350/e4q4p-c2834

Published July 1, 2011

Author David M. Shotton

To illustrate three kinds of problems in obtaining correct author lists for Open Citation data from articles in the PubMed Central Open Access subset (OASS), I take three examples, the first of which is the result of a publication policy, the second due to mis-handling of an authorship attribution at the time of publication, and the third exemplifing errors introduced when handling non-English personal names.

JISCOpen CitationsBibliographyCitationCitation Datacategories.socialScience

Garbage in, garbage out – problems with bibliographic references

https://doi.org/10.59350/spqa9-p1864

Published July 1, 2011

Author David M. Shotton

The Open Citations Project has aimed to liberate bibliographic references from biomedical research literature as Open Linked Data, using as its starting corpus the Open Access Subset (OASS) of articles within PubMed Central. The greatest problem faced during this project, naively unanticipated before we started, was the extend of incompleteness, noise and errors of various sorts within the reference information extracted from the OASS articles.

JISCOpen CitationsBibliographyCitationCitation Datacategories.socialScience

Input data for Open Citations – the PMC Open Access Subset

https://doi.org/10.59350/refgh-34906

Published July 1, 2011

Author David M. Shotton

PubMed, created by the US National Library of Medicine in DATE, holds bibliographic records and abstracts for essentially all journal articles published in the biomedical sciences. It currently records almost a million new entries each year! PubMed Central (PMC), created as an extension of PubMed, is designed to hold full text articles from among the PubMed entries.

Data PublicationSemantic PublishingCitationDataDatacitecategories.socialScience

Pensoft Journals policy and author guidelines on data publication and citation

https://doi.org/10.59350/tz55j-2wy62

Published June 30, 2011

Author David M. Shotton

In a recent blog post, Heather Piwowar, in discussing the advantages of citing datasets in the reference list of the article, said “No journals have standardized on this approach so far”. However, Pensoft Journals, a publisher that specializes in publishing biodiversity and biological systematics papers, and that has taken the lead in promoting the publication of datasets with DOIs, has exactly such a policy.

JISCOntologiesOpen CitationsSemantic PublishingCitationcategories.socialScience

How to cite data

https://doi.org/10.59350/65msb-x7f82

Published June 30, 2011

Author David M. Shotton

As an approach towards developing best practice for data citation, I recently wrote a Data Citation Best Practice Discussion Document that is available on Google Docs, and that I have now slightly revised to Version 2 [1]. In that document, I first compared what is recommended by DataCite [2] and by Altman and King [3] with what currently practised by the Dryad Data Repository and what presently occurs ‘in the wild’ in a

OpenCitations blog

JISC Open Citations Project – Final Project Blog Post

JISC Open Citations Project web site

Nomenclature for citations and references

The citation processing pipeline and the Open Citations Corpus

Citation correction methods

Who wrote this paper? Author list problems in PubMed Central references

Garbage in, garbage out – problems with bibliographic references

Input data for Open Citations – the PMC Open Access Subset

Pensoft Journals policy and author guidelines on data publication and citation

How to cite data