In the previous post, I outlined reasons why researchers don’t publish data, presented as evidence to the Royal Society’s Policy Study “Science as a Public Enterprise” Call for Evidence.
In the previous post, I outlined reasons why researchers don’t publish data, presented as evidence to the Royal Society’s Policy Study “Science as a Public Enterprise” Call for Evidence.
Evidence submitted by David Shotton in response to the Royal Society’s Policy Study “Science as a Public Enterprise” Call for Evidence, addressing the following two topics raised by that call: Getting Researcher buy-in. How do we get researchers to be more willing to share data? What is there to be learned from disciplines such as genomics which have norms which favour wide sharing of data?
Alistair Miles, of SKOS fame, who formerly worked in our research group, spent yesterday afternoon catching up with us, and has written a nice blog post on the MalariaGEN Informatics Blog describing our current activities, including our work on the Open Citations Corpus, and how they might intersect with the data management activities of the MalariaGEN, the Malaria Genome Epidemiology Network for which he now works.
Data copied from JISC Expo DOAP (Description of a Project) spreadsheet at https://spreadsheets.google.com/ccc?key=0ArsNASxXZiL6dC1mWWFMMjRWSmVha0E1WmdlQ05KcEE&hl=en#gid=7. Project title : The Open Citations Project Project tag : jiscopencite Short project description We will publish reference lists from Open Access biomedical journal articles as Linked Open Citation Data at
As part of the Open Citations Project, Alex Dutton recently completed a graphing plug-in for the Open Citations web site, that permits users to generate different kinds of graphs of citation networks by querying the Open Citation Corpus for a particular article, and either display the network of papers citing that article (input citations), papers cited by that article (output citations), or both.
**Executive summary ** *Introduction * To general readers of this blog, this post will appear different from normal posts. Rather than being about a particular topic, it pulls together a summary of the work undertaken over the past year within the Open Citations Project supported by the JISC, and is primarily intended to assist JISC evaluation of the project and its outputs.
The JISC Open Citations Project website at http://opencitations.net exists for several purposes: To hold introductory information about the JISC Open Citations Project. To hold contact information for the project team.
Reis et al . (2008) [1] cites an earlier paper from Albert Ko’s research group, Ko et al . (1999) [2]. In conventional parlance, as the following diagram shows, the word “reference” can mean either what is found in the text, what is found in the reference list, the act of citation, or the object of the citation itself, as in the sentence “All the references you will need to prepare for the journal club are on Kevin’s desk”.
The input PubMed Central Open Access subset XML reference data, our starting corpus, were transformed into Open Citations RDF in multiple stages: The original XML was first transformed into an intermediate form using XSLT.
As previously described, the PubMed Central Open Access subset of journal articles yielded 6,529,815 independent bibliographic records of both citing and cited entities, while our use of the PubMed Entrez API provided a further 2,304,143 bibliographic records for the same cited entities. Before converting these references into RDF to create the Open Citations Corpust, we attempted to remove errors in the data.