Rogue Scholar

Published August 24, 2022

More arm-waving notes on taxonomic databases. I've started to add data to ChecklistBank and this has got me thinking about the issue of data quality.

Catalogue Of LifeCitationCrossrefDataCiteDOIComputer and Information Sciences

Papers citing data that cite papers: CrossRef, DataCite, and the Catalogue of Life

https://doi.org/10.59350/w18j9-v7j10

Published August 3, 2022

Author Roderic Page

Quick notes to self following on from a conversation about linking taxonomic names to the literature. There are different sorts of citation: Paper cites another paper Paper cites a dataset Dataset cites a paper Citation type (1) is largely a solved problem (although there are issues of the ownership and use of this data, see e.g. Zootaxa has no impact factor.

CitationGBIFMaterial ExaminedSpecimen CodesComputer and Information Sciences

Round trip from identifiers to citations and back again

https://doi.org/10.59350/2b1j9-qmw12

Published May 27, 2022

Author Roderic Page

Note to self (basically rewriting last year's Finding citations of specimens). Bibliographic data supports going from identifier to citation string and back again, so we can do a "round trip." 1. Given a DOI we can get structured data with a simple HTTP fetch, then use a tool such as citation.js to convert that data into a human-readable string in a variety of formats.

CitationCSLMachine LearningParsingComputer and Information Sciences

Citation parsing tool released

https://doi.org/10.59350/9416m-mzz03

Published July 22, 2021

Author Roderic Page

Quick note on a tool I've been working on to parse citations, that is to take a series of strings such as: Möllendorff O (1894) On a collection of land-shells from the Samui Islands, Gulf of Siam. Proceedings of the Zoological Society of London, 1894: 146–156. de Morgan J (1885) Mollusques terrestres & fluviatiles du royaume de Pérak et des pays voisins (Presqúile Malaise). Bulletin de la Société Zoologique de France, 10: 353–249.

CitationCRFIdentifiersMachine LearningSpecimensComputer and Information Sciences

Finding citations of specimens

https://doi.org/10.59350/gg8m4-vb985

Published May 28, 2021

Author Roderic Page

Note to self. The challenge of finding specimen citations in papers keeps coming around. It seems that this is basically the same problem as finding citations to papers, and can be approached in much the same way. If you want to build a database of reference from scratch, one way is to scrape citations from papers (e.g., from the "literature cited" section), convert those strings into structured data, and add those to your database.

CitationTalkWikidataComputer and Information Sciences

Workshop On Open Citations And Open Scholarly Metadata 2020 talk

https://doi.org/10.59350/f7bm7-cw590

Published August 25, 2020

Author Roderic Page

I'm giving a short talk at the Workshop On Open Citations And Open Scholarly Metadata 2020, which will be held online on September 9th.

BlogsCitationCrossrefDark TaxaDOIComputer and Information Sciences

Citations, Social Media & Science

https://doi.org/10.59350/epv4n-xpg23

Published July 11, 2012

Author Roderic Page

Quick note that Morgan Jackson (@BioInFocus) has written nice blog post Citations, Social Media & Science inspired by the fact that the following paper: Kwong, S., Srivathsan, A., & Meier, R. (2012). An update on DNA barcoding: low species coverage and numerous unidentified sequences. Cladistics, no–no. doi:10.1111/j.1096-0031.2012.00408.x cites my "Dark taxa" in the body of the text but not in the list of literature cited.

CitationCollectionsIdentifiersSpecimensTAXACOMComputer and Information Sciences

Yet another reason why we need specimen identifiers, now!

https://doi.org/10.59350/56hff-14t75

Published January 18, 2012

Author Roderic Page

This message appeared on the TAXACOM mailing list: Given that most specimens lack resolvable digital identifiers (a theme I've harped on about before, most recently in the context of DNA barcoding), answering this kind of query ends up being a case of searching publications for text strings that contain the acronym of the collection.

BHLCitationDOIEOLLinkingComputer and Information Sciences

BHL needs to engage with publishers (and EOL needs to link to primary literature)

https://doi.org/10.59350/846cx-3cn39

Published November 24, 2011

Author Roderic Page

Browsing EOL I stumbled upon the recently described fish Protoanguilla palau , shown below in an image by rairaiken2011: Two things struck me, the first is that the EOL page for this fish gives absolutely no clue as to where you would to find out more about this fish (apart from an unclickable link to the Wikipedia page http://en.wikipedia.org/wiki/Protoanguilla - seriously, a link that isn't clickable?), despite the fact this fish

BioStorCitationCitation MatchingMatchingMetadataComputer and Information Sciences

Rethinking citation matching

https://doi.org/10.59350/eet4x-65051

Published September 13, 2011

Author Roderic Page

Some quick half-baked thoughts on citation matching. One of the things I'd really like to add to BioStor is the ability to parse article text and extract the list of literature cited. Not only would this be another source of bibliographic data I can use to find more articles in BHL, but I could also build citation networks for articles in BioStor.

iPhylo

Can we use the citation graph to measure the quality of a taxonomic database?

Papers citing data that cite papers: CrossRef, DataCite, and the Catalogue of Life

Round trip from identifiers to citations and back again

Citation parsing tool released

Finding citations of specimens

Workshop On Open Citations And Open Scholarly Metadata 2020 talk

Citations, Social Media & Science

Yet another reason why we need specimen identifiers, now!

BHL needs to engage with publishers (and EOL needs to link to primary literature)

Rethinking citation matching