Rogue Scholar

Published June 12, 2013

One reason I built BioNames (and the related digital archive BioStor) was to create tools to help make sense of taxonomic names. In exploring databases such as GBIF and the NCBI taxonomy every so often you come across cases where things have gone horribly wrong, and to make sense of them you have to drill down into the taxonomic literature.

Data QualityGBIFGeoJSONMesibovMillipedesComputer and Information Sciences

GBIF data quality: visualising Mesibov's millipedes

https://doi.org/10.59350/fmnqw-b9877

Published May 2, 2013

Author Roderic Page

Bob Mesibov (who has been a guest author on this blog) recently published a paper on data quality in in ZooKeys : In this paper Bob documents some significant discrepancies between data in his Millipedes of Australia (MoA) database and the equivalent data in the Atlas of Living Australia and GBIF (disclosure, I was a reviewer of the paper, and also sit on GBIF's science committee). This paper spawned a thread on TAXACOM, and also came

BioNamesDemoEOLGBIFMockupComputer and Information Sciences

BioNames update - live mockup

https://doi.org/10.59350/s77y6-v8f10

Published April 25, 2013

Author Roderic Page

Things are finally coming together, at least enough to have a functioning demo. It looks awful, but shows the main things I want BioNames to do. One thing I'm most concerned about at this stage is the possible confusion users might experience between taxon names and concepts.

BioNamesGBIFTaxon ConceptsComputer and Information Sciences

BioNames update - taxon concepts

https://doi.org/10.59350/a91jj-t6350

Published April 11, 2013

Author Roderic Page

Quick notes on "taxon concepts". In order to navigate through taxon names I plan to have at least one taxonomic classification in BioNames. GBIF makes the most sense at this stage. The model I'm adopting is that the classification is a graph where nodes have the id used by the external database (in this case GBIF). Each node has one or more names attached, and where possible the names are linked to the original description.

GBIFINaturalistTaxonomyVersion ControlComputer and Information Sciences

Time to put taxonomy into GitHub

https://doi.org/10.59350/9cdg6-e2r87

Published April 10, 2013

Author Roderic Page

Donald Hobern drew my attention to nice the way iNaturalist displays taxonomic splits: In this example, observations identified as Rhipidura fuliginosa are being split into Rhipidura fuliginosa and Rhipidura albiscapa . This immediately reminds me of the idea which keeps circulating around, namely using version control tools to manage taxonomic classification.

GBIFIONMatchingTaxonomic ConceptTaxonomic NameComputer and Information Sciences

BioNames update - matching taxon names to classifications

https://doi.org/10.59350/s2412-5eq71

Published March 20, 2013

Author Roderic Page

On eof the things BioNames will need to do is match taxon names to classifications. For example, if I want to display a taxonomic hierarchy for the user to browse through the names, then I need a map between the taxon names that I've collected and one or more classifications. The approach I'm taking is to match strings, wherever possible using both the name and taxon authority.

GBIFGlobal Plant InitiativeGPIJSTOROpen DataComputer and Information Sciences

Why are botanists locking away their data in JSTOR Plant Science?

https://doi.org/10.59350/3ztgr-q6k44

Published February 21, 2013

Author Roderic Page

Somehow I get the feeling that botanists haven't got the "open data" religion. Not only is the list of plant names list behind a really bad license, but the Global Plants Initiative (GPI) hides its type images behind a JSTOR Plant Sciences paywall. Why is botany determined to keep its data under wraps?

GBIFIdentifiersSpecimenSysTaxZFMKComputer and Information Sciences

More GBIF specimen identifier strangeness

https://doi.org/10.59350/hjnaq-n0r83

Published January 18, 2013

Author Roderic Page

Continuing the theme of trying to map specimens cited in the literature to the equivalent GBIF records, consider the GBIF record http://data.gbif.org/occurrences/685591320, which according to GBIF is specimen "ZFMK 188762" (a [sic] holotype of Praomys hartwigi ). This is odd, because the original publication of this name (Eisentraut, M. 1968 .Beitrag zur Saugetierfauna von Kamerun.

GBIFGondwanaPanbiogeographyComputer and Information Sciences

Towards a biogeographic search engine

https://doi.org/10.59350/q0fy3-b1825

Published September 24, 2012

Author Roderic Page

We all have a "past" that we might not advertise widely, and my past includes flirting with panbiogeography.

Asterophrys LeucopusAtlas Of Living AustraliaBioStorGBIFLinkingComputer and Information Sciences

Dear GBIF, please stop changing occurrenceIDs!

https://doi.org/10.59350/xke1h-qar56

Published July 24, 2012

Author Roderic Page

If we are ever going to link biodiversity data together we need to have some way of ensuring persistent links between digital records. This isn't going to happen unless people take persistent identifiers seriously. I've been trying to link specimen codes in publications to GBIF, with some success, so imagine my horror when it started to fall apart.

iPhylo

Gibbons and GBIF: good grief what a mess

GBIF data quality: visualising Mesibov's millipedes

BioNames update - live mockup

BioNames update - taxon concepts

Time to put taxonomy into GitHub

BioNames update - matching taxon names to classifications

Why are botanists locking away their data in JSTOR Plant Science?

More GBIF specimen identifier strangeness

Towards a biogeographic search engine

Dear GBIF, please stop changing occurrenceIDs!