Rogue Scholar

Bob MesibovDark TaxaDimly Lit TaxaGuest PostComputer and Information Sciences

Dimly lit taxa - guest post by Bob Mesibov

Published July 12, 2012

The following is a first for iPhylo, a guest post by Bob Mesibov. Rod Page introduced 'dark taxa' here on iPhylo in April 2011. He wrote:Rod suggested that 'quite a lot' of biology can be done without taxonomic names.

BlogsCitationCrossrefDark TaxaDOIComputer and Information Sciences

Citations, Social Media & Science

https://doi.org/10.59350/epv4n-xpg23

Published July 11, 2012

Author Roderic Page

Quick note that Morgan Jackson (@BioInFocus) has written nice blog post Citations, Social Media & Science inspired by the fact that the following paper:Kwong, S., Srivathsan, A., & Meier, R. (2012). An update on DNA barcoding: low species coverage and numerous unidentified sequences. Cladistics, no–no. doi:10.1111/j.1096-0031.2012.00408.xcites my "Dark taxa" in the body of the text but not in the list of literature cited.

GBIC2012GBIFOntologyComputer and Information Sciences

Post GBIC2012 thoughts

https://doi.org/10.59350/yywhk-57n80

Published July 6, 2012

Author Roderic Page

I'm back from Copenhagen and GBIC2012.

Equirectangular ProjectionGithubMapsOrthographic ProjectionPolarComputer and Information Sciences

Using orthographic projections to map organism distributions

https://doi.org/10.59350/6zzw6-fx070

Published July 1, 2012

Author Roderic Page

For a current project I'm currently working I show organism distributions using data from GBIF, and I display that data on a map that uses the equirectangular projection.

Biodiversity InformaticsBowkerGBIC2012GBIFPlanet ManagementComputer and Information Sciences

Planet management, GBIF, and the future of biodiversity informatics

https://doi.org/10.59350/zhnm6-0vf92

Published June 29, 2012

Author Roderic Page

Next week I'm in Copenhagen for GBIC, the Global Biodiversity Informatics Conference. The goal of the conference is to:The collaboration referred to is the agreement to mobilise data and informatics capability to met the Aichi Biodiversity Targets.I confess I have mixed feelings about the upcoming meeting. There will be something like 100 people attending the conference, with backgrounds ranging from pure science to intergovernmental policy.

CrowdsourcingEOLFigShareFlickrGithubComputer and Information Sciences

Where is the "crowd" in crowdsourcing? Mapping EOL Flickr photos

https://doi.org/10.59350/w706f-6sr12

Published June 28, 2012

Author Roderic Page

In any discussion of data gathering or data cleaning the term "crowdsourcing" inevitably comes up. A example where this approach has been successful is the Encyclopedia of Life's Flickr pool, where Flickr users upload images that are harvested by EOL.Given that many Flickr photos are taken with cameras that have built-in GPS (such as the iPhone, the most common camera on Flickr) we could potentially use the Flickr photos not only as a source of

DOIHandleUUIDComputer and Information Sciences

UUIDs

https://doi.org/10.59350/hjpt8-4qw10

Published June 27, 2012

Author Roderic Page

Just for future reference:

Catalogue Of LifeChresonymData CleaningErrorsHomonymComputer and Information Sciences

More fictional taxa and the myth of the expert taxonomic database

https://doi.org/10.59350/regph-e8w09

Published June 25, 2012

Author Roderic Page

I know I'm starting to sound like a broken record, but the more I look, the more taxonomic databases seem to be full of garbage. Databases such as the Catalogue of life, which states that it is a "quality-assured checklist" have records that are patently wrong.

BHLBioStorOpen AccessComputer and Information Sciences

70,000 articles extracted from the Biodiversity Heritage Library

https://doi.org/10.59350/aw0b4-8an11

Published June 19, 2012

Author Roderic Page

Just noticed that BioStor now has just over 70,000 articles extracted from the Biodiversity Heritage Library. This number is a little "soft" as there are some duplicates in the database that I need to clean out, but it's a nice sounding number.

BHLOCRText-miningComputer and Information Sciences

BHL and text-mining: some ideas

https://doi.org/10.59350/a8jsq-7xz17

Published June 18, 2012

Author Roderic Page

Some quick notes on possibilities for text-mining BHL (in rough order of priority). Any text-mining would have to be robust to OCR errors.

BHLErrorsFictional TaxaGBIFGoogleComputer and Information Sciences

Fictional taxa

https://doi.org/10.59350/qy4n8-y2770

Published June 18, 2012

Author Roderic Page

Anyone who works with taxonomic databases is aware of the fact that they have errors. Some taxonomic databases are restricted in scope to a particular taxon in which one or more people have expertise, these then get aggregated into larger databases, which may in turn be aggregated by databases whose scope is global.