Computer and Information SciencesBlogger

iPhylo

Rants, raves (and occasionally considered opinions) on phyloinformatics, taxonomy, and biodiversity informatics. For more ranty and less considered opinions, see my Twitter feed.ISSN 2051-8188. Written content on this site is licensed under a Creative Commons Attribution 4.0 International license.
Home PageAtom FeedMastodonISSN 2051-8188
language
Biodiversity InformaticsChallengeWhite PaperComputer and Information Sciences
Published

BMC Ecology has published Alex Hardisty and Dave Roberts' white paper on biodiversity informatics: Here are their 12 recommendations (with some comments of my own): Open Data, should be normal practice and should embody the principles of being accessible, assessable, intelligible and usable. Seems obvious, but data providers are often reluctant to open "their" data up for reuse. Data encoding should allow analysis across multiple

BioNamesGBIFTaxon ConceptsComputer and Information Sciences
Published

Quick notes on "taxon concepts". In order to navigate through taxon names I plan to have at least one taxonomic classification in BioNames. GBIF makes the most sense at this stage. The model I'm adopting is that the classification is a graph where nodes have the id used by the external database (in this case GBIF). Each node has one or more names attached, and where possible the names are linked to the original description.

GBIFINaturalistTaxonomyVersion ControlComputer and Information Sciences
Published

Donald Hobern drew my attention to nice the way iNaturalist displays taxonomic splits: In this example, observations identified as Rhipidura fuliginosa are being split into Rhipidura fuliginosa and Rhipidura albiscapa . This immediately reminds me of the idea which keeps circulating around, namely using version control tools to manage taxonomic classification.

BLASTDNA BarcodingQR CodesComputer and Information Sciences
Published

Came across this paper recently:Despite QR Codes being uncool, there's something appealing about the idea of compressing a DNA barcode sequence into a small image.

FontPointsSVGComputer and Information Sciences
Published

I'm working on displaying OCR text from BHL using SVG, and these are just some quick notes on font size. Specifically how SVG font size corresponds to the size of letters, and how you work out what point size was used to print text on a BHL page.SVG font-size corresponds to the EM square of the font.

GBIFIONMatchingTaxonomic ConceptTaxonomic NameComputer and Information Sciences
Published

On eof the things BioNames will need to do is match taxon names to classifications. For example, if I want to display a taxonomic hierarchy for the user to browse through the names, then I need a map between the taxon names that I've collected and one or more classifications. The approach I'm taking is to match strings, wherever possible using both the name and taxon authority.

BHLBioStorComputer and Information Sciences
Published

The new look Biodiversity Heritage Library has just launched. It's a complete refresh of the old site, based on the Biodiversity Heritage Library–Australia site. If you want an overview of what's new, BHL have published a guide to the new look site. Congrats to involved in the relaunch.One of the new features draws on the work I've been doing on BioStor.

BioNamesDatabaseEOLTaxonomyComputer and Information Sciences
Published

Yet another taxonomic database, this time I can't blame anyone else because I'm the one building it (with some help, as I'll explain below).BioNames was my entry in EOL's Computable Data Challenge (you can see the proposal here: http://dx.doi.org/10.6084/m9.figshare.92091). In that proposal I outlined my goal:The bulk of the funding from EOL is going into interface work by Ryan Schenk (@ryanschenk), author of synynyms among other cool things.