Rogue Scholar

ClassificationD3jsRDFSPARQLWikidataInformatikEnglisch

Displaying taxonomic classifications from Wikidata using d3js and SPARQL

Veröffentlicht 14. Januar 2017

Following on from previous posts The Semantic Web made fun: d3sparql and The Biodiversity Heritage Library meets Wikidata via Wikispecies: adding author identifiers to BioStor I've put together an example query that can be used to extract a taxonomic classification from Wikidata.

BHLBioStorIdentiifersISNIORCIDInformatikEnglisch

The Biodiversity Heritage Library meets Wikidata via Wikispecies: adding author identifiers to BioStor

https://doi.org/10.59350/kcbnd-17a33

Veröffentlicht 11. Januar 2017

Autor Roderic Page

I've added an experimental feature to BioStor that uses data from Wikidata and Wikispecies to augment what information BioStor displays on authors.

BHLBioStorNgramTimelineInformatikEnglisch

Taxonomic name timelines for BHL

https://doi.org/10.59350/nsxnn-b4v06

Veröffentlicht 23. Dezember 2016

Autor Roderic Page

Given a big corpus of literature one of the fun things to do is look at how the use of a term has changed over time. When did people first use a particular word? When did one word start to replace another, etc.? Google's Ngram Viewer is perhaps the best known tool for exploring these questions.

BOLDDNA BarcodingGBIFTaxonomyInformatikEnglisch

DNA barcoding taxonomy now in GBIF

https://doi.org/10.59350/m33gh-adh33

Veröffentlicht 22. Dezember 2016

Autor Roderic Page

Following on from adding DNA barcodes to GBIF I've now uploaded a taxonomic classification of DNA barcode BINs (Barcode Index Numbers). Each BIN is a cluster of similar DNA barcodes that is essentially equivalent to a species.

DNA BarcodingGBIFIBOLInformatikEnglisch

iBOL DNA barcodes in GBIF

https://doi.org/10.59350/9tafr-k2g51

Veröffentlicht 8. Dezember 2016

Autor Roderic Page

I've uploaded all the COI barcodes in the iBOL public data dumps into GBIF. This is an update of an earlier project that uploaded a small subset. Now that dataset doi:10.15468/inygc6 has been expanded to include some 2.7 million barcodes.

D3JavascriptSemantic WebSPARQLVisualisationInformatikEnglisch

The Semantic Web made fun: d3sparql

https://doi.org/10.59350/8jtg0-vtg66

Veröffentlicht 24. November 2016

Autor Roderic Page

Continuing my on-again off-again relationship with the Semantic Web, I stumbled across a cool approach to visualising the results of SPARQL queries.

InformatikEnglisch

Copyright and the Use of Images as Biodiversity Data

https://doi.org/10.59350/qt5ae-kds17

Veröffentlicht 14. November 2016

Autor Roderic Page

Willi Egloff, Donat Agosti, Puneet Kishor, David Patterson, and Jeremy A. Miller have published an interesting preprint entitled “Copyright and the Use of Images as Biodiversity Data” DOI:10.1101/087015 in which they argue that taxonomic images aren't copyrightable. I'm not convinced, and have commented on the bioRxiv site.

EOLJSON-LDPlatformTraitBankInformatikEnglisch

EOL Traitbank JSON-LD is broken

https://doi.org/10.59350/ba6y2-7yn42

Veröffentlicht 12. November 2016

Autor Roderic Page

One of the most interesting aspects of EOL is "TraitBank", which has been described in a recent paper: TraitBank is available in JSON-LD, and so is potentially part of the Semantic Web.

Bob MesibovCharacter EncodingGuest PostUFT-8InformatikEnglisch

Guest post: It's 2016 and your data aren't UTF-8 encoded?

https://doi.org/10.59350/t8j1g-8h227

Veröffentlicht 30. September 2016

Autor Roderic Page

The following is a guest post by Bob Mesibov. According to w3techs, seven out of every eight websites in the Alexa top 10 million are UTF-8 encoded. This is good news for us screenscrapers, because it means that when we scrape data into a UTF-8 encoded document, the chances are good that all the characters will be correctly encoded and displayed. It's not quite good news for two reasons.

ChallengeGBIFInformatikEnglisch

GBIF 2016 Ebbe Nielsen Challenge entries

https://doi.org/10.59350/44ber-48595

Veröffentlicht 30. September 2016

Autor Roderic Page

The GBIF 2016 Ebbe Nielsen Challenge has received 15 submissions. You can view them here: Unlike last year where the topic was completely open, for the second challenge we've narrowed the focus to "Analysing and addressing gaps and biases in primary biodiversity data". As with last year, judging is limited to the jury (of which I'm a member), however anyone interested in biodiversity informatics can browse the submissions.

Guest PostIRMNGTony ReesInformatikEnglisch

Guest post: Absorbing task or deranged quest: an attempt to track all genus names ever published

https://doi.org/10.59350/sx242-7wv17

Veröffentlicht 7. September 2016

Autor Roderic Page

This guest post by Tony Rees describes his quest to track all genus names ever published (plus a subset of the species…). A “holy grail” for biodiversity informatics is a suitably quality controlled, human- and machine-queryable list of all the world’s species, preferably arranged in a suitable taxonomic hierarchy such as kingdom-phylum-class-order-family-genus or other.

iPhylo

Displaying taxonomic classifications from Wikidata using d3js and SPARQL

The Biodiversity Heritage Library meets Wikidata via Wikispecies: adding author identifiers to BioStor

Taxonomic name timelines for BHL

DNA barcoding taxonomy now in GBIF

iBOL DNA barcodes in GBIF

The Semantic Web made fun: d3sparql

Copyright and the Use of Images as Biodiversity Data

EOL Traitbank JSON-LD is broken

Guest post: It's 2016 and your data aren't UTF-8 encoded?

GBIF 2016 Ebbe Nielsen Challenge entries

Guest post: Absorbing task or deranged quest: an attempt to track all genus names ever published