Rogue Scholar

Published April 17, 2014

Author Matt Sundquist

Topic ModelingTextminingElifeUnconfUnconf14Computer and Information Sciences

Topic modeling in R

https://doi.org/10.59350/9dvj7-n4g49

Published April 16, 2014

Author Carson Sievert

Editor’s note: This is the first in a series of posts from rOpenSci’s recent hackathon. I recently had the pleasure of participating in rOpenSci’s hackathon. To be honest, I was quite nervous to work among such notables, but I immediately felt welcome thanks to a warm and personable group. Alyssa Frazee has a great post summarizing the event, so check that out if you haven’t already.

APIComputer and Information Sciences

The ins and outs of interacting with web APIs

https://doi.org/10.59350/jvvm5-x7093

Published April 14, 2014

Authors Scott Chamberlain, Karthik Ram

We’ve received a number of questions from our users about dealing with the finer details of data sources on the web. Whether you’re reading data from local storage such as a csv file, a .Rdata store, or possibly a proprietary file format, you’ve most likely run into some issues in the past. Common problems include passing incorrect paths, files being too big for memory, or requiring several packages to read files in incompatible formats.

APIBiodiversitySpeciesOccurrenceComputer and Information Sciences

Accessing iNaturalist data

https://doi.org/10.59350/avxrj-tj774

Published March 26, 2014

Author Ted Hart

The iNaturalist project is a really cool way to both engage people in citizen science and collect species occurrence data. The premise is pretty simple, users download an app for their smartphone, and then can easily geo reference any specimen they see, uploading it to the iNaturalist website. It let’s users turn casual observations into meaningful crowdsourced species occurrence data.

APIBiodiversitySpeciesOccurrenceComputer and Information Sciences

Species occurrence data

https://doi.org/10.59350/z9cvk-99y79

Published March 17, 2014

Author Karthik Ram

UPDATE: mapping functions are in a separate package now (mapr). Examples that do mapping below have been updated. The rOpenSci projects aims to provide programmatic access to scientific data repositories on the web. A vast majority of the packages in our current suite retrieve some form of biodiversity or taxonomic data.

APIClimateComputer and Information Sciences

rnoaa - Access to NOAA National Climatic Data Center data

https://doi.org/10.59350/549ga-0vh52

Published March 13, 2014

Author Scott Chamberlain

We recently pushed the first version of rnoaa to CRAN - version 0.1. NOAA has a lot of data, some of which is provided via the National Climatic Data Center, or NCDC. NOAA has provided access to NCDC climate data via a RESTful API - which is great because people like us can create clients for different programming languages to access their data programatically.

APIReproducible ResearchDataArchivingComputer and Information Sciences

dvn - Sharing Reproducible Research from R

https://doi.org/10.59350/wweve-rt950

Published February 20, 2014

Author Thomas J. Leeper

Reproducible research involves the careful, annotated preservation of data, analysis code, and associated files, such that statistical procedures, output, and published results can be directly and fully replicated. As the push for reproducible research has grown, the R community has responded with an increasingly large set of tools for engaging in reproducible research practices (see, for example, the ReproducibleResearch Task View on CRAN).

APITaxizeTaxonomyComputer and Information Sciences

New features in the most recent taxize update, v0.2

https://doi.org/10.59350/czbbh-qz045

Published February 19, 2014

Author Scott Chamberlain

We just released a new version of taxize - version 0.2.0. This release contains a number of new features, and bug fixes. Here is a run down of some of the changes: First, install and load taxize install.packages("rgbif") library(taxize) New things New functions: class2tree Sometimes you just want to have a visual of the taxonomic relationships among taxa.

TaxonomyBiodiversityInformaticsComputer and Information Sciences

AntWeb - programmatic interface to ant biodiversity data

https://doi.org/10.59350/911zq-81w92

Published February 18, 2014

Author Karthik Ram

Data on more than 10,000 species of ants recorded worldwide are available through from California Academy of Sciences’ AntWeb, a repository that boasts a wealth of natural history data, digital images, and specimen records on ant species from a large community of museum curators.

APIGBIFComputer and Information Sciences

Changed and new things in the new version of rgbif, v0.5

https://doi.org/10.59350/9vh5m-c3e32

Published February 17, 2014

Author Scott Chamberlain

rgbif is an R package to search and retrieve data from the Global Biodiverity Information Facilty (GBIF). rgbif wraps R code around the [GBIF API][gbifapi] to allow you to talk to GBIF from R. We just pushed a new verion of rgbif to cran - v0.5.0. Source and binary files are now available on CRAN. There are a few new functions: count_facet, elevation, and installations. These are described, with examples, below.

rOpenSci - open tools for open science

Make your ggplots shareable, collaborative, and with D3

Topic modeling in R

The ins and outs of interacting with web APIs

Accessing iNaturalist data

Species occurrence data

rnoaa - Access to NOAA National Climatic Data Center data

dvn - Sharing Reproducible Research from R

New features in the most recent taxize update, v0.2

AntWeb - programmatic interface to ant biodiversity data

Changed and new things in the new version of rgbif, v0.5