Rogue Scholar

Published September 1, 2009

Author Roderic Page

Playing a bit more with the Wikipedia mammal data, there are some interesting patterns to note.

Clay ShirkyEOLGooglePower LawSearchComputer and Information Sciences

Google, Wikipedia, and EOL

https://doi.org/10.59350/qvzh4-v1988

Published September 1, 2009

Author Roderic Page

One assumption I've been making so far is that when people search for information on an organism using its scientific name, Wikipedia will dominate the search results (see my earlier post for an example of this assumption). I've decided to quantify this by doing a little experiment. I grabbed the Mammal Species of the World taxonomy and extracted the 5416 species names. I then used Google's AJAX search API to look up each name in Google.

GoogleGoogle DocsGoogle SpreadsheetsIPNIComputer and Information Sciences

Using Google Spreadsheets and RSS feeds to edit IPNI

https://doi.org/10.59350/9qhcq-7x469

Published July 14, 2009

Author Roderic Page

One thing I find myself doing a lot is creating Excel spreadsheets and filling them will lists of taxonomic names and bibliographic references, for which I then try to extract identifiers (such as DOIs). This is a tedious business, but the hope is that by doing it once I can create a useful resource. However, often I get bored and the spreadsheets lie forgotten in some deep recess of my computer's hard drive.

Bit.lyGoogleTinyURLURL ShorteningComputer and Information Sciences

Short URLs

https://doi.org/10.59350/7hgtg-k2427

Published April 16, 2009

Author Roderic Page

Short URLs have been a topic of discussion recently, perhaps sparked by the article URL Shorteners: Which Shortening Service Should You Use?. Many will have encountered short URLs in Twitter tweets. Leigh Dodds (@ldodds) asked I guess Leigh's talking about the need for short URLs in tweets, but I wonder about the more general question of why we need URL shorteners at all.

"Guy Kawasaki"BooksGeoreferencingGoogleMITComputer and Information Sciences

Reading books

https://doi.org/10.59350/bx6dd-tnt33

Published December 16, 2008

Author Roderic Page

One advantage of flying to the US is the chance to do some reading. At Newark (EWR) I picked up Guy Kawasaki's "Reality Check", which is a fun read. You can get a flavour of the book from this presentation Guy gave in 2006. While at MIT for the Elsevier Challenge I was browsing in the MIT book shop and stumbled across "Google and the Myth of Universal Knowledge" by Frenchman Jean-Noël Jeanneney. It's, um, very French.

"Social Graph API"GoogleComputer and Information Sciences

Google's Social Graph API

https://doi.org/10.59350/jgvp9-qsj67

Published March 10, 2008

Author Roderic Page

Google's Social Graph API was released earlier this year. The motivation: Apart from the obvious application to scientific databases (for example, utilising connections such as co-authorship), imagine the same idea applied to data.

"data Wars"GoogleMashupMicrosoftScrapingComputer and Information Sciences

The Data Wars

https://doi.org/10.59350/q7e07-vys87

Published February 5, 2008

Author Roderic Page

Wired 16.01 has an article entitled The Data Wars by Josh McHugh. A quote from the printed version: It's a sobering read for those of us who advocate harvesting data from as many sources as possible, more so in light of Microsoft's bid to buy Yahoo. Yahoo provides free access to many of its tools via an API (such as the image search I use in iSpecies, and in this sense is much more open than Google. Might this change under Microsoft...?

iPhylo

Wikipedia mammals and the power law

Google, Wikipedia, and EOL

Using Google Spreadsheets and RSS feeds to edit IPNI

Short URLs

Reading books

Google's Social Graph API

The Data Wars