Rogue Scholar

Information LiteracyLarge Language ModelRetrieval Augmented GenerationOther Social Sciences

Why use of new AI enhanced tools that help with literature review should be discouraged for undergraduates

https://doi.org/10.59350/xrjjv-yzd63

Published November 18, 2024

Author Aaron Tay

I have a controversial and perhaps somewhat surprising (to some) view.

DiscoveryLarge Language ModelRetrieval Augmented GenerationOther Social Sciences

Are the current AI search tools that are enhanced with transformer-based models, the unicorn - low skill cap, high performance tools we are looking for?

https://doi.org/10.59350/eh180-09y39

Published November 11, 2024

Author Aaron Tay

Summary Classification of Academic Search Tools by Skill and Performance : This post explores a framework for categorizing academic search tools based on their skill cap (the expertise needed to use them effectively) and performance cap (the potential quality of results they can yield), drawing parallels to gaming strategies. Trade-off : Tools like Google Scholar and Web Scale Discovery services (e.g., Primo, Summon) are seen as

LLMMarketingOther Social Sciences

Audio Overviews in Google Notebook LM made me go wow....

https://doi.org/10.59350/fag2j-zr498

Published September 30, 2024

Author Aaron Tay

Audio Overviews in Google NotebookLM is making waves online. When I first tried it, it was a "wow" moment for me. The last time I felt that way was trying Perplexity.ai in late 2022 and realizing that search engines could now return answers (with citations) instead of just potentially relevant links and I realized this would be a huge paradigm shift.

DiscoveryLarge Language ModelRetrieval Augmented GenerationOther Social Sciences

Primo Research Assistant launches- a first look and some things you should know

https://doi.org/10.59350/66xvc-kjy06

Published September 16, 2024

Author Aaron Tay

Source Ex Libris surprised us by suddenly releasing Primo Research Assistant to production on September 9, 2024 (when the earlier timeline was 4Q 2024 with some believing it might even be delayed). Despite the fact that there are so many RAG (retrieval augmented generation) academic search systems today that generate answers from search, this is still quite a significant event to be worth covering in my blog. Why?

DiscoveryLarge Language ModelOther Social Sciences

AI & Retrieval augmented generation search - the content problem -reactions from librarians, authors and publishers & thoughts on tradeoffs

https://doi.org/10.59350/b5jnj-z5e22

Published August 2, 2024

Author Aaron Tay

IP and ethical issues surrounding the use of content in Large Language Models (LLMs) have sparked significant debate, but I’ve mostly stayed out of it as this isn’t my area of expertise, and while there’s much to discuss and many legal opinions to consider, ultimately, the courts will decide what’s legal. However, for those interested in exploring this topic further, I recommend Peter Schoppert’s AI & Copyright substack.

DiscoveryLarge Language ModelOther Social Sciences

Prompt engineering with Retrieval Augmented Generation systems - tread with caution!

https://doi.org/10.59350/try40-x4947

Published July 13, 2024

Author Aaron Tay

I recently watched a librarian give a talk about their experiments teaching prompt engineering. The librarian drawing from the academic literature on the subject (there are lots!), tried to leverage "prompt engineering principles" from one such paper to craft a prompt and used it in a Retrieval Augmented Generation (RAG) system, more specifically, Statista's brand new "research AI" feature.

Citation-based Literature Mapping ToolsDiscoveryInformation LiteracyLiterature MappingOther Social Sciences

All about citation chasing and tools that does citation chasing like Citation Gecko, Connected papers, Research Rabbit, LitMaps and more

https://doi.org/10.59350/nve4s-4p846

Published June 25, 2024

Author Aaron Tay

I have been writing about what I call citation based literature mapping tools for over 5 years now, starting from Citation Gecko in 2018. This trend really intensified in 2020, with the overall victory of the open citation movement making citation linking data effectively "free", and tools like Connected papers, Researchrabbit, LitMaps and more really started to get some traction.

DiscoveryLarge Language ModelOther Social Sciences

Can Semantic Search be more interpretable? COLBERT, SPLADE might be the answer but is it enough?

https://doi.org/10.59350/bm0ee-2vz33

Published June 6, 2024

Author Aaron Tay

Can Semantic Search be more interpretable? COLBERT, SPLADE might be the answer but is it enough?

DiscoveryLarge Language ModelOther Social Sciences

Deep dive into AI powered embedding based search - sparse vs dense embeddings

https://doi.org/10.59350/qg0pj-wt465

Published June 3, 2024

Author Aaron Tay

Warning : I am not a information retrieval researcher, so take my blog post with a pinch of salt In my last blog post, I described a simplified description of a framework for infomation retrieval from the paper -

DiscoveryLarge Language ModelOther Social Sciences

Retrieval Augmented Generation and academic search engines - some suggestions for system builders

https://doi.org/10.59350/c44n5-2gz66

Published May 21, 2024

Author Aaron Tay

As academic search engines and databases incorporate the use of generative AI into their systems, an important concept that all librarian should grasp is that of retrieval augmented generation (RAG). You see it in use in all sorts of "AI products" today from chatbots like Bing Copilot, to Adobe's Acrobat Ai assistant that allow you to chat with your PDF.

Aaron Tay's Musings about librarianship

Why use of new AI enhanced tools that help with literature review should be discouraged for undergraduates

Are the current AI search tools that are enhanced with transformer-based models, the unicorn - low skill cap, high performance tools we are looking for?

Audio Overviews in Google Notebook LM made me go wow....

Primo Research Assistant launches- a first look and some things you should know

AI & Retrieval augmented generation search - the content problem -reactions from librarians, authors and publishers & thoughts on tradeoffs

Prompt engineering with Retrieval Augmented Generation systems - tread with caution!

All about citation chasing and tools that does citation chasing like Citation Gecko, Connected papers, Research Rabbit, LitMaps and more

Can Semantic Search be more interpretable? COLBERT, SPLADE might be the answer but is it enough?

Deep dive into AI powered embedding based search - sparse vs dense embeddings

Retrieval Augmented Generation and academic search engines - some suggestions for system builders