Rogue Scholar

Large-language-modelsArtificial-intelligencePrompt-engineeringInformatikEnglisch

Prompt engineering: A Way to Smartly Use AI

Veröffentlicht 14. Mai 2024

Author Dhruv Gupta ( ORCID : 0009–0004–7109–5403) Introduction Large Language Models (LLMs) have become the new face of Natural language processing (NLP). With their generative power and ability to comprehend human language, the human reliance on these models is increasing every day. However, the LLMs have been known to hallucinate and thus produce wrong outputs.

Knowledge GraphTocInformatikEnglisch

Automated Knowledge Graph Construction with Large Language Models — Part 2

https://doi.org/10.59350/4c2mx-vm853

Veröffentlicht 12. Mai 2024

Autor Amanda Kau

Harvesting the Power and Knowledge of Large Language Models

Artificial IntelligenceTocInformatikEnglisch

How to use Large Language Models to tag your data: A complete tutorial

https://doi.org/10.59350/z1z3k-rrm02

Veröffentlicht 12. Mai 2024

Autor Xuzeng He

Using Mistral for Data tagging

MegalodonLong-textsTransformer-architectureInformatikEnglisch

The longer the context, the better? Unlimited Context Length in Megalodon

https://doi.org/10.59350/dx6a6-yy475

Veröffentlicht 7. Mai 2024

Autor Qingqin Fang

An improvement architecture superior to the Transformer, proposed by Meta Author · Qingqin Fang ( ORCID: 0009–0003–5348–4264) Introduction Recently, researchers from Meta and the University of Southern California have introduced a model called Megalodon. They claim that this model can expand the context window of language models to handle millions of tokens without overwhelming your memory.

Large-language-modelsArtificial-intelligenceTransformersNatural-language-processInformatikEnglisch

Brief Introduction to the History of Large Language Models (LLMs)

https://doi.org/10.59350/m4c7t-epg97

Veröffentlicht 7. Mai 2024

Autor Wenyi Pi

Understanding the Evolutionary Journey of LLMs Author Wenyi Pi ( ORCID : 0009–0002–2884–2771) Introduction When we talk about large language models (LLMs), we are actually referring to a type of advanced software that can communicate in a human-like manner. These models have the amazing ability to understand complex contexts and generate content that is coherent and has a human feel.

Natural-language-processiTransformersArtificial-intelligenceInformatikEnglisch

Transformers Models in NLP

https://doi.org/10.59350/c7nrg-xay43

Veröffentlicht 7. Mai 2024

Autor Dhruv Gupta

Attention mechanism not getting enough attention Author Dhruv Gupta ( ORCID : 0009–0004–7109–5403) Introduction As discussed in this article, RNNs were incapable of learning long-term dependencies. To solve this issue both LSTMs and GRUs were introduced. However, even though LSTMs and GRUs did a fairly decent job for textual data they did not perform well.

Artificial IntelligenceTocInformatikEnglisch

Are Large Language Models Our Allies or Enemies in the Fight Against Fake News?

https://doi.org/10.59350/st0jr-ad818

Veröffentlicht 6. Mai 2024

Autor Amanda Kau

Large Language Models for Fake News Generation and Detection

Artificial IntelligenceTocInformatikEnglisch

Fine-tuning Large Language Models: A Brief Introduction

https://doi.org/10.59350/1aezq-kk827

Veröffentlicht 6. Mai 2024

Autor Xuzeng He

Supervised Fine-tuning, Reinforcement Learning from Human Feedback and the latest SteerLM

Artificial IntelligenceTocInformatikEnglisch

Three Paradigms of RAG

https://doi.org/10.59350/5j7tt-5y328

Veröffentlicht 6. Mai 2024

Autor Vaibhav Khobragade

From Naive to Modular: Tracing the Evolution of Retrieval-Augmented Generation

NaturallanguageprocessingLstmArtificial-intelligenceRecurrent-neural-networkInformatikEnglisch

RNNs vs GRUs vs LSTMs

https://doi.org/10.59350/t6mga-7zd77

Veröffentlicht 30. April 2024

Autor Dhruv Gupta

The Three Oldest Pillars of NLP Author Dhruv Gupta ( ORCID : 0009–0004–7109–5403) Introduction Natural Language Processing (NLP) has almost become synonymous with Large Language Models (LLMs), Generative AI, and fancy chatbots. With the ever-increasing amount of textual data and exponential growth in computational knowledge, these models are improving every day.

Large-language-modelsFrameworkRetrieval-augmentedInformatikEnglisch

RAG 2.0 is Coming?

https://doi.org/10.59350/6frhg-zxp80

Veröffentlicht 30. April 2024

Autor Qingqin Fang

A Unified and Collaborative Framework for LLM Author · Qingqin Fang ( ORCID: 0009–0003–5348–4264) Introduction In today’s rapidly evolving field of artificial intelligence, large language models (LLMs) are demonstrating unprecedented potential. Particularly, the Retrieval-Augmented Generation (RAG) architecture has become a hot topic in AI technology due to its unique technical capabilities.

Research Graph

Prompt engineering: A Way to Smartly Use AI

Automated Knowledge Graph Construction with Large Language Models — Part 2

How to use Large Language Models to tag your data: A complete tutorial

The longer the context, the better? Unlimited Context Length in Megalodon

Brief Introduction to the History of Large Language Models (LLMs)

Transformers Models in NLP

Are Large Language Models Our Allies or Enemies in the Fight Against Fake News?

Fine-tuning Large Language Models: A Brief Introduction

Three Paradigms of RAG

RNNs vs GRUs vs LSTMs

RAG 2.0 is Coming?