LLM-Reading-List

Visit Tool

LLM-Reading-List is a Research & Education tool that curates a list of LLM papers. It focuses on inference and model compression, helping researchers track relevant publications.

Claim this tool

1View

At a glance

Pricing

Open Source

Free tier

Yes

API

Skill level

Technical

About

What is LLM-Reading-List?

LLM-Reading-List is a curated collection of academic papers primarily focused on Large Language Model (LLM) inference and model compression. This GitHub repository serves as a valuable resource for researchers and academics looking to stay updated on the latest advancements in these specialized areas of LLM development. The list is meticulously organized into categories such as Transformer Architectures, Foundation Models, Position Encoding, KV Cache, Activation, Pruning, Quantization, Normalization, Sparsity and rank compression, Fine-tuning, Sampling, Scaling, Mixture of Experts, and Watermarking. Each section provides direct links to key papers, making it an efficient way to explore foundational and cutting-edge research without extensive searching. It is particularly useful for those delving into the technical aspects of optimizing LLMs for performance and efficiency.

Best used for

Ideal for professors and researchers who need to quickly access key academic papers, track the latest advancements in LLM inference, and explore various model compression techniques. Especially valuable for those working on optimizing large language models for efficiency and performance.

Common actions

find research papers

track LLM advancements

explore model compression

study inference techniques

workflowsdeepfakeautomated workflowopen-sourcelow-code/no-codegithub copilot"AI Agents"collaborationface swapping

Capabilities

Key features

Curated LLM paper list
Categorized research topics
Focus on inference
Focus on model compression
Direct paper links

Target Audience

professorresearcher

Integrations

Not yet documented

Pricing & Plans

Open Source

Free

FAQs

What kind of LLM papers are included in the LLM-Reading-List?

The LLM-Reading-List primarily focuses on papers related to Large Language Model (LLM) inference and model compression. It covers topics such as Transformer architectures, foundation models, position encoding, KV cache, pruning, quantization, and various optimization techniques.

How is the LLM-Reading-List organized?

The list is organized into distinct categories, including Transformer Architectures, Foundation Models, Position Encoding, KV Cache, Activation, Pruning, Quantization, Normalization, Sparsity and rank compression, Fine-tuning, Sampling, Scaling, Mixture of Experts, and Watermarking, making it easy to navigate specific research areas.

Is the LLM-Reading-List actively maintained and updated?

As an open-source GitHub repository, the LLM-Reading-List is maintained by its creator, Evan Miller, who adds papers he is currently reading. While not a formal publication, it reflects ongoing research interests in the LLM community.

Trending

Subcategories trending in Research & Education

Study Assistants Knowledge Management Course Creation Scientific Computing Summarization Language Learning

Trending

Explore

Browse AI tools by category

Content & Design Productivity & Business Coding & Development AI Agents & Automation Research & Education Wellness & Lifestyle Career Development Marketing & Growth Data & Analytics Customer Support & CX Finance E-commerce