LLM-Reading-List
Visit ToolLLM-Reading-List is a Research & Education tool that curates a list of LLM papers. It focuses on inference and model compression, helping researchers track relevant publications.
At a glance
Trending
LLM-Reading-List is a Research & Education tool that curates a list of LLM papers. It focuses on inference and model compression, helping researchers track relevant publications.
Trending
About
LLM-Reading-List is a curated collection of academic papers primarily focused on Large Language Model (LLM) inference and model compression. This GitHub repository serves as a valuable resource for researchers and academics looking to stay updated on the latest advancements in these specialized areas of LLM development. The list is meticulously organized into categories such as Transformer Architectures, Foundation Models, Position Encoding, KV Cache, Activation, Pruning, Quantization, Normalization, Sparsity and rank compression, Fine-tuning, Sampling, Scaling, Mixture of Experts, and Watermarking. Each section provides direct links to key papers, making it an efficient way to explore foundational and cutting-edge research without extensive searching. It is particularly useful for those delving into the technical aspects of optimizing LLMs for performance and efficiency.
Capabilities
Pricing & Plans
Open Source
Free
FAQs
Trending