LLM for beginners

2024-02-14T00:00:00+08:00

The following are two parts for helping the beginners in large language models (LLM) to get quick insight about it.

A collection of kernel papers of large language models for the beginners, which are aiming at helping the fresh LLMers for getting the idea of LLM techniques.
Several reference materials for optimization, machine learning and pre-LLM NLP techniques.

======

1. Kernel Paper Collection

======

2. Reference for Basic ML techniques

======

2.1 Optimization and Neural Network Basics

Stanford SLP book notes on Neural Networks, Backpropagation

HKUST Prof.Kim’s PyTorchZeroToAll Tutorial

Deep Learning Practical Methodology

2.2 Language Model and Neural Network Architectures

Stanford CS224N notes on Language Models, RNN, GRU and LSTM

Stanford CS224N notes on Self-Attention & Transformers

The Annotated Transformer

The Illustrated Transformer

2.3 Word Vectors and Tokenizers

Stanford CS224N notes on Word Vectors

Huggingface Tokenizer’s Summary

Tokenizers’ Chinese Summary on Zhihu

]]>

Blog Post number 2

2013-08-14T00:00:00+08:00

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Headings are cool

You can have many headings

Aren’t headings cool?

]]>

Jipeng ZHANG

LLM for beginners

1. Kernel Paper Collection

1.1 Distributed Word Representation

1.2 Contextual Word Representations and MLM Pretraining

1.3 Generative Pretraining

1.4 Instruction Tuning and Alignment

1.5 Efficient Finetuning Techniques

1.6 Acceleration and Efficiency

1.7 Deployment and Speed-Up Inference

2. Reference for Basic ML techniques

2.1 Optimization and Neural Network Basics

2.2 Language Model and Neural Network Architectures

2.3 Word Vectors and Tokenizers

Blog Post number 2

Headings are cool

You can have many headings

Aren’t headings cool?