pre-training
an archive of posts with this tag
| Apr 30, 2024 | Training with DeepSpeed - Basic Concepts |
|---|---|
| Apr 23, 2024 | Mixed-precision training in LLM |
| Apr 14, 2024 | Understanding tokenizer from Andrej Karpathy's tutorial |
an archive of posts with this tag
| Apr 30, 2024 | Training with DeepSpeed - Basic Concepts |
|---|---|
| Apr 23, 2024 | Mixed-precision training in LLM |
| Apr 14, 2024 | Understanding tokenizer from Andrej Karpathy's tutorial |