- math
- code
- pre-training
- finetune
- LLM
- Time Series
•
•
•
•
•
-
Unifying Long Context LLMs and RAG
towards advances on long context LLMs and RAG
-
Training with DeepSpeed - Basic Concepts
basic concepts behind of DeepSpeed
-
Basic and Advanced Techniques on RAG
detailed techniques on RAG, with LangChain code examples
-
Mixed-precision training in LLM
a note on mixed-precision training
-
Understanding tokenizer from Andrej Karpathy's tutorial
a detailed note on llm tokenizer