user: gpjt
1608 karma | created 17 years ago
recent submissions
6 pts|23 days ago|discuss
Writing an LLM from scratch, part 32c – Interventions: removing dropout
(gilesthomas.com)
1 pts|24 days ago|discuss
Writing an LLM from scratch, part 32B – Interventions: gradient clipping
(gilesthomas.com)
2 pts|25 days ago|discuss
1 pts|26 days ago|discuss
Getting a Custom PyTorch LLM onto the Hugging Face Hub
(gilesthomas.com)
1 pts|1 month ago|discuss
2 pts|1 month ago|discuss
1 pts|1 month ago|discuss
LLM from scratch, part 29 – using DDP to train a base model in the cloud
(gilesthomas.com)
2 pts|1 month ago|discuss
2 months ago|discuss
2 months ago|discuss
2 months ago|discuss
2 months ago|discuss
2 months ago|discuss
2 months ago|discuss
2 months ago|discuss
2 months ago|discuss
2 months ago|discuss
2 months ago|discuss
2 months ago|discuss
2 months ago|discuss