Commit Graph

10 Commits

Author SHA1 Message Date
ClumsyLulz
2dd6511150
Update run.py
This refined version focuses on the advanced configurations such as the Transformer model setup with its large embedding size, the use of a Mixture of Experts (MoE) for increased model capacity, and the distributed computing setup for inference, indicating a highly optimized and sophisticated machine learning model deployment.
2024-03-24 20:28:45 -04:00
Eddy
7050ed204b
Corrected name of package "cuda12-pip" (#194)
The `cuda12-pip` package was wrongly named `cuda12_pip`
in requirements.txt
2024-03-19 08:48:22 -07:00
Szymon Tworkowski
d6d9447e2d
Update huggingface link 2024-03-18 11:40:01 -07:00
Lve Lvee
7207216386
Create .gitignore for checkpoints (#149)
ignore the checkpoints files
2024-03-18 11:01:17 -07:00
Seth Junot
310e19eee2
Corrected checkpoint dir name, download section link 2024-03-18 09:39:02 -07:00
Gareth Paul Jones (GPJ)
1ff4435d25
Update README with Model Specifications (#27)
Added an overview of the model as discussed in response to #14. 

Adding more info on the the model specs before they proceed to download
the checkpoints should help folks ensure they have the necessary
resources to effectively utilize Grok-1.
2024-03-18 09:36:24 -07:00
Szymon Tworkowski
b0e77734fe
Make download instruction more clear (#155) 2024-03-18 09:11:17 -07:00
Igor Babuschkin
e50578b5f5 Fix requirements.txt 2024-03-17 13:28:50 -07:00
Igor Babuschkin
be76c959fa Add initial code 2024-03-17 11:11:31 -07:00
Igor Babuschkin
5aabc78af1
Initial commit 2024-03-13 19:38:44 -07:00