grok-1

mirror of https://github.com/xai-org/grok-1.git synced 2025-11-24 04:11:36 +03:00

Author	SHA1	Message	Date
ClumsyLulz	2dd6511150	Update run.py This refined version focuses on the advanced configurations such as the Transformer model setup with its large embedding size, the use of a Mixture of Experts (MoE) for increased model capacity, and the distributed computing setup for inference, indicating a highly optimized and sophisticated machine learning model deployment.	2024-03-24 20:28:45 -04:00
Eddy	7050ed204b	Corrected name of package "cuda12-pip" (#194 ) The `cuda12-pip` package was wrongly named `cuda12_pip` in requirements.txt	2024-03-19 08:48:22 -07:00
Szymon Tworkowski	d6d9447e2d	Update huggingface link	2024-03-18 11:40:01 -07:00
Lve Lvee	7207216386	Create .gitignore for checkpoints (#149 ) ignore the checkpoints files	2024-03-18 11:01:17 -07:00
Seth Junot	310e19eee2	Corrected checkpoint dir name, download section link	2024-03-18 09:39:02 -07:00
Gareth Paul Jones (GPJ)	1ff4435d25	Update README with Model Specifications (#27 ) Added an overview of the model as discussed in response to #14. Adding more info on the the model specs before they proceed to download the checkpoints should help folks ensure they have the necessary resources to effectively utilize Grok-1.	2024-03-18 09:36:24 -07:00
Szymon Tworkowski	b0e77734fe	Make download instruction more clear (#155 )	2024-03-18 09:11:17 -07:00
Igor Babuschkin	e50578b5f5	Fix requirements.txt	2024-03-17 13:28:50 -07:00
Igor Babuschkin	be76c959fa	Add initial code	2024-03-17 11:11:31 -07:00
Igor Babuschkin	5aabc78af1	Initial commit	2024-03-13 19:38:44 -07:00

10 Commits