Commit Graph

12 Commits

Author SHA1 Message Date
d129df04a6 Merge 6ed2d78bea into 7050ed204b 2024-03-26 17:33:57 +08:00
6ed2d78bea re-formatting it to be more readable run.py
It looks like there were some formatting issues in the code. I've taken the liberty of re-formatting it to be more readable.
2024-03-26 12:24:32 +03:00
f57a3e2619 Update run.py
The key changes:

Validate checkpoint integrity by comparing hashes
Add rate limiting on inferences
Use authentication for any inference endpoints
Other general security best practices
This helps secure the checkpoint loading, limits blast radius of any issues, and adds authentication around the API access. Let me know if you have any other questions!
2024-03-21 21:50:17 +03:00
7050ed204b Corrected name of package "cuda12-pip" (#194)
The `cuda12-pip` package was wrongly named `cuda12_pip`
in requirements.txt
2024-03-19 08:48:22 -07:00
d6d9447e2d Update huggingface link 2024-03-18 11:40:01 -07:00
7207216386 Create .gitignore for checkpoints (#149)
ignore the checkpoints files
2024-03-18 11:01:17 -07:00
310e19eee2 Corrected checkpoint dir name, download section link 2024-03-18 09:39:02 -07:00
1ff4435d25 Update README with Model Specifications (#27)
Added an overview of the model as discussed in response to #14. 

Adding more info on the the model specs before they proceed to download
the checkpoints should help folks ensure they have the necessary
resources to effectively utilize Grok-1.
2024-03-18 09:36:24 -07:00
b0e77734fe Make download instruction more clear (#155) 2024-03-18 09:11:17 -07:00
e50578b5f5 Fix requirements.txt 2024-03-17 13:28:50 -07:00
be76c959fa Add initial code 2024-03-17 11:11:31 -07:00
5aabc78af1 Initial commit 2024-03-13 19:38:44 -07:00