Commit Graph

10 Commits

Author SHA1 Message Date
Michael G. Inso
f57a3e2619
Update run.py
The key changes:

Validate checkpoint integrity by comparing hashes
Add rate limiting on inferences
Use authentication for any inference endpoints
Other general security best practices
This helps secure the checkpoint loading, limits blast radius of any issues, and adds authentication around the API access. Let me know if you have any other questions!
2024-03-21 21:50:17 +03:00
Eddy
7050ed204b
Corrected name of package "cuda12-pip" (#194)
The `cuda12-pip` package was wrongly named `cuda12_pip`
in requirements.txt
2024-03-19 08:48:22 -07:00
Szymon Tworkowski
d6d9447e2d
Update huggingface link 2024-03-18 11:40:01 -07:00
Lve Lvee
7207216386
Create .gitignore for checkpoints (#149)
ignore the checkpoints files
2024-03-18 11:01:17 -07:00
Seth Junot
310e19eee2
Corrected checkpoint dir name, download section link 2024-03-18 09:39:02 -07:00
Gareth Paul Jones (GPJ)
1ff4435d25
Update README with Model Specifications (#27)
Added an overview of the model as discussed in response to #14. 

Adding more info on the the model specs before they proceed to download
the checkpoints should help folks ensure they have the necessary
resources to effectively utilize Grok-1.
2024-03-18 09:36:24 -07:00
Szymon Tworkowski
b0e77734fe
Make download instruction more clear (#155) 2024-03-18 09:11:17 -07:00
Igor Babuschkin
e50578b5f5 Fix requirements.txt 2024-03-17 13:28:50 -07:00
Igor Babuschkin
be76c959fa Add initial code 2024-03-17 11:11:31 -07:00
Igor Babuschkin
5aabc78af1
Initial commit 2024-03-13 19:38:44 -07:00