mirror of
https://github.com/xai-org/grok-1.git
synced 2025-04-03 18:00:10 +03:00
Update readme and add break downs for each class
This commit is contained in:
parent
0ecf6689d0
commit
b5b1e779d2
@ -162,7 +162,7 @@ The NamedTuple is a lightweight data structure provides a way to define immutabl
|
|||||||
|
|
||||||
the TrainingState serves as a lightweight container to hold and manage the model parameters during the training process, allowing for efficient manipulation and updating of the model's weights.
|
the TrainingState serves as a lightweight container to hold and manage the model parameters during the training process, allowing for efficient manipulation and updating of the model's weights.
|
||||||
|
|
||||||
### :page_with_curl: KVMemory
|
## :page_with_curl: KVMemory
|
||||||
|
|
||||||
`KVMemory` is a `NamedTuple` data structure used to store and manage the key-value memory state in the transformer architecture. It is defined as follows:
|
`KVMemory` is a `NamedTuple` data structure used to store and manage the key-value memory state in the transformer architecture. It is defined as follows:
|
||||||
|
|
||||||
|
Loading…
Reference in New Issue
Block a user