The latents code and the prompt creation explainer code could be significantly simplified.
Simplifying the code would facilitate prototyping and adding new features.
E.g.: reshaping the input tokens to be of sequence length window_size before caching the latents would allow us to remove most or all of the window handling logic in constructors.
The
latentscode and the prompt creation explainer code could be significantly simplified.Simplifying the code would facilitate prototyping and adding new features.
E.g.: reshaping the input tokens to be of sequence length
window_sizebefore caching the latents would allow us to remove most or all of the window handling logic inconstructors.