Change the repository type filter
All
Repositories list
3 repositories
- Running large language models on a single GPU for throughput-oriented scenarios.
- [NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.
DejaVu
Public
ProTip! When viewing an organization's repositories, you can use the
props. filter to filter by custom property.