Skip to content

Latest commit

 

History

History
7 lines (5 loc) · 514 Bytes

README.md

File metadata and controls

7 lines (5 loc) · 514 Bytes

NanoLLM

Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector DB, and RAG.

Note

See dusty-nv.github.io/NanoLLM for docs and Jetson AI Lab for tutorials.