Skip to content

Latest commit

 

History

History
15 lines (11 loc) · 559 Bytes

README.md

File metadata and controls

15 lines (11 loc) · 559 Bytes

This demo showcases how to use CLIP to compute similarity between texts and images, which can be employed for image-to-text or text-to-image retrieval tasks.

Quick Start

cargo run -r -F cuda --example clip -- --device cuda:0

Results

(99.9675%) ./examples/clip/images/carrot.jpg => Some carrots
(99.93718%) ./examples/clip/images/doll.jpg => There is a doll with red hair and a clock on a table
(100.0%) ./examples/clip/images/drink.jpg => Some people holding wine glasses in a restaurant