The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.
-
Updated
Feb 21, 2025 - Python
The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.
Inference, Ingestion and Indexing built in Rust 🦀
Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine
Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.
LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.
A new novel multi-modality (Vision) RAG architecture
REST API for computing cross-modal similarity between images and text using the ColPaLI vision-language model
The repo provides the code for Qdrant for efficient image indexing and retrieval using models such as ColPali, ColQwen, and VDR-2B-Multi-V1, enhancing multimodal search capabilities across various applications.
ColPali is vision based RAG (Retrieval Augmented Generation) which can capture visual data
OCR and Document Search Web Application
Add a description, image, and links to the colpali topic page so that developers can more easily learn about it.
To associate your repository with the colpali topic, visit your repo's landing page and select "manage topics."