PDF import just landed in Weaviate Agent Skills! Point Claude Code (or any agent) at a PDF, and it handles everything from schema setup, collection configuration, to embedding each page using ColModernVBERT multimodal model. MUVERA then makes multi-vector retrieval efficient by reducing memory usage and computational cost while preserving retrieval quality. In short: your entire document library becomes searchable in Weaviate with a single prompt. CSV, JSON, and JSONL are already supported! Try it: https://t.co/CvQ30X2pcU
Weaviate Adds PDF Support to Agent Skills for Autonomous Document Ingestion
· Updated
Weaviate added PDF import capabilities to its Agent Skills framework, allowing AI agents to autonomously configure schemas and ingest document libraries. By combining multimodal embeddings with the MUVERA algorithm, the system enables high-accuracy multi-vector retrieval without the typical memory and cost overhead.
ColModernVBERT, a multimodal model (AI that understands text and images together) for page-level embedding.Multi-vector retrieval (a search method using multiple data points for higher accuracy) is often too resource-intensive. This update integrates MUVERA, an algorithm that compresses complex embeddings into an efficient format. This allows you to achieve high-quality retrieval while significantly reducing the computational and memory costs usually required for large-scale search.
You can now point any compatible agent at a document library to build a searchable database without writing manual ingestion logic. The skill is available via GitHub and can be installed using npx skills add or as a Claude Code plugin. CSV and JSON formats are also supported for agent-led imports.
Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →



