Claude Skills as an Ingestion Tool: When the Best Vision Model Is the One Driving Your Editor
Post 4 of the Pepper & Carrot AI flipbook series. The comic is images, not text — so before any RAG can happen, every page needs a description. This post walks through using a Claude Code skill as the vision provider for the ingestion pipeline of this portfolio-project specifically: no per-call API cost beyond the Claude Code subscription, auditable JSON artifacts on disk, same Claude model as Anthropic's hosted vision API. By the end, one full episode is ingested into Postgres + ChromaDB + local storage. The right vision provider is context-specific — local VLM, hosted API, and Claude Code each win under different constraints (budget, whether the pipeline runs unattended, throughput) — and the post includes a decision matrix mapping each constraint to the right choice.