assistant-skills/vision-analysis/SKILL.md

800 B

name description
vision-analysis Analyze images using a vision model via Python. Use when the built-in image tool fails or when you need to extract text, describe content, or analyze one or more images. Triggers on any request to analyze, read, describe, or extract information from images.

Vision Analysis

Analyze images by calling the OpenRouter API directly via Python.

Script

uv run --with requests ~/notes/skills/vision-analysis/scripts/analyze.py "<prompt>" <image1> [image2] ...
  • Prompt (required): What to extract or analyze from the images.
  • Images (required): One or more local file paths.

Notes

  • The built-in image tool does not work with the configured vision model. Always use this script instead.
  • Max ~20 images per call (API limit).