Annotate and describe images with text prompts
Extract text from images using various OCR modes
Generate text responses using different models