Upload an image to generate a detailed caption using Apple's FastVLM-0.5B model. You can use the default prompt or provide your own custom prompt.
Model: apple/FastVLM-0.5B
Note: This Space uses ZeroGPU for dynamic GPU allocation.