Unified Multimodal Editor (UME)

Upload an image, ask for a description, or give instructions to edit it (e.g., 'change the red mug to blue').