<image>
where appropriate and supply the list of images as an ordered list (this is true for the Phi-3 model, but may be subject to change for future vision-language models). For example:
.png
, .jpg
, .jpeg
, .gif
, .bmp
, .tiff
and .ppm
format images.