Vision–Language Models