Visualization;Semantics;Zero Shot Learning;Videos;Grounding;Vectors;Object Detection;Feature Extraction;Training;Iterative Methods;Food Computing;Zero-Shot Learning;Zero-Shot Detection;Cross-Modal Fusion;Food Detection