Report Job - Research Intern – Multimodal Foundation Model for Vision at SONY global