Multimodal Learning with Vision and Language

9th International Conference on Image Processing Theory, Tools and Applications (IPTA), İstanbul, Türkiye, 6 - 09 Kasım 2019, (Tam Metin Bildiri)

This tutorial presents the recent advances in multi-modal learning for integrated vision and language problems and gives the necessary background.