9th International Conference on Image Processing Theory, Tools and Applications (IPTA), İstanbul, Turkey, 6 - 09 November 2019
This tutorial presents the recent advances in multi-modal learning for integrated vision and language problems and gives the necessary background.