Multimodal Learning with Vision and Language

Erdem A., Erdem E.

9th International Conference on Image Processing Theory, Tools and Applications (IPTA), İstanbul, Turkey, 6 - 09 November 2019 identifier identifier

  • Publication Type: Conference Paper / Full Text
  • Volume:
  • Doi Number: 10.1145/3343031.3350935
  • City: İstanbul
  • Country: Turkey
  • Hacettepe University Affiliated: Yes


This tutorial presents the recent advances in multi-modal learning for integrated vision and language problems and gives the necessary background.