Automatic categorization and summarization of documentaries


Creative Commons License

Demirtas K., Cicekli N. K., Cicekli I.

JOURNAL OF INFORMATION SCIENCE, vol.36, no.6, pp.671-689, 2010 (SCI-Expanded) identifier identifier

  • Publication Type: Article / Article
  • Volume: 36 Issue: 6
  • Publication Date: 2010
  • Doi Number: 10.1177/0165551510382070
  • Journal Name: JOURNAL OF INFORMATION SCIENCE
  • Journal Indexes: Science Citation Index Expanded (SCI-EXPANDED), Social Sciences Citation Index (SSCI), Scopus
  • Page Numbers: pp.671-689
  • Hacettepe University Affiliated: No

Abstract

In this paper, we propose automatic categorization and summarization of documentaries using subtitles of videos. We propose two methods for video categorization. The first makes unsupervised categorization by applying natural language processing techniques on video subtitles and uses the WordNet lexical database and WordNet domains. The second has the same extraction steps but uses a learning module to categorize. Experiments with documentary videos give promising results in discovering the correct categories of videos. We also propose a video summarization method using the subtitles of videos and text summarization techniques. Significant sentences in the subtitles of a video are identified using these techniques and a video summary is then composed by finding the video parts corresponding to these summary sentences.