Video annotation is an important issue in video content management systems. Rapid growth of the digital video data has created a need for efficient and reasonable mechanisms that can ease the annotation process. In this paper, we propose a novel hierarchical clustering based system for video annotation. The proposed system generates a top-down hierarchy of the video streams using hierarchical k-means clustering. A tree-based structure is produced by dividing the video recursively into sub-groups, each of which consists of similar content. Based on the visual features, each node of the tree is partitioned into its children using k-means clustering. Each sub-group is then represented by its key frame, which is selected as the closest frame to the centroids of the corresponding cluster, and then can be displayed at the higher level of the hierarchy. The experiments show that very good hierarchical view of the video sequences can be created for annotation in terms of efficiency. © 2014 Elsevier Ltd. All rights reserved.
ISSN (dc.identifier.issn) | 00457906 (ISSN) |
Yayıncı (dc.publisher) | Elsevier Ltd |
Eser Adı (dc.title) | Hierarchical representation of video sequences for annotation |
Özet (dc.description.abstract) | Video annotation is an important issue in video content management systems. Rapid growth of the digital video data has created a need for efficient and reasonable mechanisms that can ease the annotation process. In this paper, we propose a novel hierarchical clustering based system for video annotation. The proposed system generates a top-down hierarchy of the video streams using hierarchical k-means clustering. A tree-based structure is produced by dividing the video recursively into sub-groups, each of which consists of similar content. Based on the visual features, each node of the tree is partitioned into its children using k-means clustering. Each sub-group is then represented by its key frame, which is selected as the closest frame to the centroids of the corresponding cluster, and then can be displayed at the higher level of the hierarchy. The experiments show that very good hierarchical view of the video sequences can be created for annotation in terms of efficiency. © 2014 Elsevier Ltd. All rights reserved. |
Yayın Tarihi (dc.date.issued) | 2014 |
Kayıt Giriş Tarihi (dc.date.accessioned) | 2020-08-07T13:02:16Z |
Açık Erişim tarihi (dc.date.available) | 2020-08-07T13:02:16Z |
Yayın Dili (dc.language.iso) | eng |
Yayın Türü (dc.type) | Makale |
Yazar/lar (dc.contributor.author) | MENDİ, Engin |
Tek Biçim Adres (dc.identifier.uri) | http://hdl.handle.net/20.500.12498/3134 |
DOI Numarası (dc.identifier.doi) | 10.1016/j.compeleceng.2014.03.001 |
Atıf Dizini (dc.source.database) | Scopus |