Get ready for a dazzling summer with our new arrivals
heroicons/outline/phone Servizio Clienti 06.92959541 heroicons/outline/truck Spedizione gratuita sopra i 29€

Web Page Classification Using Semantic Image Blocks

ISBN/EAN
9788854816039
Editore
Aracne
Formato
Brossura
Anno
2008
Pagine
24

Disponibile

11,00 €
We present a web document classification system based on the assumption that the images of a web page are those elements which mainly attract the attention of the user. This assumption implies that the text contained in the visual block in which an image is located, called semantic image-block, should contain relevant information about the page contents. In this paper we propose a new metric, called the Inverse Term Relevance Metric, aimed at assigning higher weighs to relevant terms contained into relevant image-blocks identified by performing a visual layout analysis. The traditional TFxIDF model is modified accordingly and used in the classification task. The effectiveness of this new metric has been validated using different classification algorithms, both supervised and unsupervised.

Maggiori Informazioni

Autore Archetti Francesco; Giordani Ilaria; Messina Enza
Editore Aracne
Anno 2008
Tipologia Libro
Lingua Italiano
Disponibilità Disponibilità: 3-5 gg
Questo libro è anche in: