A reverse engineering approach for automatic annotation of Web pages

de Virgilio, Roberto; Frasincar, Flavius; Hop, Walter; Lachner, Stephan

doi:10.1007/s11042-011-0852-8

R. de Virgilio (Roberto), F. Frasincar (Flavius), W. Hop (Walter) and S. Lachner (Stephan)

2013-05-01

A reverse engineering approach for automatic annotation of Web pages

Multimedia Tools and Applications p. 1- 22

The Semantic Web is gaining increasing interest to fulfill the need of sharing, retrieving, and reusing information. Since Web pages are designed to be read by people, not machines, searching and reusing information on the Web is a difficult task without human participation. To this aim adding semantics (i.e meaning) to a Web page would help the machines to understand Web contents and better support the Web search process. One of the latest developments in this field is Google's Rich Snippets, a service for Web site owners to add semantics to their Web pages. In this paper we provide a structured approach to automatically annotate a Web page with Rich Snippets RDFa tags. Exploiting a data reverse engineering method, combined with several heuristics, and a named entity recognition technique, our method is capable of recognizing and annotating a subset of Rich Snippets' vocabulary, i.e., all the attributes of its Review concept, and the names of the Person and Organization concepts. We implemented tools and services and evaluated the accuracy of the approach on real E-commerce Web sites.

Additional Metadata
Keywords	DRE, RDFa, Rich Snippets, Web site segmentation
Persistent URL	doi.org/10.1007/s11042-011-0852-8, hdl.handle.net/1765/31142
Series	ERIM Top-Core Articles
Journal	Multimedia Tools and Applications
Organisation	Erasmus Research Institute of Management
Citation APA APA Style APA-ALL Style AAA Style Cell Style Chicago Style Harvard Style IEEE Style MLA Style Nature Style Vancouver Style American-Institute-of-Physics Style Council-of-Science-Editors Style BibTex Format Endnote Format RIS Format CSL Format DOIs only Format	de Virgilio, R., Frasincar, F., Hop, W.& Lachner, S. (2013). A reverse engineering approach for automatic annotation of Web pages. Multimedia Tools and Applications, 1–22.https://doi.org/10.1007/s11042-011-0852-8

Free Full Text ( Final Version , 253kb )

A reverse engineering approach for automatic annotation of Web pages

Publication

Publication

About

A reverse engineering approach for automatic annotation of Web pages

Publication

Publication

Workflow

Workflow

Add Content