A linguistic graph-based approach for web news sentence searching

Schouten, Kim; Frasincar, Flavius

doi:10.1007/978-3-642-40173-2_7

K.I.M. Schouten (Kim) and F. Frasincar (Flavius)

2013-09-25

A linguistic graph-based approach for web news sentence searching

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) , Volume 8056 LNCS - Issue PART 2 p. 57- 64

With an ever increasing amount of news being published every day, being able to effectively search these vast amounts of information is of primary interest to many Web ventures. As word-based approaches have their limits in that they ignore a lot of the information in texts, we present Destiny, a linguistic approach where news item sentences are represented as a graph featuring disambiguated words as nodes and grammatical relations between words as edges. Searching is then reminiscent of finding an approximate sub-graph isomorphism between the query sentence graph and the graphs representing the news item sentences, exploiting word synonymy, word hypernymy, and sentence grammar. Using a custom corpus of user-rated queries and sentences, the search algorithm is evaluated based on the Mean Average Precision, Spearman's Rho, and the normalized Discounted Cumulative Gain. Compared to the TF-IDF baseline, the Destiny algorithm performs significantly better on these metrics.

Additional Metadata
Keywords	Grammatical relations, Graph-based, Linguistic approach, Search Algorithms, Spearman's rho
Persistent URL	doi.org/10.1007/978-3-642-40173-2_7, hdl.handle.net/1765/41598
Journal	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Organisation	Erasmus MC: University Medical Center Rotterdam
Citation APA Style AAA Style APA Style Cell Style Chicago Style Harvard Style IEEE Style MLA Style Nature Style Vancouver Style American-Institute-of-Physics Style Council-of-Science-Editors Style BibTex Format Endnote Format RIS Format CSL Format DOIs only Format	Schouten, K., & Frasincar, F. (2013). A linguistic graph-based approach for web news sentence searching. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8056 LNCS, pp. 57–64). doi:10.1007/978-3-642-40173-2_7

A linguistic graph-based approach for web news sentence searching

Publication

Publication

About

A linguistic graph-based approach for web news sentence searching

Publication

Publication

Workflow

Workflow

Add Content