Using linguistic graph similarity to search for sentences in news articles

Schouten, Kim; Frasincar, Flavius

doi:10.3233/978-1-61499-714-6-255

With the volume of daily news growing to sizes too big to handle for any individual human, there is a clear need for effective search algorithms. Since traditional bag-of-words approaches are inherently limited since they ignore much of the information that is embedded in the structure of the text, we propose a linguistic approach to search called Destiny in this paper. With Destiny, sentences, both from news items and the user queries, are represented as graphs where the nodes represent the words in the sentence and the edges represent the grammatical relations between the words. The proposed algorithm is evaluated against a TF-IDF baseline using a custom corpus of user-rated sentences. Destiny significantly outperforms TF-IDF in terms of Mean Average Precision, normalized Discounted Cumulative Gain, and Spearman's Rho.

Additional Metadata
Keywords	Sub-graph isomorphism, Syntax dependencies, Text searching
Persistent URL	doi.org/10.3233/978-1-61499-714-6-255, hdl.handle.net/1765/113077
Series	Frontiers in Artificial Intelligence and Applications
Organisation	Erasmus University Rotterdam
Citation APA Style AAA Style APA Style Cell Style Chicago Style Harvard Style IEEE Style MLA Style Nature Style Vancouver Style American-Institute-of-Physics Style Council-of-Science-Editors Style BibTex Format Endnote Format RIS Format CSL Format DOIs only Format	Schouten, K., & Frasincar, F. (2016). Using linguistic graph similarity to search for sentences in news articles. Frontiers in Artificial Intelligence and Applications. doi:10.3233/978-1-61499-714-6-255

Free Full Text ( Final Version , 342kb )

Using linguistic graph similarity to search for sentences in news articles

Publication

Publication

About

Using linguistic graph similarity to search for sentences in news articles

Publication

Publication

Workflow

Workflow

Add Content