Λεπτομέρειες

ΕίδοςΔημοσίευση
ΚωδικόςTR-2016-2
ΤίτλοςCohesive Keyword Search on Tree Data
ΣυγγραφέαςΑγγελική Δημητρίου, Ananya Dass, Δημήτρης Θεοδωράτος, Γιάννης Βασιλείου
Έτος2016
Λέξεις κλειδιάKeyword search, XML, tree data, ranking
ΠερίληψηKeyword search is the most popular querying technique on semistructured data. Keyword queries are simple and convenient. However, as a consequence of their imprecision, there is usually a huge number of candidate results of which only very few match the user's intent. Unfortunately, the existing semantics for keyword queries are ad-hoc and they generally fail to ``guess'' the user intent. Therefore, the quality of their answers is poor and the existing algorithms do not scale satisfactorily. In this paper, we introduce the novel concept of cohesive keyword queries for tree data. Intuitively, a cohesiveness relationship on keywords indicates that they should form a cohesive whole in a query result. Cohesive keyword queries allow term nesting and keyword repetition. Cohesive keyword queries bridge the gap between flat keyword queries and structured queries. Although more expressive, they are as simple as flat keyword queries and not require any schema knowledge. We provide formal semantics for cohesive keyword queries and rank query results on the proximity of the keyword instances. We design a stack based algorithm which efficiently evaluates cohesive keyword queries. Our experiments demonstrate that our approach outperforms in quality previous filtering semantics and our algorithm scales smoothly on queries of even 20 keywords on large datasets.
ΚατηγορίαSemanttic Web, XML
ΔημοσίευσηEDBT 2016
Αρχείο Επισκόπηση


Επιστροφή στην αρχική σελίδα