LONG-RANGE CORRELATIONS AS A CRITERION FOR RELEVANCE OF WORDS IN TEXTUAL DOCUMENTS
Keywords:
information retrieval, keyword detection, long-range correlations, word-token clusteringAbstract
Highly efficient detection of keywords is a basis for successful information retrieval. Here we present a new criterion of relevance of words in textual documents, which is associated with the long-range autocorrelations of word-token time series. The above approach is compared with a canonical keyword detection method based on word-token clustering in a text.
References
J. P. Herrera, P. A. Pury, Eur. Phys. J. B, 63, 135 (2008).
О. Кушнір, А. Волоско, Л. Іваніцький, С. Рихлюк, Електрон. та інф. технол., 6, 155 (2016).
K.-I. Goh, A.-L. Barabási, Europhys. Lett., 81, 48002 (2008).
E. G. Altmann, G. Cristadoro, M. D. Esposti, Proc. Natl. Acad. Sci., 109, 11582 (2012).