http://www.cnr.it/ontology/cnr/individuo/prodotto/ID287445
Automatic Detection of Words Associations in Texts based on Joint Distribution of Words Occurrences (Articolo in rivista)
- Type
- Label
- Automatic Detection of Words Associations in Texts based on Joint Distribution of Words Occurrences (Articolo in rivista) (literal)
- Anno
- 2015-01-01T00:00:00+01:00 (literal)
- Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#doi
- 10.1111/coin.12065 (literal)
- Alternative label
- Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#autori
- Daniele Santoni, Elaheh Pourabbas (literal)
- Rivista
- Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#affiliazioni
- Istituto di Analisi dei Sistemi ed Informatica \"Antonio Ruberti\", Consiglio Nazionale delle Ricerche (literal)
- Titolo
- Automatic Detection of Words Associations in Texts based on Joint Distribution of Words Occurrences (literal)
- Abstract
- In this paper, we propose a novel approach for measuring words
association based on the joint occurrences distribution in a text. Our
approach relies on computing a sum of distances between neighboring
occurrences of a given word pair and comparing it to a vector of
randomly generated occurrences. The idea behind this assumption is
that if the distribution of co-occurrences is close to random or if they
tend to appear together less frequently than by chance, such words
are not semantically related. We devise a distance function S that
evaluates the words association rate. Using S, we build a concept-tree,
which provides a visual and comprehensive representation of keywords
association in a text. In order to illustrate the effectiveness of our algorithm,
we apply it to three different texts, showing the consistency
and significance of the obtained results with respect to the semantics
of documents. Finally, we compare the results obtained by applying
our proposed algorithm with the ones achieved by both human experts
and the co-occurrence correlation method. We show that our method
is consistent with the experts evaluation and outperforms with respect
to the co-occurrence correlation method. (literal)
- Prodotto di
- Autore CNR
- Insieme di parole chiave
Incoming links:
- Autore CNR di
- Prodotto
- Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#rivistaDi
- Insieme di parole chiave di