Cultural Heritage: Knowledge Extraction from Web Documents (Contributo in atti di convegno)

Type
Label
  • Cultural Heritage: Knowledge Extraction from Web Documents (Contributo in atti di convegno) (literal)
Anno
  • 2010-01-01T00:00:00+01:00 (literal)
Alternative label
  • Sassolini E.; Cinini A. (2010)
    Cultural Heritage: Knowledge Extraction from Web Documents
    in Seventh International Conference on Language Resources and Evaluation, Valletta, Malta
    (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#autori
  • Sassolini E.; Cinini A. (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#note
  • In: LREC'10 - Seventh International Conference on Language Resources and Evaluation (Valletta, Malta, 17-23 May 2010). Proceedings, pp. 3363 - 3368. Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias (eds.). European Language Resources Association (ELRA), 2010. (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#descrizioneSinteticaDelProdotto
  • ABSTRACT: This article presents the use of NLP techniques (text mining, text analysis) to develop specific tools that allow to create linguistic resources related to the cultural heritage domain. The aim of our approach is to create tools for the building of an online \"knowledge network\", automatically extracted from text materials concerning this domain. A particular methodology was experimented by dividing the automatic acquisition of texts, and consequently, the creation of reference corpus in two phases. In the first phase, on-line documents have been extracted from lists of links provided by human experts. All documents extracted from the web by means of automatic spider have been stored in a repository of text materials. On the basis of these documents, automatic parsers create the reference corpus for the cultural heritage domain. Relevant information and semantic concepts are then extracted from this corpus. In a second phase, all these semantically relevant elements (such as p (literal)
Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#affiliazioni
  • ILC-CNR, Pisa (literal)
Titolo
  • Cultural Heritage: Knowledge Extraction from Web Documents (literal)
Prodotto di
Autore CNR
Insieme di parole chiave

Incoming links:


Autore CNR di
Prodotto
Insieme di parole chiave di
data.CNR.it