EO4GEO BoK annotation of GI resources
comunitat-uji-handle:10234/158176
comunitat-uji-handle2:10234/71345
comunitat-uji-handle3:10234/141145
comunitat-uji-handle4:
TFG-TFMMetadatos
Título
EO4GEO BoK annotation of GI resourcesTutor/Supervisor; Universidad.Departamento
Casteleyn, Sven; Universitat Jaume I. Departament de Llenguatges i Sistemes InformàticsFecha de publicación
2024-02-26Editor
Universitat Jaume IResumen
The Earth Observation for Geospatial Information (EO4GEO) Body of Knowledge (BoK) serves as a foundational framework encompassing essential geospatial concepts necessary for leveraging Geographic Information and Earth ... [+]
The Earth Observation for Geospatial Information (EO4GEO) Body of Knowledge (BoK) serves as a foundational framework encompassing essential geospatial concepts necessary for leveraging Geographic Information and Earth Observation data effectively. Based on the BoK a set of tools was developed at the University Jaume I, including the BoK Annotation Tool (BAT), which facilitates the annotation of any PDF document with EO4GEO BoK concepts, streamlining the process of knowledge association. These annotations are added manually through an easy-to-use “what you see is what you get” (WYSIWYG), which presents significant challenges, including time consumption and the need for domain expertise. To address this challenge, this master thesis studies the use of Natural Language Processing (NLP) techniques to automate the BoK annotation process. Concretely twelve NLP-based tools were applied, utilizing three key phrase extraction algorithms (YAKE, PatternRank, KeyBert) and four semantic similarity measures (Cosine, Jaro-Wrinkler, Latent semantic similarity, Word2Vec), in order to (semi)-automatically generate BoK annotation. To assess the performance, a comparative evaluation was carried out using various annotation approaches (i.e. using top-level concepts, leaf concepts and all concepts) and evaluation methods (i.e. direct matching, parent-child matching, ranking). Results revealed that YAKE_JaroW emerges as a standout performer (F1-score 28.28%), particularly in the parent-child evaluation method. This research helps to annotate existing resources with EO4GEO BoK concepts easier and more efficiently, helping to share knowledge more effectively in geospatial fields. It also emphasizes the importance of having annotation tools designed specifically for the EO4GEO BoK, which fills a crucial gap in geospatial knowledge management. [-]
Palabras clave / Materias
Descripción
Treball de Final de Màster Universitari Erasmus Mundus en Tecnologia Geoespacial (Pla de 2022). Codi: SJL042. Curs acadèmic 2023-2024
Tipo de documento
info:eu-repo/semantics/masterThesisDerechos de acceso
info:eu-repo/semantics/openAccess