Spatio-temporal pattern mining from global positioning systems (GPS) trajectories dataset
View/ Open
Metadata
Show full item recordcomunitat-uji-handle:10234/158176
comunitat-uji-handle2:10234/71345
comunitat-uji-handle3:10234/141145
comunitat-uji-handle4:
TFG-TFMMetadata
Title
Spatio-temporal pattern mining from global positioning systems (GPS) trajectories datasetAuthor (s)
Tutor/Supervisor
Belmonte Fernández, Óscar; Pebesma, Edzer; Henriques, RobertoTutor/Supervisor; University.Department
Universitat Jaume I. Departament de Llenguatges i Sistemes InformàticsDate
2015-08Publisher
Universitat Jaume IAbstract
The increasing frequency of use location-acquisition technology like the Global Positioning System is leading to the collection of large spatio-temporal datasets. The prospect of discovering usable knowledge about ... [+]
The increasing frequency of use location-acquisition technology like the Global Positioning System is leading to the collection of large spatio-temporal datasets. The prospect of discovering usable knowledge about movement behavior, which encourages for the discovery of interesting relationships and characteristics users that may exist implicitly in spatial databases. Therefore spatial data mining is emerging as a novel area of research.
In this study, the experiments were conducted following the Knowledge Discovery in Database process model. The Knowledge Discovery in Database process model starts from selection of the datasets. The GPS trajectory dataset for this research collected from Microsoft Research Asia Geolife project. After taking the data, it has been preprocessed. The major preprocessing activities include:
Fill in missed values and remove outliers;
Resolve inconsistencies, integration of data that contains both labeled and unlabeled datasets,
Dimensionality reduction, size reduction and data transformation activity like discretization tasks were done for this study.
A total of 4,273 trajectory dataset are used for training the models. For validating the performance of the selected model a separate 1,018 records are used as a testing set. For building a spatiotemporal model of this study the K-nearest Neighbors (KNN), decision tree and Bayes algorithms have been tasted as supervised approach.
The model that was created using 10-fold cross validation with K value 11 and other default parameter values showed the best classification accuracy. The model has a prediction accuracy of 98.5% on the training datasets and 93.12% on the test dataset to classify the new instances as bike, bus, car, subway, train and walk classes. The findings of this study have shown that the spatiotemporal data mining methods help to classify user mobility transportation modes. Future research directions are forwarded to come up an applicable system in the area of the study. [-]
Subject
Màster Universitari Erasmus Mundus en Tecnologia Geoespacial | Erasmus Mundus University Master's Degree in Geospatial Technologies | Máster Universitario Erasmus Mundus en Tecnología Geoespacial | accuracy | Cross Validation | data mining | Geo-life | GPS | K-Nearest-Neighbor | trajectory | Transportation modes | WEKA
Description
Treball final de Màster Universitari Erasmus Mundus en Tecnologia Geoespacial. Codi: SIW013. Curs acadèmic 2014-2015
Type
info:eu-repo/semantics/masterThesisRights
info:eu-repo/semantics/openAccess
This item appears in the folowing collection(s)
The following license files are associated with this item: