Mostrar el registro sencillo del ítem

dc.contributor.authorBELLOCH, JOSE A.
dc.contributor.authorLeón, Germán
dc.contributor.authorBadía, José
dc.contributor.authorLINDOSO, ALMUDENA
dc.contributor.authorSAN MILLAN, ENRIQUE
dc.date.accessioned2020-09-11T10:38:00Z
dc.date.available2020-09-11T10:38:00Z
dc.date.issued2020-06-06
dc.identifier.citationBelloch, J.A., León, G., Badía, J.M. et al. Evaluating the computational performance of the Xilinx Ultrascale+ EG Heterogeneous MPSoC. J Supercomput (2020). https://doi.org/10.1007/s11227-020-03342-7ca_CA
dc.identifier.urihttp://hdl.handle.net/10234/189660
dc.description.abstractThe emergent technology of Multi-Processor System-on-Chip (MPSoC), which combines heterogeneous computing with the high performance of Field Programmable Gate Arrays (FPGAs) is a very interesting platform for a huge number of applications ranging from medical imaging and augmented reality to high-performance computing in space. In this paper, we focus on the Xilinx Zynq UltraScale+ EG Heterogeneous MPSoC, which is composed of four different processing elements (PE): a dual-core Cortex-R5, a quad-core ARM Cortex-A53, a graphics processing unit (GPU) and a high end FPGA. Proper use of the heterogeneity and the different levels of parallelism of this platform becomes a challenging task. This paper evaluates this platform and each of its PEs to carry out fundamental operations in terms of computational performance. To this end, we evaluate image-based applications and a matrix multiplication kernel. On former, the image-based applications leverage the heterogeneity of the MPSoc and strategically distributes its tasks among both kinds of CPU cores and the FPGA. On the latter, we analyze separately each PE using different matrix multiplication benchmarks in order to assess and compare their performance in terms of MFlops. This kind of operations are being carried out for example in a large number of space-related applications where the MPSoCs are currently gaining momentum. Results stand out the fact that different PEs can collaborate efficiently with the aim of accelerating the computational-demanding tasks of an application. Another important aspect to highlight is that leveraging the parallel OpenBLAS library we achieve up to 12 GFlops with the four Cortex-A53 cores of the platform, which is a considerable performance for this kind of devices.ca_CA
dc.format.extent15 p.ca_CA
dc.format.mimetypeapplication/pdfca_CA
dc.language.isoengca_CA
dc.publisherSpringerca_CA
dc.rights© 2020 Springer Nature Switzerland AG. Part of Springer Nature.ca_CA
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/*
dc.subjectheterogeneous computingca_CA
dc.subjectparallel computingca_CA
dc.subjectmulti-processor system-on-chip (MPSoC)ca_CA
dc.titleEvaluating the computational performance of the Xilinx Ultrascale plus EG Heterogeneous MPSoCca_CA
dc.typeinfo:eu-repo/semantics/articleca_CA
dc.identifier.doihttps://doi.org/10.1007/s11227-020-03342-7
dc.relation.projectIDSpanish Government through TIN2017-82972-R, ESP2015-68245-C4-1-P, the Valencian Regional Government through PROMETEO/2029/109 and the Universitat Jaume I through UJI-B2019-36ca_CA
dc.rights.accessRightsinfo:eu-repo/semantics/openAccessca_CA
dc.relation.publisherVersionhttps://link.springer.com/article/10.1007/s11227-020-03342-7ca_CA
dc.type.versioninfo:eu-repo/semantics/submittedVersionca_CA


Ficheros en el ítem

Thumbnail

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem