• closedAccess   H.264/AVC inter prediction for heterogeneous computing systems 

      Rodríguez Sánchez, Rafael; Martínez, José Luis; Fernández Escribano, Gerardo; Claver, José M.; Sánchez, José L. Springer (2013)
      H.264/AVC is the latest standard for video compression and is a significant advance, but at the expense of increasing computing needs. Recently, the progress of GPUs has attracted considerable attention because they are ...
    • openAccess   H.264/AVC inter prediction on accelerator-based multi-core systems 

      Rodríguez Sánchez, Rafael; Martínez, José Luis; Fernández Escribano, Gerardo; Sánchez, José L.; Claver, José M. Springer (2013)
      The AVC video coding standard adopts variable block sizes for inter frame coding to increase compression efficiency, among other new features. As a consequence of this, an AVC encoder has to employ a complex mode decision ...
    • openAccess   Harvesting Energy in ILUPACK via Slack Elimination 

      Aliaga Estellés, José Ignacio; Barreda Vayá, Maria; Castaño Álvarez, María Asunción (2017-07-05)
      We develop a new energy-aware methodology to improve the energy consumption of a task-parallel preconditioned Conjugate Gradient iter- ative solver on a Haswell-EP Intel Xeon. This technique leverages the power-saving ...
    • openAccess   Help Hamilton: un juego para mejorar el aprendizaje en Bases de datos 

      Marqués-Andrés, Mercedes JENUI Editores (2022)
      En este artículo se describe un juego llevado a cabo en una asignatura de Bases de datos. El juego requiere aplicar conocimientos del lenguaje SQL y también de diseño de bases de datos relacionales. La experiencia ha ...
    • openAccess   Hierarchical approach for deriving a reproducible unblocked LU factorization 

      Iakymchuk, Roman; Graillat, Stef; Defour, David; Quintana-Orti, Enrique S. Sage (2019-03-17)
      We propose a reproducible variant of the unblocked LU factorization for graphics processor units (GPUs). For this purpose, we build upon Level-1/2 BLAS kernels that deliver correctly-rounded and reproducible results for ...
    • openAccess   High performance and energy efficient inference for deep learning on multicore ARM processors using general optimization techniques and BLIS 

      Castelló, Adrián; Barrachina Mir, Sergio; Dolz, Manuel F.; Quintana-Orti, Enrique S.; San Juan, Pau; Tomás Domínguez, Andrés Enrique Elsevier (2022-03-22)
      We evolve PyDTNN, a framework for distributed parallel training of Deep Neural Networks (DNNs), into an efficient inference tool for convolutional neural networks. Our optimization process on multicore ARM processors ...
    • openAccess   High Performance and Portable Convolution Operators for Multicore Processors 

      San Juan, Pablo; Castelló, Adrián; Dolz, Manuel F.; Alonso-Jordá, Pedro; Quintana-Orti, Enrique S. IEEE (2020-10)
      The considerable impact of Convolutional Neural Networks on many Artificial Intelligence tasks has led to the development of various high performance algorithms for the convolution operator present in this type of networks. ...
    • closedAccess   High performance computing tools in science and engineering 

      Quintana-Orti, Enrique S.; Vigo Aguiar, Jesús; Ranilla Pastor, José Springer Science+Business Media (2011-11)
      New large-scale problems with growing computational demands continuously arise in many scientific and engineering applications as, e.g., in bioinformatics, computational chemistry, communications or astrophysics. Effectively ...
    • closedAccess   High performance computing tools in science and engineering II 

      Quintana-Orti, Enrique S.; Vigo Aguiar, Jesús; Ranilla Pastor, José Springer Science+Business Media (2011-12)
      This special issue collects research papers selected among those presented at the second minisymposium “HPC applied to Computational Problems in Science and Engineering” which was held in June 2010, in Almeria, Spain. ...
    • openAccess   High-performance reconstruction of CT medical images by using out-of-core methods in GPU 

      Quintana-Ortí, Gregorio; Chillarón, Mónica; Vidal, Vicente; Verdu, Gumersindo Elsevier (2022-03-02)
      Background and objective:Since Computed Tomography (CT) is one of the most widely used medical imaging tests, it is essential to work on methods that reduce the radiation the patient is exposed to. Although there are several ...
    • openAccess   Highly sensitive and ultrafast read mapping for RNA-seq analysis 

      Barrachina Mir, Sergio; Castillo Catalán, María Isabel; Martínez Pérez, Héctor; Medina, Ignacio; Tárraga, Joaquín; Quintana-Orti, Enrique S.; Dopazo, Joaquín; Salavert Torres, José; Blanquer Espert, Ignacio; Paschall, J.; Hernández-García, V. Oxford University Press (2016)
      As sequencing technologies progress, the amount of data produced grows exponentially, shifting the bottleneck of discovery towards the data analysis phase. In particular, currently available mapping solutions for RNA-seq ...
    • openAccess   Householder QR Factorization With Randomization for Column Pivoting (HQRRP) 

      MARTINSSON, GUNNAR; Quintana-Ortí, Gregorio; Heavner, Nathan; Van de Geijn, Robert A. Society for Industrial and Applied Mathematics (2017)
      A fundamental problem when adding column pivoting to the Householder QR fac- torization is that only about half of the computation can be cast in terms of high performing matrix- matrix multiplications, which greatly ...
    • openAccess   How deeply do we include robotic agents in the self? 

      Stenzel, Anna; Chinellato, Eris; del Pobil, Angel P.; Lappe, Markus; Liepelt, Roman World Scientific Publishing (2013)
      In human–human interactions, a consciously perceived high degree of self–other overlap is associated with a higher degree of integration of the other person's actions into one's own cognitive representations. Here, we ...
    • closedAccess   Hybrid static–dynamic selection of implementation alternatives in heterogeneous environments 

      del Río Astorga, David; Dolz, Manuel F.; Fernández Muñoz, Javier; García Blas, Javier Springer (2019-09)
      With the emergence of heterogeneous architectures, developing parallel software has become an increasingly complex task. The ability of using multiple devices in a single application, such as CPUs, accelerators, or ...
    • openAccess   Hyperspectral Unmixing on Multicore DSPs: Trading Off Performance for Energy 

      Castillo Catalán, María Isabel; Fernández Fernández, Juan Carlos; Igual, Francisco; Plaza, Antonio; Quintana-Orti, Enrique S.; Remón Gómez, Alfredo IEEE (2014)
      Wider coverage of observation missions will increase onboard power restrictions while, at the same time, pose higher demands from the perspective of processing time, thus asking for the exploration of novel high-performance ...
    • openAccess   I-AUV Docking and Panel Intervention at Sea 

      Palomeras, Narcís; Peñalver Monfort, Antonio; Massot-Campos, Miquel; Lluís Negre, Pep; Fernández Fresneda, José Javier; Ridao, Pere; Sanz, Pedro J; Oliver-Codina, Gabriel MDPI (2016)
      The use of commercially available autonomous underwater vehicles (AUVs) has increased during the last fifteen years. While they are mainly used for routine survey missions, there is a set of applications that nowadays can ...
    • openAccess   I-AUV Mechatronics Integration for the TRIDENT FP7 Project 

      Rivas, David; Ridao, Pere; Turetta, Alessio; Melchiorri, Claudio; Palli, Gianluca; Fernández Fresneda, José Javier; Sanz, Pedro J Institute of Electrical and Electronics Engineers (IEEE) (2015-10)
      Autonomous underwater vehicles (AUVs) are routinely used to survey areas of interest in seas and oceans all over the world. However, those operations requiring intervention capabilities are still reserved to manned ...
    • openAccess   iMODS: internal coordinates normal mode analysis server 

      López Blanco, José R.; Aliaga Estellés, José Ignacio; Quintana-Orti, Enrique S.; Chacón, Pablo Oxford University Press (2014)
      Normal mode analysis (NMA) in internal (dihedral) coordinates naturally reproduces the collective functional motions of biological macromolecules. iMODS facilitates the exploration of such modes and generates feasible ...
    • openAccess   Implicit Hari–Zimmermann algorithm for the generalized SVD on the GPUs 

      Novaković, Vedran; Singer, Sanja SAGE Publications (2020-12-10)
      A parallel, blocked, one-sided Hari–Zimmermann algorithm for the generalized singular value decomposition (GSVD) of a real or a complex matrix pair (F,G) is here proposed, where F and G have the same number of columns, and ...
    • openAccess   Improved Accuracy and Parallelism for MRRR-Based Eigensolvers -- A Mixed Precision Approach 

      Petschow, Matthias; Quintana-Orti, Enrique S.; Bientinesi, Paolo Society for Industrial and Applied Mathematics (2014)
      The real symmetric tridiagonal eigenproblem is of outstanding importance in numerical computations; it arises frequently as part of eigensolvers for standard and generalized dense Hermitian eigenproblems that are based on ...