Publications: Efficient parallel implementations

Improving tasks throughput on accelerators using OpenCL command concurrency
A.J. Lázaro-Muñoz, J.M. González-Linares, J. Gómez-Luna, Nicolás Guil
Computer Research Repository, 2018
Energy-based Tuning of Convolutional Neural Networks on Multi-GPUs
F.M. Castro, N. Guil, M.J. Martín-Jiménez, J. Pérez-Serrano, M. Ujaldón
Journal of Concurrency and Computation: Practice and Experience, to appear
A.J. Lázaro-Muñoz, J.M. González-Linares, J. Gómez-Luna, Nicolás Guil
Journal of Parallel and Distributed Computing, Volume 109, November 2017, Pages 258-271
Efficient OpenCL-based concurrent tasks offloading on accelerators
A.J. Lázaro-Muñoz, J.M. González-Linares, J. Gómez-Luna, Nicolás Guil
International Conference on Computational Science (ICCS 2017), Zurich, June, 2017
Configurable XOR hash functions for banked scratchpad memories in GPUs
Gert-Jan van den Braak, Juan Gómez-Luna, José M. González-Linares, Henk Corporaal, Nicolás Guil
IEEE Transactions on Computers , vol. 65, no. 7, pp. 2045-2058, July, 2016
Paper selected (together with other three) "Editor's pick of the year 2016" of the IEEE Transactions on Computers
In-Place Matrix Transposition on GPUs
Juan Gómez-Luna, I-Jui Sung, Li-Wen Chang, José M. González-Linares, Nicolás Guil, Wen-Mei W. Hwu
IEEE Transactions on Parallel and Distributed Systems, vol. 27, Iss. 3, March, 2016
Calculation of dense trajectory descriptors on a heterogeneous embedded architecture
Julián R. Cózar, Manuel J. Marín-Jiménez, José M. González-Linares, Juan Gómez-Luna, Nicolás Guil
Journal of Systems Architecture, in press, 2015
Demystifying the 16x16 thread-block for stencils on the GPU
Siham Tabik, Maurice Peemen, Nicolás Guil, Henk Corporaal
Concurrency and Computation: Practice and Experience, in press, 2015
In-Place Data Sliding Algorithms for Many-Core Architectures
Juan Gómez-Luna, I-Jui Sung, Li-Wen Chang, Nicolás Guil, Wen-Mei W. Hwu
The 44th International Conference on Parallel Processing (ICPP-2015), Beijing, China, 2015
In-Place Transpositions of Rectangular Matrices on Accelerators
I-Jui Sung, Juan Gómez-Luna, José M. González-Linares, Nicolás Guil, Wen-Mei W. Hwu
19th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP), Orlando (Florida), 2014
Performance modeling of atomic additions on GPU scratchpad memory
J. Gómez-Luna, J. M. González-Linares, J. I. Benavides, N. Guil
IEEE Trans. on Parallel and Distributed Systems, vol. 24 no.11, 2013
Simulation and Architecture Improvements of Atomic Operations on GPU Scratchpad Memory
Gert-Jan van den Braak, Juan Gómez-Luna, Henk Cor poraal, José M. González-Linares, Nicolás Guil
The 31st IEEE International Conference on Computer Design (ICCD), 2013
An optimized approach to histogram computation on GPU
J. Gómez-Luna, J. M. González-Linares, J. I. Benavides, N. Guil
Machine Vision and Applications, Vol. 24, Iss. 5, pp 899-908, 2013
Performance models for asynchronous data transfers on consumer Graphics Processing Units
Juan Gómez-Luna; José González-Linares; José I. Benavides; Nicolás Guil
Journal of Parallel and Distributed Computing, Volume 72, Issue 9, September 2012
Programming Issues for Video Analysis on Graphic Processing Units
Juan Gómez-Luna
Ph.D. Thesis , Córdoba, Spain, 2012
Egomotion compensation and moving objects detection algorithm on GPU
J. Gómez-Luna, H. Endt, W. Stechele, J. M. González-Linares, J. I. Benavides, N. Guil
International Conference on Parallel Computing, ParCo 2011, Ghent, Belgium, 2011
Load Balancing Versus Occupancy Maximization On Graphics Processing Units: The Generalized Hough Transform as a Case Study
Juan Gómez-Luna; José M. González-Linares; José I. Benavides, Emilio L. Zapata; Nicolás Guil
The International Journal of High Performance Computing Applications, Volume 25, Issue 2, May 2011
Mapping a Class of Applications in Heterogeneous Multithreading Architectures
Juan Lucena; Oscar Plata; Nicolás Guil
Second Workshop on Programming Models for Emerging Architectures (PMEA) , colocated with PACT 2010, pp. 3-10, 2010
An implementation of the Generalized Hough Transform on heterogeneous multithreading architectures
Juan Lucena; Oscar Plata; Nicolás Guil
XXI Jornadas de Paralelismo, Valencia, SPAIN, 2010
Concurrent Calculations On Reconfigurable Logic Devices Applied To The Analysis Of Video Images
Sergio R. Geninatti; Jose I. Benavides Benitez; Manuel Hernandez Calvino; Nicolas Guil
International Journal of Reconfigurable Computing, pp. 1-8, 2010. ISSN 1687-7195.
Integrating OpenMP and CUDA Models
Juan Lucena Raya; Nicolas Guil; Oscar Plata
Proceedings on Advanced Computer Architecture and Compilation For Embedded Systems, pp. 39-42. 2009. ISBN 978-90-382-1467-2.
FPGA Implementation of The Generalized Hough Transform
Sergio R. Geninatti; Manuel Hernandez Calvino; Juan Gomez-Luna; Jose I. Benavides; Nicolas Guil
International Conference on Reconfigurable Computing and FPGAs, 2009. ISBN 978-0-7695-3917-1.
High Performance Circle Detection Through a GPU Rasterization Approach
Nicolas Guil; Manuel Ujaldon
Proceedings 4th I. Conference on Pattern Recognition and Image Analysis, pp. 273-281, 2009.
Parallelization Of a Video Segmentation Algorithm On Cuda-Enabled Graphics Processing Units
Juan Gomez-Luna; Jose M. Gonzalez-Linares; Jose I. Benavides; Nicolas Guil
Euro-Par 2009 Parallel Processing, LNCS 5704, 2009
Large LED Display Without Current Drivers
Alfonso Gago Calderon; Alfonso Gago Bohorquez; Nicolas Guil
XXIX International Display Research Conference, 2009.
Concurrent Calculations on Reconfigurable Logic Devices Applied to the Analisys of Video Images
Sergio R. Geninatti; M.H. Calvino; Jose I. Benavides; Nicolas Guil
V Southern Conference on Programmable Logic (SPL 2009), 2009.
On the Computation of the Circle Hough Transform by a GPU Rasterizer
Manuel Ujaldon; Antonio Ruiz; Nicolas Guil
Pattern Recognition Letters, 29 (3), pp. 309-318, 2008.
Recognition of Circular Patterns on GPUs: Performance Analysis and Contributions.
Antonio Ruiz; Nicolas Guil; Manuel Ujaldon
Journal Of Parallel And Distributed Computing, 8 (10), pp. 1329-1338, 2008.
Improving Video Segmentation Performance in Multithreading and Multicore Architectures
J.I. Benavides; J.M. Gonzalez-Linares; J. Gomez-Luna; N. Guil
Jornadas de Paralelismo (CEDI2007) Actas del XVII Congreso Español de Informatica Grafica (CEIG07), pp. 11-18, 2007, ISBN 978-84-9732-595-0
Using Graphics Hardware for Enhancing Edge and Circle Detection
Antonio Ruiz; Manuel Ujaldon; Nicolas Guil
Iberian Conference on Pattern Recognition And Image Analysis, LNCS 4477, pp. 234-241, 2007. ISBN 978-3-540-72848-1
Computer Architectures for Multimedia and Video Analysis
Edmundo Saez; Jose Gonzalez-Mora; Nicolas Guil; Jose I. Benavides; Emilio L. Zapata
High Performance Computing in Remote Sensing, pp. 43-67. 2007.ISBN 978-1-58488-662-4
New Full-Colour LED Based Screen
Nicolas Guil; Alfonso Gago Bohorquez; Alfonso Gago Calderon
IEEE Mediterranean Electrotechnical Conference (MELECON), 2006. Isbn 1-4244-0088-0.
Using Graphic Processing Units For Tracking Algorithms
Jose Gonzalez-Mora; Nicolas Guil; Emilio L. Zapata
5th International Workshop on Information Optics (WIO'06), pp. 310-313, 2006.
Parallelization of an Algorithm for the Automatic Detection of Deformable Objects
Jose M. Gonzalez-Linares, Nicolas Guil; Emilio L. Zapata; Pilar Martinez Ortigosa; Inmaculada Garcia Fernandez
Acta Cybernetica, 15(2), pp. 151-162, 2001, ISSN 0324-721x
Parallelization Of An Algorithm For Non-Rigid Objects Detection With Stochastic Global Optimization
Jose M. Gonzalez-Linares; Pilar Martinez Ortigosa; Nicolas Guil; Inmaculada Garcia Fernández
Volume of Extended Abstracts of the II Conference of Phd Students in Computer Science, Szeged (Hungary), 2000.
Paralelizacion de un algoritmo para la deteccion automatica de objetos deformables
Jose M. Gonzalez-Linares; Nicolas Guil; Emilio L. Zapata; Pilar Martinez Ortigosa; Inmaculada Garcia Fernandez
Perspectivas Del Paralelismo en Computadores, pp. 327-332, 2000, ISBN 84-699-3003-6
A Probabilistic Model For Best-First Search B&B Algorithms
Francisco Arguello; Nicolas Guil; Juan Lopez Gomez; Emilio L. Zapata
Irregular'97, LNCS 1253, pp. 49-60, 1997, ISSN 0302-9743.
Fast Hough Transform On Multiprocessors: A Branch And Bound Approach
Nicolas Guil; Emilio L. Zapata
Journal Of Parallel And Distributed Computing, 45(1), pp. 82-89, 1997, ISSN 0743-7315.
A Parallel Pipelined Hough Transform
Nicolas Guil; Emilio L. Zapata
EUROPAR'96, LNCS 1124, pp. 131-138, 1996, ISSN 0302-9743
Cordic Based Parallel/Pipelined Architecture for the Hough Transform
Nicolas Guil; Emilio L. Zapata; Julio Villalba; Javier Diaz Bruguera; Tomas Lang
Journal of VLSI Signal Processing Systems for Signal Image and Video Technology, 12(3), pp. 207-221, 1996, ISSN 0922-5773.