Juan Gómez Luna
Juan Gómez Luna
Verified email at ethz.ch
Title
Cited by
Cited by
Year
Chai: Collaborative heterogeneous applications for integrated-architectures
J Gómez-Luna, I El Hajj, LW Chang, V Garcıa-Flores, SG de Gonzalo, ...
ISPASS, 2017
722017
Processing data where it makes sense: Enabling in-memory computation
O Mutlu, S Ghose, J Gómez-Luna, R Ausavarungnirun
Microprocessors and Microsystems 67, 28-41, 2019
712019
Mqsim: A framework for enabling realistic studies of modern multi-queue {SSD} devices
A Tavakkol, J Gómez-Luna, M Sadrosadati, S Ghose, O Mutlu
16th {USENIX} Conference on File and Storage Technologies ({FAST} 18), 49-66, 2018
692018
An optimized approach to histogram computation on GPU
J Gómez-Luna, JM González-Linares, JI Benavides, N Guil
Machine Vision and Applications 24 (5), 899-908, 2013
472013
FLIN: Enabling fairness and enhancing performance in modern NVMe solid state drives
A Tavakkol, M Sadrosadati, S Ghose, J Kim, Y Luo, Y Wang, NM Ghiasi, ...
2018 ACM/IEEE 45th Annual International Symposium on Computer Architecture …, 2018
442018
Processing-in-memory: A workload-driven perspective
S Ghose, A Boroumand, JS Kim, J Gómez-Luna, O Mutlu
IBM Journal of Research and Development 63 (6), 3: 1-3: 19, 2019
392019
Napel: Near-memory computing application performance prediction via ensemble learning
G Singh, J Gómez-Luna, G Mariani, GF Oliveira, S Corda, S Stuijk, ...
2019 56th ACM/IEEE Design Automation Conference (DAC), 1-6, 2019
362019
Smash: Co-designing software compression and hardware-accelerated indexing for efficient sparse matrix operations
K Kanellopoulos, N Vijaykumar, C Giannoula, R Azizi, S Koppula, ...
Proceedings of the 52nd Annual IEEE/ACM International Symposium on …, 2019
332019
Performance modeling of atomic additions on GPU scratchpad memory
J Gómez-Luna, J González-Linares, J Benavides Benítez, N Guil
IEEE Transactions on Parallel and Distributed Systems 24 (11), 2273-2282, 2013
322013
Performance models for asynchronous data transfers on consumer Graphics Processing Units
J Gómez-Luna, JM González-Linares, JI Benavides, N Guil
Journal of Parallel and Distributed Computing 72 (9), 1117-1126, 2012
322012
Evaluating the effect of last-level cache sharing on integrated GPU-CPU systems with heterogeneous applications
V Garcıa, J Gomez-Luna, T Grass, A Rico, E Ayguade, AJ Pena
2016 IEEE International Symposium on Workload Characterization (IISWC), 1-10, 2016
302016
KLAP: Kernel launch aggregation and promotion for optimizing dynamic parallelism
I El Hajj, J Gómez-Luna, C Li, LW Chang, D Milojicic, W Hwu
2016 49th Annual IEEE/ACM International Symposium on Microarchitecture …, 2016
292016
In-place transposition of rectangular matrices on accelerators
IJ Sung, J Gómez-Luna, JM González-Linares, N Guil, WMW Hwu
ACM SIGPLAN Notices 49 (8), 207-218, 2014
252014
FPGA implementation of the generalized Hough transform
SR Geninatti, JIB Benítez, MH Calviño, NG Mata, JG Luna
2009 International Conference on Reconfigurable Computing and FPGAs, 172-177, 2009
252009
Genasm: A high-performance, low-power approximate string matching acceleration framework for genome sequence analysis
DS Cali, GS Kalsi, Z Bingöl, C Firtina, L Subramanian, JS Kim, ...
2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture …, 2020
242020
A modern primer on processing in memory
O Mutlu, S Ghose, J Gómez-Luna, R Ausavarungnirun
arXiv preprint arXiv:2012.03112, 2020
232020
Enabling practical processing in and near memory for data-intensive computing
O Mutlu, S Ghose, J Gómez-Luna, R Ausavarungnirun
Proceedings of the 56th Annual Design Automation Conference 2019, 1-4, 2019
232019
Parallelization of a video segmentation algorithm on CUDA–enabled graphics processing units
J Gómez-Luna, JM González-Linares, JI Benavides, N Guil
European Conference on Parallel Processing, 924-935, 2009
222009
FIGARO: Improving system performance via fine-grained In-DRAM data relocation and caching
Y Wang, L Orosa, X Peng, Y Guo, S Ghose, M Patel, JS Kim, JG Luna, ...
2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture …, 2020
212020
Automatic Generation of Warp-Level Primitives and Atomic Instructions for Fast and Portable Parallel Reduction on GPUs
SG De Gonzalo, S Hammond, S Huang, O Mutlu, J Gómez-Luna, W Hwu
CGO, 2019
202019
The system can't perform the operation now. Try again later.
Articles 1–20