Wen-mei W. Hwu

Cited by

	All	Since 2019
Citations	30210	8027
h-index	83	42
i10-index	286	156

1700

850

425

1275

1991199219931994199519961997199819992000200120022003200420052006200720082009201020112012201320142015201620172018201920202021202220232024157 152 242 244 330 296 385 458 476 500 493 502 547 578 707 604 589 619 813 1003 1583 1537 1644 1608 1469 1407 1281 1174 1178 1302 1510 1525 1491 1014

Public access

View all

78 articles

14 articles

available

not available

Based on funding mandates

Co-authors

Jinjun XiongUniversity at BuffaloVerified email at buffalo.edu
Scott MahlkeProfessor, Electrical Engineering and Computer Science Dept., University of MichiganVerified email at umich.edu
Deming ChenAbel Bliss Professor. University of Illinois at Urbana-ChampaignVerified email at illinois.edu
John StrattonWhitman CollegeVerified email at whitman.edu
Li-Wen ChangResearch Scientist, ByteDanceVerified email at bytedance.com
Izzat El HajjAmerican University of BeirutVerified email at aub.edu.lb
Sara BaghsorkhiIntel LabsVerified email at intel.com
Sitao HuangAssistant Professor of EECS, University of California IrvineVerified email at uci.edu
Abdul DakkakModularVerified email at modular.com
Isaac GeladoNVIDIAVerified email at gelado.org
Mert HidayetoğluPostdoctoral Scholar, Stanford UniversityVerified email at stanford.edu
Vikram Sharma MailthodyNVIDIAVerified email at illinois.edu
Carl PearsonSandia National LabsVerified email at sandia.gov
Simon Garcia de GonzaloSenior Member of Technical Staff, Sandia National LaboratoriesVerified email at sandia.gov
Dejan MilojicicHewlett Packard LabsVerified email at hpe.com
Cheng LiDatabricksVerified email at databricks.com
David I. AugustComputer Science, Princeton UniversityVerified email at princeton.edu
Nacho NavarroAssociate Professor of Computer Science, Universitat Politecnica de Catalunya and BSC, BarcelonaVerified email at ac.upc.edu
Tom ConteAssociate Dean for Research, College of Computing; Professor of CS and ECE, Georgia Institute ofVerified email at gatech.edu
Juan Gómez LunaNVIDIAVerified email at nvidia.com

Wen-mei W. Hwu

Senior Distinguished Research Scientist, NVIDIA; Professor and Sanders-AMD Chair of Electrical and

Verified email at illinois.edu - Homepage

Computer Architecture Compiler Parallel Computing Cognitive Computing Systems


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Programming massively parallel processors: a hands-on approach DB Kirk, WH Wen-Mei Morgan kaufmann, 2016	4158	2016
Optimization principles and application performance evaluation of a multithreaded GPU using CUDA S Ryoo, CI Rodrigues, SS Baghsorkhi, SS Stone, DB Kirk, WW Hwu Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of …, 2008	1319	2008
A power controlled multiple access protocol for wireless packet networks JP Monks, V Bharghavan, WMW Hwu Proceedings IEEE INFOCOM 2001. Conference on Computer Communications …, 2001	995	2001
Parboil: A revised benchmark suite for scientific and commercial throughput computing JA Stratton, C Rodrigues, IJ Sung, N Obeid, LW Chang, N Anssari, GD Liu, ... Center for Reliable and High-Performance Computing 127 (7.2), 2012	967	2012
The superblock: An effective technique for VLIW and superscalar compilation WMW Hwu, SA Mahlke, WY Chen, PP Chang, NJ Warter, RA Bringmann, ... Instruction-Level Parallelism: A Special Issue of The Journal of …, 2011	883	2011
GPU computing gems jade edition W Hwu Elsevier, 2011	506*	2011
IMPACT: An architectural framework for multiple-instruction-issue processors PP Chang, SA Mahlke, WY Chen, NJ Warter, WW Hwu ACM SIGARCH Computer Architecture News 19 (3), 266-275, 1991	505	1991
PUMA: A programmable ultra-efficient memristor-based accelerator for machine learning inference A Ankit, IE Hajj, SR Chalamalasetti, G Ndu, M Foltin, RS Williams, ... Proceedings of the twenty-fourth international conference on architectural …, 2019	459	2019
An adaptive performance modeling tool for GPU architectures SS Baghsorkhi, M Delahaye, SJ Patel, WD Gropp, WW Hwu Proceedings of the 15th ACM SIGPLAN symposium on Principles and practice of …, 2010	426	2010
Accelerating advanced MRI reconstructions on GPUs SS Stone, JP Haldar, SC Tsao, WW Hwu, ZP Liang, BP Sutton Proceedings of the 5th conference on Computing frontiers, 261-272, 2008	422	2008
DNNBuilder: An automated tool for building high-performance DNN hardware accelerators for FPGAs X Zhang, J Wang, C Zhu, Y Lin, J Xiong, W Hwu, D Chen 2018 IEEE/ACM International Conference on Computer-Aided Design (ICCAD), 1-8, 2018	398	2018
Program optimization space pruning for a multithreaded GPU S Ryoo, CI Rodrigues, SS Stone, SS Baghsorkhi, SZ Ueng, JA Stratton, ... Proceedings of the 6th annual IEEE/ACM international symposium on Code …, 2008	388	2008
MCUDA: An efficient implementation of CUDA kernels for multi-core CPUs JA Stratton, SS Stone, WMW Hwu Languages and Compilers for Parallel Computing: 21th International Workshop …, 2008	367	2008
Checkpoint repair for out-of-order execution machines WW Hwu, YN Patt Proceedings of the 14th annual international symposium on Computer …, 1987	358	1987
Using profile information to assist classic code optimizations PP Chang, SA Mahlke, WMW Hwu Software: Practice and Experience 21 (12), 1301-1321, 1991	353	1991
An effective GPU implementation of breadth-first search L Luo, M Wong, W Hwu Proceedings of the 47th design automation conference, 52-55, 2010	343	2010
CUDA-lite: Reducing GPU programming complexity SZ Ueng, M Lathara, SS Baghsorkhi, WMW Hwu Languages and Compilers for Parallel Computing: 21th International Workshop …, 2008	333	2008
GPU clusters for high-performance computing VV Kindratenko, JJ Enos, G Shi, MT Showerman, GW Arnold, JE Stone, ... 2009 IEEE International Conference on Cluster Computing and Workshops, 1-8, 2009	331	2009
Achieving high instruction cache performance with an optimizing compiler WW Hwu, PP Chang Proceedings of the 16th Annual International Symposium on Computer …, 1989	304	1989
A comparison of full and partial predicated execution support for ILP processors SA Mahlke, RE Hank, JE McCormick, DI August, WMW Hwu Proceedings of the 22nd annual international symposium on Computer …, 1995	272	1995

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors