Shaden Smith

Cited by

	All	Since 2019
Citations	3466	3100
h-index	22	20
i10-index	24	22

1600

800

400

1200

201520162017201820192020202120222023202420 46 96 175 185 156 256 434 1515 547

Public access

View all

11 articles

0 articles

available

not available

Based on funding mandates

Co-authors

George KarypisDistinguished McKnight University Professor, University of Minnesota; SPS, AWSVerified email at umn.edu
He YuxiongMicrosoft ResearchVerified email at microsoft.com
Jongsoo ParkResearch Scientist, FacebookVerified email at fb.com
Nikolaos SidiropoulosLouis T. Rader Professor, Electrical & Computer Engineering, University of VirginiaVerified email at virginia.edu
Fabrizio PetriniIntel Labs, Parallel Computing LabVerified email at intel.com
Jee W. ChoiUniversity of OregonVerified email at uoregon.edu
Jeff RasleyMicrosoftVerified email at microsoft.com
Nesreen K. AhmedPrincipal Scientist, Intel AI Research, Purdue UniversityVerified email at intel.com
Samyam RajbhandariMicrosoft Artificial Intelligence and Research, Ohio State University

Shaden Smith

Inflection AI

Verified email at inflection.ai - Homepage

Deep Learning Tensor Decomposition High Performance Computing Parallel Computing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Bloom: A 176b-parameter open-access multilingual language model T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ...	1146	2023
Using deepspeed and megatron to train megatron-turing nlg 530b, a large-scale generative language model S Smith, M Patwary, B Norick, P LeGresley, S Rajbhandari, J Casper, ... arXiv preprint arXiv:2201.11990, 2022	599*	2022
SPLATT: Efficient and parallel sparse tensor-matrix multiplication S Smith, N Ravindran, ND Sidiropoulos, G Karypis 2015 IEEE International Parallel and Distributed Processing Symposium, 61-70, 2015	264	2015
Zero-infinity: Breaking the gpu memory wall for extreme scale deep learning S Rajbhandari, O Ruwase, J Rasley, S Smith, Y He Proceedings of the international conference for high performance computing …, 2021	205	2021
FROSTT: The Formidable Repository of Open Sparse Tensors and Tools S Smith, JW Choi, J Li, R Vuduc, J Park, X Liu, G Karypis http://frostt.io/, 2017	152	2017
Tensor-Matrix Products with a Compressed Sparse Tensor S Smith, G Karypis 5th Workshop on Irregular applications: Architectures and Algorithms (IA^3), 2015	143	2015
Deepspeed-inference: enabling efficient inference of transformer models at unprecedented scale RY Aminabadi, S Rajbhandari, AA Awan, C Li, D Li, E Zheng, O Ruwase, ... SC22: International Conference for High Performance Computing, Networking …, 2022	124	2022
A Medium-Grained Algorithm for Distributed Sparse Tensor Factorization S Smith, G Karypis Parallel and Distributed Processing Symposium (IPDPS), 2016 IEEE International, 2016	104*	2016
Tensaurus: A versatile accelerator for mixed sparse-dense tensor computations N Srivastava, H Jin, S Smith, H Rong, D Albonesi, Z Zhang 2020 IEEE International Symposium on High Performance Computer Architecture …, 2020	102	2020
Bridging the gap between HPC and big data frameworks M Anderson, S Smith, N Sundaram, M Capotă, Z Zhao, S Dulloor, ... Proceedings of the VLDB Endowment 10 (8), 901-912, 2017	77	2017
Accelerating the tucker decomposition with compressed sparse tensors S Smith, G Karypis European Conference on Parallel Processing, 653-668, 2017	65	2017
Truss Decomposition on Shared-Memory Parallel Systems S Smith, X Liu, NK Ahmed, AS Tom, F Petrini, G Karypis IEEE High Performance Extreme Computing Conference (HPEC), 2017	57	2017
Streaming tensor factorization for infinite data sources S Smith, K Huang, ND Sidiropoulos, G Karypis Proceedings of the 2018 SIAM International Conference on Data Mining, 81-89, 2018	53	2018
Sparse tensor factorization on many-core processors with high-bandwidth memory S Smith, J Park, G Karypis 2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2017	43	2017
Big data frequent pattern mining DC Anastasiu, J Iverson, S Smith, G Karypis Frequent Pattern Mining, 225-259, 2014	42	2014
An Exploration of Optimization Algorithms for High Performance Tensor Completion S Smith, J Park, G Karypis Proceedings of the 2016 ACM/IEEE Conference on Supercomputing (SC '16), 2016	39	2016
Memory-efficient parallel computation of tensor and matrix products for big tensor decomposition N Ravindran, ND Sidiropoulos, S Smith, G Karypis 2014 48th Asilomar Conference on Signals, Systems and Computers, 581-585, 2014	38	2014
Blocking optimization techniques for sparse tensor computation J Choi, X Liu, S Smith, T Simon 2018 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2018	36	2018
Exploring Optimizations on Shared-memory Platforms for Parallel Triangle Counting Algorithms AS Tom, N Sundaram, NK Ahmed, S Smith, S Eyerman, M Kodiyath, I Hur, ... IEEE High Performance Extreme Computing Conference (HPEC), 2017	32	2017
Constrained Tensor Factorization with Accelerated AO-ADMM S Smith, A Beri, G Karypis 46th International Conference on Parallel Processing (ICPP '17), 2017	32	2017

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors