PipeDream: generalized pipeline parallelism for DNN training D Narayanan, A Harlap, A Phanishayee, V Seshadri, NR Devanur, ... SOSP 2019: Proceedings of the 27th ACM Symposium on Operating Systems …, 2019 | 836 | 2019 |
FAWN: A Fast Array of Wimpy Nodes DG Andersen, J Franklin, M Kaminsky, A Phanishayee, L Tan, ... SOSP 2009: Proceedings of the ACM SIGOPS 22nd Symposium on Operating Systems …, 2009 | 797 | 2009 |
The non-IID data quagmire of decentralized machine learning K Hsieh, A Phanishayee, O Mutlu, PB Gibbons ICML 2020: International Conference on Machine Learning (arXiv preprint …, 2019 | 600 | 2019 |
Safe and effective fine-grained TCP retransmissions for datacenter communication V Vasudevan, A Phanishayee, H Shah, E Krevat, DG Andersen, ... SIGCOMM 2009: 39 (4), 303-314, 2009 | 588 | 2009 |
Efficient large-scale language model training on gpu clusters using megatron-lm D Narayanan, M Shoeybi, J Casper, P LeGresley, M Patwary, ... Proceedings of the International Conference for High Performance Computing …, 2021 | 562 | 2021 |
Analysis of Large-Scale Multi-Tenant GPU Clusters for DNN Training Workloads M Jeon, S Venkataraman, A Phanishayee, J Qian, W Xiao, F Yang USENIX ATC 2019 (arXiv preprint arXiv:1901.05758), 2019 | 431* | 2019 |
Measurement and Analysis of TCP Throughput Collapse in Cluster-based Storage Systems. A Phanishayee, E Krevat, V Vasudevan, DG Andersen, GR Ganger, ... FAST 2008: 6th USENIX Conference on File and Storage Technologies 8, 1-14, 2008 | 373 | 2008 |
ProjecToR: Agile Reconfigurable Data Center Interconnect M Ghobadi, R Mahajan, A Phanishayee, N Devanur, J Kulkarni, ... SIGCOMM 2016, 216-229, 2016 | 370 | 2016 |
PipeDream: Fast and efficient pipeline parallel DNN training A Harlap, D Narayanan, A Phanishayee, V Seshadri, N Devanur, ... arXiv preprint arXiv:1806.03377, 2018 | 265 | 2018 |
TBD: Benchmarking and Analyzing Deep Neural Network Training H Zhu, M Akrout, B Zheng, A Pelegris, A Phanishayee, B Schroeder, ... IISWC 2018 - International Symposium on Workload Characterization - arXiv …, 2018 | 231 | 2018 |
Themis: Fair and Efficient GPU Cluster Scheduling K Mahajan, A Balasubramanian, A Singhvi, S Venkataraman, A Akella, ... NSDI 2020: 17th USENIX Symposium on Networked Systems Design and …, 2020 | 222 | 2020 |
Heterogeneity-Aware Cluster Scheduling Policies for Deep Learning Workloads D Narayanan, K Santhanam, F Kazhamiaka, A Phanishayee, M Zaharia OSDI 2020: 14th USENIX Symposium on Operating Systems Design and …, 2020 | 210 | 2020 |
Memory-efficient pipeline-parallel dnn training D Narayanan, A Phanishayee, K Shi, X Chen, M Zaharia International Conference on Machine Learning, 7937-7947, 2021 | 208 | 2021 |
Gist: Efficient Data Encoding for Deep Neural Network Training A Jain, A Phanishayee, J Mars, L Tang, G Pekhimenko ISCA 2018: Proceedings of The 45th International Symposium on Computer …, 2018 | 196 | 2018 |
Parameter hub: a rack-scale parameter server for distributed deep neural network training L Luo, J Nelson, L Ceze, A Phanishayee, A Krishnamurthy SOCC 2018: Proceedings of the ACM Symposium on Cloud Computing, 41-54, 2018 | 147 | 2018 |
Blink: Fast and generic collectives for distributed ML G Wang, S Venkataraman, A Phanishayee, J Thelin, N Devanur, I Stoica MLSys 2020: Third Conference on Machine Learning and Systems (arXiv preprint …, 2019 | 132 | 2019 |
FAWN: A fast array of wimpy nodes DG Andersen, J Franklin, M Kaminsky, A Phanishayee, L Tan, ... Communications of the ACM 54 (7), 101-109, 2011 | 131 | 2011 |
Atomic In-place Updates for Non-volatile Main Memories with Kamino-Tx A Memaripour, A Badam, A Phanishayee, Y Zhou, R Alagappan, ... EuroSys 2017: Twelfth European Conference on Computer Systems, 499-512, 2017 | 128 | 2017 |
Analyzing and mitigating data stalls in DNN training J Mohan, A Phanishayee, A Raniwala, V Chidambaram arXiv preprint arXiv:2007.06775, 2020 | 105 | 2020 |
On application-level approaches to avoiding TCP throughput collapse in cluster-based storage systems E Krevat, V Vasudevan, A Phanishayee, DG Andersen, GR Ganger, ... Proceedings of the 2nd international workshop on Petascale data storage …, 2007 | 98 | 2007 |