15 Commits (02b31f0427c546303e4004fd64a66e44aeb2fed4)
 

Author SHA1 Message Date
Bryce Allen 02b31f0427 hacky multi-node support
5 years ago
Bryce Allen c32b86422f distribute total across ranks
5 years ago
Bryce Allen 538c22a22f add avg script for parsing timings in *.txt
5 years ago
Bryce Allen 37ad5e87ce use define to switch between managed/unmanaged
5 years ago
Bryce Allen 6940ce7ceb add mem free print, fit in 8GB gpu
5 years ago
Bryce Allen 3ebd09725e add mpi wtime counters, fix make clean
5 years ago
Bryce Allen 3dd6045f2e move finialize to outside profiler area
5 years ago
Bryce Allen 55af9daa9b make: fix summit build
5 years ago
Bryce Allen 063e592dcf fix all* cuda malloc size
5 years ago
Bryce Allen 134c933e86 use managed mem for allgather, cleanup
5 years ago
Bryce Allen 714a96d1ea update cuda errors for 11
5 years ago
Bryce Allen 3e99cf443b fix allgather recv size
5 years ago
Bryce Allen 4d504dd5b1 add versions with nvtx
5 years ago
Bryce Allen df9a3a79a8 add env var debugging
6 years ago
Bryce Allen 74b23dff0b initial version
6 years ago