Commit Graph

14 Commits

Author SHA1 Message Date
Bryce Allen
c32b86422f distribute total across ranks
useful for test < 6 ranks per node
2020-08-07 18:50:39 -04:00
Bryce Allen
538c22a22f add avg script for parsing timings in *.txt 2020-08-07 18:07:56 -04:00
Bryce Allen
37ad5e87ce use define to switch between managed/unmanaged 2020-08-07 14:14:56 -04:00
Bryce Allen
6940ce7ceb add mem free print, fit in 8GB gpu 2020-08-07 13:21:22 -04:00
Bryce Allen
3ebd09725e add mpi wtime counters, fix make clean 2020-08-07 13:05:34 -04:00
Bryce Allen
3dd6045f2e move finialize to outside profiler area 2020-08-07 13:02:07 -04:00
Bryce Allen
55af9daa9b make: fix summit build 2020-08-06 11:13:32 -04:00
Bryce Allen
063e592dcf fix all* cuda malloc size 2020-08-06 11:13:18 -04:00
Bryce Allen
134c933e86 use managed mem for allgather, cleanup 2020-08-06 10:11:58 -04:00
Bryce Allen
714a96d1ea update cuda errors for 11
deprecated API was removed
2020-08-06 07:42:59 -04:00
Bryce Allen
3e99cf443b fix allgather recv size 2020-08-06 07:42:46 -04:00
Bryce Allen
4d504dd5b1 add versions with nvtx 2020-08-05 16:45:42 -04:00
Bryce Allen
df9a3a79a8 add env var debugging 2020-03-31 14:33:06 -04:00
Bryce Allen
74b23dff0b initial version 2020-02-24 17:20:21 -05:00