Bryce Allen
|
12d76b4a42
|
update ignores
|
2020-08-11 10:23:33 -04:00 |
|
Bryce Allen
|
909f8880de
|
add mpi barrier before allgather
|
2020-08-10 11:37:45 -04:00 |
|
Bryce Allen
|
924b721ad7
|
fix summit job script run script arg order
|
2020-08-10 11:33:00 -04:00 |
|
Bryce Allen
|
02b31f0427
|
hacky multi-node support
assumes 6 procs per node
|
2020-08-07 18:50:39 -04:00 |
|
Bryce Allen
|
c32b86422f
|
distribute total across ranks
useful for test < 6 ranks per node
|
2020-08-07 18:50:39 -04:00 |
|
Bryce Allen
|
538c22a22f
|
add avg script for parsing timings in *.txt
|
2020-08-07 18:07:56 -04:00 |
|
Bryce Allen
|
37ad5e87ce
|
use define to switch between managed/unmanaged
|
2020-08-07 14:14:56 -04:00 |
|
Bryce Allen
|
6940ce7ceb
|
add mem free print, fit in 8GB gpu
|
2020-08-07 13:21:22 -04:00 |
|
Bryce Allen
|
3ebd09725e
|
add mpi wtime counters, fix make clean
|
2020-08-07 13:05:34 -04:00 |
|
Bryce Allen
|
3dd6045f2e
|
move finialize to outside profiler area
|
2020-08-07 13:02:07 -04:00 |
|
Bryce Allen
|
55af9daa9b
|
make: fix summit build
|
2020-08-06 11:13:32 -04:00 |
|
Bryce Allen
|
063e592dcf
|
fix all* cuda malloc size
|
2020-08-06 11:13:18 -04:00 |
|
Bryce Allen
|
134c933e86
|
use managed mem for allgather, cleanup
|
2020-08-06 10:11:58 -04:00 |
|
Bryce Allen
|
714a96d1ea
|
update cuda errors for 11
deprecated API was removed
|
2020-08-06 07:42:59 -04:00 |
|
Bryce Allen
|
3e99cf443b
|
fix allgather recv size
|
2020-08-06 07:42:46 -04:00 |
|
Bryce Allen
|
4d504dd5b1
|
add versions with nvtx
|
2020-08-05 16:45:42 -04:00 |
|
Bryce Allen
|
df9a3a79a8
|
add env var debugging
|
2020-03-31 14:33:06 -04:00 |
|
Bryce Allen
|
74b23dff0b
|
initial version
|
2020-02-24 17:20:21 -05:00 |
|