Funded by Samsung SDS for research on optimizing modern GPU cluster platforms.