Commits


Derek Murray authored and GitHub committed a541d28fb41
Lazily get allocator when allocating an MLValue (#4276) According to profiling in #4267, getting the allocator can account for a large fraction of overhead when accessing a kernel output, due to STL container operations. The allocator isn't used when (i) we're not creating a fence, and (ii) we have a memory pattern and a pre-allocated buffer, so we can avoid this overhead.