Commits


Chi Lo authored and GitHub committed 0292356bd7c
TensorRT EP engine cache serialization/deserialization refactor (#11045) * Code refactor * fix bug * modify comment * modify test for the new ORT TRT cache behavior * update comment * rename variable * fix bug for not having trt context * Custom parameters (#10964) * get inputs independently for trtexec * track one process only * remove engine and profile files * change time to commit time * add runtime option for io binding * move to commit date * fixes * add option for graph optimization * cleanup docker script * note second time creation * allow for parameters to be configured from pipeline at runtime * uncomment * include optional arguments at runtime * post second session creation * update cmake version * Revert "update cmake version" This reverts commit 09a1364eae68610724c8e90eeea777b7ee03f74b. * Move data format import * Perf FasterRCNN + MaskRCNN (#11102) * add faster mask * fix paths * add a test scenario that - if engine cache is present, trt ep should load the engine cache and run inference * Revert "Merge branch 'trt_cache_refactor' of https://github.com/microsoft/onnxruntime into trt_cache_refactor" This reverts commit 8edc574de1ea6055534f33a57b9365c721c2eb29, reversing changes made to 0c92e5b2b1d453527001fe731ed4ccfc79e6adad. Co-authored-by: Olivia Jain <oljain@microsoft.com>