Commits


Hector Li authored and GitHub committed 4324d2173b4
[QNN EP] Enable Qnn context cache to save model initialization time (#15815) ### Description Enable Qnn Context cache feature to save model initialization time Provider options: qnn_context_cache_enable|1 to enable the cache feature qnn_context_cache_path to set the cache path. It is set to model_file.onnx.bin by default. ### Motivation and Context Model initialization time takes long because the cost of conversion from Onnx model to Qnn model. Qnn have feature to serialize the Qnn context to file, then next time user can load it from the cache context and execute the graph to save the cost. --------- Co-authored-by: Adrian Lizarraga <adlizarraga@microsoft.com>