Commits


pengwa authored and GitHub committed fb88efbe181
End to end run pass (on device training) (#11694) * lr_scheduler implementation (cherry picked from commit d9c2552b3a3b2ff38ee0a14770257aa1169f6fa9) * refactor Module/Optimizer constructor. * add intermidiate API layer bridging public interfaces with internal ones. * synthetic data loader * make end to end run pass * avoid many session input copy (CPU to GPU) some clean up * NVTX for runner * minor fix after sync * revert to let Module/Optimizer handle session creation. * fix tests & test file folder consolidation * refine based on comments & fix cpplint * typos