System and method for model training orchestration
Abstract:
A system for large-scale machine learning experiment execution, including: a platform configured to determine an experiment set from a run specification and schedule a run to one or more clusters; and a set of agents configured to receive the experiment set from the platform and facilitate individual experiment execution through a cluster orchestrator.
Public/Granted literature
Information query
Patent Agency Ranking
0/0