Distributed Training Orchestrator
Loading...
●
Polling
System Overview
--
Health Score
--
Active Alerts
--
Uptime
System Resources
--
CPU %
--
Memory %
--
Disk %
Training Progress
--
Loss
--
Accuracy
--
Throughput
Training Progress
Loss Curve:
Accuracy Curve:
Cluster Topology
🎯
Coordinator
Active
Demo Controls
Current Scenario:
Baseline Training
Interactive Actions:
➕ Add Worker
➖ Remove Worker
💥 Inject Failure
🔄 Switch Strategy
🔄 Reset Training
Gradient Strategy:
AllReduce
System Console
[00:00:00]
INFO
System initialized - polling for updates...
Recent Alerts
No alerts to display
Performance Insights
Loading insights...