AgentOps
Agent Runs / Eval
Every agent run is observable, validated against a schema and scored on a rubric — with a human in the loop where it counts.
Total runs
8
this cycle
Succeeded
6
schema-valid output
Needs review
1
awaiting a human
Avg eval score
83/100
6 evaluated
Agent Runs
Tool calls, validation, latency, tokens and cost per run
| Agent | Task | Status | Validation | Prompt | Duration | Tokens | Cost | When |
|---|---|---|---|---|---|---|---|---|
| BuyerPersonaAgent | Cluster Personas | Succeeded | Passed | BuyerPersonaAgent@v1.0 | 1.8s | 4,200 | US$0.04 | today |
| CreativeStrategyAgent | Generate Angles | Succeeded | Passed | CreativeStrategyAgent@v2.1 | 2.1s | 4,850 | US$0.05 | yesterday |
| CopywritingAgent | Write Hooks | Succeeded | Needs Review | CopywritingAgent@v3.2 | 2.4s | 5,500 | US$0.06 | 2 days ago |
| AdsManagerAgent | Draft Campaign | Needs Review | Needs Review | AdsManagerAgent@v1.3 | 2.8s | 6,150 | US$0.08 | 3 days ago |
| ComplianceAgent | Check Claims | Succeeded | Passed | ComplianceAgent@v2.4 | 3.1s | 6,800 | US$0.09 | 4 days ago |
| AnalyticsAgent | Extract Learnings | Succeeded | Passed | AnalyticsAgent@v3.5 | 3.4s | 7,450 | US$0.10 | 5 days ago |
| TrendRadarAgent | Scan Trends | Running | — | TrendRadarAgent@v1.6 | 3.7s | 8,100 | US$0.11 | today |
| WarRoomAgent | Daily Recommendations | Succeeded | Passed | WarRoomAgent@v2.7 | 4.0s | 8,750 | US$0.12 | yesterday |
Evaluations
Rubric-scored output with human-edit tracking
BuyerPersonaAgent
Human-editedScore74/100
Schema Validity9/10
Grounding7/10
Actionability7/10
Human tightened hooks, kept structure.
CreativeStrategyAgent
As-isScore79/100
Schema Validity10/10
Grounding8/10
Actionability8/10
Accepted as-is.
CopywritingAgent
Human-editedScore84/100
Schema Validity9/10
Grounding9/10
Actionability9/10
Human tightened hooks, kept structure.
ComplianceAgent
As-isScore89/100
Schema Validity10/10
Grounding7/10
Actionability7/10
Accepted as-is.
AnalyticsAgent
Human-editedScore94/100
Schema Validity9/10
Grounding8/10
Actionability8/10
Human tightened hooks, kept structure.
WarRoomAgent
As-isScore77/100
Schema Validity10/10
Grounding9/10
Actionability9/10
Accepted as-is.
Agent Roster
25 specialised agents — 8 active this cycle
MarketResearchAgent
IdleVoiceOfCustomerAgent
IdleBrandStrategyAgent
IdleProductMarketingAgent
IdleBlueOceanAgent
IdleBuyerPersonaAgent
ActiveOfferAgent
IdleExperimentationAgent
IdleCreativeStrategyAgent
ActiveCopywritingAgent
ActiveContentStrategyAgent
IdleSocialOpsAgent
IdleAIInfluencerAgent
IdleAdsManagerAgent
ActiveCommunityAgent
IdleFunnelAgent
IdleLocalizationAgent
IdleCompetitiveRadarAgent
IdleTrendRadarAgent
ActiveAnalyticsAgent
ActiveAttributionAgent
IdleCostControlAgent
IdleComplianceAgent
ActiveRightsAgent
IdleWarRoomAgent
Active