Comet
ML experiment tracking and LLM observability platform, including Opik for evaluating LLM apps.
Comet's Opik pushes deeper into agent eval and framework-portable observability.
◆Recent moves
- 18h agoView source ↗
How Evaluation-Driven Development (EDD) Works
- 3d ago
Opik + Oracle Agent Specification: Build Once, Run Anywhere
Opik integrates with Oracle's Open Agent Specification, letting teams build agents against a portable spec and avoid framework lock-in. Extends Opik's reach from observability toward standards-based agent portability.
View source ↗ - 7d ago
AI Evaluation Simplified: Automate Dataset & Metric Eval Workflows with Test Suites
Introduces Test Suites to automate dataset and metric evaluation workflows, reducing the manual work of building reference datasets and judge prompts. Advances Opik's move into the full agent evaluation loop.
View source ↗ - 7d ago
Advanced Claude Code Cost Tracking: How to Save 30% on Token Spend
How-to on cutting Claude Code token spend. Cost-tracking marketing content riding on coding-agent adoption, not a product change.
View source ↗ - 15d ago
Understanding Your Claude Code Spend: What’s Actually Driving the Cost
Analysis post on what drives Claude Code spend. Thought-leadership content, not a release.
View source ↗ - 29d ago
Agent Tracing and Observability: Log & Debug Complex AI Systems
Explainer on agent tracing and observability for debugging multi-agent systems. Educational content reinforcing Opik's core use case, not a product change.
View source ↗