
Governance
Manage Agent lifecycles: create, schedule, and run. Define your alarms and notifications for any and every workload.

Operational and Cost Optimization
Access continuously streamed actionable insights describing the current health of endpoints, services, and APIs for any target Agent workload.

Observe and Act
Measure your compliance with your customer Service Level Agreements and your own Service Level Objectives by customizing your metrics over a period using actual and forecasted workloads.

Multi-Cloud Support, Native Interfaces and Customizability
Access five-day rolling composite, service, and Agent forecasts, including alarm probabilities, enabling you to preempt and proactively fix issues.
Enabling Features
Essential features for integration and customization.
.png?width=32&height=32&name=organization%20(2).png)
Agent Lifecycle Management
Create, configure, schedule, and run agents across environments with full lifecycle control.

Decision & Execution Workflows
Define step-by-step governance workflows for approving or rejecting agent actions.

Service Settings & SLAs
Manage incident definitions, set SLA/SLO thresholds, and monitor outcomes over time.

Alarm & Notification Configuration
Customize alarm conditions and routing using agent policies and observability signals.

Scheduling Options
Set when agents run, how frequently, and under what conditions.

Group-Based Access Control
Organize agents into functional or organizational groups (e.g., Dev, Stage, Prod) with user access rules.

Agent-to-Agent Communication
Agents can share memory and trigger shared actions — enabling governance at scale.

Assistant Memory Management
Used for Agent Authentication with the Aviary platform.

Streamed Operational Insights
Continuously assess the health of endpoints, APIs, and workloads across services.

Worker Agent Configuration
Customize and deploy open-source worker agents tailored to your environment in seconds.

Worker Agent Libraries
Organize task-based workers into modular libraries powering broader service agents.

Worker Agents from API Schemas
Generate worker agents automatically using OpenAPI schemas.

Forecast Model Management
Create and tune models using tools like Databricks, Snowflake Cortex, or built-in neural networks.

Wallet
Manage credentials and access tokens for external systems (e.g., Databricks, Snowflake).

Cluster Selection
Connect agents to runtime clusters using kubeconfig for Kubernetes-based deployments.

Composite Daily Forecasts
Combine forecasts across services to surface critical trends and performance degradation.

Five-Day Rolling Forecasts
Visualize near-future performance by agent, service, or composite workload — including alarm probabilities.

Named Pattern Matching for Incidents
Identify recurring issues based on saved incident summaries and known resolution patterns.

Incident & SLA Dashboards
Conversational, dynamic views that summarize health, risk, and SLA/SLO performance.

Multi-Cloud Support
Native agent compatibility with AWS, Azure, GCP, Oracle Cloud, and IBM.

Conversational Ops/Dev Interface
Interact with agents and dashboards directly via Slack or Teams, like an SRE comrade, to investigate or configure services.

Open Source Agent Sharing
Share custom-built agents with others across your org or community.

Multiple Deployment Types
Run agents as Kubernetes-managed services, Docker containers, or standalone executables.

API Key Management
Authenticate your agents securely with the AI platform.

What are Cloud Canaries AI Agents?
Independent Agentic AI agents that monitor, observe, manage and remediate cloud environments with:
- Workload and telemetry data
- Perception, reasoning, actions, and learning
- Observability and workflow dashboards
Shared from a single platform.
Create and deploy Agents in minutes, collect data today, and managed tomorrow.
Cloud Canaries are Independent agents that observe and manage cloud environments to notify, identify, quantify, predict and remediate.