BigPanda
Aggregate data from all observability, monitoring, change and topology tools. BigPanda’s Open Box Machine Learning will correlate the data into a small number of actionable insights so incidents are detected in real-time, as they form, before they escalate into outages. Accelerate incident and outage resolution by automatically identifying the probable root cause of problems. BigPanda identifies both root cause changes and infrastructure-related root causes. Resolve incidents and outages faster. BigPanda automates and streamlines the incident response lifecycle across incident triage, ticketing, notifications, and war room creation. Accelerate remediation by integrating BigPanda with enterprise runbook automation tools. Applications and cloud services are the lifeblood of every company. When there’s an outage, everyone is impacted. BigPanda cements AIOps market leadership with $190M in funding, $1.2B valuation.
Learn more
StackPilot
StackPilot is an AI-powered oncall copilot that automates root cause analysis and bug fixes for software engineers. It integrates directly with observability tools like Datadog, Sentry, and PagerDuty to transform alerts into actionable fixes. The platform analyzes recent commits, logs, and stack traces to pinpoint faulty code, then generates pull requests with proposed solutions. Engineers only need to review and merge, significantly cutting resolution time from hours to an average of 15 minutes. StackPilot also captures investigative steps and converts them into reusable runbooks, improving incident response over time. With strong privacy measures—no code or logs stored—it ensures secure, real-time analysis for engineering teams.
Learn more
NeuBird
NeuBird’s flagship product, Hawkeye (Agentic AI SRE), is an AI-powered Site Reliability Engineering platform that transforms IT operations by continuously monitoring telemetry from across your observability stack, logs, metrics, traces, alerts, and incident tickets, to detect issues, analyze root causes, and propose or automate practical remediation in real time without requiring manual investigation. Built for enterprise-grade environments, Hawkeye integrates securely with existing monitoring and incident management tools (such as DataDog, Splunk, PagerDuty, Prometheus, ServiceNow, AWS CloudWatch, Azure Monitor, and more), correlates signals across disparate sources, and reasons contextually like a human engineer to surface actionable insights and reduce mean time to resolution (MTTR) by up to ~90%. It is always-on and can be deployed as SaaS or in a customer’s VPC with enterprise security controls, providing autonomous incident response, pattern recognition, etc.
Learn more
InsightFinder
InsightFinder Unified Intelligence Engine (UIE) platform provides human-centered AI solutions for identifying incident root causes, and predicting and preventing production incidents. Powered by patented self-tuning unsupervised machine learning, InsightFinder continuously learns from metric time series, logs, traces, and triage threads from SREs and DevOps Engineers to bubble up root causes and predict incidents from the source. Companies of all sizes have embraced the platform and seen that business-impacting incidents can be predicted hours ahead with clearly pinpointed root causes. Survey a comprehensive overview of your IT Ops ecosystem, including patterns, trends, and team activities. Also view calculations that demonstrate overall downtime savings, cost of labor savings, and number of incidents resolved.
Learn more