Services

The Services page provides comprehensive performance monitoring for all your services with service-level and operation-level insights.

Service-Level Metrics

Open a single service to inspect its health, error rate and latency (avg, min, max). From here you can jump directly to logs, metrics or traces for faster root cause analysis. Each service card displays:

  • Health Status: Real-time health indicator
  • Error Rate: Percentage of failed requests
  • Latency Distribution: Average, minimum, and maximum response times
  • Quick Navigation: Jump to related traces, logs, or detailed metrics

Operation-Level Insight

Monitor each operation such as "Create User" with its own latency and P95 metrics. See which source and target services are involved in every call. Dive into operation-level performance to:

  • Monitor latency, APDEX scores, and operations-per-second
  • Detect regressions or spikes instantly
  • Inspect full execution including HTTP/gRPC method
  • Identify bottlenecks in user-facing flows

Overview Cards:

  1. Total Services: Count of all services
  2. Average Latency: Overall average response time
  3. Total Calls: Sum of all requests
  4. Error Rate: Percentage of failed requests

Services Grid:

Each service is displayed in a card showing:

  • Service Name with icon
  • Type: HTTP, gRPC, Database, etc.
  • Status: Health indicator
  • Metrics:
    • Avg Duration
    • Calls/sec
    • Error Count
    • Success Rate

Sorting:

  1. Status (error → warning → healthy)
  2. Latency (highest first)

Filtering:

  • Filter by service name (contains)
  • Filter by service type
  • Filter by status
  • Filter by latency range
  • Filter by time range

Service Detail:

  1. Service Detail – Call Metrics:
    • Analyze detailed call metrics for the selected service
    • Latency percentiles (P50, P90, P99)
    • Throughput and performance anomalies over time
    • Real-time performance graphs
  2. Service Detail – Operations Insight:
    • Dive into operation-level performance
    • Monitor latency, APDEX scores, and operations-per-second
    • Detect regressions or spikes instantly
    • Identify which operations need optimization
  3. Performance Timeline:
    • Latency over time (line chart)
    • Request rate over time (bar chart)
    • Error rate over time (area chart)
    • Interactive charts with zoom and pan
  4. Operations List:
    • All operations within the service
    • Sorted by latency (highest first)
    • Click to see operation details
    • Filter and search operations
  5. Recent Traces:
    • Latest traces for this service
    • Click to open full trace view
    • See trace duration and status
  6. Related Logs:
    • Logs from this service
    • Automatically filtered by service name
    • Correlated with traces for debugging