AWS CloudWatch is a cloud-native monitoring service that reports the health, state, and utilization of AWS infrastructure and managed services. It provides metrics, logs, and events for resources such as EC2, ECS, EKS, AWS Batch, and storage services. Tracer complements CloudWatch by observing how workloads actually execute on that infrastructure and by attributing resource usage and cost to pipelines, tasks, and runs rather than to infrastructure alone.Documentation Index
Fetch the complete documentation index at: https://opensre.com/docs/llms.txt
Use this file to discover all available pages before exploring further.
If you’re new to Tracer or want a conceptual overview, see How Tracer fits in your stack.
What cloud-native monitoring does well
Cloud-native monitoring tools such as CloudWatch are designed to report the state of cloud resources. They provide:- Metrics emitted by AWS services and infrastructure
- Instance, container, and service health signals
- Logs and events from managed resources
- Alerts based on thresholds and service state
Where cloud-native metrics stop
Cloud-native monitoring relies on service-reported metrics and resource state. It does not observe execution behavior inside workloads and does not have awareness of pipeline or task semantics. It does not show:- What individual processes or containers are doing at runtime
- Whether allocated resources are actively used or idle
- Execution stalls caused by I/O or network contention
- Short-lived jobs that complete between reporting intervals
- Why infrastructure remains allocated after workloads finish
- How cost maps to specific pipelines, tasks, or tools
Infrastructure state versus execution behavior
Cloud-native monitoring answers questions such as:- Which instances are running?
- How much capacity is allocated?
- Are services healthy?
- Which task or process consumed those resources
- Whether work was compute-bound, I/O-bound, or idle
- Why infrastructure remained allocated without active execution
What Tracer adds
Tracer observes execution directly from the operating system and container runtime. When used alongside cloud-native monitoring, it adds:- Execution-level visibility for pipelines, runs, tasks, and tools
- Observed CPU, memory, disk, and network behavior
- Insight into stalls, idle execution, and contention
- Attribution of resource usage and cost to observed execution
Infrastructure state analysis with Tracer/sweep
Tracer/sweep extends Tracer’s visibility to cloud infrastructure state using read-only access. It identifies:- Idle compute resources with no active execution
- Orphaned nodes no longer associated with schedulers or clusters
- Unattached or residual storage left behind by completed workloads
Automation and control boundaries
Tracer’s products do not perform automated actions.- They do not start, stop, or resize infrastructure
- They do not make predictions about future workload requirements
- They do not act on forecasts or heuristics
Example: service metrics versus observed execution
CloudWatch shows high instance utilization during a pipeline run. Tracer reveals that:- CPU usage remains low
- Tasks spend most of their time blocked on disk I/O
- Instances remain allocated after execution completes
Observability comparison
This comparison highlights the difference between cloud service metrics and execution-level observation.
What Tracer does not replace
Tracer is not a cloud control plane.- It does not replace CloudWatch metrics, logs, or alarms
- It does not manage AWS resources or services
- It does not replace AWS-native monitoring outside execution behavior
When to use Tracer with cloud-native monitoring
Tracer is most useful alongside CloudWatch when teams need to:- Explain slow or inconsistent pipeline runtimes
- Distinguish execution bottlenecks from infrastructure waste
- Identify idle or orphaned compute and storage resources
- Attribute cost to pipelines, tasks, or tools rather than instances
Tracer