Company
Technology
DevOps/ObservabilityEngineer
Neural analysis suggests this role is
optimal for Senior candidates.
“DevOps/Observability Engineer. Skills: Observability, DevOps, AWS, Kubernetes. Design observability architectures. Implement observability architectures”
Industry & Context.
Problem-solving; Debugging
What They're Looking For.
Must Have
8+ years of experience, Hands-on experience designing unified observability pipelines, Deep expertise in AWS observability services, Proven ability to build and manage large-scale log aggregation systems, Experience with Kubernetes (EKS) or containerized environments (ECS), Advanced proficiency with Terraform or other Infrastructure as Code tools, Experience building alerting systems, dashboards, and monitoring frameworks
Nice to Have
Understanding of cost optimization strategies
What You'll Do.
Design observability architectures
Implement observability architectures
Build observability pipelines
Maintain observability pipelines
Develop log aggregation strategies
Develop log routing strategies
Integrate with systems
Create alerting frameworks
Create high-quality dashboards
Deploy observability infrastructure
Manage observability infrastructure
Support Kubernetes observability
Support container-based observability
Optimize observability systems
Collaborate with engineering teams
Improve system reliability
Improve monitoring standards
Improve incident response capabilities
How You'll Work.
Team & Collaboration
Engineering teams
Full Job Description
## Accountabilities Design and implement end-to-end observability architectures using OpenTelemetry, Prometheus, Grafana, and related tools across cloud environments. Build and maintain centralized observability pipelines across multi-account AWS environments, including CloudWatch, CloudTrail, and VPC Flow Logs. Develop scalable log aggregation and routing strategies, including filtering, noise reduction, and integration with systems such as Splunk HEC. Create advanced alerting frameworks and high-quality dashboards using Alertmanager, CloudWatch Alarms, and Grafana with PromQL. Deploy and manage observability infrastructure using Infrastructure as Code tools such as Terraform. Support Kubernetes and container-based observability across EKS and ECS environments. Optimize observability systems for performance, cost efficiency, and scalability in large-scale production environments. Collaborate with engineering teams to improve system reliability, monitoring standards, and incident response capabilities. Requirements: 8+ years of experience in DevOps, Site Reliability Engineering, or Observability Engineering roles. Strong hands-on experience designing unified observability pipelines using OpenTelemetry, Prometheus, and Grafana. Deep expertise in AWS observability services including CloudWatch, CloudTrail, and cross-account telemetry strategies. Proven ability to build and manage large-scale log aggregation systems and optimize high-volume data pipelines. Strong experience with Kubernetes (EKS) or containerized environments (ECS) in production settings. Advanced proficiency with Terraform or other Infrastructure as Code tools for infrastructure and observability deployments. Experience building alerting systems, dashboards, and monitoring frameworks for distributed systems. Strong understanding of cost optimization strategies for observability platforms (log filtering, metric reduction, storage tiering). Excellent problem-solving, debugging, and collaboration skills i
Applying for this DevOps/Observability Engineer role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Lever
- Lever uses a streamlined one-page form — apply in under 5 minutes.
- LinkedIn import works well; review parsed data before submitting.
- The cover letter field is optional but visible to reviewers — use it to differentiate.
- Referral codes from employees can significantly boost visibility of your application.
ANONYMOUS · UNFILTERED
What do employees actually say about this company?
Real rants from real employees. Read before you apply.