Company
Technology
PrincipalEngineer-CloudObservability
Neural analysis suggests this role is
optimal for Senior candidates.
“Principal Engineer - Cloud Observability. Skills: Cloud observability, Distributed systems, System design. Lead design of cloud observability systems. Lead evolution of cloud observability systems”
Industry & Context.
Debugging complex production issues
On-call environments, High-availability environments
What They're Looking For.
Must Have
15+ years software engineering experience, Deep expertise in system design, Deep expertise in distributed architecture, Deep expertise in concurrency, Deep expertise in multi-threaded programming, Proven experience building cloud services, Proven experience operating cloud services, Proven experience scaling cloud services, Understanding of systems fundamentals, Ability to take ideas from concept to production, Experience debugging complex production issues, Experience working in on-call environments, Experience working in high-availability environments, Leadership and communication skills, Degree in Computer Science, Degree in Engineering, Equivalent practical experience, Ability to balance short-term delivery, Ability to balance long-term architectural scalability
Nice to Have
Experience with observability systems, Experience with streaming technologies, Experience with Kafka, Experience with Flink, Experience with Druid, Experience with OpenSearch, Exposure to stream processing systems, Exposure to query processing systems, Interest in technical evangelism
What You'll Do.
Lead design of cloud observability systems
Lead evolution of cloud observability systems
Provide deep insights into performance
Provide deep insights into health
Provide deep insights into reliability
Define technical strategy for observability
Drive technical strategy for observability
Define roadmap for observability
Drive roadmap for observability
Architect scalable systems
Architect resilient systems
Handle massive growth
Handle near real-time data
Review system designs
Review architecture decisions
Ensure high engineering standards
Ensure operational excellence
Mentor senior engineers
Guide senior engineers
Foster technical growth
Foster engineering culture
Champion reliability best practices
Champion performance best practices
Champion observability best practices
Contribute hands-on to system design
Contribute hands-on to debugging
Contribute hands-on to production issue resolution
Establish engineering processes
Improve engineering processes
How You'll Work.
Team & Collaboration
Collaborate with product managers; Collaborate with engineering leaders; Collaborate with cross-functional stakeholders
Communication Scope
Influence technical decisions
Process & Methodology
Roadmap planning
Full Job Description
## Accountabilities Lead the design and evolution of cloud observability systems that provide deep insights into performance, health, and reliability of distributed data platforms. Define and drive technical strategy and roadmap for observability capabilities across cloud and hybrid environments. Architect scalable, resilient systems capable of handling massive growth and near real-time data requirements. Review system designs, architecture decisions, and code to ensure high engineering standards and operational excellence. Collaborate with product managers, engineering leaders, and cross-functional stakeholders to align technical execution with business goals. Mentor and guide senior engineers and teams, fostering technical growth, ownership, and a strong engineering culture. Champion reliability, performance, and observability best practices across engineering organizations. Contribute hands-on to system design, debugging, and production issue resolution when needed. Establish and improve engineering processes that enhance execution quality, delivery speed, and system stability. Requirements 15+ years of hands-on software engineering experience with strong exposure to large-scale distributed systems. Deep expertise in system design, distributed architecture, concurrency, and multi-threaded programming. Proven experience building, operating, and scaling production-grade cloud services. Strong understanding of systems fundamentals including networking, storage, and operating systems. Ability to take ideas from concept to production with strong execution and ownership mindset. Experience debugging complex production issues and working in on-call, high-availability environments. Strong leadership and communication skills with the ability to influence technical decisions across teams and leadership levels. Degree in Computer Science, Engineering, or equivalent practical experience. Strong ability to balance short-term delivery with long-term architectural scalability.
Applying for this Principal Engineer - Cloud Observability role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Lever
- Lever uses a streamlined one-page form — apply in under 5 minutes.
- LinkedIn import works well; review parsed data before submitting.
- The cover letter field is optional but visible to reviewers — use it to differentiate.
- Referral codes from employees can significantly boost visibility of your application.
ANONYMOUS · UNFILTERED
What do employees actually say about this company?
Real rants from real employees. Read before you apply.