Grafana Labs
Technology
StaffSoftwareEngineer-Platform,SysEng
Neural analysis suggests this role is
optimal for Senior candidates.
“Staff Software Engineer - Platform, SysEng at Grafana Labs. Skills: Platform Engineering, Distributed Systems, Cloud Infrastructure, System Reliability. Build highly available, low-latency stack. Process and store data”
What You'll Achieve.
Reduce new region build timelines; Improve performance; Increase reliability; Operate efficiently and effectively
Industry & Context.
Root cause analysis; Troubleshooting
On-call rotations
What They're Looking For.
Must Have
Proven delivery of large distributed systems, Experience shipping and operating complex systems, Clear evidence of technical leadership, Demonstrable experience in system design, Deep understanding of tradeoffs, Hands-on cloud and platform experience, Solid experience with cloud-native architectures, Operational practices for cloud-native architectures, Reliability and performance ownership, Comfortable defining SLOs/SLIs, Capacity planning experience, Performance tuning experience, Driving reliability work end-to-end, Excellent coding skills, Excellent design skills, Experience with Go, Experience with Python, Experience with Shell, Experience operating your code
Nice to Have
Experience with Kubernetes, Experience with IaC, Experience with Jsonnet
What You'll Do.
Build highly available
Process and store data
Serve data to dashboards
Serve data to alerting tools
Operate efficiently and effectively
Provide application engineers tools
Provide application engineers systems
Provide application engineers Kubernetes clusters
Take projects from conception to production
Manage cloud infrastructure
Ensure US Federal compliance
Deploy production services
Reduce new region build timelines
Meet customer demands
Manage infrastructure for teams
Build cherished tools
Work with management structures
Work with distributed systems
Approach development holistically
Look at developer feedback
Perform integration testing
Look at the big picture
Build better platforms
Invest in developer productivity
Use AI coding assistants
Follow up on incidents
How You'll Work.
Team & Collaboration
Cross-functional squads; Remote-first communication; Team decision making
Communication Scope
Technical documentation
Process & Methodology
Roadmap planning
Full Job Description
Grafana Labs, the company behind the open observability cloud, is founded on the principles of open source, open standards, open ecosystems, and open culture. Grafana Cloud, our fully managed observability platform, is flexible and built for scale. With Grafana Cloud's actually useful AI, organizations can see, understand, and act on all their disparate data to move at the speed of their ambitions. Today, more than 35 million users and 7,000+ customers – including Anthropic, Bloomberg, NVIDIA, Microsoft, and Salesforce – trust Grafana Labs to ensure reliability of their applications and systems, resolve incidents quickly, and optimize their telemetry to reduce noise and cost. We are a 100% remote company with 1,600+ team members across 40+ countries, and we’re backed by leading investors including Lightspeed Venture Partners, Sequoia Capital, GIC, Coatue, J. P. Morgan, CapitalG, and Lead Edge Capital. Learn more at grafana.com and follow us on LinkedIn and X. We’re scaling fast and staying true to what makes us different: an open-source legacy, a global collaborative culture, and a passion for meaningful work. Our team thrives in an innovation-driven environment where transparency, autonomy, and trust fuel everything we do. You may not meet every requirement, and that’s okay. If this role excites you, we’d love you to raise your hand for what could be a truly career-defining opportunity. This is a remote opportunity and we would be interested in applicants located in USA time zones (EST + CST highly preferred). Staff Backend Engineer - Platform SysEng The Opportunity: Grafana Cloud moves millions of metrics, log lines, and traces per second from our customers' environments into a highly available, low-latency stack that processes and stores this data, and serves them to dashboards and alerting tools. We aim to grow this to hundreds of millions per second, and it's critical that as we grow, we improve our performance, increase our reliability, and, of course, do it e
Applying for this Staff Software Engineer - Platform, SysEng role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Greenhouse
- Create a Greenhouse profile before applying — it saves time across multiple applications.
- Upload your resume as a PDF; the parser handles it better than Word.
- Answer all knockout questions carefully — wrong answers auto-reject before a human sees you.
- Enable email notifications to track application status in real time.
ANONYMOUS · UNFILTERED
What do employees actually say about Grafana Labs?
Real rants from real employees. Read before you apply.