Grafana Labs

Technology

StaffSoftwareEngineer-Platform,SysEng

CA$175–250k ~AI est. Canada Remote Friendly
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Senior candidates.

The Brief

“Staff Software Engineer - Platform, SysEng at Grafana Labs. Skills: Platform Engineering, Distributed Systems, Cloud Infrastructure, System Reliability. Build highly available, low-latency stack. Process and store data”

What You'll Achieve.

Reduce new region build timelines; Improve performance; Increase reliability; Operate efficiently and effectively

Industry & Context.

Technology
Problems you'll solve

Root cause analysis; Troubleshooting

Eligibility Requirements

On-call rotations

What They're Looking For.

Must Have

Proven delivery of large distributed systems, Experience shipping and operating complex systems, Clear evidence of technical leadership, Demonstrable experience in system design, Deep understanding of tradeoffs, Hands-on cloud and platform experience, Solid experience with cloud-native architectures, Operational practices for cloud-native architectures, Reliability and performance ownership, Comfortable defining SLOs/SLIs, Capacity planning experience, Performance tuning experience, Driving reliability work end-to-end, Excellent coding skills, Excellent design skills, Experience with Go, Experience with Python, Experience with Shell, Experience operating your code

Nice to Have

Experience with Kubernetes, Experience with IaC, Experience with Jsonnet

What You'll Do.

Build highly available

Process and store data

Serve data to dashboards

Serve data to alerting tools

Operate efficiently and effectively

Provide application engineers tools

Provide application engineers systems

Provide application engineers Kubernetes clusters

Take projects from conception to production

Manage cloud infrastructure

Ensure US Federal compliance

Deploy production services

Reduce new region build timelines

Meet customer demands

Manage infrastructure for teams

Build cherished tools

Work with management structures

Work with distributed systems

Approach development holistically

Look at developer feedback

Perform integration testing

Look at the big picture

Build better platforms

Invest in developer productivity

Use AI coding assistants

Follow up on incidents

How You'll Work.

Team & Collaboration

Cross-functional squads; Remote-first communication; Team decision making

Communication Scope

Technical documentation

Process & Methodology

Roadmap planning

Full Job Description

Grafana Labs, the company behind the open observability cloud, is founded on the principles of open source, open standards, open ecosystems, and open culture. Grafana Cloud, our fully managed observability platform, is flexible and built for scale. With Grafana Cloud's actually useful AI, organizations can see, understand, and act on all their disparate data to move at the speed of their ambitions. Today, more than 35 million users and 7,000+ customers – including Anthropic, Bloomberg, NVIDIA, Microsoft, and Salesforce – trust Grafana Labs to ensure reliability of their applications and systems, resolve incidents quickly, and optimize their telemetry to reduce noise and cost. We are a 100% remote company with 1,600+ team members across 40+ countries, and we’re backed by leading investors including Lightspeed Venture Partners, Sequoia Capital, GIC, Coatue, J. P. Morgan, CapitalG, and Lead Edge Capital. Learn more at grafana.com and follow us on LinkedIn and X. We’re scaling fast and staying true to what makes us different: an open-source legacy, a global collaborative culture, and a passion for meaningful work. Our team thrives in an innovation-driven environment where transparency, autonomy, and trust fuel everything we do. You may not meet every requirement, and that’s okay. If this role excites you, we’d love you to raise your hand for what could be a truly career-defining opportunity. This is a remote opportunity and we would be interested in applicants located in USA time zones (EST + CST highly preferred). Staff Backend Engineer - Platform SysEng The Opportunity: Grafana Cloud moves millions of metrics, log lines, and traces per second from our customers' environments into a highly available, low-latency stack that processes and stores this data, and serves them to dashboards and alerting tools. We aim to grow this to hundreds of millions per second, and it's critical that as we grow, we improve our performance, increase our reliability, and, of course, do it e

Free ATS check

Applying for this Staff Software Engineer - Platform, SysEng role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on Greenhouse

  • Create a Greenhouse profile before applying — it saves time across multiple applications.
  • Upload your resume as a PDF; the parser handles it better than Word.
  • Answer all knockout questions carefully — wrong answers auto-reject before a human sees you.
  • Enable email notifications to track application status in real time.

ANONYMOUS · UNFILTERED

What do employees actually say about Grafana Labs?

Real rants from real employees. Read before you apply.

Read Company Rants →