impact. com

commerce partnership marketing

SiteReliabilityEngineer

$110–130k victoria, central and western, hong kong
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Mid+ candidates.

The Brief

“Site Reliability Engineer at impact. com. Skills: Site Reliability Engineering, Observability, OpenTelemetry, Java, C#, Grafana ecosystem, API reliability. champion of performance and stability for our core application ecosystem. ensure that our high-frequency data ingestion pipelines and customer-facing applications meet strict performance and error rate benchmarks”

Industry & Context.

commerce partnership marketing
Problems you'll solve

root-cause analysis (RCA) for complex distributed system failures; Full-Stack Troubleshooting; identify bottlenecks in cross-service communication

Eligibility Requirements

5% variable annual bonus contingent on Company performance, eligible to receive Restricted Stock Unit (RSU) grant, Responsible PTO policy, up to 12 fully covered therapy/coaching sessions per year, monthly gym reimbursement policy, Restricted Stock Units (RSUs) as part of our total compensation, giving you a stake in the company's growth with a 3-year vesting schedule, pending Board approval, 26 weeks of fully paid leave for the primary caregiver, 13 weeks fully paid leave for the secondary caregiver, technology stipend to help you set up your home office, monthly allowance to cover your internet expenses

What They're Looking For.

Must Have

proficiency in Java or C#, comfortable reading, debugging, and instrumenting application code, Hands-on experience with OpenTelemetry, including auto-instrumentation, manual spans, and collector configuration, Deep experience with the Grafana ecosystem (Prometheus, Tempo, Loki) or similar distributed tracing platforms (Jaeger, Honeycomb, Datadog), Experience working with high-volume REST/Graph APIs and an understanding of OAuth flows, rate-limiting, and webhooks, Solid understanding of how Java/C# applications interact with the underlying infrastructure, Ability to prioritize tasks in a high-velocity environment and a focus on building "self-healing" systems rather than manual fixes

Nice to Have

Affiliate & Partnerships Industry Fundamentals Certification by PXA

What You'll Do.

champion of performance and stability for our core application ecosystem

ensure that our high-frequency data ingestion pipelines and customer-facing applications meet strict performance and error rate benchmarks

bridge the gap between code and infrastructure

implementing OpenTelemetry (OTel) standards across the stack

provide deep visibility into how we interact with external social APIs and how our internal services communicate

architect of our observability pipeline

Implement and maintain OpenTelemetry instrumentation across Java and C# services

ensure high-fidelity traces

Build integration tests with third-party social APIs

setup the appropriate monitoring and alerting systems to ensure high availability and reliability

Build and enhance Grafana dashboards and alerting systems that track the "Golden Signals" (Latency

Saturation) specifically tailored for JVM and. NET environments

Drive root-cause analysis (RCA) for complex distributed system failures

contribute to remediations through code optimizations or infrastructure adjustments

Leverage tracing data to identify bottlenecks in cross-service communication and optimize the path of data from social APIs to our internal stores

Debug issues across the entire stack

from containerized application code (Java/C#) down to network calls and cloud resource utilization

Analyze application usage patterns to inform scaling decisions

ensuring we handle social data bursts without compromising stability or overspending on cloud costs

How You'll Work.

Team & Collaboration

Working closely with our Java and C# engineering squads

Full Job Description

The Company: impact.com is the world’s leading commerce partnership marketing platform, transforming the way businesses grow by enabling them to discover, manage, and scale partnerships across the entire customer journey. From affiliates and influencers to content publishers, brand ambassadors, and customer advocates, impact.com empowers brands to drive trusted, performance-based growth through authentic relationships. Its award-winning products—Performance (affiliate), Creator (influencer), and Advocate (customer referral)—unify every type of partner into one integrated platform. As consumers increasingly rely on recommendations from people and communities they trust, impact.com helps brands show up where it matters most. Today, over 5,000 global brands, including Walmart, Uber, Shopify, Lenovo, L’Oréal, and Fanatics, rely on impact.com to power more than 225,000 partnerships that deliver measurable business results. Your Role at impact.com: As a Site Reliability Engineer, you'll be the champion of performance and stability for our core application ecosystem. Working closely with our Java and C# engineering squads, you'll ensure that our high-frequency data ingestion pipelines and customer-facing applications meet strict performance and error rate benchmarks. Your mission is to bridge the gap between code and infrastructure, implementing OpenTelemetry (OTel) standards across the stack to provide deep visibility into how we interact with external social APIs and how our internal services communicate. What You'll Do: OTel Orchestration: Become the architect of our observability pipeline. Implement and maintain OpenTelemetry instrumentation across Java and C# services to ensure high-fidelity traces, metrics, and logs. API Reliability: Build integration tests with third-party social APIs and setup the appropriate monitoring and alerting systems to ensure high availability and reliability. Health & Performance: Build and enhance Grafana dashboards and alerting systems t

Free ATS check

Applying for this Site Reliability Engineer role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on Greenhouse

  • Create a Greenhouse profile before applying — it saves time across multiple applications.
  • Upload your resume as a PDF; the parser handles it better than Word.
  • Answer all knockout questions carefully — wrong answers auto-reject before a human sees you.
  • Enable email notifications to track application status in real time.

ANONYMOUS · UNFILTERED

What do employees actually say about impact. com?

Real rants from real employees. Read before you apply.

Read Company Rants →