OneTrust
Technology
SeniorSoftwareEngineer-SRE
Neural analysis suggests this role is
optimal for Senior candidates.
“Senior Software Engineer - SRE at OneTrust. Skills: Site Reliability Engineering, Cloud infrastructure, AI systems deployment, Observability. Own production services end-to-end. Ensure reliability”
What You'll Achieve.
Reduce MTTR; Improve alert quality
Industry & Context.
Root cause analysis; Anomaly triage
On-call rotation
What They're Looking For.
Must Have
4+ yrs. of application development experience, Java or other equivalent language experience, Experience with Spring environment, Experience in cloud-based infrastructure, Experience with software application performance factors, Understanding of centralizing logging, metrics, dashboards, alerting, Good understanding of databases, Hands-on experience with observability tools, Familiarity with CI/CD pipelines, Familiarity with infrastructure-as-code, Build and operate AI-assisted incident response systems, Develop or integrate LLM-based tools, Apply machine learning techniques, Experience deploying AI systems in production, Familiarity with vector databases, Familiarity with embeddings, Familiarity with RAG architectures, Understanding of prompt engineering, Understanding of evaluation of LLM outputs, Kubernetes and container orchestration experience, Experience with distributed systems at scale, Familiarity with service meshes, Familiarity with microservices architecture
Nice to Have
Experience with chaos engineering tools, Background in product-facing services, Knowledge of incident management platforms
What You'll Do.
Own production services end-to-end
Ensure operational excellence
Participate in on-call rotation
Lead incident response
Engage and partner with Engineering teams
Engage and partner with Operations teams
Engage and partner with Product teams
Design application platform
Deliver application platform
Maintain application platform
Collaborate with functional groups
Enforce error budgets
Improve alert quality
Focus on actionable alerts
Embed with product teams
Catch reliability risks
Share findings with leadership
Build scripts for automation
Build scripts for incident response
Provide understanding of services
Provide solutions for monitoring
Provide solutions for automating services
Build AI-assisted incident response systems
Develop LLM-based tools
Integrate LLM-based tools
Apply machine learning techniques
Deploy AI systems in production
How You'll Work.
Team & Collaboration
Engineering teams; Operations teams; Product teams; Functional groups; Product teams; Engineering organization; Technical leadership; Senior management
Full Job Description
Strength in Trust OneTrust’s mission is to enable innovation through the responsible use of data and AI. We believe that ensuring data is trusted shouldn’t slow teams down—it should accelerate what’s possible. This led us to develop the first technology platform for responsible data use in 2016. Today, with AI representing the latest and most impactful expansion of data yet, OneTrust is once again redefining what responsible innovation looks like. OneTrust, the AI‑Ready Governance Platform™, unifies regulatory intelligence, automation, and connected governance workflows so businesses can continue to move at the speed of AI while ensuring good governance to prevent data misuse at scale. Trusted by thousands of organizations worldwide, OneTrust is shaping the future where trusted data becomes a transformative force for business and society. The Challenge Own production services end-to-end, including reliability, scalability, and operational excellence Participate in on-call rotation and lead incident response Your Mission Engage and partner with various Engineering, Operations, and Product teams to design, deliver, and maintain a highly available and performant application platform. Collaborate with different functional groups to identify gaps, prioritize, and resolve issues Defining, implementing, and maintaining SLIs and SLOs aligned with customer experience. Design and instrument SLIs such as latency, error rates, and availability across critical services Manage and enforce error budgets to balance system reliability with product feature velocity. Improving alert quality by reducing noise and focusing on actionable, high-signal alerts Embed with product teams to review architectures and catch reliability risks early Share your knowledge and experience with the Engineering organization Share your findings with technical leadership and senior management Build scripts in python/bash/java or ruby for operational automation and incident response You Are A hands-on
Applying for this Senior Software Engineer - SRE role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Greenhouse
- Create a Greenhouse profile before applying — it saves time across multiple applications.
- Upload your resume as a PDF; the parser handles it better than Word.
- Answer all knockout questions carefully — wrong answers auto-reject before a human sees you.
- Enable email notifications to track application status in real time.
ANONYMOUS · UNFILTERED
What do employees actually say about OneTrust?
Real rants from real employees. Read before you apply.