TWG Global AI
Technology
Platform/SiteReliabilityEngineer(UK)
Neural analysis suggests this role is
optimal for Mid candidates.
“Platform / Site Reliability Engineer (UK) at TWG Global AI. Skills: Site Reliability Engineering, MLOps, Cloud Platforms. Build and maintain infrastructure for ML workloads. Implement observability tools”
What You'll Achieve.
Ensure scalability; Ensure stability; Ensure performance; Reduce operational overhead
Industry & Context.
Problem-solving; Incident response; Root-cause resolution
Based in the UK, 24/7 coverage
What They're Looking For.
Must Have
3–6 years of experience in DevOps, SRE, or backend engineering, Proficient with Docker, Kubernetes, Terraform, GitLab/GitHub Actions, Airflow, Scripting in Python or Bash, Familiarity with Linux environments, Knowledge of observability stacks (Prometheus, Grafana, ELK, Datadog), Familiarity with cloud platforms (AWS, GCP, or Azure), Documentation skills, Problem-solving skills, Incident response skills
Nice to Have
Experience supporting ML/AI workflows using Palantir Foundry, Exposure to compliance frameworks (SOC 2, ISO 27001, financial regulations), Knowledge of MLOps frameworks (MLflow, Kubeflow, SageMaker Pipelines), Ability to automate deployments, testing, and monitoring at scale
What You'll Do.
Build and maintain infrastructure for ML workloads
Implement observability tools
Monitor model performance
Monitor system uptime
Design CI/CD pipelines
Manage CI/CD pipelines
Ensure high availability
Ensure disaster recovery
Ensure rollback capabilities
Manage access controls
Manage security policies
Troubleshoot incidents
Drive root-cause resolution
Provide 24/7 coverage
How You'll Work.
Team & Collaboration
Data scientists; ML engineers; Platform vendors; U.S. teams; International teams; Compliance teams; IT teams
Process & Methodology
CI/CD
Full Job Description
At TWG Group Holdings, LLC (“TWG Global”), we drive innovation and business transformation across a range of industries—including financial services, insurance, technology, media, and sports—by leveraging data and AI as core assets. Our AI-first, cloud-native approach delivers real-time intelligence and interactive business applications, empowering informed decision-making for both customers and employees. We prioritize responsible data and AI practices to ensure ethical standards and regulatory compliance. Our decentralized structure enables each business unit to operate autonomously, supported by a central AI Solutions Group, while strategic partnerships with leading data and AI vendors fuel game-changing efforts in marketing, operations, and product development. You will collaborate with management to advance our data and analytics transformation, enhance productivity, and enable agile, data-driven decisions. By leveraging relationships with top tech startups and universities, you will help create competitive advantages and drive enterprise innovation. At TWG Global, your contributions will support our goal of sustained growth and superior returns, as we deliver rare value and impact across our businesses. We’re a fast-growing AI/ML team delivering high-impact use case solutions to financial institutions, insurers, and other regulated enterprises. Backed by proven leaders in finance and national security, our team is scaling rapidly to serve clients across North America with robust, secure, and production-grade AI solutions. **Role Overview** We are seeking a **Platform / Site Reliability Engineer (SRE)** to ensure the scalability, stability, and performance of our data platforms and ML infrastructure. You’ll work closely with data scientists, ML engineers, and platform vendors to deploy and monitor production systems, automate workflows, and reduce operational overhead. **What you'll do:** * Build and maintain infrastructure to support real-time and batch ML wor
Applying for this Platform / Site Reliability Engineer (UK) role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
ANONYMOUS · UNFILTERED
What do employees actually say about TWG Global AI?
Real rants from real employees. Read before you apply.