FIS

financial services

SiteReliabilityEngineerPrincipal(SoftwareEngineering)

Atlanta, Georgia, United States; Jacksonville, Florida, United States; Milwaukee, Wisconsin, United States FULL TIME Remote Friendly
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Principal candidates.

The Brief

“Site Reliability Engineer Principal (Software Engineering) at FIS. Skills: Site Reliability Engineering, Cloud Platforms (AWS, Azure, Google Cloud), Infrastructure as Code (IaC), Monitoring and Logging, Automation, CI/CD, Incident Management. Design and maintain monitoring solutions for infrastructure, application performance, and user experience. Implement automation tools to streamline tasks, scale infrastructure, and ensure seamless deployments”

Industry & Context.

financial services
Problems you'll solve

Troubleshooting skills for complex technical issues

Eligibility Requirements

Participate in on-call rotations, Provide 24/7 support for critical incidents

What They're Looking For.

Must Have

Proficiency in development technologies, architectures, and platforms (web, API), Experience with cloud platforms (AWS, Azure, Google Cloud), Experience with IaC tools (Terraform), Knowledge of monitoring tools (Prometheus, Grafana, DataDog), Knowledge of logging frameworks (Splunk, ELK Stack), Experience in incident management and post-mortem reviews, Troubleshooting skills for complex technical issues, Proficiency in scripting languages (Python, Bash), Proficiency in automation tools (Terraform, Ansible), Experience with CI/CD pipelines (Harness, Jenkins, GitLab CI/CD, Azure DevOps)

Nice to Have

Kubernetes a plus

What You'll Do.

Design and maintain monitoring solutions for infrastructure

application performance

Implement automation tools to streamline tasks

and ensure seamless deployments

Ensure application reliability

minimizing downtime and optimizing response times

Lead incident response

including identification

and post-incident analysis

Conduct capacity planning

and resource optimization

Collaborate with security teams to implement best practices and ensure compliance

Manage deployment pipelines and configuration management for consistent and reliable app deployments

Develop and test disaster recovery plans and backup strategies

Participate in on-call rotations and provide 24/7 support for critical incidents

How You'll Work.

Team & Collaboration

Collaborate with security teams to implement best practices and ensure compliance; Collaborate with development, QA, DevOps, and product teams to align on reliability goals and incident response processes

Communication Scope

Excellent interpersonal communication; Negotiation; Influencing skills

Full Job Description

**Job Description** Are you curious, motivated, and forward-thinking? At FIS you’ll have the opportunity to work on some of the most challenging and relevant issues in financial services and technology. Our talented people empower us, and we believe in being part of a team that is open, collaborative, entrepreneurial, passionate and above all fun. _**NOTE:**_**_This position is hybrid (3 days onsite) in our FIS Office locations in Atlanta (GA), Jacksonville (FL), Milwaukee (WI)._** _**About the Team:**_ This position is under our CTO org to support SRE functions for innovation and growth for the Banking Solutions, Payments and Capital Markets business. _**What you will be doing:**_ Site Reliability Engineer will play a critical role in driving innovation and growth for the Banking Solutions, Payments and Capital Markets business. In this role, the candidate will have the opportunity to make a lasting impact on the company's transformation journey, drive customer-centric innovation and automation, and position the organization as a leader in the competitive banking, payments and investment landscape. Specifically, the Site Reliability Engineer will be responsible for the following: * Design and maintain monitoring solutions for infrastructure, application performance, and user experience. * Implement automation tools to streamline tasks, scale infrastructure, and ensure seamless deployments. * Ensure application reliability, availability, and performance, minimizing downtime and optimizing response times. * Lead incident response, including identification, triage, resolution, and post-incident analysis. * Conduct capacity planning, performance tuning, and resource optimization. * Collaborate with security teams to implement best practices and ensure compliance. * Manage deployment pipelines and configuration management for consistent and reliable app deployments. * Develop and test disaster recovery plans and backup strategies. * Collaborate with development, QA, Dev

Free ATS check

Applying for this Site Reliability Engineer Principal (Software Engineering) role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on Workday

  • Workday has a multi-step form — save your progress after every section.
  • "Apply With LinkedIn" can fail or lose data; manual entry is more reliable.
  • Watch for the "Submit for Review" final step — hitting "Save" alone does not submit.
  • Job requisition numbers are useful when following up with HR by email.

ANONYMOUS · UNFILTERED

What do employees actually say about FIS?

Real rants from real employees. Read before you apply.

Read Company Rants →