FIS
financial services
SiteReliabilityEngineerPrincipal(SoftwareEngineering)
Neural analysis suggests this role is
optimal for Principal candidates.
“Site Reliability Engineer Principal (Software Engineering) at FIS. Skills: Site Reliability Engineering, Cloud Platforms (AWS, Azure, Google Cloud), Infrastructure as Code (IaC), Monitoring and Logging, Automation, CI/CD, Incident Management. Design and maintain monitoring solutions for infrastructure, application performance, and user experience. Implement automation tools to streamline tasks, scale infrastructure, and ensure seamless deployments”
Industry & Context.
Troubleshooting skills for complex technical issues
Participate in on-call rotations, Provide 24/7 support for critical incidents
What They're Looking For.
Must Have
Proficiency in development technologies, architectures, and platforms (web, API), Experience with cloud platforms (AWS, Azure, Google Cloud), Experience with IaC tools (Terraform), Knowledge of monitoring tools (Prometheus, Grafana, DataDog), Knowledge of logging frameworks (Splunk, ELK Stack), Experience in incident management and post-mortem reviews, Troubleshooting skills for complex technical issues, Proficiency in scripting languages (Python, Bash), Proficiency in automation tools (Terraform, Ansible), Experience with CI/CD pipelines (Harness, Jenkins, GitLab CI/CD, Azure DevOps)
Nice to Have
Kubernetes a plus
What You'll Do.
Design and maintain monitoring solutions for infrastructure
application performance
Implement automation tools to streamline tasks
and ensure seamless deployments
Ensure application reliability
minimizing downtime and optimizing response times
Lead incident response
including identification
and post-incident analysis
Conduct capacity planning
and resource optimization
Collaborate with security teams to implement best practices and ensure compliance
Manage deployment pipelines and configuration management for consistent and reliable app deployments
Develop and test disaster recovery plans and backup strategies
Participate in on-call rotations and provide 24/7 support for critical incidents
How You'll Work.
Team & Collaboration
Collaborate with security teams to implement best practices and ensure compliance; Collaborate with development, QA, DevOps, and product teams to align on reliability goals and incident response processes
Communication Scope
Excellent interpersonal communication; Negotiation; Influencing skills
Full Job Description
**Job Description** Are you curious, motivated, and forward-thinking? At FIS you’ll have the opportunity to work on some of the most challenging and relevant issues in financial services and technology. Our talented people empower us, and we believe in being part of a team that is open, collaborative, entrepreneurial, passionate and above all fun. _**NOTE:**_**_This position is hybrid (3 days onsite) in our FIS Office locations in Atlanta (GA), Jacksonville (FL), Milwaukee (WI)._** _**About the Team:**_ This position is under our CTO org to support SRE functions for innovation and growth for the Banking Solutions, Payments and Capital Markets business. _**What you will be doing:**_ Site Reliability Engineer will play a critical role in driving innovation and growth for the Banking Solutions, Payments and Capital Markets business. In this role, the candidate will have the opportunity to make a lasting impact on the company's transformation journey, drive customer-centric innovation and automation, and position the organization as a leader in the competitive banking, payments and investment landscape. Specifically, the Site Reliability Engineer will be responsible for the following: * Design and maintain monitoring solutions for infrastructure, application performance, and user experience. * Implement automation tools to streamline tasks, scale infrastructure, and ensure seamless deployments. * Ensure application reliability, availability, and performance, minimizing downtime and optimizing response times. * Lead incident response, including identification, triage, resolution, and post-incident analysis. * Conduct capacity planning, performance tuning, and resource optimization. * Collaborate with security teams to implement best practices and ensure compliance. * Manage deployment pipelines and configuration management for consistent and reliable app deployments. * Develop and test disaster recovery plans and backup strategies. * Collaborate with development, QA, Dev
Applying for this Site Reliability Engineer Principal (Software Engineering) role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Workday
- Workday has a multi-step form — save your progress after every section.
- "Apply With LinkedIn" can fail or lose data; manual entry is more reliable.
- Watch for the "Submit for Review" final step — hitting "Save" alone does not submit.
- Job requisition numbers are useful when following up with HR by email.
ANONYMOUS · UNFILTERED
What do employees actually say about FIS?
Real rants from real employees. Read before you apply.