Barclays

SeniorSiteReliabilityEngineer

Pune, India FULL TIME
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Senior candidates.

The Brief

“Senior Site Reliability Engineer at Barclays. Skills: Site Reliability Engineering (SRE), Automation, Systems Engineering, Cloud Platforms, Incident Management, Performance Optimization, Communication. Apply software engineering techniques, automation, and best practices in incident response. Ensure the reliability, availability, and scalability of systems, platforms, and technology”

What You'll Achieve.

Ensure the reliability, availability, and scalability of the systems, platforms, and technology; Reducing manual workload; Increasing efficiency; Improving system resilience; Achieve outcomes

Industry & Context.

Problems you'll solve

Troubleshoot issues swiftly; Perform root cause analysis; Identify and address bottlenecks; Approach problems methodically and find effective solutions; Create solutions based on sophisticated analytical thought; In-depth analysis with interpretative thinking; Define problems; Develop innovative solutions

What They're Looking For.

Must Have

Expertise in languages such as Python, Powershell, or Go, Ability to manage incidents effectively, Troubleshoot issues swiftly, Perform root cause analysis, Deep understanding of systems engineering, including operating systems, networking, and cloud infrastructure, Proficiency in automation tools, Ability to communicate effectively with team members and stakeholders, Familiar with cloud platforms and services, Capability to approach problems methodically and find effective solutions

Nice to Have

Experience working with IaaS and/or PaaS products, Experience of either virtualization, containerization, orchestration of compute/network/storage, Experience working with REST API development and Integration, Database management / Query language, Proficiency in implementing monitoring and alerting systems

What You'll Do.

Apply software engineering techniques

and best practices in incident response

Ensure the reliability

and scalability of systems

analysis and response to system outages and disruptions

Implement measures to prevent similar incidents from recurring

Development of tools and scripts to automate operational processes

Monitoring and optimisation of system performance and resource usage

Identify and address bottlenecks

Implement best practices for performance tuning

Integrate best practices for reliability

and performance into the software development lifecycle

Ensure smooth and efficient operations

Contribute or set strategy

Make recommendations for change

Manage and maintain policies

Deliver continuous improvements

Escalate breaches of policies/procedures

Guide technical direction

multi-year assignments

Guide team members through structured assignments

Identify the need for the inclusion of other areas of specialisation

guide and coach less experienced specialists

Provide information affecting long term profits

organisational risks and strategic decisions

Advise key stakeholders

Manage and mitigate risks through assessment

Demonstrate leadership and accountability for managing risk and strengthening controls

Demonstrate comprehensive understanding of the organisation functions

Collaborate with other areas of work

Create solutions based on sophisticated analytical thought

In-depth analysis with interpretative thinking

Adopt and include the outcomes of extensive research in problem solving processes

build and maintain trusting relationships and partnerships

Use influencing and negotiating skills to achieve outcomes

Act as a centre of excellence providing hands on consultancy to infrastructure product teams

Work on software engineering skills to demonstrate value in SRE mindset

Influence existing colleagues

How You'll Work.

Team & Collaboration

Collaboration with development teams; Work closely with other teams; Communicate effectively with team members and stakeholders; Align across the enterprise; Collaborate with other areas of work; Build and maintain trusting relationships and partnerships with internal and external stakeholders

Communication Scope

Communicate effectively with team members and stakeholders; Inspiring and motivating them to embrace new mindsets, cultures, and SRE working practices; Advise key stakeholders, including functional leadership teams and senior management

Process & Methodology

Plan resources, Manage and maintain policies, Deliver continuous improvements, Lead collaborative, multi-year assignments, Manage and mitigate risks

Full Job Description

# **Job Description** **Purpose of the role** To apply software engineering techniques, automation, and best practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them. **Accountabilities** * Availability, performance, and scalability of systems and services through proactive monitoring, maintenance, and capacity planning. * Resolution, analysis and response to system outages and disruptions, and implement measures to prevent similar incidents from recurring. * Development of tools and scripts to automate operational processes, reducing manual workload, increasing efficiency, and improving system resilience. * Monitoring and optimisation of system performance and resource usage, identify and address bottlenecks, and implement best practices for performance tuning. * Collaboration with development teams to integrate best practices for reliability, scalability, and performance into the software development lifecycle, and work closely with other teams to ensure smooth and efficient operations. * Stay informed of industry technology trends and innovations, and actively contribute to the organization's technology communities to foster a culture of technical excellence and growth. **Vice President Expectations** * To contribute or set strategy, drive requirements and make recommendations for change. Plan resources, budgets, and policies; manage and maintain policies/ processes; deliver continuous improvements and escalate breaches of policies/procedures.. * If managing a team, they define jobs and responsibilities, planning for the department’s future needs and operations, counselling employees on performance and contributing to employee pay decisions/changes. They may also lead a number of specialists to influence the operations of a department, in alignment with strategic as well as tactical priorities, while balancing short and long term goals and ensuring that budgets and schedules me

Free ATS check

Applying for this Senior Site Reliability Engineer role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on Workday

  • Workday has a multi-step form — save your progress after every section.
  • "Apply With LinkedIn" can fail or lose data; manual entry is more reliable.
  • Watch for the "Submit for Review" final step — hitting "Save" alone does not submit.
  • Job requisition numbers are useful when following up with HR by email.

ANONYMOUS · UNFILTERED

What do employees actually say about Barclays?

Real rants from real employees. Read before you apply.

Read Company Rants →