Barclays

SeniorSiteReliabilityEngineer

Pune, Maharashtra, India FULL TIME
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Senior candidates.

The Brief

“Senior Site Reliability Engineer at Barclays. Skills: Site Reliability Engineering, automation, incident response, monitoring, capacity planning, troubleshooting, root cause analysis, scripting, performance tuning, systems engineering, cloud infrastructure, automation tools, cloud platforms, cloud services. apply software engineering techniques, automation, and best practices in incident response. ensure the reliability, availability, and scalability of the systems, platforms, and technology”

What You'll Achieve.

ensure the reliability, availability, and scalability of the systems, platforms, and technology; reducing manual workload, increasing efficiency, and improving system resilience; achieving the goals of the business

Industry & Context.

Problems you'll solve

troubleshoot issues swiftly; perform root cause analysis; approach problems methodically and find effective solutions; Create solutions based on sophisticated analytical thought; In-depth analysis with interpretative thinking will be required to define problems and develop innovative solutions; Adopt and include the outcomes of extensive research in problem solving processes

Eligibility Requirements

This role is based in our Pune office.

What They're Looking For.

Must Have

expertise in languages such as Python, Powershell, or Go, ability to manage incidents effectively, troubleshoot issues swiftly, and perform root cause analysis, Deep understanding of systems engineering, including operating systems, networking, and cloud infrastructure, Proficiency in automation tools, familiar with cloud platforms and services, capability to approach problems methodically and find effective solutions

Nice to Have

Experience working with IaaS and/or PaaS products, including some experience of either virtualization, containerization, orchestration of compute/network/storage, Experience working with REST API development and Integration, Database management / Query language, Proficiency in implementing monitoring and alerting systems

What You'll Do.

apply software engineering techniques

and best practices in incident response

ensure the reliability

and scalability of the systems

and capacity planning

analysis and response to system outages and disruptions

implement measures to prevent similar incidents from recurring

Development of tools and scripts to automate operational processes

Monitoring and optimisation of system performance and resource usage

identify and address bottlenecks

implement best practices for performance tuning

integrate best practices for reliability

and performance into the software development lifecycle

ensure smooth and efficient operations

contribute or set strategy

drive requirements and make recommendations for change

and manage and maintain policies

deliver continuous improvements

escalate breaches of policies/procedures

guide technical direction

multi-year assignments

guide team members through structured assignments

identify the need for the inclusion of other areas of specialisation

guide and coach less experienced specialists

provide information affecting long term profits

organisational risks and strategic decisions

Advise key stakeholders on functional and cross functional areas of impact and alignment

Manage and mitigate risks through assessment

Demonstrate leadership and accountability for managing risk and strengthening controls

Demonstrate comprehensive understanding of the organisation functions

Collaborate with other areas of work

Create solutions based on sophisticated analytical thought

Adopt and include the outcomes of extensive research in problem solving processes

build and maintain trusting relationships and partnerships

How You'll Work.

Team & Collaboration

Collaboration with development teams to integrate best practices for reliability, scalability, and performance into the software development lifecycle; work closely with other teams to ensure smooth and efficient operations; communicate effectively with team members and stakeholders; aligning, inspiring and motivating them to embrace new mindsets, cultures, and SRE working practices; lead collaborative, multi-year assignments; guide team members through structured assignments; identify the need for the inclusion of other areas of specialisation to complete assignments; Advise key stakeholders, including functional leadership teams and senior management on functional and cross functional areas of impact and alignment; Collaborate with other areas of work, for business aligned support areas to keep up to speed with business activity and the business strategies; build and maintain trusting relationships and partnerships with internal and external stakeholders

Communication Scope

communicate effectively with team members and stakeholders; ensuring alignment, inspiring and motivating them to embrace new mindsets, cultures, and SRE working practices; influencing skills; negotiating skills

Process & Methodology

Plan resources, budgets, manage and maintain policies, deliver continuous improvements, lead collaborative, multi-year assignments, guide team members through structured assignments

Full Job Description

# **Job Description** **Purpose of the role** To apply software engineering techniques, automation, and best practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them. **Accountabilities** * Availability, performance, and scalability of systems and services through proactive monitoring, maintenance, and capacity planning. * Resolution, analysis and response to system outages and disruptions, and implement measures to prevent similar incidents from recurring. * Development of tools and scripts to automate operational processes, reducing manual workload, increasing efficiency, and improving system resilience. * Monitoring and optimisation of system performance and resource usage, identify and address bottlenecks, and implement best practices for performance tuning. * Collaboration with development teams to integrate best practices for reliability, scalability, and performance into the software development lifecycle, and work closely with other teams to ensure smooth and efficient operations. * Stay informed of industry technology trends and innovations, and actively contribute to the organization's technology communities to foster a culture of technical excellence and growth. **Vice President Expectations** * To contribute or set strategy, drive requirements and make recommendations for change. Plan resources, budgets, and policies; manage and maintain policies/ processes; deliver continuous improvements and escalate breaches of policies/procedures.. * If managing a team, they define jobs and responsibilities, planning for the department’s future needs and operations, counselling employees on performance and contributing to employee pay decisions/changes. They may also lead a number of specialists to influence the operations of a department, in alignment with strategic as well as tactical priorities, while balancing short and long term goals and ensuring that budgets and schedules me

Free ATS check

Applying for this Senior Site Reliability Engineer role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on Workday

  • Workday has a multi-step form — save your progress after every section.
  • "Apply With LinkedIn" can fail or lose data; manual entry is more reliable.
  • Watch for the "Submit for Review" final step — hitting "Save" alone does not submit.
  • Job requisition numbers are useful when following up with HR by email.

ANONYMOUS · UNFILTERED

What do employees actually say about Barclays?

Real rants from real employees. Read before you apply.

Read Company Rants →