Forcepoint
SiteReliabilityEngineerII
Neural analysis suggests this role is
optimal for Mid+ candidates.
“Site Reliability Engineer II at Forcepoint. Skills: Site Reliability Engineering principles, automation, infrastructure as code, observability tooling, service restoration methodologies, coding, DevOps practices. Monitor, measure and improve the reliability, availability and scalability of Forcepoint products and infrastructure. Engage in Incident response and participate in post-mortem analysis”
What You'll Achieve.
maintain world-class reliability of our services for our customers; actively targets risk to service availability for customers; pro-actively identifying performance constraints and bottlenecks; reduce manual/re-active workload
Industry & Context.
pro-active approach to problem solving; develop automated solutions to prevent recurring problems; Analytical and logical approach to problem-solving
Participate in 24*7 rotational shifts & On-Call for handling production operation issues, Applicants must have the right to work in the location to which you have applied.
What They're Looking For.
Must Have
understanding of cloud-based architecture and operations, Experience in administrationuild/management of Linux systems, Foundational understanding of Infrastructure and Platform Technology stacks, understanding of Networking concepts and theories, such as different protocols (TCP/IP, UDP, routing protocols, etc), VLAN configuration, DNS, OSI layers, and load balancing, Understanding of security architecture and certificate management, Working knowledge of Infrastructure and Application monitoring platforms such as Grafana Cloud, Xymon, LibreNMS etc., Working knowledge of Incident Response and Alerting platforms such as PagerDuty, Opsgenie, XMatters etc., Understanding of the core DevOps practices (CI/CD pipeline, release management etc. ), Ability to write code using any one modern programming language (Python, JavaScript, Ruby etc. ), Configuration management platform understanding and experience (Chef/Puppet/Ansible), Prior experience in Cloud management automation tools (Terraform/CloudFormation etc. ), Experience with source code management software and API automation is crucial, Service availability oriented mindset with a pro-active approach to problem solving, An ideal candidate should be able to develop automated solutions to prevent recurring problems, Possesses the ability and willingness to challenge the status-quo and optimize current procedures and processes, sense of ownership and an ability to drive cross-functional process improvement, Possesses excellent inter-personal, written and verbal communications skills, Analytical and logical approach to problem-solving and a willingness to automate repetitive tasks and reduce manual/re-active workload, Ability and willingness to coach and mentor Team members and colleagues
Nice to Have
Hands-on experience with Amazon Web Services is preferred, Additional scripting skills are preferred, Cloud certifications or equivalent experience is highly regarded
What You'll Do.
measure and improve the reliability
availability and scalability of Forcepoint products and infrastructure
Engage in Incident response and participate in post-mortem analysis
Perform analytics on previous incidents and trend/usage patterns
Design and build custom tools as needed to support process optimization
Identify manual routine operational practices and build robust automation capabilities
Review and create dashboards/reports for application telemetry and infrastructure health
Monitor product performance and availability
provide feedback to develop
and implement robust monitoring
and logging solutions
Work collaboratively with software developers to promote best practices in reliability and operability
Participate with stakeholders to monitor our products
ensuring that the products meet architecture & observability design requirements
How You'll Work.
Team & Collaboration
partnering with Engineering and Operations teams; Work collaboratively with software developers; Participate with stakeholders
Communication Scope
excellent inter-personal, written and verbal communications skills
Process & Methodology
drive cross-functional process improvement
Full Job Description
**Who is Forcepoint?** Forcepoint simplifies security for global businesses and governments. Forcepoint’s all-in-one, truly cloud-native platform makes it easy to adopt Zero Trust and prevent the theft or loss of sensitive data and intellectual property no matter where people are working. 20+ years in business. 2.7k employees. 150 countries. 11k+ customers. 300+ patents. If our mission excites you, you’re in the right place; we want you to bring your own energy to help us create a safer world. All we’re missing is you! **Site Reliability Engineer** Forcepoint is seeking a Site Reliability Engineer to join our Site Reliability Engineering Team. The SRE role will focus standardising key Site Reliability Engineering principles across Forcepoint products and help maintain world-class reliability of our services for our customers. The SRE role actively targets risk to service availability for customers by partnering with Engineering and Operations teams leveraging modern observability tooling and service restoration methodologies focused on automation and infrastructure as code where possible. The ideal candidate will have a broad background spanning both applications and infrastructure. They will have direct experience with multiple coding language, core SRE practices & design methodologies. **Job Description:** * Monitor, measure and improve the reliability, availability and scalability of Forcepoint products and infrastructure * Engage in Incident response and participate in post-mortem analysis to investigate root cause and capture contributing factors for remediation * Perform analytics on previous incidents and trend/usage patterns to better predict issues and take proactive actions * Design and build custom tools as needed to support process optimization, challenging the status-quo and improving operational efficiency * Participate in 24*7 rotational shifts & On-Call for handling production operation issues * Identify manual routine operational practices and build
Applying for this Site Reliability Engineer II role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Workday
- Workday has a multi-step form — save your progress after every section.
- "Apply With LinkedIn" can fail or lose data; manual entry is more reliable.
- Watch for the "Submit for Review" final step — hitting "Save" alone does not submit.
- Job requisition numbers are useful when following up with HR by email.
ANONYMOUS · UNFILTERED
What do employees actually say about Forcepoint?
Real rants from real employees. Read before you apply.