Explore careers with our portfolio companies

Site Reliability Engineer III

Forcepoint

Forcepoint

Software Engineering, IT
Campbell, CA, USA
Posted on Dec 6, 2023

Who is Forcepoint?

Forcepoint simplifies security for global businesses and governments. Forcepoint’s all-in-one, truly cloud-native platform makes it easy to adopt Zero Trust and prevent the theft or loss of sensitive data and intellectual property no matter where people are working. 20+ years in business. 2.7k employees. 150 countries. 11k+ customers. 300+ patents. If our mission excites you, you’re in the right place; we want you to bring your own energy to help us create a safer world. All we’re missing is you!

Forcepoint is seeking a Site Reliability Engineer to join our Site Reliability Engineering Team in our Campbell, California office. The SRE role will focus on elevating application and service performance and availability in support of our organization’s fast-evolving enterprise technology needs.

The SRE role actively targets risk to service availability for employees and customers by partnering with Engineering and Operations teams leveraging modern observability tooling and service restoration methodologies focused on automation and infrastructure as code where possible.

The ideal candidate will have a broad background spanning both applications and infrastructure. They will have direct experience with multiple coding language, core SRE practices & methodologies.

*****Candidate needs to reside in the Campbell, California area as you will be commuting into the office*******

Job Description:

  • Monitor, measure and improve the reliability, availability and scalability of IT Infrastructure, applications and services

  • Identify manual routine operational practices and build robust automation capabilities using code and modern tools

  • Collaborate with Product Developers and business stakeholders to gather requirements for enabling and improving performance monitoring for applications and services

  • Engage in Incident response and participate in post-mortem analysis to investigate root cause and capture contributing factors for remediation

  • Perform analytics on previous incidents and trend/usage patterns to better predict issues and take proactive actions

  • Design and build custom tools as needed to support process optimization, challenging the status-quo and improving operational efficiency

  • Participate in 24*7 rotational shifts & On-Call for handling production operation issues

  • Engage in service capacity planning and demand forecasting, software performance analysis and system tuning

  • Create meaningful dashboards/reports for application telemetry and infrastructure health for pro-actively identifying performance constraints and bottlenecks

Requirements:

  • University degree and 4-6 years of related experience, or equivalent work experience.

  • Strong understanding of cloud-based architecture and cloud operations. Hands-on experience with Amazon Web Services and/or equivalent public cloud technology

  • Experience in administration/build/management of Linux systems

  • Foundational understanding of Infrastructure and Platform Technology stacks

  • Strong understanding of Networking concepts and theories, such as different protocols (TCP/IP, UDP, routing protocols, etc), VLAN configuration, DNS, OSI layers, and load balancing

  • Understanding of security architecture and certificate management

  • Working knowledge of Infrastructure and Application monitoring platforms such as Grafana Cloud, Solarwinds, NewRelic, DataDog etc.

  • Working knowledge of Incident Response and Alerting platforms such as PagerDuty, Opsgenie, XMatters etc.

  • Understanding of the core DevOps practices (CI/CD pipeline, release management etc.)

  • Ability to write code using any one modern programming language (Phython, JavaScript, Ruby etc.). Additional scripting skills are preferred

  • Configuration management platform understanding and experience (Chef/Puppet/Ansible)

  • Prior experience in Cloud management automation tools (Terraform/CloudFormation etc.) is crucial

  • Experience with source code management software and API automation is crucial

  • Cloud certifications or equivalent experience is highly regarded

  • Service availability oriented mindset with a pro-active approach to problem solving. An ideal candidate should be able to develop automated solutions to prevent recurring problems

  • Possesses the ability and willingness to challenge the status-quo and optimize current procedures and processes

  • Strong sense of ownership and an ability to drive cross-functional process improvement

  • Possesses excellent inter-personal, written and verbal communications skills

  • Analytical and logical approach to problem-solving and a willingness to automate repetitive tasks and reduce manual/re-active workload

  • Ability and willingness to coach and mentor Team members and colleagues

Don’t meet every single qualification? Studies show people are hesitant to apply if they don’t meet all requirements listed in a job posting. Forcepoint is focused on building an inclusive and diverse workplace – so if there is something slightly different about your previous experience, but it otherwise aligns and you’re excited about this role, we encourage you to apply. You could be a great candidate for this or other roles on our team.

The policy of Forcepoint is to provide equal employment opportunities to all applicants and employees without regard to race, color, creed, religion, sex, sexual orientation, gender identity, marital status, citizenship status, age, national origin, ancestry, disability, veteran status, or any other legally protected status and to affirmatively seek to advance the principles of equal employment opportunity.

Forcepoint is a Federal Contractor. Certain positions with Forcepoint require access to controlled goods and technologies subject to the International Traffic in Arms Regulations or the Export Administration Regulations. Applicants for these positions may need to be "U.S. Persons," as defined in these regulations. Generally, a "U.S. Person" is a U.S. citizen, lawful permanent resident, or an individual who has been admitted as a refugee or granted asylum.

Applicants must have the right to work in the location to which you have applied.