Site Reliability and DevOps Engineering Lead
Software Engineering
Ireland
EUR 71,273.53-106,910.29 / year
Micromedex is seeking a highly skilled Platform Reliability & DevOps Engineering Lead who combines deep hands-on expertise in cloud services, infrastructure, and automation with a strong architectural understanding of distributed, high-availability systems.
You will lead the platform team, ensuring our mission-critical clinical platform is highly available (24×7), performant, scalable, and secure.
This role is both strategic and hands-on: you will define and drive the platform reliability and DevOps strategy, continuously improving system resilience and CI/CD capability, while partnering closely with engineering teams and vendors to embed operational excellence across the software lifecycle.
You will be accountable for the end-to-end reliability, operability, and delivery capability of the Micromedex platform, unifying Site Reliability Engineering, DevOps, and CI/CD ownership into a single platform function. This includes owning platform reliability outcomes, DevOps enablement, and delivery pipelines to support scalable, high-availability systems and faster, safer releases.
You are passionate about automation, proactive in addressing reliability and performance challenges, and committed to maintaining the trust of clinicians worldwide through resilient system design, strong operational discipline, and rapid incident response.
Responsibilities:
People & Team Leadership
Lead, mentor, and grow Platform / DevOps engineers
Build a high-performing Platform team
Drive accountability for platform reliability and delivery outcomes
Lead vendors to deliver capabilities in production.
Production Engineering & Platform Operations
Ensure platform capabilities accelerate product delivery, remove bottlenecks.
Defines and enforces platform engineering standards and DevOps practices across all teams and vendors
Lead capacity planning, performance optimization, and cost efficiency
Define operational standards, runbooks, and reliability practices
Accountable for platform reliability outcomes at enterprise/product level
Platform Strategy and Leadership
Act as technical authority across platform, reliability, and delivery
Define platform strategy and roadmap
Govern delivery across internal teams and vendors
Platform Reliability Ownership
Own SLIs, SLOs, and error budgets
Lead resilience engineering, observability, and failure design
Drive proactive risk reduction and continuous improvement
Own incident management frameworks and continuous improvement
CI/CD and Release Engineering
Own end-to-end pipeline architecture and release automation
Standardize, secure, and fully automate pipelines
Drive continuous integration, delivery, and validation practices
Incident Leadership
Lead Sev1 response, escalation, and recovery
Own RCA and drive systemic fixes (not point fixes)
Introduce AI-enabled pipeline optimization and quality gates
Embed AI into monitoring, risk prediction, and CI/CD optimization
Drive automation to reduce operational toil and improve decision-making
Required Skills:
Bachelor’s degree in computer science, Engineering, or a related field.
8+ years of hands-on experience in software operations, DevOps and Site Reliability Engineering, including managing large-scale, mission-critical systems.
Strong proficiency in at least one programming or scripting language (e.g., Python, Bash, or Java) for automation and tool integration.
Excellent understanding of modern software delivery pipelines and DevOps practices, including CI/CD, configuration management, and version control (Git).
Demonstrated experience in releasing to and operating mission-critical, high-availability SaaS platforms, while providing technical leadership to Platform teams and effectively influencing stakeholders and vendors across Product, Architecture, and Operations.
Expertise in Site Reliability Engineering, DevOps, and Platform Engineering, driving reliability through SLI/SLOs, incident management, CI/CD architecture, and release automation.
Experience with Cloud, Systems & Infrastructure (DB2, Oracle, Infinispan, OpenLiberty) & Automation-first engineering with proven usage of AI (self-healing, triage).
Expertise with Java application platforms and runtimes (performance tuning, troubleshooting, production operations)
Expertise in designing and scaling distributed, fault-tolerant systems, with strong database optimization capabilities across DB2, Oracle, and PostgreSQL environments.
Experience with multi-region/active-active environments, observability frameworks (monitoring, logging, tracing), and embedding reliability engineering practices throughout the SDLC.
Clear and confident communication skills with ability to lead teams and collaborate effectively across engineering, product, and architecture teams.
Proven track record ensuring high availability and performance in production environments, with expertise in fault-tolerant, distributed system design.
Exceptional problem-solving skills, with experience diagnosing complex system issues under pressure and driving them to resolution.
Self-driven and proactive, with a passion for automating manual processes and continuously improving systems to enhance reliability and team productivity.
Preferred Skills:
DB2, Oracle, Infinispan, OpenLiberty, Azure
Infrastructure as Code (Terraform or similar)
Containerisation and orchestration (Docker/Kubernetes)
Compensation
The salary range provided in this job posting is intended to reflect the general market value for the position. The actual salary offered may vary based on factors such as the candidate’s experience, qualifications, skills, and the specific requirements of the role. This range may also be subject to change as market conditions evolve. We encourage open communication throughout the interview process to discuss compensation expectations. For base-salary + commission sales roles, the range represents On-Target Earnings.
Min – Max :
€71,273.53 - €106,910.29 (EUR)
Benefits
The benefits described represent the current offerings at our organization, however, benefits are subject to change and may vary by location and employment status. We strive to provide a comprehensive benefits package that supports our employees’ health, wellness, and financial goals. Please note that benefits may be discussed in more detail during the hiring process.
Vacation to help you rest, recharge, and connect with loved ones
Paid leave benefits
Health plan cover through Irish Life Health
Employer and employee pension contributions
Bike2work program/benefits
Tuition reimbursement, life insurance, EAP – and more!
It is the policy of Merative to provide equal employment opportunity (EEO) to all persons regardless of age, color, national origin, citizenship status, physical or mental disability, race, religion, creed, gender, sex, sexual orientation, gender identity and/or expression, genetic information, marital status, status with regard to public assistance, veteran status, HIV status, or any other characteristic protected by federal, state or local law. In addition, Merative will provide reasonable accommodations for qualified individuals with disabilities.