Job Description
About the Company:
Our client is a leading technology provider for the hospitality sector in the UK. We offer a range of services designed to help businesses thrive, including mobile ordering apps and web-based platforms for customer engagement through loyalty programs and reservations.
About the Role:
We are seeking a Production Operations Engineer to join our Prodops team and contribute to the operational excellence and stability of our services. The ideal candidate will have a strong background in application or web services, with experience in support, delivery, or operations roles. They will be a self-starter with a passion for technology and problem-solving, possessing excellent analytical skills and the ability to thrive in an autonomous environment.
Responsibilities:
- Building and maintaining collaborative relationships with Development, Platform, and Devops teams to ensure a responsive and knowledgeable service that maintains high performance and service levels, identifying and addressing issues and trends early.
- Owning and continuously enhancing incident triage processes, logging, monitoring, and alerting frameworks, dashboards, status pages, and reporting.
- Reducing toil through automation, tooling, and documentation.
- Managing capacity analytics and demand.
- Bringing expertise and a practical approach to problem-solving, simplifying complex issues.
- Participating in On-Call cover and Incident Response.
- Upholding key Service Level Objectives (SLOs) for availability, performance, and early detection.
- Playing a key role in reducing technical debt.
Qualifications:
A minimum of 2+ years of experience in supporting or operating production environments.
Required Skills:
- Experience with modern monitoring and logging tools for data gathering, retrieval, and event correlation.
- Proficiency in scripting and data access using PowerShell, T-SQL, and AtlasDB Compass / MQL.
- Comfort with complex provisioning and deployment scenarios.
- Strong collaboration skills, organizational abilities, and reliability.
- The ability to understand the bigger picture, representing operational and customer impacts in technical debt prioritization.
Preferred Skills:
- Experience working for a Managed Services Provider (MSP) or with multi-tenanted technology stacks.
- Experience with cloud and on-premises data-centric web infrastructure.
- Hands-on experience with Azure, MongoDB / AtlasDB, and SQL Server.