Senior Site Reliability Engineer (SRE)
We are looking for a Site Reliability Engineer (SRE) with a strong blend of development, DevOps, and performance engineering to make a large portfolio of applications of one of our clients more robust and stable. Someone who has actual development experience but understands infrastructure management as well, able to operate in both sides of the DevOps field and tackle the reliability and system-wide issues holistically.
Core Responsibilities:
Design and maintain standard CI/CD processes to deliver microservices-based applications Optimize deployment processes and reduce release cycle times. Code deployments, releases, monitoring, alerting, solving incidents Analysis and integration of modern technologies as opportunities to improve Instilling DevOps culture within the engineering function Conduct performance analysis and optimization, assist with performance troubleshooting efforts and communicate findings to various teams. Collaborate with teams to identify and resolve performance bottlenecks in C# code and SQL queries Address system-wide issues by viewing the infrastructure and applications holistically. Engage in failover testing, infrastructure hardening, and performance troubleshooting. Operate independently and confidently, building relationships and providing innovative solutions. Transition current application services to App Service plans, legacy apps to Azure Cloud, IaaS to PaaSKnowledge sharing:
Educate team members on best practices, architectural patterns, and performance engineering. Share lessons learned, frequent issues and how to overcome them Working in agile and cross-functional teams Development Experience: Strong background in C# and SQL development. DevOps Expertise: Comprehensive knowledge of Azure services and tools, including AppInsights, Azure CLI, PowerShell, and bash scripting. Performance Engineering: Ability to identify performance issues, analyse hotspots in code, and suggest effective solutions. Problem-Solving: Strong analytical skills to diagnose and resolve outages and performance bottlenecks. Communication: Excellent ability to talk to various teams, understand their concerns, and provide actionable insights. Independence: Capable of operating independently with loosely defined tasks, confident in executing tasks, and comfortable in building relationships.Nice-to-have Skills:
Experience with Python scripting. Knowledge of Databricks. Familiarity with DataDog and SonarQube. High level of confidence and independence in daily operations. Ability to execute tasks with minimal guidance. Strong relationship-building skills with team members and stakeholders. Proactive in sharing ideas and innovative solutions. Hands-on experience and practical knowledge in development and DevOps.Additional Expectations:
Provide practical hands-on experience and context to team members. Share knowledge on patterns, architecture, and best practices. Not expected to solve everything but to help educate and guide the team. The role is not just about running performance troubleshooting but also about problem-solving and understanding the overall system. The candidate should be able to find hotspots in the code and SQL queries and suggest practical solutions.Required:
C#, .NET Core, ASP.NET WebApi, Entity Framework SQL, SQL Profiler Azure DevOps CI/CD pipelines Azure tooling - Azure CLI, Azure KeyVault PowerShell, bash AppInsights Docker, Containers k8s, Azure Kubernetes Service Terraform ARM gitNice to have:
Python scripting, DataDog, Sonarqube
If you are interested, please contact our consultant for a confidential discussion using the contact details provided or click apply to send an updated copy of your CV.
Cavendish (Recruitment) Professionals Ltd are proud to be an equal opportunity employer and we believe that inclusivity begins with the candidate experience. All qualified applicants will receive consideration for employment regardless of, gender, race, age, sexual orientation, religion, or belief.