Back to Job Search

Azure Cloud Engineer - Remote

  • Location: Salt Lake City, UT, 84101
  • Salary: 65.0
  • Job Type:Contract

Posted 15 days ago

  • Function: DevOps
  • Job Ref: 195930
MATRIX has partnered with a premier client in filling a unique contract to hire position based in Salt Lake City!  This is a great opportunity to expand your career and work with a well known company and look towards career growth.

Are you a site reliability engineer focusing in the Azure space?  Do you get excited thinking about DevOps and the potential opportunity to grow and foster a cultuer where you can bridge the divide between software and the infrastructure engineering departnments? This position may be for you!!!

TITLE:  Site Reliability Engineer - AZURE Cloud Migration

Our engineering team has built the largest private Medicare marketplace in the country. We passionately focus on the continuous improvement of the systems we build and the culture we promote. We build a platform that provides the best possible support to our customers who are shopping for insurance, and where our insurance carriers can be confident that their products are accurately and impartially represented.

We have spent many years growing and fostering a DevOps culture by bridging the divide between our Software and Infrastructure Engineering departments. We want the cross-functional teams that we are building to include Site Reliability Engineers. We operate in a complex, multi-tenant, hybrid cloud and on-premises infrastructure that spans both the Windows and Linux OS. We strive for security, reliability, and automation in line with DevOps and Site Reliability Engineering principles. If you are passionate about learning and improvement through metrics and automation, and passionate about engendering that mindset in others, we want to hear from you.

Hands-on Engineering 5+ years of total hands-on experience with a majority of the following technologies, along with a willingness to become proficient in the remaining areas:
• Windows and Linux Servers
• VMware
• Cloud platforms, preferably with Azure
• Active Directory
• Secrets management with Consul and Vault or similar systems
• Configuration management tools like Salt and Terraform
• Firewalls and load balancers such as F5
• Web servers, including IIS and NGINX
• Database Server Infrastructure like Microsoft SQL Server and PostgreSQL
• Application Performance Monitoring with tools like New Relic
• Infrastructure monitoring with tools like Sensu, SolarWinds, or Nagios
• Continuous Integration and Continuous Delivery with tools like TeamCity, Octopus Deploy, Concourse, or Azure DevOps
• Log Aggregation tools like SumoLogic or Splunk
• Network theory and protocols such as DNS, DHCP, proxy servers, and firewalls
• Security operations with tools for SAST, DAST, RAST, and WAF Proficiency, high-comfort, and familiarity with:
• One or more programming languages, such as C#, JavaScript, Python or Go
• One or more scripting languages, such as PowerShell and BASH
• Command line tools such as (git, netcat, npm, terraform, etc.)

• Explore new ways of improving communication between other Site Reliability Engineers and with other teams
• Promote inclusion and collaboration between various functional disciplines
• Write and maintain architectural, stakeholder, and policy documentation

• Encourage and inspire others to innovate
• Make improvements to internal processes to reduce lead time and increase deployment frequency
• Identify improvements to the quality, security, and performance of our infrastructure
• Increase the velocity with which teams deliver, leveraging expertise from various functional disciplines
• Identify how to remediate production incidents more quickly and safely while reducing the frequency of outages
• Participate in department Communities of Practice, actively engaging with other teams and departments to collaborate on best practices and implementation strategy
• Adhere to and advocate for best practices, including Infrastructure as Code, monitoring, high availability, disaster recovery, security, and DevOps methodologies
• Use intuition, experience and understanding to create SLIs, SLOs, and SLAs
• Contribute to capacity planning, advise and consult with teams who will be load/stress testing
• Keep up with industry innovations, recommending new tools or practices when appropriate.

Team Culture
• Guide the culture and attitude of the team in an optimistic, proactive, and encouraging direction
• Foster an environment where it is safe to fail and to learn from failure
• Actively mentor peers, developing their expertise and inspiring others to innovate
• Promote inclusion and collaboration between various functional disciplines Initiative
• Take an active role in estimating, planning, and managing your own tasks while aligning your efforts with your team
• Provide timely assistance and remediation solutions during critical situations and production incidents
• Document and share "lessons learned" from production, including blameless postmortems and root cause analysis