American Homes 4 Rent - Las Vegas, NV

posted 3 months ago

Full-time - Mid Level
Remote - Las Vegas, NV
Real Estate

About the position

The Site Reliability Engineer (SRE) at AMH will play a crucial role in enhancing the reliability and performance of the company's systems by integrating practices from SecOps, DevOps, Quality Assurance, and IT operations. This position focuses on designing, building, and maintaining scalable systems while ensuring a balance between development speed and system reliability. The SRE will be responsible for automating IT infrastructure tasks, monitoring system performance, and leading incident response efforts, ultimately contributing to a seamless user experience.

Responsibilities

  • Design, develop, streamline, and deploy automation tools and frameworks to enhance the velocity, reliability, and efficiency of Azure-hosted services.
  • Implement and maintain monitoring, alerting, and incident response processes to ensure timely detection, resolution, and proactive detection of issues before impacting users.
  • Collaborate with software development teams to design and implement applications with a strong focus on reliability, scalability, security, and performance.
  • Perform root cause analysis of incidents and implement preventive measures to avoid similar issues in the future.
  • Work on capacity planning and scaling strategies to accommodate growing user bases and increasing workloads.
  • Define service level indicators, objectives, and agreements to continuously measure and manage system performance to ensure service quality meets business needs.
  • Continuously improve deployment pipelines and implement best practices for continuous integration and continuous deployment (CI/CD).
  • Stay current with industry trends and emerging technologies, integrating relevant ones into the organization's practices.
  • Provide mentorship and guidance to junior engineers and actively share knowledge within the team.

Requirements

  • Bachelor's degree in Computer Science, Information Technology, or a related field.
  • Minimum of five (5) years of experience in a Site Reliability Engineer, DevOps, or similar role is a plus.
  • Proficiency in at least one programming language (e.g., Python, Go, Java, C#) for scripting and automation tasks.
  • Strong understanding of system design, networking, and distributed systems principles.
  • Familiarity with cloud platforms (e.g., AWS, Azure, Google Cloud).
  • Hands-on experience administering Azure, along with strong understanding of core Azure services, workloads, subscriptions, and security.
  • Experience with containerization and orchestration technologies (e.g., Docker, Kubernetes).
  • Experience with Infrastructure as Code automation technologies (e.g., Terraform, Ansible, Puppet, Chef).
  • Experience with scripting tools (e.g., PowerShell, CLI, Bash).
  • Experience with developing and implementing disaster recovery and high-availability solutions and processes.
  • Certifications related to cloud platforms and DevOps practices are advantageous.
  • Azure DevOps Engineer, Solution Architect, and/or Support Engineer certification is highly desired.
  • Knowledge of monitoring and logging tools for observability and performance analysis (e.g., Azure Monitor, Log Analytics, Azure Data Explorer, Splunk, Grafana, Opsgenie).
  • Excellent problem-solving and troubleshooting skills, with a proactive and solution-oriented mindset.
  • Ability to work effectively in cross-functional teams and communicate technical concepts to both technical and non-technical stakeholders.
  • Strong collaboration and communication skills (both written and verbal), able to work effectively with cross-functional teams.
  • Excellent planning and organizational skills.
  • Entrepreneurial spirit and willingness to take prudent risks.
  • Ability to interact effectively at all levels.
  • Strong customer, quality, and results orientation.
  • Ability to be an effective member of project teams.

Nice-to-haves

  • Experience with Azure DevOps tools and practices.
  • Familiarity with Agile methodologies.
  • Experience in a fast-paced startup environment.

Benefits

  • Medical, dental, and vision insurance.
  • Flexible spending accounts and/or health savings accounts.
  • Dependent savings accounts.
  • 401(k) with company matching contributions.
  • Employee stock purchase plan.
  • Tuition reimbursement program.
  • 9 paid holidays per year.
  • Accrual of paid time off (PTO) at a rate of 0.0577 hours of PTO per hour worked, up to a maximum of 120 hours per year.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service