Site Reliability Engineer Job at EdHike LLC, Texas

T3RDeENhcUNheDVUL3VzaEpueFU1cnk1UGc9PQ==
  • EdHike LLC
  • Texas

Job Description

Job Title: Site Reliability Engineer (SRE)

Location: Austin, TX

Job Summary

We are seeking a Site Reliability Engineer (SRE) to join our team and ensure the reliability, availability, and performance of our production systems. You will bridge the gap between development and operations, applying software engineering principles to system administration and infrastructure management.

Responsibilities

  • Design, build, and maintain scalable and reliable infrastructure.
  • Develop and maintain automation tools for deployment, monitoring, and site reliability.
  • Monitor system performance and troubleshoot issues to ensure high availability.
  • Collaborate with development and DevOps teams to improve system reliability and scalability.
  • Conduct root cause analysis of production errors and implement sustainable solutions.
  • Define and measure Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets.
  • Participate in on-call rotations to support system uptime and respond to incidents.
  • Continuously improve CI/CD pipelines and operational processes.
  • Document systems, processes, and playbooks to facilitate knowledge sharing.

Requirements

Required:

  • Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent experience).
  • 3+ years of experience in SRE, DevOps, or related fields.
  • Proficiency with cloud platforms (e.g., AWS, GCP, Azure).
  • Strong skills in scripting or programming (e.g., Python, Go, Bash).
  • Experience with infrastructure as code tools (e.g., Terraform, Ansible).
  • Proficiency with containerization and orchestration (e.g., Docker, Kubernetes).
  • Familiarity with monitoring and logging tools (e.g., Prometheus, Grafana, ELK, Datadog).
  • Strong understanding of networking, system internals, and distributed systems.

Preferred:

  • Experience with incident response and postmortem culture.
  • Knowledge of security best practices in cloud and infrastructure.
  • Certification in cloud technologies (e.g., AWS Certified DevOps Engineer).

Job Tags

Similar Jobs

Greenlife Healthcare Staffing

Job # T10016 - Patient Care Technician/Nursing Assistant/Travel - Detox Job at Greenlife Healthcare Staffing

Patient Care Technician/Nursing Assistant/Travel - Detox - Greenport, NY (#T10016) Previous Nursing Assistant experience in a hospital, nursing home, or ambulatory setting preferred. Greenlife Healthcare Staffing is seeking a Patient Care Technician/Nursing Assistant... 

LaSalle Network

Inpatient Coder Job at LaSalle Network

Are you an experienced Inpatient Coder looking for a remote role where your expertise truly matters? Join a collaborative and forward-thinking healthcare organization dedicated to accuracy, compliance, and excellent patient care. Inpatient Coder Responsibilities:... 

InProduction

Scenic Installer Job at InProduction

 ...InProduction is the leading provider of temporary seating, staging, structures, and scenic production for the U.S. live events industry. The Company is a valuable partner to event organizers throughout the entire venue transformation process, with core services including... 

HelpFlow

Virtual Assistant (Bookkeeping/Accounting Focus) - Remote Job at HelpFlow

 ...Position : Virtual Assistant (Bookkeeping/Accounting Focus) Working Hours: US Business Hours Hiring Company: We are a 10 year old remote staffing business with a fully remote team of 100+ employees. We started as a customer service agency, but have leveraged our... 

Phillips, Richard & Rind, P.A.

Associate Attorney (union-side labor law) Job at Phillips, Richard & Rind, P.A.

 ...Phillips, Richard & Rind, P.A. Associate Attorney (union-side labor law) Based in Miami, FL Leading Miami-based union-side labor law firm, Phillips, Richard & Rind, PA., seeks a dedicated Associate Attorney with 3-5 years experience to assist in the...