Site Reliability Engineer Job at EdHike LLC, Texas

cGJ6ZUZraEM4amVqaHR3ckZsTzN1cGExS1E9PQ==
  • EdHike LLC
  • Texas

Job Description

Job Title: Site Reliability Engineer (SRE)

Location: Austin, TX

Job Summary

We are seeking a Site Reliability Engineer (SRE) to join our team and ensure the reliability, availability, and performance of our production systems. You will bridge the gap between development and operations, applying software engineering principles to system administration and infrastructure management.

Responsibilities

  • Design, build, and maintain scalable and reliable infrastructure.
  • Develop and maintain automation tools for deployment, monitoring, and site reliability.
  • Monitor system performance and troubleshoot issues to ensure high availability.
  • Collaborate with development and DevOps teams to improve system reliability and scalability.
  • Conduct root cause analysis of production errors and implement sustainable solutions.
  • Define and measure Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets.
  • Participate in on-call rotations to support system uptime and respond to incidents.
  • Continuously improve CI/CD pipelines and operational processes.
  • Document systems, processes, and playbooks to facilitate knowledge sharing.

Requirements

Required:

  • Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent experience).
  • 3+ years of experience in SRE, DevOps, or related fields.
  • Proficiency with cloud platforms (e.g., AWS, GCP, Azure).
  • Strong skills in scripting or programming (e.g., Python, Go, Bash).
  • Experience with infrastructure as code tools (e.g., Terraform, Ansible).
  • Proficiency with containerization and orchestration (e.g., Docker, Kubernetes).
  • Familiarity with monitoring and logging tools (e.g., Prometheus, Grafana, ELK, Datadog).
  • Strong understanding of networking, system internals, and distributed systems.

Preferred:

  • Experience with incident response and postmortem culture.
  • Knowledge of security best practices in cloud and infrastructure.
  • Certification in cloud technologies (e.g., AWS Certified DevOps Engineer).

Job Tags

Similar Jobs

Ozinga Bros, Inc.

Safety Trainer Job at Ozinga Bros, Inc.

Overview: We're Hiring: Safety Trainer | Build a Culture of Safety Salary Range: $80,000-$100,000 Annually On-site in Mokena, IL Do you have a passion for safety and a knack for engaging training? Join our team as a Safety Trainer and make a meaningful impact... 

Ed Staub & Sons Petroleum, Inc

Class A Driver - Fuel Transport Job at Ed Staub & Sons Petroleum, Inc

Truck Driver Class A CDL***GREAT PAY AND EVEN BETTER BENEFITS!!!!***PROFIT SHARING***VACATION AND...  ...annuallyEd's Trucking is looking for a regional TRANSPORT TRUCK AND TRAILER DRIVER for pickup and safe delivery of fuel and/or other products as directed. This is a... 

International SOS Government Medical Services

Radiologist Physician Job at International SOS Government Medical Services

International SOS Government Medical Services, Inc.delivers customized medical and security risk management and wellbeing solutions to enable our clients to operate safely and effectively in environments far from home. Founded in 1984, we operate in 92 countries providing...

Gale Street Inn

Bartender Job at Gale Street Inn

 ...and great vibe - strong service skills required please - Good house, great hourly - Great Tips for Hard Work - No late nights or early morning closings - We work at good service - Please no pretenders - Serious people make serious money -Visit our menu and website at... 

Jacobs

Sr. Systems Integration Engineer (Portland, OR) Job at Jacobs

 ...optimism and focus. We don't settle until we give our best and know that we're making a difference.We're looking for a Sr. Systems Integration Engineer in Portland, OR who is excited about working on projects that enable the heart of our clients' business. Join us and you...