Site Reliability Engineer (SRE) Job at Openkyber, Georgia

UmZzRlZrSDl6WTRYajZzWVlhdUZIRWlMNEE9PQ==
  • Openkyber
  • Georgia

Job Description

Job Title: Site Reliability Engineer (AWS Cloud Infrastructure)
Hire Type: Contract
Compensation: $72 to $90 per hour
Job Reference: #J-18808-Ljbffr

Job Description:

We are seeking a Site Reliability Engineer (SRE) to join a dynamic team responsible for ensuring the reliability, scalability, and performance of AWS cloud infrastructure . This mid-senior level contractor role focuses on automating operational tasks, monitoring system health, and maintaining compliance with industry standards, particularly in the healthcare sector . Ideal candidates will have extensive experience in AWS , DevOps environments , and possess strong skills in scripting and networking .

Key Responsibilities:
  • Ensure the reliability and scalability of cloud-based systems hosted on AWS .

  • Automate operational tasks using tools and technologies like Terraform , Ansible , CloudFormation , or custom scripts.

  • Monitor system health , identify performance bottlenecks, and ensure high availability and resilience of cloud infrastructure.

  • Work closely with development and operations teams to implement continuous integration and continuous deployment (CI/CD) pipelines.

  • Implement and manage monitoring, alerting , and logging solutions (e.g., CloudWatch , Prometheus , Grafana ).

  • Support incident management, troubleshoot issues, and resolve problems quickly and efficiently to minimize downtime.

  • Maintain and ensure compliance with healthcare standards (e.g., HIPAA ) and best practices for security, performance, and data protection.

  • Document system architectures, troubleshooting processes, and runbooks for effective knowledge sharing.

  • Identify areas for improvement in infrastructure performance and cost efficiency, providing recommendations for optimizations.

  • Collaborate with DevOps teams to enhance automation, performance, and efficiency across systems and platforms.

Required Skills & Experience:
  • 3+ years of experience as a Site Reliability Engineer or in a similar role, with a strong focus on AWS cloud infrastructure .

  • Extensive experience working with AWS services such as EC2 , S3 , RDS , VPC , CloudFormation , EKS , and IAM .

  • Strong proficiency in scripting languages (e.g., Python , Bash , Shell , or PowerShell ) for automating tasks and creating custom tools.

  • Experience with monitoring and alerting systems (e.g., CloudWatch , Prometheus , Grafana ).

  • Familiarity with DevOps principles and tools like Jenkins , Docker , Kubernetes , and CI/CD pipelines.

  • Strong knowledge of networking concepts, such as TCP/IP , DNS , load balancing , firewalls , and VPNs .

  • Solid understanding of cloud security best practices and maintaining compliance (particularly in healthcare environments).

  • Experience working in high-availability and resilient cloud architectures.

  • Problem-solving and troubleshooting skills, with the ability to quickly diagnose and resolve production issues.

  • Collaboration and communication skills , with the ability to work effectively in cross-functional teams.

Preferred Skills:
  • Healthcare industry experience , especially in maintaining HIPAA compliance.

  • Familiarity with containerization technologies (e.g., Docker , Kubernetes ) and serverless architectures.

  • Experience with configuration management tools like Ansible , Chef , or Puppet .

  • AWS Certification (e.g., AWS Certified Solutions Architect or AWS Certified DevOps Engineer ).

Qualifications:
  • Bachelor's degree in Computer Science , Engineering , Information Technology , or a related field, or equivalent professional experience.

  • Ability to work independently and take ownership of reliability-related initiatives.

Job Tags

Hourly pay, Contract work, For contractors,

Similar Jobs

American IT Systems

Site Reliability Engineer (SRE) Job at American IT Systems

 ...Site Reliability Engineer (SRE) San Jose, CA, 95117 Experience of maintaining production systems on AWS and/or GCP. Experience of Kubernetes clusters maintenance, managing and debugging containerized applications (Golang, Java, Python). Understanding... 

Mount Sinai Health System

Medical Assistant I-Pediatrics Job at Mount Sinai Health System

 ...Description The Medical Assistant I provides clinical office support to physicians and surgeons and performs patient care and administrative...  ...Issuing Authority: AHA Non-Bargaining Unit, IBL - Pediatrics FPA Staff Ops - ISM, Icahn School of Medicine Employer... 

General Dynamics Information Technology

Incident and Intrusion Sr Manager Job at General Dynamics Information Technology

 ...Public Trust: SSBI (T5) Requisition Type: Pipeline Job Description INCIDENT AND INTRUSION SR MANAGER MEANINGFUL WORK AND PERSONAL IMPACT As an Incent and Intrusion Sr Manager, you will be part of a program that provides ongoing support for Custom and... 

King's Seafood Distribution

Meat Butcher Job at King's Seafood Distribution

 ...We keep it reel! Premium benefits, an amazing culinary family, growth opportunities, and more! Are you hooked yet? The butcher/meat cutter position is key in executing accurate prep with consistency and quality control to ensure that every guest has a memorable meal. As... 

Easterseals PORT Health

Board Certified Behavior Analyst / BCBA Job at Easterseals PORT Health

 ...Easterseals PORT Health, we believe in more than just behavior planswe believe in transforming lives. As a BCBA with our EMPOWER program, youll have the...  ...Qualifications: ~ Certification: Board Certified Behavior Analyst (BCBA) in good standing ~ Licensure: Licensed...