Senior Site Reliability Engineer (SRE) - (Dublin, CA) Job at Articul8 AI, Dublin, CA

UlA4TVZrdjF3bzBSaWFRZVpLcUNIa3FNNWc9PQ==
  • Articul8 AI
  • Dublin, CA

Job Description

About Us

Articul8 AI is at the forefront of Generative AI innovation, delivering cutting-edge SaaS products that transform how businesses operate. Our platform empowers organizations to leverage the power of artificial intelligence in a reliable, scalable, and secure environment.

Position Overview

We are seeking an experienced Site Reliability Engineer (SRE) to join our team and help ensure the reliability, performance, and scalability of our GenAI SaaS platform. As an SRE, you will bridge the gap between development and operations, implementing automation and best practices to maintain our service reliability objectives while supporting rapid innovation.

Key Responsibilities

  • Architect and maintain scalable, highly available infrastructure for our GenAI platform.
  • Design and implement robust monitoring, alerting, and observability solutions to proactively ensure system health and performance.
  • Automate deployment, scaling, and management of our cloud-native infrastructure, reducing toil and improving efficiency.
  • Define, measure, and improve Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to deliver outstanding service quality.
  • Participate in on-call rotations and provide rapid response to production incidents, minimizing downtime and user impact.
  • Collaborate closely with development teams to build reliable, scalable, and efficient systems for complex AI workloads.
  • Lead incident response efforts, conduct thorough post-mortems, and champion continuous improvement initiatives.
  • Optimize infrastructure for performance, scalability, and cost-effectiveness—especially for high-demand AI workloads.
  • Implement and enforce security best practices across all systems and environments.
  • Create and maintain comprehensive documentation, including runbooks and knowledge base articles, to foster a culture of shared knowledge.

Qualifications

Required

  • Bachelor's degree in Computer Science, Engineering, or related field, or equivalent practical experience
  • 8+ years of experience in DevOps, SRE, or similar roles
  • Strong experience with cloud platforms (AWS, GCP, or Azure)
  • Proficiency in at least one programming/scripting language (Python, Go, Bash, etc.)
  • Hands-on experience with infrastructure as code tools (Terraform, CloudFormation, etc.)
  • Solid background in containerization technologies (Docker, Kubernetes)
  • Proven experience with monitoring and observability tools (Prometheus, Grafana, ELK stack, etc.)
  • Strong understanding of CI/CD pipelines and automation
  • Exceptional troubleshooting and problem-solving skills and ability to troubleshoot complex systems

Preferred

  • Experience supporting AI/ML systems in production
  • Knowledge of GPU infrastructure management and optimization
  • Familiarity with distributed systems and high-performance computing
  • Experience with database systems (SQL and NoSQL)
  • Certifications in cloud platforms (AWS, GCP, Azure)
  • Experience with chaos engineering and resilience testing
  • Knowledge of security best practices and compliance requirements

Ready to shape the future of resilient software systems? Apply now and help drive the reliability of tomorrow’s AI at Articul8 AI!

Job Tags

Similar Jobs

CornerStone Staffing

Customer Service Representative Job at CornerStone Staffing

 ...Staffing is hiring for a company dedicated to delivering top-notch support for customers garage door systems and wireless devices. If you...  ...that promotes from within, you could be our next Customer Service Representative! Customer Service Representative Location:... 

HydroGeoLogic, Inc.

Civil/Environmental Engineer III Job at HydroGeoLogic, Inc.

 ...Job Title: Engineer III City & State Location: St. Louis, MO Office (Hazelwood, MO) Civil/Environmental Engineer III Description/Job Summary HGL - WHO WE ARE At HGL, we value our employees as individuals and as important members of our team. We offer a work... 

Wetzel's Pretzels

Roller, Baker - Cold Stone Creamery & Wetzel's Pretzels - New Albany Job at Wetzel's Pretzels

 ...Decorate and present baked goods in an appealing manner - Maintain cleanliness and organization in the kitchen area - Adhere to all food safety and sanitation regulations - Collaborate with other kitchen staff to ensure smooth operations - Coordinate and expedite... 

3 Lions Logistics

CDL Class A Local Driver - Home Daily Job at 3 Lions Logistics

 ...Inc (3LL) is an established trucking company based out of...  ...seeking experienced and reliable Class A CDL Truck Driver to join its workforce. 3LL...  ...this position will be driving locally throughout MA, NH, ME, and...  ...are expected to conduct daily pre/post-trips and maintain... 

Gotham Podiatry PC

Surgical Medical Assistant Job at Gotham Podiatry PC

**Job Title: Surgical Medical Assistant****Job Description:** We are seeking a dedicated and skilled Surgical Medical Assistant to join our dynamic medical team. The successful candidate will assist healthcare professionals in carrying out various surgical and medical...