Senior Site Reliability Engineer (SRE Job at GovServicesHub, New York, NY

VFBZSlZVbjB4WWtWamF3WVpLaUdGRUNK
  • GovServicesHub
  • New York, NY

Job Description

Title: Senior Site Reliability Engineer (SRE)
Location: Remote

About January

At January, we’re transforming the lives of borrowers by bringing humanity to consumer finance. Our data-driven products empower financial institutions to streamline collections and help borrowers regain financial stability and control over their lives. We’re not just expanding access to credit — we’re restoring dignity and paving the way for millions to achieve financial freedom.

About the Role

As a Senior Site Reliability Engineer (SRE) , you will establish SRE practices from the ground up — ensuring reliability, scalability, and performance as January scales from thousands to millions of borrowers. You’ll architect resilient infrastructure, design modern observability solutions, and build sustainable on-call processes that evolve with our rapid growth.

Your work will directly address scaling challenges including database optimization, async workflow infrastructure, and data pipeline reliability — enabling the engineering team to ship confidently and efficiently.

Key Responsibilities

  • Lead incident response and develop sustainable on-call practices, including runbooks, blameless postmortems, and continuous improvement to reduce MTTR.
  • Build and maintain self-service observability tools (Datadog, Prometheus, ELK) for proactive monitoring and troubleshooting.
  • Create and maintain Infrastructure as Code (IaC)using Terraform or CloudFormation for consistent, secure AWS environments.
  • Partner with development teams to architect resilient, scalable infrastructure for critical components like databases, networking, async workflows, and data pipelines.
  • Design and implement robust CI/CD pipelines (GitHub Actions) with advanced deployment strategies (blue/green, canary).
  • Drive best practices in reliability and performance early in the design phase to future-proof January’s systems.

Required Skills & Experience

  • Proven experience leading incident response and postmortem processes for high-availability production systems.
  • Deep expertise in designing highly available architectures (EC2, Fargate, auto-scaling, health checks, graceful degradation).
  • Strong experience with AWS cloud infrastructure and IaC tools (Terraform, CloudFormation).
  • Hands-on experience with CI/CD automation using GitHub Actions or equivalent tools.
  • Proficiency in observability and monitoring stacks ( Datadog, Prometheus, ELK ).
  • Solid scripting/programming skills in Python (for automation, tooling, and debugging).
  • Excellent communication and documentation skills, with the ability to collaborate across engineering and platform teams.

Requirements

Tools & Technologies

  • Cloud: AWS
  • IaC: Terraform, CloudFormation
  • CI/CD: GitHub Actions
  • Monitoring: Datadog, Prometheus, ELK
  • Languages: Python
  • Infrastructure: EC2, Fargate

Additional Details

  • Remote role (NYC-based preferred for hybrid collaboration).
  • Opportunity to build and own the entire SRE practice for a growing FinTech startup.
  • Fast-paced, innovative environment working on AI-forward consumer finance products.

Job Tags

Contract work, Remote work,

Similar Jobs

Nanny Poppins Agency

Part Time Nanny Job at Nanny Poppins Agency

 ...Full-Time Nanny A family in Bronxville, NY is seeking a reliable and nurturing part-time nanny to provide dedicated care for their...  .... Compensation: ~$30$35 per hour Schedule: ~ MondayFriday ~7:00 AM 3:00 PM ~ Guaranteed 40 hours Responsibilities... 

Toshiba Global Commerce Solutions

Senior software architect Job at Toshiba Global Commerce Solutions

 ...use of the self-checkout at Lowe's Foods, earned fuel rewards at Kroger, or just paid for purchases at retailers such as Walmart, Michaels, Carrefour, The Gap, Calvin Klein, Boots, Cencosud, BJ's, or Costco? These are just a few examples of our in-store solutions and impressive... 

freetobeemyself

Remote Business & Leadership Mentor Coach Job at freetobeemyself

 ...real impact, and enjoy true flexibility? Free To Bee Myself is seeking remote project managers, leadership mentors, career coaches, and self-motivated professionals ready to build a flexible online business with global impact. Whats on Offer: Remote... 

Prime Therapeutics

Data & Reporting Analyst - Remote Job at Prime Therapeutics

 ...we serve. Looking for a purpose-driven career? Come build the future of pharmacy with us. Job Posting Title Data & Reporting Analyst - Remote Job Description The Data & Reporting Analyst is responsible for supporting the business in decision making and... 

Orin Swift Orin Swift

Intern-Hr OD - 300980 Job at Orin Swift Orin Swift

 ...-Gallo offers meaningful work, competitive compensation, benefits, and a culture that supports your well-being and growth. As our HR Intern for Organizational Development, you'll gain cross-functional exposure, hands-on project experience, and opportunities to build skills...