← all jobs

[Remote] Senior Site Reliability Engineer

Work from home Full-time role Hiring

Note: The job is a remote job and is open to candidates in USA. Ellucian is a company that powers innovation for higher education, serving over 21 million students globally. They are seeking a Senior Site Reliability Engineer to ensure the reliability, performance, and cost-efficiency of their production systems, focusing on DevOps practices and incident management.

Responsibilities

  • Own and improve system reliability, availability, and performance for production environments
  • Design, implement, and manage monitoring, alerting, and observability using DataDog (required)
  • Lead incident response efforts, including troubleshooting, mitigation, and post-incident reviews
  • Perform detailed root cause analysis (RCA) and drive permanent resolutions
  • Partner with engineering and DevOps teams to build scalable, resilient infrastructure
  • Automate operational processes to improve efficiency and reduce risk
  • Analyze and optimize infrastructure and application costs
  • Define and manage SLIs/SLOs to meet reliability targets
  • Continuously improve deployment, monitoring, and operational practices

Skills

  • 5+ years of experience in Site Reliability Engineering, DevOps, or similar roles
  • Strong, hands-on expertise with DataDog (APM, logs, metrics, dashboards, alerting)
  • Experience with cloud platforms (AWS, Azure, or GCP)
  • Proficiency in DevOps practices and tools (CI/CD, Infrastructure as Code such as Terraform)
  • Strong troubleshooting skills and experience conducting root cause analysis in distributed systems
  • Experience with containers and orchestration (Docker, Kubernetes)
  • Scripting or programming experience (Python, Bash, or similar)
  • Proven ability to analyze and optimize cloud costs
  • Own and improve system reliability, availability, and performance for production environments
  • Design, implement, and manage monitoring, alerting, and observability using DataDog (required)
  • Lead incident response efforts, including troubleshooting, mitigation, and post-incident reviews
  • Perform detailed root cause analysis (RCA) and drive permanent resolutions
  • Partner with engineering and DevOps teams to build scalable, resilient infrastructure
  • Automate operational processes to improve efficiency and reduce risk
  • Analyze and optimize infrastructure and application costs
  • Define and manage SLIs/SLOs to meet reliability targets
  • Continuously improve deployment, monitoring, and operational practices
  • Experience with cost management tools (e.g., AWS Cost Explorer, Azure Cost Management)
  • Familiarity with cloud security and compliance best practices
  • Experience supporting high-availability, customer-facing systems
  • Strong collaboration and communication skills

Benefits

  • Comprehensive health coverage: medical, dental, and vision
  • Flexible time off
  • Thrive Flex Lifestyle Account (LSA) that allows you to contribute towards your health, financial or learning interests
  • 401k w/ match & BrightPlan - to help you save for the future
  • Parental Leave
  • 5 charitable days to support the community that supports us
  • Telemedicine
  • Wellness
  • Headspace Care (mental health)
  • Wellbeats (virtual fitness classes)
  • RethinkCare & Wellthy– caregiver support
  • Diversity and inclusion programs which provide access to internal employee resource groups
  • Employee referral bonuses to encourage the addition of great new people to the team
  • We Foster a learning culture with:
  • Education Assistance Program
  • Professional development opportunities

Company Overview

  • Ellucian delivers the software, services, and insights that help your institution thrive. It was founded in 1968, and is headquartered in Fairfax, Virginia, USA, with a workforce of 1001-5000 employees. Its website is http://www.ellucian.com.
  • Company H1B Sponsorship

  • Ellucian has a track record of offering H1B sponsorships, with 2 in 2026, 31 in 2025, 27 in 2024, 28 in 2023, 31 in 2022, 33 in 2021, 30 in 2020. Please note that this does not guarantee sponsorship for this specific role.
  • More open positions

    [Remote] Labor Analyst (Financial Analyst 1) 28845

    Work from home Full-time role

    [Remote] Full Stack Developer (Remote)

    Work from home Full-time role

    [Remote] UX/UI Designer - ChessKid

    Work from home Full-time role

    [Remote] Paid Media Analyst

    Work from home Full-time role

    [Remote] AI Training Engineer | $74/hr Remote

    Work from home Full-time role

    Fractional CMO (Retail) — Build & Own a High-Performance Marketing Engine - Contract to Hire

    Work from home Full-time role

    Senior Manager, Revenue Planning

    Work from home Full-time role

    Experienced Customer Support Specialist / Live Chat Agent - Remote USA Opportunity at careerzynith

    Work from home Full-time role

    Certified Personal Trainer (Remote) – Unlimitr Early Access Program

    Work from home Full-time role

    Senior QA Engineer – Software

    Work from home Full-time role

    DevOps Engineer, (Consultant, Engineering & Technical Services) Remote / Telecommute Jobs

    Work from home Full-time role

    Remote Entry‑Level Data Entry Specialist – Accurate Records Management & Customer Service at careerzynith

    Work from home Full-time role

    Alliance Partner Executive, MSSP (Remote, BRA)

    Work from home Full-time role

    EP Mapping Specialist - CAS

    Work from home Full-time role

    Remote Data Entry Specialist – Flexible Work-from-Home Career Opportunity | careerzynith Entry-Level Position (No Experience Needed)

    Work from home Full-time role

    Remote Part-Time Outbound Customer Service Representative – Mission-Driven Outreach & Client Engagement Specialist at careerzynith

    Work from home Full-time role

    Teletherapy School Psychologist Work From Home (Florence KY) 20252026

    Work from home Full-time role

    Senior Software Engineer, Core Experiences - Grand Rapids, MI, USA

    Work from home Full-time role

    Stress Engineer Concession

    Work from home Full-time role

    AI Prompt Engineer / GPT Systems Builder for Founder Decision Engine (StartOS)

    Work from home Full-time role

    Junior Creative Strategist

    Work from home Full-time role