← all jobs

Site Reliability Engineer (USA Only - 100% Remote)

Work from home Full-time role Hiring

This a Full Remote job, the offer is available from North America, Europe, United States About Us Close is a bootstrapped, profitable, 100% remote, ~100 person team of thoughtful individuals who prioritize taking ownership and making a meaningful impact. We’re eager to make a product our customers fall in love with over and over again. We small scaling businesses. Since 2013, we’ve been building a CRM that focuses on better communication, without the hassle of manual data entry or a complex UI. We are out to supercharge sales productivity with the most modern, thoughtfully designed, all-in-one, communication-focused CRM. Our backend tech stack consists primarily of Python Flask web apps with our TaskTiger scheduler handling many of the backend asynchronous task processing chores. Our data stores include MongoDB, PostgreSQL, Elasticsearch, and Redis. The underlying infrastructure runs on AWS using a combination of managed services like EKS, MSK, RDS and ElasticCache and non-managed services running on EC2 instances. We have CI/CD pipelines that build Docker images, run automated tests and deploy to Kubernetes clusters. We also use these images in our local development environment allowing coding locally against all of our services. We have a well-documented public API that is consumed by our front-end JavaScript app as well as numerous integrations. Our infrastructure is heavily automated using Terraform, Ansible and other AWS tools. We love open sourcing our code and ideas on our GitHub and on The Making of Close, our behind-the-scenes Product & Engineering blog. Check out our open source projects like close-mongo-ops-manager, SocketShark, TaskTiger, LimitLion and ciso8601.

About the Role

  • You will be joining the Infrastructure Team at Close. This team builds and maintains the platform that runs all Close systems (and do we have a lot of those). Work with us and you’ll be working with
  • Multi-terrabyte MongoDB, PostgreSQL, and Elasticsearch clusters
  • Telemetry systems built on Grafana’s LGTM stack and ClickHouse processing over 130 TB per month
  • Multiple Kubernetes clusters running tens of thousands of pods
  • Github Actions & ArgoCD powered CI/CD that can go from merged, to production, to rolled back in 10 minutes
  • A system that is stable, up to date, and hasn’t needed scheduled downtime in 4 years
  • About You
  • You are a rock in the storm. With your hard won expertise, gained through battles won and lost, you consistently build robust systems from quality components fit to underpin mission critical applications. You value simplicity over familiarity. You value resilience over speed. You take pride in building composable and maintainable tools.
  • You’ve worked with a diverse array of infrastructure tools and systems, including
  • CICD (CircleCI, GitHub Actions, ArgoCD)
  • Configuration Management (Ansible, Terraform)
  • Databases (Elasticsearch, MongoDB, PostgreSQL, ClickHouse)
  • Cloud Computing (Kubernetes, AWS)
  • Telemetry (Loki, Tempo, Grafana, Mimir/Prometheus)
  • You're comfortable working in a fast-paced environment with a small and talented team where you're supported in your efforts to grow professionally. You're able to manage time well, communicate effectively, and collaborate in a fully distributed team.
  • Come help us with projects like...
  • Fully automating our database’s lifecycles with Argo Workflow
  • Eliminating all static credentials where they may be
  • Reducing downtime and disruption due to maintenance or disaster to new lows
  • Help us improve our multi-region disaster recovery system.
  • Requirements...
  • Senior 1 & 2 level candidates should have 5+ years of experience building modern infrastructure systems.
  • Staff level candidates should have 8+ years of experience.
  • The buck stops with you! You are the kind of person who is respected as an expert on the systems you run.
  • You have been the final point of escalation in the support of mission critical production systems
  • You are familiar with some of the following technologies AWS, Terraform, Kubernetes, Ansible, MongoDB, PostgreSQL, Elasticsearch
  • You have a strong grasp of common networking and data transfer protocols such as DNS, HTTP, TCP
  • You are able to speak and write in English
  • You are located in the USA (ET, CT, MT, PT)
  • Bonus point if you have…
  • Contributed open source code related to our tech stack.
  • Have experience maintaining very large databases
  • Has been through a successful disaster response
  • Have experience with multi-region architectures
  • Have run MLOps systems
  • Experience scaling Temporal
  • Benefits
  • Competitive compensation including an organization-wide goal-b Apply To This Job

More open positions

Urgently Need Site Reliability Engineer (Remote) in Saint Paul, MN

Work from home Full-time role

Site Reliability Engineer (SRE) - Remote

Work from home Full-time role

Urgently Need Site Reliability Engineer (Remote) in Saint Paul, MN

Work from home Full-time role

Senior Site Reliability Engineer- Remote

Work from home Full-time role

Senior Site Reliability Engineer, Remote Job

Work from home Full-time role

Remote Data Entry & Live Chat Internship – careerzynith Market Insights Program – $40/hr – Summer 2024 – Djibouti

Work from home Full-time role

Senior Business Recruiter

Work from home Full-time role

Corporate Development Manager ( SaaS)

Work from home Full-time role

Senior Threat Intelligence Analyst

Work from home Full-time role

Remote Customer Service Specialist - Part-Time

Work from home Full-time role

Senior Sales Engineer, Enterprise

Work from home Full-time role

[Remote] Lead Sales Enterprise Account Executive - Strategic Expansion

Work from home Full-time role

Contracts Manager (3-Month Contract, Fully Remote)

Work from home Full-time role

Part-Time Remote Data Entry Specialist – Precision careerzynith Product Information Management – $31/hr – Flexible Schedule

Work from home Full-time role

Analyst, Financial Reporting & Analysis

Work from home Full-time role

[Remote] Account Executive

Work from home Full-time role

Shopify Developer Needed Today (Conversion Optimization, Theme Customization & Bug Fixes)

Work from home Full-time role

Virtual Customer Support Associate – Remote (Maharashtra) – Seasonal Customer Experience Specialist at careerzynith

Work from home Full-time role

Remote Part‑Time Preschool Co‑Teacher – Play‑Based, Arts‑Integrated Early Childhood Education (Hourly, Immediate Start)

Work from home Full-time role

[Remote] Staff Software Engineer - Cloud Network Engineering

Work from home Full-time role

Service Mesh Engineer (Istio / Linkerd)

Work from home Full-time role