Join Esker as a Site Reliability Engineer

Esker is a global leader in AI-powered business solutions for the Office of the CFO. Our Source-to-Pay and Order-to-Cash platform serves 3,000+ customers - including NVIDIA, Whirlpool, Sony, and Trek - supporting 1.2M users and generating $200M+ in annual revenue. Founded in 1985, we've grown to 1,200+ employees worldwide with a culture focused on improving working capital, human relationships, and environmental sustainability. Esker operates in five continents with global headquarters in Lyon, France, and two U.S. locations in Madison, WI, and Denver, CO.

The Role

We’re seeking a Site Reliability Engineer to join our US-based SRE team of 10 engineers, part of a global SRE organization of 35+ professionals responsible for the reliability, scalability, and performance of our multi-tenant SaaS platform. In this role, you’ll work hands-on with large-scale production systems, collaborating with experienced engineers while helping drive automation, observability, and reliability improvements across the platform.

This is an exciting opportunity for someone moving from systems administration or DevOps into SRE, or for an early-career SRE looking to deepen their expertise across a broad and modern technology stack—all while supporting a platform used by millions of users worldwide.

What You'll Do

Improve platform reliability and availability by participating in our on-call rotation, learning from real production incidents, and helping evolve our incident response and post‑incident practices
Increase delivery speed and consistency through infrastructure and deployment automation using Terraform, Ansible, and Azure DevOps
Reduce time to detect and resolve issues by building actionable monitoring, alerting, and dashboards that improve visibility into system health and performance
Enable scalable growth by designing and operating infrastructure that supports a rapidly expanding global customer base across Azure and on‑premises environments
Empower engineering and support teams by delivering self-service tools, automation, and guardrails that allow safe and efficient production operations
Strengthen a culture of reliability and continuous improvement through agile collaboration, knowledge sharing, blameless postmortems, and cross‑team initiatives

Our Technology

You'll work in a hybrid cloud environment with infrastructure spanning Azure and on-premises data centers. Our stack includes Kubernetes, PostgreSQL, Elasticsearch/OpenSearch, Kafka, Redis, and modern observability tools (Prometheus, Grafana, OpenTelemetry). Automation is central to how we work: Terraform and Ansible for infrastructure, Azure DevOps for CI/CD.

What We're Looking For

3+ years in a Site Reliability, DevOps or a related role supporting production systems

2+ years of hands-on experience operating and maintaining Linux systems in live production environments (Windows server experience is a plus)
Bachelor's degree in Computer Science or related field preferred (equivalent experience considered)

Practical experience with Infrastructure as Code tools such as Terraform, Ansible, Chef, Puppet, or similar

A solid foundation in networking concepts, including TCP/IP, DNS, firewalls, and proxying
Experience working with relational and/or NoSQL databases in production environments
Scripting proficiency in one or more languages such as Python, Bash, or PowerShell

Schedule & On-Call

This is a hybrid role requiring at least two days onsite in Middleton, WI per week. The team primarily works standard business hours (Monday–Friday), with flexibility to support collaboration and focused work.

The role includes participation in a rotating, well‑balanced on‑call schedule designed to be sustainable and predictable. On‑call responsibilities are compensated, including additional pay and per‑incident compensation, and are supported by strong documentation, escalation paths, and a collaborative team approach. On‑call coverage consists of:

One evening approximately every two weeks (5–9pm CT)
One weekend approximately every 6–8 weeks (Friday 5pm – Sunday 7pm CT)

Why Join Us

Gain experience across a diverse, modern tech stack with mentorship from a global team

Clear career development path with promotion opportunities and access to roles across Esker

Work on systems at meaningful scale supporting major enterprise customers

Collaborative agile environment with emphasis on knowledge sharing and psychological safety

Salary Range: $95,000-119,000 + on call pay

This posting will close Tuesday, February 10th at 12 PM CT.

Employee Benefits

Student loan repayment assistance
Flexible work schedule, summer hours, and work from home options
Profit sharing options
Paid time off for community outreach and volunteer opportunities
Yearly stipend for employee wellness, hobbies, or educational activities
Dog-friendly work environment
Competitive salary and benefits package

Come as You Are

Esker is an equal opportunity employer that celebrates diversity and is committed to maintaining an inclusive and mutually respectful work environment. We do not discriminate against any qualified applicants on the basis of: race, color, national origin, religion, age, sex, sexual orientation, gender identity or expression, pregnancy, disability, medical history, military or veteran status, and any other characteristic protected by applicable law.

Site Reliability Engineer

Colleagues

About Esker U.S.

Site Reliability Engineer

Already working at Esker U.S.?