SRE Engineer

We are seeking a Site Reliability Engineer with a strong background in C#/.NET, modern JavaScript frameworks like Vue.js, and hands-on experience with Docker and Kubernetes is a plus. This role focuses on building and supporting resilient, observable, and performant systems that power our applications. You’ll work across infrastructure, development, and operations to ensure our services are reliable, scalable, and continuously improving.


What will be your key responsibilities:

Responsibilities

  • Ensure Service Reliability: Design and implement highly available and resilient systems using .NET
  • Incident Response & Resolution: Lead root cause analysis and resolution efforts for production issues, minimizing downtime.
  • Performance Optimization: Continuously profile and optimize backend services and frontend performance.
  • Observability: Improve monitoring and alerting coverage using Application Insights, metrics, and logging strategies via Open Telemetry.
  • Collaboration: Partner closely with development teams to design scalable applications and eliminate toil through automation.
  • Operational Excellence: Build and maintain automated deployment pipelines and manage container infrastructure.
  • Documentation & Runbooks: Maintain detailed operational documentation and incident response procedures.

What experience should you have:

Technical Skills

  • Programming: Proficiency in C#/.NET; experience with JavaScript/TypeScript and component-based frameworks like Vue.js, React, or Angular.
  • Containers & Orchestration: Strong experience with Docker and Kubernetes for deploying and managing applications.
  • Monitoring & Observability: Hands-on experience with Application Insights, Azure Monitor, Grafana, Prometheus, or similar.
  • Cloud Platforms: Familiarity with Azure, AWS, or Google Cloud Platform (GCP).\Automation & Scripting: Solid skills in scripting (e.g., PowerShell, Bash) and automating operational tasks.
  • Networking & Security: Understanding of TCP/IP, DNS, SSL, firewalls, and application-level security best practices.
  • CI/CD: Experience with pipelines and tooling such as GitHub Actions, Azure DevOps, or Jenkins.

Soft Skills

  • Problem-Solving: Strong analytical skills to troubleshoot complex, cross-layer issues in distributed systems.
  • Communication: Clear and proactive communicator who collaborates effectively with cross-functional teams.
  • Adaptability: Eagerness to learn and work across new technologies, tools, and workflows in a fast-paced environment.

What do you get in return:

Our team is composed of experts in their fields who are passionate about delivering high-quality work and maintaining a positive work culture. We value innovation, teamwork, and personal growth. As an experienced Site Reliability Engineer, you will have the opportunity to make a significant impact on our projects and contribute to the success of our organization. If you are ready to embrace exciting challenges and foster a culture of excellence, we encourage you to apply.

What do we offer:

  • Work remotely from anywhere in the world, with a fully remote team, and enjoy a mutually agreed schedule that fits your needs. (Core US working hours) 
  • Work primarily with US-based colleagues, providing you with the opportunity to collaborate with people from diverse backgrounds and skill sets.
  • Use your skills and expertise to make a significant impact on the delivery of projects in our company
  • Work in a supportive environment that values your contribution and provides you with the resources and training you need to grow in your career.
  • Enjoy a 40-hour workweek that provides you with a healthy work-life balance, and the time to pursue your personal and professional goals outside of work.

If you are a dedicated Site Reliability Engineer looking for an opportunity to work with a dynamic team, we would love to hear from you. Apply today!

I want to apply

Send offer to e-mail

More positions in category Information Technology, region abroad

IT specialista pro Oracle

  • DISPONERO
  • Klášterec na Ohří
  • Dohodou

Pro výrobní společnost hledáme IT specialistu / specialistku pro Oracle. 

IT specialista pro Oracle

programátor/ka PLC

  • HOFMANN WIZARD
  • Plzeň
  • Dohodou

PLC programátor se zaměřuje na vývoj, testování a údržbu programů pro řídící systémy strojů a průmyslových procesů.

programátor/ka PLC

1st Level Platform Support

  • Košík
  • Praha hl.m.
  • Dohodou

Jsme jednou z největších e-commerce firem v České republice a expandujeme do zahraničí. 🛒Aktuálně na pozici 1st Level Platform Support hledáme nového kolegu/kolegyni, který/á nám pomůže zajišťovat…

1st Level Platform Support