Lead Site Reliability Engineer

Posted: Thursday, 09 October 2025
Valid Thru: Saturday, 08 November 2025
Index Requested on: 10/09/2025 14:35:45
Indexed on: 10/09/2025 14:35:45

Location: Praha, 20, 186 00, CZ

Industry: Technology
Occupational Category: 17-0000.00 - Architecture and Engineering
Type of Employment: FULL_TIME

Tricentis CZ s.r.o. is hiring!

Description:

Location: Cork, Ireland OR Prague, Czech Republic

Hybrid: 3 days in the office/week

As a Lead Site Reliability Engineer, you’ll be at the forefront of building scalable, resilient, and observable systems that power Tricentis SaaS products globally. This is a hands-on engineering leadership role—balancing technical delivery, process ownership, and team mentorship.

You will drive initiatives across multiple products, shape SRE standards, and serve as a trusted partner to both engineering and product leaders. You will be responsible for elevating engineering quality and reliability while enabling scale and speed.

Your Impact as an 🚀

  • Lead and deliver cross-cutting initiatives to improve platform scalability, resilience, and cost efficiency.

  • Architect and implement cloud-native infrastructure that supports multi-region, multi-tenant deployments.

  • Improve observability strategy across systems and teams—including SLOs, error budgets, and alerting standards.

  • Coach and mentor engineers, guiding technical design reviews and promoting engineering excellence.

  • Own post-incident analysis and ensure learning loops are completed with preventive action.

  • Influence product reliability from early-stage design to production readiness reviews.

  • Establish and evolve standards for deployments, operational readiness, and incident response.

  • Serve as a technical advisor for engineering and product managers across the org.

As a valuable member of our SRE team, you' ll have the opportunity to 💪

  • Drive architectural discussions and make decisions that influence the SRE org and wider engineering teams.

  • Define and evolve technical roadmaps and execution plans aligned with company goals.

  • Partner with peers in security, infrastructure, and product to drive platform-wide improvements.

  • Lead incident response for high-impact outages and continuously reduce incident recurrence.

  • Contribute to SRE hiring through interviews, onboarding, and process refinement.

  • Guide the adoption of modern tooling and practices across teams (e.g., GitOps, self-service platforms, chaos engineering).

  • Represent SRE in leadership forums, bringing insights, trade-offs, and forward-looking strategies.

About You 🎯

  • 6+ years of experience in SRE, Infrastructure, or DevOps roles, including technical leadership.

  • Expertise in building and operating production systems in public cloud (Azure).

  • Deep understanding of observability principles (SLOs, SLIs, metrics, traces, logs).

  • Strong experience with infrastructure-as-code, container orchestration, and CI/CD (Terraform, K8s, GitHub Actions).

  • Proven track record in leading technical projects, influencing architecture, and mentoring engineers.

  • Excellent communication and cross-functional collaboration skills.

  • Proactive, ownership-driven mindset with a passion for reliability and continuous improvement.

Our Tech Stack 🌐

AZURE, AWS, Terraform, GitHub Actions, Kubernetes, DataDog, Prometheus, Grafana, Betterstack, All-in-one incident management platform | incident.io, Jira and more

Our Culture 🦄

We don' t just preach our values; we embody them in everything we do. We are committed to creating an environment that empowers, supports, and includes individuals, where trust, transparency, creativity, curiosity, and continuous improvement thrive on a daily basis.

Tricentis Core Values:
Knowing what we need to achieve and how to achieve it is important. Tricentis' core values define our ways of working and the behaviors we model that create an enjoyable and successful Tricentis life.

  • Demonstrate Self-Awareness: Own your strengths and limitations.
  • Finish What We Start: Do what we say we are going to do.
  • Move Fast: Create momentum and efficiency.
  • Run Towards Change: Challenge the status quo.
  • Serve Our Customers & Communities: Create a positive experience with each interaction.
  • Solve Problems Together: We win or lose as one team.
  • Think Big & Believe: Set extraordinary goals and believe you can achieve them.

Responsibilities:

Please review the job description.

Educational requirements:

  • high school

Desired Skills:

Please see the job description for required or recommended skills.

Benefits:

Please see the job description for benefits.

Apply Now