Staff Site Reliability Engineer, Cloud Observability

Research and Development
200000SO Requisition #

Who is Genesys?

Every year, Genesys® delivers more than 70 billion remarkable customer experiences for organizations in over 100 countries. Through the power of the cloud and AI, our technology connects every customer moment across marketing, sales, and service on any channel, while also improving employee experiences. Genesys pioneered Experience as a ServiceSM so organizations of any size can provide true personalization at scale, interact with empathy, and foster customer trust and loyalty.


Why Genesys needs you:

We are investing 1 billion dollars in R&D over the next 4 years and need the right individuals to turn that investment into innovation. Genesys is bringing that innovation to customers through multi-cloud deployments in AWS, Azure, and Google Cloud. 

Connections matter, at certain times with greater urgency. Whenever the moment, our technology facilitates those connections creating an experience as a service. Our team members own their critical services and words like scalability, resiliency, and automation are at the heart of every line of code we write. 


What you’ll do: 

Your responsibility will include system design, configuration, deployment, and operations of Observability systems and tools. These systems include monitoring of services and infrastructure, log collection and analytics, and application performance monitoring (APM). Together these systems and tools serve as a critical part of Genesys Cloud infrastructure services. 

Your initial focus will be bringing the monitoring infrastructure that consists of open-source products such as Prometheus, Grafana, VictoriaMetrics, Zabbix, and other tools to the next level of availability and scale. Activities include:

  • Creating build and deployment pipelines for monitoring tools
  • Deployment of monitoring solutions into AWS and Azure regions, development and production environments
  • Developing a set of alerts and metrics to keep your own services alive and performing well
  • Collaborating with other SRE team members, working on improving efficiency and reliability of monitoring solutions
  • Advising multiple service teams on best practices for monitoring their services using your tools

Once you’ve done this (and it should not take much time for a person like you), in the next phase you’ll expand to:

  • Contributing to the entire Observability stack
  • Kubernetes-based systems
  • Evaluating, choosing, and implementing the next generation of tools 


What you should bring to the table:

You build it, you run it. You’ll play a role of the Observability expert and a go-to person for members of SRE and service teams that depend on your solutions to enable best-in-class monitoring, logging, and APM. You’re passionate about what you do and want to make it better, faster, more reliable. Think big scale and automation. Don’t be afraid to innovate and disrupt. Continuously educate yourself and learn new skills and technologies.


Who you are:

  • 7+ years of experience in software engineering
  • Operations and administration of open-source monitoring tools: Prometheus, Grafana, Zabbix are highly desirable
  • Experience in operations and administration of Elasticsearch, Kibana, and log collection tools such as logstash, beats, fluent-bit, fluentd is a big plus (or if you know just this part, we want to talk to you anyway)
  • Developing pipelines using CICD tools: Github Actions and/or Jenkins is a plus
  • Linux administration
  • Public cloud providers: AWS, Azure
  • Docker
  • Kubernetes


Genesys is an equal opportunity employer committed to diversity in the workplace. We evaluate qualified applicants without regard to race, color, religion, sex, sexual orientation, disability, veteran status, and other protected characteristics. #LI-DM2

Previous Job Searches

Activity Feed

Job shares through Genesys
Someone applied to the Manager, Cloud Network Operations position. 12 minutes ago
Someone applied to the Associate Sales Account Executive position. 12 minutes ago
Someone applied to the Sr. Human Resources Partner position. 12 minutes ago
Someone applied to the Reporting and Analytics Specialist, People Analytics position. 12 minutes ago
Riya Kariath referred the Sr. Cloud Security Engineer position. 14 minutes ago

Similar Listings

Ireland, Ireland, Galway, Galway

📁 Research and Development

Requisition #: 200000EE

Ireland, Ireland, Galway, Galway

📁 Research and Development

Requisition #: 200000KL

Ireland, Ireland, Galway, Galway

📁 Research and Development

Requisition #: 200000SM