Loading…
May 2-4, 2018 - Copenhagen, Denmark
Click Here For Information & Registration
Back To Schedule
Thursday, May 3 • 11:55 - 12:30
Pod Anomaly Detection and Eviction using Prometheus Metrics - David Benque & Cedric Lamoriniere, Amadeus (Beginner Skill Level) (Slides Attached)

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Feedback form is now closed.
Dealing with system stability in a distributed and changing environment is a challenge: a single failing pod can affect the majority of your system responses. From Kubernetes probes to Istio circuit breaker, the CNCF projects provide us multiple means of containing this kind of problem.

After a quick review of all these means, understanding in which cases they can be used and their limitations, we will see how to react to problems that can only be revealed by internal application KPIs.

Maybe you would have liked to use a service mesh circuit breaker feature, however your traffic is not HTTP based; or, one pod continues to reply with HTTP code 200 alongside incorrect functional content. And yet, you have functional indicators that that could help take an immediate and orchestrated operational response.

We will see how to cover such cases thanks to dedicated controllers and Prometheus.

Speakers
avatar for David Benque

David Benque

Senior Software Engineer, Amadeus
Senior software engineer at Amadeus. Area of expertize: PaaS. Working on: application development, automation, system operability at scale, distributed systems. Languages: Go, C++ mainly. Like: sharing experience, learn and teach. My mission at Amadeus is to implement the transition... Read More →
avatar for Cedric Lamoriniere

Cedric Lamoriniere

Software Engineer, Amadeus
Cedric Lamoriniere is a software developer at Amadeus working on the integration of Openshift in the new internal PaaS "Amadeus Cloud Services". Cedric is specialized in distributed systems development. His first public speaking experience happens last year at the RedisConf (SF) where... Read More →



Thursday May 3, 2018 11:55 - 12:30 CEST
Auditorium 11+12
  Observability, Beginner