Christopher Batey is a Software Engineer by trade and is currently employed by DataStax as a Technical Evangelist for Apache Cassandra, previously he was Senior Software Engineer at BSkyB where he spent his time designing and developing their next generation platform that backs Sky Go, Now TV etc. He is a keen blogger, tweeter and open source advocate.
Are you developing applications that communicate over a network? Of course you are! This talk will take you through all the ways you can build fault-tolerant applications and how, once you get your team in the mindset that everything will eventually fail, dealing with the failures gracefully is no more work than building fragile applications
The topics covered will be:
What to do when one of the dependencies fails to respond in time
When to use network level time outs vs application level timeouts
What to monitor and how to monitor it, e.g connection pools, thread pools, queue sizes, latency
How to test for when the network is slow or saturated
How to test for when traffic is lost in transit
How to train your stakeholders to expect failure and get them to agree to fallbacks meaning they can choose availability over other requirements
When to use automated circuit breakers vs manual kill switches
Tips, hints and tricks for doing all of the above in Java
The topics covered are especially relevant if your application has a lot of dependencies that it communicates with over a network i.e. microservices.