Summary of "LISA17 - Queueing Theory in Practice: Performance Modeling for the Working Engineer"

Overview

Use small experiments and simple queueing models to reason about latency, capacity, and scaling without huge-scale testing.

Serial (single-server) modeling

Key lessons from serial modeling

Parallel systems and scaling

Design patterns to improve scalability

Practical guidance / workflow

  1. State goals and assumptions explicitly.
  2. Run small tests and microbenchmarks; collect instrumentation data needed to fit models.
  3. Validate simple models against production-like data before trusting extrapolations.
  4. If unsure, draw the system timeline/queueing picture, write a small simulation, or consult standard queueing results/textbooks.
  5. Watch for unbounded queues (which lead to unbounded latency) and minimize variance. The simplest capacity improvement is reducing work per request.

References / concepts / systems mentioned

Main speaker / sources

Category ?

Technology


Share this summary


Is the summary off?

If you think the summary is inaccurate, you can reprocess it with the latest model.

Video