The time delay between sending an input to an AI system and receiving a response. For user-facing applications, latency matters enormously—users expect responses in milliseconds, not seconds. This drives data center placement decisions, as network distance affects latency.
Discussed in Chapter 1 of This Is Server Country