We gratefully acknowledge support from
the Simons Foundation and member institutions.

Grant Wilkins is qualified to endorse.

Offline Energy-Optimal LLM Serving: Workload-Based Energy Models for LLM Inference on Heterogeneous Systems

Grant Wilkins: Is registered as an author of this paper.
Can endorse for cs.AI, cs.DC. (why?)

Srinivasan Keshav and Richard Mortier are not registered as owners of this paper. (why?)