
The Inference Engine - Bringing Models to Life
What we learned from "AI/ML Models Are Not Libraries" is that models are essentially collections of numbers (weights) and, optionally, mathematical formulas. The "optionally" part is key, as we saw in "A Trip in the AI/ML Model Formats Jungle" that not all model files store the formulas themselves. Some formats are "Mostly Self-Confined," while others are "Weights-Only," expecting the application using them to "know" the underlying math. In "Anatomy of a Model - the Developer Perspective", we explored different architectures and their inputs and outputs. With that groundwork laid, we can now consider the inference process as a whole.






















