BACK TO GLOSSARY
GLOSSARY TERM
What is Inference Engine?
The runtime environment where a trained machine learning model executes and generates predictions or text.
In Private AI deployments, the inference engine is hosted on secure, internal compute infrastructure (like private GPUs). This guarantees that the prompts and generated outputs never leave the enterprise network.
Host Secure Inference Engines
Run advanced AI models locally on secure infrastructure to maintain zero data leakage.