BACK TO GLOSSARY

GLOSSARY TERM

What is Inference Engine?

The runtime environment where a trained machine learning model executes and generates predictions or text.
In Private AI deployments, the inference engine is hosted on secure, internal compute infrastructure (like private GPUs). This guarantees that the prompts and generated outputs never leave the enterprise network.

Host Secure Inference Engines

Run advanced AI models locally on secure infrastructure to maintain zero data leakage.