About

Inferencer allows you to download, run and deeply control the latest SOTA AI models (GPT-OSS, DeepSeek, Qwen and more) on your own computer.

No data is sent to the cloud for processing - maintaining your complete privacy.
Advanced inferencing controls give you complete control on their accuracy and outputs.

Understand what the AI is thinking



Inferencer respects your privacy.
All AI processing happens on your device.
No telemetry, no background "update" checks.

Models

Start in the models section where you can select the location of existing models or download new ones directly from Hugging Face.

Chats

Select the model to interact with on the top menu bar and write a prompt to begin. At any point you can switch between models and continue the chat to see what else they can uncover. You can also selectively delete past messages to keep the model focused and less scatterbrain.

Chat Controls

Control the inferencing parameters including intensity of processing - which allows you to multi-task with other applications better.

Token Inspection

Select the inspector to peek into the inner-workings of each word outputted and see the model's confidence levels and alternative choices.

Prompt Framing

Expanding the prompt section to utilise the framing feature which allows you to control the output the model generates.

Settings

Including parental controls, setting up an automatic deletion policy and configuring font sizes.

Privacy

All AI processing happens offline and on your device. No data is sent to online servers for maximum privacy.

Subscribe for updates

With more features coming soon, you can be the first to know.