Up and running in a few minutes
CereVault Setup Guide
CereVault is the interface. You choose where the models run. Pick one of the three paths below, download a model, and start chatting. Everything stays on your device or your trusted local network.
Choose how your models run.
CereVault connects to a local model provider. Each option below is private and runs on your own hardware, so pick whichever fits you.
Run models on your Mac, share to your devices.
-
Install CereVault Serve
Get CereVault Serve from the Mac App Store and open it. It lives in your menu bar, with no Dock icon or windows in the way.
-
Pick a model from the catalog
Choose from the curated catalog of Llama 3.2, Gemma 2, Qwen 2.5, and Phi 3.5. It downloads directly to your Mac (about 1–3 GB each).
-
Turn on sharing (optional)
Want to use CereVault on your iPhone or iPad? Flip the device-sharing toggle in CereVault Serve. Keep your devices on the same Wi-Fi network.
-
Open CereVault
CereVault automatically discovers CereVault Serve on your machine and your local network, and prefers it as the default. Pick your model and start chatting.
Run models with Ollama.
-
Install Ollama
Download Ollama for your platform and install it.
-
Download a model
Pull a model from the Ollama library. For a small, fast starting point, run:
$ ollama run llama3.2Browse more options in the Ollama model library.
-
Keep Ollama running
Leave Ollama running in the background. It serves models locally at
http://localhost:11434. -
Open CereVault
CereVault finds Ollama on your machine automatically. Choose a downloaded model from the picker and start chatting.
Run models with LM Studio.
-
Install LM Studio
Download LM Studio for your platform and install it.
-
Download a model
In LM Studio, search the built-in catalog and download a model to your machine.
-
Start the local server
Open LM Studio's Developer (Local Server) tab and start the server. To reach it from your other devices, turn on serving over your local network.
-
Open CereVault
CereVault automatically detects LM Studio servers on your machine and across your local network. Pick a model from the picker and start chatting.
Pick a model and start chatting.
CereVault automatically finds AI servers on your network, including CereVault Serve, Ollama, and LM Studio, and connects in seconds. Switch models at any time, even mid-conversation, without losing your flow. Attach screenshots or photos and a vision-capable model can read them.
Your chats stay on your device. No accounts, no tracking, no cloud required.
Troubleshooting
- No models listed? Make sure your server (CereVault Serve, Ollama, or LM Studio) is running and at least one model is downloaded.
- Using another device? Keep it on the same Wi-Fi and turn on sharing in CereVault Serve.
- Slow first reply? Models can take a moment to load on first use, then it speeds up after that.
- Still stuck? Get in touch.