Expect to see in 2025: that is literally a single 2GB file you run on a Steam Deck to get a real-time DM assistant.
: It was based on a LLaMA-7B foundation model, fine-tuned with approximately 800k GPT-3.5 Turbo generations. gpt4allloraquantizedbin+repack