Personal Intelligence — a tiny RL-first LLM trained on your own knowledge base.
A ~10M parameter language model trained from scratch using reinforcement learning (GRPO). Piro learns from your personal knowledge base — corrections, discoveries, preferences — and distills them into a model that actually knows you. The intelligence layer of the thesis: personalized, cheap, yours.