🧬

Piro

In Progress

Personal Intelligence — a tiny RL-first LLM trained on your own knowledge base.

About

A ~10M parameter language model trained from scratch using reinforcement learning (GRPO). Piro learns from your personal knowledge base — corrections, discoveries, preferences — and distills them into a model that actually knows you. The intelligence layer of the thesis: personalized, cheap, yours.

Tech Stack

PythonReinforcement LearningLLMGRPOPKM

📦 Source Code

← All Projects