Research
How we build and evaluate.
Technical notes on the work behind our models: how we assemble training data, how we measure whether a model is actually useful, and what we learn along the way.
-
Building an Indonesian regulatory corpus
A specialist model is only as good as its corpus. How we assembled a large, clean body of Indonesia's public regulation, from gathering to deduplication, and what surprised us along the way.
-
Evaluating AI for Indonesian government work
Popular benchmarks say little about whether a model can trace a legal basis or draft an official letter. The evaluation we built instead, and how we grade it.