kipi.ai interview question

How to evaluate large language models?