Elevance Health interview question

How to optimize API call usage to AI models under different constraints?