From Code to Confidence%3A Evaluating AI Functions for Production