Beyond-Unit-Tests%3A-Why-Your-AI-Needs-a-Dedicated-Evaluation-Framework