Automating AI Workflow Testing with CI/CD and Evals.do