How to Score Complex AI Workflows and Chains of Thought