Meridian Eval: a public benchmark for tool-routing failures in LLM agents | Manifund