H-Test: identifying a set of "blindspot" tasks for LLMs that doesn't scale (not inverse, close to no effect) with language training | Manifund