We have geometry generators. We have simulation tools. We have optimization algorithms. None of them talk to each other. This is what needs to change.
The mechanical engineering loop hasn't fundamentally changed in 40 years. An expert sits at every handoff.
Each layer is getting AI tooling independently. Nothing connects top to bottom.
The text-to-CAD field has 13+ competing models, four different output formats, and no shared benchmark. Papers can't be compared. Progress is invisible.
Estimates based on current SOTA across 173 papers reviewed. L1 validity = best models achieve ~80–93% on simple shapes.
The history of ML is clear: ImageNet didn't just measure vision, it created it. SWE-bench didn't just measure coding agents, it shaped their development. A good benchmark is a forcing function.
A mechanical engineer describes what they need in plain language. The system generates geometry, checks it against manufacturing constraints, runs simulation, optimizes the design, and outputs a production-ready file.
Not a CAD copilot. A CAD engineer.
We're at step one: getting geometry generation right and measurable. But step one has to be done properly for the rest to follow.