LLM BASELINEOpenAI · 2024

GPT-4o (zero-shot)

GPT-4o prompted zero-shot to generate CadQuery or OpenSCAD code. The Text2CAD paper reports a 93% invalidity rate, establishing it as a weak baseline. Used here as a reference point for general-purpose LLM capability without CAD-specific training.

Input
Text
Output
OpenSCAD / CadQuery
Venue
OpenAI
Year
2024
CAD Arena Status
Scheduled for evaluation on the full 200-prompt benchmark. Results will appear on the leaderboard at launch.