The first framework to generate industry-standard NURBS surfaces directly from text prompts, producing editable, parametric CAD models convertible to STEP format.
A two-stage pipeline: data preparation with automated NURBS extraction and captioning, followed by LLM fine-tuning for structured CAD generation.
BRep models are normalized and converted to untrimmed NURBS via pythonOCC. Control points, knot vectors, degrees, and weights are extracted for each face.
Multi-view renders (6 viewpoints) fed into InternVL3-13B with geometric metadata guidance: dimensions, holes, volume, surface area — producing shape-centric captions.
Qwen3-4B fine-tuned with LoRA (rank 64, α=128) on 180k steps across 4×H200 GPUs. Maps text captions to structured JSON NURBS representations.
Not all surfaces can be robustly represented by untrimmed NURBS. Thin regions around holes or fillets often introduce geometric artifacts. We detect degenerate faces using Chamfer Distance comparison (CD ≤ ε, where ε = 6×10⁻⁴) and fall back to analytic primitives.
NURBGen significantly outperforms all baselines in both geometric fidelity and visual alignment with text prompts.
5 CAD designers chose the best reconstruction per prompt. Higher is better.
| Method | User (1k) ↑ | GPT-4o ↑ |
|---|---|---|
| Undecided | 2.7 | 3.2 |
| GPT-4o | 1.5 | 1.9 |
| DeepCAD | 5.6 | 6.1 |
| Text2CAD | 26.1 | 27.2 |
| NURBGen (Ours) SOTA | 64.1 | 61.6 |
| Method | IR ↓ | CD ↓ | HD ↓ | JSD ↓ | MMD ↓ |
|---|---|---|---|---|---|
| GPT-4o | 0.17 | 7.20 | 0.36 | 72.87 | 4.17 |
| DeepCAD | 0.32 | 10.28 | 0.45 | 89.77 | 4.43 |
| Text2CAD | 0.05 | 9.66 | 0.42 | 85.27 | 4.54 |
| NURBGen (Ours) SOTA | 0.018 | 4.43 | 0.25 | 57.94 | 2.14 |
A large-scale curated dataset of 300k part-level CAD models with NURBS annotations and high-quality automatically generated captions.
Derived from the ABC dataset (1M CAD models), we decompose assembly-level designs into individual part-level components using PythonOCC, generating 3M instances. A complexity-aware filtering strategy retains 300k geometrically diverse parts.
High-fidelity 3D CAD models generated by NURBGen from text descriptions, converted to industry-standard BRep format.
NURBGen consistently produces more detailed, structurally coherent results with fewer geometric artifacts than all baselines.
Qualitative Comparison with Baselines.