Papers
arxiv:2602.16317

CADEvolve: Creating Realistic CAD via Program Evolution

Published on Feb 18
· Submitted by
DMITRII ZHEMCHUZHNIKOV
on Feb 19
Authors:
,
,
,
,
,
,

Abstract

CADEvolve presents an evolution-based approach using VLM-guided edits to generate complex CAD programs from simple primitives, creating a large dataset for improved Image2CAD performance.

AI-generated summary

Computer-Aided Design (CAD) delivers rapid, editable modeling for engineering and manufacturing. Recent AI progress now makes full automation feasible for various CAD tasks. However, progress is bottlenecked by data: public corpora mostly contain sketch-extrude sequences, lack complex operations, multi-operation composition and design intent, and thus hinder effective fine-tuning. Attempts to bypass this with frozen VLMs often yield simple or invalid programs due to limited 3D grounding in current foundation models. We present CADEvolve, an evolution-based pipeline and dataset that starts from simple primitives and, via VLM-guided edits and validations, incrementally grows CAD programs toward industrial-grade complexity. The result is 8k complex parts expressed as executable CadQuery parametric generators. After multi-stage post-processing and augmentation, we obtain a unified dataset of 1.3m scripts paired with rendered geometry and exercising the full CadQuery operation set. A VLM fine-tuned on CADEvolve achieves state-of-the-art results on the Image2CAD task across the DeepCAD, Fusion 360, and MCB benchmarks.

Community

Paper author Paper submitter
edited about 8 hours ago

Title: CADEvolve: Creating Realistic CAD via Program Evolution

Paper: https://arxiv.org/abs/2602.16317
Code: https://github.com/zhemdi/CADEvolve
Dataset: https://huggingface.co/datasets/kulibinai/cadevolve
Models: https://huggingface.co/kulibinai/cadevolve-rl1

TL;DR: We generate realistic and complex CAD dataset that spans all the operation by evolving from simplest shapes, then expand to a large-scale corpus for various CAD tasks. We prove the advantages of the dataset by training an Image2CAD model on this dataset

Gallery of the dataset:
image

Highlights:

  • Evolutionary pipeline.
  • Dataset of ~1.3M executable CAD programs
  • SOTA Image2CAD model.

Authors: Maksim Elistratov, Marina Barannikov, Gregory Ivanov, Valentin Khrulkov, Anton Konushin, Andrey Kuznetsov, Dmitrii Zhemchuzhnikov

Authors, thank you for your work.
I'm building something in the space. Would be interesting to see the same dataset rebuilt for build123d!

Sign up or log in to comment

Models citing this paper 1

Datasets citing this paper 1

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2602.16317 in a Space README.md to link it from this page.

Collections including this paper 2