We propose CAD-SIGNet, an end-to-end trainable and auto-regressive architecture to recover the design history of a CAD model represented as a sequence of sketch-and-extrusion from an input point cloud. Our model learns visual-language representations by layer-wise cross-attention between point cloud and CAD language embedding.
An end-to-end trainable auto-regressive network that infers CAD language given an input point cloud.
ArchitectureMulti-modal transformer blocks with a mechanism of layer-wise cross-attention between point cloud and CAD language embedding.
TransformerAn SGA module which guides the layer-wise cross-attention mechanism to attend on relevant regions of the point cloud for predicting sketch parameters.
SGA Module
Figure: Method Overview. CAD-SIGNet (left) is composed of \(\mathbf{B}\) Multi-Modal Transformer blocks, each consisting of an \(\operatorname{LFA}\) module to extract point features \(\mathbf{F}_{b}^v\), and an \(\operatorname{MSA}\) module for token features \(\mathbf{F}_{b}^c\). An SGA module (top right) combines \(\mathbf{F}_{b}^v\) and \(\mathbf{F}_{b}^c\) for CAD visual-language learning. A sketch instance (bottom right), \(\mathbf{I}\), obtained from the predicted extrusion tokens is used to apply a mask \(\mathbf{M}_{\text{sga}}\) during the cross-attention in SGA module to predict sketch tokens.
We evaluated CAD-SIGNet on two reverse engineering scenarios:
For scenario (1), DeepCAD is used as baseline. For scenario (2), SkexGen and HNC have been used.
The present project is supported by the National Research Fund, Luxembourg under the
BRIDGES2021/IS/16849599/FREE-3D,
IF/17052459/CASCADES and Artec3D.
If you find this work useful, please consider citing:
@InProceedings{Khan_2024_CVPR,
author = {Khan, Mohammad Sadil and Dupont, Elona and Ali, Sk Aziz and Cherenkova, Kseniya and Kacem, Anis and Aouada, Djamila},
title = {CAD-SIGNet: CAD Language Inference from Point Clouds using Layer-wise Sketch Instance Guided Attention},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2024},
pages = {4713-4722}
}