Figure: Full design history recovery from an input point cloud (top-left) and CAD-SIGNet - user interaction (bottom-left and right)

Contribution

We propose CAD-SIGNet, an end-to-end trainable and auto-regressive architecture to recover the design history of a CAD model represented as a sequence of sketch-and-extrusion from an input point cloud. Our model learns visual-language representations by layer-wise cross-attention between point cloud and CAD language embedding. In particular, our main contributions are

An end-to-end trainable auto-regressive network that infers CAD language given an input point cloud.
Multi-modal transformer blocks with a mechanism of layer-wise cross-attention between point cloud and CAD language embedding.
A Sketch instance Guided Attention (SGA) module which guides the layer-wise cross-attention mechanism to attend on relevant regions of the point cloud for predicting sketch parameters.

Architecture

Figure: Method Overview. CAD-SIGNet (left) is composed of \(\mathbf B\) Multi-Modal Transformer blocks, each consisting of an \(\operatorname{LFA} \) module to extract point features, \(\mathbf F_{b}^v\), and an \(\operatorname{MSA} \) module for token features, \(\mathbf F_{b}^c\). An SGA module (top right) combines \(\mathbf F_{b}^v\) and \(\mathbf F_{b}^c\) for CAD visual-language learning. A sketch instance (bottom right), \(\mathbf I\), obtained from the predicted extrusion tokens is used to apply a mask, \(\mathbf M_{\text{sga}}\) during the cross-attention in SGA module (top-right) to predict sketch tokens.

Visual Results

We evaluated CAD-SIGNet on two reverse engineering scenarios -

Design History Recovery
Conditional Auto-Completion from User Input.

For scenario (1) DeepCAD is used as baseline. For Scenario (2), SkexGen and HNC have been used. Click on the button below for visual results.

Task Description: Given an input point cloud, the task is to infer the CAD design sequence.
Note: All the models are trained on DeepCAD dataset.

Task Description: This task is considered from a reverse engineering perspective and consists of recovering the ground-truth CAD construction history given a complete point cloud and a partial CAD sequence. All the models are trained on DeepCAD and tested on the same dataset. Design History

Acknowledgement

The present project is supported by the National Research Fund, Luxembourg under the BRIDGES2021/IS/16849599/FREE-3D, IF/17052459/CASCADES and Artec3D.

Citation

If you like our work, please cite.


                @InProceedings{Khan_2024_CVPR, 

                author    = {Khan, Mohammad Sadil and Dupont, Elona and Ali, Sk Aziz and Cherenkova, Kseniya and Kacem, Anis and Aouada, Djamila}, 

                title     = {CAD-SIGNet: CAD Language Inference from Point Clouds using Layer-wise Sketch Instance Guided Attention},

                booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)}, 

               month     = {June}, 

                year      = {2024}, 

                pages     = {4713-4722} 

                }

Table of Contents

CAD-SIGNet: CAD Language Inference from Point Clouds using
Layer-wise Sketch Instance Guided Attention

Mohammad Sadil Khan¹· Elona Dupont¹ · Sk Aziz Ali^1,2
Kseniya Cherenkova^1,3 · Anis Kacem¹ · Djamila Aouada¹

CVPR 2024 (Highlight 🤩)

Contribution

Architecture

CAD Visualizer

Visual Results

Video

Acknowledgement

Citation