A1 Refereed original research article in a scientific journal

Bridging Convolutional Neural Networks and Transformers for Efficient Crack Detection in Concrete Building Structures




AuthorsYadav, Dhirendra P.; Sharma, Bhisham; Chauhan, Shivank; Dhaou, Imed B.

PublisherMPDI

Publication year2024

JournalSensors

Journal name in sourceSensors

Article number4257

Volume24

Issue13

eISSN1424-8220

DOIhttps://doi.org/10.3390/s24134257

Web address https://www.mdpi.com/1424-8220/24/13/4257

Self-archived copy’s web addresshttps://research.utu.fi/converis/portal/detail/Publication/457412969


Abstract
Detecting cracks in building structures is an essential practice that ensures safety, promotes longevity, and maintains the economic value of the built environment. In the past, machine learning (ML) and deep learning (DL) techniques have been used to enhance classification accuracy. However, the conventional CNN (convolutional neural network) methods incur high computational costs owing to their extensive number of trainable parameters and tend to extract only high-dimensional shallow features that may not comprehensively represent crack characteristics. We proposed a novel convolution and composite attention transformer network (CCTNet) model to address these issues. CCTNet enhances crack identification by processing more input pixels and combining convolution channel attention with window-based self-attention mechanisms. This dual approach aims to leverage the localized feature extraction capabilities of CNNs with the global contextual understanding afforded by self-attention mechanisms. Additionally, we applied an improved cross-attention module within CCTNet to increase the interaction and integration of features across adjacent windows. The performance of CCTNet on the Historical Building Crack2019, SDTNET2018, and proposed DS3 has a precision of 98.60%, 98.93%, and 99.33%, respectively. Furthermore, the training validation loss of the proposed model is close to zero. In addition, the AUC (area under the curve) is 0.99 and 0.98 for the Historical Building Crack2019 and SDTNET2018, respectively. CCTNet not only outperforms existing methodologies but also sets a new standard for the accurate, efficient, and reliable detection of cracks in building structures.

Downloadable publication

This is an electronic reprint of the original article.
This reprint may differ from the original in pagination and typographic detail. Please cite the original version.




Funding information in the publication
This research received no external funding.


Last updated on 2025-27-01 at 19:46