Hi-labspermmorpho: a novel expert-labeled dataset with extensive abnormality classes for deep learning-based sperm morphology analysis
Göster/ Aç
Erişim
info:eu-repo/semantics/closedAccessTarih
2024Yazar
Aktaş, AbdulsametSerbes, Görkem
Hüner Yiğit, Merve
Aydın, Nizamettin
Uzun, Hakkı
İlhan, Hamza Osman
Üst veri
Tüm öğe kaydını gösterKünye
Aktas, A., Serbes, G., Yigit, M. H., Aydin, N., Uzun, H., & Ilhan, H. O. (2024). Hi-LabSpermMorpho: A Novel Expert-Labeled Dataset with Extensive Abnormality Classes for Deep Learning-Based Sperm Morphology Analysis. IEEE Access, 1. https://doi.org/10.1109/access.2024.3521643Özet
Sperm morphology is crucial in semen analysis for diagnosing male infertility. To reduce limitations in visual assessment, such as variability in biological conditions and the biologist's experience, developing computer-based sperm analysis techniques is imperative. In this study, a total of 49345 RGB sperm morphology patches were obtained using the proposed image acquisition technique and three different Diff-Quick staining methods: BesLab, Histoplus, and GBL. The images were labeled by experts under 18 classes, including sperm head, neck, and tail abnormality types, along with a normal class. The head category includes amorphous, tapered, double, pyriform, pin, vacuolated, narrow acrosome, and round. The neck category encompasses thin, thick, twisted, and asymmetrical. The tail category includes double, curly, long, short, and twisted. The Efficient-V2-Medium achieved accuracy rates of 65.05% and 67.42% on the BesLab and Histoplus datasets, respectively, while the GBL dataset yielded an accuracy of 63.58% using the Efficient-V2-Small. This study experimentally demonstrates that the Histoplus staining method is more suitable for deep learning-based automated analysis systems. As a reference for future studies, 35 different deep learning architectures were trained on the proposed dataset, establishing a classification baseline. The results show that the dataset can be successfully applied to complex deep learning models. Additionally, it addresses the absence of a large-scale sperm morphology analysis public datasets and can serve as a standard benchmark for future studies.