Identification and development of pathogen- and pest-specific defense–resistance-associated ssr marker candidates assisted by machine learning and discovery of putative QTL hotspots in camellia sinensis
Dosyalar
Tarih
Yazarlar
Dergi Başlığı
Dergi ISSN
Cilt Başlığı
Yayıncı
Erişim Hakkı
Özet
In this study, a targeted SSR (Simple Sequence Repeat) marker resource was developed based on genes and protein families associated with pathogen- and pest-related defense–resistance mechanisms in Camellia sinensis. Forty-one genes and protein families reported to show upregulation, increased expression, or functional validation under disease and pest stress were selected, and the corresponding 195 loci were mapped onto the Camellia sinensis cv. Shuchazao genome. SSR screening within gene bodies and gene-flanking regions (±5 kb) identified 5197 SSR loci. Putative QTL hotspot regions were defined using locus-based sliding-window analysis, Z-score calculations, and permutation tests, yielding 633 SSRs filtered at the 99% and 95% significance thresholds. Proteome-wide scans based on conserved amino acid motifs identified multiple loci within the WRKY, NAC, LRR, PRX, and CHI families, and Random Forest analysis was used to prioritize SSRs within these families. Finally, 386 SSR primer sets were designed and evaluated by in silico PCR across six tea genomes. Of these, 245 primers produced amplicons in more than one genome, and 124 exhibited polymorphic information content values greater than 0.500. Overall, the developed SSR panels represent a biologically contextualized and experimentally transferable marker resource targeting defense–resistance-associated genic and gene-proximal regions.











