Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
Filter by Categories
Book Review
Brief Report
Case Letter
Case Report
Case Series
Commentary
Current Issue
Editorial
Erratum
Guest Editorial
Images
Images in Neurology
Images in Neuroscience
Images in Neurosciences
Letter to Editor
Letter to the Editor
Letters to Editor
Letters to the Editor
Media and News
None
Notice of Retraction
Obituary
Original Article
Point of View
Position Paper
Review Article
Short Communication
Short Communications
Systematic Review
Systematic Review Article
Technical Note
Techniques in Neurosurgery
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
Filter by Categories
Book Review
Brief Report
Case Letter
Case Report
Case Series
Commentary
Current Issue
Editorial
Erratum
Guest Editorial
Images
Images in Neurology
Images in Neuroscience
Images in Neurosciences
Letter to Editor
Letter to the Editor
Letters to Editor
Letters to the Editor
Media and News
None
Notice of Retraction
Obituary
Original Article
Point of View
Position Paper
Review Article
Short Communication
Short Communications
Systematic Review
Systematic Review Article
Technical Note
Techniques in Neurosurgery
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
Filter by Categories
Book Review
Brief Report
Case Letter
Case Report
Case Series
Commentary
Current Issue
Editorial
Erratum
Guest Editorial
Images
Images in Neurology
Images in Neuroscience
Images in Neurosciences
Letter to Editor
Letter to the Editor
Letters to Editor
Letters to the Editor
Media and News
None
Notice of Retraction
Obituary
Original Article
Point of View
Position Paper
Review Article
Short Communication
Short Communications
Systematic Review
Systematic Review Article
Technical Note
Techniques in Neurosurgery
View/Download PDF

Translate this page into:

Original Article
ARTICLE IN PRESS
doi:
10.25259/JNRP_103_2025

Unraveling the association between COL4A2 polymorphisms and intracerebral hemorrhage

Department of Neurology, Psychiatry and Rehabilitation, Karaganda Medical University, Karaganda, Kazakhstan
Department of Biomedicine, Research Laboratory, Karaganda Medical University, Karaganda, Kazakhstan.

*Corresponding author: Shynar Muratbekova, Department of Neurology, Psychiatry and Rehabilitation, Karaganda Medical University, Karaganda, Kazakhstan. muratbekovas@qmu.kz

Licence
This is an open-access article distributed under the terms of the Creative Commons Attribution-Non Commercial-Share Alike 4.0 License, which allows others to remix, transform, and build upon the work non-commercially, as long as the author is credited and the new creations are licensed under the identical terms.

How to cite this article: Muratbekova S, Grigolashvili M, Kadyrova I, Beisembayeva M, Belyayev R, Shayakhmetova Y, et al. Unraveling the association between COL4A2 polymorphisms and intracerebral hemorrhage. J Neurosci Rural Pract. doi: 10.25259/JNRP_103_2025

Abstract

Objectives:

Intracerebral hemorrhage (ICH) is a common disease worldwide, known for its high mortality and disability rates. Due to significant population and ethnic differences, the aim of this study was to identify the risk factors for the development of ICH in the Kazakh population.

Materials and Methods:

The case–control study included 162 patients with verified ICH and 165 people in the control group. To assess the risk of developing ICH, medical and social risk factors, as well as genetic risk factors (polymorphisms in the COL4A2 gene: rs9521732, rs9521733, rs9515199), were studied.

Results:

The study revealed a statistically significant effect of the following medical, social, and genetic factors on the risk of ICH in the main ethnic population of Kazakhstan: diabetes mellitus, arterial hypertension, moderate-to-heavy alcohol consumption, and the rs9515199 polymorphism. Specifically, under the co-dominant model, the C/T and C/C genotypes were significant; under the dominant model, the C/T-C/C genotype combination was significant; and under the recessive model, the C/C homozygote was significant.

Conclusion:

The findings indicate that the rs9515199 polymorphism in the COL4A2 gene significantly decreases the risk of ICH in the Kazakh population, which contrasts with findings in other populations. Further research is needed to confirm these results and explore their practical applications.

Keywords

COL4A2 gene
Intracerebral hemorrhage
Kazakh population
Risk factors
Single-nucleotide polymorphism

INTRODUCTION

Spontaneous intracerebral hemorrhage (ICH) is an emergency pathology associated with intracranial vessel rupture. ICH is the second most prevalent type of stroke, with over 3 million new cases registered worldwide in 2019.[1,2] Despite the extensive history of research on this condition, to date, there has been no conclusive evidence to support the efficacy of surgical or medical treatments.

Due to the high levels of mortality and disability, it is a priority to prevent ICH. The established risk factors for ICH are arterial hypertension, obesity, diabetes mellitus, smoking, and alcohol abuse. There are also gender, racial, and ethnic characteristics.[1-4] Due to the significant differences in the risk of occurrence and progression of ICH among individuals from European, Asian, and African populations, conducting genetic research is considered an upcoming trend. The greatest importance is given to specific single-nucleotide polymorphisms of the genes associated with the Renin-Angiotensin system, the integrity of cerebral vessels’ walls, lipid homeostasis, inflammatory processes in the endothelium, and vascular-platelet hemostasis.[5-8]

COL4A1 (the protein accession ID P02462) and COL4A2 (the protein accession ID P08572) are inextricably bound. They encode the synthesis of Collagen Type IV Alpha 1 and Collagen Type IV Alpha 2. Forming heterotrimers, they are the main structural components of basement membranes. COL4A1 (NM_001845) and COL4A2 (NM_001846) have 52 and 48 exons, respectively, and are located on opposite strands on human chromosome 13, specifically at position 13q34. These two genes are separated by a 127-nucleotide region that contains a common bidirectional promoter. This promoter requires additional regulatory elements in order to control the tissue specificity, level, and ratio of gene expression.[9]

Mutations in the COL4A1 and COL4A2 genes cause the development of highly penetrant multisystem anomalies, as well as damage to the central nervous system, vision, heart, kidneys, and musculoskeletal systems in both young and older individuals. The most pronounced brain damage associated with these mutations is seen in conditions such as Type I porencephaly, pre- and perinatal hemorrhages, and sporadic and recurrent ICHs in both young and aged patients. Subclinical manifestations of the mutation are also possible, where diffuse or periventricular leukoencephalopathy and calcification, intracranial aneurysms, and cerebral microbleeds are detected in the absence of an obvious neurological deficit. This suggests a potentially hidden and underestimated role in a wide range of cerebrovascular diseases.[9-15]

Mutations in the COL4A2 gene can lead to disruption of the structure of Collagen Type IV, causing vessel fragility and susceptibility to rupture. Mutations in this gene are directly associated with an increased risk of ICH and other cerebrovascular diseases.[9-15]

The investigation of polymorphisms in different ethnic groups is of critical importance due to population-specific patterns in the frequency and influence of COL4A2 alleles in both Asian and European populations.[9-19] This implies that genetic risk factors may differ between ethnic groups, justifying the need for population-specific studies.[20]

The aim of this study was to identify risk factors for the development of ICH in the Kazakh population.

MATERIALS AND METHODS

The main group included 162 patients diagnosed with ICH, who underwent inpatient treatment in the medical centers of the Karaganda region, Kazakhstan. Their diagnosis was verified using neuroimaging data. The control group consisted of 165 people, matched to the main group by sex and age, with no history of strokes and brain tumors, autoimmune diseases, and congenital malformations. All participants were ethnic Kazakhs, with at least two generations of their direct ancestors (parents and grandparents) being of Kazakh ethnicity. The recruitment period for this study was from October 01, 2020, to October 01, 2021. Informed consent to participate in the research was obtained from each patient. The DNA isolation process was conducted using the GeneJET Genomic DNA Purification Kit (Thermo Scientific), following the manufacturer’s protocol to ensure high-quality genomic DNA suitable for downstream applications. The isolated DNA was quantified using a spectrophotometer to assess its purity and concentration, with A260/A280 ratios maintained within the range of 1.8–2.0.

Genotyping was performed using real-time polymerase chain reaction (PCR) on the QuantStudio 5 System, which ensures precise and reliable detection of single-nucleotide polymorphisms (SNPs). The analysis employed TaqMan reagents, including the SNP Genotyping Assay and TaqMan Genotyping Master Mix (Applied Biosystems), which were selected for their high sensitivity and specificity. The SNP Genotyping Assay was customized to target the polymorphisms of interest (rs9521732, rs9521733, and rs9515199 in the COL4A2 gene).

The PCR protocol included an initial denaturation step, followed by 40 amplification cycles consisting of denaturation, annealing, and extension. After PCR, the genotyping data were analyzed using QuantStudio Design and Analysis Software, ensuring robust allele discrimination and compliance with Hardy–Weinberg equilibrium. These methodological steps were optimized to minimize variability and ensure accurate identification of genotypes.

Statistical analysis

All data were analyzed using the Statistical Package for the Social Sciences (SPSS) software for Windows (SPSS Inc., Chicago, Illinois, USA) and SNPstats (https://www.snpstats.net/). The analysis of the differences between the main and control groups, as well as the check for correspondence of the genotype frequency distributions to the Hardy–Weinberg equilibrium in the control group, were carried out using the c2 test. To assess the association of risk factors with the occurrence of ICH, an odds ratio with a 95% confidence interval was used. The differences were considered statistically significant at a value of p < 0.05. Missing data for medical and social risk factors were minimal (<0.5% for any variable) and were considered to be entirely due to chance. Therefore, these missing values were accounted for using complete case analysis. Missing genotypes in the genetic data were due to insufficient DNA samples and technical issues.

The sample size was calculated based on statistical analysis using the following parameters: a significance level (a) of 0.05, a statistical power (1-b) of 0.80, and a case-to-control ratio of 1:1. The calculations utilized minor allele frequency data for East Asian population populations available from the Allele frequency aggregator (ALFA) database. To detect a moderate association with an odds ratio of 1.5, the total sample sizes required were 300 (rs9515199), 210 (rs9521732), and 220 (rs9521733) individuals.

RESULTS

Patient characteristics

The study involved 327 participants, whose general characteristics are presented in Table 1. There were no significant gender or age differences between the groups. The average age was 56.6 ± 11.35 years.

Table 1: Age and gender characteristics of the participants.
Characteristics Cases, n=162 Controls, n=165 p-value
Male Female Male Female
Gender, n(%) 87 (53.7) 75 (46.3) 89 (53.94) 76 (46.06) 0.9659
Age, years (Mean±SD) 55.91±10.43 57.95±11.36 54.40±11.07 58.62±12.31 0.4688

SD: Standard deviation, p< 0.05 was considered statistically significant

In 79% (128 patients) of the case group, a deep ICH was detected. The lobar location accounted for 11% of cases (18 patients), and the infratentorial location occurred in 10% of patients. In 88.9% of cases, the middle cerebral artery territory was affected (144 patients), while in 11.1%, the vertebrobasilar territory was affected (18 patients).

Influence of the controlled risk factors

The influence of modifiable risk factors is presented in Table 2 and Supplementary Files 1 and 2. The statistically significant effect was found for diabetes mellitus (odds ratio [OR] = 3.11, 95% confidence interval [CI] = 1.19–8.09, p =0.015), arterial hypertension (p < 0.001), and moderate-to-heavy alcohol consumption (OR = 4.43, 95% CI = 2.12–9.27, p < 0.001).

Table 2: Influence of the controlled risk factors on the development of intracerebral hemorrhage.
Indicators Cases (%) n=162 Controls (%) n=165 OR Upper 95% CI Low 95% CI p-value
Obesity 25 (15.43) 35 (21.21) 0.69 0.39 1.19 0.1770
Diabetes mellitus 17 (10.49) 6 (3.64) 3.11 1.19 8.09 0.0153
Arterial hypertension 162 (100) 41 (24.85) - - - <0.0001
Smoking 60 (37.04) 46 (27.88) 1.52 0.96 2.43 0.0769
Moderate-to-heavy alcohol consumption 36 (22.22) 10 (6.06) 4.43 2.12 9.27 <0.0001

OR: Odds ratio, CI: Confidence interval, p< 0.05 was considered statistically significant

Supplementary File 1

Supplementary File 2

Genotyping results

For genotyping the COL4A2 polymorphisms, 327 individuals were tested. Complete results were obtained for 300 individuals for SNP rs9515199 (Hardy–Weinberg equilibrium: (HWE) in the control group, p = 0.054), 306 individuals for SNP rs9521733 (p = 0.069), and 286 individuals for SNP rs9521732 (p = 0.079).

After genotyping, we analyzed the association between the risk of developing ICH and polymorphisms in the COL4A2 gene, under different inheritance models. The analysis was carried out without adjustment for medical and social risk factors. For SNP rs9521733 and SNP rs9521732, p-value was above 0.05, whereas for SNP rs9515199, a significant association was found (p < 0.05). Therefore, only data for rs9515199 were included in the subsequent analysis.

The following associations were found for the rs9515199 polymorphism: Under the co-dominant model for the C/T genotype (OR = 0.60, 95% CI = 0.36–0.99, p-level=1e-04) and the C/C genotype (OR = 0.22, 95% CI = 0.11–0.46, p = 1e-04); under the dominant model for the C/T-C/C genotype combination (OR = 0.45, 95% CI = 0.28–0.73, p = 9e-04); and under the recessive model for the C/C homozygote (OR = 0.29, 95% CI = 0.15–0.56, p = 1e-04). The genotyping results are presented in Table 3.

Table 3: Distribution of genotypes and alleles of polymorphisms of the COL4A2 gene.
Genotype Control (%) Cases (%) OR (95% CI) p-level OR (95% CI) p-level
rs9515199
  Codominant
    T/T 48 (31.2) 73 (50) 1.00 1e-04 1.00 4e-04
    C/T 65 (42.2) 59 (40.4) 0.60 (0.36–0.99) 0.59 (0.35–1.00)
    C/C 41 (26.6) 14 (9.6) 0.22 (0.11–0.46) 0.25 (0.12–0.52)
  Dominant
    T/T 48 (31.2) 73 (50) 1.00 9e-04 1.00 0.0017
    C/T-C/C 106 (68.8) 73 (50) 0.45 (0.28–0.73) 0.46 (0.28–0.75)
  Recessive
    T/T-C/T 113 (73.4) 132 (90.4) 1.00 1e-04 1.00 6e-04
    C/C 41 (26.6) 14 (9.6) 0.29 (0.15–0.56) 0.33 (0.17–0.64)
  Over-dominant
    T/T-C/C 89 (57.8) 87 (59.6) 1.00 0.75 1.00 0.64
    C/T 65 (42.2) 59 (40.4) 0.93 (0.59–1.47) 0.89 (0.55–1.45)
  Log-additive 0.50 (0.36–0.69) <0.0001 0.52 (0.37–0.73) 1e-04

OR: Odds ratio, CI: Confidence interval, p< 0.05 was considered statistically significant

For the subgroup with deep ICH (128 patients, 79%), a statistical analysis was performed to identify associations with the polymorphisms using an uncorrected odds ratio. These findings should be considered preliminary and require further validation. Genotyping of the COL4A2 polymorphisms was performed on 293 individuals. Complete results were obtained for 270 individuals for SNP rs9515199 (HWE p = 0.078), 272 for SNP rs9521733 (HWE p = 0.046), and 253 for SNP rs9521732 (HWE p = 0.057). In the analysis of association with the deep form of ICH, a significant result (p < 0.05) was found for SNP rs9515199, but not for SNP rs9521732 (p > 0.05). A significant association was identified for the rs9515199 polymorphism across several genetic models: Under the co-dominant model for the C/T heterozygote (OR = 0.55, 95% CI = 0.32–0.93) and the C/C homozygote (OR = 0.22, 95% CI = 0.10–0.47; p = 2e-04); under the dominant model for the C/T-C/C genotype combination (OR = 0.42, 95% CI = 0.26–0.70; p = 6e-04); and under the recessive model for the C/C homozygote (OR = 0.30, 95% CI = 0.15–0.61; p = 4e-04).

DISCUSSION

Our study identified a significant association between the rs9515199 polymorphism in the COL4A2 gene and a reduced risk of ICH within the Kazakh population, suggesting a protective effect of the C allele. This finding stands in direct contrast to prior meta-analyses conducted exclusively in European cohorts, which reported the same allele to be a risk factor for deep ICH (rs9515199: OR = 1.28, 95% CI = 1.14–1.44;[11] OR = 1.24, 95% CI = 1.11–1.4[16]). This discrepancy highlights the critical influence of ethnic background on genetic risk assessments for ICH.

The observed protective effect in our cohort is consistent with the higher frequency of the protective C allele in Asian populations (e.g., 0.47 in East Asians) compared to European populations (0.4), as recorded in the ALFA database. The Kazakh population, with its unique admixed ancestry combining East and West Eurasian heritage,[17] provides a valuable model for exploring these ethnic disparities. Our data show that the allele frequencies of rs9515199 and rs9521732 align with East Asian patterns, while rs9521733 aligns with European population, further support this admixed status, and suggest that population-specific genetic architecture can profoundly modify the effect of a variant.

The complex role of COL4A2 across different ethnicities is further illustrated by conflicting data from other East Asian studies. A large exome-wide study in Japan found no association between COL4A2 mutations and ICH,[18] while research in the Chinese Han population implicated different polymorphisms within the same gene (rs1049931 and rs1049906) in increasing risk.[19] This suggests that the genetic etiology of ICH is not only population-specific but may also involve different causal variants within the same gene or interactions with differing genetic backgrounds and environmental factors.

Our study has several limitations. The sample size, while sufficient to detect an association, is relatively modest compared to large international consortia. This limits the power for more sophisticated subgroup analyses (e.g., by ICH location) or for investigating rare variants. The findings require validation in an independent, larger Kazakh cohort to confirm the effect size and direction.

Future research directions are promising. The planned multicenter ERICH-GENE study,[20] which will perform whole-genome sequencing on over 10,000 patients from diverse ethnicities, is poised to definitively address these questions of ethnic heterogeneity and discover novel variants. In conclusion, our investigation identifies rs9515199 as a potential protective genetic marker for ICH in the Kazakh population, underscoring the necessity of including diverse ethnic groups in genetic research. While this marker is not yet ready for clinical application, it represents a step toward more precise risk stratification and underscores the need for developing personalized medicine approaches that account for genetic ancestry.

CONCLUSION

Our study identified several risk factors for ICH in the Kazakh population. A history of arterial hypertension, diabetes mellitus (OR = 3.11; 95% CI: 1.19–8.09), and moderate-to-heavy alcohol consumption (OR = 4.43; 95% CI: 2.12–9.27) were significant medical and social predictors. Genetic analysis revealed a strong association between the rs9515199 polymorphism in the COL4A2 gene and a reduced risk of ICH. A protective effect was observed under the co-dominant (CT genotype: OR = 0.60; 95% CI: 0.36–0.99; p = 1e-04; CC genotype: OR = 0.22; 95% CI: 0.11–0.46; p = 1e-04), dominant (CT/CC vs. TT: OR = 0.45; 95% CI: 0.28–0.73; p = 9e-04), and recessive (CC vs. CT/TT: OR = 0.29; 95% CI: 0.15–0.56; p = 1e-04) models. The identified protective effect of the C allele contrasts with previously reported associations in European populations, where the same allele was linked to an increased risk. This discrepancy underscores the critical role of ethnic genetic background in modifying disease risk and highlights the importance of conducting genetic association studies in diverse populations. Further validation in larger, independent cohorts is essential to confirm these associations. Ultimately, integrating genetic markers like rs9515199 with established clinical risk factors could improve risk stratification and pave the way for personalized prevention strategies for ICH.

Ethical approval:

The research/study was approved by the Institutional Review Board at Karaganda Medical University, number 57, dated June 19, 2020.

Declaration of patient consent:

The authors certify that they have obtained all appropriate patient consent.

Conflicts of interest:

There are no conflicts of interest.

Use of artificial intelligence (AI)-assisted technology for manuscript preparation:

The authors confirm that there was no use of artificial intelligence (AI)-assisted technology for assisting in the writing or editing of the manuscript and no images were manipulated using AI.

Financial support and sponsorship: Ministry of Education and Science of the Republic Kazakhstan (Grant No. AP AP08957527)

References

  1. , , . Epidemiology, risk factors, and clinical features of intracerebral hemorrhage: An update. J Stroke. 2017;19:3-10.
    [CrossRef] [PubMed] [Google Scholar]
  2. . Global, regional, and national burden of stroke and its risk factors, 1990-2019: A systematic analysis for the global burden of disease study 2019. Lancet Neurol. 2021;20:795-820.
    [CrossRef] [PubMed] [Google Scholar]
  3. , , , , , , et al. The story of intracerebral hemorrhage: From recalcitrant to treatable disease. Stroke. 2021;52:1905-14.
    [CrossRef] [PubMed] [Google Scholar]
  4. , , , , , . Location-specific risk factors for intracerebral hemorrhage: Systematic review and meta-analysis. Neurology. 2020;95:e1807-18.
    [CrossRef] [PubMed] [Google Scholar]
  5. , , , . Genetic risk factors for spontaneous intracerebral haemorrhage. Nat Rev Neurol. 2016;12:40-9.
    [CrossRef] [PubMed] [Google Scholar]
  6. , . Genetics of spontaneous intracerebral hemorrhage. Stroke. 2017;48:3420-4.
    [CrossRef] [PubMed] [Google Scholar]
  7. , , . Genetic polymorphisms associated with spontaneous intracerebral hemorrhage. Int J Mol Sci. 2018;19:3879.
    [CrossRef] [PubMed] [Google Scholar]
  8. , , , , , , et al. Genetic risk of Spontaneous intracerebral hemorrhage: Systematic review and future directions. J Neurol Sci. 2019;407:116526.
    [CrossRef] [PubMed] [Google Scholar]
  9. , , . COL4A1 and COL4A2 mutations and disease: Insights into pathogenic mechanisms and potential therapeutic targets. Hum Mol Genet. 2012;21:R97-110.
    [CrossRef] [PubMed] [Google Scholar]
  10. , . Main features of COL4A1-COL4A2 related cerebral microangiopathies. Cereb Circ Cogn Behav. 2022;3:100140.
    [CrossRef] [PubMed] [Google Scholar]
  11. , , , , , , et al. Common variation in COL4A1/COL4A2 is associated with sporadic cerebral small vessel disease. Neurology. 2015;84:918-26.
    [CrossRef] [PubMed] [Google Scholar]
  12. , , , , , , et al. COL4A2 is associated with lacunar ischemic stroke and deep ICH: Meta-analyses among 21,500 cases and 40,600 controls. Neurology. 2017;89:1829-39.
    [CrossRef] [PubMed] [Google Scholar]
  13. , , , , , , et al. Prevalence of COL4A1 and COL4A2 mutations in severe fetal multifocal hemorrhagic and/or ischemic cerebral lesions. Ultrasound Obstet Gynecol. 2021;57:783-9.
    [CrossRef] [PubMed] [Google Scholar]
  14. , , , , , , et al. COL4A1/COL4A2 and inherited platelet disorder gene variants in fetuses showing intracranial hemorrhage. Prenat Diagn. 2022;42:601-10.
    [CrossRef] [PubMed] [Google Scholar]
  15. , , , , , , et al. The expanding phenotype of COL4A1 and COL4A2 mutations: Clinical data on 13 newly identified families and a review of the literature. Genet Med. 2015;17:843-53.
    [CrossRef] [PubMed] [Google Scholar]
  16. , , , , , , et al. COL4A2 mutations impair COL4A1 and COL4A2 secretion and cause hemorrhagic stroke. Am J Hum Genet. 2012;90:91-101.
    [CrossRef] [PubMed] [Google Scholar]
  17. , , , , , , et al. Ancestral origins and admixture history of Kazakhs. Mol Biol Evol. 2024;41:msae144.
    [CrossRef] [PubMed] [Google Scholar]
  18. , , , , , , et al. Identification of nine genes as novel susceptibility loci for early-onset ischemic stroke, intracerebral hemorrhage, or subarachnoid hemorrhage. Biomed Rep. 2018;9:8-20.
    [CrossRef] [PubMed] [Google Scholar]
  19. , , , , , , et al. Association of the COL4A2 gene polymorphisms with primary intracerebral hemorrhage risk and outcome in Chinese Han population. Mol Neurobiol. 2024;61:8787-96.
    [CrossRef] [PubMed] [Google Scholar]
  20. , , , , , , et al. The ethnic/racial variations of intracerebral hemorrhage genetics (ERICH-GENE) study protocol. [Preprint medRxiv]; 2025
    [CrossRef] [Google Scholar]
Show Sections