Mirnas signature as potential biomarkers for cervical precancerous lesions in human papillomavirus positive women

Mirnas signature as potential biomarkers for cervical precancerous lesions in human papillomavirus positive women

Play all audios:

Loading...

ABSTRACT Biomarkers to identify women at risk of cervical cancer among those with high-risk HPV infection (hrHPV+) are needed. Deregulated expression of microRNAs (miRNAs) contributes to


hrHPV-induced cervical carcinogenesis. We aimed at identifying miRNAs with the capacity to distinguish high (CIN2+) and low (≤ CIN1) grade cervical lesions. We sequenced miRNA libraries from


Formalin-Fixed Paraffin-Embedded (FFPE) tissues from women with CIN2+ (n = 10) and age-matched women with ≤ CIN1 (n = 10), randomly and retrospectively selected from a trial that followed


women for 24 months after a hrHPV+ test at the screening visit. Five miRNAs differentially expressed were validated by RT-qPCR in an independent set of FFPE tissues with a reviewed diagnosis


of CIN2+ (n = 105) and ≤ CIN1 (n = 105). The Ingenuity Pathway Analysis (IPA) was conducted to identify mRNAs inversely correlated with the top 25 differentially expressed miRNAs. Inverse


correlations with 401 unique mRNA targets were identified for fourteen of the top 25 differentially expressed miRNAs. Eleven of these miRNAs targeted 26 proteins of pathways deregulated by


HPV E6 and E7 oncoproteins and two of them, miR-143-5p and miR-29a-3p, predicted CIN2+ and CIN3+ in the independent validation by RT-qPCR of FFPE tissues from hrHPV-positive women. SIMILAR


CONTENT BEING VIEWED BY OTHERS ANALYSIS OF PRECANCEROUS LESION-RELATED MICRORNAS FOR EARLY DIAGNOSIS OF CERVICAL CANCER IN THE THAI POPULATION Article Open access 02 January 2025


IDENTIFICATION AND VALIDATION OF A MIRNA-BASED PROGNOSTIC SIGNATURE FOR CERVICAL CANCER THROUGH AN INTEGRATED BIOINFORMATICS APPROACH Article Open access 17 December 2020 TRANSCRIPTOME


ANALYSIS UNCOVERS THE DIAGNOSTIC VALUE OF MIR-192-5P/HNF1A-AS1/VIL1 PANEL IN CERVICAL ADENOCARCINOMA Article Open access 06 October 2020 INTRODUCTION Cervical cancer is the fourth most


frequently diagnosed cancer and the fourth leading cause of cancer death in women, with an estimated 604,000 new cases and 342,000 deaths worldwide in 2020. Around 90% of these cases and


deaths occur in Asia, Africa, Latin America, and the Caribbean regions1. Although cervical cancer mortality rates have declined in most areas of the world for the past few decades, they


remain disproportionately high in underserved populations mostly living in low-middle income countries (LMICs), but also in low resource settings of high-income countries (HICs) as the


United States and Europe, where the cervical cancer death rate is twofold higher among women residing in high-poverty versus low-poverty areas2,3. High-risk Human Papillomavirus (hrHPV), a


common sexually transmitted infection worldwide, is the cause of almost all cervical cancer. Nearly 90% of hrHPV infections are transient and only a small proportion of hrHPV-positive women


develop persistent infection and cervical intraepithelial neoplasia (CIN) grade 2 or 3 (CIN2 and CIN3, respectively) which may if left untreated, progress to cancer4. However, identifying


and treating properly the low proportion of hrHPV+ women at risk of high-grade disease (CIN2, CIN3) or cancer requires resource-consuming screening programs, which have not been possible to


implement in low-resource settings. hrHPV DNA testing is the best alternative for screening because it has more than 95% both sensitivity and negative predictive value for detecting CIN2+ 


and CIN3+5, which, permits longer screening intervals and more cost-effective screening programs6. Even further, the logistics attributes and robustness of the HPV test offer the opportunity


for self-sampling, a modality that improves the reach of women in low resource settings7. However, hrHPV testing has low specificity, increasing the number of women for follow-up and


leading to overtreatment of women without risk of progressive disease8. If miRNAs are proven to have comparable performance to other molecular biomarkers with published research such as


methylation9, HPV genotyping10 and P16/Ki67 dual-staining11, these biomarkers may have the potential to be part of new triage methods for HPV-positive women in the future. miRNAs are highly


stable molecules and easily detected by PCR techniques, methods that could be implemented in first points of care in the near future with facilities such as isothermal amplification12. In


addition, miRNAs as an emerging molecular test could be incorporated in techniques of reflex testing on self-sampling modality to increase the global uptake for triaging hrHPV positive women


after primary cervical cancer screening13. The evidence of the role of miRNAs in the development of persistent hrHPV infection and cervical precancerous lesions is emerging. The deregulated


expression of hrHPV E6 and E7 genes are the major drivers of uncontrolled cell cycle progression and cervical cancer development14. The expression of miRNAs processing proteins, DROSHA and


DICER, is increased by hrHPV E6 and E7 proteins15 and altered in hrHPV-induced cancer16. Furthermore, the modulation of the expression of human miRNAs and the consequent impact on cellular


gene expression by hrHPV E6 and E7 contributes to cervical cancer17,18. In this exploratory discovery study, we used the MiSeq platform to sequence QIAseq® miRNA libraries to identify miRNAs


differentially expressed between hrHPV-positive-high- (CIN2, CIN3 and SCC; combined as CIN2+) and -low-grade (CIN1, Negative; combined as ≤ CIN1) cervical lesions. Validation of


differentially expressed miRNAs was conducted in an independent set of samples by RT-qPCR. Finally, we conducted an analysis to comprehensively capture the potential interaction between


differentially expressed miRNAs with mRNA transcripts of proteins of pathways deregulated by HPV E6 and E7 oncoproteins. Our study contributes to elucidating the utility of miRNAs as


biomarkers to detect cervical cancer precursor lesions. METHODS SELECTION OF STUDY PARTICIPANTS We studied women selected from the participants of a pragmatic trial (ASCUS-COL trial


conducted from 2011 to 2016)19 that compared the performance for detection of CIN2+ of strategies that included colposcopy and/or biopsy, repeat cytology, or hrHPV testing for the optimal


management of ASC-US (Atypical Squamous Cells of Undetermined Significance) cytology. Another group of participants had been recruited through a screening study20, (BIOMARKERS) conducted


from 2013 to 2015, that compared the performance for detection of CIN2+ of hrHPV testing, HPV 16/18 genotyping, p16/Ki67 dual-staining, and conventional cytology. Participants (20–69 years


old) of these studies were residents of the metropolitan area of Medellin, Colombia, recruited in routine screening (n = 2661) with first-time ASC-US cytology during the last 3 months before


recruitment (ASCUS-COL trial) or at colposcopy services (n = 749) with first-time referral to colposcopy because ≥ ASC-US (LSIL, HSIL, ASCUS, ASC-H, AGC) cytology (BIOMARKERS study).


Specimens of all women participating in these studies were HPV tested at recruitment by the Hybrid Capture II (ASCUS-COL) or COBAS test (BIOMARKERS study). All participants were followed for


at least 24 months or until referred for evaluation by a well-trained colposcopist and biopsied based on the results of the tests (threshold hrHPV-positive and/or ≥ ASC-US cytology) using a


standardized and controlled protocol of biopsy sampling. All clinical procedures were conducted under routine care by the local healthcare providers and at the end of the study, accredited


experts confirmed histopathological diagnoses. Women with histological diagnoses of CIN2+ by local or expert health providers received treatment. Samples were retrospectively selected, and


the selection and laboratory analysis were conducted independently and blindly after the end of the studies. This study was conducted according to the Colombian regulation for research in


humans (Resolution 8430 of 1993, Ministry of Health of Colombia), and all methods were performed in accordance with the relevant guidelines and regulations by the institutions. SELECTION OF


THE CLINICAL SPECIMENS The clinical specimens used in this study were 10% buffered formalin-fixed and paraffin-embedded (FFPE) tissues identified after the end of the ASCUS-COL or BIOMARKERS


studies from women who had hrHPV-positive test results at recruitment visit and with a confirmed colposcopy-directed biopsy with either negative, CIN1, CIN2, CIN3 or carcinoma in situ,


adenocarcinoma (ADC) or squamous cell carcinoma (SCC) as the worst diagnosis at any time during the study but before treatment. For the discovery phase, a convenience set of 10


hrHPV-positive FFPE tissue specimens with confirmed histopathology diagnosis of CIN2+ (7 CIN2 and 3 CIN3; median age 32 years; range 22–48) and 10 age-matched, hrHPV-positive FFPE tissues


with confirmed histopathology diagnosis of ≤ CIN1 (9 CIN1 and 1 Negative for lesion; median age 29 years; range 22–58), were randomly selected. An independent set of samples was chosen from


the ASCUS-COL study to validate by quantitative RT-PCR (qPCR) 5 miRNAs, 2 with the lowest coefficient of variation and 3 with the highest fold change. As shown in the flowchart (Fig. 1), 756


of the 1122 hrHPV positive women had a biopsy, of which 185 corresponded to high-grade preneoplastic cervical lesions (CIN2+), which were selected and paired with women with low-grade


lesions (≤ CIN1) by age and time of follow up to diagnosis. Samples from 366 women that did not meet matched criteria or were not randomly selected, 144 women without FFPE tissues in the


pathology labs and 16 that did not meet quality criteria for the RT-qPCR were excluded. The independent set included 105 of the 185 randomly selected FFPE tissues of women with the worst


biopsy-confirmed diagnosis of CIN2+ (71 CIN2, 32 CIN3 and 2 SCC, median age 29 years; range: 20–60), paired by age (± 6 years) with 105 of the 185 FFPE tissues of women with the worst


biopsy-confirmed diagnosis of ≤ CIN1 (26 CIN1, 79 Negative, median age 28 years; range 20–62). HPV genotyping of the samples by CLART HPV4 (Clinical Array Technology, Genomica, Madrid,


Spain) conducted at the Queen's Medical Research Institute of The University of Edinburgh (Edinburgh, United Kingdom) is already described21,22. Supplementary Tables 1 and 2 show the


sociodemographic description of patients of the discovery and validation sets, respectively. RNA EXTRACTION Total RNA was isolated using the miRNeasy FFPE Kit (cat 217504, QIAGEN®) according


to the manufacturer’s instructions. Briefly, the paraffin was removed with Xylene and the samples were incubated at 56 °C for 15 min, then at 80 °C for 15 min in an optimized lysis buffer,


which contains Proteinase K for RNA release from tissue sections. The sample was then treated with DNase I to eliminate genomic DNA. The lysate was then mixed with a guanidine salt-based


buffer to provide appropriate RNA binding conditions and then ethanol was added, and the samples were transferred to a RNeasy MinElute spin column, where the total RNA binds to the membrane,


and contaminants are washed. Finally, the sample containing the total RNA, including miRNAs, was eluted in 20 μL of RNase-free water and stored at -80° C, until its use. The total RNA


isolated was quantified using the Qubit™ RNA Assay kit (Invitrogen™), on the Qubit 3.0 Fluorometer (Thermo). MIRNA LIBRARY PREPARATION AND SEQUENCING Library preparation was performed from


100 ng of total RNA using the QIAseq® miRNA Library Kit according to the manufacturer’s instructions (cat 331502, QIAGEN®). Briefly, a pre-adenylated adapter was ligated to the 3′ end of the


miRNAs while an RNA adapter was ligated to the 5′ end of all mature miRNAs. cDNA synthesis was carried out with a reverse transcription primer that binds at the 3′ adapter and incorporates


a universal molecular index (UMI). The cDNA was cleaned up using magnetic beads and the library was amplified using a universal forward primer and indexing reverse primers. The libraries


were cleaned up with magnetic beads and qualified on an Agilent 2100 Bioanalyzer (Agilent Technologies) using a High Sensitivity DNA 1000 chip, according to the manufacturer’s instructions.


The miRNA Library concentration was determined using the Qubit™ dsDNA HS Assay kit (Invitrogen™), on the Qubit 3.0 Fluorometer (Thermo). The libraries were adjusted to 4 nM, pooled (2


10-sample pools) and sequenced at 1 × 75 bp on the MiSeq platform (Illumina, San Diego, CA) following the instructions of the MiSeq System Denature and Dilute Libraries Guide insert. MIRNA


SEQUENCING DATA PROCESSING FASTQ files generated by the MiSeq instrument were uploaded to GeneGlobe Data Analysis Center of Qiagen® (https://geneglobe.qiagen.com/analyze/) and analyzed under


two steps. In brief, the primary quantification consisted in the trimming of the 3′ adapter and low-quality bases using cut-adapt (cutadapt.readthedocs.io/en/stable/guide.html), discarding


reads with less than 16 bp insert sequences or less than 10 bp sequences (defective reads), aligning the reads to the reference transcriptome tolerating up to two mismatches (miRBase V.21


http://www.mirbase.org/) using Bowtie (bowtie-bio.sourceforge.net/index.shtml). On average 2.8 million reads per sample were obtained in the sequencing and 1.3 million of these (47% of the


reads total) were mapped to miRBase V.21. A total of 2,186 sequences of mature miRNAs were mapped. The secondary analysis consisted of the miRNA differential expression between CIN2+ and ≤ 


CIN1 groups. The counts were normalized using the DESeq2 scaling factor by the Geometric Median strategy23. The differential gene expression analysis was carried out by comparing the miRNAs


expression levels between CIN2+ and ≤ CIN1 groups, using Wald statistic test for multiple comparisons, and the p-values adjusted by the Benjamini and Hochberg method (FDR ≤ 0.05). The


differential expression analysis was confirmed in R (v3.6.3) using the same DESeq2 workflow. R (v3.6.3) was also used for post-analysis, heatmaps and plots. Raw sequencing reads and


normalized reads counts are available from the NCBI Gene Expression Omnibus (GEO) through the series accession number GSE167858. QUANTITATIVE RT-PCR (QPCR) The miRCURY Locked Nucleic Acid


(LNA) RT Kit (cat 339340; QIAGEN®) was used for the miRNA reverse transcription reaction. In brief, total RNA was used as starting material and the procedure involved the miRNA


polyadenylation and reverse transcription in a single reaction step using a Poly-T primer bound to a universal tag. Using one universal RT reaction and the cDNA template for subsequent PCR


amplifications, the reaction was incubated at 40–42 °C for 1 h, followed by an inactivation step through a brief incubation at 95 °C. The reaction volume for the cDNA synthesis was 10 µL,


consisting of 2 µL of total RNA at 5 ng/µL, 2 µL of 5× miRCURY RT Reaction Buffer, 1 µL of 10× miRCURY RT Enzyme mix and 5 µL RNAse-free water. miRCURY LNA SYBER Green PCR Kit (cat 339346;


QIAGEN®) and miRCURY LNA PCR Assays (YP00204788, YP00205992, YP00204570, YP00204698, YP00204765; QIAGEN®) were used for quantitative PCR duplicate reactions per sample. The reference gene


used was the small nucleolar RNA (snoRNA) SNORD44 (YP00203902; QIAGEN®), widely used as a housekeeping gene in miRNAs studies24. Quantitative PCR reactions were performed in 10 μL,


consisting of 3 µL of 1:60 diluted cDNA, 5 µL of 2× miRCURY SYBR® Green Master Mix, 1 µL of PCR Primer and 1 µL of RNase free water. Cycle conditions used for cDNA synthesis and PCR were


done according to the manufacturer’s protocol. The relative expression was determined by the 2−ΔCT method25. The expression of the five miRNAs relative to the internal control gene (SNORD44)


in each of the lesion grades (79 NEG, 26 CIN1, 71 CIN2 and 32 CIN3 plus 2 cancers) or grouped as high- (CIN2+) or low-grade (≤ CIN1) lesions were estimated as _∆Ct_ = _Ct (miR‑of‑interest)


– Ct (SNORD44)_ and the median of the Log2(2−∆CT) of these values plotted in the figures. STATISTICAL ANALYSIS OF MIRNA RT-QPCR DATA The analysis was based on a prespecified analytical plan.


The primary hypothesis was that a sample size of 106 pairs will be needed to estimate, with an 80% power and significance level of 95%, statistical differences between CIN2+ and ≤ CIN1 of a


miRNA relative change with a moderate effect (0.3). Based on the distribution of the Ct values for miR-133a-3p, we excluded 2 samples with extreme outliers (Ct values of 14.26 and 8.84).


The Cuzick trend test was used to investigate changes in each miRNA with increasing lesion severity. We used the U-Mann Whitney statistic test for the comparison of relative expression


medians between CIN2+ and ≤ CIN1 groups. We also developed a miRNA signature, using logistic regression models with CIN2+ or CIN3+ as outcomes and ≤ CIN1 as reference group. Models were run


entering all the negative of normalized expression levels relative to SNORD44 (dCt) of the 5 miRNAs and by backward stepwise selection including or not HPV 16 and/or HPV 18 genotypes as a


single dichotomous variable. The performance of the most optimal model to predict CIN2+ or CIN3+ was that with the highest AUC and corresponding significant 95% confidence intervals (CI).


The value of the 5 miRNAs to triage hrHPV positive women was also evaluated by comparing the performance to detect CIN2+ and CIN3+ with HPV 16/18 genotyping, a marker that has validated


clinical utility for the management of hrHPV positive women. A Receiving Operating Characteristics (ROC) analysis including the normalized Ct values of the 5 miRNAs validated by RT-qPCR was


conducted to identify the cut-off corresponding to a specificity of 70%. Differences in sensitivities and specificities of the 5 miRNAs signature with HPV16/18 genotypes or by combining both


tests were examined by McNemar’s test with continuity correction. Age is one of the most important risk factors for cancer overall26; therefore, if the miRNAs expression is associated with


preneoplastic high-grade lesion and cancer, it can also be associated with older age. We observed that regardless of the histological diagnosis, the relative expression of miR-133a-3p was


different in women ≤ 30 as compared to women > 30 years old (p = 0.009) (Supplementary Fig. 1). Therefore, we only included in the final RT-qPCR validation analysis samples from women


with their respective paired control (+ /− 6 years, n = 210) (see flowchart, Fig. 1). For all tests, a p-value ≤ 0.05 was considered statistically significant. The software R (v3.6.3) was


used to conduct the analysis. CELLULAR MIRNAS INTERACTION WITH PROTEINS OF THE PATHWAYS ALTERED BY HPV E6/E7 ONCOPROTEINS (IN-SILICO APPROACH) To explore the potential for functional


importance of the miRNAs in the context of HPV infection, we searched for experimentally proven inverse correlations between the miRNAs and their corresponding target genes products using


the Ingenuity Pathway Analysis (IPA, QIAGEN Inc., https://digitalinsights.qiagen.com/IPA)27. IPA integrates the data from the databases TargetScan, TarBase, miRecords and Ingenuity Expert


Findings. The names of the top 25 differentially expressed miRNAs between CIN2+ and < CIN1 (p value < 0.05), were uploaded to IPA. This list of mRNAs, was subsequently filtered using


the following criteria: selecting only “Experimentally observed” in the column “miRNA confidence”, “Cancer” in the column “Disease”, “Human” in the column “Species” , and finally filtering


out genes without biological relevance in the column “Pathway”. Thereafter, the information from the KEGG PATHWAY ko05165 for Human Papillomavirus infection


(https://www.genome.jp/pathway/map05165+K04693) was used to predict miRNAs with potential interactions with gene target products of pathways that are altered by HPV E6 and/or E7


oncoproteins28,29,30. The interactions between miRNAs and mRNAs related to HPV infection was built using Cytoscape V3.7.1, (URL: https://cytoscape.org/)31. ETHICAL CONSIDERATIONS All


participants included in the ASCUS-COL and BIOMARKERS studies signed informed consent and authorization to use samples and data for future research. The ethics committees for human


experimentation of Sede de Investigación Universitaria (SIU) (Resolutions 08-036-171 and 18-40-535) and School of Medicine (Resolution 004/2008), of the Universidad de Antioquia approved the


studies. RESULTS MIRNA SIGNATURES ON HIGH- VS. LOW-GRADE CERVICAL LESIONS There were no differentially expressed miRNAs between the samples from the ASCUS-COL and the BIOMARKERS studies


(data not shown). The analysis of the sequencing data (discovery set) showed 162 miRNAs differentially expressed between CIN2+ and ≤ CIN1 lesions (p ≤ 0.05). The top 25 miRNAs differentially


expressed are shown in Table 1, and the complete list of differentially expressed miRNAs is shown in Supplementary Table 3. Unsupervised hierarchical clustering was performed for the 25


most differentially expressed miRNAs between the 10 CIN2+ and 10 ≤ CIN1 lesions, using the z-score transformation on the normalized read counts. The heat map shows that, compared to the ≤ 


CIN1 lesions which tend to group in the center of the heat map, the CIN2+ lesions are distributed in three different clusters, suggesting a higher miRNA expression variability in CIN2+ as


compared to the ≤ CIN1 lesions (Fig. 2). The higher variability in high-grade, as compared to low-grade cervical lesions was observed for all the 162 differentially expressed miRNAs across


the 20 samples (Supplementary Fig. 2). We selected three miRNAs with the highest FC (miR-133a-3p, miR-143-5p, miR-143-3p) and two miRNAs with the lowest %CV (miR- 29a-3p and miR-30b-5p) from


the top 25 of the 162 miRNAs differentially expressed for validation on an independent group of FFPE tissues (n = 210, 105 CIN2+ lesions matched by age and time to diagnosis to 105 ≤ CIN1


lesions). Undetermined Ct values occurred in 21 samples for miR-133a-3p, in 6 samples for miR-143-5p, and in 1 sample for miR-29a-3p, miR-30b-5p, and miR-143-3p. Therefore, their age-paired


samples were also excluded from this analysis (see flowchart, Fig. 1). The summary of RT-qPCR data is shown in Supplementary Table 4. The variability of the five miRNAs used in qPCR


validations was also observed in the RNAseq data. We conducted linear regression and observed a linear relationship (Coefficient of determination R2) between the median (R2 = 0.90) or mean


(R2 = 0.88) of normalized read-counts of RNAseq and normalized Ct values of RT-qPCR (Supplementary Fig. 3). We observed a significant trend for the medians (Cuzick test for medians trend, p 


< 0.05) of the relative expression of miR-143-3p (p = 0.029), miR-143-5p (p = 0.020) and miR-30b-5p (p = 0.033) with increasing severity of the lesions (Fig. 3a). When we compared the


relative expression levels between high-grade lesions (CIN2+) and low-grade lesions (≤ CIN1), we observed a significant increased relative expression of miR-143-5p in the CIN2+ lesions (Fig.


 3b, p = 0.021), the descriptive statistics across the cervical lesions, and between CIN2+ and ≤ CIN1 for each miRNA are shown in (Fig. 3c) and (Fig. 3d), respectively. We did not observe


any additional change of the relative expression levels in the other miRNAs. COMPARISON OF THE PERFORMANCE OF THE 5 MIRNAS SIGNATURE WITH HPV16/18 GENOTYPING TO DETECT CIN2+ OR CIN3+ IN HPV


POSITIVE WOMEN The AUCs of the models for CIN2+ or CIN3+ that included all five miRNAs were 0.645 (95% CI 0.565–0.719, p < 0.001) and 0.649 (95% CI 0.551–0.739, p < 0.011)


respectively. The best performer predictor signature was observed when HPV 16 and/or 18 genotypes were added as a single dichotomous variable to the five miRNAs classification with AUCs of


0.678 (95% CI 0.596–0.751, p < 0.001) for CIN2+ and 0.726 (95% CI 0.628–0.811, p < 0.001) for CIN3+ (Table 2). Supplementary Table 5 shows the number and percentages of  ≤ CIN1


controls and CIN2+ and CIN3+ cases with negative or positive tests results. Supplementary Table 6 shows the comparison of the performance of the five miRNAs signature, HPV 16/18 genotyping


or the combination of both to detect CIN2+ and CIN3+ at a desired specificity of 70.88% (cut-off for the 5 miRNAs signature ≥ 0.54). The specificity of HPV 16/18 genotyping (65.28%, 95%CI


53.14–76.12) was not statistically different to that of the 5 miRNAs signature at a cut-off of ≥ 0.54 for a set specificity of 70.88% (95%CI 59.58–80.57). Likewise, there were not


differences in the performance for the detection of CIN2+ or CIN3+ between the five miRNAs signature and HPV 16/18 genotyping. However, compared to the five miRNAs signature, the specificity


for ≤ CIN1 significantly decreased (p = 0.0001) from 70.88 to 49.73% (95% CI 42.48–56.97) and the sensitivity significantly (p < 0.0001) increased from 51.90% (95% CI 40.36–63.30) to


77.22% (95% CI 66.40–85.90) for detection of CIN2+ and from 51.85% (95% CI 31.95–71.33) to 85.19% (95% CI 66.27–95.81) for detection of CIN3+ (p = 0.0076) after combining the positivity for


any of the two, the five miRNAs signature plus HPV 16/18 genotyping. ROLE OF MIRNAS DIFFERENTIALLY EXPRESSED IN THE CONTEXT OF HPV INFECTION Table supplementary 7 shows miRNAs-target gene


products pairs with validated inverse correlations in the IPA. Fourteen out of the top 25 differentially expressed miRNAs have experimental evidence of inverse correlation with 401 unique


different target mRNAs. Eleven of these 14, have experimental evidence of targeting 26 proteins of pathways altered by HPV E6 and E7 oncoproteins (Fig. 4) and two of these 11 miRNAs


(miR-143-3p and miR-29a-3p) were the best predictors of CIN2+ and CIN3+ when validated in the independent set of samples by RT-qPCR (Fig. 4 and Table 2). DISCUSSION It has been shown that


miRNAs modulate the expression of tumor suppressor genes related to the hrHPV carcinogenic processes such as _TP53_, _CDKN1A_ (p21) and _CDKN1B_ (p27)32,33. The levels of these miRNAs are


modulated by hrHPV E6/E7 expression in-vitro, suggesting that miRNAs mediate hrHPV-induced cervical carcinogenesis17,34. In this work, we sequenced miRNAs from 20 FFPE tissues from hrHPV


positive women with high- or low-grade cervical lesions and identified a panel of 162 miRNAs differentially expressed between those groups using DESeq223. Our screening approach was


validated by RT-qPCR in an independent set of 210 FFPE tissue samples (79 NEG, 26 CIN1, 71 CIN2, 32 CIN3, and 2 SCC). Finally, the in-silico exploration showed that the miRNAs validated in


this study interact with genes coding for proteins of pathways altered by HPV E6/E7 oncoproteins. The Ingenuity Pathway Analysis identified 401 unique different inverse correlated target


mRNAs for 14 out of the top 25 differentially expressed miRNAs (Supplementary Table 7). Eleven of these 14 miRNAs were predicted to interact with 26 proteins of pathways altered by HPV E6


and E7 oncoproteins (Fig. 4). Among these 11, miR-29a-3p and miR-143-3p, which respectively interact with the important tumor suppressor genes PTEN, and KRAS, were the best predictors of


CIN2+ or CIN3+ in the regression models that included the five miRNAs validated by RT-qPCR in the independent set of samples (Table 2). At a desired specificity of 70.88% (cut-off for the 


five miRNAs signature ≥ 0.54 normalized counts) the performance for the detection of CIN2+ or CIN3+ of the 5 miRNAs signature was similar to that of HPV 16/18 genotyping. However, compared


to the five miRNAs signature, a significant decrease in specificity for ≤ CIN1 and a significant (p < 0.0001) increased in the sensitivity for CIN2+ or CIN3+ was observed after combining


the positivity for any of the two, the five miRNAs signature plus HPV 16/18 genotyping. This prompted us to enquire about the proportion of samples positive for the five miRNAs signature


among those positive for HPV16/18 or for the other high-risk genotypes. The proportions among the positive for genotypes other than HPV16/18, were 30%, 56% and 64% in ≤ CIN1, CIN2+ or CIN3+ 


respectively, meanwhile the respective percentages were 32%, 50% and 44% in those HPV16/18 positive. This indicates that miRNAs detected an additional 20% of CIN3+ cases not detected by


HPV16/18 genotyping, an attribute important for the design of algorithms including reflex management of other than hrHPV 16/18 positive women. Deep sequencing analysis of small RNA molecules


is a powerful technique to obtain detailed information of the miRNAs profiles of a biological sample35. To the best of our knowledge, ours is the first study to determine genome-wide miRNA


profiles from hrHPV-positive FFPE cervical tissue with preneoplastic lesions using deep sequencing analysis. One of the strengths of our study is the inclusion of patients with


histologically confirmed FFPE tissues. By inclusion criteria only women 20–69 years old women (birth years 1942–1993) were included in the cohort where this study was nested. Women were


recruited between January 2011 and January 2014. HPV vaccine was introduced for 2003 birth cohorts (9 years old girls only) in Colombia in 2012. Therefore, participants in this study were


not eligible for vaccination. Indirect evidence of low probability of vaccination of this cohort is the fact that HPV infection rates were 40%, which is the expected rate for women with


ASC-US cytology at this age range19. Our study had the representation of the complete spectrum of cervical lesions, which are necessary for a complete understanding of the miRNAs profiling


on each stage. Another strength of our study is that it uses a protocol for small RNA library construction, optimized with Unique Molecular Indexes (UMIs) (a twelve-bases random sequence)


enabling unbiased and accurate miRNome-wide quantification of mature miRNAs by NGS. The UMIs allow the tagging of each mature miRNA molecule, removing any sequencing bias introduced during


library construction due to different amplification efficiencies36. One interesting observation was that the age of the participants influenced the relative expression of miRNAs based on the


aggressiveness of the preneoplastic lesions of the cervix (Supplementary Fig. 1). We found that age could be a confounding variable in the expression of miR-133a-3p and miR-30b-5p. The


relative change of miR-133a-3p was found to be higher in older women (> 30 years old) when compared to younger women ≤ 30 years old (p = 0.009). The relative change of miR-30b-5p was


higher in older (> 30) than in younger women with CIN2 (p = 0.039). These observations guided our decision to pair age-matched participants to compare high- (CIN2+) vs. low-grade lesions


(≤ CIN1) for our validation set. Snoek et al. conducted a discovery analysis in 56 hrHPV-positive cervical self-collected samples (32 ≤ CIN1 and 24 CIN3) by small RNAseq (Illumina, Inc, San


Diego, USA) and logistic regression for identification of differentially expressed miRNAs with validation by RT-qPCR in an independent set of 190 samples (101 ≤ CIN1, 48 CIN3, and 41 SCC)37.


We did not find commonly differentially expressed miRNAs, probably because of differences in the platforms used, the sample type, and analysis strategy. However, when we analyzed Snoek _et.


al._ data available in the Gene Expression Omnibus (GEO number GSE104758) using our strategy, seven of the miRNAs were shared between the two studies (hsa-mir-9-5p, hsa-mir-195-5p,


hsa-mir-196b-5p, hsa-mir-504-5p, hsa-mir-30e-3p, hsa-mir-184, hsa-mir-126-5p). We acknowledge that it would be desirable to confirm in the same set of samples the differential expression of


the five miRNAs included in the RT-qPCR. However, in spite of the high variability and small sample size of the RT-qPCR data, there was significant trend of higher expression with increasing


lesion severity, in three of the five miRNAs, although this difference was significant for miR-143-5p only. A priori sample size of 106 pairs was estimated to achieve 80% power to compare


medians between ≤ CIN1 and CIN2 + lesions (U-Mann–Whitney test), with 1 tail, an effect size of 0.3, at 95% significance level. However, in the post-hoc analysis with an observed effect size


of 0.2 for miR-143-5p, a 54% power is reached with the 99 sample pairs analyzed. Our _in-silico_ approach suggests that the miRNAs differentially expressed potentially regulate the


expression of genes encoding for proteins of pathways altered by hrHPV E6/E7 oncoproteins. This observation strengths the biological relevance of the miRNAs for distinguish high- from


low-grade cervical precancerous lesions. However, it is important to clarify that in-vitro assays are necessary to confirm the role of these miRNAs in hrHPV-induced carcinogenesis. Studies


with larger number of samples independently validated by RT-qPCR and correlation analysis between differentially expressed miRNAs and their target pairs using cervical precancerous and


cancer lesions and comparison with other biomarkers such as HPV genotyping, methylation and p16/Ki67 dual-staining are also needed to further improve the knowledge about their biological


role and clinical utility of these biomarkers to triage hrHPV-positive women. The observation that high variability of miRNA expression is higher in high-grade cervical lesions, suggests


that several combined miRNAs may be more suitable as biomarkers for pre-cancerous cervical lesions. The use of cervical exfoliates, in addition to FFPE samples, can improve the performance


of the RT-qPCR and therefore capture larger differences in the fold change when the effect size is small. CONCLUSIONS In conclusion, a comprehensive analysis of (i) the RNA-Seq data of 10


CIN2+ and 10 ≤ CIN1 precancerous lesions identified 162 differentially expressed miRNAs, (ii) the Ingenuity Pathway Analysis of the top 25 differentially expressed miRNAs identified inverse


correlations with 401 unique coded target genes for 14 miRNAs, of which 11 interact with proteins regulated by HPV E6 and E7 oncoproteins and (iii) independent validation of five of these


miRNAs by RT-qPCR in 79 CIN2+ and 79 ≤ CIN1 precancerous lesions identified two miRNAs predictors of CIN2+ and CIN3+ precancerous cervical lesions. Our results suggested that these two


miRNAs are potential biomarkers for predicting cervical precancerous lesions. However, larger prospective studies are necessary to determine the clinical value of these miRNAs and further


experimental validation is required to demonstrate the biological role of them in HPV-related cervical carcinogenesis. DATA AVAILABILITY The dataset excluding personal identifiers will be


available to proper academic parties on request from the corresponding author in accordance with the data sharing policies of Universidad de Antioquia. Raw sequencing reads and normalized


reads counts are available from the NCBI Gene Expression Omnibus (GEO) through the series accession number GSE167858. REFERENCES * Ferlay, J., Ervik, M., Lam, F., Colombet, M., Mery, L.,


Piñeros, M., Znaor, A., & Soerjomataram, I.B.F. _Global Cancer Observatory: Cancer Today_. https://gco.iarc.fr/today (2020). * Siegel, R. L., Miller, K. D. & Jemal, A. Cancer


statistics, 2019. _CA. Cancer J. Clin._ 69, 7–34 (2019). Article  PubMed  Google Scholar  * Sung, H. _et al._ Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality


worldwide for 36 cancers in 185 countries. _CA. Cancer J. Clin._ https://doi.org/10.3322/caac.21660 (2021). Article  PubMed  Google Scholar  * Schiffman, M., Castle, P., Jeronimo, J.,


Rodriguez, A. C. & Wacholder, S. Human papillomavirus and cervical cancer. _Lancet_ 370, 890–907 (2007). Article  CAS  PubMed  Google Scholar  * Koliopoulos, G. _et al._ Cytology versus


HPV testing for cervical cancer screening in the general population. _Cochrane Database Syst. Rev._ https://doi.org/10.1002/14651858.CD008587.pub2 (2017). Article  PubMed  PubMed Central 


Google Scholar  * Mezei, A. K. _et al._ Cost-effectiveness of cervical cancer screening methods in low- and middle-income countries: A systematic review. _Int. J. Cancer_ 141, 437–446


(2017). Article  CAS  PubMed  Google Scholar  * Kamath Mulki, A. & Withers, M. Human Papilloma Virus self-sampling performance in low- and middle-income countries. _BMC Womens Health_


21, 1–11 (2021). Article  Google Scholar  * Rijkaart, D. C. _et al._ Evaluation of 14 triage strategies for HPV DNA-positive women in population-based cervical screening. _Int. J. Cancer_


130, 602–610 (2012). Article  CAS  PubMed  Google Scholar  * Adcock, R. _et al._ DNA methylation testing with S5 for triage of high-risk HPV positive women. _Int. J. Cancer_ 151(7),


993–1004. https://doi.org/10.1002/ijc.v151.710.1002/ijc.34050 (2022). Article  CAS  PubMed  PubMed Central  Google Scholar  * Del Mistro, A. _et al._ Human papilloma virus genotyping for the


cross-sectional and longitudinal probability of developing cervical intraepithelial neoplasia grade 2 or more. _Int. J. Cancer_ 143(2), 333–342.


https://doi.org/10.1002/ijc.v143.210.1002/ijc.31326 (2018). Article  Google Scholar  * Cuzick, J. _et al._ Combined use of cytology, p16 immunostaining and genotyping for triage of women


positive for high-risk human papillomavirus at primary screening. _Int. J. Cancer_ 147(7), 1864–1873. https://doi.org/10.1002/ijc.v147.710.1002/ijc.32973 (2020). Article  CAS  PubMed  Google


Scholar  * Priy Dave, V. _et al._ MicroRNA amplification and detection technologies: opportunities and challenges for point of care diagnostics. _Lab. Investig._ 99, 452–469 (2019). Article


  Google Scholar  * Lozar, T. _et al._ Cervical cancer screening postpandemic: Self-sampling opportunities to accelerate the elimination of cervical cancer. _Int. J. Womens Health_ 13, 841


(2021). Article  PubMed  PubMed Central  Google Scholar  * Yeo-Teh, N. S. L., Ito, Y. & Jha, S. High-risk human papillomaviral oncogenes E6 and E7 target key cellular pathways to achieve


oncogenesis. _Int. J. Mol. Sci._ 19, 1706 (2018). Article  PubMed  PubMed Central  Google Scholar  * Harden, M. E. & Munger, K. Perturbation of DROSHA and DICER expression by human


papillomavirus 16 oncoproteins. _Virology_ 507, 192–198 (2017). Article  CAS  PubMed  Google Scholar  * Snoek, B. C., Babion, I., Koppers-Lalic, D., Pegtel, D. M. & Steenbergen, R. D.


Altered microRNA processing proteins in HPV-induced cancers. _Curr. Opin. Virol._ 39, 23–32 (2019). Article  CAS  PubMed  Google Scholar  * Harden, M. E., Prasad, N., Griffiths, A. &


Munger, K. Modulation of microRNA-mRNA target pairs by human papillomavirus 16 oncoproteins. _MBio_ 8, 2170 (2017). Article  Google Scholar  * Babion, I. _et al._ Identification of


deregulated pathways, key regulators, and novel miRNA-mRNA interactions in HPV-mediated transformation. _Cancers (Basel)_ 12, 700 (2020). Article  CAS  PubMed  Google Scholar  * Baena, A.


_et al._ Comparison of immediate colposcopy, repeat conventional cytology and high-risk human papillomavirus testing for the clinical management of atypical squamous cells of undetermined


significance cytology in routine health services of Medellin, Colombia. _Int. J. Cancer_ 148, 1394–1407 (2021). Article  CAS  Google Scholar  * Ramírez, A. T., López-Urán, C., Posada, G.,


Buitrago, C. A. & Sanchez, G. I. _P16/Ki-67 Staining of Conventional Cytology for Reflex Testing of HPV Positive Women Referred to Colposcopy in Medellin, Colombia_ (2015). * Ramírez, A.


T. _et al._ Effective methylation triage of HPV positive women with abnormal cytology in a middle-income country. _Int. J. cancer_ 148, 1383–1393 (2021). Article  PubMed  Google Scholar  *


Bonde, J. _et al._ The Valgent4 protocol: Robust analytical and clinical validation of 11 HPV assays with genotyping on cervical samples collected in SurePath medium. _J. Clin. Virol._ 108,


64–71 (2018). Article  CAS  PubMed  Google Scholar  * Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. _Genome Biol._


15, 550 (2014). Article  PubMed  PubMed Central  Google Scholar  * Masè, M. _et al._ Selection of reference genes is critical for miRNA expression analysis in human cardiac tissue. A focus


on atrial fibrillation. _Sci. Rep._ 7, 41127 (2017). Article  PubMed  PubMed Central  ADS  Google Scholar  * Livak, K. J. & Schmittgen, T. D. Analysis of relative gene expression data


using real-time quantitative PCR and the 2 CT method. _Methods_ 25, 402–408 (2001). Article  CAS  PubMed  Google Scholar  * Risk Factors: Age-National Cancer Institute.


https://www.cancer.gov/about-cancer/causes-prevention/risk/age. Accessed 9 Feb 2022 (2022) * Krämer, A., Green, J., Pollard, J. & Tugendreich, S. Causal analysis approaches in ingenuity


pathway analysis. _Bioinformatics_ 30, 523–530 (2014). Article  PubMed  Google Scholar  * Kanehisa, M., Furumichi, M., Sato, Y., Kawashima, M. & Ishiguro-Watanabe, M. KEGG for


taxonomy-based analysis of pathways and genomes. _Nucleic Acids Res._ 51, D587 (2023). Article  PubMed  Google Scholar  * Kanehisa, M. Toward understanding the origin and evolution of


cellular organisms. _Protein Sci._ 28, 1947–1951 (2019). Article  CAS  PubMed  PubMed Central  Google Scholar  * Kanehisa, M. & Goto, S. KEGG: Kyoto encyclopedia of genes and genomes.


_Nucleic Acids Res._ 28, 27–30 (2000). Article  CAS  PubMed  PubMed Central  Google Scholar  * Shannon, P. _et al._ Cytoscape: A software environment for integrated models of biomolecular


interaction networks. _Genome Res._ 13, 2498–2504 (2003). Article  CAS  PubMed  PubMed Central  Google Scholar  * Beaudenon, S. & Huibregtse, J. M. HPV E6, E6AP and cervical cancer. _BMC


Biochem._ 9, S4 (2008). Article  PubMed  PubMed Central  Google Scholar  * Milde-Langosch, K., Riethdorf, S., Kraus-Pöppinghaus, A., Riethdorf, L. & Löning, T. Expression of


cyclin-dependent kinase inhibitors p16MTS1, p21WAF1, and p27KIP1 in HPV-positive and HPV-negative cervical adenocarcinomas. _Virchows Arch._ 439, 55–61 (2001). Article  CAS  PubMed  Google


Scholar  * Harden, M. E. & Munger, K. Human papillomavirus 16 E6 and E7 oncoprotein expression alters microRNA expression in extracellular vesicles. _Virology_ 508, 63–69 (2017). Article


  CAS  PubMed  Google Scholar  * Witten, D., Tibshirani, R., Gu, S. G., Fire, A. & Lui, W.-O. Ultra-high throughput sequencing-based small RNA discovery and discrete statistical


biomarker analysis in a collection of cervical tumours and matched controls. _BMC Biol._ 8, 1–14 (2010). Article  Google Scholar  * Islam, S. _et al._ Quantitative single-cell RNA-seq with


unique molecular identifiers. _Nat. Methods_ 11, 163–166 (2013). Article  PubMed  Google Scholar  * Snoek, B. C. _et al._ Genome-wide microRNA analysis of HPV-positive self-samples yields


novel triage markers for early detection of cervical cancer. _Int. J. Cancer_ 144, 372–379 (2019). Article  CAS  PubMed  Google Scholar  Download references ACKNOWLEDGEMENTS We are most


grateful to the women who participated in the study. We are very grateful for the inputs of Dr Silvio Urcuqui and Dr Luis Fernando Barrera during the tutorial committees of MIG for the


master thesis, to The of ASCUS-COL study members of allied Institutions, and clinicians contributions in the collection of the samples, to Arianis Tatiana Ramirez and Kelly Melissa Castañeda


for the collection of the samples of the BIOMARKERS study and to the Translational Genomic Core of Louisiana State University Health Sciences Center, where the laboratory work was


conducted. We are grateful to Dr Kate Cuschieri who performed the HPV genotyping of the samples by CLART HPV4 (Clinical Array Technology, Genomica, Madrid, Spain) at the Queen's Medical


Research Institute of The University of Edinburgh (Edinburgh, United Kingdom). FUNDING G.S. received funds from Colombian government Science and Technology Agency-MinCiencias (grant number


111584467661) and Universidad de Antioquia (Estrategia de Sostenibilidad 2013–2014). J.Z. received funds from National Cancer Institute- NCI (grant numbers P30GM114732, 1P20GM121288-01,


1P20CA202922-01A1). M.G. and Y.C. received fellowships from Science and Technology Agency-MinCiencias (Fortalecimiento de Programas y Proyectos en Ciencias Médicas y de la Salud con Talento


Joven e Impacto Regional 752-2018). AUTHOR INFORMATION AUTHORS AND AFFILIATIONS * Infection and Cancer Group, School of Medicine, Universidad de Antioquia, Cra 51D No 62-29 Lab 219,


Medellín, 050010, Antioquia, Colombia Martha I. González-Ramírez, Yurley T. Cardona, María C. Agudelo, Juan J. Florez-Acosta, Samuel Agudelo-Gamboa, Carlos A. Orozco-Castaño & Gloria I.


Sánchez * Department of Pathology, Universidad de Antioquia, Medellín, 050010, Antioquia, Colombia Carolina López * Stanley S. Scott Cancer Center, Louisiana State University Health Sciences


Center, New Orleans, LA, 70112, USA Jone Garai, Li Li & Jovanny Zabaleta * School of Health and Sport Sciences, Fundación Universitaria del Área Andina, Bogotá, 110111, Cundinamarca,


Colombia Carlos A. Orozco-Castaño * Department of Interdisciplinary Oncology, Louisiana State University Health Sciences Center, New Orleans, LA, 70112, USA Jovanny Zabaleta Authors * Martha


I. González-Ramírez View author publications You can also search for this author inPubMed Google Scholar * Yurley T. Cardona View author publications You can also search for this author


inPubMed Google Scholar * María C. Agudelo View author publications You can also search for this author inPubMed Google Scholar * Carolina López View author publications You can also search


for this author inPubMed Google Scholar * Juan J. Florez-Acosta View author publications You can also search for this author inPubMed Google Scholar * Samuel Agudelo-Gamboa View author


publications You can also search for this author inPubMed Google Scholar * Jone Garai View author publications You can also search for this author inPubMed Google Scholar * Li Li View author


publications You can also search for this author inPubMed Google Scholar * Carlos A. Orozco-Castaño View author publications You can also search for this author inPubMed Google Scholar *


Jovanny Zabaleta View author publications You can also search for this author inPubMed Google Scholar * Gloria I. Sánchez View author publications You can also search for this author


inPubMed Google Scholar CONTRIBUTIONS G.S. and M.G. conceived the study and experiments, which were designed with assistance from J.Z., M.A. and C.A., J.G. and L.L. conducted the isolation


and sequencing of miRNAs from the discovery set samples, M.G. and Y.C. conducted the isolation and the RT-qPCR of miRNAs from validation set samples. C.L. conducted the histopathological


diagnosis confirmation. M.G., Y.C., J.F., S.A., and G.S. analyzed the results. M.C. reported the data of clinical management, and hrHPV testing and did the pairing of samples. M.G., J.F.,


S.A., prepared the last version of figures and tables with the advisory of G.S. and J.Z. All authors contributed to the writing of the manuscript, and read, reviewed, and approved the final


version. CORRESPONDING AUTHOR Correspondence to Gloria I. Sánchez. ETHICS DECLARATIONS COMPETING INTERESTS The authors declare no competing interests. ADDITIONAL INFORMATION PUBLISHER'S


NOTE Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations. SUPPLEMENTARY INFORMATION SUPPLEMENTARY FIGURE 1. SUPPLEMENTARY


FIGURE 2. SUPPLEMENTARY FIGURE 3. SUPPLEMENTARY TABLE 1. SUPPLEMENTARY TABLE 2. SUPPLEMENTARY TABLE 3. SUPPLEMENTARY TABLE 4. SUPPLEMENTARY TABLE 5. SUPPLEMENTARY TABLE 6. SUPPLEMENTARY


TABLE 7. RIGHTS AND PERMISSIONS OPEN ACCESS This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and


reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes


were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material.


If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to


obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. Reprints and permissions ABOUT THIS ARTICLE CITE THIS


ARTICLE González-Ramírez, M.I., Cardona, Y.T., Agudelo, M.C. _et al._ miRNAs signature as potential biomarkers for cervical precancerous lesions in human papillomavirus positive women. _Sci


Rep_ 13, 9822 (2023). https://doi.org/10.1038/s41598-023-36421-9 Download citation * Received: 29 November 2022 * Accepted: 03 June 2023 * Published: 17 June 2023 * DOI:


https://doi.org/10.1038/s41598-023-36421-9 SHARE THIS ARTICLE Anyone you share the following link with will be able to read this content: Get shareable link Sorry, a shareable link is not


currently available for this article. Copy to clipboard Provided by the Springer Nature SharedIt content-sharing initiative