Document Type


Publication Date



This article is the author’s final published version in International Journal of Molecular Sciences, Volume 23, Issue 23, December 2022, Article number 14814.

The published version is available at Copyright © Gonzalez-Bosquet et al.


The preoperative diagnosis of pelvic masses has been elusive to date. Methods for characterization such as CA-125 have had limited specificity. We hypothesize that genomic variation can be used to create prediction models which accurately distinguish high grade serous ovarian cancer (HGSC) from benign tissue.

Methods: In this retrospective, pilot study, we extracted DNA and RNA from HGSC specimens and from benign fallopian tubes. Then, we performed whole exome sequencing and RNA sequencing, and identified single nucleotide variants (SNV), copy number variants (CNV) and structural variants (SV). We used these variants to create prediction models to distinguish cancer from benign tissue. The models were then validated in independent datasets and with a machine learning platform.

Results: The prediction model with SNV had an AUC of 1.00 (95% CI 1.00-1.00). The models with CNV and SV had AUC of 0.87 and 0.73, respectively. Validated models also had excellent performances.

Conclusions: Genomic variation of HGSC can be used to create prediction models which accurately discriminate cancer from benign tissue. Further refining of these models (early-stage samples, other tumor types) has the potential to lead to detection of ovarian cancer in blood with cell free DNA, even in early stage.

Creative Commons License

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.

PubMed ID