Background Recent advances in mass spectrometric instrumentation and bioinformatics have critically contributed to the field of proteogenomics. Nonetheless, whether that integrative approach has reached the point of maturity to effectively reveal the flow of genetic variants from DNA to proteins still remains elusive. The objective of this study was to detect somatically acquired protein variants in breast cancer specimens for which full genome and transcriptome data was already available (BASIS cohort).

Methods LC-MS/MS shotgun proteomic results of 21 breast cancer tissues were coupled to DNA sequencing data to identify variants at the protein level and finally were used to associate protein expression with gene expression levels.

Results Here we report the observation of three sequencing-predicted single amino acid somatic variants. The sensitivity of single amino acid variant (SAAV) detection based on DNA sequencing-predicted single nucleotide variants was 0.4%. This sensitivity was increased to 0.6% when all the predicted variants were filtered for MS “compatibility” and was further increased to 2.9% when only proteins with at least one wild type peptide detected were taken into account. A correlation of mRNA abundance and variant peptide detection revealed that transcripts for which variant proteins were detected ranked among the top 6.3% most abundant transcripts. The variants were detected in highly abundant proteins as well, thus establishing transcript and protein abundance and MS “compatibility” as the main factors affecting variant onco-proteogenomic identification.

Conclusions While proteomics fails to identify the vast majority of exome DNA variants in the resulting proteome, its ability to detect a small subset of SAAVs could prove valuable for precision medicine applications.

doi.org/10.1016/j.clinbiochem.2019.01.005, hdl.handle.net/1765/116323
Clinical Biochemistry
Department of Medical Oncology

Dimitrakopoulos, L., Prassas, I., Sieuwerts, A., Diamandis, E., Martens, J., & Charames, G.S. (2019). Proteome-wide onco-proteogenomic somatic variant identification in ER-positive breast cancer. Clinical Biochemistry, 66, 63–75. doi:10.1016/j.clinbiochem.2019.01.005