Introduction
A human genome stores the genetic information encoded as DNA sequences and all cells in multicellular organisms harbour identical genotype. The sequence arrangements contain basic information controlling and directing gene expression, which are further upgraded with different mechanisms, governing fine tuning of these processes and adding additional levels of regulation. Epigenetic modifications are one example of these mechanisms, exerting functionally relevant changes in the genome. Those modifications collectively modify gene expression patterns and control different biological processes such as cell differentiation, proliferation, pre-mRNA processing, survival, genomic imprinting, and X chromosome inactivation (1-3). Epigenetic mechanisms, including DNA methylation, DNA hydroxymethylation, post-translational modifications (PTMs) of histone proteins, nucleosome remodelling, and regulation by non-coding RNAs (ncRNAs) work in concert with cis and trans acting elements to drive appropriate gene expression (4-6). Cis acting elements denote DNA sequences, which are part of gene promoters or other structural portions of a coding sequence that is required for expression. Trans acting elements or factors are proteins that bind to cis acting sequences commanding gene or ncRNAs expression. In this manner, cells control DNA transcription by turning on or off coding sequences as well as non-coding functional RNAs, depending on the type of epigenetic modifications (2).
Epigenetic modifications are written, erased, read, and regulated by a set of specific proteins (1). Although many of these changes are coded within the genome, accumulating evidence showed that environmental influences, such as smoking, starvation, lifestyle, drugs, disease, and aging, can induce long-term changes in epigenetic memory (1-3). It appears that these changes could also be heritable as well as sustained and transmitted through several generations (3, 7, 8). Hence, the dynamic epigenetic landscape is a bridge between environmental exposures and more stable genetic information, stored in the genome, allowing multicellular organisms’ quick adaptation mode, thus abrogating the necessity for time-consuming process of changing the DNA sequence through mutations followed by natural selection (2, 7).
Dysregulation of epigenetic memory has been attributed as pathogenic mechanism driving the initiation and progression of several complex diseases. A vast amount of research has been performed linking aberrant DNA methylation profiles and histone modifications to developmental defects, obesity, asthma, cancers and neurodegenerative diseases (9). However, given the complexity of epigenetic mechanisms, which are further influenced by aging, human genetic variations, such as polymorphisms, and environmental factors, there is still a long way towards collecting, researching, and deciphering of epigenetic information (7, 10). Translation of all these mechanisms into relevant biological information will require combined efforts of scientists from different fields.
In this review we attempted to give the medical readers an overview of the epigenetic field by discussing DNA methylation in the context of its dysregulation in selected epithelial cancers. More specifically, we discussed potential significant aberrant methylation patterns in breast, pancreatic, colorectal, and gastric cancer. These cancer types represent informative cancer models where heterogeneous epigenetic nature hampers the search for specific and sensitive biomarkers; although enormous research efforts had already produced successful diagnostic aids for some of them. We focused on DNA methylation markers, which could be most relevant for translation into clinical setting and could be detected in easily accessible diagnostic specimens.
DNA methylation
Feinberg and Vogelstein were the first to associate differences in DNA methylation status to cancer in the early 80’ of the previous century (11). Since then, there has been an explosion of research regarding aberrant DNA methylation in complex diseases (Figure 1). Simplicity of techniques employed for DNA methylation research and recently developed genome-scale methods for mapping DNA methylation across whole genomes have already resulted in translation of basic discoveries. Therefore, aberrant methylation of genes were associated with diseases, such as colorectal cancer, Prader-Willi, Angelman, Beckwith-Wiedmann syndromes and introduced into routine clinical diagnostics. DNA methylation patterns work in close association with histone modifications and chromatin structure, building transcriptionally active or repressed chromatin (2, 7, 12). It is important to recognize that crosstalk between these three mechanisms is complex and that aberrant DNA methylation can impact histone modifications and chromatin structure. The opposite mechanism is also possible, so histone modifications and dysregulation of histone modifying proteins can also influence DNA methylation patterns.
Mechanisms of DNA methylation
In mammalian cells, DNA methylation consists of covalent attachment of a methyl group to the 5’ position of cytosine residues in CG dinucleotides (2, 3). CG dinucleotides are not randomly distributed throughout the genome, but tend to cluster in regions called CpG islands, mainly present in the gene promoters (2, 5, 6). An accepted definition of CpG islands describes them as DNA sequences, more than 200 base pairs long, with CG content greater than 50% and an observed / expected CpG ratio of more than 60% (5, 13). Methylation can also occur at non-promoter CpG islands, defined as CpG shores, located in the vicinity of CpG islands up to 2 kilobases long (5, 14, 15). Methylation of CpG islands is typically associated with gene silencing, while demethylation of these sites enables transcription (1, 5). Methylation physically prevents the binding of regulatory proteins, which promote transcription, and through binding of methylation-sensitive proteins, which mask regulatory sequences or by interacting with histone modifications that modulate the structure of chromatin, enabling or disabling access of transcriptional machinery to promoters (1, 3). All these epigenetic marks work together to compress nucleosomes into transcriptionally unfavourable conformation, heterochromatin (3).
Methylation patterns in differentiated cells are mostly stable and inheritable (13). However, specific proteins, DNA methyl-transferases (DNMTs), can establish novel methylation marks in response to environmental triggers (3). DNMT3a and DNMT3b are responsible for the de novo methylation, while DNMT1 maintains silencing inheritance patterns through mitosis (1, 3). Some DNMTs remain tightly associated with nucleosomes; their binding assisted through interactions with methyl binding domains (MBDs), and may recruit other proteins containing MBDs to reinforce the silencing. These complexes further interact and recruit proteins involved in histone modifications (3).
Demethylation process is mainly active in early embryo development and can occur via passive demethylation during cell division or via active demethylation using TET1 and TET2 mediated hydroxylation of 5-methylcytosine to 5-hydroxymethylcytosine, followed by deamination and base excision repair (1).
Methods for DNA methylation discovery
DNA methylation represents a new opportunity in cancer genetics to provide biomarkers for diagnostic and prognostic use in clinical environment (16). In the past decade, due to significant technical developments, the research of DNA methylation has evolved from locus specific approach to genome-wide determination of methylome with base pair resolution (17).
Many different techniques for DNA methylation determination exist, and choosing the most appropriate one largely depends on the nature and number of the samples, information required, and costs. The three main approaches are: methylation-specific restriction enzyme digestion, affinity purification of methylated DNA and bisulfite conversion of DNA (18). The DNA obtained is further subjected to molecular-genetic approaches, which are for single locus analysis based on PCR, whereas in the case of genome-wide interrogation they are based either on microarray technology, mass spectroscopy or next generation sequencing analysis.
In the beginning the DNA methylation studies were performed with methylation-sensitive restriction enzymes and Southern blotting (19). However, today the majority of the methods rely on bisulfite conversion of DNA and subsequent PCR-based method (Table 1). The application of bisulfite treatment demands laboratory experience, since it can lead to DNA degradation and unwanted conversion of methylated cytosines to thymines when the incubation times are prolonged or incomplete conversion if incubation is too short. In order to overcome these problems numerous commercially available kits for bisulfite conversion of DNA isolated from different sample types were developed (20).
Table 1
When performing the analysis of the specific locus, the investigated region should ideally be unmethylated in normal tissue and methylated in cancerous tissue or vice versa. In addition, the methylation levels should enable differentiation between the two statuses of the samples (21).
In the laboratory setting one of the mostly used methods for locus specific methylation biomarkers is pyrosequencing, which is very appropriate for degraded formalin-fixed, paraffin-embedded (FFPE) samples that are an important part of tissue bio-banks. The technique enables quantitative analysis of each CpG position (22). Another method, which is fast and also robust, is high resolution melting (HRM) curve analysis. Two typical methods in wider use are methylation-sensitive HRM (MS-HRM) curve analysis and quantitative real time PCR, such as SMART-MSP (23). Both techniques are relatively cheap and sensitive, allow relatively good throughput and quantification, and are closed tubes techniques. The latter minimizes the risk for sample confusion and cross-contamination which is of vital importance in clinical laboratory (16, 24). The MS-HRM method has already been tested on samples with small amounts of DNA, like stool, and has proven to be sensitive and reliable enough to be used for screening purposes (25). Both techniques have demonstrated to be successful also on old FFPE tissues (26). However, these techniques require well designed primer pairs and stringent annealing temperatures in order to overcome the problem with false positive results. They are also not completely reliable when analysing heterogeneous DNA methylation patterns.
Another useful technique, still considered as a single locus analysis, is the matrix-assisted laser desorption ionization - time of flight (MALDI-TOF). The Sequenom Inc. has developed a sensitive and high throughput assay MassARRAY® EpiTYPER®, which enables quantitative screening and differential methylation analysis in cancer samples (27).
Roche 454 Genome Sequencer and Illumina Genome analyser are the most used next generation sequencing platforms in research (28) and therefore likely to be validated for clinical use. They are becoming the key players in cancer genome-wide methylome determination, which could result in determination of an array of biomarkers. These arrays should subsequently be subjected to thorough testing on larger sample cohorts using more-cost effective methods. The costs of next-generation sequencing are currently still too high to allow larger sample testing. So far research performed on these platforms proves to be useful to provide fingerprints of cancer methylomes, which could help in cancer subtypes classification. However, due to the limited knowledge of functional consequences of methylation aberrations, enormous number of discovered changes and overlapping changes between different cancers, establishment of cancer specific methylation signatures is still far away.
Roche 454 system was one of the first platforms that enabled a comprehensive multi-sample, multi-gene, and ultra-deep sequencing of cancer DNA generating specific methylation patterns. Beside the high number of reads, and therefore detailed sequence coverage, an important advantage of this technology was simultaneous exploration of genetic and epigenetic data (29). Another platform that offers genome-wide methylome studies is the Infinium HumanMethylation BeadChip microarray platform (Illumina). One of the most useful arrays seems to be the one with 27,578 highly informative CpG islands located within the proximal promoter regions (30). However, this technology needs high-quality DNA, which is not the most optimal for clinical setting, since the samples are mostly stored as FFPE. The comparison of fresh-frozen samples with FFPE showed that the correlation of results between them was not optimal (31).
As we have seen, the methods for DNA methylation analysis are abundant, however, when considering their application for use in clinical diagnostics the main drawbacks are the standardization of methods between laboratories, determination of reference standards, and costs, associated with training of personnel and obtaining new equipment (16). DNA methylation techniques that could be used in clinical setting should be easy to use, high throughput, preferably automatable, applicable on degraded DNA, cost-effective, and should provide quantitative methylation data (16, 21). On the other hand, DNA methylation is a stable covalent modification, present at single or multiple CpG sites, and as such may be easily translated into robust and high performance laboratory tests (32). Furthermore, biomarker evaluation should be performed in readily accessible diagnostic specimens, such as blood, urine, faeces or saliva in order to detect early stages of the disease.
Methylation profiles of selected cancer types – current knowledge
DNA methylation biomarkers could be used in two different ways: as confirmatory diagnostic, prognostic and predictive markers in an already diagnosed tumour or as markers for early disease and/or residual disease screening (21, 33).
An important feature of cancers is global hypomethylation across the genome, whereas hypermethylation is mostly observed locally (21). It is assumed that the consequence of hypomethylation of single-copy genes is activation of expression, leading to enhanced oncogenic potential. The role of hypomethylation of repeated sequences, which comprise 45% of human genome, is less understood, although it has been hypothesized to facilitate genomic instability (34). Hypermethylation, on the other hand is associated with inactivation of tumour suppressor genes and genes, implicated in normal homeostasis of tissues (35).
When the methylation status of whole genome is determined it is referred to as a cancer methylome fingerprint, while designation of methylation biomarker applies to methylation status of particular sites in the genome. Both are important in the development of clinical protocol that could enable the clinician to better diagnose and stage the disease, and to predict the prognosis and monitor the response to therapy (36).
The number of studies analysing the methylome and focal hypermethylation is enormous. However, so far there are only a few CE-IVD approved DNA methylation biomarkers, like MGMT in gliomas (37) or SEPT9 for colorectal cancer and SHOX2 for lung cancer in clinical use. This is mainly due to big inter-study differences, which makes it hard to compare the results in an unambiguous way. This has been clearly demonstrated by Li and co-workers who performed meta-analysis to evaluate the prognostic value of global DNA hypomethylation in cancer (38). Their initial search retrieved 146 citations, but after checking them for a few criteria, like assessment of global DNA hypomethylation level, the survival, the hazard ratio, and the sample size of more than thirty subjects, only 20 studies remained eligible to be included in the meta-analysis (38). Although the initial differences were pronounced, the pooled analysis of all the included studies demonstrated that global DNA hypomethylation could be associated with worse survival in cancer patients (38).
Studies showed that it is challenging to find biomarkers with high informativeness for clinical use, despite the fact that some results seem to replicate across studies. However, more defined protocols for study executions and more consensuses regarding the methods used should be established to gain applicable knowledge in the near future.
Breast cancer
Genome-wide expression analyses have defined five biologically and clinically distinct cancer subtypes of breast cancer: luminal-A, luminal-B, HER-2, normal-like, and basal-like (39). Timely detection of breast cancer is crucial for better prognosis and treatment, since the survival drops from 98% to 20% when tumour metastases appear (40). Among the first findings in breast cancer genetics were BRCA1 mutations, which importantly elevate the risk for the disease. Furthermore, it has been shown that not only genetic mutations, but also promoter methylation patterns represent a marker, particularly for early detection of malignant growth. Namely, Wong and co-workers determined that BRCA1 promoter methylation was associated with 3.5-fold increased risk for early onset of breast cancer. Importantly, they used peripheral blood as diagnostic specimen for the determination of methylation status of BRCA1 promoter, which could be therefore used as a screening tool (41). Promoter hypermethylation of BRCA1 could serve also as predictive biomarker for PARP1 inhibitor therapy, which has been currently used only for treatment of patients with BRCA1 and BRCA2 mutations to decrease the growth and vascularisation of tumour (42).
Since breast cancer has many faces, so far no gene that would apply to all the subtypes has been determined. Studies, analysing smaller panels of genes seem to be the most promising way towards a clinically relevant assay for breast cancer diagnostics. In the following section we will review the most recent studies of assays with clinical relevance.
The most recent study on advanced breast cancer, which could be used in monitoring and treatment response, was performed by Fackler and co-workers who tested a panel of 10 markers on cell-free circulating DNA (cfDNA) (43). The proposed assay was based on their previous results from Infinium Human Methylation27K Beadchip on tumour and serum samples from patients, and on normal samples of breast tissue and serum. The markers included in the test had low methylation levels in normal samples and high levels in cancer ones. The selected markers were AKR2B2, COL6A2, GPX7, HIST1H3C, HOXB4, RASGRF2, and TM6SF1, and additionally three markers, ARHGEF7, TMEFF2, and RASSF1A, previously associated with breast cancer, were chosen. The specificity and sensitivity of the assay were 96.4% and 91.7%, respectively. The best individual performance was obtained for the genes RASSF1A, HIDR1H3C, RASGRF2, COL6A2, HOXB4, and AKR1B1, which combined in a panel, performed almost as good as the 10 - gene panel (Table 2). Furthermore, the results showed applicability for measuring the treatment response, which could be of great benefit for physician (43). Kloten and co-workers studied promoter methylation of seven genes SFRP1, SFRP2, SFRP5, WIF1, DKK3, ITIH5, and RASSF1A. They performed the analysis on test set of samples, and did the validation of the best markers DKK3, ITIH5, and RASSF1A on additional independent samples. They were able to achieve sensitivity of 51.4% and a specificity of 80.5% to distinguish breast cancer patients from both healthy and benign controls. The sensitivity and specificity of DKK3 and ITIH5 in premenopausal women was 52% and 100%, respectively, making them promising candidates for early cancer detection (44). Radpour and co-workers tested different set of ten tumour suppressor genes (APC, BIN1, BMP6, BRCA1, CST6, ESR-b, GSTP1, P16, P21 and TIMP3) for which they determined in their previous study to be hypermethylated in breast cancer tumours compared to controls (45). Eight of the studied genes showed the methylation level statistically significantly higher in cfDNA of patients compared to controls. The panel of these eight genes reached the specificity of 91.7% for plasma samples, and 95% for serum samples, and more than 90% specificity for both specimens. The higher methylation of cfDNA for seven genes was associated with higher methylation in solid tumours, compared to normal control tissues, which proves the cancerous tissue as the source of the cfDNA (46). Another study on cfDNA that was associated with early diagnosis of breast cancer was performed by Agostini and co-workers, who determined that amount of cfDNA and methylation of RASSF1A, MAL, and SFRP1, could be used as predictive biomarker (47).
Table 2
Cancer type | DNA methylation marker | Main results | Reference |
---|---|---|---|
Breast | RASSF1A, HIDR1H3C, RASGRF2, COL6A2, HOXB4, AKR1B1 |
High methylation levels in cancer samples. Applicable for measuring treatment response. |
(43) |
DKK3, ITIH5, RASSF1A | Possible to discriminate breast cancer patients from benign cases and healthy controls with sensitivity of 51.4% and a specificity of 80.5%. | (44) | |
DKK3, ITIH5 | Candidate genes for early cancer detection with 52% and 100% sensitivity and specificity, respectively. | (44) | |
APC, BIN1, BRCA1, CST6, GSTP1, P16, P21, TIMP3 | Higher methylation of cfDNA in cancer patients compared to controls with over 90% for either plasma or serum samples or both. Results were confirmed also on triple matched samples from tumour, normal and serum sample. | (45, 46) | |
MAL, RASSF1A, SFRP1 | 69% specificity and 100% sensitivity of the test was achieved when using the combination of cfDNA ALU247 quantification and methylation status. | (47) | |
|
BCAP31, FZD9, HDAC1, HOXA11, LYN, PTPRH, ST6GAL1, TFF1 |
Strong associations between methylation and ER status, PR status, TP53 mutation status, histological grade and molecular subtype. BCAP31 hypomethylation showed worse survival and OGG1 hypermethylation showed that patients with normal - like levels of methylation. |
(48) |
Pancreatic | Test of 1505 CpG sites | Blood-based test. With a model with five CpG sites (IL10_P348, LCN2_P86, ZAP70_P220, AIM2_P624, TAL1_P817) could differentiate between patients and controls. | (58) |
ADAMTS1, BNC1 | Early stage cancer diagnostics. | (55) | |
|
|
Could be used as a screening tool for high-risk individuals. |
|
Cancer type |
DNA methylation marker |
Main results |
Reference |
NPTX2 | Potential biomarker for differential diagnostics of pancreatic cancer and chronic pancreatitis, and early disease detection with sensitivity and specificity of 80% and 76%, respectively. | (56, 57) | |
|
ACIN1, TNFRSF10C |
Blood-based test. Hypermethylation of TNFRSF10C and ACIN1 associated with shorter survival. Higher levels of TNFRSF10C methylation were associated with perineural spread of cancer cells. |
(59) |
Colorectal | Genome-wide differential methylation patterns | Identification of differential methylation patterns that could be used for distinguishing cancer tissues from normal ones. | (15, 61, 65-67) |
MGMT | MGMT hypermethylation detection in blood or tissues is used as an aid in diagnostic detection of CRC cells, clinical research, and pharma/biopharma testing. | (71-73) | |
SEPT9 | SEPT9 hypermethylation detection in blood plasma is used as an aid in diagnostic evaluation of CRC. | (74, 76, 79-81) | |
VIM | Vimentin hypermethylation detection in stool samples (and blood) is used as an aid in diagnostic detection of CRC cells. | (77, 78) | |
ALX4, SEPT9, TMEFF2 | The sensitivities and specificities of the combined biomarkers for the detection of CRC in primary tissues and peripheral blood samples were 84% and 81% and 87% and 90%, respectively. | (74) | |
SOX21, SLC6A15, NPY, GRASP, ST8SIA1, ZSCAN18) | Hypermethylation of these genes was detected in at least 50% of CRC tissues and low levels of methylation in non-neoplastic colorectal tissues showing a potential to be developed into stool-based assays. | (61) | |
BCAT1, COL4A2, DLX5, FGF5, FOXF1, FOXI2, GRASP, IKZF1, IRF4, SDC2, SOX21 | Hypermethylation of these genes was detected in CRC tissue and low methylation levels were characterized in blood samples from healthy subjects rendering them suitable for evaluation as blood-based CRC biomarkers. | (61) | |
BMP3, CACNA1G, CDKNA2, CDH1, Corf50, IGF2, NEUROG1, MGMT, MLH1, RUNX3, SFRP2, SOCS1, THBD, VHL | Additional candidate biomarkers for detection of CRC cells that are often hypermethylated in malignant cells |
(12,61,62, 71-78,80,81) |
|
|
IGF2, LINE-1 |
Candidate biomarkers for CRC showing frequent hypomethylation in pre-malignant lesions, possibly suitable for early detection of CRC. May play role in progression of distinct subtypes of CRC. |
(63, 64) |
Gastric | Genome-wide differential methylation patterns | Variable hypermethylation and hypomethylation landscape. | (93, 94) |
CRK, MOS | Hypermethylated in tissues from patients positive for Helicobacter pylori infection and hypomethylated in tissues from patients with advanced gastric cancer. May be suitable for development of methylation assays for patients with H. pylori infection showing no pre-malignant lesions. | (99) | |
ADAM23, APC, BMP-2, BX161496, CACNA2D3, CDH1, CDX2, CHFR, CHRNA3, DAPK, DCC, DOK1, GDNF, GNMT, HOXA5, ITGA4, ITGAM, KCNA4, MGMT, MINT25, MLF1, MLH1, p16, p14ARF, PRDM5, RASSF1A, RELN, RPRM, RUNX3, SOCS1, SULF1, TFPI, TIMP3, TRIM15, VAV1, WNT5A, WT1, ZIC1 | Candidate genes showing frequent hypermethylation in gastric cancer tissues. | (92, 93, 95-99) | |
ALDH2, FAM38A, LINE-1, MSX2, MTHFR, PTPN6, STAT5A | Candidate genes showing frequent hypomethylation in gastric cancer tissues. | (92, 93, 99) |
Fleischer and co-workers performed an integrated analysis, where they tested blood sample methylation status of 27 gene promoters which were further associated with gene expression, germline genotypes, and clinical parameters. They determined that methylation was more strongly associated with clinical parameters than gene expression, supporting the idea of methylation being a robust biomarker. On individual level, the methylations of BCAP31 and OGG1 were associated with survival and better prognosis, if the methylation levels of those two genes were similar to levels in normal controls (48).
As seen in the latest studies of biomarker panels, the results still highly depend on the study design, where majority of the proposed assays stem from previous whole-genome results of research groups proposing the ‘new’ potential tests. Therefore, majority of the genes studied are diverse between the studies, and thorough validation studies are needed to achieve the desired sensitivity and specificity.
Pancreatic cancer
So far, the gold standard for pancreatic cancer serum biomarker in clinical practice has been the carbohydrate antigen (CA) 19-9 (49). Recent meta-analysis has shown that it has the sensitivity and specificity of 0.8, which places it as an important diagnostic biomarker in clinical environment (50). However, it is not applicable for early-stage cancer diagnostic. As the patients with developing pancreatic cancer do not have clear clinical manifestations, early stages of the disease are difficult to diagnose, resulting in poor prognosis and high mortality (49). The search for new biomarkers that could be used for screening and prognostic purposes has been lately oriented towards epigenetics as aberrant DNA methylation has been observed already during the earliest stages of disease development (51).
Excellent recent review papers have compiled extensive data on aberrantly methylated genes in pancreatic cancer (49, 51-53). The other valuable source of information is Pancreatic Cancer Methylation Database which covers the five major subtypes of pancreatic cancer and has currently over 65,000 entries on methylation status of more than 4000 genes (54).
DNA methylation biomarkers that could indicate the existence of the precursor lesions are of great importance, since well-timed detection is one of the key issues in pancreatic cancer diagnosis. Yi and co-workers focused on two genes for early-stage cancer detection, BNC1 and ADAMTS1 that were detected as aberrantly methylated using genome-wide study. With the methylation analysis of these two genes on FFPE and serum samples they were able to detect early-stage cancers at a much higher frequency than with the current biomarker CA19-9. The use of these two genes for screening high-risk individuals in populations (e.g. the ones with family history of pancreatic cancer, hereditary pancreatitis, familial non-polyposis colorectal cancer or mutations in BRCA1/2) could therefore be justifiable (55). A study using cfDNA from plasma of patients with pancreatic cancer, chronic pancreatitis and benign biliary stone disease showed that the methylation level of NPTX2 promoter, which has been previously detected as aberrantly methylated in pancreatic cancer tissue, was significantly higher in cancer patients compared to the other two tested groups (56, 57). The sensitivity and specificity were 80% and 76%, respectively, making the NPTX2 a good potential biomarker for differential diagnostics and early disease detection (57).
In two other studies the DNA for methylation analysis was extracted from whole blood. Pedersen and co-workers performed a rather large study in two phases. Out of 1,505 CpG sites in phase I, 88 were selected and tested in phase II. They were able to differentiate between pancreatic adenocarcinoma and control samples; however, no relation of the obtained biomarkers with disease staging or other clinical parameters was available (58). Dauksa and co-workers designed a more clinically oriented study, although on a smaller sample and on promoter regions of nine tumour suppressor genes. They showed that hypermethylation of TNFRSF10C and ACIN1 was associated with shorter survival, and that higher levels of TNFRSF10C methylation were associated with perineural spread of cancer cells (59).
The so far determined biomarkers need further validation, but they have the potential to become clinically useful for early and differential diagnostics.
Colorectal cancer
Despite enormous efforts invested in the search for biomarkers that would allow early diagnosis, colorectal cancer (CRC) still remains the second leading cause of cancer-related deaths. Although hereditary types of CRC have been well characterized and main pathobiological mechanisms implicated in the onset of the disease have been revealed, the underlying causes for a large proportion of sporadically occurring CRC and a smaller subset of CRC with hereditary origins remain elusive (12, 60).
Several decades of epigenetic research have shown that aberrant DNA methylation is characteristic for sporadic CRC and that it is probably predominantly involved in the early events during malignant phenotype progression (12, 34, 61). These changes involve global hypomethylation events as well as focal hypermethylation of CpG islands and shores in the promoter regions of specific genes (12, 15, 34, 61, 62).
Initially, several well-designed studies have identified loss of methylation in CpG islands located in repetitive sequences of DNA, such as LINE-1 and Alu (63, 64). These discoveries were subsequently followed by novel findings supported with the advances in methods, capable of interrogating whole-genome methylation status. Several groups demonstrated hypomethylation of approximately one-third of genes, which were normally hypermethylated in normal colorectal tissues, and hypomethylation of long blocks of DNA, which were in normal cells packed within transcriptionally inactive heterochromatin (15, 34, 65). These long blocks were mostly associated with regions that corresponded to large organized chromatin lysine modifications (LOCKs), enriched in histone modifications characteristic for heterochromatin, or portions of chromatin associated with nuclear lamina, lamina-associated domains (LADs) (15, 34, 65-67). It is important to note, that the main feature of all these findings is extreme hypervariability in the methylation patterns and it seems that not only changes in gene expression and increased genome instability through enhanced recombination events and double strand breaks but also general variability of hypomethylation landscape is equally important in driving the neoplastic transformation (34).
In parallel with hypomethylation research, a great effort has been invested in discovering and elucidation of hypermethylation patterns in CRC. Hypermethylation of MLH1 remains one the few biomarkers that were translated into clinical utility for distinguishing sporadic CRC from hereditary Lynch syndrome, which is mostly characterized by mutations in mismatch repair genes, particularly MLH1, MSH2, and MSH6, and EPCAM. However, hundreds of mutations have been discovered in these genes, and for many of those, the pathogenicity and their functional effect on the protein product is still difficult to interpret, indicating that other pathogenic mechanisms, including epigenetic, might underlie the tumorigenesis (68-70).
MGMT hypermethylation is novel hypermethylated biomarker used in clinical diagnostics, associated with distinct mutator pathway, leading to CRC (71-73). Interestingly, it was demonstrated that polymorphism rs16906252 is closely associated with the onset of MGMT hypermethylation, thus confirming the complex nature and crosstalk between cis-acting DNA sequence and epigenetic modifications (72).
Many laboratories further identified panels of genes methylated with high frequency in CRC, such as ALX4, BMP3, CACNA1G, CDKNA2, CDH1, Corf50, IGF2, NEUROG1, RUNX3, SEPT9, SFRP2, SOCS1, THBD, TMEFF2, VHL, and VIM (12, 61, 62, 74, 75). Notably, selected panels of methylated biomarkers have been interrogated using blood specimens from patients, reaching over 80% sensitivities and specificities (62, 74). Despite extensive confirmation and validation of these epimarkers in large cohorts of patients, their clinical utility has not been reliably confirmed. However, first generations of commercially available methylation test assays, such as Epi proColon Assay (Epigenomics AG), ColoVantage® (Quest Diagnostics) and Abbott RealTime mS9 (Abbott Laboratories) for detecting mSEPT9 and ColoSure™ (LabCorp) for detecting mVIM have been validated as pre-screening biomarkers in asymptotic individuals with varying results (76-81).
In conclusion, epigenetic modifications seem to play an important role in CRC tumorigenesis; however, screening for specific methylation changes in genes is not yet justified. Deciphering the functional impact of aberrant CpG island methylation in the complex context of all the intrinsic and environmental factors, which influence epigenome, will require further studies and integration of genomic, epigenomic and proteomic data. More important, as Timp and Feinberg suggested in their recent review, research showed that far-reaching goal might be detecting the departure of methylation signatures from normal ones using expensive whole-genome technologies (34).
Gastric cancer
After decades of research it is clear that gastric cancer is a multifactorial disease involving environmental factors, such as Helicobacter pylori and Epstein-Barr virus infection, obesity, salt intake, tobacco use, alcohol consumption, and genetic susceptibility (35, 82-85). H. pylori has been recognized as class I pathogen affecting the methylation status of CDH1 and several other genes, transposable elements, and repetitive DNA sequences (83, 86-90). In some instances it seems that persistent H. pylori infection might influence specific hypermethylation of some genes, which could be eventually used as diagnostic biomarkers; however, the universal biomarkers have not yet been clearly identified. Interestingly, it has also been demonstrated that physical activity and nutrition could significantly affect methylation status of genes (91). The overall picture is further complicated by the different ethnical backgrounds of the subjects and the fact that methylation is dynamic and greatly affected by lifestyle, nutrition, drugs and chemotherapy.
Methylation events are much less characterized in gastric adenocarcinomas compared to CRC. Several studies have shown that, in general, gastric cancer cells and cancer-associated stroma exhibit global hypomethylation of genomic DNA and focal hypermethylation of several transcription sites (92-95). More than 500 publications have reported on the presence of specific promoter hypermethylation in gastric cancer (96). Due to limited space, it is impossible to present all relevant findings; however, interested readers are referred to excellent reviews, compiling relevant methylation changes in association with gastric cancer for further information (9, 89, 97).
Furthermore, meta-analyses of methylation patterns in gastric cancer tissues highlight the controversy of CpG island methylator phenotype (CIMP) in this type of cancer (38, 96, 98). Next to large variability in assays, sample collection, sample storage, and examined gene panels, many issues relevant to clinical implementation remain unaddressed (96). Likewise, lack of validation in large independent cohorts, taking into account the method of treatment, gender, smoking, age, which also affect methylation patterns, and ethnical background further challenge identification of optimal methylation signature for detecting early stomach malignant events.
Ideally, good screening biomarkers should be associated with gastric cancer, regardless of H. pylori status (99), detectable in easily accessible body fluids, such as blood or urine, unrelated to the gender and age, and specific for detecting early stages of the disease (35, 100). Although several epimarkers, such as CDH1, CHFR, DAPK, GSTP1, MLH1, p15, p16, RUNX3, RARβ, RASSF1A, and TFFPI2 have emerged as potential screening biomarkers that are detectable in serum or plasma of patients; none has yet reached a sufficiently high level of specificity and reliability for early detection of gastric cancer (89, 96).
Conclusions
The synthesis of available knowledge regarding DNA methylation markers relevant for cancers is extremely complex and here we attempted to depict the intricacy and heterogeneity of this field. Certainly, additional large mining studies could reveal novel and relevant biomarkers that could easily be overlooked in smaller literature reviews such as ours. Though aberrations of DNA methylation patterns appear to be relevant carcinogenic mechanisms, the translation of these findings into sensitive and specific diagnostic tests is hindered by the heterogeneity of epigenetic events in individuals and overlapping methylation changes that occur in different cancers (28, 32). Detection of global hypomethylation and focal hypermethylation patterns is technically demanding and currently not cost-effective for clinical laboratories. Even with modern accurate technologies available for detecting epimarkers in body fluids, the main disadvantage of these markers, namely specificity, still persists. Anti-profiling, looking for departure of methylation patterns from normal ones, is currently not applicable for the detection of the disease in whole blood (101).
One of the most promising aspects in this field is that despite complex and expensive technologies used for research, the output panels of biomarkers, specific for detecting methylation aberrations, could be interrogated with relatively simple PCR-bisulphite methods, easily applicable to the most clinical laboratories. Moreover, several commercial enterprises are offering professional diagnostic services using advanced methods such as next generation sequencing etc. for the detection of common methylation aberrations. On the other hand, the interpretation of these results is often confusing and complicated, thus requiring establishment of multidisciplinary genetic counselling teams within healthcare institutions, which will be able to deliver meaningful explanations of the analyses.
The main headway in the cancer DNA methylation research would be to replicate the findings of aberrant methylation patterns present in tissues to more accessible diagnostic samples like stool, sputum, urine, blood or serum. However, clinically applicable methylation assays still await more replication studies, testing on large cohorts of patients and healthy controls, validations with different methods, and functional analyses of methylation patterns in order to determine, if the specific methylation patterns are exclusively associated with the disease or are by-products of immune response to tumour development, stress or treatments.