Title:Exploring Patterns of Epigenetic Information with Data Mining Techniques
Volume: 19
Issue: 4
Author(s): Vanessa Aguiar-Pulido, Jose A. Seoane, Marcos Gestal, Julian Dorado
Affiliation:
Keywords:
Epigenetics, data mining, knowledge discovery, bioinformatics
Abstract: Data mining, a part of the Knowledge Discovery in Databases process (KDD), is the process of extracting patterns from large data sets by combining methods from statistics and artificial intelligence with database management. Analyses of epigenetic data have evolved towards genome-wide and high-throughput approaches, thus generating great amounts of data for which data mining is essential. Part of these data may contain patterns of epigenetic information which are mitotically and/or meiotically heritable determining gene expression and cellular differentiation, as well as cellular fate. Epigenetic lesions and genetic mutations are acquired by individuals during their life and accumulate with ageing. Both defects, either together or individually, can result in losing control over cell growth and, thus, causing cancer development. Data mining techniques could be then used to extract the previous patterns. This work reviews some of the most important applications of data mining to epigenetics.