Mining LINE-1 Characteristics That Mediate Gene Expression

  • Naruemon Pratanwanich
  • Apiwat Mutirangura
  • Chatchawit Aporntewan
Part of the Communications in Computer and Information Science book series (CCIS, volume 115)


We proposed to use data mining to identify LINE-1 (L1) characteristics that were associated with gene expression in bladder cancer. The data were collected from L1Base and GSE3167. The memory-efficient data structure called FP-Tree was employed to enumerate all frequent item sets. The frequent item sets were then used to produce rules for predicting “down regulation” and “not down.” Each rule was assigned a p-value by means of Chi-square test. No statistically significant rules for “down” had been found, in contrast 692 rules for “not down” were significant with odd ratios ranging from 1.68 to 1.98. All the significant rules were concentrated only in 20 characteristics. We were able to infer the L1 characteristics that down-regulated genes. Those characteristics were number of L1 elements in host genes, full-length intactness, number of CpG islands, conserved 5’UTR and mutated ORF2.


LINE-1 hypomethylation gene expression bladder cancer data mining 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Robertson, K.D.: DNA methylation and human disease. Nature Reviews Genetics 6, 597–610 (2005)CrossRefPubMedGoogle Scholar
  2. 2.
    Chalitchagorn, K., Shuangshoti, S., Hourpai, N., Kongruttanachok, N., Tangkijvanich, P., et al.: Distinctive pattern of LINE-1 methylation level in normal tissues and the association with carcinogenesis. Oncogene 23, 8841–8846 (2004)CrossRefPubMedGoogle Scholar
  3. 3.
    Phokaew, C., Kowudtitham, S., Subbalekha, K., Shuangshoti, S., Mutirangura, A.: LINE-1 methylation patterns of different loci in normal and cancerous cells. Nucl. Acids Res. 36, 5704–5712 (2008)CrossRefPubMedPubMedCentralGoogle Scholar
  4. 4.
    Subbalekha, K., Pimkhaokham, A., Pavasant, P., Chindavijak, S., Phokaew, C., et al.: Detection of LINE-1s hypomethylation in oral rinses of oral squamous cell carcinoma patients. Oral Oncology 45, 184–191 (2009)CrossRefPubMedGoogle Scholar
  5. 5.
    Han, J.S., Szak, S.T., Boeke, J.D.: Transcriptional disruption by the L1 retrotransposon and implications for mammalian transcriptomes. Nature 429, 268–274 (2004)CrossRefPubMedGoogle Scholar
  6. 6.
    Clark, D.: Molecular Biology. Elsevier Academic Press, Amsterdam (2005)Google Scholar
  7. 7.
    Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Morgan Kaufmann, San Francisco (2005)Google Scholar
  8. 8.
    Penzkofer, T., Dandekar, T., Zemojtel, T.: L1Base: from Functional Annotation to Prediction of Active LINE-1 Elements. Nucl. Acids Res. 33(Database issue), D498–D500 (2005)CrossRefGoogle Scholar
  9. 9.
    WEKA Project, The University of Waikato,
  10. 10.
    Han, J., Pei, J., Yin, Y., Mao, R.: Mining Frequent Patterns without Candidate Generation: A Frequent-Pattern Tree Approach. Data Mining and Knowledge Discovery 8(1), 53–87 (2001)CrossRefGoogle Scholar
  11. 11.

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Naruemon Pratanwanich
    • 1
  • Apiwat Mutirangura
    • 2
  • Chatchawit Aporntewan
    • 3
  1. 1.Graduate SchoolChulalongkorn UniversityBangkokThailand
  2. 2.Department of Anatomy, Faculty of MedicineChulalongkorn UniversityBangkokThailand
  3. 3.Department of Mathematics, Faculty of ScienceChulalongkorn UniversityBangkokThailand

Personalised recommendations