Cataloguing functionally relevant polymorphisms in gene DNA ligase I: a computational approach

Singh, Abhishek A.; Sivakumar, Dakshinamurthy; Somvanshi, Pallavi

doi:10.1007/s13205-011-0006-8

Cataloguing functionally relevant polymorphisms in gene DNA ligase I: a computational approach

Original Article
Open access
Published: 27 April 2011

Volume 1, pages 47–56, (2011)
Cite this article

Download PDF

You have full access to this open access article

3 Biotech Aims and scope Submit manuscript

Cataloguing functionally relevant polymorphisms in gene DNA ligase I: a computational approach

Download PDF

Abhishek A. Singh¹,
Dakshinamurthy Sivakumar² &
Pallavi Somvanshi³

1829 Accesses
6 Citations
Explore all metrics

Abstract

A computational approach for identifying functionally relevant SNPs in gene LIG1 has been proposed. LIG1 is a crucial gene which is involved in excision repair pathways and mutations in this gene may lead to increase sensitivity towards DNA damaging agents. A total of 792 SNPs were reported to be associated with gene LIG1 in dbSNP. Different web server namely SIFT, PolyPhen, CUPSAT, FASTSNP, MAPPER and dbSMR were used to identify potentially functional SNPs in gene LIG1. SIFT, PolyPhen and CUPSAT servers predicted eleven nsSNPs to be intolerant, thirteen nsSNP to be damaging and two nsSNPs have the potential to destabilize protein structure. The nsSNP rs11666150 was predicted to be damaging by all three servers and its mutant structure showed significant increase in overall energy. FASTSNP predicted twenty SNPs to be present in splicing modifier binding sites while rSNP module from MAPPER server predicted nine SNPs to influence the binding of transcription factors. The results from the study may provide vital clues in establishing affect of polymorphism on phenotype and in elucidating drug response.

A comprehensive in silico analysis of non-synonymous and regulatory SNPs of human MBL2 gene

Article Open access 21 June 2016

An in silico approach to characterize nonsynonymous SNPs and regulatory SNPs in human TOX3 gene

Article 14 November 2019

Prediction of the most deleterious non-synonymous SNPs in the human IL1B gene: evidence from bioinformatics analyses

Article Open access 10 June 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Single nucleotide polymorphisms, often referred as SNP, are the most common DNA variations present throughout human genome with a frequency of one in thousand base pairs (Brookes 1999). SNPs present in coding region are either synonymous SNP (sSNP) in which any alteration in the codon does not result in coding of different amino acid or nonsynonymous SNP (nsSNP) where a change in codon results in coding of different amino acid. The missense mutations (a category of nsSNP) are of importance because of their ability to influence protein functions and many of them are linked to human inheritable diseases (krawczak et al. 2000; Tokuriki et al. 2008; Wang and Moult 2001). While SNPs present in other genomic regions, viz untranslated regions (UTR), intron and promoter regions have potential to influence gene regulation (Mooney 2005). SNPs in transcription factor binding site (TFBS) may disrupt the site (Boccia et al. 1996; Vasiliev et al. 1999) or may form a novel binding site (Knight et al. 1999; Piedrafita et al. 1996). Similarly, a SNP in micro RNA binding site may lead to repression of protein coding genes or activators of RNA degradation (Mishra et al. 2008). Furthermore, SNPs in splicing modifiers binding site (enhancers or silencers) may generate an unstable mRNA resulting in a defective or truncated protein (ElSharawy et al. 2006). Some SNPs are functional (Hardison 2003) and thus, their identification is crucial to understand molecular basis of complex traits and diseases in human (Shastry 2002).

The experimental techniques are most comprehensive and precise ones in distinguishing functional SNPs from neutral ones (Chen and Sullivan 2003). It is not feasible in terms of time and cost to perform laboratory experiments for all SNPs in human genome (or in single gene) and elucidate their functional importance while theoretical or computational methods aid in narrowing down the number of potentially functional SNPs present in a human gene (Ramensky et al. 2002). In this study, the authors have applied web-based computational tools to identify potentially functional SNPs influencing protein stability, binding of splicing modifiers, binding of transcription factors and binding of micro RNA in gene DNA Ligase I (LIG1, ATP-dependent). The two most important processes in which gene LIG1 participates are joining of Okazaki fragments during eukaryotic DNA replication and ligation of synthesized patch during base excision repair (BER) (Pascal et al. 2004; Vago et al. 2009; Goetz et al. 2005; Lee et al. 2008; Timson et al. 2000). DNA replication gene LIG1 also interacts with proliferation cell nuclear antigen (PCNA) (Levin et al. 1997; Montecucco et al. 1998; Liang et al. 2008) and loss in its ability to interact with PCNA jeopardises its normal functionality to join Okazaki fragments and to ligate synthesized patch during BER (Liang et al. 2008; Levin et al. 2000). SNPs in gene LIG1 may cause DNA Ligase I deficiency which results in immunodeficiency and increased sensitivity to DNA-damaging agents (Barnes et al. 1992). In this study, mutant protein structures were modelled and compared with native structure of gene product LIG1, for changes in energy and Root Mean Square Deviation (RMSD) values.

The present in silico study focuses on identification of functional SNPs in most of genomic regions of human gene LIG1 as compared to the recent in silico studies which were more focussed on identification of deleterious nsSNPs (Doss et al. 2008a, b; Rajasekaran and Sethumadhavan 2010; Kanthappan and Sethumadhavan 2010).

Materials and methods

Dataset

The single nucleotide polymorphism database (dbSNP) (Sherry et al. 2001) cited at http://www.ncbi.nlm.nih.gov/SNP was used to retrieve SNPs and their related protein sequences for the gene LIG1.

Identification of deleterious nonsynonymous single nucleotide polymorphism by sequence homology based method

Sorting Intolerant from Tolerant (SIFT) tool accessible at http://sift.jcvi.org/ was applied to detect deleterious nonsynonymous SNPs (Ng and Henikoff 2001, 2002, 2003; Kumar et al. 2009). SIFT compiles a dataset of functionally linked protein sequences by searching protein database using PSI-BLAST algorithm. Then, it builds an alignment from the homologous sequences with the query sequence and scans all positions in the alignment and calculates the probabilities for amino acids at that position. The substitution at each position with normalized probabilities less than a tolerance index or SIFT score of 0.05 are predicted to be deleterious or intolerant while those equivalent or greater than 0.05 are predicted to be tolerant (Ng and Henikoff 2001). In this study RefSeq ID or GI number and substitution(s) was given as input to SIFT blink program (Kumar et al. 2009). The program was executed on default settings i.e., best BLAST hits for each organism were included and sequences greater than 90% identity to query were removed. A total of thirty-one nsSNPs in protein transcript (NP_000225.1) of gene LIG1 (NM_000234.1) were analysed for identification of deleterious variant(s).

Identification of damaging nonsynonymous single nucleotide polymorphism by structural-homology based method

Polymorphism Phenotyping tool (PolyPhen) available at http://coot.embl.de/PolyPhen/ uses structural and evolutionary characteristics to identify deleterious nsSNPs (Sunyaev et al. 2000; Ramensky et al. 2002). PolyPhen uses either amino acid sequence or SWall protein database ID (SPTR) or accession number with the two amino acid variants along with their position as inputs. The algorithm performs sequence-based characterization of the mutation site using a blend of various algorithms, followed by the identification and alignment of homologs to the query sequence and generating profile score. The amino acid residue substitution is then mapped to the known protein 3D structures and position-specific independent counts (PSIC) scores are calculated for each of the two amino acids. Finally, PSIC score difference is computed. A PSIC score difference more than or equal to 1.5 is considered to be damaging. Based on PSIC score difference, PolyPhen ranks nsSNP into one of the following three categories: (a) Benign (b) Possibly damaging and (c) Probably damaging. A total of thirty-one nsSNPs in protein transcripts (NP_000225.1) of gene LIG1 (NM_000234.1) were analysed for identification of deleterious variant(s).

Identification of nonsynonymous single nucleotide polymorphism influencing protein stability

Cologne University Protein Stability Analysis Tool (CUPSAT) (Parthiban et al. 2006, 2007a, b) available at http://cupsat.tu-bs.de/ was applied to analyse changes in protein stability upon point mutation. The computational method makes use of amino acid-atom potentials and torsion angle distribution to assess amino acid environment of the mutation site (Parthiban et al. 2007a, b). The overall stability is calculated from atom and torsion angle potentials. In case of unfavourable torsion angles, atom potentials may have higher impact on stability which results in stabilising mutation (Parthiban et al. 2007). The output comprises of information about mutational site, its structural features, and information regarding changes in protein stability for 19 possible substitutions at the give position. The structure of gene product LIG1 was acquired from Protein Data Bank (PDB) (Berman et al. 2000), having PDB id 1x9n (A chain). The protein structure, native amino acid residue and its position was given as an input to the tool. A total of sixteen nsSNPs were evaluated for their influence on protein stability.

Identification of single nucleotide polymorphism in splicing modifier binding site

FASTSNP (Yuan et al. 2006) a web-based tool, available at http://FASTSNP.ibms.sinica.edu.tw was used to determine polymorphism(s) in coding (nsSNP and sSNP) and in UTR regions of gene LIG1 influencing splicing regulation. FASTSNP is based on a decision tree principle and uses three web services: (i) ESEfinder (Cartegni et al. 2003; Smith et al. 2006) (ii) ESE-RESCUE (Fairbrother et al. 2002), and (iii) FAS-ESS (Wang et al. 2004) to predict impact of SNPs present in splicing modifier binding sites. SNPs present in Exonic Splicing Enhancer (ESE) sites are identified by ESEfinder and ESE-RESCUE tools. ESEfinder aids in identification of sSNPs located in ESE sites that will potentially weaken the binding site and ESE-RESCUE provides cross reference to the results from ESEfinder. While SNPs present in Exonic Splicing Silencer (ESS) site are identified by FAS-ESS tool. It also aids in identification of coding SNPs that will potentially abolish ESS sites. FASTSNP also computes a score based on the level of risk i.e., 0, 1, 2, 3, 4 and 5 indicating No, Very Low, Low, Medium, High and Very High risk.

Identification of single nucleotide polymorphism in transcription factor binding site and in micro-RNA binding site

The authors used rSNP module from MAPPER web server available at http://genome.ufl.edu/mapper/mapper-main to identify SNPs present in binding site of one or more transcription factors in gene LIG1. The tool identifies TFBS in multiple genomes, by combining TRANSFAC (Matys et al. 2003, 2006) and JASPAR (Sandelin et al. 2004; Bryne et al. 2008; Portales-Casamar et al. 2010) data with profile hidden Markov model (HMMs) (Marinescu et al. 2005a, b) The gene LIG1 was given as an input to rSNP module and models from all available three libraries i.e., TRANSFAC matrices, TRANSFAC factors and JASPAR matrices were selected. The result comprises of a list of SNPs in TFBSs along with computed scores, these scores indicate changes in binding affinity of transcription factors. Furthermore, the tool does not limit its prediction to 5′UTR and promoter region but also extends it to intron region (Jun and Jing 2010).

Database of all miRNA binding sites within 200 nucleotides of a SNP (dbSMR) which may influence binding of miRNA, available at http://miracle.igib.res.in/polyreg/ was used to detect these SNPs (Hariharan et al. 2009). Both options present in database i.e., polymorphisms around predicted miRNA binding sites and polymorphisms around validated miRNA binding sites, were executed to identify SNPs influencing binding of miRNA to its target sites in gene LIG1.

Modelling nsSNPs on protein structure and determining alterations in energy and RMSD

The structure of the gene product LIG1 was acquired from PDB, having PDB id 1x9n (A chain). The Swiss-PDB Viewer (Kaplan and Littlejohn 2001) was used for mapping mutations on structure. Selenomethionine residues present in the protein structures (native and mutant) were modified as Methionine using protein preparation wizard, Schrodinger, maestro (Schrodinger Inc. USA). The native and mutated structures were parameterized with amber03 force field and energy minimization was performed using GROMACS (Hess et al. 2008) (version 4.5.1) employing steepest descent algorithm. The RMSD values were computed using structural superimposition program from the Schrodinger suite. A total of seven nsSNPs were mapped onto the protein structure and analysed for change in energy and RMSD values from native structure.

Results

Dataset

The SNPs related to LIG1 gene were acquired from dbSNP database, build 132 (Sherry et al. 2001). Out of 792 SNPs, 52 (6.56%) SNPs were present in coding region of the gene (31 nsSNP and 21 sSNP), 3 SNPs (0.37%) in 3′UTR, 736 SNPs (92.92%) in intron region, 1 SNP (0.12%) in 5′UTR.

Deleterious nonsynonymous single nucleotide polymorphisms predicted by SIFT server

Eleven nsSNPs were predicted to be deleterious with a tolerance index below 0.05. Lower the tolerance index or SIFT score, greater functional consequence an amino acid residue substitution is expected to have (Ng and Henikoff 2001). Four nsSNPs (rs111507847, rs3730947, rs34087182, rs11666150) had a tolerance index of 0.00, four nsSNPs (rs113944619, rs55686525, rs117019444, rs55950593) had a tolerance index of 0.01, two nsSNPs (rs3730863, rs3731003) had a tolerance index of 0.02, and the remaining one nsSNP (rs4987181) in the deleterious category had a tolerance index of 0.03. Seven nsSNPs (rs113944619, rs4987181, rs3730863, rs3730947, rs117019444, rs3731003, rs11666150) out of eleven nsSNPs predicted to be deleterious had a validated status (Table 1).

Table 1 Evaluation of nsSNPs from SIFT and PolyPhen servers

Full size table

Damaging nonsynonymous single nucleotide polymorphism predicted by PolyPhen

Thirteen nsSNPs out of thirty-one nsSNPs were predicted to be either possibly damaging or probably damaging and had PSIC score difference in the range of 1.548 and 2.840 (Table 1). Out of these thirteen nsSNPs, eight nsSNPs (rs113944619, rs4987181, rs12981963, rs11879148, rs55686525, rs111507847, rs34087182, and rs11666150) were put into the category of probably damaging and the remaining five nsSNPs (rs41555118, rs3730863, rs4987070, rs79897727, rs55950593) were put into the category of possibly damaging by the program. Eight nsSNPs (rs41555118, rs3730863, rs4987070, rs79897727, rs113944619, rs4987181, rs12981963, rs11879148) out of thirteen nsSNPs predicted to be in the category of either possibly damaging or probably damaging had validated status. It was observed that six nsSNPs (rs113944619, rs4987181, rs55686525, rs111507847, rs34087182, rs11666150) predicted to be probably damaging by PolyPhen server were also predicted deleterious by SIFT server. While two nsSNPs (rs3730863, rs55950593) predicted to be possibly damaging by PolyPhen server were also predicted to be deleterious by SIFT server. This shows a significant level of correlation between the results from evolutionary-based approach (SIFT) and structural-based approach (PolyPhen). The highly damaging nsSNP (rs34087182) had a PSIC score difference of 2.840 and SIFT score 0.00.

Nonsynonymous single nucleotide polymorphism responsible for destabilising protein structure

CUPSAT identified two nsSNPs (rs3731003 and rs11666150) out of sixteen nsSNPs to be influencing over all stability of the protein structure. Ten nsSNPs (rs3730933, rs111846131, rs111507847, rs3730947, rs3730966, rs4987068, rs112555243, rs74929288, rs55950593, rs11668325) only exhibited unfavourable changes in torsion angles with no influence on overall stability of protein (Table 2). The nsSNP rs11666150 predicted to be destabilising protein structure was also predicted damaging by SIFT server (SIFT score 0.00) and PolyPhen server (PSIC score difference 2.307).

Table 2 Change in protein structure and DDG upon point mutation

Full size table

Functional single nucleotide polymorphism in splicing modifiers binding site

FASTSNP predicted twenty SNPs to be influencing splicing regulation by their presence in splicing modifiers (enhancers and silencers) binding site (Table 3) (krawczak et al. 2000). Sixteen SNPs predicted to be influencing splicing regulation had a risk in range of 2–3 (low to medium) and remaining four SNPs with a risk in range of 3–4 (medium to high). Interestingly, two SNPs rs20581 and rs20580 were also highlighted in recent studies for their functional importance (Chang et al. 2008; Lee et al. 2008; Liu et al. 2009). None of the SNPs in UTR were reported to be present in splicing modifier binding sites.

Table 3 SNPs present in splicing modifier binding sites

Full size table

Functional single nucleotide polymorphism in transcription factor binding site, micro RNA binding site, and in promoter region

Gene LIG1 contains binding sites for a number of transcription factors which may mediate increased expression in dormant cells in response to growth factors (Noguiez et al. 1992). The presence of transcription factor binding site is not limited to 5′UTR or to promoter region but it also extends to intronic region (Jun and Jing 2010). Nine SNPs were predicted to be present in transcription factor binding site. Five SNPs (rs3730842, rs75696040, rs74747924, rs7246696 and rs3730840) in intron and four SNPs (rs3730838, rs752084, rs3730836 and rs79501686) in promoter region were predicted to be present in TFBS. Two SNPs (rs75696040 and rs74747924) were predicted to be present in the binding site of MZF1 transcription factor in chromosomal region between 48,673,165 to 48,673,177 on chromosome 19. Other than SNP rs79501686, all SNPs gave a score difference of more than 2, indicating the presence of SNP substantially influences binding affinity of transcription factors (Table 4). None of the SNPs by dbSMR were reported to be influencing binding of micro RNA in gene LIG1.

Table 4 SNPs present in transcription factor binding site

Full size table

Mapping and analysis of mutants on protein structure

Seven nsSNPs (rs11666150, rs55950593, rs34087182, rs3731003, rs117019444, rs3730947 and rs111507847) predicted to be deleterious by SIFT or PolyPhen server and present between the residue number 262 and 901 were mapped on the protein structure (PDB id: 1x9n, A chain) of gene product LIG1. The amino acid residue substitution was performed using Swiss-PDB Viewer to get seven mutant modelled protein structures for SNPs rs117019444, rs111507847, rs3730947, rs3731003, rs34087182, rs55950593, and rs11666150. The total energy of the native structure (1x9n, A chain, Fig. 1) and the seven mutant modelled protein structures for SNPs rs117019444, rs111507847, rs3730947, rs3731003, rs34087182, rs55950593, and rs11666150 was −52745.32, −62163.56, −40160.3672, −59187.7773, −57279.74, −59290.78, −53570.56 and −41863.19 kJ/mol, respectively (Table 5). It can be observed from Table 5 that the RMSD values fall in range of 0.00522673–0.0361993 and do not suggest much deviation while significant changes in energy of mutant structures can be observed. The mutant protein models for SNPs rs11666150 (Fig. 2) and rs111507847 showed an increase in energy compared to the energy of native structure. The result for nsSNP rs11666150 correlates with results given by SIFT, PolyPhen and CUPSAT servers. The native and mutant protein molecule structures were visualised using Visual Molecular Dynamics (VMD) program (Humphrey et al. 1996).

Table 5 RMSD value and overall energy of mutant protein structures

Full size table

Discussion

Laboratory-based techniques are most accurate and conclusive in distinguishing functional SNPs from non-functional SNPs (Chen and Sullivan 2003). But large number of SNPs present in human genome makes execution of laboratory techniques very demanding in terms of time, cost, and labour. On contrary, in silico methods can help in distinguishing potentially functional SNPs from neutral SNPs present in a gene.

The computational pipeline (Fig. 3) was applied to all SNPs linked to gene LIG1as cited in dbSNP. Eleven and thirteen nsSNPs were predicted to be deleterious by SIFT and PolyPhen server, respectively. Eight nsSNPs were predicted to be deleterious by both SIFT and PolyPhen server. Evaluation of protein stability upon point mutation by CUPSAT server showed two nsSNPs (rs11666150 and rs3731003) to be able to destabilize protein structure. Out of seven mutant models of nsSNPs only two nsSNPs (rs11666150 and rs111507847) mutant models demonstrated significant change in energy compared to native structure of protein. Interestingly, one nsSNP (rs11666150) was predicted to be intolerant, probably damaging and destabilizing by SIFT, PolyPhen and CUPSAT servers, respectively, and also its mutant structure showed a significant change in energy level. FASTSNP web server predicted twenty SNPs to be influencing splicing regulation and four were predicted with a risk in range of 3–4 (medium to high). Nine SNPs from intron and promoter region were predicted by rSNP module from MAPPER to be influencing binding of transcription factor. The in silico study was well-focussed on SNPs present in all regions of gene LIG1 as regulatory region SNPs may also be disease causatives (Hudson 2003; Yan et al. 2002). Furthermore, results of the study were in concordance with the results from recent studies (Chang et al. 2008; Lee et al. 2008; Liu et al. 2009; Ryu et al. 2009).

A large variety of tools are freely available for identification of potentially functional SNPs in a gene and each tool has different perspective for same biological problem (Thusberg and Vihinen 2009). The choice of computational tools to be used in an analysis is made on the nature of functional SNP to be identified and the amount of data and information being available for a given gene.

Conclusion

In this study nsSNP rs11666150 was found damaging by all the functional nsSNP prediction servers used. Further, its mutant structure demonstrated significant overall energy change as compared to the native structure. In this analysis, SNPs influencing binding of transcription factor and splicing modifier binding site are also predicted. However, studies will be required for in vitro validation of potentially functional SNPs in LIG1 and eventually will lead to development of better drugs against DNA ligase I deficiency (MIM: 126391). The authors suppose that the computational pipeline used in this study may also apply to any other human gene to identify potentially functional SNPs in it.

References

Barnes DE, Tomkinson AE, Lehmann AR, Webster AD, Lindahl T (1992) Mutations in the DNA ligase I gene of an individual with immunodeficiencies and cellular hypersensitivity to DNA-damaging agents. Cell 69(3):495–503
Article CAS Google Scholar
Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE (2000) The protein data bank. Nucleic Acids Res 28(1):235–242
Google Scholar
Boccia LM, Lillicrap D, Newcombe K, Mueller CR (1996) Binding of the Ets factor GA-binding protein to an upstream site in the factor IX promoter is a critical event in transactivation. Mol Cell Biol 16(5):1929–1935
Article CAS Google Scholar
Brookes AJ (1999) The essence of SNPs. Gene 234(2):177–186
Article CAS Google Scholar
Bryne JC, Valen E, Tang MH, Marstrand T, Winther O, da Piedade I, Krogh A, Lenhard B, Sandelin A (2008) JASPAR, the open access database of transcription factor binding profiles: new content and tools in 2008 update. Nucleic Acids Res 36:D102–D106
Article CAS Google Scholar
Cartegni L, Wang J, Zhu Z, Zhang MQ, Krainer AR (2003) ESEfinder: a web resource to identify exonic splicing enhancers. Nucleic Acids Res 31(13):3568–3571
Article CAS Google Scholar
Chang JS, Wrensch MR, Hansen HM, Sison JD, Aldrich MC, Quesenberry CP Jr, Seldin MF, Kelsey KT, Kittles RA, Silva G, Wiencke JK (2008) Nucleotide excision repair genes and risk of lung cancer among San Francisco Bay Area Latinos and African Americans. Int J Cancer 129(9):2095–2104
Article Google Scholar
Chen X, Sullivan PF (2003) Single nucleotide polymorphism genotyping: biochemistry, protocol, cost and throughput. Pharmacogenomics J 3(2):77–96
Article Google Scholar
Doss CGP, Rajasekaran R, Sudandiradoss C, Ramanathan K, Purohit R, Sethumadhavan R (2008a) A novel computational and structural analysis of nsSNPs in CFTR gene. Genomic Med 2(1–2):23–32
Article Google Scholar
Doss CGP, Sudandiradoss C, Rajasekaran R, Purohit R, Ramanathan K, Sethumadhavan R (2008b) Identification and structural comparison of deleterious mutations in nsSNPs of ABL1 gene in chronic myeloid leukemia: a bio-informatics study. J Biomed Inform 41(4):607–612
Article Google Scholar
ElSharawy A, Manaster C, Teuber M, Rosenstiel P, Kwiatkowski R, Huse K, Platzer M, Becker A, Nürnberg P, Schreiber S, Hampe J (2006) SNPSplicer: systematic analysis of SNP-dependent splicing in genotyped cDNAs. Hum Mutat 27(11):1129–1134
Article CAS Google Scholar
Fairbrother WG, Yeh RF, Sharp PA, Burge CB (2002) Predictive identification of exonic splicing enhancers in human genes. Science 297(5583):1007–1013
Article CAS Google Scholar
Goetz JD, Motycka TA, Han M, Jasin M, Tomkinson AE (2005) Reduced repair of DNA double-strand breaks by homologous recombination in a DNA ligase I-deficient human cell line. DNA Repair (Amst) 4(6):649–654
Article CAS Google Scholar
Hardison RC (2003) Comparative genomics. PLoS Biol 1(2):156–160
Article CAS Google Scholar
Hariharan M, Scaria V, Brahmachari SK (2009) dbSMR: a novel resource of genome-wide SNPs affecting microRNA mediated regulation. BMC Bioinform 10(1):108
Article Google Scholar
Hess B, Kutzner C, Van Der Spoel D, Lindahl E (2008) GROMACS 4: algorithms for highly efficient, load-balanced, and scalable molecular simulation. J Chem Theory Comput 4:435–447
Article CAS Google Scholar
Hudson TJ (2003) Wanted: regulatory SNPs. Nat Genet 33:439–440
Article CAS Google Scholar
Humphrey W, Dalke A, Schulten K (1996) VMD-visual molecular dynamics. J Molec Graphics 14:33–38
Google Scholar
Jun H, Jing Z (2010) Co-occurrence of core of binding sites for transcription factors in intronic region of Saccharomyces cerevisiae ribosomal protein genes. International Conference on Bioinformatics and on Biomedical Technology, Chengdu, pp 88–91. doi:10.1109/ICBBT.2010.5479005
Kanthappan R, Sethumadhavan R (2010) Identification of structurally and functionally significant deleterious nsSNPs of GSS gene: in silico analysis. Adv Biosci Biotechnol 1:361–366
Article Google Scholar
Kaplan W, Littlejohn TG (2001) Swiss-PDB viewer (Deep View). Brief Bioinform 2:195–197
Article CAS Google Scholar
Knight JC, Udalova I, Hill AV, Greenwood BM, Peshu N, Marsh K, Kwiatkowski D (1999) A polymorphism that affects OCT-1 binding to the TNF promoter region is associated with severe malaria. Nat Genet 22(2):145–150
Article CAS Google Scholar
Krawczak M, Ball EV, Fenton I, Stenson PD, Abeysinghe S, Thomas N, Cooper DN (2000) Human gene mutation database-a biomedical information and research resource. Hum Mutat 15(1):45–51
Article CAS Google Scholar
Kumar P, Henikoff S, Ng PC (2009) Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm. Nat Protoc 4(8):1073–1082
Article CAS Google Scholar
Lee YC, Morgenstern H, Greenland S, Tashkin DP, Papp J, Sinsheimer J, Cao W, Hashibe M, You NC, Mao JT, Cozen W, Mack TM, Zhang ZF (2008) A case-control study of the association of the polymorphisms and haplotypes of DNA ligase I with lung and upper-aerodigestive-tract cancers. Int J Cancer 122(7):1630–1638
Article CAS Google Scholar
Levin DS, Bai W, Yao N, O’Donnell M, Tomkinson AE (1997) An interaction between DNA ligase I and proliferating cell nuclear antigen: implications for Okazaki fragment synthesis and joining. Proc Natl Acad Sci USA 94(24):12863–12868
Article CAS Google Scholar
Levin DS, McKenna AE, Motycka TA, Matsumoto Y, Tomkinson AE (2000) Interaction between PCNA and DNA ligase I is critical for joining of Okazaki fragments and long-patch base-excision repair. Curr Biol 10(15):919–922
Article CAS Google Scholar
Liang L, Deng L, Nguyen SC, Zhao X, Maulion CD, Shao C, Tischfield JA (2008) Human DNA ligases I and III, but not ligase IV, are required for microhomology-mediated end joining of DNA double-strand breaks. Nucleic Acids Res 36(10):3297–3310
Article CAS Google Scholar
Liu Y, Scheurer ME, El-Zein R, Cao Y, Do KA, Gilbert M, Aldape KD, Wei Q, Etzel C, Bondy ML (2009) Association and interactions between DNA repair gene polymorphisms and adult glioma. Cancer Epidemiol Biomarkers Prev 18(1):204–214
Article CAS Google Scholar
Marinescu VD, Kohane IS, Riva A (2005) The MAPPER database: a multi-genome catalog of putative transcription factor binding sites. Nucleic Acids Res 33(Database Issue): D91–D97
Google Scholar
Marinescu VD, Kohane IS, Riva A (2005b) MAPPER: a search engine for the computational identification of putative transcription factor binding sites in multiple genomes. BMC Bioinform 6(1):79
Article Google Scholar
Matys V, Fricke E, Geffers R, Gössling E, Haubrock M, Hehl R, Hornischer K, Karas D, Kel AE, Kel-Margoulis OV, Kloos DU, Land S, Lewicki-Potapov B, Michael H, Münch R, Reuter I, Rotert S, Saxel H, Scheer M, Thiele S, Wingender E (2003) TRANSFAC: transcriptional regulation, from patterns to profiles. Nucleic Acids Res 31(1):374–378
Article CAS Google Scholar
Matys V, Kel-Margoulis OV, Fricke E, Liebich I, Land S, Barre-Dirrie A, Reuter I, Chekmenev D, Krull M, Hornischer K, Voss N, Stegmaier P, Lewicki-Potapov B, Saxel H, Kel AE, Wingender E (2006) TRANSFAC and its module TRANSCompel: transcriptional gene regulation in eukaryotes. Nucleic Acids Res 34:D108–D110
Article CAS Google Scholar
Mishra PJ, Banerjee D, Bertino JR (2008) MiRSNPs or MiR-polymorphisms, new players in microRNA mediated regulation of the cell: introducing microRNA pharmacogenomics. Cell Cycle 7(7):853–858
Article CAS Google Scholar
Montecucco A, Rossi R, Levin DS, Gary R, Park MS, Motycka TA, Ciarrocchi G, Villa A, Biamonti G, Tomkinson AE (1998) DNA ligase I is recruited to sites of DNA replication by an interaction with proliferating cell nuclear antigen: identification of a common targeting mechanism for the assembly of replication factories. EMBO J 17:3786–3795
Article CAS Google Scholar
Mooney S (2005) Bioinformatics approaches and resources for single nucleotide polymorphism functional analysis. Brief Bioinform 6(1):44–56
Article CAS Google Scholar
Ng PC, Henikoff S (2001) Predicting deleterious amino acid substitutions. Genome Res 11:863–874
Article CAS Google Scholar
Ng PC, Henikoff S (2002) Accounting for human polymorphisms predicted to affect protein function. Genome Res 12(3):436–446
Article CAS Google Scholar
Ng PC, Henikoff S (2003) SIFT: predicting amino acid changes that affect protein function. Nucleic Acids Res 31(13):3812–3814
Article CAS Google Scholar
Noguiez P, Barnes DE, Mohrenweiser HW, Lindahl T (1992) Structure of the human DNA ligase I gene. Nucleic Acids Res 20(15):3845–3850
Article CAS Google Scholar
Prathiban V, Gromiha MM, Schomburg D (2006) CUPSAT: prediction of protein stability upon point mutations. Nucleic Acids Res 34(2):W239–W242
Article Google Scholar
Parthiban V, Gromiha MM, Abhinandan M, Schomburg D (2007a) Computational modeling of protein mutant stability: analysis and optimization of statistical potentials and structural features reveal insights into prediction model development. BMC Struct Biol 7:54
Article Google Scholar
Parthiban V, Gromiha MM, Hoppe C, Schomburg D (2007b) Structural analysis and prediction of protein mutant stability using distance and torsion potentials: role of secondary structure and solvent accessibility. Proteins 66(1):41–52
Article CAS Google Scholar
Pascal JM, O’Brien PJ, Tomkinson AE, Ellenberger T (2004) Human DNA ligase I completely encircles and partially unwinds nicked DNA. Nature 432(7016):473–478
Article CAS Google Scholar
Piedrafita FJ, Molander RB, Vansant G, Orlova EA, Pfahl M, Reynolds WF (1996) An Alu element in the myeloperoxidase promoter contains a composite SP1-thyroid hormone-retinoic acid response element. J Biol Chem 271(24):14412–14420
Article CAS Google Scholar
Portales-Casamar E, Thongjuea S, Kwon AT, Arenillas D, Zhao X, Valen E, Yusuf D, Lenhard B, Wasserman WW, Sandelin A (2010) JASPAR 2010: the greatly expanded open-access database of transcription factor binding profiles. Nucleic Acids Res 38:D105–D110
Article CAS Google Scholar
Rajasekaran R, Sethumadhavan R (2010) In Silico identification of significant detrimental missense mutations of EGFR and their effect with 4-anilinoquinazoline-based drugs. Appl Biochem Biotechnol 160(6):1723–1733
Article CAS Google Scholar
Ramensky V, Pork P, Sunyaev S (2002) Human non-synonymous SNPs: server and survey. Nucleic Acids Res 30(17):3894–3900
Article CAS Google Scholar
Ryu GM, Song P, Kim KW, Oh KS, Park KJ, Kim JH (2009) Genome-wide analysis to predict protein sequence variations that change phosphorylation sites or their corresponding kinases. Nucleic Acids Res 37(4):1297–1307
Article CAS Google Scholar
Sandelin A, Alkema W, Engström P, Wasserman WW, Lenhard B (2004) JASPAR: an open-access database for eukaryotic transcription factor binding profiles. Nucleic Acids Res 32:D91–D94
Article CAS Google Scholar
Shastry BS (2002) SNP alleles in human disease and evolution. J Hum Genet 47(11):561–566
Article CAS Google Scholar
Sherry ST, Ward MH, Kholodov M, Baker J, Phan L, Smigielski EM, Sirotkin K (2001) The NCBI database of genetic variation. Nucleic Acids Res 29(1):308–311
Article CAS Google Scholar
Smith PJ, Zhang C, Wang J, Chew SL, Zhang MQ, Krainer AR (2006) An increased specificity score matrix for the prediction of SF2/ASF-specific exonic splicing enhancers. Hum Mol Genet 15(16):2490–2508
Article CAS Google Scholar
Sunyaev S, Ramensky V, Bork P (2000) Towards a structural basis of human non-synonymous single nucleotide polymorphisms. Trends Genet 16(4):198–200
Article CAS Google Scholar
Thusberg J, Vihinen M (2009) Pathogenic or not? And if so, then how? Studying the effect of missense mutations using Bioinformatics methods. Hum Mutat 130(5):703–714
Article Google Scholar
Timson DJ, Singleton MR, Wigley DB (2000) DNA ligases in the repair and replication of DNA. Mutat Res 460(3–4):301–318
Article CAS Google Scholar
Tokuriki N, Stricher F, Serrano L, Tawfik DS (2008) How protein stability and new functions trade off. PLos Comput Biol 4(2):e1000002
Article Google Scholar
Vago R, Leva V, Biamonti G, Montecucco A (2009) DNA ligase I and Nbs1 proteins associate in a complex and colocalize at replication factories. Cell Cycle 8(16):2600–2607
Article CAS Google Scholar
Vasiliev GV, Merkulov VM, Kobzev VF, Merkulova TI, Ponomarenko MP, Kolchanov NA (1999) Point mutations within 663–666 bp of intron 6 of the human TDO2 gene, associated with a number of psychiatric disorders, damage the YY-1 transcription factor binding site. Fed Eur Biochem Soc Lett 462(1–2):85–88
Article CAS Google Scholar
Wang Z, Moult J (2001) SNPs, protein structure, and disease. Hum Mutat 17(4):263–270
Article Google Scholar
Wang Z, Rolish ME, Yeo G, Tung V, Mawson M, Burge CB (2004) Systematic identification and analysis of exonic splicing silencers. Cell 119:831–845
Article CAS Google Scholar
Yan H, Yuan W, Velculescu VE, Vogelstein B, Kinzler KW (2002) Allelic variation in human gene expression. Science 297:1143
Article CAS Google Scholar
Yuan HY, Chiou JJ, Tseng WH, Liu CH, Liu CK, Lin YJ, Wang HH, Yao A, Chen YT, Hsu CN (2006) FASTSNP: an always up-to-date and extendable service for SNP function analysis and prioritization. Nucleic Acids Res 34(2):W635–W641
Article CAS Google Scholar

Download references

Author information

Authors and Affiliations

Department of Bioinformatics-BiGCaT, Maastricht University, Maastricht, The Netherlands
Abhishek A. Singh
Department of Bioinformatics, School of Chemical and Biotechnology, SASTRA University, Thanjavur, Tamil Nadu, India
Dakshinamurthy Sivakumar
Bioinformatics Centre, Biotech Park, Sector-G, Jankipuram, Lucknow, Uttar Pradesh, India
Pallavi Somvanshi

Authors

Abhishek A. Singh
View author publications
You can also search for this author in PubMed Google Scholar
Dakshinamurthy Sivakumar
View author publications
You can also search for this author in PubMed Google Scholar
Pallavi Somvanshi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Pallavi Somvanshi.

Rights and permissions

Open Access This is an open access article distributed under the terms of the Creative Commons Attribution Noncommercial License (https://creativecommons.org/licenses/by-nc/2.0), which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.

Reprints and permissions

About this article

Cite this article

Singh, A.A., Sivakumar, D. & Somvanshi, P. Cataloguing functionally relevant polymorphisms in gene DNA ligase I: a computational approach. 3 Biotech 1, 47–56 (2011). https://doi.org/10.1007/s13205-011-0006-8

Download citation

Received: 24 February 2011
Accepted: 01 April 2011
Published: 27 April 2011
Issue Date: July 2011
DOI: https://doi.org/10.1007/s13205-011-0006-8

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Cataloguing functionally relevant polymorphisms in gene DNA ligase I: a computational approach

Abstract

Similar content being viewed by others

A comprehensive in silico analysis of non-synonymous and regulatory SNPs of human MBL2 gene

An in silico approach to characterize nonsynonymous SNPs and regulatory SNPs in human TOX3 gene

Prediction of the most deleterious non-synonymous SNPs in the human IL1B gene: evidence from bioinformatics analyses

Introduction

Materials and methods

Dataset

Identification of deleterious nonsynonymous single nucleotide polymorphism by sequence homology based method

Identification of damaging nonsynonymous single nucleotide polymorphism by structural-homology based method

Identification of nonsynonymous single nucleotide polymorphism influencing protein stability

Identification of single nucleotide polymorphism in splicing modifier binding site

Identification of single nucleotide polymorphism in transcription factor binding site and in micro-RNA binding site

Modelling nsSNPs on protein structure and determining alterations in energy and RMSD

Results

Dataset

Deleterious nonsynonymous single nucleotide polymorphisms predicted by SIFT server

Damaging nonsynonymous single nucleotide polymorphism predicted by PolyPhen

Nonsynonymous single nucleotide polymorphism responsible for destabilising protein structure

Functional single nucleotide polymorphism in splicing modifiers binding site

Functional single nucleotide polymorphism in transcription factor binding site, micro RNA binding site, and in promoter region

Mapping and analysis of mutants on protein structure

Discussion

Conclusion

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Cataloguing functionally relevant polymorphisms in gene DNA ligase I: a computational approach

Abstract

Similar content being viewed by others

A comprehensive in silico analysis of non-synonymous and regulatory SNPs of human MBL2 gene

An in silico approach to characterize nonsynonymous SNPs and regulatory SNPs in human TOX3 gene

Prediction of the most deleterious non-synonymous SNPs in the human IL1B gene: evidence from bioinformatics analyses

Introduction

Materials and methods

Dataset

Identification of deleterious nonsynonymous single nucleotide polymorphism by sequence homology based method

Identification of damaging nonsynonymous single nucleotide polymorphism by structural-homology based method

Identification of nonsynonymous single nucleotide polymorphism influencing protein stability

Identification of single nucleotide polymorphism in splicing modifier binding site

Identification of single nucleotide polymorphism in transcription factor binding site and in micro-RNA binding site

Modelling nsSNPs on protein structure and determining alterations in energy and RMSD

Results

Dataset

Deleterious nonsynonymous single nucleotide polymorphisms predicted by SIFT server

Damaging nonsynonymous single nucleotide polymorphism predicted by PolyPhen

Nonsynonymous single nucleotide polymorphism responsible for destabilising protein structure

Functional single nucleotide polymorphism in splicing modifiers binding site

Functional single nucleotide polymorphism in transcription factor binding site, micro RNA binding site, and in promoter region

Mapping and analysis of mutants on protein structure

Discussion

Conclusion

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation