Dynamic Programming for Single Nucleotide Polymorphism ID Identification in Systematic Association Studies
Article Outline
Single nucleotide polymorphisms (SNPs) play an important role in personalized medicine. However, the SNP data reported in many association studies provide only the SNP nucleotide/amino acid position, without providing the SNP ID recorded in National Center for Biotechnology Information databases. A tool with the ability to provide SNP ID identification, with a user-friendly interface, is needed. In this paper, a dynamic programming algorithm was used to compare homologs when the processed input sequence is aligned with the SNP FASTA database. Our novel system provides a web-based tool that uses the National Center for Biotechnology Information dbSNP database, which provides SNP sequence identification and SNP FASTA formats. Freely selectable sequence formats for alignment can be used, including general sequence formats (ACGT, [dNTP1/dNTP2] or IUPAC formats) and orientation with bidirectional sequence matching. In contrast to the National Center for Biotechnology Information SNP-BLAST, the proposed system always provides the correct targeted SNP ID (SNP hit), as well as nearby SNPs (flanking hits), arranged in their chromosomal order and contig positions. The system also solves problems inherent in SNP-BLAST, which cannot always provide the correct SNP ID for a given input sequence. Therefore, this system constitutes a novel application which uses dynamic programming to identify SNP IDs from the literature and keyed-in sequences for systematic association studies. It is freely available at http://bio.kuas.edu.tw/SNPosition/.
Key Words: association studies , BLAST , BLAT , dynamic programming , single nucleotide polymorphisms
No full text is available. To read the body of this article, please view the PDF online.
References
- . SNP discovery in associating genetic variation with human disease phenotypes . Mutat Res . 2005;573:41–53
- . SNPs in cancer research and treatment . Br J Cancer . 2004;90:747–751
- . SNPs and haplotypes: genetic markers for disease and drug response (review) . Int J Mol Med . 2003;11:379–382
- . SNPper: retrieval and analysis of human SNPs . Bioinformatics . 2002;18:1681–1685
- FESD: a Functional Element SNPs Database in humans . Nucleic Acids Res . 2005;33:D518–D522
- SNPHunter: a bioinformatic software for single nucleotide polymorphism data acquisition and management . BMC Bioinformatics . 2005;6:60
- Genewindow: an interactive tool for visualization of genomic variation . Nat Genet . 2005;37:109–110
- SNPServer: a realtime SNP discovery tool . Nucleic Acids Res . 2005;33:W493–W495
- . GeneSNPs-blast function . Available at: http://www.genome.utah.edu/genesnps/cgi-bin/blast.cgi [Date accessed: August 23, 2006]
- . BLAST: at the core of a powerful and diverse set of sequence analysis tools . Nucleic Acids Res . 2004;32:W20–W25
- dbSNP: the NCBI database of genetic variation . Nucleic Acids Res . 2001;29:308–311
- . SNP-BLAST . Available at: http://www.ncbi.nlm.nih.gov/SNP/snp_blastByOrg.cgi [Date accessed: January 9, 2009]
- . What is dynamic programming? . Nat Biotechnol . 2004;22:909–910
- . A systematic approach to dynamic programming in bioinformatics . Bioinformatics . 2000;16:665–677
- . A general method applicable to the search for similarities in the amino acid sequence of two proteins . J Mol Biol . 1970;48:443–453
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programs . Nucleic Acids Res . 1997;25:3389–3402
- SNP ID-info: SNP ID searching and visualization platform . OMICS . 2008;12:217–226
- The UCSC Genome Browser Database: update 2006 . Nucleic Acids Res . 2006;34:D590–D598
- . BLAT—the BLAST-like alignment tool . Genome Res . 2002;12:656–664
- HapBlock: haplotype block partitioning and tag SNP selection software using a set of dynamic programming algorithms . Bioinformatics . 2005;21:131–134
- . Inference of missing SNPs and information quantity measurements for haplotype blocks . Bioinformatics . 2005;21:2001–2007
- SNP-RFLPing: restriction enzyme mining for SNPs in genomes . BMC Genomics . 2006;7:30
PII: S1607-551X(09)70057-9
doi:10.1016/S1607-551X(09)70057-9
© 2009 Elsevier. Published by Elsevier Inc. All rights reserved.
