Bioinformatics (2009)
Supplementary Information
An integrative scoring system for ranking SNPs by their potential
deleterious effects
Phil Hyoun Lee and Hagit Shatkay
Computational Biology and Machine Learning Lab
School of Computing, Queen's University, Kingston, Ontario, Canada, K7L 3N6
TABLE 1. Disease-causing or Disease-associated SNPs from OMIM and their assessed Functional Significance score
We have obtained the list of 1,399 disease-causing or disease-associated SNPs
from the FTP site: ftp://ftp.ncbi.nih.gov/snp/database/organism_data/human_9606/OmimVarLocusIdSNP.bcp.gz.
The downloaded list contains the dbSNP ids of the SNPs that are linked to the ALLELIC VARIANT entry
in the OMIM (Online Mendelian Inheritance in Man) database, which is described as either disease-producing mutations or
polymorphisms with a positive statistical correlation with particular common disorders based on the bio-medical literature information.
There are some examples of known disease-related SNPs with a high functional significance score.
For example, three SNPs on NAT2, namely, rs1799930, rs1801280,
and rs1208 are included in the list of 1,399 disease-causing or disease-associated SNPs.
The assessed FS scores for the three SNPs are 0.866, 0.584, and 0.858, respectively,
designating high functional significance of the SNPs.
In OMIM, all the three SNPs are described to be relevant to slow acetylation activity (Vatsis et al. 1991),
and rs1801280 has been reported to be strongly associated with susceptibility to bladder cancer and adverse drug reactions (Hein et al., 2000; Hein, 2002; Patin et al. 2006; OMIM ID: 243400).
Other examples of high scoring, known disease-causing or disease-associated SNPs are rs7775 and rs288326, which have been reported as a strong risk factor for primary osteoarthritis of the hip in females (Loughlin et al. 2004; OMIM ID: 605083).