Identification of candidate genomic regions by integrating cluster analysis and genome-wide association studies
Abstract
By identifying genomic variants responsible for life-threatening complex disorders, genome-wide association studies (GWAS) has gained great potential in improving precision medicine. However, due to the large number of association tests it employs, it becomes susceptible to error rate inflation and decrease in statistical power. This study aims to address these problems by integrating cluster analysis and GWAS in order to identify candidate genomic regions of possible relevance to a phenotype. This integrative approach reduces the number of tests by focusing more on significant genetic loci and the variants residing therein. Using Hamming distance as the similarity measure we conducted cluster analysis on SNPs associated with HBsAg seroclearance and subjected all the resulting SNP-sets to a Hamming-distance based association test. Results showed that all SNP-sets are significantly associated with HBsAg seroclearance. Furthermore, the set which obtained the highest degree of association contain SNPs which belong to the locus 11p which has been previously linked with HBsAg positivity.
Downloads
Published
Issue
Section
License
By submitting their manuscript to the Samahang Pisika ng Pilipinas (SPP) for consideration, the Authors warrant that their work is original, does not infringe on existing copyrights, and is not under active consideration for publication elsewhere.
Upon acceptance of their manuscript, the Authors further agree to grant SPP the non-exclusive, worldwide, and royalty-free rights to record, edit, copy, reproduce, publish, distribute, and use all or part of the manuscript for any purpose, in any media now existing or developed in the future, either individually or as part of a collection.
All other associated economic and moral rights as granted by the Intellectual Property Code of the Philippines are maintained by the Authors.








