Optimal genotype determination in highly multiplexed SNP data

M Moorhead, P Hardenbol, F Siddiqui… - European journal of …, 2006 - nature.com
M Moorhead, P Hardenbol, F Siddiqui, M Falkowski, C Bruckner, J Ireland, HB Jones, M Jain…
European journal of human genetics, 2006nature.com
High-throughput genotyping technologies that enable large association studies are already
available. Tools for genotype determination starting from raw signal intensities need to be
automated, robust, and flexible to provide optimal genotype determination given the specific
requirements of a study. The key metrics describing the performance of a custom genotyping
study are assay conversion, call rate, and genotype accuracy. These three metrics can be
traded off against each other. Using the highly multiplexed Molecular Inversion Probe …
Abstract
High-throughput genotyping technologies that enable large association studies are already available. Tools for genotype determination starting from raw signal intensities need to be automated, robust, and flexible to provide optimal genotype determination given the specific requirements of a study. The key metrics describing the performance of a custom genotyping study are assay conversion, call rate, and genotype accuracy. These three metrics can be traded off against each other. Using the highly multiplexed Molecular Inversion Probe technology as an example, we describe a methodology for identifying the optimal trade-off. The methodology comprises: a robust clustering algorithm and assessment of a large number of data filter sets. The clustering algorithm allows for automatic genotype determination. Many different sets of filters are then applied to the clustered data, and performance metrics resulting from each filter set are calculated. These performance metrics relate to the power of a study and provide a framework to choose the most suitable filter set to the particular study.
nature.com