|
Human human reference species mouse, chimp, arabidopsis…
|
tarix | 06.09.2018 | ölçüsü | 469 b. | | #78002 |
|
100 million
human human reference species mouse, chimp, arabidopsis… agricultural species cattle, sheep, pig, … rice, wheat, grape … bacterial disease, human “ecosystem”
Individuals Populations disease and “quantitative traits” Transcriptome of child and parents
Indexing Alignment SNP/MNP/Indel/SV calling
Human genome (3 billion nt) Human genome (3 billion nt) 1 billion reads of 100 nt coverage of 30 Indexing + Aligning in 27 minutes
Indexing
Varying levels of extraction of reads across genome (use differences) Locate boundaries (as accurately as possible) Extract number of variants Use SNPs
Mapping reads back onto a database of known bacteria/viruses Mapping reads back onto a database of known bacteria/viruses Many don’t map at all Remove human “contamination”
Map reads to database Estimate most likely frequencies a hill climbing estimation problem Can anything be done about unmapped reads?
Dostları ilə paylaş: |
|
|