InvestorsHub Logo
Followers 3
Posts 607
Boards Moderated 0
Alias Born 06/07/2010

Re: Paulieme post# 131

Thursday, 07/04/2013 4:57:19 PM

Thursday, July 04, 2013 4:57:19 PM

Post# of 1062
Open this link and click on Figure 1 & 2. http://genomebiology.com/2013/14/6/405 (Figure 1. Idealized assembly graphs [18] of the 5.2 megabase-pair B.anthracis Ames Ancestor main chromosome using (a) 100 bp,
(b) 1,000 bp and (c) 5,000 bp reads. The graphs encode the compressed de Bruijn graph derived from infi nite coverage error-free reads,
eff ectively representing the repeats in the genome and the upper bound of what could be achieved in a real assembly. Increasing the read length
decreases the number of contigs because the longer reads will span more of the repeats. Note the assembly with 5,000 bp reads has a self-edge
because the chromosome is circular). (figure 2. A sequencing context breakdown of the empirical insertion error rate of the two platforms on NA12878 whole genome data. In this figure we show all contexts of size 8 that start with AAAAA. The empirical insertion quality score (y-axis) is PHRED scaled. Despite the higher error rate (approximately Q12) of the PacBio RS instrument, the error is independent of the sequencing context. Other platforms are known to have different error rates for different sequencing contexts. Illumina's HiSeq platform, shown here, has a lower error rate (approximately Q45 across eight independent runs), but contexts such as AAAAAAAA and AAAAACAG have extremely different error rates (Q30 versus Q55). This context-specific error rate creates bias that is not easily clarified by greater sequencing depth. Empirical insertion error rates were measured using the Genome Analysis Toolkit (GATK) - Base Quality Score Recalibration tool.
Volume:
Day Range:
Bid:
Ask:
Last Trade Time:
Total Trades:
  • 1D
  • 1M
  • 3M
  • 6M
  • 1Y
  • 5Y
Recent PACB News