InvestorsHub Logo
Followers 3
Posts 607
Boards Moderated 0
Alias Born 06/07/2010

Re: None

Tuesday, 06/24/2014 12:46:46 AM

Tuesday, June 24, 2014 12:46:46 AM

Post# of 1060
Homolog.us – Bioinformatics
Frontier in Bioinformatics
June 23rd, 2014 ///-- Another PacBio Development – Adam Phillippy’s New MHAP Module
Share..... 1 0 0 0 0
Homolog.us blog is written by professional janitors dedicated to clean up US science. During lunch breaks and other time off from the job, we discuss bioinformatics. The name 'homolog.us' is not a spelling mistake, but is derived by taking Arabic translation of the 'O' in the original word.
Please follow us on twitter – @homolog_us.

http://wgs-assembler.sourceforge.net/wiki/index.php?title=PBcR -------------------------------------


Alignment speed is the biggest bottleneck in PacBio assembly. Therefore, those working on PacBio reads will find the following release helpful.

June 14, 2014 – PBcR and CA 8.2 alpha as source or pre-compiled for Linux is now available. PBcR now incorporates a novel probabilistic overlapper for self-correction of sequences named MHAP. This allows assembly of prokaryotic genomes in < 30 minutes on a typical desktop and assembly of small eukaryotic genomes in < 2 days. If you use MHAP, please cite the Biology of Genomes poster (Berlin K., Koren, S. et. al. Reducing assembly complexity of genomes with single-molecule sequencing. Biology of Genomes, 2014). For best results, java 1.7r51 or newer is recommended to use MHAP.
How good is it? Here is the plain language description -



———————————————-

Given that alignment is a big bottleneck, we previously checked whether BWA-mem could improve the execution time. Heng Li came up with a set of optimal parameters to get the best alignment. Readers may find the following comment from Irek in that thread helpful -

Hey, I just finished comparison for new PacBio RSII data (CCS and CLR), used: bwa-sw,mem,blasr,ssaha2,smalt,last and agile.
Checked speed, memory, mapping status of reads, then went for precision-recall assessment, and finished with the analysis of error model recognition.

Actually new version of SMALT looks like a winner and it’s really fast and memory efficient.

As for mem – blasr. For CCS they are comparable in terms of precision-recall, but for CLR, mem definitely looses.



-------------------------------------------------------------------------------------------------------------
Heroes and Heroines of New Media--2014
I am strongly influenced by Charles Hugh Smith, who runs his insightful social blog of Two Minds. I hope he will not mind, if I copy his style of acknowledgement to the supporters of our blog.

Our blog is deeply honored by the generous contribution of the following readers. Without their patronage, this site would go away.

Outstandingly Generous:
Amemiya C. Schnable J. Bowman B.

We are also looking for subscribers to get help to finish the tutorials. Please see this post for details.

June 23rd, 2014 | Category: pacbio-- http://www.homolog.us/blogs/blog/2014/06/23/another-pacbio-development-adam-phillippys-new-pacbio-module/
Volume:
Day Range:
Bid:
Ask:
Last Trade Time:
Total Trades:
  • 1D
  • 1M
  • 3M
  • 6M
  • 1Y
  • 5Y
Recent PACB News