Monday, August 28, 2017 5:31:21 PM
Background: Reference quality genomes are expected to provide a resource for studying gene structure, function, and evolution. However, often genes of interest are not completely or accurately assembled, leading to unknown errors in analyses or additional cloning efforts for the correct sequences. A promising solution is long-read sequencing. Here we tested PacBio-based long-read sequencing and diploid assembly for potential improvements to the Sanger-based intermediate-read zebra finch reference and Illumina-based short-read Anna's hummingbird reference, two vocal learning avian species widely studied in neuroscience and genomics.
Results: With DNA of the same individuals used to generate the reference genomes, we generated diploid assemblies with the FALCON-Unzip assembler, resulting in contigs with no gaps in the megabase range, representing 150-fold and 200-fold improvements over the current zebra finch and hummingbird references, respectively. These long-read and phased assemblies corrected and resolved what we discovered to be numerous misassemblies in the references, including missing sequences in gaps, erroneous sequences flanking gaps, base call errors in difficult to sequence regions, complex repeat structure errors, and allelic differences between the two haplotypes. These improvements were validated by single long genome and transcriptome reads, and resulted for the first time in completely resolved protein-coding genes widely studied in neuroscience and specialized in vocal learning species.
Conclusions: These findings demonstrate the impact of long reads, sequencing of previously difficult-to-sequence regions, and phasing of haplotypes on generating high quality assemblies necessary for understanding gene structure, function, and evolution.
De novo genome assembly, long reads, SMRT Sequencing, brain, language
Issue Section: Research
© The Authors 2017. Published by Oxford University Press.
Recent PACB News
- PacBio Announces the HiFi Solves Sub-fertility Consortium in Asia Pacific • GlobeNewswire Inc. • 09/11/2024 01:05:00 PM
- Revio to Power Research in Male Infertility and Rare Disease at Münster University Hospital • GlobeNewswire Inc. • 09/04/2024 01:05:00 PM
- PacBio to Present at the Morgan Stanley 22nd Annual Global Healthcare Conference • GlobeNewswire Inc. • 08/26/2024 08:03:00 PM
- Form 4 - Statement of changes in beneficial ownership of securities • Edgar (US Regulatory) • 08/20/2024 09:27:01 PM
- Form 4 - Statement of changes in beneficial ownership of securities • Edgar (US Regulatory) • 08/20/2024 09:25:47 PM
- Form 144 - Report of proposed sale of securities • Edgar (US Regulatory) • 08/19/2024 08:24:44 PM
- Form 144 - Report of proposed sale of securities • Edgar (US Regulatory) • 08/16/2024 08:29:03 PM
- Form 10-Q - Quarterly report [Sections 13 or 15(d)] • Edgar (US Regulatory) • 08/09/2024 12:46:25 AM
- Form 8-K/A - Current report: [Amend] • Edgar (US Regulatory) • 08/07/2024 08:09:35 PM
- Form 8-K - Current report • Edgar (US Regulatory) • 08/07/2024 08:08:10 PM
- PacBio Announces Second Quarter 2024 Financial Results • GlobeNewswire Inc. • 08/07/2024 08:05:00 PM
- Singapore’s National Precision Medicine (NPM) programme will partner with PacBio to propel HiFi Sequencing in Southeast Asia • GlobeNewswire Inc. • 08/05/2024 08:05:00 PM
- Novogene to Power Cancer, Rare Disease, and Environmental Research with PacBio Sequencing • GlobeNewswire Inc. • 07/23/2024 01:05:00 PM
- PacBio to Report Second Quarter 2024 Financial Results on August 7, 2024 • GlobeNewswire Inc. • 07/15/2024 08:03:00 PM
- PacBio and International Research Consortium CoLoRS Announce Release of First-Ever HiFi Long-Read Variant Database • PR Newswire (US) • 06/10/2024 01:05:00 PM
- Form DEFA14A - Additional definitive proxy soliciting materials and Rule 14(a)(12) material • Edgar (US Regulatory) • 05/30/2024 01:52:04 AM
- Form 4 - Statement of changes in beneficial ownership of securities • Edgar (US Regulatory) • 05/22/2024 09:25:43 PM
- Ambry Genetics and PacBio Announce Collaboration to Sequence Up to 7,000 Human Genomes Aimed at Providing Answers for Families Battling Rare Diseases • PR Newswire (US) • 05/15/2024 01:45:00 PM
- Form S-3ASR - Automatic shelf registration statement of securities of well-known seasoned issuers • Edgar (US Regulatory) • 05/09/2024 08:33:12 PM
- Form 10-Q - Quarterly report [Sections 13 or 15(d)] • Edgar (US Regulatory) • 05/09/2024 08:21:46 PM
- Form 8-K - Current report • Edgar (US Regulatory) • 05/09/2024 08:12:15 PM
- PacBio Announces First Quarter 2024 Financial Results • PR Newswire (US) • 05/09/2024 08:05:00 PM
- PacBio Announces Preliminary First Quarter 2024 Revenue and Updates 2024 Revenue Guidance • PR Newswire (US) • 04/16/2024 12:05:00 PM
- Estonia National Biobank Selects PacBio to Sequence 10,000 Whole Genomes • PR Newswire (US) • 03/27/2024 12:00:00 PM
- PacBio Grants Equity Incentive Award to New Employee • PR Newswire (US) • 03/22/2024 08:30:00 PM
VHAI - Vocodia Partners with Leading Political Super PACs to Revolutionize Fundraising Efforts • VHAI • Sep 19, 2024 11:48 AM
Dear Cashmere Group Holding Co. AKA Swifty Global Signs Binding Letter of Intent to be Acquired by Signing Day Sports • DRCR • Sep 19, 2024 10:26 AM
HealthLynked Launches Virtual Urgent Care Through Partnership with Lyric Health. • HLYK • Sep 19, 2024 8:00 AM
Element79 Gold Corp. Appoints Kevin Arias as Advisor to the Board of Directors, Strengthening Strategic Leadership • ELMGF • Sep 18, 2024 10:29 AM
Mawson Finland Limited Further Expands the Known Mineralized Zones at Rajapalot: Palokas step-out drills 7 metres @ 9.1 g/t gold & 706 ppm cobalt • MFL • Sep 17, 2024 9:02 AM
PickleJar Announces Integration With OptCulture to Deliver Holistic Fan Experiences at Venue Point of Sale • PKLE • Sep 17, 2024 8:00 AM