Wednesday, April 13, 2016 1:39:35 PM
When Mike Schatz realized a few years ago that his PacBio® System had reached the throughput needed to process human genomes, he decided to give it a real challenge: the incredibly complicated, massively rearranged SK-BR-3 breast cancer cell line. The genome consists of 80 chromosomes, and that’s just the tip of the complexity iceberg.
“We were really interested in sequencing a human genome that would be maximally impactful and that was aligned with our research interest in cancer genomes, where it’s been well documented that structural variations play a major role,” says Schatz, now an associate research professor of computer science at Johns Hopkins University and an adjunct associate professor of quantitative biology at Cold Spring Harbor Laboratory, where the analysis took place. He notes that despite its importance, structural variation has not been thoroughly studied because short-read sequencers cannot reliably identify these large genomic elements. “One of the really special properties about the PacBio Sequencer is, in addition to being able to call SNPs or small variants, we also get to look for large variants such as structural variation,” he says.
But as Schatz and his collaborators at Cold Spring Harbor Laboratory and the Ontario Institute for Cancer Research delved into this work, they realized that existing variant callers were tailored to short-read data. To make the most of the large amount of long-read information they were generating, the team wrote a suite of new analysis tools optimized for SMRT Sequencing data. “The tools catering to short-read data just aren’t made to capture the awesome information that we can now take advantage of,” says Maria Nattestad, a graduate student in Schatz’s lab who wrote several of the new algorithms. “Building our own tools was really the only way to go here.”
Those tools, which are especially important for understanding structural variation, are now being publicly released to fuel further SMRT Sequencing studies of human genomes. Also coming out soon is the team’s detailed analysis of the SK-BR-3 genome and transcriptome, which includes a high-quality assembly as well as a new understanding of gene fusions, the evolutionary history of this cell line, and more.
De novo sequencing and assembly were the first steps in making sense of the SK-BR-3 genome. With 72-fold SMRT Sequencing coverage, “we got an outstanding assembly of this genome even though it’s so complicated,” Schatz says, citing a contig N50 size of 2.5 Mb compared to a state-of-the-art short-read assembly with a contig N50 of just 3 kb. “That’s nearly a thousand-fold more contiguous going from short-read to long-read assemblies, and it’s through that improved assembly that the majority of structural variants were detected.”
Using custom-built analysis tools, including variant callers Sniffles, by Schatz lab member Fritz Sedlazeck, and Assemblytics, by Nattestad, the scientists found more than 10,000 structural variants in the SK-BR-3 genome ranging in size from 50 bases to millions of base pairs long. Another major discovery involved meticulously characterizing the complicated process that led to the cell line’s Her2 oncogene amplification.
The team also used the Iso-Seq™ method to analyze the full transcriptome of SK-BR-3, finding as much complexity at the RNA level as they saw in the DNA. “In the Iso-Seq analysis, we see many tens of thousands of novel isoforms,” Schatz says. “That’s a really strong testament to the long reads, which fully capture an isoform in one sequence — unlike short reads, where you have to infer isoform structure.”
To learn more about the project, which included novel findings about gene fusions in cancer, check out the full case study. http://www.pacb.com/blog/genome-and-transcriptome-analysis-help-scientists-deconstruct-cancer-complexity/
Recent PACB News
- PacBio Announces the HiFi Solves Sub-fertility Consortium in Asia Pacific • GlobeNewswire Inc. • 09/11/2024 01:05:00 PM
- Revio to Power Research in Male Infertility and Rare Disease at Münster University Hospital • GlobeNewswire Inc. • 09/04/2024 01:05:00 PM
- PacBio to Present at the Morgan Stanley 22nd Annual Global Healthcare Conference • GlobeNewswire Inc. • 08/26/2024 08:03:00 PM
- Form 4 - Statement of changes in beneficial ownership of securities • Edgar (US Regulatory) • 08/20/2024 09:27:01 PM
- Form 4 - Statement of changes in beneficial ownership of securities • Edgar (US Regulatory) • 08/20/2024 09:25:47 PM
- Form 144 - Report of proposed sale of securities • Edgar (US Regulatory) • 08/19/2024 08:24:44 PM
- Form 144 - Report of proposed sale of securities • Edgar (US Regulatory) • 08/16/2024 08:29:03 PM
- Form 10-Q - Quarterly report [Sections 13 or 15(d)] • Edgar (US Regulatory) • 08/09/2024 12:46:25 AM
- Form 8-K/A - Current report: [Amend] • Edgar (US Regulatory) • 08/07/2024 08:09:35 PM
- Form 8-K - Current report • Edgar (US Regulatory) • 08/07/2024 08:08:10 PM
- PacBio Announces Second Quarter 2024 Financial Results • GlobeNewswire Inc. • 08/07/2024 08:05:00 PM
- Singapore’s National Precision Medicine (NPM) programme will partner with PacBio to propel HiFi Sequencing in Southeast Asia • GlobeNewswire Inc. • 08/05/2024 08:05:00 PM
- Novogene to Power Cancer, Rare Disease, and Environmental Research with PacBio Sequencing • GlobeNewswire Inc. • 07/23/2024 01:05:00 PM
- PacBio to Report Second Quarter 2024 Financial Results on August 7, 2024 • GlobeNewswire Inc. • 07/15/2024 08:03:00 PM
- PacBio and International Research Consortium CoLoRS Announce Release of First-Ever HiFi Long-Read Variant Database • PR Newswire (US) • 06/10/2024 01:05:00 PM
- Form DEFA14A - Additional definitive proxy soliciting materials and Rule 14(a)(12) material • Edgar (US Regulatory) • 05/30/2024 01:52:04 AM
- Form 4 - Statement of changes in beneficial ownership of securities • Edgar (US Regulatory) • 05/22/2024 09:25:43 PM
- Ambry Genetics and PacBio Announce Collaboration to Sequence Up to 7,000 Human Genomes Aimed at Providing Answers for Families Battling Rare Diseases • PR Newswire (US) • 05/15/2024 01:45:00 PM
- Form S-3ASR - Automatic shelf registration statement of securities of well-known seasoned issuers • Edgar (US Regulatory) • 05/09/2024 08:33:12 PM
- Form 10-Q - Quarterly report [Sections 13 or 15(d)] • Edgar (US Regulatory) • 05/09/2024 08:21:46 PM
- Form 8-K - Current report • Edgar (US Regulatory) • 05/09/2024 08:12:15 PM
- PacBio Announces First Quarter 2024 Financial Results • PR Newswire (US) • 05/09/2024 08:05:00 PM
- PacBio Announces Preliminary First Quarter 2024 Revenue and Updates 2024 Revenue Guidance • PR Newswire (US) • 04/16/2024 12:05:00 PM
- Estonia National Biobank Selects PacBio to Sequence 10,000 Whole Genomes • PR Newswire (US) • 03/27/2024 12:00:00 PM
- PacBio Grants Equity Incentive Award to New Employee • PR Newswire (US) • 03/22/2024 08:30:00 PM
North Bay Resources Commences Operations at Bishop Gold Mill, Inyo County, California; Engages Sabean Group Management Consulting • NBRI • Sep 25, 2024 9:15 AM
CEO David B. Dorwart Anticipates a Bright Future at Good Gaming Inc. Through His Most Recent Shareholder Update • GMER • Sep 25, 2024 8:30 AM
Cannabix Technologies and Omega Laboratories Inc. Advance Marijuana Breathalyzer Technology - Dr. Bruce Goldberger to Present at Society of Forensic Toxicologists Conference • BLOZF • Sep 24, 2024 8:50 AM
Integrated Ventures, Inc Announces Strategic Partnership For GLP-1 (Semaglutide) Procurement Through MedWell USA, LLC. • INTV • Sep 24, 2024 8:45 AM
Avant Technologies Accelerates Creation of AI-Powered Platform to Revolutionize Patient Care • AVAI • Sep 24, 2024 8:00 AM
VHAI - Vocodia Partners with Leading Political Super PACs to Revolutionize Fundraising Efforts • VHAI • Sep 19, 2024 11:48 AM