Assembling Genomes and Finding Disease-Causing Mutations

Description

Carsonella ruddii is a bacterium that lives symbiotically inside some insects. Its sheltered life has allowed it to reduce its genome to only about 160,000 base pairs. With only about 200 genes, it lacks some genes necessary for survival, but these genes are supplied by its insect host. In fact, Carsonella has such a small genome that biologists have conjectured that it is losing its “bacterial” identity and turning into an organelle, which is part of the host’s genome. This transition from bacterium to organelle has happened many times during evolutionary history; in fact, the mitochondrion responsible for energy production in human cells was once a free-roaming bacterium that we assimilated in the distant past. Given a collection of simulated error-free read-pairs, use the paired de Bruijn graph to reconstruct the Carsonella ruddii genome. Compare this assembly to the assembly obtained from the classic de Bruijn graph (i.e., when all we know is the reads themselves and do not know the distance between paired reads) in order to better appreciate the benefits of read-pairs. For each k, what is the minimum value of d needed to enable reconstruction of the entire Carsonella ruddii genome from its (k, d)-mer composition?



Have you tried this resource? Help someone out by sharing your thoughts!

Write a review

More Ways to Learn Genomic Data Science

Introduction to Genomic Technologies
Johns Hopkins University
Introduction to Genomic Technologies
College | Free
Genomic Data Science Capstone
Johns Hopkins University
Genomic Data Science Capstone
College | Free
Bioconductor for Genomic Data Science
Johns Hopkins University
Bioconductor for Genomic Data Science
College | Free
Statistics for Genomic Data Science
Johns Hopkins University
Statistics for Genomic Data Science
College | Free