Third-generation sequencing techniques are used for high-quality sequencing. These are better in many aspects than the first-generation and second-generation sequencing techniques. However, third-generation sequencing and mapping technology face high error rates. They are being worked on and expected to improve over time. Third-generation sequencing proves useful for applications with tolerance to error rates.
Third-generation sequencing is also known as long-read sequencing. We define it as a generation of DNA sequencing methods that are currently under active development.
There are many algorithms available for third-generation sequencing. A few of these are mentioned ahead.
Genome assembly is defined as the reconstruction of the whole genome DNA sequences. Vast quantities of DNA fragments are aligned in this type of assembly.
When a reference genome is given, the newly sequenced reads can be aligned to the reference genome to characterize their characteristics. This type of assembly is easy to implement and quick. The drawback of this method is that it hides the novel sequences and significant
De novo assembly is another reference alignment technology. This method reconstructs the whole genome sequence from raw sequence reads. This method is chosen when no reference genome is available, when the species of the given organism is unknown or when specific genetic variants of interest can not be detected by reference genome alignment.
De novo assembly is a computational problem due to the short reads produced due to the present generation of sequencing. This is resolved by the iterative process of finding and connecting sequences with suitable overlaps. Pair end reads are a possible solution to this problem, but it also has drawbacks.
Longer reads offered by the third-generation technologies may ease the problems faced by de novo assembly. For instance, if an entire repetitive region is sequenced ambiguously as a single read, no computational inference would be required.
Third-generation sequencing and second-generation sequencing together are used to lessen the error rates. This works as long reads from third-generation sequencing may be used to resolve the ambiguities present in the second-generation assembled sequences. While the short second-generation reads can be used to resolve the errors present in the long third-generation sequence reads.
This sequencing, along with long-range mapping data, improves the assemblies and is cost-effective. The drawback of this method is that scaffolding chromosomes have less information than fully sequenced chromosomes. This can cause overlooking necessary sequences while reading or the obscured gaps that occur in between the gaps.
Finding structural variations is difficult as short reads tend to fail to map to the breakpoints of structural variants. Third-generation mapping and sequencing allow improved split-read analysis. Repetitive elements flank variations, and longer reads allow a confident detection and alignment of the sequences.
This method is used for phasing
This analysis is complicated due to sequencing errors and the uneven coverage that gives additional false variants to be introduced and missing the true heterozygous variants in the sequence.
Optimization frameworks are used to enhance the robustness and to decrease the disagreements between the assignment and the reading process.
Free Resources