TD2: finding protein coding regions in transcripts
Abstract
The transcriptome encompasses all RNA transcripts in eukaryotic cells, orchestrating gene expression and regulating cellular function, development, and adaptation. Identifying open reading frames (ORFs) in transcripts is a critical step in transcriptome analysis. We introduce TD2, a new tool forab initioannotation of protein-coding ORFs in transcripts. We find TD2 to be sensitive and precise when compared to other state-of-the-art tools in reference transcripts and transcriptome assemblies from a diverse array of eukaryotes.
TD2 is available at<ext-link xmlns:xlink="http://www.w3.org/1999/xlink" ext-link-type="uri" xlink:href="https://github.com/Markusjsommer/TD2">https://github.com/Markusjsommer/TD2</ext-link>. The project is open-source, developed in Python with PyTorch, and is freely available to all academic, government, and commercial users under the MIT license.
Related articles
Related articles are currently not available for this article.