gff格式是Sanger研究所定义,是一种简单的、方便的对于DNA、RNA以及蛋白质序列的特征进行描述的一种数据格式,比如序列的那里到那里是基因,已经成为序列注释的通用格式,比如基因组的基因预测,许多...
MEGA构建系统进化树的步骤
MEGA 是一个非常优秀的序列比对和系统进化树构建软件,使用方便,完全免费,可从其网站下载, 最新版本是4.0。下面是用MEGA构建系统进化树的简明操作流程(具体细节请参考相关MEGA教程及使用说明)...
De Bruijn Graphs for Alternative Splicing and Repetitive Regions
Today we shall examine de Bruijn graphs for two structures that occur frequently in genomes or trans...
推荐一篇关于真核基因组注释方法与流程的文章
测一个未知基因组(de nove sequence),要进行测序、拼接及注释。关于测序仪和拼接软件已经讲的很多了,很少有关于基因组注释的文章。一篇最近在Nature Review Genetics上的...
K-mer应用实例分析
We started with 600 million 100mer reads from a genomic library and plotted its K-mer distribution. ...
Maximizing Utility of Available RAMs in K-mer World
Bioinformaticians trying to assemble genomes or transcriptomes from large NGS libraries usually grap...
perl对中文的处理(encode,decode)
Perl从5.6开始已经开始在内部使用utf8编码来表示字符,也就是说对中文以及其他语言字符的处理应该是完全没有问题的。我们只需要利用好Encode这个模块便能充分发挥Perl的utf8字符的优势了。...
How do sequencing errors affect de Bruijn graphs?
Today’s commentary is the fourth in our de Bruijn graph series, but I did not like Roman characters ...
De Bruijn graphs(3)
In earlier commentaries, we introduced the concept of de Bruijn graphs and showed how they were used...
De Bruijn graphs(2)
In the previous post, we discussed how de Bruijn graphs can be constructed for a genome or a large s...