镜像id:c0e3c9e5e0e8,睿远在92服务器上的容器id:0b5438424915
运行命令
python /home/pipeline.py -q query.fa -t target/ -o /gengxin/NC/output_dir -s /home/scripts/Align2CDS.py -p 8 -n 8 -f fail.txt #-p,-n指的是线程数,-q 输入的CDS序列文件输出目录中就有negative_results.txt和positive_results.txt的结果文件
备注:分析结果会把大写的字母变成小写的,如原geneID为TG5G00748分析后的geneID为tg5g00748。
运行以下脚本解决大小写问题
grep "positive" positive_results.txt > positive.gene.txt
awk '{
split($1, path, "/");
gene = path[6];
gsub("tg", "TG", gene);
gsub("g", "G", gene);
print gene, "\t", $2, "\t", $3
}' positive.gene.txt > positive.gene.final.txt #将基因中的小写字母改为大写
grep "negative" negative_results.txt > negative.gene.txt
awk '{
split($1, path, "/");
gene = path[6];
gsub("tg", "TG", gene);
gsub("g", "G", gene);
print gene, "\t", $2, "\t", $3
}' negative.gene.txt > negative.gene.final.txt