By wuzhenzhen, 31 October, 2025

数据比对前先对参考基因组序列建立index。

(1)利用bwa-mem2建立索引文件:

# bwa-mem2 index genome.fasta
# (1)解压基因组文件
(base) root@961a4377e759:/home/Rmolle_calllsnp_work/Rmolle_genome_GCA025413875# gunzip -k Rmolle_genomic_GCA_025413875.1.fna.gz
# (2)将fna后缀改为fasta后缀,否则程序没办法运行
(base) root@961a4377e759:/home/Rmolle_calllsnp_work/Rmolle_genome_GCA025413875# cp Rmolle_genomic_GCA_025413875.1.fna  Rmolle_genomic_GCA_025413875.1.fasta
# (3)利用bwa-mem2建立索引
(base) root@961a4377e759:/home/Rmolle_calllsnp_work/Rmolle_results/bam_output_file# /home/software/bwa-mem2/bwa-mem2 index /home/Rmolle_calllsnp_work/Rmolle_genome_GCA025413875/Rmolle_genomic_GCA_025413875.1.fasta 

(2)利用gatk建立索引:

 # gatk CreateSequenceDictionary -R genome.fasta -O genome.dict
(base)root@961a4377e759:/home/Rmolle_calllsnp_work/Rmolle_results/bam_sort_bygatk# /home/software/gatk-4.1.8.1/gatk CreateSequenceDictionary -R /home/Rmolle_calllsnp_work/Rmolle_genome_GCA025413875/Rmolle_genomic_GCA_025413875.1.fna.gz -O Rmolle_genomic_GCA_025413875.1.dict #fig3

(3)利用samtools建立索引:

# samtools faidx genome.fasta
(base) root@961a4377e759:/home/Rmolle_calllsnp_work/samtools_results# samtools faidx ../Rmolle_genome_GCA025413875/Rmolle_genomic_GCA_025413875.1.fasta