“genome chunk count: 0” in human genome #19

YKasama · 2023-11-16T06:40:09Z

Hello,

I'm trying to use analysis using tebreak, but it doesn't work.
The sample data has been processed successfully.

The command executed is as follows.

tebreak -b test.bam
-r /data2/pub_data/hg19/chromosomes/hg19.fa
-p 30
-d /home/gene/work/hg19.te.Alu_disctgt.txt
-m /home/gene/TE/TEBreak/tebreak/lib/hg19.chr.centromere_telomere.bed
--max_ins_reads 500
-i /home/gene/work/test.Alu_RepBase2.fa

The execution log is as follows.

loading bwa index /data2/pub_data/mm10/mm10_all.fa into shared memory ...
loaded.
discordant targets in: /home/gene/work/hg19.te.Alu_disctgt.txt
genome chunk count: 0 <--　
：

About the specified file:
BED file of masked regions : hg19.chr.centromere_telomere.bed
We used files with "chr" at the beginning of the chromosome name.
hg19.centromere_telomere.bed file had a mix of lines with "chr"
in some entries.

discordant mate-linked targets：hg19.te.Alu_disctgt.txt
I want to target "Alu" comprehensively, so I created it
with the following command.

$ cat /tmp/rmsk//.out |
grep 'Alu' | awk '{print $5"\t"$6"\t"$7"\t"$11"\t"$10"\t"$9}' |
sed -e 's/C$/-/' > hg19.te.Alu_disctgt.txt

The contents of this file are as follows.
chr10 61181 61345 SINE/Alu FRAM -
chr10 67261 67438 SINE/Alu AluSp +
chr10 71658 71935 SINE/Alu AluJr +
chr10 72171 72462 SINE/Alu AluSg4 -
chr10 77316 77627 SINE/Alu AluSp +
chr10 97072 97325 SINE/Alu AluSx1 -
chr10 99058 99367 SINE/Alu AluSz -
　　：

I have a question.
(1)The centromere_telomere.bed and disctgt.txt files are
If both files are human, does it matter whether chr is added or not?
The result is the same whether or not the entry has chr.

(2)Why is "genome chunk count: 0" and processing is not possible?
Please advise where the problem is.

Regards,

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

“genome chunk count: 0” in human genome #19

“genome chunk count: 0” in human genome #19

YKasama commented Nov 16, 2023

“genome chunk count: 0” in human genome #19

“genome chunk count: 0” in human genome #19

Comments

YKasama commented Nov 16, 2023

The command executed is as follows.

tebreak -b test.bam -r /data2/pub_data/hg19/chromosomes/hg19.fa -p 30 -d /home/gene/work/hg19.te.Alu_disctgt.txt -m /home/gene/TE/TEBreak/tebreak/lib/hg19.chr.centromere_telomere.bed --max_ins_reads 500 -i /home/gene/work/test.Alu_RepBase2.fa

The execution log is as follows.

loading bwa index /data2/pub_data/mm10/mm10_all.fa into shared memory ... loaded. discordant targets in: /home/gene/work/hg19.te.Alu_disctgt.txt genome chunk count: 0 <-- ：

tebreak -b test.bam
-r /data2/pub_data/hg19/chromosomes/hg19.fa
-p 30
-d /home/gene/work/hg19.te.Alu_disctgt.txt
-m /home/gene/TE/TEBreak/tebreak/lib/hg19.chr.centromere_telomere.bed
--max_ins_reads 500
-i /home/gene/work/test.Alu_RepBase2.fa

loading bwa index /data2/pub_data/mm10/mm10_all.fa into shared memory ...
loaded.
discordant targets in: /home/gene/work/hg19.te.Alu_disctgt.txt
genome chunk count: 0 <--　
：