Skip to content

Chain after RepeatFiller with wrong cordinates #22

@ebioman

Description

@ebioman

Hi
I am trying RepeatFiller on multiple chain files for hg38->T2T and I am encountering some inconsistent errors.
The official chain file from T2T runs through without anything being reported - seems no repeats are anymore present.
Using self generated ones (from minimap2 and GSAlign) I encounter a weird situation that chain files are generated but then are faulty.

python3 /usr/local/GenomeAlignmentTools/src/RepeatFiller.py  \
  --chain Minimap2_liftover.chain --T2bit hg38_p8_primaryContigs.2bit \
  --Q2bit chm13v2.0.2bit -o Minimap2_liftover.repeatFiltered.chain 

If I try then any kind of command afterwards, e.g.

chainPreNet Minimap2_liftover.repeatFiltered.chain  hg38_p8_primaryContigs.sizes chm13v2.0.sizes stdout 

q end mismatch 242669717 vs 242693499 line 54824 of Minimap2_liftover.repeatFiltered.chain

It fails with that error, other tools such as chainSorter as well. If I use though instead my file Minimap2_liftover.chain then everything goes smoothly. I tried as well flipping target and query in the RepeatFiller command as I was not sure about the definition, and astonishing (and worringly) it actually went through ....
But I get then similarly an incompatible chain file at the end.

I used the latest release from your tools

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions