-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Suspicious "Ran out of path" Error #2
Comments
Some other corg questions:
|
OK, I have looked into this. Here's what's going on. snp1000g has no paths in common with any of the other graphs (not even a path called "ref"). It appears to contain a single path named "17". Thus, it can merge with any graph without error, since no path merging actually happens; the graphs just get placed side by side in one file. To fix this, I've added a warning that will print if there aren't any common paths to merge on. To debug the rest of it, I've added a pre-check of the path lengths to be merged, to make sure they are actually the same length before doing any merging. It will be slower, but probably safer, as it's now impossible to merge two graphs that disagree about a path's length and not have an error. With this change, I can successfully merge: Cactus and Camel The length of GI262359905 doesn't match for: Cactus and Trivial This is weird, but it seems sane and consistent. Cactus and Camel agree with each other about the path length, and both disagree with trivial. It seems like Cactus and Camel put the path at 81,191 bp and Trivial puts it at 81,189 bp, or 2 bases shorter. Going back in with vg, it seems like this is actually the case:
Looking at the original FASTA, it seems as though the shorter length, which trivial is using, is correct:
There's an extra length-2 node tacked on (I think with a reversing join) in both the Cactus and Camel graphs. Interestingly, it looks like the lengths are correct in the original HAL files:
It does show up, however, after sg2vg:
It could be that vg is messing this up, even with the simple commands I'm using here, but I don't think it's likely. As for your other questions:
|
Hey Adam, Thanks for this! I'm going to look into sg2vg right now to see what's up -Glenn On Fri, Dec 4, 2015 at 5:29 PM, adamnovak [email protected] wrote:
|
I have 4 brca1 graphs, (extracted from the high-coverage-reads/indexes):
/cluster/home/hickey/ga4gh/hgvm-graph-bakeoff-evalutations/high_coverage_graphs/
cactus-brca1.vg
refonly-brca1.vg
snp1000g-brca1.vg
trivial-brca1.vg
I can successfully run corg on all pairs of these graphs except cactus and trivial, which gives me
Processing path: GI262359905
We ran out of path in one graph and not in the other!
We have a mapping
Our mapping: {"position": {"node_id": 856, "offset": 95}, "is_reverse": true, "rank": 842}
terminate called after throwing an instance of 'std::runtime_error'
what(): Ran out of mappings on one path before the other!
Aborted (core dumped)
If these two graphs have paths of different length, which seems to be what this error is saying, then I would not expect them both to compare successfully to refonly and snp1000g.
The text was updated successfully, but these errors were encountered: