Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

failed to load tblout table #8

Open
Valentin-Bio opened this issue Dec 28, 2023 · 3 comments
Open

failed to load tblout table #8

Valentin-Bio opened this issue Dec 28, 2023 · 3 comments

Comments

@Valentin-Bio
Copy link

Hello, I have previously used rhmmer to load hmmscan results using read_tblot() function. In my current case I want to load a hmmsearch table with nearly 26 thousand hits but I'm getting the following error:

Error in `$<-`:
! Assigned data `descriptions[!grepl(" *#", descriptions, perl = TRUE)]` must be compatible
  with existing data.
✖ Existing data has 25673 rows.
✖ Assigned data has 0 rows.
ℹ Only vectors of size 1 are recycled.
Caused by error in `vectbl_recycle_rhs_rows()`:
! Can't recycle input of size 0 to size 25673.

May this be due to the large amount of hits that report the table that I'm trying to load ?

bests,

Valentín.

@arendsee
Copy link
Owner

Thanks for the report Valentin! rhmmer doesn't support the output of search. It should though. If your data isn't private, could you send me the first 20 lines or so? Then I can use that as a test case and write a new parser function.

Alternatively, if you like writing R code yourself, I would be happy to accept a pull request.

@glitchheadgit
Copy link

The same problem. Can't parse both tblout and domtblout. Files attached below. Hope this will help!
phytases_tblout.txt
phytase_domtblout.txt

arendsee added a commit that referenced this issue Oct 3, 2024
Previously the description column filtered out comments (everything
following a '#' sign). This caused all entries in the search
domtableout#' sign). This caused all entries in the search domtblout
description column to be empty and caused the error seen in #8.

The solution is not entirely satisfactory. There may be cases were we
actually do want to ignore comments. Also, the search domtblout
description comment contains information on the hit that should possibly
be extracted and stored in dedicated columns.
@arendsee
Copy link
Owner

arendsee commented Oct 3, 2024

@glitchheadgit I have added a partial fix, see the comment message above. Feel free to try this out and let me know if it solves the problem. If so, I will work on resubmitting to cran.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants