Node of clusters and ranking database scheduler #365
Unanswered
Su870
asked this question in
* General SCENIC questions
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I am at this step in the pyscenic process on Jupyter notebook, but I am not sure what 'address' to input in client_or_address=SCHEDULER.
I have tried SCHEDULER="123.122.8.24:8786" but it does not connect. The IP address (if that is what it is) needs to have access to a shared network drive on which the ranking databases are stored but I am not sure where that is.
I have also tried my own IP address and get this error: "my IP address numbers" is not valid for parameter client_or_address.
What have others tried and worked?
Thanks!
Phase II: Prune modules for targets with cis regulatory footprints (aka RcisTarget)
--Calculate a list of enriched motifs and the corresponding target genes for all modules.
with ProgressBar():
df = prune2df(dbs, modules, MOTIF_ANNOTATIONS_FNAME)
--Create regulons from this table of enriched motifs.
regulons = df2regulons(df)
--Save the enriched motifs and the discovered regulons to disk.
df.to_csv(MOTIFS_FNAME)
with open(REGULONS_FNAME, "wb") as f:
pickle.dump(regulons, f)
Clusters can be leveraged in the following way:
--The clusters can be leveraged via the dask framework:
df = prune2df(dbs, modules, MOTIF_ANNOTATIONS_FNAME, client_or_address=SCHEDULER)
Caution!
The nodes of the clusters need to have access to a shared network drive on which the ranking databases are stored.
Reloading the enriched motifs and regulons from file should be done as follows:
df = load_motifs(MOTIFS_FNAME)
with open(REGULONS_FNAME, "rb") as f:
regulons = pickle.load(f)
Beta Was this translation helpful? Give feedback.
All reactions