Node of clusters and ranking database scheduler #365

Su870 · 2022-12-02T01:04:40Z

Su870
Dec 2, 2022

I am at this step in the pyscenic process on Jupyter notebook, but I am not sure what 'address' to input in client_or_address=SCHEDULER.

I have tried SCHEDULER="123.122.8.24:8786" but it does not connect. The IP address (if that is what it is) needs to have access to a shared network drive on which the ranking databases are stored but I am not sure where that is.

I have also tried my own IP address and get this error: "my IP address numbers" is not valid for parameter client_or_address.

What have others tried and worked?

Thanks!

Phase II: Prune modules for targets with cis regulatory footprints (aka RcisTarget)
--Calculate a list of enriched motifs and the corresponding target genes for all modules.
with ProgressBar():
df = prune2df(dbs, modules, MOTIF_ANNOTATIONS_FNAME)

--Create regulons from this table of enriched motifs.
regulons = df2regulons(df)

--Save the enriched motifs and the discovered regulons to disk.
df.to_csv(MOTIFS_FNAME)
with open(REGULONS_FNAME, "wb") as f:
pickle.dump(regulons, f)
Clusters can be leveraged in the following way:

--The clusters can be leveraged via the dask framework:
df = prune2df(dbs, modules, MOTIF_ANNOTATIONS_FNAME, client_or_address=SCHEDULER)
Caution!

The nodes of the clusters need to have access to a shared network drive on which the ranking databases are stored.

Reloading the enriched motifs and regulons from file should be done as follows:

df = load_motifs(MOTIFS_FNAME)
with open(REGULONS_FNAME, "rb") as f:
regulons = pickle.load(f)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Node of clusters and ranking database scheduler #365

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 0 comments

Select a reply

Node of clusters and ranking database scheduler #365

Su870 Dec 2, 2022

Thanks!

Replies: 0 comments

Su870
Dec 2, 2022