Tempered mcmc #180

JesusTorrado · 2021-05-20T21:16:12Z

[WIP]

Missing testing and getdist side of things.

cobaya/samplers/mcmc/mcmc.py

cmbant · 2021-05-20T22:06:35Z

cobaya/samplers/mcmc/mcmc.py

+            if self.temperature:
+                raise LoggedError(
+                    self.log,
+                    "Temperature != 1 and dragging are not compatible at the moment.")


At a quick glance, I think dragging is correct with just the current PR changes (since temperature just scales all the logposts by the same linear factor)?

It should, I guess, for any sampling algorithm, since it's just importance reweighting? Anyway, will test before merging with baseline Planck.

codecov-commenter · 2021-09-08T16:14:25Z

Codecov Report

Merging #180 (8e8a5f4) into master (ac3ed01) will decrease coverage by 0.04%.
The diff coverage is 75.40%.

@@            Coverage Diff             @@
##           master     #180      +/-   ##
==========================================
- Coverage   87.59%   87.55%   -0.05%     
==========================================
  Files          91       91              
  Lines        8345     8390      +45     
==========================================
+ Hits         7310     7346      +36     
- Misses       1035     1044       +9

Impacted Files	Coverage Δ
cobaya/theories/camb/camb.py	`92.00% <ø> (ø)`
cobaya/collection.py	`85.39% <68.29%> (-1.54%)`	⬇️
cobaya/samplers/mcmc/mcmc.py	`91.16% <87.50%> (-0.31%)`	⬇️
cobaya/model.py	`93.92% <100.00%> (+0.03%)`	⬆️
cobaya/sampler.py	`89.36% <0.00%> (+0.42%)`	⬆️
cobaya/prior.py	`97.87% <0.00%> (+1.41%)`	⬆️

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

JesusTorrado · 2021-09-08T16:40:12Z

@cmbant Mostly done. As expected by getdist, now storing tempered weights (integer) and logpost in the collection. Statistical functions such as mean, cov take into account the temperature with which the sample was generated and return the statistics corresponding to the original pdf (unless otherwise requested with ignore_temperature=True, as convergence checks in tempered MCMC do).

A couple of missing things:

Collection.thin_samples: should it care about temperature? I reckon it should not at this point in the code, right? (that's the point of integer weights?) But not sure.
Handling of covmats I/O: I would say we would load/save the temperature=1 covmats, and modify them internally, since we may want to reuse them in sampling processes with different temperatures?
What to do exactly about getdist: now one can load a tempered sample, call MCSamples.cool() with the temperature, and recover the original pdf. But since the temperature is known at the time of loading the sample (stored in the updated.yaml), wouldn't it make sense that getdist is smater about it, e.g. having a default value for the cool argument in MCSamples.cool(), or even calling it automatically before plotting, even if weights are still stored tempered/integer for chain operations purposes?

cmbant · 2021-09-17T07:31:52Z

back from hols now. I agree thinning should just use the integer weights, and saving covmats with T=1 makes sense.
Could certainly add some temperature meta data tag to .properties.ini and yaml equivalent, though people may also want to look at the high-temperature results (e.g. to visually check tail convergence, or see what results look like with effectively larger errors).

JesusTorrado · 2021-09-22T13:49:11Z

Thanks!

Then I don't think there is anything to be done then on the thinning side, since the relevant auxiliary GetDist chains are already initialised with weights and logposterior of the tempered posterior (so integer weights and exponentiated minuslogposterior).
CovMat I/O is now done for the T=1 pdf.
About GetDist:
- The question is about default behaviour: when we plot a tempered chain, do we want to get plots/statistics for the original pdf (maybe together with a warning) if no kwarg is overriden, or the other way around? I think it should be the first one, but your call. Of course, ideally in both cases one should be able to get plots/statistics of both the tempered and original pdf.
- No need for now for metadata: the temperature is in the updated.yaml file, and I have already wrote a function for GetDist to extract it (just not sure where to put it, because GetDist has a few different ways/places for loading chains). As stated above, once set as an attribute of the chain, the temperature could be the default value for the arg of MCSamples.cool().

I'll open a GetDist issue about this.

JesusTorrado · 2021-09-22T15:51:43Z

@cmbant do you agree with the changes in the docs here, or have anything to add? 89b3188

cmbant · 2021-09-22T16:55:48Z

Looks good to me thanks, though not very sure what you mean by "by weighting with samples with their probability". Can be useful for evaluating small tail probabilities. In high dimensions I'm using a single higher temperature usually probably doesn't help much for importance sampling (you'd really want to flatten just in a small number of relevant directions).

Did you fix the issue with the covariance scaling?

I don't have a very strong opinion about default GetDist behaviour. Ideally temperature, cooling, thinning , and burn-in removal metadata should probably be propagated consistently (including when saving MCSamples.saveAsText, and in Cobaya importance-sampled outputs).

JesusTorrado · 2021-09-22T19:27:54Z

Thanks a lot for the review!

Looks good to me thanks, though not very sure what you mean by "by weighting with samples with their probability".

The first "with" should not be there, sorry. I mean when computing any quantity for which one weights samples with their posterior/prior/likelihood, e.g. in the GP sampler we try to do quick estimates of the covariance matrix from small high-temperature samples, too small to be fair, but sparse enough to get a decent estimation (hopefully).

Can be useful for evaluating small tail probabilities.

I'll add that.

In high dimensions I'm using a single higher temperature usually probably doesn't help much for importance sampling (you'd really want to flatten just in a small number of relevant directions).

Ok, I guess I was thinking too naively about this. Do you think it's still worth mentioning that it may be useful in low dimensions, or should we remove any mention of re-weighting there?

Did you fix the issue with the covariance scaling?

Yes, it was indeed the prior, thanks!

I don't have a very strong opinion about default GetDist behaviour.

My strongest opinion is that, if the default is plotting/using the high-temperature chain, users should get at least a warning when loading the chain with an explanation of how to cool it down. Another strong opinion: some hint/checkbox in the GUI for cooling down on the spot (even it the cooled-down chain only stays in RAM). Otherwise users would have to load the chains by hand in a Python shell/notebook and cool them to be able to plot them? Not sure how you did that before with CosmoMC.

In general, I think GetDist should work with it more like Cobaya's collections: stored as high-temperature, but cooled-down weights and posterior can be requested via methods without modifying the internal high-temperature data. This way you don't need to cool down the sample to get "cool" statistics and plots.

Ideally temperature, cooling, thinning , and burn-in removal metadata should probably be propagated consistently (including when saving MCSamples.saveAsText, and in Cobaya importance-sampled outputs).

I'll turn this into another issue and get to work on it over the following weeks.

As for post processing, the easiest way to go at the moment is to automatically cool on load, and thus not preserve temperature. Unless you strongly disagree, I'd leave it like that for now and open another issue to be worked on in the following weeks. (I need this one merged soon).

cmbant · 2021-09-23T07:28:18Z

I started support for auto-cool in getdist at https://github.com/cmbant/getdist/tree/autocool.
Cobaya temperature setting could be propagated to properties as other metadata settings like sampler. Does this work?

GetDist gui currently doesn't have a good way to change settings per chain (since you often operate on many at once), but this I think works analogously to ignore_rows/burn_removed, in that set globally, but chain metadata specified whether should be applied to each specific chain.

JesusTorrado · 2021-09-25T20:25:17Z

Thanks! Could work. Since this is new, we can test it ourselves and see whether it feels convenient. I am going to be using it this coming week myself.

To get the temperature from a yaml, assuming you have got the sampler name with the get_sampler function that is already there, you can use:

def get_sample_temperature(filename_or_info, sampler):
    return yaml_file_or_dict(filename_or_info).get(_sampler, {}).get(sampler, {})\
        .get("temperature", 1)

So if I got it correctly, GetDistGUI will cool down on load unless the metadata requests otherwise. That would look good to me then.

Any comment on my answer above re documentation?

cmbant · 2021-09-27T07:30:40Z

Doc changes sound OK, can certainly mention could be useful for importance sampling in some cases.

cmbant · 2022-02-24T11:05:32Z

TODO (at least): post processing, how to store temperature/cool status (#202), getdist setting of temperature property on cobaya load.

I pushed a few of the tempering-independent fixes in this PR to master.

cmbant · 2022-10-06T17:40:41Z

@JesusTorrado: @AndreasNygaard is interested in using temperatures for training Connect - how far off do you think this is from merge?
(I re-activated travis account again..)

mcmc: tempereture: basic implementation [skip travis]

aca9102

cmbant reviewed May 20, 2021

View reviewed changes

cobaya/samplers/mcmc/mcmc.py Show resolved Hide resolved

cmbant reviewed May 20, 2021

View reviewed changes

JesusTorrado added 7 commits August 26, 2021 22:26

Merge branch 'master' into mcmc_tempered [skip travis]

579269a

more on temperatures [skip travis]

937c19d

collection: some type hinting

1606031

temperature: fixes

5a12f05

collection: bugfixes and reworking [skip travis]

4e80362

collection: removed old slicing made it unnecessary bc caching

d524abf

Merge branch 'master' into mcmc_tempered

3094a3d

mcmc: takes temperature into account for bound computation

3484e27

mcmc_tempered: bugfix in Collection, and some refactoring

0580398

JesusTorrado added 2 commits September 22, 2021 17:47

collection: bugfix (new pandas behaviour) and temp in MAP

0f99386

mcmc_tempered: documentation

89b3188

mcmc_tempered: added test

ad74149

JesusTorrado added 5 commits November 23, 2021 00:09

typo

81f75be

docs typo

5dd8be8

typos and clarifications

6dc60cd

model and collection: more typing

4597f50

collection: covmat computation robustness

b169ec6

JesusTorrado and others added 2 commits December 5, 2021 07:27

prior: added method set_reference to update reference pdf, MPI-aware

48fc2e6

Merge branch 'master' into mcmc_tempered

d929322

cmbant and others added 3 commits February 24, 2022 11:08

Merge branch 'master' into mcmc_tempered

1b42142

Merge branch 'master' into mcmc_tempered

6a89357

trivial

8e8a5f4

JesusTorrado closed this Mar 29, 2023

JesusTorrado deleted the mcmc_tempered branch March 29, 2023 18:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tempered mcmc #180

Tempered mcmc #180

JesusTorrado commented May 20, 2021

cmbant May 20, 2021

JesusTorrado Sep 7, 2021

codecov-commenter commented Sep 8, 2021 •

edited

Loading

JesusTorrado commented Sep 8, 2021 •

edited

Loading

cmbant commented Sep 17, 2021

JesusTorrado commented Sep 22, 2021

JesusTorrado commented Sep 22, 2021

cmbant commented Sep 22, 2021

JesusTorrado commented Sep 22, 2021

cmbant commented Sep 23, 2021

JesusTorrado commented Sep 25, 2021

cmbant commented Sep 27, 2021

cmbant commented Feb 24, 2022 •

edited

Loading

cmbant commented Oct 6, 2022

Tempered mcmc #180

Tempered mcmc #180

Conversation

JesusTorrado commented May 20, 2021

cmbant May 20, 2021

Choose a reason for hiding this comment

JesusTorrado Sep 7, 2021

Choose a reason for hiding this comment

codecov-commenter commented Sep 8, 2021 • edited Loading

Codecov Report

JesusTorrado commented Sep 8, 2021 • edited Loading

cmbant commented Sep 17, 2021

JesusTorrado commented Sep 22, 2021

JesusTorrado commented Sep 22, 2021

cmbant commented Sep 22, 2021

JesusTorrado commented Sep 22, 2021

cmbant commented Sep 23, 2021

JesusTorrado commented Sep 25, 2021

cmbant commented Sep 27, 2021

cmbant commented Feb 24, 2022 • edited Loading

cmbant commented Oct 6, 2022

codecov-commenter commented Sep 8, 2021 •

edited

Loading

JesusTorrado commented Sep 8, 2021 •

edited

Loading

cmbant commented Feb 24, 2022 •

edited

Loading