Remove spectral index and allow input of stokes varying by source, time and channel. #244

sjperkins · 2018-01-12T13:42:05Z

Previously, spectral index was limited to (freq/ref_freq)**alpha, which is a fairly simplistic model. Remove this and simply allow the user to specify stokes parameters varying by source, time and channel.

Previously, spectral index was limited to (freq/ref_freq)**alpha, which is a fairly simplistic model. Remove this and simply allow the user to specify stokes parameters varying by source, time and channel.

sjperkins · 2018-01-12T13:44:56Z

I'm unlikely to merge this into master given

development on the dask version
Support plugging in various implementations for different sections of the RIME pipeline. #168 would be a fair amount of dev on the master branch.

However, given the need for arbitrary spectral index expressions in https://github.com/cyriltasse/DDFacet/pull/452 and #171 I can keep parallel development on this branch going.

sjperkins · 2018-01-12T13:45:10Z

/cc @bennahugo @landmanbester

sjperkins · 2018-01-12T13:46:25Z

montblanc/tests/test_meq_tf.py

-    def point_ref_freq(self, context):
-        (lp, up) = context.dim_extents('npsrc')
-        return pt_ref_freq[lp:up]
+        return s*(f/rf)**a


@bennahugo See above for how to input stokes parameters along with the default spectral index into montblanc

sjperkins · 2018-03-14T08:29:18Z

@JSKenyon @SaiyanPrince After discussions with @gijzelaerr I think we're going to end up merging this into master branch. Essentially stokes parameters now vary by channel too and one has to manually work the spectral index into the Stokes terms (see test_meq_tf.py). In practice, the *_stokes data sources in your SourceProviders will need to change as they've changed in test_meq_tf.py in this branch.

You have 2 hours to lodge objections!

JSKenyon · 2018-03-14T08:38:00Z

I don't object per se, but this volatile interface is a little bit frustrating. To any CubiCal user reading this, it may take a day or two for this to be fixed on our side.

gijzelaerr · 2018-03-14T08:42:55Z

Maybe make a release of Montblanc before the change, and make current Cubical depend on that release for now?

sjperkins · 2018-03-14T14:47:22Z

This is on hold for a bit while @gijzelaerr and myself consider packaging implications...

IanHeywood · 2018-03-14T17:48:25Z

So what montblanc will ingest is an array of dimensions n_pol x n_chan for every component?

I guess in the case of interfacing with CubiCal this array will be calculated from the Tigger models using I,Q,U,V,f0,alpha,[beta,...] and then passed to montblanc?

Cheers.

sjperkins · 2018-03-15T09:25:35Z

So what montblanc will ingest is an array of dimensions n_pol x n_chan for every component?

In fact, the array rank is growing fairly large (n_src, n_time, n_chan, n_pol) (we have n_time in order to represent scintillation)

I guess in the case of interfacing with CubiCal this array will be calculated from the Tigger models using I,Q,U,V,f0,alpha,[beta,...] and then passed to montblanc?

Correct

o-smirnov · 2018-03-15T09:41:07Z

(we have n_time in order to represent scintillation)

Do you then use n_time=1 if no sources are variable (i.e. is the time axis broadcast, numpy-style?)

sjperkins · 2018-03-15T09:46:18Z

Do you then use n_time=1 if no sources are variable (i.e. is the time axis broadcast, numpy-style?)

Not at present, mostly because GPU broadcasting isn't available yet. There is a broadcasting PR that would make the pipeline more flexible in this regard.

o-smirnov · 2018-03-15T10:15:53Z

Aren't we then substantially increasing data volumes to cater for a what is only a marginal use case?

sjperkins · 2018-03-15T10:26:55Z

Aren't we then substantially increasing data volumes to cater for a what is only a marginal use case?

Indeed, although it "supports the general case" in that the user can work whatever spectral models they want into the stokes parameters.

I think this highlights the need for flexibility for inputs and RIME terms (related to #245, #168), balanced against the hand-coded and relatively inflexible nature of CUDA kernels. @JSKenyon and myself started brainstorming ways to support this configurability yesterday.

Things like tensorflow/tensorflow#15243 would make this much easier (probably at the expense of extra memory copies).

o-smirnov · 2018-03-15T10:33:35Z

Indeed, although it "supports the general case" in that the user can work whatever spectral models they want into the stokes parameters.

That I don't question. 99% of your use cases need a spectral axis.

However, I wager less than 5% (if that) of the use cases need both spectral and time axes. If this is so, doesn't having an "always on" time axis induce an unnecessary penalty 94% of the time?

sjperkins · 2018-03-15T12:27:49Z

However, I wager less than 5% (if that) of the use cases need both spectral and time axes. If this is so, doesn't having an "always on" time axis induce an unnecessary penalty 94% of the time?

Yes, and really the issue here becomes maintaining permutations of the C++/CUDA code (Most likely in the multiplication of terms rather than their calculation). Off the top of my head the permutations are device and shape ["CPU", "GPU"] [("time,chan"), ("chan"), ("time"), ()] for a total of 12 combinations. The nice thing about NumPy is that it makes this kind of thing trivial, but a couple of lines in NumPy blows up to hand-crafting each case in CUDA. Thats where a GPU broadcasting operation would make things simpler.

There is the option of something like tf.matmul but then instead of a single operator doing specialised multiplies of four 2x2 Jones matrices we have four calls to tf.matmul and allocation of space for four result arrays. To me it seems like a snakes and ladders type scenario -- memory budgets on a GPU are tighter than on a CPU.

My gut is against premature optimisation here -- its been a while since I've been able to do serious profiling but things like the beam, gaussian shape parameters and multiplication to produce source coherencies were the big time sinks when I last looked.

Additionally, I was only intending to merge this to make #250 simpler and the cuDNN dependency is proving a hassle.

o-smirnov · 2018-03-15T13:05:29Z

I would reduce the shape permutations to just [("time,chan"), ("chan")]. These are the two realistic use cases.

My gut is against premature optimization here

I'm on record as being militantly against premature optimization. But is this premature, or even optimization?

Consider a simple MeerKAT use case. 1000 sources, 4K channels, 4 polarisations, 1000 timeslots. That's 16G entries in your source array -- you can't even get that onto the GPU!

Eliminate the time axis, it's down to 16M entries, small change.

So having a permanently unrolled time axis effectively makes Montblanc incompatible with MeerKAT, our meat-and-potatoes application. All for the sake of supporting a rather exotic use case (which scintillation and/or transients are) that is only likely to be run in a regime where you have small numbers of sources, channels and antennas.

sjperkins · 2018-03-15T13:19:22Z

Consider a simple MeerKAT use case. 1000 sources, 4K channels, 4 polarisations, 1000 timeslots. That's 16G entries in your source array -- you can't even get that onto the GPU!

True! That is probably too large for the general case, and is a good argument for why this PR probably shouldn't be merged as standard functionality -- I should have pencilled and papered the problem sizes because as you point out, the size of the input is huge.

But also remember, that the problem also gets subdivided into chunks by time and source (think 100 time and 50 sources) for transfer to the GPU. The dask version will also allow easy chunking by channel. So the problem is less about fitting it onto the GPU, but the I/O transfer of parts of the problem to/from the GPU (16GB/s full duplex on PCI-E 3.0). I always think of RIME I/O costs as O(V + S) vs compute costs of O(V x S) where V is number of visibilities and S number of sources. I think in this case, due to the large number of sources, the compute will still outstrip the very large I/O transfer.

In summary, I agree we shouldn't make this standard. I'll liase with @gijzelaerr tomorrow about packaging again.

sjperkins · 2018-03-15T13:21:09Z

I think this is also why configurability is going to be a large consideration from my side in the new version.

IanHeywood · 2018-03-15T14:15:24Z

1000 sources

Can you really see a need for this many sources? Won't most use cases involve a handful of montblanc components and a MODEL_DATA column? Not that I'm arguing against making things leaner.

I think the time axis would be a nice thing to keep as an option. I was chatting briefly to @twillis449 and Bruce B. about simulating RFI and this seems like a really nice way to chuck in arbitrary time and frequency behaviour.

o-smirnov · 2018-03-15T14:25:50Z

Well, we went up to 300 sources for the 3C147 VLA reduction. Arguably, this can be reduced by better use of the MODEL_DATA column. So let's say we need 50, or 100. That's still a big ole hunk of data with a time axis in place -- but quite lightweight without one.

I'm not arguing against a time axis -- just against a non-optional one.

tensorflow GPU requires a manual install.

https://stackoverflow.com/a/17019983/1611416

bennahugo · 2019-05-28T17:01:32Z

I'm happy that this branch works (and is fully py3 compatible), see my latest test run below:

This addresses the py3 changes needed for cubical in ratt-ru/CubiCal/pull/270

bennahugo · 2019-05-28T17:02:19Z

@sjperkins can we sit together tomorrow morning and handle the rest of the merge conflicts?

sjperkins · 2019-05-28T18:03:40Z

Sure

bennahugo · 2019-05-29T12:23:05Z

Alright my subtraction test passes again as above. I need to install nvidia drivers to run your tests though so fingers crossed this doesn't break my system

bennahugo · 2019-05-31T14:16:39Z

This is a python 3 construct.

  File "/usr/local/lib/python2.7/dist-packages/montblanc/impl/rime/tensorflow/RimeSolver.py", line 1145, in _get_data
    raise ex.with_traceback(sys.exc_info()[2])
AttributeError: 'exceptions.ValueError' object has no attribute 'with_traceback'

…facet

o-smirnov · 2019-11-01T12:38:42Z

ANy reason not to merge this still?

sjperkins · 2019-11-01T14:41:03Z

ANy reason not to merge this still?

I can't remember any particular reason. Maybe we should just merge and deal with consequences?

Remove spi and input stokes per (src,time,channel)

a593498

Previously, spectral index was limited to (freq/ref_freq)**alpha, which is a fairly simplistic model. Remove this and simply allow the user to specify stokes parameters varying by source, time and channel.

sjperkins commented Jan 12, 2018

View reviewed changes

sjperkins mentioned this pull request Jan 12, 2018

Per channel stokes parameters #166

Open

Merge branch 'master' into ddfacet

0c182cf

sjperkins added 7 commits June 11, 2018 13:58

Merge branch 'master' into ddfacet

4b5f507

Install tensorflow CPU by default

06b91c4

tensorflow GPU requires a manual install.

Update the standalone.py example

0e42a40

RadecToLm operator

700e78c

Swap translation and rotation in the beam

f4c9ed6

Makefile fixes for later tensorflow versions

dbad6d5

Convert all source position inputs to radec

3e50b32

ratt-priv-ci and others added 10 commits May 27, 2019 13:56

residual py3 issues

cc39f4e

Fix py2 backwards compat

a7ab593

depend on hypercube py3 fixes

3f095ce

py3 fixes

02da662

Use six to write py2-3 compatible metaclasses

36e0550

py2-3 install issue

64567df

remove metadata class printing

f167e01

py23 agnostic __builtin__ import

6c4dad5

Fix method inspection within MetaClass constructor

878ce89

https://stackoverflow.com/a/17019983/1611416

Fix residual py3 issues

03afcc9

bennahugo mentioned this pull request May 29, 2019

Add support for Python3 ratt-ru/CubiCal#270

Merged

merge master into ddfacet branch

916c430

sjperkins added 6 commits May 31, 2019 09:43

Fix variable usage in setup.py

5a456cd

Make a proper Sersic Shape test

85684b2

Make a proper gaussians shape test

a99dcf7

Feed rotation test spacing

4f0af23

Fix radec_to_lm test library load

2061238

Make a proper complex phase test case

65d7ec2

ratt-priv-ci added 2 commits May 31, 2019 16:30

Remove py3 only constructs

77e501c

Merge branch 'ddfacet' of https://github.com/ska-sa/montblanc into dd…

547008f

…facet

sjperkins merged commit e8b2e71 into master Apr 21, 2020

sjperkins deleted the ddfacet branch April 21, 2020 07:23

sjperkins mentioned this pull request Mar 15, 2021

make pypi compatible ratt-ru/CubiCal#446

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove spectral index and allow input of stokes varying by source, time and channel. #244

Remove spectral index and allow input of stokes varying by source, time and channel. #244

sjperkins commented Jan 12, 2018

sjperkins commented Jan 12, 2018

sjperkins commented Jan 12, 2018

sjperkins Jan 12, 2018

sjperkins commented Mar 14, 2018

JSKenyon commented Mar 14, 2018

gijzelaerr commented Mar 14, 2018

sjperkins commented Mar 14, 2018

IanHeywood commented Mar 14, 2018

sjperkins commented Mar 15, 2018

o-smirnov commented Mar 15, 2018

sjperkins commented Mar 15, 2018

o-smirnov commented Mar 15, 2018

sjperkins commented Mar 15, 2018

o-smirnov commented Mar 15, 2018

sjperkins commented Mar 15, 2018 •

edited

Loading

o-smirnov commented Mar 15, 2018

sjperkins commented Mar 15, 2018 •

edited

Loading

sjperkins commented Mar 15, 2018

IanHeywood commented Mar 15, 2018

o-smirnov commented Mar 15, 2018

bennahugo commented May 28, 2019

bennahugo commented May 28, 2019

sjperkins commented May 28, 2019

bennahugo commented May 29, 2019

bennahugo commented May 31, 2019

o-smirnov commented Nov 1, 2019

sjperkins commented Nov 1, 2019

Remove spectral index and allow input of stokes varying by source, time and channel. #244

Remove spectral index and allow input of stokes varying by source, time and channel. #244

Conversation

sjperkins commented Jan 12, 2018

sjperkins commented Jan 12, 2018

sjperkins commented Jan 12, 2018

sjperkins Jan 12, 2018

Choose a reason for hiding this comment

sjperkins commented Mar 14, 2018

JSKenyon commented Mar 14, 2018

gijzelaerr commented Mar 14, 2018

sjperkins commented Mar 14, 2018

IanHeywood commented Mar 14, 2018

sjperkins commented Mar 15, 2018

o-smirnov commented Mar 15, 2018

sjperkins commented Mar 15, 2018

o-smirnov commented Mar 15, 2018

sjperkins commented Mar 15, 2018

o-smirnov commented Mar 15, 2018

sjperkins commented Mar 15, 2018 • edited Loading

o-smirnov commented Mar 15, 2018

sjperkins commented Mar 15, 2018 • edited Loading

sjperkins commented Mar 15, 2018

IanHeywood commented Mar 15, 2018

o-smirnov commented Mar 15, 2018

bennahugo commented May 28, 2019

bennahugo commented May 28, 2019

sjperkins commented May 28, 2019

bennahugo commented May 29, 2019

bennahugo commented May 31, 2019

o-smirnov commented Nov 1, 2019

sjperkins commented Nov 1, 2019

sjperkins commented Mar 15, 2018 •

edited

Loading

sjperkins commented Mar 15, 2018 •

edited

Loading