Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CDMS data on OSN and descriptors #1

Open
zonca opened this issue Oct 10, 2023 · 8 comments
Open

CDMS data on OSN and descriptors #1

zonca opened this issue Oct 10, 2023 · 8 comments

Comments

@zonca
Copy link
Collaborator

zonca commented Oct 10, 2023

We want to investigate how to use data descriptors of SuperCDMS data to document their structure, this could be used by automated systems like EventIO to interface with the data

@zonca
Copy link
Collaborator Author

zonca commented Oct 10, 2023

Event IO

Karl Kosack <[email protected]> and Maximilian Nöthe <[email protected]> are ctapipe developers (https://ctapipe.readthedocs.io/).
The file format is called EventIO, documented here. They have a C++ library for it with a Python interface, but wanted to get less implementation-specific. They looked at Kaitai and had misgivings about the lack of built-in support for variable-width integers. I showed them that we can represent variable-width integers in Kaitai, but that was the last I heard from them.

@zonca
Copy link
Collaborator Author

zonca commented Oct 10, 2023

@zonca
Copy link
Collaborator Author

zonca commented Oct 11, 2023

confirm I can access the data:

aws s3 --profile osn ls
 s3://supercdms-data/CDMS/UMN/R68/Raw/07180808_1558/07180808_1558_F0001.mid.gz
2023-06-16 03:19:52   48189875 07180808_1558_F0001.mid.gz

@pibion
Copy link
Member

pibion commented Oct 20, 2023

@zonca here's the descriptor for v8 of the supercdms format: https://github.com/det-lab/dataReaderWriter/blob/master/kaitai/ksy/scdms_v8.ksy. An example of data that this matches is at https://github.com/det-lab/dataReaderWriter/blob/master/data/51230216_125838_F0001.mid.gz.

The UMN data above is v1, I'm working on a data description for that now.

@zonca
Copy link
Collaborator Author

zonca commented Oct 20, 2023

@pibion do you have docs on how to create a Python interface for this descriptor? do these use awkward arrays?

@pibion
Copy link
Member

pibion commented Oct 20, 2023

@pibion
Copy link
Member

pibion commented Oct 20, 2023

Her talk on how to use these is at https://github.com/ManasviGoyal/PyHEP-2023-Awkward-Target-for-Kaitai-Struct

@zonca
Copy link
Collaborator Author

zonca commented Oct 21, 2023

ok, I managed to install https://github.com/ManasviGoyal/kaitai_awkward_runtime, and run the example:

awkward_array = kaitai_awkward_runtime.load("data/animal.raw")

@ManasviGoyal how do I load https://github.com/det-lab/dataReaderWriter/blob/master/kaitai/ksy/scdms_v8.ksy or the dataset instead?

@zonca zonca changed the title CDMS data on OSN and description CDMS data on OSN and descriptors Oct 21, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants