-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DOE OSTI DOIs for input4MIPs #177
Comments
FYI to self, 210 DOIs were issued by the CMIP6-era citation service for input4MIPs, so maybe we need to bump up the ~100 number we've discussed - see https://www.wdc-climate.de/ui/statistics?type=cmip6_doi_registration. Also relevant is the CMIP6 Data Citation and Long-Term Archival wiki - https://redmine.dkrz.de/projects/cmip6-lta-and-data-citation/wiki |
Hi @jitendra-kumar. Just circling on this task, is there any progress to report? We have a project meeting tomorrow, so I was keen to update the data providers about the status and timings |
Here's a summary of fields we need information for to register with OSTI. Many (but not all) of these information exist within the JSONs in this repo and we can pull the information together from the existing JSONs, and create a new JSON with all the information needed to register DOI for each dataset. Product Description:
Dataset Location:
|
@jitendra-kumar, that's great. What is the best/easiest format for this info to be collated, considering this first pass is going to be manual copy-and-paste — text files or another format? |
We should put the information together in a JSON, and that would allow us to automate the process at the later date. And even for the short term I can extract everything needed from that for manual entry. |
We will need to create .html landing pages. The .json could be used to render those. We would need then to put together a template. Then push those pages to gh-pages. This could be done with Github Actions. |
Do you have any ideas for the schema @jitendra-kumar ? E.g. do certain fields need to be strings/boolean/lists etc.? I think that is the key. Once we have the schema, writing data to match it is relatively trivial. Even just something like the below Schema proposalfrom attrs import define
@define
class Author:
first_name: str
last_name: str
orcid: str # would we also validate this, probably a good idea if easy
affiliation: str
affiliation_ror: str | None # optional for anyone whose institute isn't registered
@define
class Product:
dataset_title: str
authors: list[Author]
related_dois: list[str] # should validate that these are DOIs
originating_research_organisation: str # I find this field a bit weird, given most things have multiple authors therefore source organisations and the authors have affiliations anyway
publication_date: str # YYYY-MM-DD I guess?
sponsoring_organisation: str # as above re needing multiple and info already being in author info. Also unclear to me what the difference from the other orgs is so I would suggest making this optional if we can
keywords: list[str]
geolocation: tuple[float, float] # what do we put here? Lat/lon co-ords? Would suggest making this optional or dropping if we can
description: str
dataset_location: tuple[float, float] # what do we put here? Lat/lon co-ords? Would suggest making this optional or dropping if we can. Or do I misunderstand this field?
@define
class Dataset:
url: str # validate this is a URL
extension: str
size: float # in bytes I guess? |
@znichollscr working on this schema to be consistent with what OSTI wants. Will have something to share soon. |
Just adding a placeholder issue, so we can centralize information about what the DOI OSTI service requires from authors to get a DOI issued.
We can then update the source_id and institution_id registration info, with the additional fields
ping @jitendra-kumar @sashakames
The text was updated successfully, but these errors were encountered: