Skip to content

nsdf-fabric/nsdf-catalog

Repository files navigation

NSDF-Catalog

NSDF-catalog

NSDF-Catalog is a lightweight indexing service with minimal metadata that complements existing domain-specific and rich-metadata collections.

NSDF-Catalog is part of the National Science Data Fabric (NSDF) initiative, funded by the National Science Fundation (NSF).

NSDF-Catalog is designed to facilitate multiple related objectives within a flexible microservice to: (i) coordinate data movements and replication of data from origin repositories within the NSDF federation (ii) build an inventory of existing scientific data to inform the design of next-generation cyberinfrastructure; and (iii) provide a suite of tools for discovery of datasets for cross-disciplinary research.

NSDF-Catalog indexes scientific data at a fine-granularity at the file or object level to inform data distribution strategies and to improve the experience for users from the consumer perspective, with the goal of allowing end-toend dataflow optimizations. Index Terms—national science data fabric, scientific data, cloud, high performance computing

Quickstart

The fastest way to get started is to use docker-compose:

$ docker-compose up

This spawns various containerized microserves which together form a NSDF-compatible indexing service. An graphical overview is documented in (./docs/figures/overview.drawio).

It should now be possible to access a web frontend at:

http://localhost:80/

And a REST API at:

http://localhost:80/api/v1

Production Deployment

It is highly recommended to run NSDF-catalog instances through a domain name to allow for HTTPs.

  • (A domain to allow HTTPs via LetsEncrypt/Certbot)
  • A publically available IP
  • A machine (physical or virtual) to spawn multiple containers

Community

NSDF-catalog is an open source project. Questions, discussion, and contributions are welcome. Contributions can be anything from new packages to bugfixes, documentation, or even new core features.

Resources:

Contributing

Contributing to NSDF-catalog is easy. Just send us a pull request.

When you send your request, make develop the destination branch on the NSDF-catalog-repository.

NSDF-catalog'sdevelop branch has the latest contributions. Pull requests should target develop, and users who want the latest package versions, features, etc. can use develop.

Authors

NSDF-catalog was created by the NSDF team . To reach out email us at [email protected]

Citing NSDF-Catalog

If you are referencing NSDF-catalog in a publication, please cite the following paper:

  • Jakob Luettgau, Giorgio Scorzelli, Glenn Tarcea, Christine R. Kirkpatrick, Valerio Pascucci and Michela Taufer. NSDF-Catalog: Lightweight Indexing Service for Democratizing Data Delivery. In Proceedings of the 15th IEEE/ACM International Conference on Utility and Cloud Computing (UCC), pages 1–10, Seattle, WA, USA, December 2022. IEEE Computer Society.

On GitHub, you can copy this citation in APA or BibTeX format via the "Cite this repository" button.

License

NSDF-Catalog is distributed under the terms of the BSD-3-Clause license. All new contributions must be made under the BSD-3-Clause license.

See LICENSE for details.

SPDX-License-Identifier: (BSD-3-Clause)

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •