Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[RFE] New corpus schema #1

Open
ingvagabund opened this issue Apr 15, 2018 · 0 comments
Open

[RFE] New corpus schema #1

ingvagabund opened this issue Apr 15, 2018 · 0 comments

Comments

@ingvagabund
Copy link
Contributor

ingvagabund commented Apr 15, 2018

  • Store artefacts as tarballs. Compressed json files are a lot smaller then uncompressed ones
  • Consolidate artefacts generated by the same extractor. E.g. symbols extractor extracts three pieces of data: api, static allocations and contracts. They should be stored under the same tarball.
  • Store relevant artefacts per package, not per project where it is sane. E.g. API of some projects can be quite huge. They should be stored by a project package so one does not have to read entire json to get a subset of data contained inside.

Some available artefacts:

  • golang-distribution-snapshot
  • golang-ipprefix-to-rpm
  • golang-project-content-metadata
  • golang-project-distribution-exported-api
  • golang-project-distribution-packages
  • golang-project-exported-api
  • golang-project-packages
  • golang-project-repository-commit
  • golang-project-repository-info

The artefacts can be broken down into the following categories:

  • project meta data:
    • golang-project-content-metadata
    • golang-project-repository-commit
    • golang-project-repository-info
  • snapshots:
    • golang-distribution-snapshot
    • golang-upstream-snapshot (TBD)
  • project data:
    • golang-project-distribution-exported-api
    • golang-project-distribution-packages
    • golang-project-exported-api
    • golang-project-packages
  • distribution:
    • golang-ipprefix-to-rpm
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant