Skip to content

Releases: Clay-foundation/model

v1.0

06 Jun 10:55
85e821c
Compare
Choose a tag to compare

Release v1.0 (2024/06/06)

💫 Highlights

  • 🎉 Release of Clay Foundation Model v1.0 🎉

🚀 Model

🗃️ Data Pipeline

  • Moved data pipeline code into stacchip a dedicated library for the data wrangling

📖 Documentation

🧰 Maintenance

🛠️ v0.2 updates

🧑‍🤝‍🧑 New Contributors

Full Changelog: v0.0.1...v1.0

v0.0.1

12 Jan 06:21
0145e55
Compare
Choose a tag to compare
v0.0.1 Pre-release
Pre-release

Release v0.0.1 (2024/01/12)

💫 Highlights

  • 🎉 First release of Clay Foundation Model 🎉
  • Implement MAE with support for position, time, latlon & channel embeddings (#47)

🚀 Model

  • Generate embeddings from CLAYModule trained with latlon/time encodings (#96)
  • Add multigpu support & UI to test embeddings (#109)
  • Refactor model for multi-device usage and easier disabling of masking (#95)
  • Rename embeddings file to include MGRS code and store GeoTIFF source_url (#86)
  • Callback function to log Masked Autoencoder reconstructions to WandB (#88)
  • Adapt model to load 512x512 images from s3 bucket (#85)
  • Save embeddings with spatiotemporal metadata to GeoParquet (#73)
  • Generate embeddings via prediction loop (#56)
  • Initial Vision Transformer architecture with MAE decoder (#37)

🗃️ Data Pipeline

  • Adapted sampling strategy (#81)
  • Allow ClayDataModule to load GeoTIFF files directly from s3 (#92)
  • Let ClayDataModule return same spatiotemporal fields as GeoTIFFDataModule (#91)
  • Improve date handling for data pipeline (#76)
  • Let LightningDataModule return spatiotemporal metadata (#66)
  • check for no data on a tile level in sentinel 1 vv and vh, sentinel 2 and DEM (#60)
  • Batch setup (#54)
  • LightningDataModule to load GeoTIFF files (#52)
  • Ready for batch (#44)
  • Tiler module (#41)
  • Landcover based sampling strategy (#29)
  • Datacube (#27)

📖 Documentation

  • Document how the benchmark dataset labels were prepared (#100)
  • Document how to finetune pretrained model on downstream task (#99)
  • Document how to generate vector embeddings (#98)
  • Document how to run the datacube pipeline with a batch job (#97)
  • Initialize Jupyter Book documentation (#89)
  • Setting the model license to OpenRail-M (#63)
  • Create CODE_OF_CONDUCT.md (#53)

🧰 Maintenance

  • Bump pytorch from 2.0.0 to 2.1.0, CUDA from 11.8 to 12.0 (#51)
  • Add pre-commit hooks with ruff formatter/linter rules (#26)
  • Setup GitHub Actions Continuous Integration tests (#25)
  • Setup LightningCLI trainer script (#24)
  • Initial conda environment and binder links (#15)

🧑‍🤝‍🧑 Contributors

Full Changelog: v0.0.0...v0.0.1