Skip to content

A sophisticated data pipeline that creates meaningful collections of ENS names.

Notifications You must be signed in to change notification settings

namehash/collection-templates

Repository files navigation

Collections Templates Pipeline

A sophisticated data pipeline that creates meaningful collections of ENS domain names by analyzing Wikipedia and Wikidata knowledge graphs. The pipeline discovers relationships between names based on their conceptual connections - whether they belong to the same category, share similar attributes, or are commonly associated together.

Overview

The pipeline processes Wikipedia and Wikidata dumps through multiple stages:

  1. Extracts relevant data about categories, lists and their members
  2. Maps relationships between entities across Wikipedia and Wikidata
  3. Merges related collections and enriches them with metadata like:
    • Popularity scores
    • Domain registration status
    • Collection avatars and banners
  4. Computes recommendations between related collections
  5. Makes collections searchable through Elasticsearch in a schema used by NameGraph

The system also handles user-created collections, enriching them with the same metadata and making them discoverable alongside Wikipedia-derived collections.

This creates an intuitive way for users to explore and find meaningful domain names based on their interests and use cases, leveraging the vast knowledge encoded in Wikipedia and Wikidata.

Example

Wikipedia Category of Pink Floyd Albums gets processed into a collection named Pink Floyd Albums. Some of the members of this collection are:

NameGraph [Github]

Building on this collections templates pipeline, NameGraph empowers ENS registrar apps to build new name discovery user experiences. Surf more than 21 million name ideas across more than 400,000 name collections, or generate infinite related name suggestions.

Visit NameGraph at namegraph.dev and shoot for the moon!

alt text

Documentation

Take a closer look at the way collections templates pipeline works in docs.