Skip to content

Obsidian vault for Database and Analytics

Notifications You must be signed in to change notification settings

kubow/datastore

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

General repository about data related technologies

Covered data storing concepts

  • Higher architecture and concepts
  • SQL
    • Relational DB (OLTP)
    • Analytical DB (OLAP)
  • NoSQL
    • Key-Value DB
    • Graph DB
    • Document DB
  • On-Premise / Cloud / Hybrid

Covered data engines *

* Following product types are included:


Data Storing Engines Specific Data Tools
Amazon stores Data Discovery
Apache (multiple products) - SchemaCrawler (DB schema discovery & comprehension tool - github)
ClickHouse - SodaSQL (data testing and monitoring - documentation)
CockroachDB Data Processing
CrateDB - Apache Airflow (orchestration)
Databricks - Talend DataCleaner (Profiling & Cleansing) https://sourceforge.net/projects/datacleaner/
DataWatch - OpenRefine
DuckDB - Meltano (data extracting)
Elasticsearch - dbt (data transformation)
Google - dlt
IBM - Kestra
InfluxDB
JSON (standalone / JSON native db) - y42
MariaDB
MongoDB Data Analysis & Reporting
Microstrategy
Microsoft
Minio
Neo4J
Oracle
Pentaho
PostgreSQL
Qlik
Redis
Salesforce
SAP Data Monitoring
SingleStore
Snowflake - HP OpenView (Rep Agent compatible)
SQLite - IBM Tivoli
Teradata - Ignite
TDEngine - BMC
Vertica - Bradmark http://www.bradmark.com/
XML (standalone / XML native db)

Universal Data Clients

  • Data Grip
  • DBeaver
  • SquirelSQL

Universal Database tweakers

Model structure for any product

  • System
    • Architecture
    • Product fundamentals
  • Install
    • Preparation, installation
    • Editions
    • Licensing
    • Versions
    • Upgrading
  • Maintenance
    • Operational Management
    • Monitoring
    • Security
    • Backup / Recovery

Engines overview database description (sqlite3 database source)

view "engine_overview" with columns for

  • engine name
  • developer (language used + website)
  • engine category (and storage type)
    • basic categories (SQL, NoSQL, graph, key-value, document, time series)
    • storage types (trasnactional, analytical, integration, data warehousing)
  • supported operating systems (win, unix, linux, mac)
  • security rating
  • deployment model
  • sourcing model (open-source?, free version available?)

Useful resources

Embedded databases (1): The harmony of DuckDB, KùzuDB and LanceDB | The Data Quarry Embedded databases (2): KùzuDB, an extremely fast OLAP graph database | The Data Quarry

Books

Principles of Database Management: The Practical Guide to Storing, Managing and Analyzing Big and Small Data Database in Depth: Relational Theory for Practitioners What is High Availability? The Ultimate Guide | Percona

Seven Databases in Seven Weeks Segmentation Fault - A DBA Perspective

Free sources

Index of /~database/documents @ University of Oklahoma

UI bakery sample databases Datasets - Data | World Resources Institute GitHub - jOOQ/sakila: The Sakila Database

Datové sady - Národní katalog otevřených dat (NKOD) The MONDIAL Database

Web sites

Knowledge Base of Relational and NoSQL DBMS