Skip to content

Commit

Permalink
Merge pull request #2257 from moj-analytical-services/fix_bullets
Browse files Browse the repository at this point in the history
Fix bullets in readme.md
  • Loading branch information
RobinL authored Jul 16, 2024
2 parents 370c9cd + fea6118 commit 330520f
Showing 1 changed file with 5 additions and 5 deletions.
10 changes: 5 additions & 5 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -16,11 +16,11 @@ Splink is a Python package for probabilistic record linkage (entity resolution)

## Key Features

**Speed:** Capable of linking a million records on a laptop in around a minute.
🎯 **Accuracy:** Support for term frequency adjustments and user-defined fuzzy matching logic.
🌐 **Scalability:** Execute linkage in Python (using DuckDB) or big-data backends like AWS Athena or Spark for 100+ million records.
🎓 **Unsupervised Learning:** No training data is required for model training.
📊 **Interactive Outputs:** A suite of interactive visualisations help users understand their model and diagnose problems.
**Speed:** Capable of linking a million records on a laptop in around a minute.<br>
🎯 **Accuracy:** Support for term frequency adjustments and user-defined fuzzy matching logic.<br>
🌐 **Scalability:** Execute linkage in Python (using DuckDB) or big-data backends like AWS Athena or Spark for 100+ million records.<br>
🎓 **Unsupervised Learning:** No training data is required for model training.<br>
📊 **Interactive Outputs:** A suite of interactive visualisations help users understand their model and diagnose problems.<br>

Splink's linkage algorithm is based on Fellegi-Sunter's model of record linkage, with various customisations to improve accuracy.

Expand Down

0 comments on commit 330520f

Please sign in to comment.