forked from URI-ABD/clam
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Added MSA and separated Metric from Dataset (URI-ABD#226)
* feat: added msa and separated metric form dataset * docs: updated main README * fmt: python formatting * fix: corrected overlapp_with method for SquishyBall * feat: added min and max methods for Number trait * wip: disconnecting msa and pancakes
- Loading branch information
Showing
217 changed files
with
17,384 additions
and
8,121 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,16 @@ | ||
[package] | ||
name = "bench-cakes" | ||
version = "0.1.0" | ||
edition = "2021" | ||
|
||
[dependencies] | ||
clap = { version = "4.5.16", features = ["derive"] } | ||
bench-utils = { path = "../utils" } | ||
ftlog = { workspace = true } | ||
bitcode = { workspace = true } | ||
abd-clam = { workspace = true, features = ["disk-io"] } | ||
distances = { workspace = true } | ||
rand = { workspace = true } | ||
rayon = { workspace = true } | ||
stringzilla = "3.9.5" | ||
augurs-dtw = { version = "0.8.1", features = ["parallel"] } |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,40 @@ | ||
# Benchmarks for CAKES Search Algorithms | ||
|
||
This is crate provides a CLI to run benchmarks for the CAKES search algorithms and reproduce the results from our paper. | ||
|
||
## Reproducing the Results | ||
|
||
Let's say you have data from the [ANN-Benchmarks suite](https://github.com/erikbern/ann-benchmarks?tab=readme-ov-file#data-sets) in a directory `../data/input` and you want to run the benchmarks for the CAKES search algorithms on the `sift` dataset. | ||
You can run the following command: | ||
|
||
```bash | ||
cargo run --release --package bench-cakes -- \ | ||
--inp-dir ../data/input/ \ | ||
--dataset sift \ | ||
--out-dir ../data/output/ \ | ||
--seed 42 \ | ||
--num-queries 10000 \ | ||
--max-power 7 \ | ||
--max-time 300 \ | ||
--balanced-data \ | ||
--permuted-trees | ||
``` | ||
|
||
This will run the CAKES search algorithms on the `sift` dataset with 10000 search queries. | ||
The results will be saved in the directory `../data/output/`. | ||
The dataset will be augmented by powers of 2 up to 2^7. | ||
Each algorithm will be run for at least 300 seconds. | ||
The `--balanced` flag will build trees with balanced partitions. | ||
The `--permuted` flag will permute the dataset into depth-first order after building the tree. | ||
|
||
There are several other available options. | ||
Running the following command will provide documentation on how to use the CLI: | ||
|
||
```bash | ||
cargo run --release --package bench-cakes -- --help | ||
``` | ||
|
||
## Plotting the Results | ||
|
||
The outputs from the benchmarks can be plotted using the python package we provide at `../py-cakes`. | ||
See the associated README for more information. |
Oops, something went wrong.