Skip to content

Commit

Permalink
WIP: refactoring of cache system, renaming and docs
Browse files Browse the repository at this point in the history
  • Loading branch information
ddnexus committed Dec 11, 2024
1 parent 9100233 commit b9bb703
Show file tree
Hide file tree
Showing 12 changed files with 187 additions and 217 deletions.
10 changes: 5 additions & 5 deletions docs/api/keyset.md
Original file line number Diff line number Diff line change
Expand Up @@ -52,7 +52,7 @@ standard `pagy_*nav` helpers
| `cutoff` | The point beyond the last record of a `page`. <br/>(It's the encoded string of the `keyset attributes` of the last record in a page) |
| `page` | The current `page`, i.e. the page of records fetched beyond the `cutoff` of the **previous page**. |
| `next` | The next `page`, i.e. the page of records fetched beyond the `cutoff` of the **current page**. |
| `filter_params` | The hash of `filter_params` used to filter the page records beyond the `cutoff` |
| `filter_args` | The hash of `filter_args` used to filter the page records beyond the `cutoff` |

### Keyset or Offset pagination?

Expand Down Expand Up @@ -144,7 +144,7 @@ If you need a specific order:

- You pass an `uniquely ordered` `set` and `Pagy::Keyset` pulls the `:limit` of records of the first page.
- It requests the `next` URL by setting its `page` query string param to the `cutoff` of the current page.
- At each request, the new `:page` (i.e. the `cutoff` of the previous page) is decoded into DB `filter_params` that are coupled
- At each request, the new `:page` (i.e. the `cutoff` of the previous page) is decoded into DB `filter_args` that are coupled
with a `where` filter query, and the `:limit` of new records is pulled.
- You know that you reached the end of the collection when `pagy.next.nil?`.

Expand Down Expand Up @@ -198,12 +198,12 @@ Default `nil`.

**Use this for DB-specific extra optimizations, if you know what you are doing.**

If the `:filter_records` variable is set to a lambda, pagy will call it with the `set`, `filter_params` and `keyset` arguments
If the `:filter_records` variable is set to a lambda, pagy will call it with the `set`, `filter_args` and `keyset` arguments
instead of using its auto-generated query to filter the records. It must return the filtered set. For example:

```ruby
filter_records = lambda do |set, filter_params, keyset|
set.where(my_optimized_query(keyset), **filter_params)
filter_records = lambda do |set, filter_args, keyset|
set.where(my_optimized_query(keyset), **filter_args)
end

Pagy::Keyset(set, filter_records:)
Expand Down
147 changes: 26 additions & 121 deletions docs/api/keyset_numeric.md
Original file line number Diff line number Diff line change
Expand Up @@ -7,24 +7,20 @@ category:

# Pagy::Keyset::Numeric

A [Pagy::Keyset](keyset.md) subclass with numeric `page`s supporting `pagy_*nav` and other Frontend helpers.
A [Pagy::Keyset](keyset.md) subclass with numeric pages supporting `pagy_*nav` and the other Frontend helpers.

!!!warning Experimental: the API might change in minor versions
!!!

## Overview

The keyset/cursor pagination technique is the fastest technique able to pull a page of records from a relational DB, but it cannot
refer to pages as numbers.

All the Frontend helpers, like the `pagy_nav`, work only with page number, so... `Pagy::Keyset` cannot use them.

The `Pagy::Keyset::Numeric` subclass uses the fastest technique for the DB and works with the usual user-friendly helpers. It may
require a slightly more involved setup, but it offers the best performance AND functionality together.
`Pagy::Keyset` uses the fastest technique for pagination, but it cannot work with any Frontend helper because they require numeric
pages. That's why we created `Pagy::Keyset::Numeric`: it uses keyset pagination AND supports `pagy_*navs` and the other Frontend
helpers.

!!!
The API is documented here, however you may prefer to use the [keyset_numeric extra](/docs/extras/keyset_numeric)
wrapper to easily integrate it with your app.
The API is documented here, however you should use the [keyset_numeric extra](/docs/extras/keyset_numeric)
wrapper to handle the cache and easily integrate it with your app.

You should also familiarize with the [Pagy::Keyset](keyset.md) class.
!!!
Expand All @@ -35,62 +31,32 @@ You should also familiarize with the [Pagy::Keyset](keyset.md) class.

Integrates the [Keyset Glossary](keyset_numeric.md#glossary)

| Term | Description |
|-----------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| `keyset numeric pagination` | The pagy exclusive technique to use `keyset pagination` with numeric pages, supporting `pagy_*navs` and other Frontend helpers. The best technique for performance AND functionality! |
| `page` | The current page **number** |
| `cutoffs snapshot` | The conceptual snapshot of all the `cutoff`s that split the `set` into pages. Each one identifies a point beyond a specific record in the `set`. |
| `cutoffs` | The cached known `cutoff` variables used for the visited (numeric) pages. |
| `cache` | Hash-like object (responding to `:[]=` and `[]`) that should persist between requests.The [keyset_numeric extra](/docs/extras/keyset_numeric) sets it to the `session` object by defult. Pagy uses it to map `cutoffs` to page numbers. |
| `cache_key` | The string (or `Proc` returning it), deterministically related with a `cutoff snapshot`. |
| Term | Description |
|-----------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| `keyset numeric pagination` | The pagy exclusive technique to use `keyset pagination` with numeric pages, supporting `pagy_*navs` and the other Frontend helpers.<br/>The best technique for performance AND functionality! |
| `page` | The current page **number** |
| `cutoffs` | The known `cutoff` variables used to keep track of the visited pages. |

### How Pagy::Keyset::Numeric works

This `Pagy::Keyset` subclass adds a cached layer of `cutoffs` that are mapped to page number as they get visited/discovered.

Here is how it works:

- You pass an `uniquely ordered` `set` and `Pagy::Keyset` pulls the `:limit` of records of the first page (exacly like its super
class does), however it stores the `cutoff` of the current page in the cache.
- It requests the `next` URL by setting its `page` query string param to the current page + 1 (as it happens in offset pagination)
- At each request, the new `:page` number is used to pull its cached `cutoff` (i.e. the `cutoff` of the previous page) from the
cache, and it's used exactly like its superclass does.
- As the visited page numers are adding up, and become available to the `series` method as in offset pagination, so the related
`cutoffs` get stored in the cache, offering the user the possibility to jump to any of the visited page.
- When the user jump to a visited page, the query to pull the visited page is slightly different to ensure accuracy and
consistency. See the `Pagy::Keyset::Numeric#filter_records_sql` code and comments for details.
- You know that you reached the end of the collection when `pagy.next.nil?`.

## Setup

This section integrates the [Keyset Setup](keyset.md#setup).

### Set up the `:cache` variable

This class requires a `:cache` variable set to a hash-like object (responding to `:[]=` and `[]`) that should persist between
requests. If you use the [keyset_numeric extra](/docs/extras/keyset_numeric) wrapper, it will set it to the `session` object by
default, however you can pass any persisted hash-like object of your preference.
This `Pagy::Keyset` subclass adds a layer of `cutoffs` that are mapped to page numbers when they get visited/discovered.

Notice that besides writing and reading from it, Pagy does not expire nor handle the cache in any way. Your app should manage it
like it does with the `session` object.
The `cutoffs` must be cached between requests for this class to work, however the cache is not handled by this class at all: that is a concern of the [keyset_numeric extra](/docs/extras/keyset_numeric). Here is a simplified example of what happens with the cutoff at each request:

!!!danger Do not use the cookie based session as cache

With the default cookie-based `session`, depending on the size of your cutoffs, your session cookie will likely overflow the 4k
max size. You should probably use some other storage for the session (e.g. `ActiveRecord::SessionStore`).

!!!primary Notice
```ruby
cutoffs = get_cutoffs_from_cache(key)
pagy = Keyset::Numeric.new(set, cutoffs:, **vars)
write_cutoffs_to_cache(key, pagy.cutoffs)
```

We plan to implement a client-side cache support using the `sessionStorage`.
!!! Notice

Its implementation will require some time, so please, hang tight and cheer for us!
The cache handling is external to the `Pagy::Keyset::Numeric` class for easy overriding. See the [Overriding the cache](/docs/extras/keyset_numeric#overriding-the-cache) for more details.
!!!

!!! success
## Setup

If you would like Pagy to support an object that write/read from the cache in any way other than `:[]=` and `[]`, please open
a [feature request](https://github.com/ddnexus/pagy/discussions/categories/feature-requests).
!!!
See the [Keyset Setup](keyset.md#setup).

## ORMs

Expand Down Expand Up @@ -123,70 +89,9 @@ The `Array` of fetched records for the current page.

See the [Pagy::Keyset variables](keyset.md#variables), plus the following:

==- `:cache`

Mandatory hash-like object (responding to `:[]=` and `[]`) that should persist between requests.
The [keyset_numeric extra](/docs/extras/keyset_numeric) sets it to the `session` object by defult.

==- `:cache_key`

!!!warning
==- `:cutoffs`

Set this only if the default `cache_key` generated and used by pagy causes the pagination to skip pages, raises
`Pagy::OverflowError` or resets to page 1
!!!

### Understanding the cache_key

<br/>The `cache_key` is a string that must uniquely identify a specific `cutoff snapshot`. Pagy uses it to determine which
paginated set each `page` request belongs to, in order to access the cached data.

Since the `cutoff snapshot` is the product of a certain DB query influenced by different factors, we can simply concatenate ALL
the factors that actually AFFECT the `cutoff snapshot` and we will produce the `cache_key` that uniquely identifies it.

If we miss any factor that AFFECT the `cutoff snapshot` the resulting `cached_key` may stay the same even when the
`cutoff snapshot` has changed between requests, which is a fatal inconsistency for accessing cached data: _this would cause the
pagination to repeat or skip records/pages_.

Notice however that we must INCLUDE ONLY the factors that AFFECT the `cutoff snapshot`. If we include any factor that may change
between requests WITHOUT AFFECTING the `cutoff snapshot`, the resulting `cached_key` may change even when the `cutoff snapshot` is
the same, which is another fatal inconsistency for accessing cached data: _this would cause the cache to switch to a new key
without `cutoffs`, likely generating a Pagy::OverflowError_

### Identifying the factors to include/exclude in the `cache_key`

<br/>The factors that change the `cutoff snapshot`, that must be included in the generation of the `cache_key`, are those
affecting one or more of the following:

- the number of total results/records (i.e. `where`, `:group`, `:having` clauses)
- the order of the records (i.e. the `order` clause)
- the page length (i.e. the `vars[:limit]` set by pagy)

Not every factor that changes the query has an impact on the `cutoff snapshot`. For example: if you change the `select` clause,
there will be no change in the `cutoff snapshot` so you must NOT include it as a factor. On the other hand, a `search` field that
the user can fill IS a factor, because the resulting records may be different, and by consequence the`cutoff snapshot`.

#### How to create your own `:cache_key`

<br/>Pagy create a default `cache_key` for you by generating a `SHA2` hexdigest out of the concatenation of:

- the selective sql of your `set` (i.e. only the `where`, `order`, `group` and `having`)
- the `vars[:limit]` variable

If that doesn't work for you, you should create your own. For example:

```ruby
# If you have just a "search" field, the :cache_key may be assigned like:
cache_key = ->(_vars) { params(:search) }

# If you have also other fields
# and a possibly changing :limit and order (:keyset) variables:
cache_key = lambda do |vars|
[params.slice(:search, :category, :year, ...), # search params
vars.slice(:limit, :keyset), # vars that change the page records
].to_json
end
```
Mandatory variable that should persist between requests.

==- `:max_pages`

Expand All @@ -196,8 +101,8 @@ Paginate only `:max_pages` ignoring the rest.

## Attribute Readers

`cache_key`, `in`, `last`, `limit`, `next`, `page`, `prev`, `vars`
`cutoffs`, `in`, `last`, `limit`, `next`, `page`, `prev`, `vars`

## Troubleshooting

See the [Pagy::Keyset Troubleshooting](keyset.md#variables)
See the [Pagy::Keyset Troubleshooting](keyset.md#troubleshooting)
2 changes: 1 addition & 1 deletion docs/extras/keyset.md
Original file line number Diff line number Diff line change
Expand Up @@ -13,7 +13,7 @@ Paginate with the Pagy keyset pagination technique.

## Overview

This is a tiny wrapper around the [Pagy::Keyset API](/docs/api/keyset.md). Please refer to the class documentation for a
This is a wrapper around the [Pagy::Keyset API](/docs/api/keyset.md) that handles the required cache. Please refer to the class documentation for a
fuller understanding of keyset pagination:

[!ref Keyset Pagination: Concepts and Overview](/docs/api/keyset.md)
Expand Down
57 changes: 42 additions & 15 deletions docs/extras/keyset_numeric.md
Original file line number Diff line number Diff line change
Expand Up @@ -13,6 +13,8 @@ Paginate with the Pagy Keyset Numeric pagination technique.

## Overview

Not just a wrapper!

This is a tiny wrapper around the [Pagy::Keyset::Numeric API](/docs/api/keyset_numeric). Please refer to the class documentation
for a fuller understanding of keyset pagination:

Expand All @@ -28,24 +30,37 @@ This section integrates the [Keyset Extra Synopsis](/docs/extras/keyset.md)
```ruby Controller (action)
# Basic defaults (uses the session object as the cache)
@pagy, @records = pagy_keyset_numeric(set)
```
```ruby ApplicationController
# Optional custom cache read/write
def pagy_cache_read(key) = my_custom_cache(key)
def pagy_cache_write(key, value) = my_custom_cache(key, value)

# Optional custom session :cache hash
@pagy, @records = pagy_keyset_numeric(set, cache: my_persistent_hash)

# CUstom cache_key
# If you have just a "search" field, the :cache_key may be assigned like:
cache_key = ->(_vars) { params(:search) }

# If you have also other fields
# and a possibly changing :limit and order (:keyset) variables:
cache_key = lambda do |vars|
[ params.slice(:search, :category, :year, ...), # search params
vars.slice(:limit, :keyset), # vars that change the page records
].to_json
end
@pagy, @records = pagy_keyset_numeric(set, cache_key:)
# optional custom cache_key
def pagy_cache_new_key = my_custom_cache.generate_key
```

### Overriding the cache


!!!warning

Besides writing and reading from it, Pagy does not expire nor handle the cache in any way. Your app should manage it like it does
with the `session` object.
!!!

!!!danger Do not use the cookie based session as the cache

Your session cookie will likely overflow the 4k max size. You should probably use some other storage for the session (e.g.
`ActiveRecord::SessionStore`).

!!!primary Notice

We plan to implement a client-side cache support using the `sessionStorage`.

Its implementation will require some time, so please, hang tight and cheer for us!
!!!

## Variables

See the [Pagy::Keyset::Numeric variables](/docs/api/keyset_numeric#variables)
Expand All @@ -56,4 +71,16 @@ See the [Pagy::Keyset::Numeric variables](/docs/api/keyset_numeric#variables)

This method is similar to the `pagy` (for offset pagination) method. It returns the `pagy` object and the array of `records` pulled from the DB.

==- `pagy_cache_read(key)`

This method is similar to the `pagy` (for offset pagination)

==- `pagy_cache_write(key, value)`

This method is similar to the `pagy` (for offset pagination)

==- `pagy_cache_new_key`

This method is similar to the `pagy` (for offset pagination)

===
2 changes: 1 addition & 1 deletion gem/apps/keyset_numeric.ru
Original file line number Diff line number Diff line change
Expand Up @@ -34,7 +34,7 @@ end
require 'pagy/extras/limit'
require 'pagy/extras/keyset_numeric'
require 'pagy/extras/pagy'
Pagy::DEFAULT[:limit] = 5
Pagy::DEFAULT[:limit] = 4
Pagy::DEFAULT.freeze

# Sinatra setup
Expand Down
7 changes: 7 additions & 0 deletions gem/lib/pagy/b64.rb
Original file line number Diff line number Diff line change
Expand Up @@ -5,6 +5,13 @@ class Pagy # :nodoc:
module B64
module_function

CHARS = [*'0'..'9', *'a'..'z', *'A'..'Z', '_', '-'].freeze

def convert(int)
digits = int.digits(64).reverse
digits.map { |i| CHARS[i] }.join
end

def encode(bin)
[bin].pack('m0')
end
Expand Down
Loading

0 comments on commit b9bb703

Please sign in to comment.