Add iterable concept #219

tcbrindle · 2024-11-26T17:17:43Z

This PR adds a new iterable concept and updates various algorithms and adaptors to use it.

An iterable is, as the name suggests, something we can iterate over -- specifically, by using internal iteration. The required interface is:

struct sequence_traits<T> {
    auto element_type(T& iterable) -> E;

    template <typename T, typename Pred>
        requires std::predicate<Pred&, E>
    auto iterate(T& iterable, Pred&& pred) -> bool;
};

This is basically the existing for_each_while abstracted out into its own concept, returning a bool to indicate whether iteration was completed successfully rather than a cursor.

Once iterate() has returned, it is unspecified whether a second call to iterate() will start again from the beginning, carry on where it left off, do nothing, or do something else. Iterables are strictly weaker than sequences: that is, every sequence is an iterable, but not vice versa. Nonetheless, a large number of algorithms work on iterables (just about anything that can be written as a short-circuiting fold), and most of the existing sequence adaptors can work on iterables as well.

Since iterables only use internal iteration, there is no external state (i.e. a cursor) which could be invalidated. Not only does this make the iterable interface easier to implement for some types, but most importantly it means that we can provide a safe default implementation of the iterable protocol for all C++20 ranges.

This dramatically improves our interoperability with ranges, while the use of internal iteration means that converting ranges pipelines to the equivalent Flux code may yield some nice performance benefits.

codecov · 2024-11-26T17:45:10Z

Codecov Report

Attention: Patch coverage is 97.89790% with 7 lines in your changes missing coverage. Please review.

Project coverage is 98.40%. Comparing base (f539c11) to head (b5587e2).

Files with missing lines	Patch %	Lines
include/flux/algorithm/starts_with.hpp	75.00%	4 Missing ⚠️
include/flux/adaptor/scan.hpp	98.03%	1 Missing ⚠️
include/flux/core/concepts.hpp	91.66%	1 Missing ⚠️
include/flux/core/sequence_iterator.hpp	66.66%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #219      +/-   ##
==========================================
- Coverage   98.60%   98.40%   -0.21%     
==========================================
  Files          69       70       +1     
  Lines        2578     2629      +51     
==========================================
+ Hits         2542     2587      +45     
- Misses         36       42       +6

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

An iterable is an object which can perform internal iteration, accepting a predicate which indicates if the iteration should terminate early. Iterables need to supply an `iterate(it, pred)` function, and an `element_type(it)` function whose return type indicates the element type of the sequence. (For C++-y reasons this is a function rather than a plain alias like `using element_type = int`). Every sequence is iterable, but we can also have iterables which are not sequences. In particular, every range can safely be iterable, because iterators are never exposed and so we never need to worry about invalidation, i.e. ```cpp auto iterate(std::ranges::input_range auto&& rng, auto Pred) -> bool { for (auto&& elem : rng) { if (!pred(FWD(elem)) { return false; } } return true; } ```

Contiguous + sized ranges are still contiguous, sized, bounded sequences

Seems like a good place to start

...rather than sequences. These are the algorithms which perform a single pass from the start of a sequence up to some end point, without needing to restart again. Which is actually a suprising number of algorithms. (Although not flux::to yet, because that's more complicated.)

Also const_iterable_sequence -> const_iterable And make `flux::ref` work for iterables rather than just sequences

The current std::invocable and std::regular_invocable concepts are a bit of a pain to use. This commit adds three new callable concepts instead, each of which is specified with a signature argument: * `func_once` is a callable that will be invoked at most once * `func_mut` may be called more than once, and may hold and modify internal state * `func` may be called more than once, but may not modify state: it must be equality preserving, like `std::regular_invocable` Rather than variadic template arguments, these concepts use function signatures instead, so you say something like template <typename F> requires func<F, void(int, float)> auto g(F f); or just auto g(func<void(int, float)> auto f)

We already use this in the documentation, it's probably a good idea to make it exist in reality too

Like `adaptable_sequence`, this accepts either rvalue iterables or trivially copyable lvalue iterables, and (as the name suggests) is intended for use in sink functions, allowing the user to pass (presumably) cheap-to-copy types effectively by value, while requiring an explicit copy or move for things like std::vector.

This is a lot of changes

...and also filter_deref()

That was actually easier than expected

That was more work than expected

The final boss... In theory, we could make `cartesian_product` and `cartesian_product_map` iterable when the first argument is a non-sequence iterable, but that doesn't seem worth the effort.

Somehow the changes in this PR triggered an ICE in GCC 12 while compiling `reverse_adaptor`... even though we haven't actually made any changes to it at all (other than changing the name of a function). It's a mystery, but hopefully no longer a problematic one.

For some reason MSVC really doesn't like the static consteval auto element_type(auto& self) -> element_t<decltype((self.base_))>; formulation that we've been using in various adaptors. Fortunately, the very first thing I tried, namely using a template parameter rather than an auto param, seems to fix it, i.e. template <typename Self> static consteval auto element_type(Self& self) -> element_t<decltype((self.base_))>; even though the two formulations should be completely identical in meaning.

Ooops

The new name is even worse than the old one. I really need to think of something better. Naming is hard...

Sequence flatten requires both outer and inner to be sequences

tcbrindle added 29 commits November 26, 2024 19:25

Make all input ranges iterable

7bd0dd0

Contiguous + sized ranges are still contiguous, sized, bounded sequences

Make for_each operate on iterables

b8f12f5

Seems like a good place to start

Modify flux::to to use iterables

975797d

read_only_sequence -> read_only_iterable

5970c5c

Also const_iterable_sequence -> const_iterable And make `flux::ref` work for iterables rather than just sequences

Add predicate_for<Iterable> concept

66733ab

We already use this in the documentation, it's probably a good idea to make it exist in reality too

Use new iterate() in TCO example

09bfa83

Allow flux::from() to accept iterables

1419a30

Allow filter adaptor to operate on iterables

c688d31

sized_sequence => sized_iterable

c0fb519

This is a lot of changes

Make non-contiguous ranges sized_iterables

a21b95f

Make flux::map() work on iterables

847f668

Make passthrough_traits_base support iterables

3067fbe

Make drop() work on iterables

d7b227e

Make take() work on iterables

ab78282

Make flatten() work on iterables

36442ad

Make chain() work on iterables

b0bbfb4

Make drop_while() work with iterables

80dd9e1

Make filter_map() work with iterables

667c9f6

...and also filter_deref()

Make flatten_with() work with iterables

4deb5bd

That was actually easier than expected

Add missing <ranges> includes in tests

2bda737

Make read_only() work with iterables

6628427

Make scan() and prescan() work with iterables

6a2b2ed

Make scan_first() work with iterables

53da08b

Make stride() work with iterables

5efec93

That was more work than expected

Make take_while() work with iterables

d1a1bf4

tcbrindle added 3 commits November 26, 2024 19:25

Add iterate() impl to cartesian_base

8616f55

The final boss... In theory, we could make `cartesian_product` and `cartesian_product_map` iterable when the first argument is a non-sequence iterable, but that doesn't seem worth the effort.

Update .clang-format

b0531a7

Fix scan test with Clang 18

872fd8d

tcbrindle force-pushed the pr/iterable branch from 88ee775 to 872fd8d Compare November 26, 2024 19:25

tcbrindle added 13 commits November 26, 2024 20:26

Work around GCC 12 ICE

fb4bce7

Somehow the changes in this PR triggered an ICE in GCC 12 while compiling `reverse_adaptor`... even though we haven't actually made any changes to it at all (other than changing the name of a function). It's a mystery, but hopefully no longer a problematic one.

Re-enable constexpr starts_with() tests

9883fa6

Ooops

Rename sequence_traits -> iter_traits

be4186c

Rename flux_sequence_traits -> flux_iter_traits

f45c4f7

Rename inline_sequence_base -> inline_iter_base

1822f09

The new name is even worse than the old one. I really need to think of something better. Naming is hard...

Update .gitignore

2496980

Flatten: avoid hard error with non-sequence inner

588ef04

Sequence flatten requires both outer and inner to be sequences

Correctly constrain inline_iter_base member fns

a3f7a4e

Merge branch 'main' into pr/iterable

d74b787

Remove erroneous #include

8fe8c9d

Fix take::iterate() for RA + Bounded

788dd5e

Use read_at_unchecked() in default iterate() impl

b5587e2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add iterable concept #219

Add iterable concept #219

tcbrindle commented Nov 26, 2024

codecov bot commented Nov 26, 2024 •

edited

Loading

Add iterable concept #219

Are you sure you want to change the base?

Add iterable concept #219

Conversation

tcbrindle commented Nov 26, 2024

codecov bot commented Nov 26, 2024 • edited Loading

Codecov Report

codecov bot commented Nov 26, 2024 •

edited

Loading