diff --git a/dev/.documenter-siteinfo.json b/dev/.documenter-siteinfo.json
index d080a68..cda8c86 100644
--- a/dev/.documenter-siteinfo.json
+++ b/dev/.documenter-siteinfo.json
@@ -1 +1 @@
-{"documenter":{"julia_version":"1.10.5","generation_timestamp":"2024-10-18T14:46:32","documenter_version":"1.7.0"}}
\ No newline at end of file
+{"documenter":{"julia_version":"1.11.0","generation_timestamp":"2024-10-18T15:37:52","documenter_version":"1.7.0"}}
\ No newline at end of file
diff --git a/dev/custom/index.html b/dev/custom/index.html
index 9830a17..4ed9e8c 100644
--- a/dev/custom/index.html
+++ b/dev/custom/index.html
@@ -104,7 +104,7 @@
 )</code></pre><p>Create a <code>CodeGenContext</code> (ctx), a struct that stores options for Automa code generation. Ctxs are used for Automa&#39;s various code generator functions. They currently take the following options (more may be added in future versions)</p><ul><li><code>vars::Variables</code>: variable names used in generated code. See the <code>Variables</code> struct.</li><li><code>generator::Symbol</code>: code generator mechanism (<code>:table</code> or <code>:goto</code>). The table generator creates smaller, simpler code that uses a vector of integers to determine state transitions. The goto-generator uses a maze of <code>@goto</code>-statements, and create larger, more complex code, that is faster.</li><li><code>getbyte::Function</code> (table generator only): function <code>f(data, p)</code> to access byte from data. Default: <code>Base.getindex</code>.</li><li><code>clean</code>: Whether to remove some <code>QuoteNode</code>s (line information) from the generated code</li></ul><p><strong>Example</strong></p><pre><code class="language-julia hljs">julia&gt; ctx = CodeGenContext(generator=:goto, vars=Variables(buffer=:tbuffer));
 
 julia&gt; generate_code(ctx, compile(re&quot;a+&quot;)) isa Expr
-true</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/3ab134cd6bd6def0a7446a4043d709e593614cf5/src/codegen.jl#L65-L93">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.Variables" href="#Automa.Variables"><code>Automa.Variables</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><p>Struct used to store variable names used in generated code. Contained in a <code>CodeGenContext</code>. Create a custom <code>Variables</code> for your <code>CodeGenContext</code> if you want to customize the variables used in Automa codegen, typically if you have conflicting variables with the same name.</p><p>Automa generates code with the following variables, shown below with their default names:</p><ul><li><code>p::Int</code>: current position of data</li><li><code>p_end::Int</code>: end position of data</li><li><code>is_eof::Bool</code>: Whether <code>p_end</code> marks end file stream</li><li><code>cs::Int</code>: current state</li><li><code>data::Any</code>: input data</li><li><code>mem::SizedMemory</code>: Memory wrapping <code>data</code></li><li><code>byte::UInt8</code>: current byte being read from <code>data</code></li><li><code>buffer::TranscodingStreams.Buffer</code>: (<code>generate_reader</code> only)</li></ul><p><strong>Example</strong></p><pre><code class="language-julia hljs">julia&gt; ctx = CodeGenContext(vars=Variables(byte=:u8));
+true</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/b08d53b7940af2b81c780f50a98205a0a4e60cf6/src/codegen.jl#L65-L93">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.Variables" href="#Automa.Variables"><code>Automa.Variables</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><p>Struct used to store variable names used in generated code. Contained in a <code>CodeGenContext</code>. Create a custom <code>Variables</code> for your <code>CodeGenContext</code> if you want to customize the variables used in Automa codegen, typically if you have conflicting variables with the same name.</p><p>Automa generates code with the following variables, shown below with their default names:</p><ul><li><code>p::Int</code>: current position of data</li><li><code>p_end::Int</code>: end position of data</li><li><code>is_eof::Bool</code>: Whether <code>p_end</code> marks end file stream</li><li><code>cs::Int</code>: current state</li><li><code>data::Any</code>: input data</li><li><code>mem::SizedMemory</code>: Memory wrapping <code>data</code></li><li><code>byte::UInt8</code>: current byte being read from <code>data</code></li><li><code>buffer::TranscodingStreams.Buffer</code>: (<code>generate_reader</code> only)</li></ul><p><strong>Example</strong></p><pre><code class="language-julia hljs">julia&gt; ctx = CodeGenContext(vars=Variables(byte=:u8));
 
 julia&gt; ctx.vars.byte
-:u8</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/3ab134cd6bd6def0a7446a4043d709e593614cf5/src/codegen.jl#L4-L29">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../parser/">« Parsing buffers</a><a class="docs-footer-nextpage" href="../io/">Parsing IOs »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Friday 18 October 2024 14:46">Friday 18 October 2024</span>. Using Julia version 1.10.5.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+:u8</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/b08d53b7940af2b81c780f50a98205a0a4e60cf6/src/codegen.jl#L4-L29">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../parser/">« Parsing buffers</a><a class="docs-footer-nextpage" href="../io/">Parsing IOs »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Friday 18 October 2024 15:37">Friday 18 October 2024</span>. Using Julia version 1.11.0.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/debugging/index.html b/dev/debugging/index.html
index f27bdbc..aad7e98 100644
--- a/dev/debugging/index.html
+++ b/dev/debugging/index.html
@@ -64,4 +64,4 @@
     println(io, machine2dot(machine))
 end
 # Requires graphviz to be installed
-run(pipeline(`dot -Tsvg /tmp/machine.dot`), stdout=&quot;/tmp/machine.svg&quot;)</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/3ab134cd6bd6def0a7446a4043d709e593614cf5/src/dot.jl#L42-L57">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../reader/">« Creating readers</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Friday 18 October 2024 14:46">Friday 18 October 2024</span>. Using Julia version 1.10.5.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+run(pipeline(`dot -Tsvg /tmp/machine.dot`), stdout=&quot;/tmp/machine.svg&quot;)</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/b08d53b7940af2b81c780f50a98205a0a4e60cf6/src/dot.jl#L42-L57">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../reader/">« Creating readers</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Friday 18 October 2024 15:37">Friday 18 October 2024</span>. Using Julia version 1.11.0.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/index.html b/dev/index.html
index 8fb0c2a..bf04b1b 100644
--- a/dev/index.html
+++ b/dev/index.html
@@ -45,4 +45,4 @@
     (headers, reshape(fields, length(headers), :))
 end
 
-header, data = parse_tsv(&quot;a\tabc\n12\t13\r\nxyc\tz\n\n&quot;)</code></pre></article><nav class="docs-footer"><a class="docs-footer-nextpage" href="theory/">Theory »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Friday 18 October 2024 14:46">Friday 18 October 2024</span>. Using Julia version 1.10.5.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+header, data = parse_tsv(&quot;a\tabc\n12\t13\r\nxyc\tz\n\n&quot;)</code></pre></article><nav class="docs-footer"><a class="docs-footer-nextpage" href="theory/">Theory »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Friday 18 October 2024 15:37">Friday 18 October 2024</span>. Using Julia version 1.11.0.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/io/index.html b/dev/io/index.html
index 9e223dc..cd777a0 100644
--- a/dev/io/index.html
+++ b/dev/io/index.html
@@ -107,14 +107,14 @@
 mark:    ^
 p = 9            ^</code></pre><p>Finally, when we reach the newline <code>p = 13</code>, the whole header is in the buffer, and so <code>data[@markpos():p-1]</code> will correctly refer to the header (now, <code>1:12</code>).</p><pre><code class="nohighlight hljs">content: abcdefghijkl\nA
 mark:    ^
-p = 13               ^</code></pre><p>Remember to update the mark, or to clear it with <code>@unmark()</code> in order to be able to flush data from the buffer afterwards.</p><h2 id="Reference"><a class="docs-heading-anchor" href="#Reference">Reference</a><a id="Reference-1"></a><a class="docs-heading-anchor-permalink" href="#Reference" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.generate_reader" href="#Automa.generate_reader"><code>Automa.generate_reader</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">generate_reader(funcname::Symbol, machine::Automa.Machine; kwargs...)</code></pre><p>Generate a streaming reader function of the name <code>funcname</code> from <code>machine</code>.</p><p>The generated function consumes data from a stream passed as the first argument and executes the machine with filling the data buffer.</p><p>This function returns an expression object of the generated function.  The user need to evaluate it in a module in which the generated function is needed.</p><p><strong>Keyword Arguments</strong></p><ul><li><code>arguments</code>: Additional arguments <code>funcname</code> will take (default: <code>()</code>).   The default signature of the generated function is <code>(stream::TranscodingStream,)</code>,   but it is possible to supply more arguments to the signature with this keyword argument.</li><li><code>context</code>: Automa&#39;s codegenerator (default: <code>Automa.CodeGenContext()</code>).</li><li><code>actions</code>: A dictionary of action code (default: <code>Dict{Symbol,Expr}()</code>).</li><li><code>initcode</code>: Initialization code (default: <code>:()</code>).</li><li><code>loopcode</code>: Loop code (default: <code>:()</code>).</li><li><code>returncode</code>: Return code (default: <code>:(return cs)</code>).</li><li><code>errorcode</code>: Executed if <code>cs &lt; 0</code> after <code>loopcode</code> (default error message)</li></ul><p>See the source code of this function to see how the generated code looks like</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/3ab134cd6bd6def0a7446a4043d709e593614cf5/src/stream.jl#L1-L24">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.@escape" href="#Automa.@escape"><code>Automa.@escape</code></a> — <span class="docstring-category">Macro</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">@escape()</code></pre><p>Pseudomacro. When encountered during <code>Machine</code> execution, the machine will stop executing. This is useful to interrupt the parsing process, for example to emit a record during parsing of a larger file. <code>p</code> will be advanced as normally, so if <code>@escape</code> is hit on <code>B</code> during parsing of <code>&quot;ABC&quot;</code>, the next byte will be <code>C</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/3ab134cd6bd6def0a7446a4043d709e593614cf5/src/codegen.jl#L710-L718">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.@mark" href="#Automa.@mark"><code>Automa.@mark</code></a> — <span class="docstring-category">Macro</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">@mark()</code></pre><p>Pseudomacro, to be used with IO-parsing Automa functions. This macro will &quot;mark&quot; the position of <code>p</code> in the current buffer. The marked position will not be flushed from the buffer after being consumed. For example, Automa code can call <code>@mark()</code> at the beginning of a large string, then when the string is exited at position <code>p</code>, it is guaranteed that the whole string resides in the buffer at positions <code>markpos():p-1</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/3ab134cd6bd6def0a7446a4043d709e593614cf5/src/codegen.jl#L723-L732">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.@unmark" href="#Automa.@unmark"><code>Automa.@unmark</code></a> — <span class="docstring-category">Macro</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">unmark()</code></pre><p>Pseudomacro. Removes the mark from the buffer. This allows all previous data to be cleared from the buffer.</p><p>See also: <a href="#Automa.@mark"><code>@mark</code></a>, <a href="#Automa.@markpos"><code>@markpos</code></a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/3ab134cd6bd6def0a7446a4043d709e593614cf5/src/codegen.jl#L737-L744">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.@markpos" href="#Automa.@markpos"><code>Automa.@markpos</code></a> — <span class="docstring-category">Macro</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">markpos()</code></pre><p>Pseudomacro. Get the position of the mark in the buffer.</p><p>See also: <a href="#Automa.@mark"><code>@mark</code></a>, <a href="#Automa.@markpos"><code>@markpos</code></a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/3ab134cd6bd6def0a7446a4043d709e593614cf5/src/codegen.jl#L749-L755">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.@bufferpos" href="#Automa.@bufferpos"><code>Automa.@bufferpos</code></a> — <span class="docstring-category">Macro</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">bufferpos()</code></pre><p>Pseudomacro. Returns the integer position of the current <code>TranscodingStreams</code> buffer (only used with the <code>generate_reader</code> function).</p><p><strong>Example</strong></p><pre><code class="nohighlight hljs"># Inside some Automa action code
+p = 13               ^</code></pre><p>Remember to update the mark, or to clear it with <code>@unmark()</code> in order to be able to flush data from the buffer afterwards.</p><h2 id="Reference"><a class="docs-heading-anchor" href="#Reference">Reference</a><a id="Reference-1"></a><a class="docs-heading-anchor-permalink" href="#Reference" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.generate_reader" href="#Automa.generate_reader"><code>Automa.generate_reader</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">generate_reader(funcname::Symbol, machine::Automa.Machine; kwargs...)</code></pre><p>Generate a streaming reader function of the name <code>funcname</code> from <code>machine</code>.</p><p>The generated function consumes data from a stream passed as the first argument and executes the machine with filling the data buffer.</p><p>This function returns an expression object of the generated function.  The user need to evaluate it in a module in which the generated function is needed.</p><p><strong>Keyword Arguments</strong></p><ul><li><code>arguments</code>: Additional arguments <code>funcname</code> will take (default: <code>()</code>).   The default signature of the generated function is <code>(stream::TranscodingStream,)</code>,   but it is possible to supply more arguments to the signature with this keyword argument.</li><li><code>context</code>: Automa&#39;s codegenerator (default: <code>Automa.CodeGenContext()</code>).</li><li><code>actions</code>: A dictionary of action code (default: <code>Dict{Symbol,Expr}()</code>).</li><li><code>initcode</code>: Initialization code (default: <code>:()</code>).</li><li><code>loopcode</code>: Loop code (default: <code>:()</code>).</li><li><code>returncode</code>: Return code (default: <code>:(return cs)</code>).</li><li><code>errorcode</code>: Executed if <code>cs &lt; 0</code> after <code>loopcode</code> (default error message)</li></ul><p>See the source code of this function to see how the generated code looks like</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/b08d53b7940af2b81c780f50a98205a0a4e60cf6/src/stream.jl#L1-L24">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.@escape" href="#Automa.@escape"><code>Automa.@escape</code></a> — <span class="docstring-category">Macro</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">@escape()</code></pre><p>Pseudomacro. When encountered during <code>Machine</code> execution, the machine will stop executing. This is useful to interrupt the parsing process, for example to emit a record during parsing of a larger file. <code>p</code> will be advanced as normally, so if <code>@escape</code> is hit on <code>B</code> during parsing of <code>&quot;ABC&quot;</code>, the next byte will be <code>C</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/b08d53b7940af2b81c780f50a98205a0a4e60cf6/src/codegen.jl#L710-L718">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.@mark" href="#Automa.@mark"><code>Automa.@mark</code></a> — <span class="docstring-category">Macro</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">@mark()</code></pre><p>Pseudomacro, to be used with IO-parsing Automa functions. This macro will &quot;mark&quot; the position of <code>p</code> in the current buffer. The marked position will not be flushed from the buffer after being consumed. For example, Automa code can call <code>@mark()</code> at the beginning of a large string, then when the string is exited at position <code>p</code>, it is guaranteed that the whole string resides in the buffer at positions <code>markpos():p-1</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/b08d53b7940af2b81c780f50a98205a0a4e60cf6/src/codegen.jl#L723-L732">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.@unmark" href="#Automa.@unmark"><code>Automa.@unmark</code></a> — <span class="docstring-category">Macro</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">unmark()</code></pre><p>Pseudomacro. Removes the mark from the buffer. This allows all previous data to be cleared from the buffer.</p><p>See also: <a href="#Automa.@mark"><code>@mark</code></a>, <a href="#Automa.@markpos"><code>@markpos</code></a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/b08d53b7940af2b81c780f50a98205a0a4e60cf6/src/codegen.jl#L737-L744">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.@markpos" href="#Automa.@markpos"><code>Automa.@markpos</code></a> — <span class="docstring-category">Macro</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">markpos()</code></pre><p>Pseudomacro. Get the position of the mark in the buffer.</p><p>See also: <a href="#Automa.@mark"><code>@mark</code></a>, <a href="#Automa.@markpos"><code>@markpos</code></a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/b08d53b7940af2b81c780f50a98205a0a4e60cf6/src/codegen.jl#L749-L755">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.@bufferpos" href="#Automa.@bufferpos"><code>Automa.@bufferpos</code></a> — <span class="docstring-category">Macro</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">bufferpos()</code></pre><p>Pseudomacro. Returns the integer position of the current <code>TranscodingStreams</code> buffer (only used with the <code>generate_reader</code> function).</p><p><strong>Example</strong></p><pre><code class="nohighlight hljs"># Inside some Automa action code
 @setbuffer()
 description = sub_parser(stream)
-p = @bufferpos()</code></pre><p>See also: <a href="#Automa.@setbuffer"><code>@setbuffer</code></a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/3ab134cd6bd6def0a7446a4043d709e593614cf5/src/codegen.jl#L760-L775">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.@relpos" href="#Automa.@relpos"><code>Automa.@relpos</code></a> — <span class="docstring-category">Macro</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">relpos(p)</code></pre><p>Automa pseudomacro. Return the position of <code>p</code> relative to <code>@markpos()</code>. Equivalent to <code>p - @markpos() + 1</code>. This can be used to mark additional points in the stream when the mark is set, after which their action position can be retrieved using <code>abspos(x)</code>.</p><p>Behaviour is undefined if mark has not yet been set.</p><p><strong>Example usage:</strong></p><pre><code class="nohighlight hljs"># In one action
+p = @bufferpos()</code></pre><p>See also: <a href="#Automa.@setbuffer"><code>@setbuffer</code></a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/b08d53b7940af2b81c780f50a98205a0a4e60cf6/src/codegen.jl#L760-L775">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.@relpos" href="#Automa.@relpos"><code>Automa.@relpos</code></a> — <span class="docstring-category">Macro</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">relpos(p)</code></pre><p>Automa pseudomacro. Return the position of <code>p</code> relative to <code>@markpos()</code>. Equivalent to <code>p - @markpos() + 1</code>. This can be used to mark additional points in the stream when the mark is set, after which their action position can be retrieved using <code>abspos(x)</code>.</p><p>Behaviour is undefined if mark has not yet been set.</p><p><strong>Example usage:</strong></p><pre><code class="nohighlight hljs"># In one action
 identifier_pos = @relpos(p)
 
 # Later, in a different action
-identifier = data[@abspos(identifier_pos):p]</code></pre><p>See also: <a href="#Automa.@abspos"><code>@abspos</code></a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/3ab134cd6bd6def0a7446a4043d709e593614cf5/src/codegen.jl#L780-L800">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.@abspos" href="#Automa.@abspos"><code>Automa.@abspos</code></a> — <span class="docstring-category">Macro</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">abspos(p)</code></pre><p>Automa pseudomacro. Used to obtain the actual position of a relative position obtained from <code>@relpos</code>. See <a href="#Automa.@relpos"><code>@relpos</code></a> for more details.</p><p>Behaviour is undefined if mark has not yet been set.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/3ab134cd6bd6def0a7446a4043d709e593614cf5/src/codegen.jl#L805-L812">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.@setbuffer" href="#Automa.@setbuffer"><code>Automa.@setbuffer</code></a> — <span class="docstring-category">Macro</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">setbuffer()</code></pre><p>Updates the buffer position to match <code>p</code>. The buffer position is syncronized with <code>p</code> before and after calls to functions generated by <code>generate_reader</code>. <code>@setbuffer()</code> can be used to the buffer before calling another parser.</p><p><strong>Example</strong></p><pre><code class="nohighlight hljs"># Inside some Automa action code
+identifier = data[@abspos(identifier_pos):p]</code></pre><p>See also: <a href="#Automa.@abspos"><code>@abspos</code></a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/b08d53b7940af2b81c780f50a98205a0a4e60cf6/src/codegen.jl#L780-L800">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.@abspos" href="#Automa.@abspos"><code>Automa.@abspos</code></a> — <span class="docstring-category">Macro</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">abspos(p)</code></pre><p>Automa pseudomacro. Used to obtain the actual position of a relative position obtained from <code>@relpos</code>. See <a href="#Automa.@relpos"><code>@relpos</code></a> for more details.</p><p>Behaviour is undefined if mark has not yet been set.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/b08d53b7940af2b81c780f50a98205a0a4e60cf6/src/codegen.jl#L805-L812">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.@setbuffer" href="#Automa.@setbuffer"><code>Automa.@setbuffer</code></a> — <span class="docstring-category">Macro</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">setbuffer()</code></pre><p>Updates the buffer position to match <code>p</code>. The buffer position is syncronized with <code>p</code> before and after calls to functions generated by <code>generate_reader</code>. <code>@setbuffer()</code> can be used to the buffer before calling another parser.</p><p><strong>Example</strong></p><pre><code class="nohighlight hljs"># Inside some Automa action code
 @setbuffer()
 description = sub_parser(stream)
-p = @bufferpos()</code></pre><p>See also: <a href="#Automa.@bufferpos"><code>@bufferpos</code></a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/3ab134cd6bd6def0a7446a4043d709e593614cf5/src/codegen.jl#L817-L834">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../custom/">« Customizing codegen</a><a class="docs-footer-nextpage" href="../reader/">Creating readers »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Friday 18 October 2024 14:46">Friday 18 October 2024</span>. Using Julia version 1.10.5.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+p = @bufferpos()</code></pre><p>See also: <a href="#Automa.@bufferpos"><code>@bufferpos</code></a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/b08d53b7940af2b81c780f50a98205a0a4e60cf6/src/codegen.jl#L817-L834">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../custom/">« Customizing codegen</a><a class="docs-footer-nextpage" href="../reader/">Creating readers »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Friday 18 October 2024 15:37">Friday 18 October 2024</span>. Using Julia version 1.11.0.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/objects.inv b/dev/objects.inv
index 95c80a6..c684b92 100644
Binary files a/dev/objects.inv and b/dev/objects.inv differ
diff --git a/dev/parser/index.html b/dev/parser/index.html
index cc3e0fd..cade8a1 100644
--- a/dev/parser/index.html
+++ b/dev/parser/index.html
@@ -81,17 +81,17 @@
 julia&gt; regex2 = onenter!(regex, :entering_regex);
 
 julia&gt; regex === regex2
-true</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/3ab134cd6bd6def0a7446a4043d709e593614cf5/src/re.jl#L67-L84">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.RegExp.onexit!" href="#Automa.RegExp.onexit!"><code>Automa.RegExp.onexit!</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">onexit!(re::RE, a::Union{Symbol, Vector{Symbol}}) -&gt; re</code></pre><p>Set action(s) <code>a</code> to occur when reading the first byte no longer part of regex <code>re</code>, or if experiencing an expected end-of-file. If multiple actions are set by passing a vector, execute the actions in order.</p><p>See also: <a href="#Automa.RegExp.onenter!"><code>onenter!</code></a>, <a href="#Automa.RegExp.onall!"><code>onall!</code></a>, <a href="#Automa.RegExp.onfinal!"><code>onfinal!</code></a></p><p><strong>Example</strong></p><pre><code class="language-julia hljs">julia&gt; regex = re&quot;ab?c*&quot;;
+true</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/b08d53b7940af2b81c780f50a98205a0a4e60cf6/src/re.jl#L67-L84">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.RegExp.onexit!" href="#Automa.RegExp.onexit!"><code>Automa.RegExp.onexit!</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">onexit!(re::RE, a::Union{Symbol, Vector{Symbol}}) -&gt; re</code></pre><p>Set action(s) <code>a</code> to occur when reading the first byte no longer part of regex <code>re</code>, or if experiencing an expected end-of-file. If multiple actions are set by passing a vector, execute the actions in order.</p><p>See also: <a href="#Automa.RegExp.onenter!"><code>onenter!</code></a>, <a href="#Automa.RegExp.onall!"><code>onall!</code></a>, <a href="#Automa.RegExp.onfinal!"><code>onfinal!</code></a></p><p><strong>Example</strong></p><pre><code class="language-julia hljs">julia&gt; regex = re&quot;ab?c*&quot;;
 
 julia&gt; regex2 = onexit!(regex, :exiting_regex);
 
 julia&gt; regex === regex2
-true</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/3ab134cd6bd6def0a7446a4043d709e593614cf5/src/re.jl#L88-L106">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.RegExp.onall!" href="#Automa.RegExp.onall!"><code>Automa.RegExp.onall!</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">onall!(re::RE, a::Union{Symbol, Vector{Symbol}}) -&gt; re</code></pre><p>Set action(s) <code>a</code> to occur when reading any byte part of the regex <code>re</code>. If multiple actions are set by passing a vector, execute the actions in order.</p><p>See also: <a href="#Automa.RegExp.onenter!"><code>onenter!</code></a>, <a href="#Automa.RegExp.onexit!"><code>onexit!</code></a>, <a href="#Automa.RegExp.onfinal!"><code>onfinal!</code></a></p><p><strong>Example</strong></p><pre><code class="language-julia hljs">julia&gt; regex = re&quot;ab?c*&quot;;
+true</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/b08d53b7940af2b81c780f50a98205a0a4e60cf6/src/re.jl#L88-L106">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.RegExp.onall!" href="#Automa.RegExp.onall!"><code>Automa.RegExp.onall!</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">onall!(re::RE, a::Union{Symbol, Vector{Symbol}}) -&gt; re</code></pre><p>Set action(s) <code>a</code> to occur when reading any byte part of the regex <code>re</code>. If multiple actions are set by passing a vector, execute the actions in order.</p><p>See also: <a href="#Automa.RegExp.onenter!"><code>onenter!</code></a>, <a href="#Automa.RegExp.onexit!"><code>onexit!</code></a>, <a href="#Automa.RegExp.onfinal!"><code>onfinal!</code></a></p><p><strong>Example</strong></p><pre><code class="language-julia hljs">julia&gt; regex = re&quot;ab?c*&quot;;
 
 julia&gt; regex2 = onall!(regex, :reading_re_byte);
 
 julia&gt; regex === regex2
-true</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/3ab134cd6bd6def0a7446a4043d709e593614cf5/src/re.jl#L136-L153">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.RegExp.onfinal!" href="#Automa.RegExp.onfinal!"><code>Automa.RegExp.onfinal!</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">onfinal!(re::RE, a::Union{Symbol, Vector{Symbol}}) -&gt; re</code></pre><p>Set action(s) <code>a</code> to occur when the last byte of regex <code>re</code>. If <code>re</code> does not have a definite final byte, e.g. <code>re&quot;a(bc)*&quot;</code>, where more &quot;bc&quot; can always be added, compiling the regex will error after setting a final action. If multiple actions are set by passing a vector, execute the actions in order.</p><p>See also: <a href="#Automa.RegExp.onenter!"><code>onenter!</code></a>, <a href="#Automa.RegExp.onall!"><code>onall!</code></a>, <a href="#Automa.RegExp.onexit!"><code>onexit!</code></a></p><p><strong>Example</strong></p><pre><code class="language-julia hljs">julia&gt; regex = re&quot;ab?c&quot;;
+true</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/b08d53b7940af2b81c780f50a98205a0a4e60cf6/src/re.jl#L136-L153">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.RegExp.onfinal!" href="#Automa.RegExp.onfinal!"><code>Automa.RegExp.onfinal!</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">onfinal!(re::RE, a::Union{Symbol, Vector{Symbol}}) -&gt; re</code></pre><p>Set action(s) <code>a</code> to occur when the last byte of regex <code>re</code>. If <code>re</code> does not have a definite final byte, e.g. <code>re&quot;a(bc)*&quot;</code>, where more &quot;bc&quot; can always be added, compiling the regex will error after setting a final action. If multiple actions are set by passing a vector, execute the actions in order.</p><p>See also: <a href="#Automa.RegExp.onenter!"><code>onenter!</code></a>, <a href="#Automa.RegExp.onall!"><code>onall!</code></a>, <a href="#Automa.RegExp.onexit!"><code>onexit!</code></a></p><p><strong>Example</strong></p><pre><code class="language-julia hljs">julia&gt; regex = re&quot;ab?c&quot;;
 
 julia&gt; regex2 = onfinal!(regex, :entering_last_byte);
 
@@ -99,24 +99,24 @@
 true
 
 julia&gt; compile(onfinal!(re&quot;ab?c*&quot;, :does_not_work))
-ERROR: [...]</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/3ab134cd6bd6def0a7446a4043d709e593614cf5/src/re.jl#L110-L132">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.RegExp.precond!" href="#Automa.RegExp.precond!"><code>Automa.RegExp.precond!</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">precond!(re::RE, s::Symbol; [when=:enter], [bool=true]) -&gt; re</code></pre><p>Set <code>re</code>&#39;s precondition to <code>s</code>. Before any state transitions to <code>re</code>, or inside <code>re</code>, the precondition code <code>s</code> is checked to be <code>bool</code> before the transition is taken.</p><p><code>when</code> controls if the condition is checked when the regex is entered (if <code>:enter</code>), or at every state transition inside the regex (if <code>:all</code>)</p><p><strong>Example</strong></p><pre><code class="language-julia hljs">julia&gt; regex = re&quot;ab?c*&quot;;
+ERROR: [...]</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/b08d53b7940af2b81c780f50a98205a0a4e60cf6/src/re.jl#L110-L132">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.RegExp.precond!" href="#Automa.RegExp.precond!"><code>Automa.RegExp.precond!</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">precond!(re::RE, s::Symbol; [when=:enter], [bool=true]) -&gt; re</code></pre><p>Set <code>re</code>&#39;s precondition to <code>s</code>. Before any state transitions to <code>re</code>, or inside <code>re</code>, the precondition code <code>s</code> is checked to be <code>bool</code> before the transition is taken.</p><p><code>when</code> controls if the condition is checked when the regex is entered (if <code>:enter</code>), or at every state transition inside the regex (if <code>:all</code>)</p><p><strong>Example</strong></p><pre><code class="language-julia hljs">julia&gt; regex = re&quot;ab?c*&quot;;
 
 julia&gt; regex2 = precond!(regex, :some_condition);
 
 julia&gt; regex === regex2
-true</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/3ab134cd6bd6def0a7446a4043d709e593614cf5/src/re.jl#L157-L175">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.generate_code" href="#Automa.generate_code"><code>Automa.generate_code</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">generate_code([::CodeGenContext], machine::Machine, actions=nothing)::Expr</code></pre><p>Generate init and exec code for <code>machine</code>. The default code generator function for creating functions, preferentially use this over generating init and exec code directly, due to its convenience.  Shorthand for producing the concatenated code of:</p><ul><li><code>generate_init_code(ctx, machine)</code></li><li><code>generate_action_code(ctx, machine, actions)</code></li><li><code>generate_input_error_code(ctx, machine)</code> [elided if actions == :debug]</li></ul><p><strong>Examples</strong></p><pre><code class="nohighlight hljs">@eval function foo(data)
+true</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/b08d53b7940af2b81c780f50a98205a0a4e60cf6/src/re.jl#L157-L175">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.generate_code" href="#Automa.generate_code"><code>Automa.generate_code</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">generate_code([::CodeGenContext], machine::Machine, actions=nothing)::Expr</code></pre><p>Generate init and exec code for <code>machine</code>. The default code generator function for creating functions, preferentially use this over generating init and exec code directly, due to its convenience.  Shorthand for producing the concatenated code of:</p><ul><li><code>generate_init_code(ctx, machine)</code></li><li><code>generate_action_code(ctx, machine, actions)</code></li><li><code>generate_input_error_code(ctx, machine)</code> [elided if actions == :debug]</li></ul><p><strong>Examples</strong></p><pre><code class="nohighlight hljs">@eval function foo(data)
     # Initialize variables used in actions
     data_buffer = UInt8[]
     $(generate_code(machine, actions))
     return data_buffer
-end</code></pre><p>See also: <a href="#Automa.generate_init_code"><code>generate_init_code</code></a>, <a href="#Automa.generate_exec_code"><code>generate_exec_code</code></a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/3ab134cd6bd6def0a7446a4043d709e593614cf5/src/codegen.jl#L167-L190">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.generate_init_code" href="#Automa.generate_init_code"><code>Automa.generate_init_code</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">generate_init_code([::CodeGenContext], machine::Machine)::Expr</code></pre><p>Generate variable initialization code, initializing variables such as <code>p</code>, and <code>p_end</code>. The names of these variables are set by the <code>CodeGenContext</code>. If not passed, the context defaults to <code>DefaultCodeGenContext</code></p><p>Prefer using the more generic <code>generate_code</code> over this function where possible. This function should be used if the initialized data should be modified before the execution code.</p><p><strong>Example</strong></p><pre><code class="language-julia hljs">@eval function foo(data)
+end</code></pre><p>See also: <a href="#Automa.generate_init_code"><code>generate_init_code</code></a>, <a href="#Automa.generate_exec_code"><code>generate_exec_code</code></a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/b08d53b7940af2b81c780f50a98205a0a4e60cf6/src/codegen.jl#L167-L190">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.generate_init_code" href="#Automa.generate_init_code"><code>Automa.generate_init_code</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">generate_init_code([::CodeGenContext], machine::Machine)::Expr</code></pre><p>Generate variable initialization code, initializing variables such as <code>p</code>, and <code>p_end</code>. The names of these variables are set by the <code>CodeGenContext</code>. If not passed, the context defaults to <code>DefaultCodeGenContext</code></p><p>Prefer using the more generic <code>generate_code</code> over this function where possible. This function should be used if the initialized data should be modified before the execution code.</p><p><strong>Example</strong></p><pre><code class="language-julia hljs">@eval function foo(data)
     $(generate_init_code(machine))
     p = 2 # maybe I want to start from position 2, not 1
     $(generate_exec_code(machine, actions))
     return cs
-end</code></pre><p>See also: <a href="#Automa.generate_code"><code>generate_code</code></a>, <a href="#Automa.generate_exec_code"><code>generate_exec_code</code></a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/3ab134cd6bd6def0a7446a4043d709e593614cf5/src/codegen.jl#L210-L232">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.generate_exec_code" href="#Automa.generate_exec_code"><code>Automa.generate_exec_code</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">generate_exec_code([::CodeGenContext], machine::Machine, actions=nothing)::Expr</code></pre><p>Generate machine execution code with actions. This code should be run after the machine has been initialized with <code>generate_init_code</code>. If not passed, the context defaults to <code>DefaultCodeGenContext</code></p><p>Prefer using the more generic <code>generate_code</code> over this function where possible. This function should be used if the initialized data should be modified before the execution code.</p><p><strong>Examples</strong></p><pre><code class="nohighlight hljs">@eval function foo(data)
+end</code></pre><p>See also: <a href="#Automa.generate_code"><code>generate_code</code></a>, <a href="#Automa.generate_exec_code"><code>generate_exec_code</code></a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/b08d53b7940af2b81c780f50a98205a0a4e60cf6/src/codegen.jl#L210-L232">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.generate_exec_code" href="#Automa.generate_exec_code"><code>Automa.generate_exec_code</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">generate_exec_code([::CodeGenContext], machine::Machine, actions=nothing)::Expr</code></pre><p>Generate machine execution code with actions. This code should be run after the machine has been initialized with <code>generate_init_code</code>. If not passed, the context defaults to <code>DefaultCodeGenContext</code></p><p>Prefer using the more generic <code>generate_code</code> over this function where possible. This function should be used if the initialized data should be modified before the execution code.</p><p><strong>Examples</strong></p><pre><code class="nohighlight hljs">@eval function foo(data)
     $(generate_init_code(machine))
     p = 2 # maybe I want to start from position 2, not 1
     $(generate_exec_code(machine, actions))
     return cs
-end</code></pre><p>See also: <a href="#Automa.generate_init_code"><code>generate_init_code</code></a>, <a href="#Automa.generate_exec_code"><code>generate_exec_code</code></a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/3ab134cd6bd6def0a7446a4043d709e593614cf5/src/codegen.jl#L249-L271">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../tokenizer/">« Tokenizers</a><a class="docs-footer-nextpage" href="../custom/">Customizing codegen »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Friday 18 October 2024 14:46">Friday 18 October 2024</span>. Using Julia version 1.10.5.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+end</code></pre><p>See also: <a href="#Automa.generate_init_code"><code>generate_init_code</code></a>, <a href="#Automa.generate_exec_code"><code>generate_exec_code</code></a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/b08d53b7940af2b81c780f50a98205a0a4e60cf6/src/codegen.jl#L249-L271">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../tokenizer/">« Tokenizers</a><a class="docs-footer-nextpage" href="../custom/">Customizing codegen »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Friday 18 October 2024 15:37">Friday 18 October 2024</span>. Using Julia version 1.11.0.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/reader/index.html b/dev/reader/index.html
index 5ee891e..99a84ab 100644
--- a/dev/reader/index.html
+++ b/dev/reader/index.html
@@ -57,4 +57,4 @@
 Seq(&quot;tag&quot;, &quot;GAGATATA&quot;)
 
 julia&gt; read_record(reader)
-ERROR: EOFError: read end of file</code></pre></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../io/">« Parsing IOs</a><a class="docs-footer-nextpage" href="../debugging/">Debugging Automa »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Friday 18 October 2024 14:46">Friday 18 October 2024</span>. Using Julia version 1.10.5.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+ERROR: EOFError: read end of file</code></pre></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../io/">« Parsing IOs</a><a class="docs-footer-nextpage" href="../debugging/">Debugging Automa »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Friday 18 October 2024 15:37">Friday 18 October 2024</span>. Using Julia version 1.11.0.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/regex/index.html b/dev/regex/index.html
index 1ec4838..27e660d 100644
--- a/dev/regex/index.html
+++ b/dev/regex/index.html
@@ -13,5 +13,5 @@
 true
 
 julia&gt; compile(regex) isa Automa.Machine
-true</code></pre><p>See also: <a href="#Automa.RegExp.@re_str"><code>@re_str</code></a>, <a href="../validators/#Automa.compile"><code>compile</code></a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/3ab134cd6bd6def0a7446a4043d709e593614cf5/src/re.jl#L13-L42">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.RegExp.@re_str" href="#Automa.RegExp.@re_str"><code>Automa.RegExp.@re_str</code></a> — <span class="docstring-category">Macro</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">@re_str -&gt; RE</code></pre><p>Construct an Automa regex of type <code>RE</code> from a string. Note that due to Julia&#39;s raw string escaping rules, <code>re&quot;\\&quot;</code> means a single backslash, and so does <code>re&quot;\\\\&quot;</code>, while <code>re&quot;\\\\\&quot;&quot;</code> means a backslash, then a quote character.</p><p>Examples:</p><pre><code class="language-julia hljs">julia&gt; re&quot;ab?c*[def][^ghi]+&quot; isa RE
-true </code></pre><p>See also: <a href="#Automa.RegExp.RE"><code>RE</code></a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/3ab134cd6bd6def0a7446a4043d709e593614cf5/src/re.jl#L273-L286">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../theory/">« Theory</a><a class="docs-footer-nextpage" href="../validators/">Validators »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Friday 18 October 2024 14:46">Friday 18 October 2024</span>. Using Julia version 1.10.5.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+true</code></pre><p>See also: <a href="#Automa.RegExp.@re_str"><code>@re_str</code></a>, <a href="../validators/#Automa.compile"><code>compile</code></a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/b08d53b7940af2b81c780f50a98205a0a4e60cf6/src/re.jl#L13-L42">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.RegExp.@re_str" href="#Automa.RegExp.@re_str"><code>Automa.RegExp.@re_str</code></a> — <span class="docstring-category">Macro</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">@re_str -&gt; RE</code></pre><p>Construct an Automa regex of type <code>RE</code> from a string. Note that due to Julia&#39;s raw string escaping rules, <code>re&quot;\\&quot;</code> means a single backslash, and so does <code>re&quot;\\\\&quot;</code>, while <code>re&quot;\\\\\&quot;&quot;</code> means a backslash, then a quote character.</p><p>Examples:</p><pre><code class="language-julia hljs">julia&gt; re&quot;ab?c*[def][^ghi]+&quot; isa RE
+true </code></pre><p>See also: <a href="#Automa.RegExp.RE"><code>RE</code></a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/b08d53b7940af2b81c780f50a98205a0a4e60cf6/src/re.jl#L273-L286">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../theory/">« Theory</a><a class="docs-footer-nextpage" href="../validators/">Validators »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Friday 18 October 2024 15:37">Friday 18 October 2024</span>. Using Julia version 1.11.0.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/theory/index.html b/dev/theory/index.html
index 3ee68a6..8552fee 100644
--- a/dev/theory/index.html
+++ b/dev/theory/index.html
@@ -1,2 +1,2 @@
 <!DOCTYPE html>
-<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Theory · Automa.jl</title><meta name="title" content="Theory · Automa.jl"/><meta property="og:title" content="Theory · Automa.jl"/><meta property="twitter:title" content="Theory · Automa.jl"/><meta name="description" content="Documentation for Automa.jl."/><meta property="og:description" content="Documentation for Automa.jl."/><meta property="twitter:description" content="Documentation for Automa.jl."/><script data-outdated-warner src="../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL=".."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../assets/documenter.js"></script><script src="../search_index.js"></script><script src="../siteinfo.js"></script><script src="../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/catppuccin-mocha.css" data-theme-name="catppuccin-mocha"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/catppuccin-macchiato.css" data-theme-name="catppuccin-macchiato"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/catppuccin-frappe.css" data-theme-name="catppuccin-frappe"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/catppuccin-latte.css" data-theme-name="catppuccin-latte"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../assets/themeswap.js"></script></head><body><div id="documenter"><nav class="docs-sidebar"><div class="docs-package-name"><span class="docs-autofit"><a href="../">Automa.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li><a class="tocitem" href="../">Home</a></li><li class="is-active"><a class="tocitem" href>Theory</a><ul class="internal"><li><a class="tocitem" href="#Nondeterministic-finite-automata"><span>Nondeterministic finite automata</span></a></li><li><a class="tocitem" href="#Deterministic-finite-automata"><span>Deterministic finite automata</span></a></li><li><a class="tocitem" href="#Automa-in-a-nutshell"><span>Automa in a nutshell</span></a></li></ul></li><li><a class="tocitem" href="../regex/">Regex</a></li><li><a class="tocitem" href="../validators/">Validators</a></li><li><a class="tocitem" href="../tokenizer/">Tokenizers</a></li><li><a class="tocitem" href="../parser/">Parsing buffers</a></li><li><a class="tocitem" href="../custom/">Customizing codegen</a></li><li><a class="tocitem" href="../io/">Parsing IOs</a></li><li><a class="tocitem" href="../reader/">Creating readers</a></li><li><a class="tocitem" href="../debugging/">Debugging Automa</a></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li class="is-active"><a href>Theory</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Theory</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/BioJulia/Automa.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/BioJulia/Automa.jl/blob/master/docs/src/theory.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="Theory-of-regular-expressions"><a class="docs-heading-anchor" href="#Theory-of-regular-expressions">Theory of regular expressions</a><a id="Theory-of-regular-expressions-1"></a><a class="docs-heading-anchor-permalink" href="#Theory-of-regular-expressions" title="Permalink"></a></h1><p>Most programmers are familiar with <em>regular expressions</em>, or <em>regex</em>, for short. What many programmers don&#39;t know is that regex have a deep theoretical underpinning, which is leaned on by regex engines to produce highly efficient code.</p><p>Informally, a regular expression can be thought of as any pattern that can be constructed from the following atoms:</p><ul><li>The empty string is a valid regular expression, i.e. <code>re&quot;&quot;</code></li><li>Literal matching of a single symbol from a finite alphabet, such as a character, i.e. <code>re&quot;p&quot;</code></li></ul><p>Atoms can be combined with the following operations, if R and P are two regular expressions:</p><ul><li>Alternation, i.e <code>R | P</code>, meaning either match R or P.</li><li>Concatenation, i.e. <code>R * P</code>, meaning match first R, then P</li><li>Repetition, i.e. <code>R*</code>, meaning match R zero or more times consecutively.</li></ul><div class="admonition is-info"><header class="admonition-header">Note</header><div class="admonition-body"><p>In Automa, the alphabet is <em>bytes</em>, i.e. <code>0x00:0xff</code>, and so each symbol is a single byte. Multi-byte characters such as <code>Æ</code> is interpreted as the two concatenated of two symbols, <code>re&quot;\xc3&quot; * re&quot;\x86&quot;</code>. The fact that Automa considers one input to be one byte, not one character, can become relevant if you instruct Automa to complete an action &quot;on every input&quot;.</p></div></div><p>Popular regex libraries include more operations like <code>?</code> and <code>+</code>. These can trivially be constructed from the above mentioned primitives, i.e. <code>R?</code> is <code>&quot;&quot; | R</code>, and <code>R+</code> is <code>RR*</code>.</p><p>Some implementations of regular expression engines, such as PCRE which is the default in Julia as of Julia 1.8, also support operations like backreferences and lookbehind. These operations can NOT be constructed from the above atoms and axioms, meaning that PCRE expressions are not regular expressions in the theoretical sense.</p><p>The practical importance of theoretically sound regular expressions is that there exists algorithms that can match regular expressions on O(N) time and O(1) space, whereas this is not true for PCRE expressions, which are therefore significantly slower.</p><div class="admonition is-info"><header class="admonition-header">Note</header><div class="admonition-body"><p>Automa.jl only supports real regex, and as such does not support e.g. backreferences, in order to gurantee fast runtime performance.</p></div></div><p>To match regex to strings, the regex are transformed to <em>finite automata</em>, which are then implemented in code.</p><h2 id="Nondeterministic-finite-automata"><a class="docs-heading-anchor" href="#Nondeterministic-finite-automata">Nondeterministic finite automata</a><a id="Nondeterministic-finite-automata-1"></a><a class="docs-heading-anchor-permalink" href="#Nondeterministic-finite-automata" title="Permalink"></a></h2><p>The programmer Ken Thompson, of Unix fame, deviced <em>Thompson&#39;s construction</em>, an algorithm to constuct a nondeterministic finite automaton (NFA) from a regex. An NFA can be thought of as a flowchart (or a directed graph), where one can move from node to node on directed edges. Edges are either labeled <code>ϵ</code>, in which the machine can freely move through the edge to its destination node, or labeled with one or more input symbols, in which the machine may traverse the edge upon consuming said input.</p><p>To illustrate, let&#39;s look at one of the simplest regex: <code>re&quot;a&quot;</code>, matching the letter <code>a</code>:</p><p><img src="../figure/simple.png" alt="State diagram showing state 1, edge transition consuming input &#39;a&#39;, leading to &quot;accept state&quot; 2"/></p><p>You begin at the small dot on the right, then immediately go to state 1, the circle marked by a <code>1</code>. By moving to the next state, state 2, you consume the next symbol from the input string, which must be the symbol marked on the edge from state 1 to state 2 (in this case, an <code>a</code>). Some states are &quot;accept states&quot;, illustrated by a double circle. If you are at an accept state when you&#39;ve consumed all symbols of the input string, the string matches the regex.</p><p>Each of the operations that combine regex can also combine NFAs. For example, given the two regex <code>a</code> and <code>b</code>, which correspond to the NFAs <code>A</code> and <code>B</code>, the regex <code>a * b</code> can be expressed with the following NFA:</p><p><img src="../figure/cat.png" alt="State diagram showing ϵ transition from state A to accept state B"/></p><p>Note the <code>ϵ</code> symbol on the edge - this signifies an &quot;epsilon transition&quot;, meaning you move directly from <code>A</code> to <code>B</code> without consuming any symbols.</p><p>Similarly, <code>a | b</code> correspond to this NFA structure...</p><p><img src="../figure/alt.png" alt="State diagram of the NFA for `a | b`"/></p><p>...and <code>a*</code> to this:</p><p><img src="../figure/kleenestar.png" alt="State diagram of the NFA for `a*`"/></p><p>For a larger example, <code>re&quot;(\+|-)?(0|1)*&quot;</code> combines alternation, concatenation and repetition and so looks like this:</p><p><img src="../figure/larger.png" alt="State diagram of the NFA for `re&quot;(\\+|-)?(0|1)*&quot;`"/></p><p>ϵ-transitions means that there are states from which there are multiple possible next states, e.g. in the larger example above, state 1 can lead to state 2 or state 8. That&#39;s what makes NFAs nondeterministic.</p><p>In order to match a regex to a string then, the movement through the NFA must be emulated. You begin at state 1. When a non-ϵ edge is encountered, you consume a byte of the input data if it matches. If there are no edges that match your input, the string does not match. If an ϵ-edge is encountered from state <code>A</code> that leads to states <code>B</code> and <code>C</code>, the machine goes from state <code>A</code> to state <code>{B, C}</code>, i.e. in both states at once.</p><p>For example, if the regex <code>re&quot;(\+|-)?(0|1)*</code> visualized above is matched to the string <code>-11</code>, this is what happens:</p><ul><li>NFA starts in state 1</li><li>NFA immediately moves to all states reachable via ϵ transition. It is now in state {2, 3, 5, 7, 8, 9, 10}.</li><li>NFA sees input <code>-</code>. States {2, 3, 4, 5, 7, 8, 10} do not have an edge with <code>-</code> leading out, so these states die. Therefore, the machine is in state 9, consumes the input, and moves to state 2.</li><li>NFA immediately moves to all states reachable from state 2 via ϵ transitions, so goes to {3, 4, 5, 7}</li><li>NFA sees input <code>1</code>, must be in state 5, moves to state 6, then through ϵ transitions to state {3, 4, 5, 7}</li><li>The above point repeats, NFA is still in state {3, 4, 5, 7}</li><li>Input ends. Since state 3 is an accept state, the string matches.</li></ul><p>Using only a regex-to-NFA converter, you could create a simple regex engine simply by emulating the NFA as above. The existence of ϵ transitions means the NFA can be in multiple states at once which adds unwelcome complexity to the emulation and makes it slower. Luckily, every NFA has an equivalent <em>determinisitic finite automaton</em>, which can be constructed from the NFA using the so-called <em>powerset construction</em>.</p><h2 id="Deterministic-finite-automata"><a class="docs-heading-anchor" href="#Deterministic-finite-automata">Deterministic finite automata</a><a id="Deterministic-finite-automata-1"></a><a class="docs-heading-anchor-permalink" href="#Deterministic-finite-automata" title="Permalink"></a></h2><p>Or DFAs, as they are called, are similar to NFAs, but do not contain ϵ-edges. This means that a given input string has either zero paths (if it does not match the regex), one, unambiguous path, through the DFA. In other words, every input symbol <em>must</em> trigger one unambiguous state transition from one state to one other state.</p><p>Let&#39;s visualize the DFA equivalent to the larger NFA above:</p><p><img src="../figure/large_dfa.png" alt="State diagram of the DFA for `re&quot;(\\+|-)?(0|1)*&quot;`"/></p><p>It might not be obvious, but the DFA above accepts exactly the same inputs as the previous NFA. DFAs are way simpler to simulate in code than NFAs, precisely because at every state, for every input, there is exactly one action. DFAs can be simulated either using a lookup table of possible state transitions, or by hardcoding GOTO-statements from node to node when the correct input is matched. Code simulating DFAs can be ridicuously fast, with each state transition taking less than 1 nanosecond, if implemented well.</p><p>Furthermore, DFAs can be optimised. Two edges between the same nodes with labels <code>A</code> and <code>B</code> can be collapsed to a single edge with labels <code>[AB]</code>, and redundant nodes can be collapsed. The optimised DFA equivalent to the one above is simply: </p><p><img src="../figure/large_machine.png" alt="State diagram of the simpler DFA for `re&quot;(\\+|-)?(0|1)*&quot;`"/></p><p>Unfortunately, as the name &quot;powerset construction&quot; hints, convering an NFA with N nodes may result in a DFA with up to 2^N nodes. This inconvenient fact drives important design decisions in regex implementations. There are basically two approaches:</p><p>Automa.jl will just construct the DFA directly, and accept a worst-case complexity of O(2^N). This is acceptable (I think) for Automa, because this construction happens in Julia&#39;s package precompilation stage (not on package loading or usage), and because the DFAs are assumed to be constants within a package. So, if a developer accidentally writes an NFA which is unacceptably slow to convert to a DFA, it will be caught in development. Luckily, it&#39;s pretty rare to have NFAs that result in truly abysmally slow conversions to DFA&#39;s: While bad corner cases exist, they are rarely as catastrophic as the O(2^N) would suggest. Currently, Automa&#39;s regex/NFA/DFA compilation pipeline is very slow and unoptimized, but, since it happens during precompile time, it is insignificant compared to LLVM compile times.</p><p>Other implementations, like the popular <code>ripgrep</code> command line tool, uses an adaptive approach. It constructs the DFA on the fly, as each symbol is being matched, and then caches the DFA. If the DFA size grows too large, the cache is flushed. If the cache is flushed too often, it falls back to simulating the NFA directly. Such an approach is necessary for <code>ripgrep</code>, because the regex -&gt; NFA -&gt; DFA compilation happens at runtime and must be near-instantaneous, unlike Automa, where it happens during package precompilation and can afford to be slow.</p><h2 id="Automa-in-a-nutshell"><a class="docs-heading-anchor" href="#Automa-in-a-nutshell">Automa in a nutshell</a><a id="Automa-in-a-nutshell-1"></a><a class="docs-heading-anchor-permalink" href="#Automa-in-a-nutshell" title="Permalink"></a></h2><p>Automa simulates the DFA by having the DFA create a Julia <code>Expr</code>, which is then used to generate a Julia function using metaprogramming. Like all other Julia code, this function is then optimized by Julia and then LLVM, making the DFA simulations very fast.</p><p>Because Automa just constructs Julia functions, we can do extra tricks that ordinary regex engines cannot: We can splice arbitrary Julia code into the DFA simulation. Currently, Automa supports two such kinds of code: <em>actions</em>, and <em>preconditions</em>.</p><p>Actions are Julia code that is executed during certain state transitions. Preconditions are Julia code, that evaluates to a <code>Bool</code> value, and which are checked before a state transition. If a precondition evaluates to <code>false</code>, the transition is not taken.</p></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../">« Home</a><a class="docs-footer-nextpage" href="../regex/">Regex »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Friday 18 October 2024 14:46">Friday 18 October 2024</span>. Using Julia version 1.10.5.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Theory · Automa.jl</title><meta name="title" content="Theory · Automa.jl"/><meta property="og:title" content="Theory · Automa.jl"/><meta property="twitter:title" content="Theory · Automa.jl"/><meta name="description" content="Documentation for Automa.jl."/><meta property="og:description" content="Documentation for Automa.jl."/><meta property="twitter:description" content="Documentation for Automa.jl."/><script data-outdated-warner src="../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL=".."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../assets/documenter.js"></script><script src="../search_index.js"></script><script src="../siteinfo.js"></script><script src="../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/catppuccin-mocha.css" data-theme-name="catppuccin-mocha"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/catppuccin-macchiato.css" data-theme-name="catppuccin-macchiato"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/catppuccin-frappe.css" data-theme-name="catppuccin-frappe"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/catppuccin-latte.css" data-theme-name="catppuccin-latte"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../assets/themeswap.js"></script></head><body><div id="documenter"><nav class="docs-sidebar"><div class="docs-package-name"><span class="docs-autofit"><a href="../">Automa.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li><a class="tocitem" href="../">Home</a></li><li class="is-active"><a class="tocitem" href>Theory</a><ul class="internal"><li><a class="tocitem" href="#Nondeterministic-finite-automata"><span>Nondeterministic finite automata</span></a></li><li><a class="tocitem" href="#Deterministic-finite-automata"><span>Deterministic finite automata</span></a></li><li><a class="tocitem" href="#Automa-in-a-nutshell"><span>Automa in a nutshell</span></a></li></ul></li><li><a class="tocitem" href="../regex/">Regex</a></li><li><a class="tocitem" href="../validators/">Validators</a></li><li><a class="tocitem" href="../tokenizer/">Tokenizers</a></li><li><a class="tocitem" href="../parser/">Parsing buffers</a></li><li><a class="tocitem" href="../custom/">Customizing codegen</a></li><li><a class="tocitem" href="../io/">Parsing IOs</a></li><li><a class="tocitem" href="../reader/">Creating readers</a></li><li><a class="tocitem" href="../debugging/">Debugging Automa</a></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li class="is-active"><a href>Theory</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Theory</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/BioJulia/Automa.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/BioJulia/Automa.jl/blob/master/docs/src/theory.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="Theory-of-regular-expressions"><a class="docs-heading-anchor" href="#Theory-of-regular-expressions">Theory of regular expressions</a><a id="Theory-of-regular-expressions-1"></a><a class="docs-heading-anchor-permalink" href="#Theory-of-regular-expressions" title="Permalink"></a></h1><p>Most programmers are familiar with <em>regular expressions</em>, or <em>regex</em>, for short. What many programmers don&#39;t know is that regex have a deep theoretical underpinning, which is leaned on by regex engines to produce highly efficient code.</p><p>Informally, a regular expression can be thought of as any pattern that can be constructed from the following atoms:</p><ul><li>The empty string is a valid regular expression, i.e. <code>re&quot;&quot;</code></li><li>Literal matching of a single symbol from a finite alphabet, such as a character, i.e. <code>re&quot;p&quot;</code></li></ul><p>Atoms can be combined with the following operations, if R and P are two regular expressions:</p><ul><li>Alternation, i.e <code>R | P</code>, meaning either match R or P.</li><li>Concatenation, i.e. <code>R * P</code>, meaning match first R, then P</li><li>Repetition, i.e. <code>R*</code>, meaning match R zero or more times consecutively.</li></ul><div class="admonition is-info"><header class="admonition-header">Note</header><div class="admonition-body"><p>In Automa, the alphabet is <em>bytes</em>, i.e. <code>0x00:0xff</code>, and so each symbol is a single byte. Multi-byte characters such as <code>Æ</code> is interpreted as the two concatenated of two symbols, <code>re&quot;\xc3&quot; * re&quot;\x86&quot;</code>. The fact that Automa considers one input to be one byte, not one character, can become relevant if you instruct Automa to complete an action &quot;on every input&quot;.</p></div></div><p>Popular regex libraries include more operations like <code>?</code> and <code>+</code>. These can trivially be constructed from the above mentioned primitives, i.e. <code>R?</code> is <code>&quot;&quot; | R</code>, and <code>R+</code> is <code>RR*</code>.</p><p>Some implementations of regular expression engines, such as PCRE which is the default in Julia as of Julia 1.8, also support operations like backreferences and lookbehind. These operations can NOT be constructed from the above atoms and axioms, meaning that PCRE expressions are not regular expressions in the theoretical sense.</p><p>The practical importance of theoretically sound regular expressions is that there exists algorithms that can match regular expressions on O(N) time and O(1) space, whereas this is not true for PCRE expressions, which are therefore significantly slower.</p><div class="admonition is-info"><header class="admonition-header">Note</header><div class="admonition-body"><p>Automa.jl only supports real regex, and as such does not support e.g. backreferences, in order to gurantee fast runtime performance.</p></div></div><p>To match regex to strings, the regex are transformed to <em>finite automata</em>, which are then implemented in code.</p><h2 id="Nondeterministic-finite-automata"><a class="docs-heading-anchor" href="#Nondeterministic-finite-automata">Nondeterministic finite automata</a><a id="Nondeterministic-finite-automata-1"></a><a class="docs-heading-anchor-permalink" href="#Nondeterministic-finite-automata" title="Permalink"></a></h2><p>The programmer Ken Thompson, of Unix fame, deviced <em>Thompson&#39;s construction</em>, an algorithm to constuct a nondeterministic finite automaton (NFA) from a regex. An NFA can be thought of as a flowchart (or a directed graph), where one can move from node to node on directed edges. Edges are either labeled <code>ϵ</code>, in which the machine can freely move through the edge to its destination node, or labeled with one or more input symbols, in which the machine may traverse the edge upon consuming said input.</p><p>To illustrate, let&#39;s look at one of the simplest regex: <code>re&quot;a&quot;</code>, matching the letter <code>a</code>:</p><p><img src="../figure/simple.png" alt="State diagram showing state 1, edge transition consuming input &#39;a&#39;, leading to &quot;accept state&quot; 2"/></p><p>You begin at the small dot on the right, then immediately go to state 1, the circle marked by a <code>1</code>. By moving to the next state, state 2, you consume the next symbol from the input string, which must be the symbol marked on the edge from state 1 to state 2 (in this case, an <code>a</code>). Some states are &quot;accept states&quot;, illustrated by a double circle. If you are at an accept state when you&#39;ve consumed all symbols of the input string, the string matches the regex.</p><p>Each of the operations that combine regex can also combine NFAs. For example, given the two regex <code>a</code> and <code>b</code>, which correspond to the NFAs <code>A</code> and <code>B</code>, the regex <code>a * b</code> can be expressed with the following NFA:</p><p><img src="../figure/cat.png" alt="State diagram showing ϵ transition from state A to accept state B"/></p><p>Note the <code>ϵ</code> symbol on the edge - this signifies an &quot;epsilon transition&quot;, meaning you move directly from <code>A</code> to <code>B</code> without consuming any symbols.</p><p>Similarly, <code>a | b</code> correspond to this NFA structure...</p><p><img src="../figure/alt.png" alt="State diagram of the NFA for `a | b`"/></p><p>...and <code>a*</code> to this:</p><p><img src="../figure/kleenestar.png" alt="State diagram of the NFA for `a*`"/></p><p>For a larger example, <code>re&quot;(\+|-)?(0|1)*&quot;</code> combines alternation, concatenation and repetition and so looks like this:</p><p><img src="../figure/larger.png" alt="State diagram of the NFA for `re&quot;(\\+|-)?(0|1)*&quot;`"/></p><p>ϵ-transitions means that there are states from which there are multiple possible next states, e.g. in the larger example above, state 1 can lead to state 2 or state 8. That&#39;s what makes NFAs nondeterministic.</p><p>In order to match a regex to a string then, the movement through the NFA must be emulated. You begin at state 1. When a non-ϵ edge is encountered, you consume a byte of the input data if it matches. If there are no edges that match your input, the string does not match. If an ϵ-edge is encountered from state <code>A</code> that leads to states <code>B</code> and <code>C</code>, the machine goes from state <code>A</code> to state <code>{B, C}</code>, i.e. in both states at once.</p><p>For example, if the regex <code>re&quot;(\+|-)?(0|1)*</code> visualized above is matched to the string <code>-11</code>, this is what happens:</p><ul><li>NFA starts in state 1</li><li>NFA immediately moves to all states reachable via ϵ transition. It is now in state {2, 3, 5, 7, 8, 9, 10}.</li><li>NFA sees input <code>-</code>. States {2, 3, 4, 5, 7, 8, 10} do not have an edge with <code>-</code> leading out, so these states die. Therefore, the machine is in state 9, consumes the input, and moves to state 2.</li><li>NFA immediately moves to all states reachable from state 2 via ϵ transitions, so goes to {3, 4, 5, 7}</li><li>NFA sees input <code>1</code>, must be in state 5, moves to state 6, then through ϵ transitions to state {3, 4, 5, 7}</li><li>The above point repeats, NFA is still in state {3, 4, 5, 7}</li><li>Input ends. Since state 3 is an accept state, the string matches.</li></ul><p>Using only a regex-to-NFA converter, you could create a simple regex engine simply by emulating the NFA as above. The existence of ϵ transitions means the NFA can be in multiple states at once which adds unwelcome complexity to the emulation and makes it slower. Luckily, every NFA has an equivalent <em>determinisitic finite automaton</em>, which can be constructed from the NFA using the so-called <em>powerset construction</em>.</p><h2 id="Deterministic-finite-automata"><a class="docs-heading-anchor" href="#Deterministic-finite-automata">Deterministic finite automata</a><a id="Deterministic-finite-automata-1"></a><a class="docs-heading-anchor-permalink" href="#Deterministic-finite-automata" title="Permalink"></a></h2><p>Or DFAs, as they are called, are similar to NFAs, but do not contain ϵ-edges. This means that a given input string has either zero paths (if it does not match the regex), one, unambiguous path, through the DFA. In other words, every input symbol <em>must</em> trigger one unambiguous state transition from one state to one other state.</p><p>Let&#39;s visualize the DFA equivalent to the larger NFA above:</p><p><img src="../figure/large_dfa.png" alt="State diagram of the DFA for `re&quot;(\\+|-)?(0|1)*&quot;`"/></p><p>It might not be obvious, but the DFA above accepts exactly the same inputs as the previous NFA. DFAs are way simpler to simulate in code than NFAs, precisely because at every state, for every input, there is exactly one action. DFAs can be simulated either using a lookup table of possible state transitions, or by hardcoding GOTO-statements from node to node when the correct input is matched. Code simulating DFAs can be ridicuously fast, with each state transition taking less than 1 nanosecond, if implemented well.</p><p>Furthermore, DFAs can be optimised. Two edges between the same nodes with labels <code>A</code> and <code>B</code> can be collapsed to a single edge with labels <code>[AB]</code>, and redundant nodes can be collapsed. The optimised DFA equivalent to the one above is simply: </p><p><img src="../figure/large_machine.png" alt="State diagram of the simpler DFA for `re&quot;(\\+|-)?(0|1)*&quot;`"/></p><p>Unfortunately, as the name &quot;powerset construction&quot; hints, convering an NFA with N nodes may result in a DFA with up to 2^N nodes. This inconvenient fact drives important design decisions in regex implementations. There are basically two approaches:</p><p>Automa.jl will just construct the DFA directly, and accept a worst-case complexity of O(2^N). This is acceptable (I think) for Automa, because this construction happens in Julia&#39;s package precompilation stage (not on package loading or usage), and because the DFAs are assumed to be constants within a package. So, if a developer accidentally writes an NFA which is unacceptably slow to convert to a DFA, it will be caught in development. Luckily, it&#39;s pretty rare to have NFAs that result in truly abysmally slow conversions to DFA&#39;s: While bad corner cases exist, they are rarely as catastrophic as the O(2^N) would suggest. Currently, Automa&#39;s regex/NFA/DFA compilation pipeline is very slow and unoptimized, but, since it happens during precompile time, it is insignificant compared to LLVM compile times.</p><p>Other implementations, like the popular <code>ripgrep</code> command line tool, uses an adaptive approach. It constructs the DFA on the fly, as each symbol is being matched, and then caches the DFA. If the DFA size grows too large, the cache is flushed. If the cache is flushed too often, it falls back to simulating the NFA directly. Such an approach is necessary for <code>ripgrep</code>, because the regex -&gt; NFA -&gt; DFA compilation happens at runtime and must be near-instantaneous, unlike Automa, where it happens during package precompilation and can afford to be slow.</p><h2 id="Automa-in-a-nutshell"><a class="docs-heading-anchor" href="#Automa-in-a-nutshell">Automa in a nutshell</a><a id="Automa-in-a-nutshell-1"></a><a class="docs-heading-anchor-permalink" href="#Automa-in-a-nutshell" title="Permalink"></a></h2><p>Automa simulates the DFA by having the DFA create a Julia <code>Expr</code>, which is then used to generate a Julia function using metaprogramming. Like all other Julia code, this function is then optimized by Julia and then LLVM, making the DFA simulations very fast.</p><p>Because Automa just constructs Julia functions, we can do extra tricks that ordinary regex engines cannot: We can splice arbitrary Julia code into the DFA simulation. Currently, Automa supports two such kinds of code: <em>actions</em>, and <em>preconditions</em>.</p><p>Actions are Julia code that is executed during certain state transitions. Preconditions are Julia code, that evaluates to a <code>Bool</code> value, and which are checked before a state transition. If a precondition evaluates to <code>false</code>, the transition is not taken.</p></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../">« Home</a><a class="docs-footer-nextpage" href="../regex/">Regex »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Friday 18 October 2024 15:37">Friday 18 October 2024</span>. Using Julia version 1.11.0.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/tokenizer/index.html b/dev/tokenizer/index.html
index ffa628b..e69af27 100644
--- a/dev/tokenizer/index.html
+++ b/dev/tokenizer/index.html
@@ -43,11 +43,11 @@
 @eval @enum Token error $(first.(tokens)...)
 make_tokenizer((error, 
     [Token(i) =&gt; j for (i,j) in enumerate(last.(tokens))]
-)) |&gt; eval</code></pre><h2 id="Token-disambiguation"><a class="docs-heading-anchor" href="#Token-disambiguation">Token disambiguation</a><a id="Token-disambiguation-1"></a><a class="docs-heading-anchor-permalink" href="#Token-disambiguation" title="Permalink"></a></h2><p>It&#39;s possible to create a tokenizer where the different token regexes overlap:</p><pre><code class="language-julia-repl hljs">julia&gt; make_tokenizer([re&quot;[ab]+&quot;, re&quot;ab*&quot;, re&quot;ab&quot;]) |&gt; eval</code></pre><p>In this case, an input like <code>ab</code> will match all three regex. Which tokens are emitted is determined by two rules:</p><p>First, the emitted tokens will be as long as possible. So, the input <code>aa</code> could be emitted as one token of the regex <code>re&quot;[ab]+&quot;</code>, two tokens of the same regex, or of two tokens of the regex <code>re&quot;ab*&quot;</code>. In this case, it will be emitted as a single token of <code>re&quot;[ab]+&quot;</code>, since that will make the first token as long as possible (2 bytes), whereas the other options would only make it 1 byte long.</p><p>Second, tokens with a higher index in the input array beats previous tokens. So, <code>a</code> will be emitted as <code>re&quot;ab*&quot;</code>, as its index of 2 beats the previous regex <code>re&quot;[ab]+&quot;</code> with the index 1, and <code>ab</code> will match the third regex.</p><p>If you don&#39;t want emitted tokens to depend on these priority rules, you can set the optional keyword <code>unambiguous=true</code> in the <code>make_tokenizer</code> function, in which case <code>make_tokenizer</code> will error if any input text could be broken down into different tokens. However, note that this may cause most tokenizers to error when being built, as most tokenization processes are ambiguous.</p><h2 id="Reference"><a class="docs-heading-anchor" href="#Reference">Reference</a><a id="Reference-1"></a><a class="docs-heading-anchor-permalink" href="#Reference" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.Tokenizer" href="#Automa.Tokenizer"><code>Automa.Tokenizer</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">Tokenizer{E, D, C}</code></pre><p>Lazy iterator of tokens of type <code>E</code> over data of type <code>D</code>. Tokenizers are usually created with the <a href="#Automa.tokenize"><code>tokenize</code></a> function, and their iterator behaviour are defined by <a href="#Automa.make_tokenizer"><code>make_tokenizer</code></a>.</p><p><code>Tokenizer</code> works on any buffer-like object that defines <code>pointer</code> and <code>sizeof</code>. When iterated, it will return a <code>Tuple{Integer, Integer, E}</code>:</p><ul><li>The first value in the tuple is the 1-based starting index of the token in the buffer</li><li>The second is the length of the token in bytes</li><li>The third is the token.</li></ul><p>Un-tokenizable data will be emitted as the &quot;error token&quot; which must also be of type <code>E</code>.</p><p>The <code>Int</code> parameter <code>C</code> allows multiple tokenizers to be created with the otherwise same type parameters.</p><p>See also: <a href="#Automa.make_tokenizer"><code>make_tokenizer</code></a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/3ab134cd6bd6def0a7446a4043d709e593614cf5/src/tokenizer.jl#L1-L21">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.tokenize" href="#Automa.tokenize"><code>Automa.tokenize</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">tokenize(::Type{E}, data, version=1) -&gt; Tokenizer</code></pre><p>Create a <code>Tokenizer{E, typeof(data), version}</code>, iterating tokens of type <code>E</code> over <code>data</code>.</p><p>See also: <a href="#Automa.Tokenizer"><code>Tokenizer</code></a>, <a href="#Automa.make_tokenizer"><code>make_tokenizer</code></a>, <a href="../validators/#Automa.compile"><code>compile</code></a></p><p><strong>Examples</strong></p><pre><code class="language-julia-repl hljs">julia&gt; tokenize(UInt32, &quot;hello&quot;)
+)) |&gt; eval</code></pre><h2 id="Token-disambiguation"><a class="docs-heading-anchor" href="#Token-disambiguation">Token disambiguation</a><a id="Token-disambiguation-1"></a><a class="docs-heading-anchor-permalink" href="#Token-disambiguation" title="Permalink"></a></h2><p>It&#39;s possible to create a tokenizer where the different token regexes overlap:</p><pre><code class="language-julia-repl hljs">julia&gt; make_tokenizer([re&quot;[ab]+&quot;, re&quot;ab*&quot;, re&quot;ab&quot;]) |&gt; eval</code></pre><p>In this case, an input like <code>ab</code> will match all three regex. Which tokens are emitted is determined by two rules:</p><p>First, the emitted tokens will be as long as possible. So, the input <code>aa</code> could be emitted as one token of the regex <code>re&quot;[ab]+&quot;</code>, two tokens of the same regex, or of two tokens of the regex <code>re&quot;ab*&quot;</code>. In this case, it will be emitted as a single token of <code>re&quot;[ab]+&quot;</code>, since that will make the first token as long as possible (2 bytes), whereas the other options would only make it 1 byte long.</p><p>Second, tokens with a higher index in the input array beats previous tokens. So, <code>a</code> will be emitted as <code>re&quot;ab*&quot;</code>, as its index of 2 beats the previous regex <code>re&quot;[ab]+&quot;</code> with the index 1, and <code>ab</code> will match the third regex.</p><p>If you don&#39;t want emitted tokens to depend on these priority rules, you can set the optional keyword <code>unambiguous=true</code> in the <code>make_tokenizer</code> function, in which case <code>make_tokenizer</code> will error if any input text could be broken down into different tokens. However, note that this may cause most tokenizers to error when being built, as most tokenization processes are ambiguous.</p><h2 id="Reference"><a class="docs-heading-anchor" href="#Reference">Reference</a><a id="Reference-1"></a><a class="docs-heading-anchor-permalink" href="#Reference" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.Tokenizer" href="#Automa.Tokenizer"><code>Automa.Tokenizer</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">Tokenizer{E, D, C}</code></pre><p>Lazy iterator of tokens of type <code>E</code> over data of type <code>D</code>. Tokenizers are usually created with the <a href="#Automa.tokenize"><code>tokenize</code></a> function, and their iterator behaviour are defined by <a href="#Automa.make_tokenizer"><code>make_tokenizer</code></a>.</p><p><code>Tokenizer</code> works on any buffer-like object that defines <code>pointer</code> and <code>sizeof</code>. When iterated, it will return a <code>Tuple{Integer, Integer, E}</code>:</p><ul><li>The first value in the tuple is the 1-based starting index of the token in the buffer</li><li>The second is the length of the token in bytes</li><li>The third is the token.</li></ul><p>Un-tokenizable data will be emitted as the &quot;error token&quot; which must also be of type <code>E</code>.</p><p>The <code>Int</code> parameter <code>C</code> allows multiple tokenizers to be created with the otherwise same type parameters.</p><p>See also: <a href="#Automa.make_tokenizer"><code>make_tokenizer</code></a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/b08d53b7940af2b81c780f50a98205a0a4e60cf6/src/tokenizer.jl#L1-L21">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.tokenize" href="#Automa.tokenize"><code>Automa.tokenize</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">tokenize(::Type{E}, data, version=1) -&gt; Tokenizer</code></pre><p>Create a <code>Tokenizer{E, typeof(data), version}</code>, iterating tokens of type <code>E</code> over <code>data</code>.</p><p>See also: <a href="#Automa.Tokenizer"><code>Tokenizer</code></a>, <a href="#Automa.make_tokenizer"><code>make_tokenizer</code></a>, <a href="../validators/#Automa.compile"><code>compile</code></a></p><p><strong>Examples</strong></p><pre><code class="language-julia-repl hljs">julia&gt; tokenize(UInt32, &quot;hello&quot;)
 Tokenizer{UInt32, String, 1}(&quot;hello&quot;)
 
 julia&gt; tokenize(Int8, [1, 2, 3], 3)
-Tokenizer{Int8, Vector{Int64}, 3}([1, 2, 3])</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/3ab134cd6bd6def0a7446a4043d709e593614cf5/src/tokenizer.jl#L31-L47">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.make_tokenizer" href="#Automa.make_tokenizer"><code>Automa.make_tokenizer</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">make_tokenizer(
+Tokenizer{Int8, Vector{Int64}, 3}([1, 2, 3])</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/b08d53b7940af2b81c780f50a98205a0a4e60cf6/src/tokenizer.jl#L31-L47">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.make_tokenizer" href="#Automa.make_tokenizer"><code>Automa.make_tokenizer</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">make_tokenizer(
     machine::TokenizerMachine;
     tokens::Tuple{E, AbstractVector{E}}= [ integers ],
     goto=true, version=1
@@ -64,7 +64,7 @@
  (2, 1, 0x02)
  (3, 3, 0x00)
  (6, 1, 0x02)
- (7, 1, 0x01)</code></pre><p>Any actions inside the input regexes will be ignored.</p><p>If <code>goto</code> (default), use the faster, but more complex goto code generator.<br/>The <code>version</code> number will set the last parameter of the <code>Tokenizer</code>, which allows you to create different tokenizers for the same element type.</p><p>See also: <a href="#Automa.Tokenizer"><code>Tokenizer</code></a>, <a href="#Automa.tokenize"><code>tokenize</code></a>, <a href="../validators/#Automa.compile"><code>compile</code></a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/3ab134cd6bd6def0a7446a4043d709e593614cf5/src/tokenizer.jl#L70-L110">source</a></section><section><div><pre><code class="language-julia hljs">make_tokenizer(
+ (7, 1, 0x01)</code></pre><p>Any actions inside the input regexes will be ignored.</p><p>If <code>goto</code> (default), use the faster, but more complex goto code generator.<br/>The <code>version</code> number will set the last parameter of the <code>Tokenizer</code>, which allows you to create different tokenizers for the same element type.</p><p>See also: <a href="#Automa.Tokenizer"><code>Tokenizer</code></a>, <a href="#Automa.tokenize"><code>tokenize</code></a>, <a href="../validators/#Automa.compile"><code>compile</code></a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/b08d53b7940af2b81c780f50a98205a0a4e60cf6/src/tokenizer.jl#L70-L110">source</a></section><section><div><pre><code class="language-julia hljs">make_tokenizer(
     tokens::Union{
         AbstractVector{RE},
         Tuple{E, AbstractVector{Pair{E, RE}}}
@@ -91,4 +91,4 @@
  (1, 1, 1)
  (2, 1, 2)
  (3, 2, 0)
- (5, 1, 2)</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/3ab134cd6bd6def0a7446a4043d709e593614cf5/src/tokenizer.jl#L195-L238">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../validators/">« Validators</a><a class="docs-footer-nextpage" href="../parser/">Parsing buffers »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Friday 18 October 2024 14:46">Friday 18 October 2024</span>. Using Julia version 1.10.5.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+ (5, 1, 2)</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/b08d53b7940af2b81c780f50a98205a0a4e60cf6/src/tokenizer.jl#L195-L238">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../validators/">« Validators</a><a class="docs-footer-nextpage" href="../parser/">Parsing buffers »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Friday 18 October 2024 15:37">Friday 18 October 2024</span>. Using Julia version 1.11.0.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/validators/index.html b/dev/validators/index.html
index 80fc5b9..bf325a7 100644
--- a/dev/validators/index.html
+++ b/dev/validators/index.html
@@ -24,9 +24,9 @@
 (0x0a, (3, 0))
 
 julia&gt; validate_io(IOBuffer(&quot;&gt;hello\nAC&quot;))
-(nothing, (2, 2))</code></pre><h2 id="Reference"><a class="docs-heading-anchor" href="#Reference">Reference</a><a id="Reference-1"></a><a class="docs-heading-anchor-permalink" href="#Reference" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.generate_buffer_validator" href="#Automa.generate_buffer_validator"><code>Automa.generate_buffer_validator</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">generate_buffer_validator(name::Symbol, regexp::RE; goto=true; docstring=true)</code></pre><p>Generate code that, when evaluated, defines a function named <code>name</code>, which takes a single argument <code>data</code>, interpreted as a sequence of bytes. The function returns <code>nothing</code> if <code>data</code> matches <code>Machine</code>, else the index of the first invalid byte. If the machine reached unexpected EOF, returns <code>0</code>.</p><p>If <code>goto</code>, the function uses the faster but more complicated <code>:goto</code> code.<br/>If <code>docstring</code>, automatically create a docstring for the generated function.</p><p>See also: <a href="#Automa.generate_io_validator"><code>generate_io_validator</code></a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/3ab134cd6bd6def0a7446a4043d709e593614cf5/src/codegen.jl#L116-L128">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.generate_io_validator" href="#Automa.generate_io_validator"><code>Automa.generate_io_validator</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">generate_io_validator(funcname::Symbol, regex::RE; goto::Bool=false)</code></pre><p><strong>NOTE: This method requires TranscodingStreams to be loaded</strong></p><p>Create code that, when evaluated, defines a function named <code>funcname</code>. This function takes an <code>IO</code>, and checks if the data in the input conforms to the regex, without executing any actions. If the input conforms, return <code>nothing</code>. Else, return <code>(byte, (line, col))</code>, where <code>byte</code> is the first invalid byte, and <code>(line, col)</code> the 1-indexed position of that byte. If the invalid byte is a <code>\n</code> byte, <code>col</code> is 0 and the line number is incremented. If the input errors due to unexpected EOF, <code>byte</code> is <code>nothing</code>, and the line and column given is the last byte in the file.</p><p>If <code>goto</code>, the function uses the faster but more complicated <code>:goto</code> code.</p><p>See also: <a href="#Automa.generate_buffer_validator"><code>generate_buffer_validator</code></a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/3ab134cd6bd6def0a7446a4043d709e593614cf5/src/stream.jl#L100-L118">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.compile" href="#Automa.compile"><code>Automa.compile</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">compile(re::RE; optimize::Bool=true, unambiguous::Bool=true)::Machine</code></pre><p>Compile a finite state machine (FSM) from <code>re</code>. If <code>optimize</code>, attempt to minimize the number of states in the FSM. If <code>unambiguous</code>, disallow creation of FSM where the actions are not deterministic.</p><p><strong>Examples</strong></p><pre><code class="nohighlight hljs">machine = let
+(nothing, (2, 2))</code></pre><h2 id="Reference"><a class="docs-heading-anchor" href="#Reference">Reference</a><a id="Reference-1"></a><a class="docs-heading-anchor-permalink" href="#Reference" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.generate_buffer_validator" href="#Automa.generate_buffer_validator"><code>Automa.generate_buffer_validator</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">generate_buffer_validator(name::Symbol, regexp::RE; goto=true; docstring=true)</code></pre><p>Generate code that, when evaluated, defines a function named <code>name</code>, which takes a single argument <code>data</code>, interpreted as a sequence of bytes. The function returns <code>nothing</code> if <code>data</code> matches <code>Machine</code>, else the index of the first invalid byte. If the machine reached unexpected EOF, returns <code>0</code>.</p><p>If <code>goto</code>, the function uses the faster but more complicated <code>:goto</code> code.<br/>If <code>docstring</code>, automatically create a docstring for the generated function.</p><p>See also: <a href="#Automa.generate_io_validator"><code>generate_io_validator</code></a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/b08d53b7940af2b81c780f50a98205a0a4e60cf6/src/codegen.jl#L116-L128">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.generate_io_validator" href="#Automa.generate_io_validator"><code>Automa.generate_io_validator</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">generate_io_validator(funcname::Symbol, regex::RE; goto::Bool=false)</code></pre><p><strong>NOTE: This method requires TranscodingStreams to be loaded</strong></p><p>Create code that, when evaluated, defines a function named <code>funcname</code>. This function takes an <code>IO</code>, and checks if the data in the input conforms to the regex, without executing any actions. If the input conforms, return <code>nothing</code>. Else, return <code>(byte, (line, col))</code>, where <code>byte</code> is the first invalid byte, and <code>(line, col)</code> the 1-indexed position of that byte. If the invalid byte is a <code>\n</code> byte, <code>col</code> is 0 and the line number is incremented. If the input errors due to unexpected EOF, <code>byte</code> is <code>nothing</code>, and the line and column given is the last byte in the file.</p><p>If <code>goto</code>, the function uses the faster but more complicated <code>:goto</code> code.</p><p>See also: <a href="#Automa.generate_buffer_validator"><code>generate_buffer_validator</code></a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/b08d53b7940af2b81c780f50a98205a0a4e60cf6/src/stream.jl#L100-L118">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Automa.compile" href="#Automa.compile"><code>Automa.compile</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">compile(re::RE; optimize::Bool=true, unambiguous::Bool=true)::Machine</code></pre><p>Compile a finite state machine (FSM) from <code>re</code>. If <code>optimize</code>, attempt to minimize the number of states in the FSM. If <code>unambiguous</code>, disallow creation of FSM where the actions are not deterministic.</p><p><strong>Examples</strong></p><pre><code class="nohighlight hljs">machine = let
     name = re&quot;[A-Z][a-z]+&quot;
     first_last = name * re&quot; &quot; * name
     last_first = name * re&quot;, &quot; * name
     compile(first_last | last_first)
-end</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/3ab134cd6bd6def0a7446a4043d709e593614cf5/src/machine.jl#L110-L126">source</a></section><section><div><pre><code class="language-julia hljs">compile(tokens::Vector{RE}; unambiguous=false)::TokenizerMachine</code></pre><p>Compile the regex <code>tokens</code> to a tokenizer machine. The machine can be passed to <code>make_tokenizer</code>.</p><p>The keyword <code>unambiguous</code> decides which of multiple matching tokens is emitted: If <code>false</code> (default), the longest token is emitted. If multiple tokens have the same length, the one with the highest index is returned. If <code>true</code>, <code>make_tokenizer</code> will error if any possible input text can be broken ambiguously down into tokens.</p><p>See also: <a href="../tokenizer/#Automa.Tokenizer"><code>Tokenizer</code></a>, <a href="../tokenizer/#Automa.make_tokenizer"><code>make_tokenizer</code></a>, <a href="../tokenizer/#Automa.tokenize"><code>tokenize</code></a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/3ab134cd6bd6def0a7446a4043d709e593614cf5/src/tokenizer.jl#L261-L274">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../regex/">« Regex</a><a class="docs-footer-nextpage" href="../tokenizer/">Tokenizers »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Friday 18 October 2024 14:46">Friday 18 October 2024</span>. Using Julia version 1.10.5.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+end</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/b08d53b7940af2b81c780f50a98205a0a4e60cf6/src/machine.jl#L110-L126">source</a></section><section><div><pre><code class="language-julia hljs">compile(tokens::Vector{RE}; unambiguous=false)::TokenizerMachine</code></pre><p>Compile the regex <code>tokens</code> to a tokenizer machine. The machine can be passed to <code>make_tokenizer</code>.</p><p>The keyword <code>unambiguous</code> decides which of multiple matching tokens is emitted: If <code>false</code> (default), the longest token is emitted. If multiple tokens have the same length, the one with the highest index is returned. If <code>true</code>, <code>make_tokenizer</code> will error if any possible input text can be broken ambiguously down into tokens.</p><p>See also: <a href="../tokenizer/#Automa.Tokenizer"><code>Tokenizer</code></a>, <a href="../tokenizer/#Automa.make_tokenizer"><code>make_tokenizer</code></a>, <a href="../tokenizer/#Automa.tokenize"><code>tokenize</code></a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/BioJulia/Automa.jl/blob/b08d53b7940af2b81c780f50a98205a0a4e60cf6/src/tokenizer.jl#L261-L274">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../regex/">« Regex</a><a class="docs-footer-nextpage" href="../tokenizer/">Tokenizers »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Friday 18 October 2024 15:37">Friday 18 October 2024</span>. Using Julia version 1.11.0.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>