Skip to content

Commit

Permalink
Improve use of Unicode terminology in definition of string
Browse files Browse the repository at this point in the history
  • Loading branch information
jclark committed Jun 30, 2019
1 parent 630c086 commit 2c6c73d
Showing 1 changed file with 6 additions and 4 deletions.
10 changes: 6 additions & 4 deletions lang/spec.html
Original file line number Diff line number Diff line change
Expand Up @@ -455,7 +455,7 @@ <h2 id="values_types">5. Values, types and variables</h2>
</tr>
<tr>
<td>string</td>
<td>sequences of Unicode code points</td>
<td>sequences of Unicode scalar values</td>
</tr>
<tr>
<td rowspan="7" >basic, structured</td>
Expand Down Expand Up @@ -730,9 +730,11 @@ <h4>Strings</h4>
CodePoint := HexDigit+
</pre>
<p>
A string is an immutable sequence of zero or more Unicode code points. Any code
point in the Unicode range of 0x0 to 0x10FFFF inclusive is allowed other than
surrogates (0xD800 to 0xDFFF inclusive).
A string is an immutable sequence of zero or more Unicode scalar values, where a
Unicode scalar value is any code point in the Unicode range of 0x0 to 0x10FFFF
inclusive, other than surrogate code points, which are 0xD800 to 0xDFFF
inclusive. Note that a string may include Unicode noncharacters, such as 0xFFFE
and 0xFFFF.
</p>
<p>
In a <code>StringNumericEscape</code>, <code>CodePoint</code> must valid Unicode
Expand Down

0 comments on commit 2c6c73d

Please sign in to comment.