This is an unofficial snapshot of the ISO/IEC JTC1 SC22 WG21 Core Issues List revision 114a. See http://www.open-std.org/jtc1/sc22/wg21/ for the official list.

2024-04-18


2636. Update Annex E based on Unicode 15.0 UAX #31

Section: Clause Annex E  [uaxid]     Status: C++23     Submitter: Steve Downey     Date: 2022-10-20

P2720R0 comment US 64-132

[Accepted as a DR at the November, 2022 meeting.]

Unicode 15.0 UAX #31 clarified that rule R3 was, in fact, intended to apply to programming languages. WG21's prior understanding was that programming languages are not in scope of that rule. The proposed resolution updates E.4 [uaxid.pattern] to the revised understanding. See paper P2653R1 (Update Annex E based on Unicode 15.0 UAX 31) for more details.

Proposed resolution (approved by CWG 2022-10-21):

Change in E.4 [uaxid.pattern] as follows:

UAX #31 describes how formal languages that use or interpret patterns of characters, such as regular expressions or number formats, may describe that syntax with Unicode properties such as computer languages should describe and implement their use of whitespace and syntactically significant characters during the processes of lexing and parsing.

C++ does not do this as part of the language, deferring to library components for such usage of patterns. This requirement does not apply to C++ claim conformance with this requirement.