#tc39-delegates on 2020-04-20

06:17	<mathiasbynens>	Bakkot: wdyt about allowing `[\p{Seq}]`? the only remaining surprising cases are then negation (either through `[^...]` or `\P{...}`)
06:28	<Bakkot>	mathiasbynens that feels... pretty weird at first glance? but it is possible that is just the novelty of it.
06:34	<Bakkot>	mathiasbynens I guess that seems like it makes things worse, not better; my major problem with `\p{Seq}` in general is that a thing I am expecting to match one character is suddenly matching maybe more than one character and to figure out which it's going to do I need to go track down some table in unicode
06:35	<Bakkot>	and expanding that so that `[]` now also maybe matches more than one character does not seem like an improvement
06:49	<mathiasbynens>	Bakkot: yeah, I similarly hadn't seriously considered it before. I get it from the UTC perspective though: UTS18 has always allowed strings in character classes
06:50	<mathiasbynens>	Bakkot: anyway, it's one of those things where if we throw now (like in the current proposal), we can always decide to loosen that up later
06:51	<mathiasbynens>	OTOH, the discussion on whether character classes can match multiple code points influences the syntax discussion
06:53	<mathiasbynens>	Bakkot: out of curiosity, why do you need to know if the thing that's matched is just one code point or more?
06:53	<mathiasbynens>	with /u you already don't know if it's 1 UTF-16 code unit or 1 code point (which could be 2 such units), and that's a Good Thing(tm)
06:54	<mathiasbynens>	this is the next step
06:59	<Bakkot>	mathiasbynens the way I reason about regular expressions is by walking them over strings, one character at a time
06:59	<Bakkot>	if we could redefine 'character' to 'glyph' everywhere then I could walk one glyph at a time
07:00	<Bakkot>	but as long as `.` means one code point I am stuck with thinking about code points