2009-04-01
[18:03:00.0000]
philip: i like the subtle change at the end
[18:06:00.0000]
/me couldn't think of anything more interesting to do
[22:30:00.0000]
nico1: for more on whatwg, see http://www.whatwg.org/
[22:31:00.0000]
note also that this channel and #html-wg and #webapps are logged, and you can read the logs at http://krijnhoetmer.nl/irc-logs/
[22:31:01.0000]
mostly we talk about HTML5 and browser stuff
[22:32:00.0000]
thanks a lot MikeSmith !
[23:03:00.0000]
Hrm
[23:04:00.0000]
/me contemplates reading up on the content model
[23:25:00.0000]
nico1: http://validator.nu/
[23:26:00.0000]
http://about.validator.nu/
[23:27:00.0000]
http://hsivonen.iki.fi/thesis/html5-conformance-checker.xhtml
[23:30:00.0000]
http://www.whatwg.org/specs/web-apps/current-work/multipage/
[23:31:00.0000]
nico1: ↑
[23:31:01.0000]
http://www.whatwg.org/specs/web-apps/current-work/multipage/syntax.html#parsing
[23:34:00.0000]
nico1: http://blog.whatwg.org/validatornu-html-parser-120
[23:34:01.0000]
http://about.validator.nu/htmlparser/
[23:35:00.0000]
http://code.google.com/p/html5lib/
[23:36:00.0000]
http://code.google.com/p/html5lib/issues/list
[23:53:00.0000]
sigh at the adactio post. some really should explain why we don't like landmarks, but I guess blogging on April 1 is a bad idea
[23:54:00.0000]
(or, rather, why we don't like architectural forms as the means of expressing landmarks)
[00:32:00.0000]
hsivonen: so I'm discovering a bit about schematron
[00:32:01.0000]
e.g., it has a element to report the name of the context node
[00:33:00.0000]
so one can do, e.g.:
[00:33:01.0000]
[[
[00:33:02.0000]
[00:33:03.0000]
[00:33:04.0000]
The element must not
[00:33:05.0000]
appear as a descendant of the
[00:33:06.0000]
“footer” element.
[00:33:07.0000]
[00:33:08.0000]
]]
[00:35:00.0000]
MikeSmith: does that work in Jing?
[00:36:00.0000]
yeah, it does
[00:36:01.0000]
I just tested it
[00:36:02.0000]
it's in Schematron 1.5, and Jing supports that
[00:36:03.0000]
or does now at least
[00:36:04.0000]
MikeSmith: interesting. I knew Rick Jelliffe's XSLT implementation had something like that but I always thought it was a later addition
[00:36:05.0000]
I'd guess that it might now have previously
[00:37:00.0000]
that's possible
[00:37:01.0000]
anyway, if I use , it could eliminate a whole lotta redunancy that's in assertions.sch now
[00:38:00.0000]
seems like a good idea
[00:42:00.0000]
MikeSmith: http://bugzilla.validator.nu/attachment.cgi?id=70 looks good, except I don't see you using the label locators at all for location reporting.
[00:42:01.0000]
/me looks back at patch
[00:42:02.0000]
MikeSmith: what's the purpose of collecting forVals into a set first instead of doing the id matching as part of the stack walk?
[00:45:00.0000]
hsivonen: purpose is mainly that's just the simplest way it initially occurred to me to do it.
[00:45:01.0000]
hsivonen: I should just set a boolean instead?
[00:45:02.0000]
MikeSmith: preferably, yes
[00:45:03.0000]
when it finds a match for the id value?
[00:45:04.0000]
hsivonen: OK
[00:46:00.0000]
hsivonen: about your other question, I'm still looking
[00:47:00.0000]
hsivonen: so you saying that it's better not to use the locator mechanism at all for this label case?
[00:48:00.0000]
MikeSmith: or you could also emit a warning that gives the location of the label
[00:48:01.0000]
MikeSmith: unless that turns out to suck as UI. I'm not sure if giving the location of the label would be helpful or annoying from a user POV
[00:49:00.0000]
anyone got a nice simple demo page, showing the use of the HTML5 SQL API?
[00:50:00.0000]
/me hoping i can pre-populate a database with sqllite elsewhere...
[00:50:01.0000]
hsivonen: do you think it would be useful to emit those warnings? or overkill?
[01:07:00.0000]
I wish people would at least try to make april fools jokes sound plausible http://tech.slashdot.org/article.pl?sid=09/03/31/1950221&from=rss
[01:11:00.0000]
damn, the quality of postings from Slashdot writers is otherwise so exceptional
[01:11:01.0000]
Lachy: you've burst my bubble of deep appreciation for Slashdot
[01:12:00.0000]
Lachy: That sounds a lot like... http://www.lunascape.tv/
[01:12:01.0000]
/me goes to replace his sarcasm detector
[01:13:00.0000]
/me checks the whatwg blog
[01:13:01.0000]
blimey, the aria spec is over 80 pages long
[01:14:00.0000]
MikeSmith: I put in warnings like that for some table integrity checking stuff. I don't have user feedback complaining about those.
[01:14:01.0000]
hsivonen: OK, I will add it then
[01:15:00.0000]
http://labs.opera.com/news/2009/04/01/
[01:15:01.0000]
This would be awesome
[01:22:00.0000]
hsivonen: so... aria. what should i say in the spec?
[01:23:00.0000]
hsivonen: should i just defer to the wai-aria spec and say that the attributes it defines can be used in html5 for accessibility purposes?
[01:24:00.0000]
or should i list something explicitly? or what?
[01:37:00.0000]
Hixie: I had expected the ongoing task force to figure out what makes sense
[01:38:00.0000]
Hixie: on the topic of UA conformance:
[01:38:01.0000]
defer to the ARIA implementation guide and a (AFAIK) so far non-existent document on resolving native semantic and ARIA semantic conflicts in client implementation
[01:40:00.0000]
Hixie: on the topic of authoring conformance: defer states and properties authoring to ARIA and role applicability to given HTML elements to so far non-existent document hopefully flowing out of the work of the task force
[01:41:00.0000]
Hixie: if you want to address FUD along the lines of HTML5 hating ARIA, you could already put in notes that defer to the parts of the ARIA spec family that exist
[01:44:00.0000]
Hixie: so to elaborate
[01:45:00.0000]
Hixie: the task force documents accessibility API mappings for old HTML and ARIA (and perhaps new HTML5 stuff)
[01:45:01.0000]
Hixie: from those mappings, it's possible to see what HTML bits and ARIA bits overlap
[01:46:00.0000]
Hixie: then it's necessary to define what UAs should do when they get mixed signals (HTML semantics and ARIA role set inconsistently)
[01:47:00.0000]
Hixie: or what should happen if an element has a native accessibiilty API mapping and has states and properties as aria-* attributes with no role
[01:47:01.0000]
Hixie: hopefully in due course this will live in a W3C document somewhere
[01:48:00.0000]
Hixie: so reference that for UA conformance
[01:48:01.0000]
Hixie: then, *given* the behaviors documented in that document, some element/role combinations will be non-sensical. I think those should be flaggable as non-conforming by validators.
[01:49:00.0000]
Hixie: but defining those cases needs to wait until the UA conformance reqs show what combinations are non-sensical to author
[01:50:00.0000]
Hixie: if it were up to me, I'd put a statement of intent to this direction as a note in the HTML 5 spec
[01:51:00.0000]
which reminds me that I should review the latest draft of ARIA
[02:19:00.0000]
/me agrees ith pretty much everything hsivoen said, fwiw
[02:33:00.0000]
/me finally starts dogfooding HTML5 parsing builds (not an April fools joke)
[02:36:00.0000]
/me wonders if hsivonen will append (not an aprils fools joke to all statements made today)
[02:36:01.0000]
s/(// s//(/
[02:36:02.0000]
actually I didn't need any brackets
[02:36:03.0000]
sigh
[02:37:00.0000]
jgraham: hopefully only the ones that seem potentially unbelievable
[02:38:00.0000]
Actually, as far as a parser goes, I've been ready for dogfood for a while now. I was waiting for an image cache crasher to go away
[02:38:01.0000]
hm, that's like xkcd's (no pun intended)
[02:45:00.0000]
takkaria: Really? How would you use it? Like I hear they're making an HTML 5 (not an april fools joke)
[02:54:00.0000]
Hixie: "
There can only be one character encoding declaration in the document.
"
[03:25:00.0000]
hmm. identi.ca looks wrong with the HTML5 parser
[03:25:01.0000]
all the content is "below the fold"
[03:26:00.0000]
hsivonen: You got new try builds avaliable?
[03:27:00.0000]
jgraham: yes. https://build.mozilla.org/tryserver-builds/2009-03-30_05:34-hsivonen⊙if/
[03:27:01.0000]
jgraham: it doesn't have all the latest namespace fixes
[03:27:02.0000]
XPath in text/html is probably broken in that build
[03:33:00.0000]
hsivonen: That build reliably crashes when visiting the live dom viewer
[03:37:00.0000]
hsivonen: identi.ca doesn't have any interesting parse errors
[03:39:00.0000]
jgraham: on which platform? WFM on Mac.
[03:39:01.0000]
oops. not WFM
[03:40:00.0000]
jgraham: crashes here, too. thanks.
[03:40:01.0000]
i still wonder why the parser should imply
[03:40:02.0000]
aargh. the image cache crash is still there
[03:40:03.0000]
and why the content model requires colgroup around col
[03:41:00.0000]
jgraham: do you also see imgCacheEntry stuff on the top of the crash stack?
[03:41:01.0000]
--> Lunch
[03:42:00.0000]
zcorpan: Does col without colgroup make any sense?
[03:45:00.0000]
jgraham: just as much as tr without tbody. lone col is valid in html4 and xhtml1
[03:58:00.0000]
hsivonen: http://bugzilla.validator.nu/attachment.cgi?id=71
[04:00:00.0000]
hsivonen: just tweaked to set boolean hasFor instead of collecting the "for" values
[04:19:00.0000]
http://canvex.lazyilluminati.com/misc/dom-viewer/?%3C!DOCTYPE%20html%3E%3Cp%3E%3Cobject%3E%3Cp%3E%3C%2Fobject%3E%3C%2Fp%3E%3Cscript%3Edocument.write(document.getElementsByTagName('object').length)%3C%2Fscript%3E
[04:19:01.0000]
/me is really confused
[04:19:02.0000]
try that in ie8
[04:21:00.0000]
Hixie, the Live DOM Viewer clipboard is broken
[04:21:01.0000]
clipboard.cgi returns 404
[04:23:00.0000]
oh, no, it's not. That's Philip`s DOM viewer.
[04:24:00.0000]
I didn't realise zcorpan had linked to it instead
[04:24:01.0000]
Lachy: The differencde is somewhat important :)
[04:25:00.0000]
MikeSmith: looks good. You could also break immediately after hasFor = true;
[04:25:01.0000]
hsivonen: OK
[05:16:00.0000]
Darn, we've been shut down
[05:21:00.0000]
Dashiva, do you mean by Mr Last Week?
[05:24:00.0000]
Hixie: have you ran the HTML5 parser on Google's index to see if quirks-mode pages would break if
always closed the paragraph?
[05:25:00.0000]
Lachy: Yes
[05:30:00.0000]
Dashiva, just ignore MLW. He's just an anonymous coward and a troll.
[05:30:01.0000]
Be careful - insults will only make him stronger
[05:31:00.0000]
does anyone else have data on whether it's feasible to adopt the standards mode behavior for
in the quirks mode, too?
[05:31:01.0000]
hsivonen, what sort of data would be useful?
[05:32:00.0000]
WTF? This is weird, yet seemingly per-spec.
[05:32:01.0000]
Lachy: a Web crawl showing that the case doesn't occur on the Web or doesn't occur on the CSS-enabled Web
[05:32:02.0000]
Lachy: or data about Opera having tried dropping the quirk and having gotten bug reports forcing a revert
[05:33:00.0000]
it could theoretically affect both rendering and scripting if scripts or styles depend on the table being within the p
[05:33:01.0000]
true
[05:33:02.0000]
gsnedders: What?
[05:34:00.0000]
it would be sad if the single parsing quirk in HTML5 ended up being Hixie's own making (through Acid2)
[05:34:01.0000]
I don't think we've tried dropping the quirk before
[05:34:02.0000]
what did acid2 require?
[05:34:03.0000]
jgraham: http://software.hixie.ch/utilities/js/live-dom-viewer/?%3C!DOCTYPE%20html%3E%0A%3Cp%20style%3D%22display%3Arun-in%22%3E%3Cspan%3Efoo%3C%2Fspan%3E%2C%20%3Cspan%3Ebar%0A%3Cp%3Efoo
[05:34:04.0000]
that
close the p
[05:34:05.0000]
Lachy: Acid2 requires
to parse like
[05:35:00.0000]
oh, good
[05:35:01.0000]
good???
[05:36:00.0000]
yes, having the table close the p is always how it's worked in good browsers in standards mode, and was what was requried by html4
[05:36:01.0000]
hsivonen: btw, have you found pages breaking because of
?
[05:36:02.0000]
"what was required by html4" is a weak argument
[05:37:00.0000]
zcorpan: I haven't
[05:37:01.0000]
might be worth to research that markup pattern since ie7 and all other browsers except ie8 and html5-enabled gecko don't close the inner ul
[05:38:00.0000]
true. but it makes the most sense cause it allows the to remain optional even when followed by a table
[06:24:00.0000]
hsivonen: If there was a version of the HTML parser library that reported occurrences of that quirk, I could run it on my collection of pages
[06:26:00.0000]
though that wouldn't be too useful if it's less common than ~0.01%
[06:27:00.0000]
Philip`: what kind of reporting mechanism do you need?
[06:29:00.0000]
hsivonen: The easiest would probably be something that prints to stdout in a thread-safe way
[06:29:01.0000]
I doubt that's the only quirk you'll need though it would be nice. Reparsing is another issue :/
[06:29:02.0000]
Actually I suppose that wouldn't quite work since I'd need to print the URI too
[06:30:00.0000]
annevk42: is reparsing so important that it trumps security?
[06:30:01.0000]
Philip`: would a warning with an easily detectable string work?
[06:30:02.0000]
hsivonen, ideally it's not, but the security concerns are rather weak
[06:33:00.0000]
hsivonen: So I would just use setErrorHandler? That sounds easy enough
[06:34:00.0000]
Philip`: ok
[06:34:01.0000]
(Does it matter if I use DOM or SAX?)
[06:34:02.0000]
Philip`: no
[06:39:00.0000]
Philip`: svn head now does warn("A \u201Ctable\u201D start tag caused a paragraph to close implicitly."); when a table closes a para
[06:42:00.0000]
hsivonen: Is there a chance you could convert it into a .jar file? :-)
[06:42:01.0000]
/me doesn't have a version checked out from SVN
[06:42:02.0000]
Philip`: ok
[06:44:00.0000]
Philip`: http://hsivonen.iki.fi/htmlparser-philip-2009-04-01.jar
[06:45:00.0000]
hsivonen: Thanks!
[06:47:00.0000]
hsivonen: Hmm, I get lots of org.w3c.dom.DOMException: INVALID_CHARACTER_ERR: An invalid or illegal XML character is specified.
[06:47:01.0000]
(which I didn't get using 1.0.7)
[06:47:02.0000]
when I'm using HtmlDocumentBuilder(XmlViolationPolicy.ALTER_INFOSET)
[06:48:00.0000]
By "lots" I mean "one", plus many NAMESPACE_ERR: An attempt is made to create or change an object in a way which is incorrect with regard to namespaces.
[06:50:00.0000]
Philip`: hmm. I don't know what I've broken. Do you have a URL that triggers those?
[06:51:00.0000]
http://www.giftology.co.uk gives the NAMESPACE_ERR
[06:51:01.0000]
http://www.villatraining.ca/ gives the INVALID_CHARACTER_ERR
[06:53:00.0000]
netquotevar:=""
[06:55:00.0000]
It also gives NAMESPACE_ERR on e.g. http://www.autobanga.lt/ which doesn't seem to use anything interesting except xml:lang
[06:56:00.0000]
(unless the code changed in the past few months)
[06:56:01.0000]
seems like something is badly wrong in the XML sanity code
[06:57:00.0000]
I hope it's not my fault
[06:58:00.0000]
Whoops, there's more messages sent to ErrorHandler than I expected
[06:58:01.0000]
My output file is 3,483,189,354 bytes
[06:59:00.0000]
I see 8161 URLs (out of ~130K) that trigger the table-closed-p warning
[06:59:01.0000]
Philip`: thanks. that seems bad on the face of it
[07:01:00.0000]
/me wonders if there's any value in uploading the list
[07:02:00.0000]
Philip`: I think there would be. one could pick sites at random and view them with two parsers
[07:02:01.0000]
Anyone from Opera aroud?
[07:02:02.0000]
*around
[07:03:00.0000]
gsnedders: do I count?
[07:03:01.0000]
hsivonen: http://philip.html5.org/data/table-implicitly-closed-p.txt
[07:04:00.0000]
no
[07:04:01.0000]
MikeSmith: Probably enough
[07:04:02.0000]
Opera wants a transcript of grades, which implies something formal and official. I don't have any digital transcript.
[07:04:03.0000]
gsnedders: I'm still part of the conspiracy, according to some people at least
[07:05:00.0000]
Or am I half asleep?
[07:05:01.0000]
Philip`: thanks!
[07:07:00.0000]
gsnedders: i just sent an email when i applied at opera
[07:08:00.0000]
http://www.reddit.com/r/programming/comments/893uo/html5_removes_tag_soup_support_goes_xml_only/ - apparently the commenters like the non-XML syntax
[07:08:01.0000]
zcorpan: http://www.opera.com/company/jobs/opening/211/ does say to apply online, though
[07:08:02.0000]
gsnedders: so?
[07:08:03.0000]
zcorpan: That is not email :P
[07:09:00.0000]
doesn't mean email is rejected
[07:09:01.0000]
i'm not saying you should send an email, though
[07:09:02.0000]
:P
[07:10:00.0000]
waha, http://philip.html5.org/tools/web-apps-tracker?from=2942&to=2943 is funny
[07:10:01.0000]
did someone announce that on blog.whatwg.org ?
[07:10:02.0000]
especially if you follow the diff all the way to the end, lol
[07:10:03.0000]
dt? dimitri?
[07:10:04.0000]
Yeah, right
[07:10:05.0000]
http://twitter.com/WHATWG/status/1427965755
[07:11:00.0000]
Clearly not enough people here read the WHATWG Twitter feed
[07:11:01.0000]
ah sweet
[07:11:02.0000]
I guess the entire digg-like style of reddit is April st too?
[07:11:03.0000]
*1st
[07:12:00.0000]
gsnedders: If you can't get a trasnscript at the moment you should apply first and worry later
[07:14:00.0000]
http://digg.com/programming/Tag_Soup_Support_Removed_From_HTML5
[07:16:00.0000]
It seems much more pointless when it's not blending into the Twitter stream and hiding behind tinyurl
[07:19:00.0000]
yo
[07:20:00.0000]
Hello
[07:21:00.0000]
anyone know if the ie8 beta devs have a channel kicking about?
[07:21:01.0000]
I think kicking the ie8 devs, even over irc, is harsh
[07:21:02.0000]
hmmm
[07:21:03.0000]
Really? 8)
[07:21:04.0000]
Although maybe they deserve it for the way they parse