00:21 | <jgraham> | Hixie: I can't see any obvious errors in the rest of the algorithm, but I didn't rigorously check everything, particularly the places where you changed the constraints so x_current always <= x_width and similarly with y_current |
00:25 | <Hixie> | k |
00:25 | <Hixie> | as always, let me know if you run into problems ever :-) |
00:46 | <gsnedders> | Can someone get me held back in school, just so I can stay in Secondary 5? |
07:46 | <hsivonen> | http://lists.w3.org/Archives/Public/www-tag/2008Mar/0033.html |
08:07 | <hsivonen> | http://www.dehora.net/journal/2008/03/20/html5-obs/ |
08:26 | <hsivonen> | http://lists.w3.org/Archives/Public/www-tag/2008Mar/0058.html |
08:27 | <hsivonen> | http://lists.w3.org/Archives/Public/www-tag/2008Mar/0061.html |
09:09 | <svl> | To see "Aunt Tilly" and "simply deploy RDF documents" in the same sentence... |
09:10 | <hsivonen> | regarding UTF-8 as a token default alongside windows-1252: it seems that people who use UTF-8 have a clue about declaring it |
09:58 | <hsivonen> | Philip`: did you examine the behavior of getProbableCharsets() in jchardet? Is the *last* entry in the array always the most probable? |
10:04 | <hsivonen> | hmm. I guess I should just let the detection fail if UTF-16 is the most probable |
10:04 | <hsivonen> | instead of trying to take the next most probable |
11:50 | <hsivonen> | Hixie, annevk: the discussion in this Mozilla bug might be of interest: https://bugzilla.mozilla.org/show_bug.cgi?id=289938 |
12:29 | <virtuelv> | I believe annevk is on vacation |
12:31 | <hsivonen> | right. I forgot |
12:40 | <hsivonen> | http://wilbur.bytowninternet.com/pipermail/list_html4all.org/2008-March/000660.html |
13:16 | <BenMillard> | hi all, Simon 'Zcorpan' Pieters told me Hixie had gotten to table issues |
13:17 | <BenMillard> | he linked me to the tracker and the recent IRC chat |
13:24 | <MikeSmith> | hsivonen - have you tested install of your conformance checker on OSX? |
13:27 | <MikeSmith> | virtuelv - I really wish there were some way I could easily install the latest opera snapshot/weekly without needing to the goddamn desktop team blog page to check and manually get the link |
13:27 | <virtuelv> | MikeSmith: you want build feeds? |
13:33 | <MikeSmith> | Dashiva - I was going to say to virtuelv that I really don't care what the mechanism is as long as I've got some automated way to install the test build when I want to |
13:35 | <hsivonen> | MikeSmith: I develop on Mac OS X, but I haven't tested a clean install on OS X in a while |
13:37 | <MikeSmith> | hsivonen - OK, just wondering. I've been messing around a bit lately with shell stuff under Darwin on OSX and just wondering if there were any gotchas |
13:37 | <MikeSmith> | one thing I noticed so far is that it's not really obvious what JAVA_HOME is |
13:38 | <MikeSmith> | but eventually managed to figure out it's /System/Library/Frameworks/JavaVM.framework/Versions/CurrentJDK/Home |
13:49 | <BenMillard> | I've told Accessify Forums (as "Cerbera") about the tables section of HTML5 changing. I hope it's a fair summary: http://www.accessifyforum.com/viewtopic.php?p=59125#59125 |
14:16 | csarven | finds http://csseleven.com/ lame on so many levels |
14:38 | <hsivonen> | ouch. the spec has grow past the Validator.nu size limit again |
14:40 | <zcorpan_> | Hixie: while you're working on tables, i think the table processing should look at child <col> elements of <table> |
14:40 | <zcorpan_> | (and that it should be allowed and perhaps that the html parser shouldn't imply <colgroup> -- i've sent email about this if i recall correctly) |
14:41 | <hsivonen> | zcorpan_: re: attributes defaulted from a DTD: If the user chooses DTD processing, Validator.nu processes the DTD. If that sucks for the user, the users shouldn't ask for DTD processing |
14:41 | <zcorpan_> | hsivonen: fair enough :) |
14:41 | <hsivonen> | zcorpan_: however, I have considered cloning Gecko's DTD behavior and offering that as an option |
14:42 | <hsivonen> | but I don't want to do it unless there's agreement that Gecko's DTD behavior should be grandfathered into the Web platform |
14:58 | <Philip`> | hsivonen: I haven't tried using getProbableCharsets at all |
14:59 | <hsivonen> | Philip`: ok |
15:00 | <hsivonen> | csarven: is there any point other than self-promotion with pictures and minibios? |
15:01 | <jgraham_mibbit> | BenMillard: I think tha's a fair summary although I guess Hixie can come up with an entirely novel algorithm |
15:03 | <csarven> | hsivonen My thoughts exactly! |
15:16 | <BenMillard> | jgraham_mibbit, thanks and that's a possibility zcorpan mentioned. |
17:30 | <Hixie> | hsivonen: if you want to let the html4all people know, there will be a gigantic e-mail coming out relatively soon discussing all the table edits in great detail |
17:30 | <Hixie> | hsivonen: i just didn't want to delay the edits until i'd done editing the spec |
17:31 | <Hixie> | er, until i'd done replying to the e-mail, i mean |
18:54 | <Philip`_> | http://wiki.whatwg.org/wiki/Special:Recentchanges - Mpsingh looks like a spammer |
19:24 | <andersca> | Hixie: found a tiny typo in the spec |
19:24 | <andersca> | Explicit entries can also be marked as foreign, which means that they have an manifest attribute but that it doesn't point at this cache's manifest. |
19:24 | <andersca> | an manifest => a manifest |
19:30 | <hsivonen> | Hixie: mentioned as part of replying on html4all |
19:48 | <andersca> | Hixie: also, ping |
19:48 | <andersca> | :) |
19:53 | Pavlov_ | summons hixie too, for good measure |
19:54 | <andersca> | cool |
20:25 | <Hixie> | andersca, Pavlov_: here |
20:26 | <Hixie> | andersca: and sorry to be a pain, but if you could, please mail the typo to me :-) (either the list or ian⊙hc if you don't want to spam the list) |
20:26 | <andersca> | Hixie: I would rather have all those typos exposed on the whatwg list ;) |
20:27 | <Hixie> | sure thing :-) |
20:27 | <andersca> | Hixie: also, I was wondering why null characters in the manifest file should be converted to fffd? |
20:28 | <Hixie> | to avoid any problems with implementors wanting to use null-terminated strings |
20:29 | <Hixie> | it's been a source of many problems in other parts of the web platform |
20:30 | <andersca> | ap: did you see that |
20:31 | <ap> | Hixie: the problems in other parts were fixed without changing decoding of nulls - I think it would be better to have a consistent approach |
20:31 | <Hixie> | the html5 parser handles nulls in much the same way |
20:31 | <ap> | Hixie: hmm, do any browsers follow that? |
20:32 | <ap> | Hixie: for all I know, WebKit just treats nulls as normal characters |
20:32 | <Hixie> | some do. others treat them as nulls, and yet others have security bugs (they just strip out the nulls, or truncate at nulls, both of which can be used for xss or similar behaviours) |
20:33 | <Hixie> | to be honest it really doesn't matter much what we do with nulls here, since none of the magic strings contain nulls, and URIs aren't allowed to contain nulls. |
20:33 | <Hixie> | (or fffds) |
20:33 | <Hixie> | i don't really understand why it's a problem |
20:33 | <Hixie> | so long as the behaviour is indistinguishable from treating nulls as FFFDs |
20:33 | <ap> | Hixie: I'd just prefer to avoid introducing yet another mode in the decoder |
20:34 | <Hixie> | the spec doesn't say anything about how you implement it |
20:34 | <ap> | Hixie: hmm, I think andersca was talking about this phrase from the spec: "All U+0000 NULL characters must be replaced by U+FFFD REPLACEMENT CHARACTERs." |
20:35 | <Hixie> | right, so long as the result is equivalent to this, it doesn't matter how you do it |
20:35 | <Hixie> | in fact i think that line is effectively redundant, so long as you correctly detect whether URIs are valid or not |
20:36 | <othermaciej> | why not just wait for Opera to fix their null character handling bugs |
20:36 | <Hixie> | since both FFFD and 0000 are invalid in URIs |
20:36 | <ap> | Hixie: ok, I guess I didn't have enough context, and thought that this had practical consequences |
20:36 | <andersca> | oh, sorry about that :( |
20:36 | <Hixie> | othermaciej: this isn't about those bugs |
20:36 | <Hixie> | othermaciej: but the null->fffd stuff in the parser is there because nulls have caused far many more problems than just in opera |
20:37 | <Hixie> | e.g. the status bar bug several years back |
20:37 | <Hixie> | and since there really is no use case for U+0000 on the Web, it seems far safer to just remove it altogether |
20:37 | <othermaciej> | nulls are certainly risky when you mix with C APIs |
20:37 | <othermaciej> | but converting them in some places but not others is an annoying burden |
20:37 | <ap> | Hixie: sounds like a warning could have better effect than a somewhat misguiding requirement in a sample algorithm |
20:38 | <othermaciej> | and you really want your code to be safe against nulls in strings anyway |
20:38 | <Hixie> | ap: send mail :-) |
20:38 | <ap> | Hixie: k |
20:38 | <Hixie> | thanks |
20:38 | <othermaciej> | but if the conversion doesn't affect results of the algorithm then it does not matter much in this case |
20:39 | <Hixie> | othermaciej: in the manifest case, i don't think there's any noticeable effect, and we can probably remove it |
20:39 | <Hixie> | othermaciej: in the html parser case, i agree that it is unfortunate that you can still introduce nulls through the DOM APIs |
20:39 | <Hixie> | othermaciej: not much i can do about that, though |
20:40 | othermaciej | blames the Unicode consortium |
20:40 | <Hixie> | othermaciej: still seems far safer and better to make nulls become FFFDs, since you already have to have code to turn a number of other things into FFFDs and have to do newline normalisation anyway |
20:40 | <Hixie> | anyway |
20:40 | <Hixie> | please discuss this on a mailing list :-) |
20:40 | Hixie | is working on tables |
20:42 | <gsnedders> | eww! you can't use tables! they're presentational! divs ftw! |
20:46 | hsivonen | likes \0 to FFFD substitution in the parser |
20:52 | <BenMillard> | hixie, is there anything I can be doing to help your work on tables? |
20:58 | Philip` | sees the last paragraph of http://html4all.org/mailman/archives/list_html4all.org/2008-March/000663.html and conducts some research |
20:58 | <Philip`> | Out of about 130,000 random pages from dmoz.org, the 'axis' attribute is used on http://www.cdsm.co.uk/ and http://centerville.lionwap.org/ and http://gazianteplionsclubtr.lionwap.org/ |
20:58 | Philip` | concludes that it's quite rare, and that he has insufficient data to conclude anything else |
20:59 | <hsivonen> | Philip`: I assume you keep a local cache of 130000 pages? |
20:59 | <Philip`> | hsivonen: Yes |
21:00 | <hsivonen> | how much disk space does the cache take? |
21:00 | <Philip`> | I wouldn't want the cost of downloading ~3GB of content every time I wanted to find a little more information :-) |
21:00 | <Philip`> | hsivonen: ~3GB |
21:00 | <Philip`> | (having clamped each page to 256KB) |
21:01 | <Philip`> | (since the (unclamped) mean size of a page is infinite) |
21:02 | <Philip`> | If disk space was a concern, I imagine compression could help significantly |
21:05 | <Philip`> | (http://triin.net/archive/kool/webstat/figure-8.png suggests almost everything is <=256KB, so that seems a reasonable limit to impose) |
21:05 | <Philip`> | (though actually I picked the value pretty arbitrarily) |
21:07 | <hsivonen> | If I have understood axis correctly (not a safe assuption), http://gazianteplionsclubtr.lionwap.org/ uses axis in a useless way |
21:07 | <hsivonen> | as does http://centerville.lionwap.org/ |
21:08 | <hsivonen> | and http://www.cdsm.co.uk/ |
21:09 | jgraham_ | is pretty sure axis is useless |
21:12 | <Philip`> | At least it's abused far less frequently than many other attributes |
21:24 | BenMillard | agrees with jgraham |
21:29 | <BenMillard> | I'm off for some supper, some TV and then bed. bye all |
21:29 | <hsivonen> | ye |
21:29 | <hsivonen> | bye even |
21:30 | <BenMillard> | cya |
22:11 | <jgraham_> | http://james.html5.org/graph_2.svg -> possibly all the state transitions made in the html5lib treebuilder testcases (assuming I didn't screw up) |
22:12 | jgraham_ | will work out how to distinguish "real" transitions from "reprocess as if in" transitions next (that graph should have both) |
22:13 | <Hixie> | how do you go from AfterBodyPhase to InFramesetPhase? |
22:14 | <Hixie> | oh that's the "process using the rules for" thing |
22:14 | <Hixie> | nevermind |
22:14 | <Hixie> | there are iirc only four insertion modes that can be subject to that, btw |
22:14 | <Hixie> | if that helps optimise code |
22:49 | <Philip`> | jgraham_: That's quite nice :-) |
22:52 | <Philip`> | <text style="font-family:Times New Roman;font-size:14.00;"> - doesn't it need to quote the string? |
22:53 | <Hixie> | no |
22:53 | <Philip`> | Oh |
22:53 | <Hixie> | it does, however, need units on the font-size |
22:54 | <Philip`> | Oh |
22:54 | <hsivonen> | Hixie: in SVG? |
22:54 | <Hixie> | yup |
22:54 | <Hixie> | unless you use the font-size="" attribute |
22:55 | <hsivonen> | in retrospec, it might have been good to make unitless default to px in CSS even in the standards mode |
22:55 | <Hixie> | except that would clash with unitless numbers in line-height, font, and other properties |
22:55 | <hsivonen> | unitless line-height is redundant with % |
22:56 | <Hixie> | nope |
22:56 | <Hixie> | they inherit differently |
22:56 | <hsivonen> | too difficult :-) |
22:56 | <Hixie> | (numbers allow lines to expand to fit children with bigger font sizes, %s don't) |
22:56 | <Hixie> | i'm not defending the design of the language |
22:57 | <Hixie> | just explaining why the svg group got it wrong |
22:59 | <Philip`> | http://www.gerv.net/security/link-fingerprints/ - "half a loaf is better than no bread" - I think that's not true in general - the half loaf might be covered in mould, and it'd be better if you didn't have any bread at all |
23:00 | <Hixie> | i think we already discussed fingerprints on the list once |
23:00 | <hsivonen> | http://mindforks.blogspot.com/2008/03/aria-templateid-explained.html |
23:00 | <Hixie> | and came to the conclusion it wasn't a good idea |
23:00 | <Hixie> | i was hoping someone would find the reference and reply to him before i had to :-) |
23:00 | <hsivonen> | I wonder how aria-templateid scales on the time axis given the update cycle of AT |
23:00 | <hsivonen> | compared to the update cycle of Web 2.0 apps |
23:00 | Hixie | wonders what happens if he sets aria-templateid="google.com/gmail" on his blog |
23:01 | <othermaciej> | what's aria-templateid? |
23:01 | <Hixie> | <hsivonen> http://mindforks.blogspot.com/2008/03/aria-templateid-explained.html |
23:02 | Philip` | tends to download files by copying the URL and pasting it into wget, and then getting bash errors and doing it again with quotes around the URL argument, which would avoid the benefits of link hashing |
23:02 | <othermaciej> | that sounds like an amazingly bad idea |
23:02 | <Hixie> | othermaciej: you hate blind people! |
23:03 | <hsivonen> | othermaciej: you are not the first person with that initial reaction |
23:04 | <takkaria> | I can't imagine AT vendors actually implementing support for it |
23:04 | <Philip`> | Hixie: If you did that, it would make your blog less usable to some people with AT, and there are much easier ways you can cause problems for those people, so I don't see why it matters |
23:05 | <Hixie> | Philip`: it seems like reducing the ways that authors can (accidentally) screw over their users would be a good thing |
23:06 | <Hixie> | Philip`: and, similarly, increasing the number of ways that authors can screw over their users would be a bad thing |
23:06 | <Philip`> | Why would someone accidentally set aria-templateid="google.com/gmail" on their site? |
23:06 | <othermaciej> | the saddest thing about it is that it seems to presume that web sites themselves won't use ARIA to give a good enough accessibility experience, so that AT needs special-case hacks for different sites |
23:06 | <Hixie> | Philip`: why would someone accidentally put <!DOCTYPE PHP PUBLIC "-//W3C//DTD PHP 4.01//EN"> as their doctype? |
23:06 | <Hixie> | othermaciej: firevox shows that this is already happening |
23:07 | <Hixie> | othermaciej: though why a site would care enough to put an aria-templateid attribute but not enough to make their site usable, i dunno |
23:07 | <Dashiva> | I find the AT upgrade cycle a much more persuasive argument :) |
23:08 | <othermaciej> | we have indeed heard many times that the brutally slow AT upgrade cycle makes changing accessibility features fruitless |
23:08 | <Philip`> | Hixie: There's an infinite number of errors like that that people can accidentally introduce; but there's only a (pretty small) finite number of aria-templateid="..." strings that any AT would care about, so there's a vastly smaller chance of that happening by accident |
23:08 | <othermaciej> | for instance that a new table header association algorithm was worthless, since it would be forever until AT supports it, and twice forever until users who need it have that version |
23:09 | <Hixie> | Philip`: except that the sites that would use it are those most likely to be held up as examples to learn from |
23:09 | <Hixie> | othermaciej: way to rain on what i'm doing on my other monitor :-P |
23:10 | <Philip`> | aria-templateid allows competition in (e.g.) the Gmail-usability space, because lots of vendors (and extension developers etc) can all try to make it work as well as possible; otherwise only Google could improve Gmail usability, so they're going to do less well than when there are many competing attempts |
23:11 | <Hixie> | nonsense |
23:11 | <othermaciej> | Hixie: I'm not saying I agree with such sentiments |
23:11 | <Hixie> | Philip`: ATs could easily just hardcode the gmail uris |
23:11 | <Dashiva> | Philip`: Replace templateid with the URL and you have a user script system that works today |
23:12 | <Philip`> | Maybe Google Maps is a better example, since that often gets embedded in other web pages and you can't hardcode the page URLs |
23:12 | <Philip`> | (Yay mashups!) |
23:13 | <Hixie> | that might hypothetically work, if templateid is on a per-subtree level, but that would need extremely careful definition |
23:14 | <hsivonen> | (the visual part of Google maps is very visual though) |
23:14 | <Hixie> | horrah, ms feedback on storage |
23:15 | <Philip`> | I suppose the ARIA spec is a little vague on aria-templateid, but maybe someone will blog about it in an extremely careful detailed way |
23:15 | <Hixie> | man, i should use that model to write html5 |
23:15 | <Hixie> | that would be so much easier |
23:16 | <Philip`> | hsivonen: (There are many disabled people who aren't blind, so AT improvements on primarily visual content would still help some users) |
23:16 | <hsivonen> | Philip`: you are aware of the wiki guide, right? |
23:17 | <Philip`> | hsivonen: I'm not, and Google just tells me about a science fiction manga |
23:17 | <Philip`> | Oh, is that the Mozilla one? |
23:17 | <hsivonen> | Philip`: http://developer.mozilla.org/en/docs/ARIA_UA_Best_Practices |
23:18 | <Philip`> | "aria-templateid: XXX not sure what to do with this one" |
23:18 | <Philip`> | Hmm |
23:22 | Philip` | just sees http://lists.whatwg.org/pipermail/whatwg-whatwg.org/2006-November/thread.html#7825 about the link fingerprint thing |
23:23 | <Hixie> | that's the one |
23:26 | <Hixie> | thanks Philip` |
23:28 | Philip` | thanks the "Search Mail" button |
23:33 | Hixie | wonders how strange a validator feature request would have to be before hsivonen would just reject it out of hand |
23:33 | <Hixie> | hsivonen: so i'm tempted to ask for support for an equivalent of the #line preprocessor directive... |
23:33 | Philip` | wonders what happens if the target of checksummed link is replaced by an HTML file containing a malicious script followed by enough stuff to make a browser want to incrementally render before it's finished downloading and can check the checksum |
23:33 | <Hixie> | (i edit a file that is then concatenated to another one before validation, so the line numbers are all 82 lines off for me) |
23:34 | <Hixie> | crap, gotta go |
23:34 | <Hixie> | if people want to check it out, i just regenned the spec with the new algorithm for tables |
23:34 | <Hixie> | bbl |
23:36 | <csarven> | Are there any UAs that support inline pseudo-classes? |
23:38 | <andersca> | Hixie: cool |