00:12
Philip`
tries implementing a very extremely primitive version of cInputStream, which only returns the string "Hello world" (repeating infinitely) and whose C code calls the class "Noddy" because it's copied from the Python documentation
00:12
<Philip`>
and it seems to run char() about three times faster than a pure Python version, which makes it seem actually worthwhile
00:16
<Philip`>
I believe cInputStream only needs to implement three functions, so it should be pretty straightforward...
00:32
<annevk>
only three times faster?
00:32
<annevk>
still quite a lot I suppose
00:33
<annevk>
I guess all the other Python bits impact the perf as well then. I would expect a C version to be a 100 times faster or so.
00:33
<Philip`>
All the method's doing is returning the next character from a string and increasing the offset counter, so there isn't a huge scope for improvements
00:36
<Philip`>
http://krijnhoetmer.nl/irc-logs/html-wg/20070710#l-239
00:37
<Philip`>
(100x seems about right)
00:37
<annevk>
(I got that number from a collegue who did some comparative testing.)
01:13
<Hixie>
most uses of alt=<...> seem to be pretty serious errors (people putting markup in alt="")
01:13
<Hixie>
so we could probably turn alt=<...> into the magic "not a description but a type of image" mode
01:14
<Hixie>
(as in, instead of <img important alt=...>, we would have <img alt="<...>">)
01:16
<Hixie>
or we could use alt={...}, the only use of that seems to be for cases where alt=&...; would have been better
01:16
<Hixie>
we could even say that in {...} if the ... matches an entity name then it's treated as an entity, otherwise it's treated as an important image...
01:19
<Philip`>
I'm not sure what you mean about alt=&...;
01:19
<Philip`>
ALT="{short description of image}" seems the most common value in my data
01:20
<Philip`>
(<img alt="<...>"> is nasty because everyone will write it as <img alt="&lt;...&gt;"> and it'll be ugly)
01:21
<Hixie>
yeah i agree with <>
01:21
<Hixie>
alt={alpha} is the most common {...} value i see
01:22
<Philip`>
I don't see that at all
01:22
<Hixie>
which would probably be best as alt=&alpha;
01:22
<Hixie>
i expect my sample has a lot more scientific documents
01:25
<Philip`>
http://philip.html5.org/data/alt-in-braces.txt
01:27
<Philip`>
It'd be kind of annoying for people who do <img src="rendered-latex.gif" alt="latex source">, since that'd occasionally be {...}
01:29
<Hixie>
i dropped anything that had fewer than 10,000 pages, and my list was: http://damowmow.com/temp/alt-in-braces.txt
01:29
<Philip`>
Might all the alphas come from one site with lots of pages?
01:30
<Hixie>
quite possible
01:31
<Hixie>
(or from one tool)
01:33
<Philip`>
http://www.cmaj.ca/cgi/content/full/173/12/1441 has alt="{dagger}"
01:33
<Philip`>
http://jcem.endojournals.org/cgi/content/abstract/87/4/1687 has alt="{chi}"
01:33
<Philip`>
Those seem to be the only entity-like things I got
01:34
Philip`
notices some similarity in their URLs
01:34
<Hixie>
here are 15 pages that used alt={alpha}, selected at random (so if there's one site drowning the results, you should see a lot of that site):
01:35
<Hixie>
unlinked alt={alpha},http://agron.scijournals.org/cgi/content-nw/full/91/6/928/FIG6?ck=nck
01:35
<Hixie>
unlinked alt={alpha},http://www.good-sa.com.tw/97.html
01:35
<Hixie>
unlinked alt={alpha},http://hyper.ahajournals.org/cgi/collection/other_vasc_bio?notjournal=ahajournals&page=156
01:35
<Hixie>
unlinked alt={alpha},http://www.pnas.org/papbysection.shtml
01:35
<Hixie>
unlinked alt={alpha},http://toxsci.oxfordjournals.org/cgi/content/full/69/2/354/F2
01:35
<Hixie>
unlinked alt={alpha},http://journals.asm.org/cgi/figsearch?FIRSTINDEX=1540&SEARCHID=1&hits=10&RESULTFORMAT=&FULLTEXT=embryos&andorexactfulltext=&resourcetype=HWFIG
01:35
<Hixie>
unlinked alt={alpha},http://atvb.ahajournals.org/cgi/content/full/26/1/143
01:35
<Hixie>
unlinked alt={alpha},http://jb.asm.org/cgi/content/full/185/1/89?maxtoshow=&HITS=10&hits=10&RESULTFORMAT=&fulltext=lacz&searchid=1&FIRSTINDEX=1400&resourcetype=HWFIG
01:35
<Hixie>
unlinked alt={alpha},http://www.jimmunol.org/cgi/content/full/174/1/205?ck=nck
01:35
<Hixie>
unlinked alt={alpha},http://ajpheart.physiology.org/papbyrecent.shtml
01:35
<Hixie>
unlinked alt={alpha},http://bloodjournal.hematologylibrary.org/cgi/content/abstract/103/4/1286
01:35
<Hixie>
unlinked alt={alpha},http://circ.ahajournals.org/cgi/collection/endo_vastype_no?notjournal=ahajournals&page=236&ck=nck
01:35
<Hixie>
unlinked alt={alpha},http://ejcts.ctsnetjournals.org/cgi/content/full/25/3/352?ck=nck
01:35
<Hixie>
unlinked alt={alpha},http://stemcellbiology.blogspot.com/2007_09_01_archive.html
01:35
<Hixie>
unlinked alt={alpha},http://bloodjournal.hematologylibrary.org/cgi/content/abstract/106/7/2302
01:35
<Hixie>
looks like ahajournals.org might be overrepresented
01:35
<Hixie>
but that's still a broad selection
01:37
<Philip`>
All but 2 of those are running the same software
01:39
<Philip`>
(e.g. breaking the URLs makes all of them (but 2) show pretty much identical 404 pages)
01:40
<Hixie>
makes sense
03:08
<Philip`>
Hmm... cInputStream reduces spec parse time from 16.8s by 20% to 13.5s
03:08
<Philip`>
which is alright but not fantastic
03:11
<Philip`>
(That code could probably be optimised to get another 5% or so)
03:12
<Philip`>
(mainly by caching the charsUntil patterns)
07:39
gsnedders
yawns
07:42
<gsnedders>
I need test cases for the spec-gen
07:43
<gsnedders>
I don't really want to fix bugs before that
08:20
<gsnedders>
hmm. All the broken xrefs seem to be those with <dfn><code>foo</code></dfn>
08:25
<virtuelv>
shouldn't those really have been <code><dfn>foo</dfn></code>?
08:31
<hsivonen>
Hixie: should id="" be checked for uniqueness?
08:31
<hsivonen>
Hixie: <foo id=""><bar id=""> should that only whine about empty string ids or also about duplicate ids?
08:36
<hsivonen>
othermaciej: Duplicate IDs are now checked. Thanks.
08:37
<hsivonen>
in other news, Validator.nu now checks meta refresh values
08:37
<othermaciej>
hsivonen: cool, glad you added it
08:42
<hsivonen>
Philip`: I rephrased the non-streamability message as you requested
08:42
<hsivonen>
thanks
08:55
<annevk>
http://developers.facebook.com/fbopen/
08:56
<jgraham>
Has anyone ever checked how AT sees <figure><legend>? Apparently screenreaders have some special behaviour for <legend> that might prevent graceful degradation
09:01
<mpt>
hsivonen, do you check uniqueness of, and existence of the elements for, for= values?
09:03
<annevk>
for= also needs to do type checking
09:03
<hsivonen>
mpt: I think I do (for HTML5--not HTML4)
09:03
<hsivonen>
let's see
09:03
<jwalden>
http://svn.facebook.com/svnroot/platform/fbopen/lib/fbml/fbjs.php is the real meat as far as I can tell there
09:03
<annevk>
<label for=x>...</label> <input type=hidden id=x> should not be conforming
09:04
<annevk>
<label for=x>...</label> <div id=x></div> should not be conforming
09:04
<annevk>
etc.
09:04
<hsivonen>
mpt: yeah, I check for it
09:04
<mpt>
cool
09:05
<hsivonen>
the XHTML 1.0 / HTML 4 support in V.nu really sucks compared to the (X)HTML5 support
09:05
<hsivonen>
I wonder if I should just remove XHTML 1.0 / HTML 4 support or keep fixing the breakage
09:05
<hsivonen>
or leave it in a sucking state
09:06
<annevk>
hsivonen, it doesn't catch the type=hidden above
09:06
<annevk>
hsivonen, it does catch the <div> element
09:06
<hsivonen>
annevk: good catch.
09:07
<hsivonen>
annevk: is referring to type=hidden forbidden in the spec? (it probably should be)
09:07
<annevk>
might not be forbidden just yet
09:08
<annevk>
in fact, I don't think WF2 makes additional requirements about for
09:11
<annevk>
a more complicated check might be not allowing referencing a <datalist> child unless the <label> is itself a child of the same <datalist>
09:11
<hsivonen>
annevk: I found another type=hidden bug while I was at it. thanks
09:11
hsivonen
has totally forgotten about datalist
09:12
<hsivonen>
why would labels refer to datalists?
09:12
<annevk>
<label for=x> ... </label> <datalist> <select id=x> </select> </datalist>
09:12
<hsivonen>
eww.
09:13
<annevk>
the conforming case would be <datalist> <label for=x> </label> <select id=x> </select> </datalist>
09:13
<annevk>
it seems kind of nasty to check
09:14
<hsivonen>
consider http://bugzilla.validator.nu/show_bug.cgi?id=232 postponed
09:14
<hsivonen>
(deploying the easier fix now...)
09:14
<annevk>
yeah, fair enough
09:15
<annevk>
othermaciej, any progress on your Forms TF stuff?
09:15
<annevk>
othermaciej, the Forms WG people have stopped radio silence, maybe we should say something back
09:16
<othermaciej>
annevk: yeah, we should
09:16
<othermaciej>
annevk: I think I can get something written in the next few days
09:16
<othermaciej>
though if I don't get it done before next week, then I'll be out of action for a bit due to WWDC
09:17
<annevk>
better get on it then ;)
10:26
<Dashiva>
Just mark it as beta :)
10:26
<Dashiva>
oops. that was one scrollback too little
10:37
<Hixie>
man, some people are whiny
10:39
<annevk>
maybe that's how things worked out for them when they were little :)
10:40
<Lachy>
who's whining?
10:40
<Dashiva>
The editorial comments on xhtml and stuff, maybe
10:41
<Lachy>
ah, I still have hundreds of mails to catch up on from last weekend
10:43
<Dashiva>
I spent most of the weekend wondering why there wasn't any activity on public-html. Then I realized it was no longer May.
10:43
<hsivonen>
I'm quite happy to find less email
10:44
<Hixie>
nah, direct mail to me
10:45
<Hixie>
though i am getting tired of having to deal with the w3c
10:46
<Hixie>
hsivonen: i recommend that you make the validator collapse all the presentational attributes used with the value 0 or equivalent into one error "your document contains obsolete presentational markup"
10:48
<Hixie>
this global <a> thing is a bit irritating too
10:48
<Hixie>
could someone please explain to me what is wrong with onclick="getElementsByTagName('a')[0].click()" ?
10:48
<annevk>
it's more typing than href
10:48
<Dashiva>
It requires scripting
10:48
<Hixie>
waah.
10:48
<Philip`>
It needs CSS to give the right cursor, and JS to update the status bar
10:49
<Philip`>
(Er, do browsers still let you update the status bar?)
10:49
<Dashiva>
Sometimes, some of them
10:49
<annevk>
Hixie, maybe it's a CSS matter
10:49
<annevk>
Hixie, in that CSS could provide a way to make the target area of the link span the entire table row the link is in
10:50
<Hixie>
yeah well
10:50
<Hixie>
would be nice if the csswg was doing anything
10:50
Dashiva
gasps
10:50
<annevk>
i'm doing something
10:50
<annevk>
i moved the namespaces module to CR :)
10:50
<annevk>
next is media queries and cssom view
10:51
<Hixie>
uh huh
10:51
<Hixie>
i mean something useful :-P
10:51
<Hixie>
well, cssom view is useful
10:52
<annevk>
media queries is something html5 depends on too
10:53
<Hixie>
media queries is fine already
10:53
<Hixie>
i mean it has some minor issues
10:53
<Hixie>
but there are bigger fish to fry
10:53
<Hixie>
the other part of cssom, for instance
10:53
<annevk>
hah
10:53
<Hixie>
and the animation proposals from apple
10:54
<Hixie>
the animation stuff is probably the number one priority right now
10:54
<annevk>
dean jackson will be working on that, but the CSS WG is sort of slow setting it all up, not sure what's holding everything back
10:54
<Hixie>
if i wasn't committed to finishing html5 i'd be doing that i expect
10:54
<Hixie>
sadly to do a good job requires a rewrite of css from teh ground up
10:54
<Hixie>
since css2.1 is so not well written
10:54
<annevk>
(media queries parsing was defined in the same sense html4 parsing was defined so it needed some fixup)
10:54
<Hixie>
(and i say that as one of hte editors)
10:55
<annevk>
2012
10:56
<Hixie>
yeah
10:56
<Hixie>
well
10:56
<Hixie>
dom core first
10:56
<Hixie>
then svg
10:57
<Hixie>
and maybe http if gsnedders hasn't done it by then
10:57
<annevk>
dom core is zcorpan
10:57
<Hixie>
well he has til 2012 i guess :-)
10:57
<hsivonen>
Hixie: hmm. I was thinking about lobbying to allow the presentational stuff and offering warnings for them as a checkbox
10:58
<takkaria>
"then svg", heh
10:59
<Hixie>
hsivonen: i don't expect such lobbying to be fruitful
10:59
<Hixie>
hsivonen: the only use case seems to be "silence the validator"
10:59
<Hixie>
hsivonen: and that's easier done in ui
10:59
<Hixie>
annevk: is chaals around?
11:00
<Hixie>
oh man what's this geo wg crap
11:01
<Hixie>
just as we're fixing the waf+webabi mess of two wgs, we're splitting the new wg into two wgs again.
11:01
<Philip`>
You require e.g. frameborder="0" to make content work correctly in some current UAs (i.e. IE), and it'd be nice if valid HTML5 content could work in current UAs
11:01
<Hixie>
is the w3c simply unable to learn from its mistakes?
11:02
takkaria
wonders why "CSS Marquee" is a high priority spec of the csswg
11:02
<annevk>
Hixie, chaals is in Brasil last I heard. You could e-mail him I suppose
11:03
<Philip`>
takkaria: Presumably because it's a very popular feature in China
11:03
<annevk>
Hixie, I don't get the geo stuff either, all browser vendors + google indicated a preference for the new WA WG as venue
11:06
<Philip`>
(Oh, I guess my presumption was wrong, since the thingy says it's mainly for mobile browsers)
11:18
<Hixie>
wtf does this mean:
11:18
<Hixie>
svn: REPORT request failed on '/svn/!svn/vcc/default'
11:18
<Hixie>
svn: Target path does not exist
11:18
<Hixie>
...when i try to "svn up" the html5lib directory
11:21
<Hixie>
ooo, it works if i do it when i'm in the directory at its "real" path instead of a symlinked path
11:21
<Hixie>
weird
11:22
<Philip`>
If it's a symlinked subdirectory, I guess it'd be looking for ../../.svn/ and would get unhappy because that doesn't exist
11:23
<Hixie>
i don't think the real location has one of those either
11:23
<Hixie>
but oh well
11:23
<Hixie>
whatever
11:23
<Hixie>
hey, hsivonen fixed dup id detection
11:23
Hixie
fixes the dup ids in the spec
11:24
<Hixie>
annevk: when are we publishing again?
11:24
<annevk>
the plan is Thursday
11:24
<Hixie>
k
11:24
<annevk>
it largely depends on the W3C getting its act together though, offline-webapps was scheduled for last Friday...
11:25
<Hixie>
so when do i have to have the boilerplate updated?
11:25
<Hixie>
is now ok?
11:26
<annevk>
you mean making it WD-ready? I guess they want it as late as possible. Personally I'd say that now is ok
11:26
<Hixie>
k
11:29
<Hixie>
um
11:29
<Hixie>
looks like the multipage script broke when i updated it
11:29
<Hixie>
guess i'd better look into that
11:29
<Philip`>
Broke in which ways?
11:30
<Philip`>
("Updated" as in "updated to the latest version from SVN"?)
11:32
<Hixie>
ImportError: No module named serializer
11:32
<Hixie>
yes
11:33
<Philip`>
Sounds like an old version of html5lib
11:33
<Hixie>
updated that too
11:33
<Hixie>
svn up
11:34
<Philip`>
Hmm
11:34
<Philip`>
Certain? :-)
11:34
<Hixie>
yes
11:34
<Hixie>
what is python setup.py install going to do?
11:35
<Philip`>
Probably install into /usr/python2.5/etc
11:36
<Philip`>
(Is it using an installed old version of html5lib?)
11:36
<Hixie>
that's not going to work so well here.
11:36
<Hixie>
yeah, looks like it might be
11:36
<Hixie>
oh i see
11:36
<Hixie>
i symlinked to the wrong place
11:37
<Hixie>
for some definition of wrong
11:40
Philip`
goes away for half an hour
11:40
<Philip`>
Hixie: If it fails when calling some xpath method, that's because it needs lxml 2.0
11:41
<Philip`>
Otherwise it ought to work, hopefully :-)
11:41
<Hixie>
`now i get:
11:41
<Hixie>
ImportError: No module named lxml
11:41
<annevk>
dependencies suck
11:41
<Hixie>
yes.
11:41
<Philip`>
Ah, in that case you also need lxml 2.0
11:41
<Hixie>
html5lib used to be much easier to use :-)
11:42
<Hixie>
now i have to install it and add libraries...
11:42
<Philip`>
That's not html5lib's fault :-)
11:42
<Philip`>
It's just because I chose to use the lxml treebuilder (because that was nicer and faster than the DOM-ish one)
11:42
<Philip`>
It should be just "easy_install lxml" except lack of root probably makes that harder
11:42
<Hixie>
i have no idea how to install that dependency
11:43
<Hixie>
i don't have anything resembling a work environment here
11:43
<Hixie>
i have a directory.
11:43
<Hixie>
i'm lucky to have python.
11:44
<Philip`>
Do you have easy_install?
11:44
<Hixie>
no
11:44
<hsivonen>
IIRC, getting easy_install to do the right thing under Debian doesn't exactly qualify as 'easy'
11:45
<Hixie>
i think this is not debian
11:45
<Hixie>
i think it's fedora core 5 but i'm not 100% sure
11:45
<Philip`>
Hmm, that makes it much more of a pain...
11:46
<annevk>
no apt-get ?
11:47
<Hixie>
i don't have root
11:47
<Philip`>
Fedora 7 only has lxml 1.3 packages
11:47
<Hixie>
so even if it was debian, apt-get wouldn't help
11:47
<Philip`>
(though you can --enablerepo=development to get 2.0 on there)
11:47
<Hixie>
also, i don't have libxml2 or libxslt
11:47
<Hixie>
so i can't compile lxml2
11:47
<Philip`>
or python-dev?
11:47
<Hixie>
python-dev?
11:48
<Philip`>
like the CPython header files and stuff
11:48
<Hixie>
$ python setup.py build
11:48
<Hixie>
Building lxml version 2.1.beta3-55506.
11:48
<Hixie>
NOTE: Trying to build without Cython, pre-generated 'src/lxml/lxml.etree.c' needs to be available.
11:48
<Hixie>
ERROR: /bin/sh: xslt-config: command not found
11:48
<Hixie>
** make sure the development packages of libxml2 and libxslt are installed **
11:48
<Hixie>
...
11:48
<Hixie>
gcc: src/lxml/lxml.etree.c: No such file or directory
11:48
<Hixie>
gcc: no input files
11:48
<Hixie>
error: command 'gcc' failed with exit status 1
11:48
<annevk>
maybe use the Web service from Philip` instead?
11:48
<Hixie>
that may be the better idea at this point
11:49
<Philip`>
Maybe me switching to lxml was a bad idea
11:49
<Hixie>
what's the uri for your cgi app?
11:49
<Philip`>
But it's nice when it works :-)
11:49
<annevk>
(the scary thing about installing is that it always leaves a lot of cruft behind; at least, I'm afraid of that)
11:49
<Philip`>
http://krijnhoetmer.nl/irc-logs/html-wg/20080601#l-16
11:50
<Hixie>
can you make it ping http://www.whatwg.org/specs/web-apps/current-work/do-pubrules-update when it's done? i don't trust my end to keep a connection open for two minutes, long running jobs have a tendency to get killed when load gets high.
11:50
Philip`
is too used to being able to install dependencies trivially
11:51
<Philip`>
Hixie: Like sending a GET request?
11:51
<Hixie>
yeah
11:51
Philip`
wonders what'll happen now that Googlebot knows that URL and will keep pinging it :-)
11:51
<Hixie>
i guess the mutlipage url will get refreshed more oten!
11:51
<Hixie>
often
11:52
<annevk>
prolly the referer is checked
11:52
<annevk>
or maybe not :)
11:52
<Philip`>
I could do that in, uh, the next hour or so
11:52
<Hixie>
cool
11:52
<Hixie>
annevk: it's a two line shell script that just downloads the file and unzips it
11:52
<Hixie>
i wonder how i make wget do a post
11:52
<Philip`>
(Is it fine if my script still attempts to keep the connection open for a couple of minutes?)
11:52
<Hixie>
aha, --post-data
11:52
<Hixie>
Philip`: sure
11:52
<Philip`>
Hixie: Ah, good, since I'm not sure how to close it
11:53
<Hixie>
you just run another script in the background to do the real work
11:53
<Hixie>
:-)
11:53
<Philip`>
I've not quite worked out how to do that so the caller returns and the background process doesn't die
11:54
<Philip`>
But anyway I need to go now :-)
11:54
<Hixie>
with apache i've never had a problem just doing "foo.sh&" from within a bash shell script cgi
11:55
<Hixie>
aw man, Philip` used bz2
11:55
<Hixie>
tar doesn't support bz2 last i checked
11:55
<Hixie>
ooh, -j
11:56
<hsivonen>
gnutar support everything
11:56
<hsivonen>
+s
11:56
<hsivonen>
whew. I've now flushed zcorpan's IRC bug reports to bugzilla
12:00
<Hixie>
oh right, Philip`'s script doesn't add all the magic symlinks i had in my version
12:03
<Hixie>
ok
12:03
<Hixie>
all fixed
12:05
<zcorpan>
hsivonen: i guess i could report to bugzilla directly in the future :)
12:05
<hsivonen>
zcorpan: that would be nice
12:06
<hsivonen>
zcorpan: thanks for the reports, btw. Fixing now.
12:06
<zcorpan>
hsivonen: np
12:15
<Hixie>
i guess i should sleep
12:26
<Hixie>
annevk: any chance i can throw scrollIntoView() into an actively maintained spec you're working on?
12:27
<annevk>
i've looked into adding it to CSSOM View
12:27
<annevk>
as that'd make the most sense
12:27
<Hixie>
yeah
12:27
<Hixie>
i have an XXX in the html5 spec about doing that
12:27
<annevk>
i wonder if there are many outstanding issues
12:27
<Hixie>
i'm going through them now
12:27
<annevk>
cause cssom view is more or less done apart from the insane thing that is offset*
12:28
<Hixie>
only cos you took out all the other hard bits and put them into a separate spec :-P
12:28
<annevk>
i'm trying to round up xhr1 stuff now, not sure when cross-site stuff is going to be done but I suppose I should start pushing CSSOM View again as the CSSWG resolved to publish something and it hasn't happened yet
12:29
<Hixie>
the most important part -- how css gets bootstrapped from html5 -- is still undefined, right? or rather, is in the unmaintained other cssom spec?
12:29
<annevk>
Hixie, oh, alternate style sheets etc., yeah
12:29
<Hixie>
(offset* is pretty important too)
12:30
<Hixie>
(and desperately needs a thorough spec)
12:30
<annevk>
i agree and offset* is specced
12:30
<annevk>
the problem is deciding which of the various options we have is the best
12:31
<Hixie>
yeah, i know the feeling
12:31
<annevk>
because offset* is completely broken :)
12:31
<Hixie>
you have to study existing content, read bugs in the various UAs' bug databases, etc
12:31
<Hixie>
it's a lot of work :-(
12:32
<annevk>
yeah, i did all the research, but there's no clear answer
12:32
<Hixie>
the html parser section was the worst so far for me
12:32
<Hixie>
talk about no clear answer :-)
12:32
<annevk>
hehe
12:32
<annevk>
i like the html parser
12:32
<annevk>
it's one of the better parts of html5 despite not being one of the original goals
12:32
<Hixie>
heh
12:33
<Hixie>
it's so complicated it isn't susceptible to bike shed colour discussions
12:33
<Hixie>
which helps
12:35
<virtuelv>
Hixie: don't misunderestimate the tendency people have to look at any building as if it was a bikeshed
12:35
<hsivonen>
I like how my one-sentence spec comments on the parsing section get answered with "Done" without a bikeshed thread in between
12:36
<Hixie>
virtuelv: there have been virtually no bikesheds on the parser section, so i think my statement is true
12:42
hsivonen
thinks it's sad that Adobe keeps breaking their greatest hit (PDF) by mixing Flash with it
12:43
<Hixie>
lol i can't validate the spec using validator.w3.org, it crashes
12:44
<hsivonen>
the spec is a good stress test case
12:47
<Hixie>
ok annevk, i checked in a pubrules-compliant version
12:47
<Philip`>
Hixie: Might it be nice/possible for the 404 page to include link-fixup.js so that e.g. http://www.whatwg.org/specs/web-apps/current-work/multipage/outdated.html#introduction sends you to the right place?
12:48
<Hixie>
i'm about to go to bed
12:48
<Hixie>
mail me the details
12:49
Philip`
wonders why http://www.whatwg.org/specs/web-apps/current-work/multipage/stdout.txt is text/html
12:49
<Philip`>
Hixie: Okay, will do
12:49
<Hixie>
i have a default text/html mime type iirc
12:49
<Hixie>
feel free to include a .htaccess file in the bz2 file
12:50
<annevk>
and stuff that steals cookies :D
12:51
<Hixie>
he could do far worse than that
12:51
<Hixie>
he could put arbitrary binary code in there
12:51
<Philip`>
Hmm, I could use that to make everyone who views the multipage spec unknowingly vote for my favourite issues in http://www.whatwg.org/issues/
12:51
<Hixie>
and then run it
12:52
<Hixie>
and steal my e-mail
12:52
<Philip`>
Stop giving me ideas :-(
12:52
<Hixie>
and anything on any of my sites
12:52
<Hixie>
you could just log into the database and set the vots to whatever you wnat :-)
12:52
<zcorpan>
doesn't <video> work on safari for windows?
12:52
<roc>
it works
12:52
<Hixie>
Philip`: of course if you did any of these things and got caught, it would probably not do your career much good :-)
12:53
<roc>
however, it only integrates with Quicktime
12:53
<Philip`>
Hixie: I'll have to make sure I don't get caught, then
12:53
<Hixie>
:-)
12:53
Hixie
sets up his script to automatically send him diffs of every update
12:58
<jwalden>
the XSS might make someone cross
12:58
<jwalden>
the ensuing spectacle would be quite a site
12:58
jwalden
decides not to try for the second S
12:59
<Hixie>
there
12:59
<Hixie>
philip can't run arbitrary code anymore
12:59
<Hixie>
i force the permissions to non-executable
13:00
<annevk>
so much for trust :p
13:00
<Hixie>
it's not really about trusting him
13:01
<Hixie>
so much as reducing the number of hosts that can be used as attack vectors into my site
13:01
Philip`
doesn't trust himself
13:04
annevk
wanted to thank jgraham for http://meyerweb.com/eric/thoughts/2008/06/02/the-missing-link/?#comment-382177 but he's not around
13:07
<Hixie>
nn
13:26
<hsivonen>
what's the <> tag called in SGML?
13:27
<hasather>
hsivonen: empty start-tag
13:28
<hsivonen>
hasather: thanks
13:28
<annevk>
but why?
13:28
<annevk>
nobody wants to be bothered with SGML phrases :o
13:29
<Lachy>
hsivonen, http://www.is-thought.co.uk/book/sgml-9.htm#SHORTTAG is useful if you ever need to lookup SGML stuff
13:34
<hsivonen>
annevk: I'm trying to improve error messages
13:34
<hsivonen>
annevk: so I thought I should say on <> and </> that SGML features foo and bar aren't permitted in HTML
13:35
<zcorpan>
hsivonen: i'm not sure that's an improvement over saying that < must be written as &lt; in html
13:35
<annevk>
I guess I'm glad I asked since I agree with zcorpan
13:36
<hsivonen>
ok
13:36
<annevk>
pretending that SGML still exists or that HTML is somehow related to it doesn't seem useful
13:36
<hsivonen>
ok
13:36
<hsivonen>
would you agree, though, that <? should say XML processing instructions aren't permitted?
13:38
<annevk>
yeah, that seems ok
13:38
<annevk>
XML exists and people are mixing it and HTML all the time (or at least pretending to do that)
13:38
<zcorpan>
indeed
13:39
<zcorpan>
<?xml ... ?> is common in text/html
13:56
<hsivonen>
I changed various parser error messages
13:56
<hsivonen>
please let me know if it was an improvement
14:20
<gsnedders>
virtuelv: Hixie wrote them, not me! :P
14:21
<virtuelv>
gsnedders: ?
14:21
<gsnedders>
virtuelv: <dfn><code>h1</code></dfn>
14:21
<virtuelv>
ah
14:21
<gsnedders>
virtuelv: The point is it is the defining instance of that code block
14:23
<zcorpan>
hsivonen: is there an easy way to check what the new messages are? :)
14:23
<hsivonen>
zcorpan: I'll run a diff
14:28
<annevk>
gsnedders, in theory specific handling for <hx>... <dfn> ... <dfn> ... </hx> would be nice
14:29
<gsnedders>
annevk: Specific in what way?
14:29
<annevk>
it would be nice if the <dfn> didn't get an id= in that case and references to the <dfn> would instead point to the <hx>
14:32
<gsnedders>
Ah. That would be nice.
14:33
gsnedders
currently just wants it to work right now :P
14:34
<gsnedders>
OK, I've just got running the spec-gen down from 137.302s to 16.986s on HTML 5 :P
14:34
<hsivonen>
zcorpan: http://pastebin.ca/1037720
14:38
<zcorpan>
hsivonen: i think "unescaped logical not" goes over the heads of most authors since most authors don't know programming
14:41
<Philip`>
It seems pretty uncommon for "<>" to mean "logical not", since people don't write web pages in BASIC
14:41
<hsivonen>
should I mention only mistyped start tag?
14:41
<Philip`>
Oh, that's not logical not anyway, it's not-equal
14:42
<hsivonen>
umm right
14:42
<hsivonen>
I'll assume <> and </> are typoed tags
14:43
<annevk>
Probable cause: typo
14:43
<Philip`>
Almost all the <> I can see are in JS regexps
14:43
<annevk>
<> is acceptable within <script>, no?
14:43
<Philip`>
except
14:43
<Philip`>
http://geocities.com/scerez/: <td rowspan="7"><>
14:43
<Philip`>
http://members.aol.com/_ht_a/swinggaits/: <div align=left><font face='Arial,Helvetica,adobe-helvetica,Arial Narrow' size=3 color='#99cc33'><i>&nbsp;&nbsp; </i></font><font face='Arial,Helvetica,adobe-helvetica,Arial Narrow' size=3 color='#99cc33'><b>><><><><><><><><><><><><><><><><><><><><><><><><><><><</b></font><font face='Arial,Helvetica,adobe-helvetica,Arial Narrow' size=3 color='#99cc33'><i> </i></font></div>
14:44
<Philip`>
http://www.hakurai.ne.jp/: <param name="yradius" value="0"><>
14:44
<Philip`>
http://www.angelfire.com/in/HorseAndCarriageSo/: <><br>
14:44
<Philip`>
and so on
14:44
<annevk>
fun
14:45
<Philip`>
http://www.rad.de/: Auf rad.de bestimmt du selbst, was rad.de-Mitglieder lesen. Ob´s um Fahrräder, Touren, Tipps oder News geht <<>> die rad.de-Route ist alles andere als eine Einbahnstraße.</center>
14:45
<annevk>
hsivonen, maybe it should say "Use &lt;> instead."
14:45
<Philip`>
I'm not sure how many are typos and how many are decorative
14:45
<hsivonen>
annevk: that's not good advice if the probable cause is a tag typo
14:46
<gsnedders>
http://hg.gsnedders.com/spec-gen/file/tip/specGen/utils.py#l65 — that gives "" for <dfn><code>foo</code></dfn>
14:47
<annevk>
hsivonen, true
14:47
<zcorpan>
hsivonen: "Either you typoed a tag or you should replace < with &lt;"
14:47
<zcorpan>
(or something)
14:48
<Philip`>
gsnedders: Do you mean repr(Element.getroottree()) gives ""?
14:48
<gsnedders>
Philip`: Line 65
14:48
<gsnedders>
Something is wrong with it, obviously
14:48
<gsnedders>
Wait, I haven't reloaded that in ages
14:49
<gsnedders>
Line 70
14:49
<gsnedders>
My XPath is broken :(
14:49
<hsivonen>
I wonder how I should deal with bugs blocking on spec issues
14:49
<hsivonen>
LATER seems like a bugzilla anti-pattern
14:50
<annevk>
gsnedders, also, the <hx> in question should probably use the first <dfn> child for creating the id in question (unless id is already set)
14:50
<gsnedders>
Oh duh. descendant::text() is what I want
14:50
<annevk>
gsnedders, so <h4>The <dfn><code>em</code></dfn> element</h4> ends up as <h4 id=em>
14:50
<zcorpan>
hsivonen: the text you have for </> would work for <> too, i think (with s/end/start/)
14:50
<Philip`>
gsnedders: child::text() returns text childs, and <dfn> has no text childs
14:50
<gsnedders>
annevk: That's hard to do without forward scanning, which I don't really want to do
14:51
<gsnedders>
Philip`: Yeah, I just realised looking at the spec
14:51
<Philip`>
That's why it's called "child::text" :-)
14:51
<annevk>
gsnedders, else it would be something like <h4 id=the-em> right? what's the big difference?
14:51
<gsnedders>
Philip`: Just the difference between direct and indirect children :)
14:52
<gsnedders>
annevk: I'd rather it just used the textContent of the hx
14:52
<Philip`>
gsnedders: etree.tostring(Elements, method='text') works (at least in lxml 2.0) at finding the concatenated descendant text
14:52
<hsivonen>
zcorpan: ok. tweaking again
14:52
<Philip`>
(or at least I hope it does, since that's what I'm using)
14:54
<gsnedders>
http://stuff.gsnedders.com/html5.html — find broken xrefs that I've introduced (again)!
14:54
<zcorpan>
hsivonen: a likely cause of " in attribute name is that there's a " missing somewhere else, i think
14:55
<hsivonen>
zcorpan: which state?
14:55
<hsivonen>
I mean: at start? surely not inside?
14:56
<zcorpan>
@@ -2187,7 +2187,7 @@
14:56
<zcorpan>
* (') Parse error.
14:56
<zcorpan>
*/
14:56
<zcorpan>
err("Quote \u201C" + c
14:56
<zcorpan>
- + "\u201D in attribute name.");
14:56
<zcorpan>
+ + "\u201D in attribute name. Probable cause: \u201C=\u201D missing immediately before.");
14:57
<zcorpan>
the diff doesn't give much context :(
14:57
<zcorpan>
<p title=my title">
14:58
<hsivonen>
zcorpan: If an attribute name starts with ", isn't the likely cause something like: name "value"?
14:58
<hsivonen>
zcorpan: that's a different state
14:58
<zcorpan>
hsivonen: ah
14:58
<hsivonen>
I don't remember what I wrote
14:59
<hsivonen>
oops. you are right.
14:59
<hsivonen>
my message is bogus
14:59
<hsivonen>
thanks
15:00
<zcorpan>
hsivonen: hmm. <!DOCTYPE html><title></title><embed src ">
15:00
<zcorpan>
gives no errors
15:02
<zcorpan>
or had i reported that before?
15:02
<annevk>
i think you did
15:02
<zcorpan>
yeah. sorry for the noise :)
15:02
<annevk>
it seems that " is not catched on the tokenizing level which is why it is not catched
15:02
<annevk>
(you can figure that out by looking at <p title ">
15:02
<hsivonen>
zcorpan: It's a spec bug
15:03
<annevk>
)
15:05
hsivonen
has still 50 bytes to spare in the tokenizer loop
15:08
<gsnedders>
Hixie: What should the target time to run the spec-gen on HTML 5 be?
15:09
<Philip`>
gsnedders: Zero seconds
15:09
<gsnedders>
Philip`: html5lib takes 12s to parse it alone! :P
15:10
<gsnedders>
Philip`: (thanks for all the work you've done on perf. recently, though)
15:11
<Philip`>
gsnedders: It's not necessarily a reachable target, but it's the best one to aim at :-)
15:11
<takkaria>
I was reading about how python implements dictionaries internally last night, quite interesting from a performance point of view
15:12
<zcorpan>
hsivonen: isn't it better to write out the full doctype in the missing SI warnings?
15:13
<gsnedders>
I so much nicer to work on something when you can actually see the results of what you've done :)
15:13
<zcorpan>
hsivonen: as in "The doctype did not contain the system identifier prescribed by the HTML 4.01 specification. Expected \u201C<!DOCTYPE HTML PUBLIC \"-//W3C//DTD HTML 4.01 Transitional//EN\" \"http://www.w3.org/TR/html4/loose.dtd\">\u201D.";
15:15
<hsivonen>
zcorpan: I suppose. thanks
15:16
<zcorpan>
hsivonen: otherwise it looks good i think
15:17
<hsivonen>
zcorpan: thanks
15:17
<annevk>
gsnedders, less than a second would be nice
15:17
<gsnedders>
annevk: Give me chtml5lib first :)
15:18
<gsnedders>
I think 1.5s is realistically just about as quick as we'll get with current computers with chtml5lib to be fair
15:21
<Philip`>
chtml5lib needs to be multithreaded!
15:22
<takkaria>
I'm not sure there's much to parallelise
15:22
<annevk>
yeah, so it can do tokenizing before having an input stream and building a tree while downloading a resource
15:22
<takkaria>
I think my sense of humour was lacking there. oops
15:23
<hsivonen>
takkaria: if tokens were objects (as opposed to callbacks), the tokenizer and the tree builder could run on different threads
15:23
<hsivonen>
probably not useful
15:24
<takkaria>
(btw, hubbub (incomplete C html parser) takes 1.7s just to tokenise the current spec)
15:24
<hsivonen>
with Java and the V.nu parser it would be relatively easy to try two-threaded performance
15:25
<hsivonen>
zcorpan: fixed
15:25
<jmb>
takkaria: most of that's probably in the input stream handling.
15:25
<hsivonen>
Philip`: fixed image report on fatal error
15:26
<gsnedders>
takkaria: Well, WebKit is quicker than that :P
15:27
Philip`
gets another ~5% from improving cInputStream
15:27
gsnedders
fears the commit when this lands
15:27
<Philip`>
so it now parses the spec in about 12.6s, vs 16.5s for pure Python
15:28
<jmb>
s/probably//
15:28
<takkaria>
heh
15:29
<hsivonen>
Philip`: from what data source? (disk, RAM?)
15:29
<Philip`>
takkaria: Via http://krijnhoetmer.nl/irc-logs/html-wg/20070710#l-239 I got 0.35s for tokenising in C++
15:29
<Philip`>
(on a slightly old version of the spec)
15:29
<Philip`>
(on a P4 3.0GHz, I think)
15:29
annevk
wonders how long Validator.nu takes
15:30
<Philip`>
annevk: That's the "Java" one in those results
15:30
hsivonen
prepares to run a benchmark
15:30
<Philip`>
hsivonen: Disk (but cached)
15:31
<hsivonen>
Philip`: crossing to the native file system seems expensive in Java
15:31
<Philip`>
hsivonen: It's relatively inexpensive in Python, just because the rest of the parser is a hundred times more expensive :-)
15:31
<annevk>
Philip`, how about parsing?
15:32
<annevk>
for validating I get "Total execution time 23573 milliseconds."
15:32
<Philip`>
annevk: I've never had a working C++ parser so I've never had anything interesting to compare
15:33
<Philip`>
cInputStream seems kind of worthwhile for performance - does anyone know what would be the nicest way to try including it in html5lib?
15:34
<Philip`>
(It just defines a class with char() and charsUntil(), and you stick it together with the normal HTMLInputStream (currently via inheritance) and then it goes faster)
15:34
<Philip`>
(or it crashes)
15:35
<gsnedders>
annevk: http://bugs.gsnedders.com/issues/show/5 — happy? :)
15:37
<annevk>
ah, feature request 5, now you have to fix it :p
15:37
<annevk>
(or maybe it's that it will never be finished :D )
15:38
<hsivonen>
Parsing the spec from an in-memory UTF-16 buffer (but using all the Reader cruft in between) takes 0.12 seconds on average in the true streaming mode with a content handler that doesn't do anything
15:38
<gsnedders>
annevk: Yeah, but it's only down for 1.1 :P
15:38
<gsnedders>
annevk: Though there are no open issues on 1.0 :D
15:39
<gsnedders>
(1.0 is mainly just finish it)
15:40
<hsivonen>
I'm too lazy to test how much faster the parser would be without a Reader in there
15:41
<Philip`>
Almost exactly two orders of magnitude faster than Python
15:41
<hsivonen>
oh and this was HotSpot from Java 5 32 bit client on 2.4 GHz Core 2 Duo
15:41
<takkaria>
we should record benchmarks somewhere, probably
15:42
Philip`
could only get another 10% if he made char and charsUntil take zero time
15:44
gsnedders
wonders how to implement the TOC algorithm
15:47
gsnedders
adds something for 1.0
15:49
<hsivonen>
in case anyone feels like running better benchmarks, please use V.nu parser trunk instead of release
16:57
<Philip`>
Manual reference counting in C is not fun
16:59
<annevk>
fun vs fast
16:59
<Philip`>
It's not particularly fast either :-)
16:59
<annevk>
:p
17:00
<Philip`>
The fastest approach is to just leak memory
17:00
<Philip`>
and the second fastest approach is to just leak memory and then scramble around to detect all the leaks once you're about to run out
17:00
<Philip`>
*run out of memory
17:03
<hsivonen>
the autoreleasepool solution is pretty cool (out of solutions that don't involve a proper garbage collector)
17:03
<annevk>
Philip`, OOM-safe should probably be a requirement
17:04
<hsivonen>
(the Mozilla 2 OOM approach looks interesting)
17:06
<Philip`>
if (!(m = malloc(size))) alert("Buy more RAM")
17:27
hsivonen
tries to understand what the W3C AB does
17:28
<hsivonen>
are the examples of past cases where AB has given the W3C an opinion on something?
17:29
<annevk>
see November/December 2007 in http://lists.w3.org/Archives/Member/process-issues/#latest
17:29
<annevk>
(minus #latest)
17:30
<hsivonen>
annevk: thanks
18:40
Philip`
gives up trying to optimise cInputStream, since there's not a lot left to gain :-(
18:40
<zcorpan_>
why is pixelratio='' only on <source>, not on <video>?
20:35
<hsivonen>
whoa! according to Simon Willison, libfbml depends on Firefox
20:37
<hsivonen>
hmm. http://wiki.developers.facebook.com/index.php/FBML_DTD
20:37
<hsivonen>
XSD is the new DTD
21:00
<zcorpan_>
hsivonen: is utf-8 assumed before the encoding decl is found?
21:01
zcorpan_
looks at http://validator.nu/?doc=http%3A%2F%2Fwww.accessify.com%2F&showsource=yes
21:06
<hsivonen>
zcorpan_: windows-1252 is assumed
21:07
<zcorpan_>
hsivonen: ok
21:07
<zcorpan_>
hsivonen: what's up with "Stray end tag noscript." in http://validator.nu/?doc=http%3A%2F%2Fwww.accessify.com%2F&charset=UTF-8&showsource=yes ?
21:08
<hsivonen>
interesting.
21:08
<hsivonen>
I'll have to investigate in the morning
21:08
<hsivonen>
thanks
21:08
hsivonen
notes the alt on the stat single-pixel image on an accessibility site...
21:09
<hsivonen>
nn
21:09
<zcorpan_>
nn
21:22
<Dashiva>
annevk, zcorpan_: around?
21:23
<zcorpan_>
Dashiva: yep
21:24
<Dashiva>
I just noticed <a href> elements tostring as their href, has that always been the case?
21:24
<zcorpan_>
yeah
21:25
<zcorpan_>
dunno if it's specced anywhere
21:25
<Dashiva>
Strange how I've never noticed before
21:25
<Dashiva>
That was my second question :)
21:25
<zcorpan_>
i think html5 is waiting for webidl to get fancy features to support this sort of thing
21:26
<Hixie>
html5 does spec it
21:26
<zcorpan_>
oh
21:26
<Hixie>
btw zcorpan_ regarding your earlier question, it's on <source> only to discourage its use.
21:26
<zcorpan_>
[Stringifies=href] interface HTMLAnchorElement
21:26
<zcorpan_>
Hixie: ok
21:27
<gsnedders>
0.924s for the spec-gen (admittedly, with only xref) with XML source/output of the HTML 5 spec (which is the sort of speed we should be able to get with a C impl. of html5lib)
21:27
<gsnedders>
Hixie: What sort of speed do you want the spec-gen to run in, at the very slowest?
21:28
zcorpan_
'd say 0.924s
21:28
<Hixie>
i WANT it to
21:28
<Hixie>
silly cat
21:28
<gsnedders>
Hixie: You see my latest tweet?
21:28
<Hixie>
i WANT it to run in 500ms or so
21:29
<gsnedders>
"I HAS SPECIFICASHUN GENERATOR! "
21:29
<Hixie>
but i'll put up with whatever i can get :-)
21:29
<gsnedders>
Hixie: How quick is the W3C spec-gen?
21:29
Hixie
checks
21:30
<Hixie>
(and no, i don't follow twitter)
21:30
<gsnedders>
(I expected that)
21:30
<gsnedders>
(Which is why I quoted it here)
21:31
<Hixie>
timing...
21:31
<Hixie>
14 seconds
21:31
<Hixie>
an eternity
21:32
<Hixie>
well, 28 seconds since i do it twice, but i run them in parallel at this point
21:32
<Philip`>
My partly-optimised-in-C html5lib takes, uh, 12.3 seconds to parse the spec :-(
21:32
<gsnedders>
OK, if someone writes a fully-optimised-in-C html5lib, then we can be under 3s, I expect, without much more optimisation of the spec-gen itself
21:34
<Philip`>
I kind of like the approach of rewriting small pieces of the Python functionality into a C extension, but keeping all the complex rarely-called fast-enough bits in Python
21:34
<Philip`>
but I'm not sure how far that approach could be used in html5lib
21:34
gsnedders
wonders whether to switch to doing AH Computing from AH English just so he can use the AH Computing project to learn C and write an html5lib parser :P
21:36
<Philip`>
(Probably the next step would be to replace the entire tokeniser class with a pure C implementation, but the treebuilders are still kind of slow too)
21:38
<gsnedders>
4.4s to serialize, 11.3s to parse here
21:38
<gsnedders>
leaving the other 0.8s for me to waste myself :)
21:40
<gsnedders>
Philip`: "Cue ranges" is missing a </dl> because my Feb 13th source is missing one. Damned Hixie!
21:44
<gsnedders>
Most of the bugs are down to Hixie making my input buggy :(
21:45
<Philip`>
Most of the bugs are down to you not using an up-to-date copy of the spec :-p
21:45
<gsnedders>
Philip`: If someone hadn't broken it in the first place… :P
21:46
<Hixie>
in february i had something like 3000 known bugs with the spec
21:46
<Hixie>
i still have over 2000 now!
21:46
<gsnedders>
whatwg.org is down, seemingly :(
21:46
<Hixie>
yeah
21:46
<Hixie>
not sre why
21:46
Philip`
looks innocent
21:46
<gsnedders>
Hixie: I means bugs like <a href="#appcache-history-1">change</span> (in the source)
21:53
gsnedders
follows Philip`'s suggestion to get rid of the last bit of XPath, making it .3s quicker
21:55
<Philip`>
I never suggested getting rid of the last bit of XPath
21:55
<Philip`>
I actually suggested the exact opposite
21:55
<gsnedders>
pedantic sod :)
21:55
<gsnedders>
http://hg.gsnedders.com/hgwebdir.cgi/spec-gen/rev/1a00bbac777e
21:55
<Philip`>
(keeping it to find all elements with ids)
21:56
<gsnedders>
Did you? I forgot that. But that's already gone. :P
21:59
<gsnedders>
Philip`: 0.718s (for //*[@id]) v. 0.185s (for iter over all elements, and manually checking if they have an @id)
22:07
<gsnedders>
Hixie: can I still `cat header-whatwg source` and just ignore the *.inc files?
22:09
<gsnedders>
Hixie: Can I get you to fix a rather major bug in the spec source, that the real spec-gen silently changes?
22:10
<gsnedders>
'<a href="#appcache-history-1">change</span>' ought to have a closing </a>, not a closing </span>
22:10
<annevk>
oh, whatwg.org is back up
22:10
<gsnedders>
oh wait
22:10
<gsnedders>
I'm being dumb
22:10
annevk
thought it was down from the logs and hesitated to check
22:11
<gsnedders>
I'm just looking at html5.old, my old copy
22:11
<gsnedders>
Hixie: Do nothing.
22:11
<Hixie>
i can do that
22:12
<gsnedders>
Hixie: BTW, you've made it run slower :(
22:12
<gsnedders>
Hixie: Though that was just by making the spec longer.
22:12
<Hixie>
heh
22:13
<gsnedders>
Hixie: Less detail, plz
22:13
<gsnedders>
Oh dear. I don't think this is good. id="prose-content."
22:14
<annevk>
Hixie, hey, we managed to publish something today :)
22:14
<annevk>
Hixie, http://www.w3.org/TR/offline-webapps/
22:14
<gsnedders>
http://validator.nu/?=&doc=http%3A%2F%2Fstuff.gsnedders.com%2Fhtml5.html — they're all down to html5lib's brokenness, as far as I can see
22:15
<annevk>
given that I don't see myself doing anything else useful today I'll write a short blog entry for blog.whatwg.org
22:16
<jgraham_>
Oh, blogging. Yeah I was going to write something about @media. I wonder if Lachy put the slides up yet
22:17
<jgraham_>
gsnedders: That looks simple to fix. Patches welcome :)
22:17
<annevk>
Lachy, you around? Can you make me admin or something on the blog?
22:17
jgraham_
could just not be lazy and fix it himself
22:17
<gsnedders>
jgraham_: Or just commits :)
22:17
<annevk>
Lachy, I can't even add a category
22:18
<jgraham_>
annevk: I can do that, I think
22:18
<annevk>
cool
22:18
<gsnedders>
I think I managed to break stuff, which is odd.
22:19
<jgraham_>
Er, I don't seem to be able to log in at the moment
22:20
<gsnedders>
jgraham_: We can allow < in unquoted attributes too
22:21
<gsnedders>
jgraham_: Can I change a test without being killed?
22:21
<jgraham_>
gsnedders: Which test? And why?
22:21
<gsnedders>
jgraham_: Change: u'<span title="foo<bar">' to u'<span title=foo<bar>'
22:21
<gsnedders>
jgraham_: < is allowed in unquoted attributes
22:22
<jgraham_>
That sounds reasonable
22:22
jgraham_
is not too familiar with the serializer tests
22:23
gsnedders
isn't either.
22:23
<gsnedders>
I just changed to code to match the spec and looked at what tests it caused to fail :)
22:24
<jgraham_>
When I try to log in to the WHATWG blog it tells me that I got my password wrong. When I try to reset my password, it tells me that the link it sent me has an invalid key
22:24
<Hixie>
uh
22:24
<Hixie>
did we get owned or something
22:25
<annevk>
it worked for me, but I don't have an admin account
22:25
gsnedders
commits to html5lib
22:25
<jgraham_>
(it is possible that I really got my password wrong of course)
22:26
<jgraham_>
(but the reset thing should work)
22:27
<Hixie>
man the wordpress ui changed since hte last time i was here
22:27
<Hixie>
annevk: made you admin
22:27
<Hixie>
annevk: can you work how jgraham_'s problem? :-)
22:27
<Hixie>
work out, even
22:28
<annevk>
hehe
22:28
<annevk>
lots of new features
22:28
<gsnedders>
http://stuff.gsnedders.com/html5.html — now valid!
22:28
<annevk>
jgraham_, i'll pm you on w3.org
22:29
<Hixie>
oh i see how to reset the password
22:29
<Hixie>
i can reset it right here if you want jgraham_
22:29
<Hixie>
or anne can do it
22:29
<Hixie>
whichever :-
22:29
<Hixie>
)
22:29
<annevk>
i just did
22:29
<Hixie>
cool
22:30
<Philip`>
gsnedders: When making changes like allowing =, it would be good to add a test that fails in the old code and passes in the new code
22:30
<gsnedders>
Philip`: That is true, actually
22:31
<gsnedders>
Philip`: And fixing the description for the change
22:31
<gsnedders>
jgraham_: Should I change the order to make it more logical, or keep it verbatim?
22:32
<jgraham_>
gsnedders: Of the tests? Change the order. Our tests should really be sorted into some sort of structure so tiny baby steps can' hurt
22:33
gsnedders
commits
22:39
<Hixie>
Philip`: k, the 404 thing should work now, let me know if it is broken
22:40
<Philip`>
Hixie: Oops
22:40
<Philip`>
Hixie: It needs <body onload="fixBrokenLink()">
22:40
Philip`
forgot about that part
22:41
<annevk>
http://blog.whatwg.org/offline-webapps
22:41
<Hixie>
try now?
22:42
<Philip`>
Hixie: Works now - thanks!
22:42
<Hixie>
cool
22:42
<Hixie>
it runs a couple of sed commands on the original whatwg 404 page
22:42
<Hixie>
and creates the .htaccess file to use it
22:42
<Hixie>
so hopefully if i modify the main whatwg 404 page, it'll keep track
22:42
<Philip`>
Hmm, doesn't work in IE because IE replaces the whole error page
22:43
<Hixie>
sucks to be an IE user
22:43
<Hixie>
bbiab
22:44
<Philip`>
Oh, real IE7 (vs Wine IE6) does display the error page
22:44
<Philip`>
but has a "permission denied" JS error while running the script :-/
22:45
<Philip`>
Oh, it worked this time
22:46
<Philip`>
That's good enough for me
22:48
Philip`
will ignore the problem that old links to .../multipage/section-foo.html (with no #) are broken and don't automatically fix themselves
22:53
<annevk>
Philip`, that should be fixable too without too much trouble, no?
22:53
<annevk>
Philip`, given that the file names are based on id= too
22:54
<Philip`>
annevk: Hmm... I suppose if I knew when it was 404, then I could do that
22:55
<Philip`>
Hixie: Would it be possible to change to <body onload="fixBrokenLink(404)"> ?
22:58
<Philip`>
Hixie: (By the way, I've added the do-pubrules-update call now)
23:13
annevk
adds <p> and </p> to his blog entry
23:31
<annevk>
ARIA expressed in CSS? Hmm. I guess I better not reply to that as I might offend someone...