2008-08-01 [17:09:00.0000] maybe i should spend more time fixing the spec and less time responding to sam and insulting the tag. [17:10:00.0000] I must have missed the TAG insult email [17:10:01.0000] just sent it [17:17:00.0000] /me sighs [17:17:01.0000] Is it silly the sub. docs is longer than xref and TOC/numbering docs put together? [17:42:00.0000] http://www.whatwg.org/issues/data.html now shows labels in browsers that support the html5 fillText() api [17:44:00.0000] damn, I have to go and download one now... [18:16:00.0000] /me realises he's completely screwed up 1.0b1 spec-gen docs by having no external links [18:16:01.0000] Nevertheless, go get now! [18:17:00.0000] Also, if there's anyone who should be in the ack but isn't, do email me [18:18:00.0000] http://hg.gsnedders.com/hgwebdir.cgi/spec-gen/rev/fab6bfa129aa (see the bzip/zip/gz links to download) [18:18:01.0000] Oh dear. [18:18:02.0000] The docs say 1.0b1-dev to [18:18:03.0000] *too [18:19:00.0000] I really am too tired :P [18:44:00.0000] sam is rich [18:44:01.0000] "please don't dismiss me" he says, after not replying to almost any of the questions i ask him [18:45:00.0000] like, i write an e-mail "here's how you could help us move forward: X. So far you haven't helped us move forward." [18:45:01.0000] and he replies "You say I haven't helped you move forward! Whine whine whine." [18:45:02.0000] I think that thread is a waste of your time, fwiw, and you should probably stop replying [18:45:03.0000] i'm gonna see if he replies to the questions i asked [18:45:04.0000] if he does, we could make progress [18:46:00.0000] if he doesn't, i'll add him to my filter that labels e-mails as being "AAA IMPORTANT/CRITICAL" [18:53:00.0000] Hixie, I assume you saw the posts about getting WF2 integrated? have you plans to do that soon now that we seem to have some consensus on it? [18:54:00.0000] yeah gonna do that after we publish next month [18:55:00.0000] awesome. :) [20:05:00.0000] is it possible to modify image opacity (after it was rendered on canvas)? [01:35:00.0000] apparently, browsers don't treat a bogus internal encoding decl after a BOM as an error in XML: http://upload.wikimedia.org/wikipedia/commons/3/3a/Bahia_Municip_Itapicuru.svg [01:36:00.0000] nzkoz: you were looking for me? [02:03:00.0000] I had dinner with friends who write software. [02:04:00.0000] it seems to me that when people who have had to deal with Namespaces in XML can talk freely, they never have anecdotes about how Namespaces have helped them [02:04:01.0000] instead, they have negative comments [02:05:00.0000] OTOH, devil's advocate scenarios where Namespaces could help come from people who don't have to deal with Namespaces as part of their work [02:15:00.0000] Namespaces are an example of the Fundamental Software Engineering Error [02:15:01.0000] which is that something too terrible to actually use can be fixed by adding a level of indirection [02:16:00.0000] sometimes that is true but software engineers try to do it even when it clearly is not [02:23:00.0000] othermaciej: do you mean that URI-based extensibility is the too terrible thing in this case? [02:23:01.0000] using URIs as a namespace identifier for tags in a markup language [02:23:02.0000] is the terrible thing [02:24:00.0000] if you had to mention the URI on every tag it would be clearly unusable [02:24:01.0000] but since URIs are *obviously* the one true form of unique identifier, you add a level of indirection instead of rethinking why you are using them [02:25:00.0000] or why URIs that are not meant to be dereferenced should start with http: and have a hostname [02:38:00.0000] Hixie: if Google Translate isn't observing
 now, why would it observe some other "do not translate" marker?

[02:41:00.0000] 
I wonder if Web authors would bother to annotate their stuff for machine translation

[02:42:00.0000] 
If sometimes there are s it ought to translate, it could just default to not translating and have some popup UI when you move the mouse over that text to offer to translate it

[02:46:00.0000] 
hsivonen: good question

[02:47:00.0000] 
hsivonen: though it wouldn't help with the wikipedia example, since that's not marked up right

[02:53:00.0000] 
/me expects a new round of Distributed Extensibility around ITS

[03:15:00.0000] 
In case anyone is wondering about Validator.nu weirdness, the DNS server that Validator.nu use for resolving addresses for outgoing connections is being really slow to respond today

[03:15:01.0000] 
is it being attacked?

[03:15:02.0000] 
i hear there are attacks going on now

[03:15:03.0000] 
I don't know.

[03:19:00.0000] 
are the mozilla devs still trapped in Whistler?

[03:19:01.0000] 
https://bugzilla.mozilla.org/show_bug.cgi?id=448604

[03:21:00.0000] 
hsivonen: I just updated my local validator.nu and now getting "Exception in thread "main" java.lang.NoClassDefFoundError: org/mortbay/jetty/Connector" error

[03:22:00.0000] 
MikeSmith: did you run build.py with target 'all' or 'dldeps'?

[03:22:01.0000] 
with "run"

[03:22:02.0000] 
MikeSmith: try dldeps first and then run again

[03:22:03.0000] 
k

[03:23:00.0000] 
OK, I see it's downloading the new dependencies now

[03:27:00.0000] 
hsivonen: btw, the dldeps can sometimes be a PITA because certain downloads often fail with "Connection reset by peer" messages, and the download doesn't retry, so I have to go back and retry it manually

[03:27:01.0000] 
happening now with the http://download.icu-project.org/files/icu4j/4.0/icu4j-4_0.jar download

[03:28:00.0000] 
MikeSmith: yeah, I'm experiencing problems with DNS right now. It has worked until now, so there hasn't been a need to make it retry before...

[03:28:01.0000] 
OK

[03:28:02.0000] 
but yeah, I should probably make it retry

[03:33:00.0000] 
MikeSmith: the subversion link is so that people can use a svn client to get the complete log, blame, diffs, etc (it's not supposed to be accessed from a browser)

[03:34:00.0000] 
MikeSmith: would it make sense to expose it as a non-hyperlinked url, maybe?

[03:34:01.0000] 
MikeSmith: (other changes look fine)

[03:35:00.0000] 
great

[03:35:01.0000] 
Hixie: yeah, I think a non-hyperlinked "svn checkout http://svn.whatwg.org/webapps/" would be good 

[03:35:02.0000] 
cool, will do that then

[03:35:03.0000] 
thanks

[03:38:00.0000] 
'svn blame' isn't very useful, since it blames Hixie for everything

[03:38:01.0000] 
You mean it isn't all his fault?

[03:38:02.0000] 
i use it a lot to track which version number a line was last edited in

[03:39:00.0000] 
gDashiva: It is, but we know that already

[03:39:01.0000] 
Philip`: I deployed a new XML serializer. Feel free to try to break it.

[03:40:00.0000] 
hsivonen: I might have a look when I have fewer urgent things to work on :-)

[03:41:00.0000] 
MikeSmith: "choice of means" kind of sounds kooky to me so i'm changing that paragraph

[03:42:00.0000] 
now it just reads:   There are various ways to follow the change history for the specification:

[03:42:01.0000] 
Hixie: yeah, sounded funny to me too :) I just couldn't think of better wording..

[03:42:02.0000] 
your revision sounds great to me

[03:45:00.0000] 
i also changed your  construct to just a list of s, since s can have multiple s per 
s

[03:46:00.0000] 
yeah, that's cleaner

[03:47:00.0000] 
hey there's no link to the issues list either

[03:47:01.0000] 
should we add taht?

[03:47:02.0000] 
i guess i forgot to add it when i added it to the whatwg copy when daniel asked

[03:49:00.0000] 
Hixie: yeah, seems like that would definitely be good to have too

[03:51:00.0000] 
ok here's what i have so far: http://www.whatwg.org/specs/web-apps/current-work/.w3c/Overview.html

[03:52:00.0000] 
/me looks now

[03:52:01.0000] 
/me isn't sure he likes the text of the "HTML 5 bug/issue-tracking service" link but doesn't have a better suggestion

[03:52:02.0000] 
too many capitals, numbers, and types of punctuation in short successon

[03:52:03.0000] 
succession

[03:54:00.0000] 
Hixie: yeah, that "HTML 5 bug/issue-tracking service" wording definitely klunky

[03:54:01.0000] 
anyway, revised SOTD overall looks great 

[03:54:02.0000] 
how about just "our public bug tracker"?

[03:54:03.0000] 
or database

[03:55:00.0000] 
"public bug database" sound best, i think

[03:56:00.0000] 
"submit them to our public bug database"

[03:56:01.0000] 
ok: http://www.whatwg.org/specs/web-apps/current-work/.w3c/Overview.html

[03:56:02.0000] 
oh you think s/using/to/? i can do that too if you want

[03:56:03.0000] 
beautiful

[03:57:00.0000] 
nah, "using" is fine

[03:57:01.0000] 
oh i should probably update the link to the bug list to not be the list that i use but hte list that includes all the bugs i am hiding from myself too!

[03:57:02.0000] 
e.g. the ones i reassign to you :-)

[03:57:03.0000] 
heh, yeah

[03:58:00.0000] 
hmm. the resolf.conf on the validator.nu machine is interesting

[04:00:00.0000] 
Hixie: http://www.w3.org/Bugs/Public/buglist.cgi?quicksearch=ALL+product:HTML+-status:RESOLVED+-status:CLOSED

[04:00:01.0000] 
I think

[04:00:02.0000] 
for the "bug database" link

[04:00:03.0000] 
hmm, though I see that picks up the authoring-guide also

[04:01:00.0000] 
anyway, some form that quicksearch feature would seem best

[04:03:00.0000] 
http://www.w3.org/Bugs/Public/buglist.cgi?component=Spec%20bugs&component=Spec%20proposals&product=HTML%20WG&resolution=NEEDSINFO&resolution=LATER&resolution=REMIND&resolution=---&order=bugs.resolution%2Cbugs.priority%2C%20bugs.bug_severity

[04:04:00.0000] 
checked in

[04:39:00.0000] 
wow, so Dean Edridge might be becoming an editor? http://lists.w3.org/Archives/Public/www-archive/2008Aug/0002.html

[04:39:01.0000] 
it'll be interest to see how well he manages

[04:53:00.0000] 
hmm. Jigsaw is pretty seriously vintage Java...

[04:56:00.0000] 
Namespaces (java packages) don't solve the problem of the contents of the namespace being different in 2000 and 2008

[05:01:00.0000] 
hsivonen: I have another question about the http://svn.versiondude.net/whattf/syntax/trunk/relaxng HTML5 schema

[05:03:00.0000] 
does it actually capture the content-model constraints around phrasing prose/phrase/flow content?

[05:03:01.0000] 
MikeSmith: it should

[05:03:02.0000] 
subject to bugs, of course

[05:03:03.0000] 
MikeSmith: however, exclusions are handled in Schematron

[05:04:00.0000] 
and it doesn't capture the new transparent  thing yet

[05:04:01.0000] 
OK. maybe I need to look at the assertions. I haven't much yet.

[05:04:02.0000] 
MikeSmith: do you have a test case that misvalidates?

[05:04:03.0000] 
for a specific for example, where is the constraint that a  can't have a 
 as a child?

[05:05:00.0000] 
/me looks

[05:05:01.0000] 
hsivonen: 
foo
 doesn't misvalidate, but validator.nu doesn't actually seem to get to the point of validating it

[05:06:00.0000] 
because it seems that the parser fixes it before it gets to the validation stage

[05:06:01.0000] 
MikeSmith: in block.rnc, p.inner is defined to be ( common.inner.phrase )

[05:07:00.0000] 
hsivonen: right, and common.inner.phrase = text & common.elem.phrase*

[05:07:01.0000] 
and common.elem.phrase = common.elem.embedded

[05:07:02.0000] 
MikeSmith: yeah, in that case, stuff happens according to the parsing algorithm before it reaches the schema layer

[05:07:03.0000] 
and common.elem.embedded = notAllowed 

[05:07:04.0000] 
hsivonen: right, that's what I meant for that particular case

[05:07:05.0000] 
the  gets implied

[05:08:00.0000] 
before the 

[05:08:01.0000] 
that has nothing to do with the schema

[05:08:02.0000] 
right, I understand that

[05:09:00.0000] 
it seems like with a conformant HTML5 parser, there are many such cases

[05:09:01.0000] 
with one consequence being that the error messages aren't going to be very helpful

[05:09:02.0000] 
but for XHTML5, the restriction is that common.elem.prose |= ul.elem does not end up augmenting common.inner.phrase

[05:10:00.0000] 
OK

[05:10:01.0000] 
MikeSmith: there's a pending feature request to get warnings on implied tags

[05:10:02.0000] 
ah

[05:10:03.0000] 
that would be great to have

[05:11:00.0000] 
ideally I think a user should see a message saying, e.g., "the  element cannot contain a 
 as a child"

[05:11:01.0000] 
or whatever

[05:11:02.0000] 
that makes it very explicit

[05:12:00.0000] 
MikeSmith: the thing is, that omitting 
 is a legitimate way to end the 

[05:12:01.0000] 
hmm, yeah, I realize that now

[05:13:00.0000] 
god, all this stuff must make building a conformance checker a major PITA

[05:13:01.0000] 
:)

[05:15:00.0000] 
right now, the PITA is that Jigsaw doesn't print informative diagnostics when stuff fails :-)

[05:16:00.0000] 
I saw you had mentioned Jigsaw but I'm clueless so far about what you need it for

[05:16:01.0000] 
what problem does it potentially solve for you?

[05:16:02.0000] 
MikeSmith: getting the W3C run an instance of Validator.nu under their preferred container

[05:17:00.0000] 
ah

[05:17:01.0000] 
that would definitely be really nice to have

[05:20:00.0000] 
hsivonen: getting back to the HTML5 schema, am I confused, or is it the case that if you expand the content-model references out, common.inner.phrase just amounts to text & notAllowed

[05:20:01.0000] 
hmm. interesting. when the servlet-relative path is "/", Jigsaw gives it as null

[05:21:00.0000] 
MikeSmith: each phrase-level element definition augments that stub definition

[05:21:01.0000] 
OK

[05:23:00.0000] 
hsivonen: I see now... I just need to quit being lazy and to actually read the schema

[05:34:00.0000] 
do all browsers default to submitting the form to base uri if the action attribute on the form is missing?

[05:34:01.0000] 
hsivonen: do you know of any tools that are able to generate a flattened version of an rng/rnc schema with the combine=choice definitions for a pattern actually combined into a single definition?

[05:35:00.0000] 
MikeSmith: I'm not aware of such a tool, but here's a guess

[05:35:01.0000] 
you might get that result if

[05:35:02.0000] 
you run Trang to convert the schema to RELAX NG XML syntax

[05:36:00.0000] 
and then run Kohsuke Kawaguchi's schema converter to convert the schema from RELAX NG to RELAX NG

[05:37:00.0000] 
but that's just a guess

[05:37:01.0000] 
then you could run Trang againg to compact syntax to make the result human-readable :-)

[05:37:02.0000] 
MikeSmith: Trang preserves the structure of the schema

[05:38:00.0000] 
yeah, tried trang .. doesn't do it, unfortunately -- or fortunately, depending on how you look at it. trang faithfully preserves the RNC structure in RNG output in such a way that is seems like it's actually round-trippable

[05:38:01.0000] 
MikeSmith:  Kohsuke Kawaguchi's converter builds an abstact model and reserializes it without preserving structure

[05:38:02.0000] 
but IIRC, him tool doesn't read compact syntax

[05:39:00.0000] 
I tried Dave Tolpin's incelim and it doesn't combine them either

[05:39:01.0000] 
hence, the need to use Trang, too

[05:39:02.0000] 
hsivonen: OK

[05:39:03.0000] 
will try Kohsuke's tool

[05:40:00.0000] 
/me apologizes again for not actually reading carefullywhat hsivonen wrote above

[05:40:01.0000] 
I'll shut up now :)

[05:40:02.0000] 
for a while at least

[05:46:00.0000] 
w00t. I got Validator.nu to run inside Jigsaw. (without file upload support, without gzip support and without non-ASCII input support)

[05:49:00.0000] 
hsivonen: congats

[05:49:01.0000] 
MikeSmith: thanks. now I need to document what I did. :-)

[09:18:00.0000] 
hsivonen: any clues on getting Kohsuke's rngconv working with the HTML datatype library?

[09:19:00.0000] 
hsivonen: I'm trying to run a conversion, but I'm getting "http://whattf.org/datatype-draft" is not a recognized data type vocabulary"

[09:29:00.0000] 
hsivonen, othermaciej just saw your dialog re: namespaces earlier (last night) http://krijnhoetmer.nl/irc-logs/whatwg/20080801#l-154

[09:30:00.0000] 
feel free to add a new section (or sections), like "implementation experience" and/or "fundamental software engineering error" to http://microformats.org/wiki/namespaces-considered-harmful

[09:34:00.0000] 
MikeSmith: no clue. do you have the library in classpath?

[09:35:00.0000] 
tantek: I recently started a wiki page, too: http://wiki.whatwg.org/wiki/Namespace_confusion

[09:35:01.0000] 
not much there yet

[09:36:00.0000] 
still, a good collection

[09:36:01.0000] 
feel free to link to your page also from http://microformats.org/wiki/namespaces-considered-harmful

[09:37:00.0000] 
tantek: ok. I will. (gotta run now, though)

[09:44:00.0000] 
hsivonen: yeah, I got the dist/html5-datatypes.jar subdir of my http://svn.versiondude.net/whattf/syntax/trunk/relaxng/datatype/java working directory

[09:44:01.0000] 
it's just that one jar file, right?

[12:08:00.0000] 
/me wonders if getting involved with this thread was a good idea after all

[12:35:00.0000] 
it wasn't

[12:36:00.0000] 
neither is the way trackback/pingback have been brought up at all

[12:42:00.0000] 
maybe I should set a cron job to email http://xkcd.com/386/ to me every morning...

[12:45:00.0000] 
the debate on extensibility is fundamentally a religious one, I don't see how either side will ever buckle

[12:46:00.0000] 
I have this suspicion that HTML5 will never become a W3C recommendation as a result of this and other permadiscussions

[14:29:00.0000] 
libxml2's APIs suck a little bit

[15:13:00.0000] 
"And authors want to add metadata. Instead of forcing it into containers that haven't been designed for it (@title, @data-*), let them do it properly." -- http://lists.w3.org/Archives/Public/public-html/2008Aug/0023.html

[15:13:01.0000] 
I don't get what other way would be considered the proper way to embed metadata, beyond the mechanisms designed for adding metadata?!

[15:15:00.0000] 
if, as Julian claims, title and data-* weren't designed for adding some type of metadata, then I must be missing something.


2008-08-02
[18:13:00.0000] 
Hixie, why does it matter if an image is stretched or not, for the purpose of conformance?

[18:13:01.0000] 
why does it matter if a tag is closed or not, for the purpose of conformance?

[18:14:00.0000] 
in almost all cases, stretching an image is a mistake. in the remaining cases where it is intentional, it's dubious practice. this argues for the author being notified of the problem.

[18:15:00.0000] 
same as with anything else that is a conformance error

[20:39:00.0000] 
Could anyone point to an SVG to Canvas converter?

[01:48:00.0000] 
MikeSmith: yeah, it's just that one jar file

[01:48:01.0000] 
hsivonen: yeah, I got it figured out finally

[01:49:00.0000] 
I was using "jar -r" to run it

[01:49:01.0000] 
and that caused it to ignore the datatype library despite the fact I had it in my classpath

[01:50:00.0000] 
MikeSmith: the -jar switch sucks

[01:50:01.0000] 
yeah

[01:50:02.0000] 
I should have known better than to try it 

[01:52:00.0000] 
anyway, I did manage to be able to run it, but not without fatal errors

[01:52:01.0000] 
running against the HTML5 schema, I get java.lang.StackOverflowError

[01:53:00.0000] 
at com.sun.msv.grammar.ExpressionCloner.onChoice(ExpressionCloner.java:37)

[01:54:00.0000] 
anyway, I'm giving up on it

[01:55:00.0000] 
and instead just writing a stylesheet to pre-process the schema and combine all the choice=combine stuff

[02:06:00.0000] 
MikeSmith: msv is no good with the default stack size of hotspot

[02:07:00.0000] 
-XX:ThreadStackSize=2048 

[02:10:00.0000] 
hsivonen: OK, trying now

[02:12:00.0000] 
getting same error even with that switch

[02:13:00.0000] 
but anyway, the idea is kind of a no-go regardless, due to that fact that it's going to restructure the whole schema

[02:14:00.0000] 
the only thing I really want is to just consolidate the @combine=choice stuff, which I think can manage to do with XSLT

[03:18:00.0000] 
oh, I wish what I wrote in IRC didn't get dragged into the thread on the mailing list :-(. I chose not to respond on the list for a reason.

[03:22:00.0000] 
i hope mike is happy: http://www.whatwg.org/specs/web-apps/current-work/#fetching :-)

[03:25:00.0000] 
/me smiles

[03:25:01.0000] 
Hixie: cool

[03:27:00.0000] 
"At a time convenient to the user and the user agent" is an interesting phrase

[03:30:00.0000] 
heh. It makes it sound like my browser should book an appointment with me to do the download :-)

[05:21:00.0000] 
nn

[15:26:00.0000] 
i think i might make alt="" required and say that when you don't know what the image is, you have to say what kind of image it is (e.g. "uploaded image", "photo", "thumbnil", or whatever) and put that in braces in the alt="" attribute, as in alt="{photo}"

[15:26:01.0000] 
and that that is never allowd inside a link

[15:27:00.0000] 
(in a link it should just give the text that is appropriate for the link, e.g. alt="View image" if it's a link to the image)

[15:29:00.0000] 
this seems to handle all the cases that allowing alt="" to be omitted does, while verly mildly improving the accessibility of those images, and being slightly more compatible with legacy UAs

[15:29:01.0000] 
(it affects some legacy pages, though not many, and certainly no more than the missing-alt-altogether case would)

[15:40:00.0000] 
seems like a wise decision

[15:47:00.0000] 
Hixie, why use curly braces instead of square brackets?

[15:48:00.0000] 
Lachy: it's more exotic

[15:48:01.0000] 
and probably less likely to conflict

[15:49:00.0000] 
ok, makes sense

[16:02:00.0000] 
Lachy: [] are used a lot already

[16:02:01.0000] 
Lachy: {} not so much

[16:02:02.0000] 
Lachy: iirc <> is used even less, but that causes problems in xml or something

[16:02:03.0000] 
/me logs on to his vpn to get the numbers

[16:03:00.0000] 
< >, if anyone remembers

[16:05:00.0000] 
ok here are the stats as percentages of total pages scanned

[16:05:01.0000] 
pages that had an img that wasn't in a link and had a value that followed the pattern [...]: 0.45%

[16:05:02.0000] 
pages that had an img that wasn't in a link and had a value that followed the pattern (...): 0.13%

[16:06:00.0000] 
pages that had an img that wasn't in a link and had a value that followed the pattern {...}: 0.035%

[16:06:01.0000] 
pages that had an img that wasn't in a link and had a value that followed the pattern <...>: 0.033%

[16:07:00.0000] 
most common alt={...} value was alt={alpha}, most of which it seems came from pages created by one particular conversion tool

[16:08:00.0000] 
I'm surprised (...) is that low

[16:09:00.0000] 
number of pages with : 94%

[16:09:01.0000] 
number of pages with : 82%

[16:09:02.0000] 
number of pages with  with non-empty alt: 77%

[16:10:00.0000] 
number of pages with  that had at least one without an alt="": 67%

[16:11:00.0000] 
number of pages with  and that had all their  with alt="": 27%

[16:11:01.0000] 
number of pages that had at least one  element but no  elements with alt="": 11%

[16:12:00.0000] 
which comes out to somewhere between 29%-71% of images that don't have alt

[16:12:01.0000] 
ok, well, it nicely solves the problem of distinguishing between legitmate and guessed alt text, so it seems like a reasonable solution

[16:13:00.0000] 
most of the alt=""s with the pattern <...> seemed to be mistakes, e.g. 0.0015% of pages had alt="span style='background-color: #CCFFFF'>Visa"

[16:13:01.0000] 
so what would authoring tools like Dreamweaver be requried to insert by default? Would alt="{image}" be ok?

[16:14:00.0000] 
if the author knows what the image is, then alt="{...}" is never "ok"

[16:14:01.0000] 
I think apache's autoindex uses "[ TXT ]" as alt..

[16:14:02.0000] 
but yeah, i guess that would be a reasonable default for the case where today they just omit alt="" or have alt="" empty without probable cause

[16:15:00.0000] 
yeah, I know that. But by default when a user just drags and drops an image into the WYSIWYG editor, and doesn't enter anything for alt text into the prompt

[16:15:01.0000] 
actually, just "[TXT]"

[16:15:02.0000] 
jcranmer: second most common [...] alt value was [DIR] 0.085% (first was [new] 0.095%, third was [NEW] 0.066%)

[16:16:00.0000] 
[DIR] is used in apache's autoindex

[16:16:01.0000] 
jcranmer: [TXT] was 0.024%

[16:16:02.0000] 
less common than [cpp] and [flash]

[16:16:03.0000] 
and [*]

[16:17:00.0000] 
most common (...) values were (+), (-) and (?)

[16:17:01.0000] 
what would [cpp] be used for?

[16:17:02.0000] 
\[[A-Z]+\] is quite possibly autoindexing

[16:17:03.0000] 
I don't know what IIS uses, if anything

[16:17:04.0000] 
(0.038%, 0.037%, and 0.0095% respectively)

[16:18:00.0000] 
yeah apache's autoindexing was well represented in these results

[16:18:01.0000] 
[spoiler] was common too

[16:18:02.0000] 
what does [] look like if you exclude apache, which is probably a valid use case?

[16:19:00.0000] 
although it would have to be ~30% of all stuff to change the rankings

[16:19:01.0000] 
valid how?

[16:19:02.0000] 
not sure what you mean

[16:20:00.0000] 
Hixie: Hmm what turned out to be wrong with using an attribute to signal the poverty of alt text, rather than resorting to odd syntax inside the alt attribute?

[16:20:01.0000] 
especially given the use-case is automatic insertion rather than hand-authoring

[16:21:00.0000] 
jcranmer: the [...] values were (roughly in order): [new], [DIR], [NEW], [   ], [*], [b], [img], [i], [url], [u], [email], [quote], [flash], [fixed], [spoiler], [cpp], [strike], [TXT], [IMG], [ICO], [M], ...

[16:22:00.0000] 
five of which are definitely apache

[16:22:01.0000] 
webben: it seems more likely to be misused

[16:22:02.0000] 
webben: e.g. through ignorant copy-paste

[16:22:03.0000] 
webben: also, coming up with a name was difficult

[16:22:04.0000] 
another seven of which seem to be BB-code

[16:23:00.0000] 
I'd have thought exactly the opposite was the case.

[16:23:01.0000] 
[img alt="[b]SEE THIS[/b]"] ?

[16:23:02.0000] 
that a weird syntax inside alt is utterly opaque

[16:23:03.0000] 
although the non-existence of [/b] does seem to invalidate that theory...

[16:24:00.0000] 
Hixie: Is there a list of proposed names anywhere?

[16:24:01.0000] 
webben: not a convenient list, no

[16:25:00.0000] 
webben: noalt, important

[16:25:01.0000] 
I can't remember the others

[16:25:02.0000] 
yeah, well, I agree those aren't good names ;)

[16:25:03.0000] 
webben: what kind of proposal did you have in mind?  ?

[16:25:04.0000] 
webben: (with a better name obviously!)

[16:26:00.0000] 
well, you wouldn't need the ="true" (presumably?) but yeah.

[16:27:00.0000] 
that was basically my importantimage="" proposal, but nobody could come up with a good attribute name.

[16:28:00.0000] 
missing-text-equivalent

[16:28:01.0000] 
please-sir-can-i-have-some-more-validation

[16:29:00.0000] 
webben:  vs  seems like a tossup as to which is better

[16:29:01.0000] 
that name's still probably not ideal, but I think the former is easily better.

[16:30:00.0000] 
/me would suggest testing it with some newbie authors and asking them what they think these syntaxes mean

[16:30:01.0000] 
actually that's not a fair test

[16:30:02.0000] 
since that clues them in that {} is a syntax, which is counter-intuitive

[16:31:00.0000] 
heh

[16:32:00.0000] 
i wish i could remember why i had decided to look at alt={...} rather than importantimage=""

[16:32:01.0000] 
there was some more serious problem than the name, iirc, but i don't recall what

[16:33:00.0000] 
(You could use self-documenting decisions, that way you won't have to document anything)

[16:33:01.0000] 
not sure what that would mean here :-)

[16:34:00.0000] 
the decision wasn't written down anywhere, i just did it

[16:35:00.0000] 
if we documented every decision, that would take the fun out of rehashing old arguments!

[16:39:00.0000] 
Hixie: I just felt like taking a jab at self-documenting code.

[16:39:01.0000] 
heh

[16:42:00.0000] 
webben: missing-text-eqivalent doesn't really convey the right message, i'm not sure it actually helps vs {...}

[16:42:01.0000] 
Hixie: is " alt-is-not-actually-a-description-but-a-category" the right message?

[16:42:02.0000] 
Hixie, are you speccing the {...} feature now?

[16:43:00.0000] 
the message is alt-is-not-actually-a-description-but-a-category-because-we-do-not-know-what-the-image-actually-is

[16:43:01.0000] 
Lachy: i'm looking at it. it's one of the folders with the most messages

[16:44:00.0000] 
Hixie: alt-is-category-only ?

[16:44:01.0000] 
webben: i guess the reason i prefer {...} is that if we're going to come up with some mostly opaque syntax, i'd rather pick the most compact

[16:45:00.0000] 
webben: that's pretty long, and still doesn't really help, i mean, what's a category? does that mean i can just do this on all my images? etc.

[16:46:00.0000] 
I think the what's a category question is answered by the alt content itself.

[16:47:00.0000] 
I like the {} syntax better because it looks ugly enough to discourage authors from using it on all their images, yet simple enough to be used where appropriate

[16:47:01.0000] 
would alt="{}" be considered non-conforming?

[16:48:00.0000] 
While {} might discourage authors using {}, it doesn't make it clear the alt is suboptimal. Consequently newbies could easily take away the message that alt="Photo" is a good alt.

[16:49:00.0000] 
missing-text-equivalent at least hints that something's wrong.

[16:49:01.0000] 
and which of these regexes best matches the proposed syntax: /^\{.+\}$/ or /^\{[^\}]+\}$/

[16:51:00.0000] 
webben: the stats indicate that authors already think omitting alt altogther is fine

[16:51:01.0000] 
webben: so i don't think this will make it any worse

[16:51:02.0000] 
Lachy: i was just thinking /^\{.*\}$/

[16:51:03.0000] 
Hixie: I don't see the logical connection between those stats and the effect of any given example.

[16:52:00.0000] 
webben: i'm saying that nothing could make the current authoring practices worse

[16:52:01.0000] 
Hixie, so then alt="{}" would be conforming, even though it's almost completely useless to UAs since it says nothing about what type of image it is?

[16:52:02.0000] 
I don't think not making it any worse is the bar we should be setting. ;)

[16:53:00.0000] 
Lachy: alt="{image}" doesn't say anything about what it is either

[16:53:01.0000] 
I guess those two could be considered equivalent then

[16:53:02.0000] 
fwiw it could easily be worse.

[16:53:03.0000] 
webben: i don't think having a few people stick misteriously named attributes on images is going to improve things either

[16:54:00.0000] 
/me doesn't really see why not.

[16:55:00.0000] 
if the problem is mysteriousness, then go for a big huge long name.

[16:55:01.0000] 
then nobody will use it

[16:55:02.0000] 
why would nobody use it?

[16:55:03.0000] 
nobody would use it /accidentally/ ... which is precisely what you're trying to avoid

[16:56:00.0000] 
it's a psychology thing -- people just don't seem to use long keywords, they shy away from them

[16:56:01.0000] 
i don't know why

[16:56:02.0000] 
We're not talking about hand-authoring here. We're talking about sites like Flickr processing thousands of images; and software like DreamWeaver written by professionals.

[16:56:03.0000] 
i guess i'm not sold on the idea that the advantage of an attribute over just a compact syntax outweigh the disadvantages... the pros and cons on both sides seem pretty minimal

[16:57:00.0000] 
that's the target audience of this attribute as I understand it.

[16:57:01.0000] 
flickr output is hand-authored templates.

[16:57:02.0000] 
it's still hand-authored.

[16:57:03.0000] 
the templates are hand-authored; the output isn't.

[16:58:00.0000] 
likewise someone writes the code for DreamWeaver

[16:58:01.0000] 
s/hand-authoring/small-time authoring/ if you like

[16:58:02.0000] 
my point is it's not like we can just ignore authoring because there's a computer involved -- it's still hand written at some point

[16:59:00.0000] 
webben, even if the attribute were theoretically a better approach (which I'm not convinced it is), then there's still the big problem of finding an appropriate name that accurately represents its meaning for all the vaious use cases


2008-08-03
[17:00:00.0000] 
yeah i haven't yet seen a name that i'd be proud to have in a spec with my name at the top

[17:00:01.0000] 
Lachy: If the meaning is ambiguous, then that's true of {} too. If the meaning can be expressed, then it can be expressed in an attribute name.

[17:00:02.0000] 
if {} is ambiguous, that may hint at a problem with the idea

[17:00:03.0000] 
are there two concepts here that need seperating?

[17:01:00.0000] 
{}'s advatnage is great compactness, its disadvantage is it's meaning is not intuitive. for an attribute to outweigh its corresponding verbosity disadvantage, it has to have a name that is intuitive

[17:02:00.0000] 
s/it's/its/

[17:02:01.0000] 
I think you're underestimating that {} doesn't even look like code, and therefore is likely to be misused.

[17:02:02.0000] 
one can't really dispute an attribute looks like code, even if it's not obvious what it does.

[17:02:03.0000] 
why would it be misused more than now?

[17:03:00.0000] 
it doesn't exist now.

[17:03:01.0000] 
alt=[...] is used a lot

[17:03:02.0000] 
alt={...} is not

[17:03:03.0000] 
yeah, but not as code.

[17:03:04.0000] 
you're saying that alt={...} would become more popular for other uses just because we introduce it as meaning something special for one use?

[17:03:05.0000] 
webben, you're not making sense

[17:04:00.0000] 
Hixie: Given folks don't normally see alt, that's not actually inconceivable.

[17:04:01.0000] 
re what you asked earlier... the concept here is "i don't know what this image is or will be, so i cannot provide a useful equivalent... here's a hint as to what kind of image it is, at least, so that you know it's not meant to be purely decorative"

[17:05:00.0000] 
it's possible that once this syntax starts showing up in poorly written tutorials, authors could misunderstand its purpose and begin using it more commonly, yet wrongly

[17:05:01.0000] 
webben: it seems pretty unlikely to me. We know that attributes that people use get copied and pasted around even without reason, too, so i don't see why it would happen any less to an attribute than to a special syntax in an attribute.

[17:06:00.0000] 
webben: maybe it's in fact more likely that authors who don't know about this would not know that the {...} syntax means anything, and would thus in fact not use it, as they think it's ugly :-)

[17:06:01.0000] 
webben: whereas they would see an attribute and know that it DID mean something, even if they didn't know what, and so WOULD use it (as we have seen happen with other things)

[17:06:02.0000] 
e.g. bits of svg appearing in random places in html documents

[17:07:00.0000] 
I think "bits of svg" are probably a lot more opaque than any of the names suggested for this thing.

[17:07:01.0000] 
svg was just one example, it happens with everything

[17:08:00.0000] 
hm, when we started this discussion i was pretty much neutral on the issue of importantimage="" vs alt={...} but now i'm definitely leaning more towards the latter

[17:08:01.0000] 
Well, yeah, but you need to work out what makes things happen more, not just look at whether they happen at all.

[17:09:00.0000] 
i'm gonna go shopping and will think more on this, but feel free to keep discussing it, i'll read any ideas that come up when i get back

[17:09:01.0000] 
k ; have a good shop :)

[17:09:02.0000] 
/me is probably heading to bed very shortly.

[17:10:00.0000] 
I think the {} syntax would be accepted by the community better than importantimage="", because there were a lot of suggestions for using a similar approach with square brackets, and very little support for using importantimage (it was mostly ignored when it was suggested, depite repeatedly pointing to it)

[17:11:00.0000] 
Hixie, btw, did your Stargate Continuum DVD arrive yet?

[17:11:01.0000] 
if so, what did you think of it?

[17:14:00.0000] 
Lachy: I don't think importantimage was a good name; "" implies unimportant image; and it doesn't convey that something is missing.

[17:15:00.0000] 
webben, please elaborate?

[17:15:01.0000] 
sorry alt="" => decorative (unimportant) image.

[17:16:00.0000] 
alt="something" => important image

[17:16:01.0000] 
importantimage => no additional information

[17:16:02.0000] 
the same is true for alt={}

[17:17:00.0000] 
yes, but I'm suggesting why importantimage wasn't a good name

[17:18:00.0000] 
missing-text-equivalent does at least add some information, though I'm still not precisely happy with it.

[17:18:01.0000] 
oh, ok. I misread your message. I thought you said it was a good name

[17:18:02.0000] 
"alt-is-hint-only" perhaps

[17:19:00.0000] 
it's too long

[17:19:01.0000] 
/me is thoroughly unconvinced that long is bad. It's actually a plus AFAICT.

[17:19:02.0000] 
and there's a possibillity that some authors could inadvertently write alt-is-only-hint instead of alt-is-hint-only

[17:20:00.0000] 
there's also a possibility authors could write {)

[17:20:01.0000] 
so using sentences for attribute names isn't really a good idea, especially when it's possible to transpose words without losing its meaning

[17:20:02.0000] 
or " {

[17:21:00.0000] 
it's easier to spot a syntax error like that, than it is to spot incorrectly transposed words in an attribute name

[17:21:01.0000] 
the former cannot be caught by the validator; the second can.

[17:22:00.0000] 
sorry {) is not definitely invalid, but alt-is-only-hint definitely is.

[17:22:01.0000] 
so at best a validator could issue a warning about the former.

[17:22:02.0000] 
that's what the image analysis tool in the validator is for.

[17:26:00.0000] 
I think having a non-verifiable syntax would not make for a more user-friendly image analysis tool.

[17:26:01.0000] 
since then you're asking people to check syntax as well as equivalents.

[17:27:00.0000] 
whereas you could have them fix the syntax then check equivalents

[17:31:00.0000] 
I'm not really convinced that it's likely for authors to mistype {} as {), because the keys are in different positions on the keyboard, and the braces are right next to each other

[17:32:00.0000] 
{]

[17:32:01.0000] 
and in this whole discussion, I've not seen anyone accidentally mistype {}

[17:32:02.0000] 
on my keyboard at least } ] are the same key

[17:33:00.0000] 
yeah, that's true, but still unlikely

[17:33:01.0000] 
not sure why that would be unlikely

[17:33:02.0000] 
have you ever seen it happen anywhere? If so, how freqently?

[17:34:00.0000] 
i don't have much memory of typos and their frequency full stop

[17:34:01.0000] 
let alone mental data on } ] in particular ;)

[17:35:00.0000] 
so, in other words, you're just speculating and trying to put that forth as strong evidence anyway?

[17:35:01.0000] 
/me thinks the whole discussion is very speculative.

[17:36:00.0000] 
/me doesn't figure "there's also a possibility authors could write {)" is an especially strong statement either

[17:38:00.0000] 
I do know I don't want to have to get time-poor editorial or QA staff to understand the ins and outs of this syntax when I could get it checked with a validator /before/ giving them more useful work of inspecting text that is supposed to be a text equivalent.

[17:39:00.0000] 
or rely on their eyes when this can be handled by code.

[17:40:00.0000] 
so maybe there are ways of at least flagging mismatched braces as a warning in the validator then

[17:40:01.0000] 
that's still a waste of their time.

[17:40:02.0000] 
what?

[17:40:03.0000] 
they'd need to manually inspect the braces

[17:41:00.0000] 
so? Braces are used very infrequently within alt text anyway, so it's hardly a lot of time wasted

[17:42:00.0000] 
that's worse, then they'd need to try and remember what that pesky developer said about braces

[17:42:01.0000] 
and () is just like {} right?

[17:42:02.0000] 
I don't understand your point

[17:42:03.0000] 
well, not worse, but not good either.

[17:43:00.0000] 
Lachy: Basically, it shifts a job that could be done more reliably by machines onto people.

[17:43:01.0000] 
that's impractical.

[17:44:00.0000] 
checking the accuracy of alt text isn't a job that can be reliably done by machines anyway, so having authors manually check those using braces along with all the others isn't that big a deal

[17:44:01.0000] 
this isn't about "checking the accuracy of alt text"

[17:44:02.0000] 
of course it is

[17:44:03.0000] 
what else is it about?

[17:45:00.0000] 
Let's say you have a source of photos coming into a system.

[17:45:01.0000] 
so you have some code which inspects each one to see if it has some text to use as an equivalent

[17:45:02.0000] 
if it does, it inserts the text; if not, it inserts alt="{Photo}"

[17:47:00.0000] 
yeah, and...?

[17:47:01.0000] 
why should humans be checking if the system can spell {Photo} correctly, rather than just getting a total of those with {Photo} and proceeding to check the ones that do have alt text?

[17:49:00.0000] 
I guess the image checker could group them

[17:49:01.0000] 
but that would probably mean you'd end up with false positive

[17:49:02.0000] 
*positives

[17:50:00.0000] 
you seem to be making assumptions about the UI of the image inspector. Given the {} syntax, why couldn't the image inspector group those with {Photo} together and provide a count, or whatever else?

[17:50:01.0000] 
I don't see how you'd end up with any more false positives using alt="{...}" as you would with some attribute

[17:52:00.0000] 
Lachy: Yeah. Maybe. Would get a lot more complicated if alt's were being autogenerated to contain more information

[17:52:01.0000] 
e.g. {Photo tagged with 'cat'}

[17:54:00.0000] 
in fact, the better your auto-generation of alts, the worse it would get.

[17:55:00.0000] 
well, actually, I guess the checker could just group {} and non-{} together for separate checking

[18:01:00.0000] 
webben, yeah, exactly how it would group alt="..." separately from some-special-attribute=""

[18:02:00.0000] 
and if, in the process of inspecting the alt="..." group, the author sees a { in there, it would look like something that needed fixing

[18:08:00.0000] 
maybe

[18:09:00.0000] 
well, it would if the validator also warned about it /and/ if {} weren't common for that set of alt's

[18:09:01.0000] 
meh, this also means you'd need code to inspect provided equivalents for { } and decide what to do with it

[18:10:00.0000] 
e.g. does this mean the provided equivalent is actually not an equivalent but someone who actually knows the {} syntax and is assuming your just going to dump into an alt attribute.

[18:10:01.0000] 
or is this actually the equivalent

[18:15:00.0000] 
also, it's cutting into the strings people use for interpolation (e.g. YAHOO.lang.substitute uses {} ) http://developer.yahoo.com/yui/docs/YAHOO.lang.html

[18:16:00.0000] 
any good pages with