00:13
<Hixie>
Philip`: so i dunno what happened but i now have 3346 e-mails in the database
00:13
<Hixie>
i'm thinking maybe the script died when it last ran
00:14
<Hixie>
do you want a dump again?
00:14
<Hixie>
(if so, what's your e-mail again?)
00:16
<Hixie>
in other news, who's in charge of the wiki again?
00:16
<Hixie>
i think we might want to do something about the spam, as suggested on http://wiki.whatwg.org/wiki/User_talk:Hixie, though i don't know what
00:17
<Philip`>
Hixie: When I download all the folder lists from you, that only gives 1515 messages
00:17
<Hixie>
it should have changed recently
00:17
<Hixie>
i reran the algorithm
00:17
<Hixie>
the script, rather
00:17
<Philip`>
Ah, okay
00:17
<Philip`>
The dump would be helpful so I don't have to download the rest, then
00:18
<Philip`>
(philip⊙zdcu)
00:18
<Hixie>
sent
00:20
<Philip`>
Received
00:20
<Philip`>
(Thanks!)
00:20
Philip`
tries to remember where he put his scripts
00:24
<Philip`>
Seems to be working
00:26
<Philip`>
The list of all emails is 7MB now
00:26
<Philip`>
and I see 3298 messages in it
00:26
<Philip`>
which is close enough to 3346
00:26
<Philip`>
(I guess some messages are not in any folder?)
00:31
<Hixie>
you should have exactly 3346 in that csv dump
00:31
<Hixie>
the folders probably changed since yesterday
00:31
<Hixie>
if that matters
00:33
<Philip`>
I get 3346 from the CSV, but downloading the 'folders' list then each 'folder ...' list gives (as of ten minutes ago) 3298
00:33
<Hixie>
odd
00:34
<Hixie>
i'd have expected the opposite
00:34
<Hixie>
that is, some of the e-mails are in more than one folder
00:40
<Philip`>
The CSV contains some message IDs like "<3019.217.124.88.212.1149790503.squirrel⊙wc"
00:40
<Hixie>
i bet the IDs are truncated to 64 bytes
00:41
<Hixie>
but that's 56
00:41
<Hixie>
not 64
00:41
<Hixie>
so nevermind
00:41
<Hixie>
i'm confused
00:41
<Philip`>
Of the messages that are in the CSV but not in the folders list, almost all are truncated like that
00:42
<Hixie>
weird
00:42
<Philip`>
except for <<<<f0a054a12911d5a153482846f135f669⊙1l>>>>
00:42
<Philip`>
Oh, I think I see that one in the online bit too
00:43
<Philip`>
*online folders list
00:43
<Philip`>
so ignore that
00:43
<Hixie>
it's in the database
00:43
<Hixie>
though why, i don't know
00:44
<Philip`>
I see 54 truncated names in the CSV, which is 3346 - 3298
00:44
<Philip`>
Uh, not it isn't
00:44
<Philip`>
*no
00:45
<Hixie>
do you have a preferred username?
00:45
<Philip`>
I see 54 truncated names, plus 6 messages in two folders, and 3298 = 3346 - 54 + 6
00:45
<Philip`>
Depends on the context :-)
00:46
<Philip`>
(Usually something like "philip" works alright)
00:46
<Philip`>
(except when someone else has stolen that name, and I have to add random punctuation characters)
00:47
<Philip`>
((Actually, 5 messages in two folders and 1 message in three folders))
00:53
<Hixie>
ok so how did these get truncated
00:53
<Hixie>
wtf
00:54
<Hixie>
btw the way i update the tables each night is to lock them, drop them, then recreate them entirely
00:54
<Hixie>
which is why the number can change dramatically (if the script dies while it's updating)
00:54
<Philip`>
When you measured 56 bytes, did you happen to do it with something like perl -e'print length("<3019...⊙w.")' ?
00:55
<Hixie>
yes, did i screw that up? :-)
00:56
<Philip`>
Try the -w flag :-)
00:56
<Hixie>
d'oh
00:56
<Hixie>
i suck
00:56
Philip`
made precisely the same mistake
00:56
<Hixie>
ok so i need to make it longer
00:56
<Hixie>
that means shrinking the other index column
00:58
<Philip`>
Why does it affect indexes?
00:58
<Hixie>
cos i use this as port of one of my primary indexes
00:58
<Hixie>
in the 'folders' table iirc
01:01
Philip`
doesn't see why expanding one would require shrinking another
01:05
<Hixie>
because i'm at the limit of the index key size
01:06
<Philip`>
The index can just be on a short prefix of the column and it should work exactly the same, I thought
01:06
<Philip`>
But maybe that doesn't work so well when it's meant to be a unique index
01:13
<Hixie>
ok regenning the db
01:14
<Hixie>
the reason they were cropped in the dump and not on the site is that the site was using the folders table instead of the emails table and the two tables had different lengths for the ids!
01:14
<Hixie>
<- moron
01:17
<Philip`>
Okay - once it's done, I can just download the 54 extra emails through the API
01:18
<Hixie>
done
01:18
<Hixie>
(you can also dump it straight from the database)
01:23
<Philip`>
Hmm, it looks like I got the same folders list as before
01:24
<Philip`>
(unless I messed up somewhere)
01:24
<Philip`>
with only the 3298 messages
01:29
<Hixie>
there's something really weird going on
01:32
<Hixie>
there are 3,353 entries in the 'emails' table
01:32
<Hixie>
there are 3,359 in the 'folders' table (which includes dupes)
01:33
<Philip`>
There are 3,353 distinct 'message' values in the 'folders' table, and they all correspond to an 'id' in 'emails'
01:33
<Hixie>
right
01:33
<Philip`>
so that all seems to be correct
01:33
<Hixie>
(i just checked)
01:33
<Hixie>
so why do you only get 3298 if you use the api
01:33
<Philip`>
Is there any caching somewhere?
01:33
<Hixie>
oh, maybe, yeah
01:33
<Philip`>
(I deleted my own cache before downloading the folders list again)
01:34
<Hixie>
i just killed my server, try again
01:35
Philip`
tries
01:38
<Philip`>
3359
01:38
<Philip`>
Perfect :-)
01:38
<Hixie>
phew
01:38
<Hixie>
good call on the caching
01:38
<Hixie>
i've added a scary line to my regen script that kills the server
01:40
<Philip`>
Is there not a less drastic way to clear the cache?
01:41
Philip`
's usual approach to caching is "nobody uses my stuff anyway, so it doesn't matter if it's slow and resource-intensive"
01:41
<Hixie>
Philip`: that was my approach, and then you started fetching all this data... ;-)
01:41
<Hixie>
Philip`: there's probably a better way, but *shrug*
01:42
<Hixie>
the server knows how to handle it and the CGI scripts that talk to it know how to handle the server script going away
01:42
<Hixie>
(the server i'm talking about isn't the database, it's a cgi script that talks a little tcp protocol to my other cgi scrits)
01:42
<Hixie>
scripts
01:43
<Philip`>
Hmm, good point - I'll make sure I don't offer any APIs and tempt people to use them :-)
01:44
<Philip`>
(Ah, okay, that's less drastic than killing MySQL and Apache and whatever)
01:44
<Hixie>
yeah
01:44
<Hixie>
i really wish we had TCPConnection
01:44
<Hixie>
it would be so much better than having this mess
01:44
<Hixie>
and then i could do away with everything except the server script
01:50
<Hixie>
hey anyone recall the uris of the studies similar to the webstats one i did?
01:50
<Hixie>
there was one that had a flower on the front page iirc
01:51
<Philip`>
http://triin.net/2006/06/12/Coding_practices_of_web_pages ?
01:51
<Hixie>
yes!
01:51
<Hixie>
thanks!
02:01
<Philip`>
http://triin.net/archive/kool/webstat/figure-33.png - you can tell the data set had a very high proportion of CNN pages
02:02
<Hixie>
you should see the number of <nyt_copyright> elements in the studies i do :-P
02:02
<Hixie>
not enough to come in the top 100 or anything, but still significant
02:02
<Philip`>
cnn.com was 5% of the URLs on dmoz.org when I last counted
02:03
<Hixie>
yeah
02:03
<Hixie>
cnn.com has a lot of pages
02:03
<Philip`>
I've no idea who keeps submitting them all to dmoz.org
02:03
<Hixie>
cnn, maybe
02:03
<Philip`>
Oh, that would make sense
02:04
<Hixie>
comparing http://triin.net/archive/kool/webstat/figure-12.png to http://code.google.com/webstats/2005-12/charts/unique-elements-per-page.svg is interesting
02:05
<Hixie>
it's clearly not a perfect normal distribution
02:05
<Hixie>
there's a very clear bump around 5-10 in both graphs
02:05
<Hixie>
actually i guess i'd be approximately a poisson, not a normal, distribution.
02:07
<Philip`>
It seems everyone has <html>, <head>, <title>, <body>, and then I guess you don't need more than a few of the next ones (<meta>, <a>, <img>, <br>, <table>, etc) to make up the rest of your page
02:07
<Lachy>
Hixie, I'm unable to login to the wiki server using SFTP. It responds with unable to authenticate.
02:07
<Hixie>
oh it was probably one of the accounts that dreamhost changed the passwd on
02:07
<Lachy>
same with the blog
02:07
<Hixie>
let me fix it
02:08
<Lachy>
whatwikiuser and lhunt were the usernames
02:08
<Philip`>
The spike at 6 on triin.net looks odd, since I can't imagine people doing much with just two extra tags
02:08
<Hixie>
Philip`: the spike is a bit later on the google one but yeah
02:08
<Hixie>
there's a prike
02:08
<Hixie>
spike
02:09
Philip`
wonders if he can get that graph from his own data
02:09
<Hixie>
what's the uri to yours again?
02:09
<Philip`>
http://canvex.lazyilluminati.com/survey/2007-07-17/analyse.cgi/index
02:14
<Philip`>
http://canvex.lazyilluminati.com/survey/2007-07-17/tagcount.html
02:15
Lachy
installs the captcha extension
02:16
<Lachy>
can someone with an ordinary user account on the wiki attempt to make an edit to see if it works? (sysops won't see it)
02:16
Philip`
still likes the "[x] Are you a spambot?" approach to blocking spam
02:16
<Lachy>
wiki.whatwg.org
02:16
<Philip`>
(Doesn't work on human spammers, but computers never answer the question right)
02:16
<Hixie>
Philip`: see! see! your data has the same thing!
02:17
<Philip`>
My data comes from exactly the same source as the triin.net stuff, so that's not surprising :-)
02:17
<Philip`>
with the same 5% cnn.com, and quite a lot of weather.com
02:17
<Hixie>
heh
02:17
<Hixie>
pity :-P
02:17
<Philip`>
(except a much smaller sample)
02:18
<Philip`>
Lachy: It let me save a change without asking any hard questions
02:19
<Lachy>
oops, It hadn't uploaded LocalSettings.php
02:19
<Lachy>
try again
02:19
<Lachy>
aargh! fatal error.
02:21
<Hixie>
i assume triin.net is Pene Saarsoo's site
02:21
<Hixie>
Rene
02:21
Philip`
looks at the spikey bits in more detail
02:22
<Lachy>
Philip`, fixed the error, can you try again?
02:26
<Philip`>
Lachy: Is it meant to show on the editing page, or only after saving?
02:26
<Lachy>
I believe it's supposed to be after attempting to save
02:27
<Lachy>
these are the instructions http://www.mediawiki.org/wiki/Extension:ConfirmEdit
02:27
<Lachy>
it doesn't tell me much
02:27
<Philip`>
It let me save again without anything
02:28
<Philip`>
Hixie: Of 114 the pages with 6 tag names, 61 have <frameset> and <frame>
02:28
<Philip`>
(and most have <meta>, <title>, <html>, <head>)
02:28
<Philip`>
which explains the spike at 6
02:28
<Philip`>
(since they're all just frameset pages)
02:29
<Hixie>
oooh
02:29
<Hixie>
interesting
02:29
<Philip`>
Of the 208 pages with 8, 137 have <noframes>, 149 have <frameset> and <frame>, and most have meta/body/title/head/html
02:30
<Philip`>
so it seems they're all just frameset pages again
02:30
<Hixie>
aah
02:30
<Hixie>
interesting stuff!
02:30
<Philip`>
Same for 9 tags (244 pages, 168 have frames, 159 have noframes, 141 have p)
02:31
<Lachy>
Philip`, it will if you attempt to add a URL or attempt to create an account
02:31
<Lachy>
it asks the user a simple math questions like: 44 - 5 = ?
02:32
<Philip`>
Lachy: Aha, that works
02:32
<Philip`>
"69 - 5 ="
02:32
Philip`
thinks a bit
02:32
<Philip`>
Oh, it wasn't 61
02:33
<Philip`>
The "Create account" page says "82 + 6 ="
02:33
<Philip`>
so that seems to be working
02:46
<Philip`>
Hixie: http://canvex.lazyilluminati.com/survey/2007-07-17/tagcount.html
02:46
<Philip`>
now with details about what tags the various groups of pages use
02:46
<Hixie>
neat
02:47
<Philip`>
Hmm, the page using 41 tags looks quite legitimate
02:48
<Hixie>
heh
02:48
<Philip`>
(http://community.webshots.com/user/JulioUU)
02:49
<Hixie>
woot! you have some nyt_copyright elements too!
02:49
<Philip`>
http://canvex.lazyilluminati.com/survey/2007-07-17/analyse.cgi/tag/nyt_copyright
02:49
<Philip`>
Only one :-(
02:50
<Hixie>
i got over a million of those in my sample last year
02:50
<Hixie>
(don't recall what the recent numbers are)
02:50
<Hixie>
er, not last year. year before.
02:50
<Hixie>
whenever my first sample was.
02:53
<Philip`>
11 pages with nothing but <meta> - looks like those are redirects
02:53
<Hixie>
yeah
02:59
<Philip`>
(It's nice having data about eight thousand pages rather than eight billion, because I can write hopelessly inefficient SQL queries and still get information back in a few seconds ;-) )
03:32
<Lachy>
why do people keep overreacting and bringing up the headers issue all the time?! http://lists.w3.org/Archives/Public/public-html/2007Aug/0926.html
04:07
<Hixie>
"headers= are allready counted by our editor as insignificant"
04:07
<Hixie>
they are? i thought i'd not yet looked at them.
04:14
<Lachy>
well, apparently Leif knows something about you that you don't :-)
04:15
<Hixie>
good to know
04:15
<Hixie>
i better update my opinions to think that headers="" is insignificant then
04:31
<Lachy>
I'm trying to think of a way to explain that the significance of a feature isn't dependent upon stats alone and give some examples of other factors that could be considered. any suggestions?
04:32
<othermaciej>
there's a lot of things to consider
04:33
<othermaciej>
- does the feature fulfill a needed use case?
04:33
<othermaciej>
- is it possible to do the same things as well without the feature?
04:33
<othermaciej>
- which existing user agents, if any, implement the feature?
04:33
<othermaciej>
- is the feature used widely in existing content?
04:34
<Hixie>
- have existing user agents invented similar proprietary features to address the use case
04:34
<Hixie>
- have libraries (e.g. dojo) implemented work arounds for the lack of the feature?
04:34
<Lachy>
- benefits to users and authors
04:34
<othermaciej>
- is most existing use (if there is any) such that it would be beneficial or hamful to the goal of the feature to support it?
04:35
<othermaciej>
for example if a feature is not implemented by mainstream browsers but is widely used, it becomes likely that at least some content using it will depend on it being ignored
04:36
<othermaciej>
there's all sorts of things to think about
04:36
<Lachy>
thanks, that's a good list
04:37
<othermaciej>
and another thing to consider is that different criteria may apply to whether a feature is required for implementations, and to whether it is valid for content
04:37
<othermaciej>
a feature that is considered to have no valid use cases and to violate various important principles may nontheless be required for implementations
06:10
<Hixie>
so i'm examining some of my data collected over teh past few months with an eye towards what can be published
06:10
<Hixie>
looking at the longdesc="" data closer, it seems the situation is even more dire than i thought
06:11
<Hixie>
only about 0.6% of <img> elements with a longdesc="" attribute have a useful value, as far as i can tell
06:11
<Hixie>
which make it about 0.0007% of <img> elements
06:11
<othermaciej>
what are some of the kinds of values that are non-useful?
06:12
<Hixie>
all the wikipedia <img> elements have bogus values that point to non-descriptions
06:12
<Hixie>
there are also many <img> elements that have longdesc="" attributes with values that are redundant with an ancestor <a> element's href="" attribute
06:12
<Hixie>
or that point to the root of another domain
06:13
<othermaciej>
wow
06:14
<Hixie>
hm, i didn't check for identity with the alt="" attribute
06:14
<Hixie>
i should have
06:16
<Lachy>
Hixie, there's a bug in your issues list http://www.whatwg.org/issues/top - see the top issue that says 3 votes, yet there's only 1
06:17
<Hixie>
woah, clicking it changes the result
06:17
<Hixie>
freaky
06:17
<Hixie>
noted
06:17
<Hixie>
thanks
06:18
<Lachy>
what evidence was presented in support of longdesc?
06:18
<Hixie>
dunno, haven't studied it beyond looking at this data so far
06:19
Lachy
checks the wiki
06:19
<Lachy>
hmm. some people want longdesc on iframe too
06:20
<othermaciej>
that sounds amazingly redundant
06:20
<othermaciej>
the iframe itself would in general contain markup
06:20
<othermaciej>
why would you need a markup alternative to it?
06:21
<Hixie>
where else can spammers put their links? we've blocked off most of the other places
06:25
<Lachy>
Hixie, can you publish a list of URIs for sites that look like they're using longdesc legitimately?
06:27
<Hixie>
http://junkyard.damowmow.com/292
06:27
<Lachy>
is that all of them, or just a sample?
06:27
<Hixie>
sample of 100
06:28
<Hixie>
there were millions that didn't hit any of my heuristics
06:28
<Hixie>
(about 6% of <img> elements with a longdesc="")
06:28
<Lachy>
This is not legitmate: <a href="http://www.google.co.jp/"><img src="http://blog2.fc2.com/2/20century/file/Logo_20s.gif"; alt="Google" height="75" width="143" longdesc="http://www.google.co.jp/logos.html"; /></a>
06:28
<Hixie>
indeed not
06:29
<Hixie>
there are many that are not
06:29
<Hixie>
about 90% by the sample i looked at
06:29
<Lachy>
I'll let you know if I find one that is
06:32
<Lachy>
it's interesting how the entire list of use cases given in the wiki, doesn't actually include any use cases. Just a list of different disabilities that might benefit from it
06:39
<Hixie>
k well the score doesn't change when you open one anymore
06:39
<Hixie>
but i don't see why the score of the top one is wrong
06:40
<Lachy>
maybe it thinks I'm so important that my vote is worth 2? :-)
06:41
<Hixie>
seems unlikely! ;-)
06:47
<Hixie>
either i don't understand GROUP BY / COUNT() or there's a bug in mysql
06:56
<Hixie>
oh!
06:56
<Hixie>
the message is in two folders!
07:01
<Hixie>
ok fixed
07:01
Hixie
changed COUNT(...) to COUNT(DISTINCT ...)
07:58
<Lachy>
Hixie, one attempted legitimate use of longdesc, but it's still questionable http://www.tcfp.state.tx.us/standards/standards_manual/standards_manual.asp?chapter=431429421 (see the state of texas logo at the bottom)
07:58
<Lachy>
links to http://www.tcfp.state.tx.us/image_description.txt
08:20
<jruderman>
.txt!
08:21
<othermaciej>
that could totally be alt text
08:21
<othermaciej>
the best part is that the plaintext contains a URL
08:34
<Lachy>
I updated the wiki to add some *real* use cases to replace the list of purported beneficiaries that was there http://esw.w3.org/topic/HTML/LongdescRetention
09:06
<Dashiva>
Lachy: Sorry if I double-posted you yesterday, had some trouble finding the right account for w3c posting
09:06
Dashiva
dreams about a mailing list with useful reply-to
09:06
<Lachy>
double posted me? which message?
09:07
<Lachy>
which name do you use in emails?
09:07
<Dashiva>
magnusrk+something@
09:07
<Lachy>
BTW, munging reply-to headers is considered bad practice and I'm glad the W3C doesn't do it
09:09
<Dashiva>
I've yet to see an argument against it that wasn't based on happening 20 years ago
09:09
<Lachy>
ah, right. I did get multiple copies of that one, but it doesn't bother me since I just deleted it along with the other dupes
09:10
<Lachy>
I've yet to see a convincing argument for it that isn't based on a users inability to use their mail client properly
09:10
<takkaria>
given that HTML5 is based in part around users' inability to write valid markup, it seems like that might be a pretty good reason. :)
09:11
<Lachy>
just press Reply All (or equivalent). It's usually right next to the Reply button!
09:11
<Dashiva>
That's the problem
09:11
<takkaria>
Lachy: oh, I agree, I just enjoy the mild irony. :)
09:11
<Dashiva>
I got a mail from Robert Burns today that didn't contain a single word I've said because i've been passed along by reply all for who knows how long
09:12
<Lachy>
people need to learn to trim the recipient list too!
09:12
<Dashiva>
User education doesn't work, you should know that ;)
09:12
<takkaria>
Lachy: btw, rob moved your use cases to "authoring cases" and moved what were "use cases" before back there
09:13
<Lachy>
WTF???
09:13
<Lachy>
They're not use cases!
09:14
<takkaria>
I tried to clean up the <img>fallback</img> section a while ago to remove duplicates, and he just reverted it, so I gave up on it
09:16
<zcorpan>
amusing
09:17
Lachy
reverted rob's change
09:17
<Dashiva>
revert war, I choose you
09:18
zcorpan
escapes the expected bashing
09:21
takkaria
wonders if the Design Principles aren't better changed to "How the HTML WG should work"
09:25
<takkaria>
Lachy: btw, are you going to look through all of Hixie's longdesc urls?
09:25
<Lachy>
not all of them. I looked through about a dozen randomly selected
09:27
<zcorpan>
random selection, eh? in the statistical sense? not biased in some way? ;)
09:28
<Lachy>
well, not entirely random. I just picked a couple from the top, scrolled down a bit and picked a few others, then repeated.
09:28
<Lachy>
it was sort of biased by ignoring the wikipedia entires that I knew were bogus
09:29
<takkaria>
I assume someone's already argued with the wikimedia people about how they're misuing longdesc?
09:30
<Lachy>
don't know, probably.
09:32
takkaria
adds that to his todolist
09:43
<Lachy>
Rob isn't listening again, I give up
09:45
<takkaria>
Lachy: apologies; I've sent you two duplicate replies to your longdesc post, both off-list
09:45
<takkaria>
I confuse myself by having too many email accounts and never hitting reply-to-all
09:47
<Lachy>
takkaria, just resend it to the list, it doesn't matter if I get dupes
09:47
takkaria
has done so
10:29
<zcorpan>
um
10:29
takkaria
sighs at Rob for another contentless post
10:29
<zcorpan>
what should happen if you serve a gif as image/png and use it in an <object>?
10:30
<zcorpan>
browsers just decode it as a gif
10:52
<Lachy>
zcorpan, browsers don't care what image MIME is use, they just invoke the appropriate image library by sniffing the first few bytes of the file, looking for the signature
10:56
<Whiskey_M>
'lo
11:29
<zcorpan>
Lachy: right... so image/png, image/gif, etc, all go through "Content-Type sniffing: image"
12:03
<Lachy>
zcorpan, yeah, something like that
13:42
<zcorpan>
hmm, wonder if browsers look at the file extension in their sniffing code
13:44
<Whiskey_M>
if it helps I have noted that for sites that serve content (normally avi's ) with incorrect headers IE will normally serve it with the application based upon extension, rather than FF which tends to serve based on the header
13:46
<zcorpan>
interesting
13:48
<Philip`>
Is that for content that the browser displays itself, or for content where it either saves to disk or asks the OS to find some external program to open it?
16:26
<Lachy>
http://blog.whatwg.org/omit-alt
16:29
<Lachy>
jgraham, that's a good explanation of the flag example you posted. Thanks, now I don't have to respond. :-)
18:56
<gsnedders>
who actually owns html5.org?
18:57
<Lachy>
gsnedders, anne
18:57
<gsnedders>
ah
22:18
<Hixie>
btw Lachy if you can suggest some page-only heuristics (i.e. not involving the network) for detecting bogus longdesc=""s that would have caught them in the URLs i mentioned, it would be useful
22:18
<Hixie>
i'd be able to rerun the study excluding those
22:33
<othermaciej>
I looked at a few of those and they generally didn't look detectable from the page context alone
22:36
<Hixie>
yeah that was my conclusion too
22:37
<Hixie>
so i haven't yet looked at the headers="" stuff in detail
22:37
<Hixie>
but i wonder
22:38
<Hixie>
if i look at it, and find that imho we shouldn't include headers="" (which could be the case, just like it could be the case that we _should_ include it)
22:38
<Hixie>
will the situation become worse or better?
22:38
<Hixie>
in other words, is it better for the working group for me to address the issue at its proper time, or should i prioritise it given that the risk of that is that the attribute does in fact not get kept?
22:42
<othermaciej>
whether it has a positive effect on attitudes depends on the outcome
22:42
<jgraham>
Hixie: The problem is that we are unlikely to ever reach consensus on a spec that does not include headers irrespective of its technical merit
22:42
<othermaciej>
doing enough work to predict the outcome but not actually make any changes would not really be acting in good faith though, I think
22:43
<Hixie>
jgraham: we'll never reach consensus on the spec anyway
22:43
<jgraham>
Hixie: Fair point.
22:43
<Hixie>
othermaciej: yeah i wouldn't do that
22:43
<othermaciej>
I think there is a sense from some quarters that any feature that is nominally "for accessibility" is absolutely needed, regardless of whether it does anything to help accessibility in practice
22:43
<Hixie>
right
22:44
<othermaciej>
and that even trying to study the question of whether it helps accessibility in practice is somehow illegitimate
22:45
<othermaciej>
notwithstanding what I think of specific features, it is hard to make a good spec when there are sacred cows that may not be examined critically, so I don't know how much we can accomodate that point of view
22:45
<Hixie>
i'm not considering any cows sacred
22:45
<billmason>
I would suggest that if headers got some kind of priority and a determination made, even if the determination was "keep it", there would just be a move to argue the next accessibilty issue in dispute immediately. I would not make it a special priority just to give those evangelizing for it a more immediate answer.
22:46
<Hixie>
i'm not really concerned about appeasing people, i want to make the spec the best html spec possible, including addressing the goal of making documents that use the language universally accessible.
22:46
<othermaciej>
I know you don't - I'm just saying it's hard to collaborate with someone who does have sacred cows, even if you happen to agree with them on one point
22:46
<jgraham>
But it makes everyone's life miserable when some people consider cows to be sacred and they are taken to slaughter (so to speak)
22:46
<Hixie>
othermaciej: yeah
22:47
<Hixie>
billmason: yeah. so that argues for just treating it like any other issue, and going with the FIFO principle, right?
22:47
<billmason>
That would be my take on it, yes.
22:47
<Hixie>
i think that's probably the best course of action
22:47
<billmason>
And for parsing logs later to write emails, I say that as a person with a general accessibility interest/focus/etc.
22:47
<Hixie>
i expect Dan to start telling me to prioritise public-html wg feedback over existing whatwg feedback though.
22:48
<Hixie>
he seems to have interpreted my "a few months" as "3 or so months" not "20 or so months" which is probably closer to realistic.
22:48
<othermaciej>
I do think Chaals made a good point that accessibility features might deserve special status even if only a subset of especially accessible sites use them (properly), since those might be the only sites that someone with a given disability can use at all
22:48
<Hixie>
not sure what to do about that if he does ask me to do so
22:48
<othermaciej>
but I don't know if the evidence bears that out as a justification for headers="" or the like
22:49
<jgraham>
Hixie: Two stacks, one issue from the top of each in turn?
22:49
<othermaciej>
it would require examples of actual particularly accessible sites that consistently use it in a way that is beneficial
22:49
<othermaciej>
not just a theory that there might be some
22:49
<Hixie>
othermaciej: well, there are two things, right, there's the features that are rarely used but when used make the page better, and there are the features that are very widely misused to the point where they actively hurt the user's experience on most sites
22:50
<Hixie>
longdesc clearly falls into the latter category according to all the studies i've done or seen
22:50
<Hixie>
as in, exposing longdesc will mostly expose the user to spam or useless content, even on supposedly accessibility-aware pages
22:51
zcorpan
would think that the way tabindex is implemented, it actively hurts the user experience too. treating all positive numbers as 0 for tabindex would, i think, have better user experience
22:52
<Hixie>
that's possible too, i haven't even looked at tabindex yet other than adding the negative thing
22:53
<zcorpan>
yep. :) i just suddenly came to think of tabindex when reading the above
22:53
<Hixie>
one thing i really don't know how to fix is accesskey=""
22:54
<zcorpan>
dunno either
22:54
<jgraham>
That isn't going to be fun.
22:54
<zcorpan>
although opera's implementation is somewhat useful, or at least not harming the user experience
22:55
<Hixie>
opera's implementation is unintuitive
22:55
<jgraham>
From what I remember Mike Smith saying it sounds like different solutions are appropriate on different devices
22:55
<Hixie>
and doesn't actually solve the problem of how to make it device independent
22:55
<Hixie>
yeah
22:55
<Hixie>
maybe it really is a stylistic thing
22:55
<Hixie>
we do have key-equivalent in CSS iirc
22:56
<Philip`>
Hixie: I would assume users wouldn't see the spam or useless content since they wouldn't be explicitly asking their tool to give them the longdesc, except in the cases where they have a good expectation that it's going to be useful, so misuse wouldn't hurt the user much
22:56
<Hixie>
Philip`: really? how would they know to ask?
22:57
<Hixie>
for longdesc it really seems to me that the only cases i've seen where the longdesc was actually useful and wasn't something you could have just stuffed into alt="", it was actually useful to sighted users too
22:57
<Hixie>
and could have just been included on the page or in a link from the page
22:57
<Philip`>
I'd assume the tool would indicate in some quick way that there is a longdesc attached to the image, similarly to how it must quickly indicate wherever there is a link
22:58
<Philip`>
but I have precisely no experience of how relevant tools work in practice
22:58
<zcorpan>
jaws says "press enter for long description" after reading the image alt, and if you press enter it will open the url in a new window
22:58
<zcorpan>
iirc
22:58
<Hixie>
right but how do you know it's appropriate?
22:59
<Philip`>
(At least from what I've heard about table headers, they're not read out by default - you press some key when you've got the right cell selected, and if you see one cell has rubbish headers then you won't bother checking every single other cell - so the harm caused by misuse is similarly minimised)
23:01
<Philip`>
Hixie: By considering the context, like whether other images on the page have useful longdescs - e.g. you'd know it's worthwhile reading all the longdescs in the CSS spec after you've seen the first few
23:02
<Philip`>
(and on sites which misuse it, you'd quickly realise you should just ignore all the rest)
23:02
<Hixie>
that doesn't seem like the optimal user experience
23:02
<Hixie>
it's like reminding the user continually that the page wasn't designed for them but there's this secondary set of content they can access
23:03
<Hixie>
seems like it would make one bitter
23:04
Hixie
tries to build up a list of requirements for offline web apps
23:06
<Philip`>
It seems a less significant negative point than with e.g. <input usemap> (where if your browser supports it, there are features of some sites that just don't work at all) - it might waste a bit of time to read out useless longdescs, but it's not preventing the user from using the site
23:08
<Philip`>
A closer-to-optimal user experience would be good, though I don't have any ideas for that :-)
23:08
<Hixie>
yeah the main argument for me against longdesc is that it's not useful at all, except in rare cases where frankly sighted users would benefit too, and therefore you're better off putting the content on the page itself
23:08
<othermaciej>
Hixie: one thing that might work is to limit the number of access keys and make it a set that UAs can map in a natural way on each platform to something w/ no conflicts
23:08
<Hixie>
it's hard to say since i've seen so few useful uses of it
23:08
<Hixie>
othermaciej: yeah that's been suggested
23:08
<Hixie>
othermaciej: but then the UI becomes unclear
23:09
<Hixie>
othermaciej: i.e. discoverability drops through the floor
23:12
<kingryan>
Hixie: having CSS-useable hooks could help with styleability
23:12
<Hixie>
yeah
23:12
<kingryan>
s/styleability/discoverability/
23:13
<Hixie>
like ...:after { content: ' (' copy-key ')'; }
23:13
<Hixie>
... { key-equivalent: copy-key; }
23:13
<kingryan>
yeah
23:13
<Hixie>
or ... { key-equivalent: copy-key; } ...::after { content: ' (' key-equivalent ')'; }
23:13
<Hixie>
...to have resilience against the cascade
23:13
<kingryan>
....:access-key('n'):after { content: 'meta-key N'}
23:14
<kingryan>
yeah
23:14
<Hixie>
i was thinking of also having othermaciej's idea since that solves the device problem too
23:14
<Philip`>
Subtitles on TV shows would benefit sighted users too (e.g. when they get distracted and miss a couple of words, or can't understand someone's accent), but usually they aren't displayed along with the content since they're ugly and distracting and not sufficiently useful to be shown to all users; people may have similar reasons for not wanting to put image descriptions in the normal page content, and hiding it being [D] links or londesc
23:15
<kingryan>
Hixie: is om's idea of having a limited set?
23:15
<Philip`>
s/being/behind/
23:15
<Hixie>
yeah but subtitles aren't only accessible to blind users or hidden behind long properties pages, they're one-button accessible
23:15
<Hixie>
like an <a> link would be
23:15
<Hixie>
kingryan: yeah see above
23:16
<Philip`>
That just seems like a browser UI issue
23:16
<kingryan>
yeah, me wishes browsers couldn't remap stuff that the browser already handles
23:16
<kingryan>
or allow me to re-remap it
23:16
<Hixie>
Philip`: i don't think we should hide longdesc behind a context menu or something. i'm saying the link should be right there in the content just like for everything else.
23:19
<Philip`>
That sounds like D-links - I think the only argument I've heard against them is they don't look very nice (but there are quite possibly other arguments I haven't heard)
23:19
<othermaciej>
Hixie: if the set was 0-9 for instance, it could be unmodified 0-9 on phones with a keypad, Cmd-0 - Cmd-9 on Macs, Ctrl-0 - Ctrl-9 on windows, etc
23:19
<othermaciej>
(assuming those are actually free)
23:20
<Hixie>
Philip`: yeah, though i wouldn't use [D].
23:20
<othermaciej>
but I'm not sure how you address discoverability
23:20
<Philip`>
(http://www.w3.org/TR/WCAG10-HTML-TECHS/#long-descriptions says "Invisible d-links thus provide a (temporary) solution for designers who wish to avoid visible d-links for stylistic reasons.")
23:20
<othermaciej>
I think the page has to provide info about shortcuts
23:20
<Hixie>
othermaciej: i think kingryan's idea helps with that
23:20
<billmason>
I think there's some kind of study that says numeric shortcuts aren't really free, though.
23:20
<othermaciej>
that's how keyboard shortcuts work in native apps
23:20
<othermaciej>
they are listed in the menu bar
23:21
<othermaciej>
having a shortcut list with labels somewhere in the page or accessible from the chrome would be the natural analogy
23:21
<othermaciej>
I'm not sure if styling them to mention they key helps
23:21
<kingryan>
yeah, having it in the chrome seems reasonable
23:21
<othermaciej>
thee UA could do a better job if it is the only thing that knows the concrete key mapping, but you need a label to go next to the shortcut
23:22
<othermaciej>
*the UA
23:22
<othermaciej>
it could even be a menu or submenu in the menu bar
23:22
<Hixie>
hm yeah, for <command>s and other Command elements you could just have the UA create a menu somewhere with the key equivs
23:22
<othermaciej>
"Page Shorcuts"
23:22
<kingryan>
it'd be nice in interactive browsers if you could could style the access-key elements with the modifier key is pressed
23:22
<Hixie>
they could do that now with :active
23:23
<othermaciej>
kingryan: if the modifier is a commonly used one on the OS, that could be distracting
23:23
<kingryan>
othermaciej: indeed it could
23:23
<othermaciej>
something based on <command> seems like a good basic approach
23:23
<Hixie>
oh i misread what kingryan said
23:23
<othermaciej>
the idea is that a shorcut key actually activates a command, for which there might also be one or more UI elements
23:23
<kingryan>
there are some places, like dialogue windows in os x, where this is done already
23:24
<othermaciej>
it makes more sense to associate it with a command, which can then have an appropriate label
23:24
<othermaciej>
kingryan: example?
23:24
<othermaciej>
(I don't know of dialog windows that have keyboard shortcuts beyond the standard tab/enter/esc and such)
23:25
<kingryan>
I remember something adding "cmd-foo" in a button when cmd is pressed down
23:25
<kingryan>
I'm not sure where that is
23:25
<othermaciej>
I don't think that is standard
23:25
<kingryan>
it's not
23:25
<kingryan>
I'm just seeing that I've seen it done before and I helped with discoverability
23:28
<othermaciej>
usually in OS X all the keyboard shorcuts also have a menu item, so you can look in the menu system to see the shortcuts
23:31
<kingryan>
othermaciej: true
23:33
<othermaciej>
Windows is different since it uses the underline system sometimes, and that may apply both to menus and items in a dialog