W3C Validator Update

Collapse
This topic is closed.
X
X
 
  • Time
  • Show
Clear All
new posts
  • Nick Kew

    W3C Validator Update

    There's a new beta of the W3C Markup Validation Service now live at
    <URL:http://validator.w3.or g:8001/>

    Probably the most important change is verbose output, including attempts
    to explain the validator errors. Other changes include improved
    display of error messages, and a choice of parse modes.
    Currently - but probably not for long - it includes an
    "interestin g" default setting!

    Some of the changes have been the subject of much debate. We need
    to widen that to include users: please tell us what you like or
    dislike in the new service! Quick feedback may catch Terje while
    he's still hacking this release:-)

    As always, problem and bug reports are welcome, but please check
    first whether they're already known.

    --
    Nick Kew

    In urgent need of paying work - see http://www.webthing.com/~nick/cv.html
  • Isofarro

    #2
    Re: W3C Validator Update

    Jim Ley wrote:
    [color=blue]
    > On Thu, 28 Aug 2003 19:11:44 +0000 (UTC), "Jukka K. Korpela"
    > <jkorpela@cs.tu t.fi> wrote:
    >[color=green]
    >>nick@fenris.w ebthing.com (Nick Kew) wrote:[/color][/color]

    First of all - well done and thanks for the efforts all of you have put into
    this.

    [color=blue]
    > This beta release has been defaulted to an extended mode as an
    > oversight, that's clearly wrong, do you feel a fussy mode should not
    > exist?[/color]

    There's certainly benefit in a fussy mode. Whether that should still fall
    under the naming of "validator" is a different question. I certainly don't
    feel qualified to answer that question.

    [color=blue]
    > An SGML validator certainly, is the CSS validator also useless? If
    > you don't like the non-technical use of validator, what do you propose
    > such a QA tool be called?[/color]

    Quality Assist. QA Assist. Its tools about improving the Quality of the
    markup, so why not focus on the Quality aspect. Businesses tend to like
    words like Quality.
    [color=blue]
    > Yes the beta is wrong to claim valid document invalid, yes the beta is
    > wrong to default to fussy mode - I think everyone has acknowledged
    > that. Do you see anything else wrong in the beta you could report?[/color]

    Nothing wrong, but confused me a bit:
    <http://validator.w3.or g:8001/check?uri=http% 3A%2F%2Fwww.iso lani.co.uk%2Fbl og%2F&verbose=1 &fussy=1>
    (Validating my blog page)

    I'm a little surprised that the "unescaped" & in the main text went
    unnoticed by validation - but I'm glad the new fussy checker picked it up
    (Multiple occurrances of Marks & Spencer within the text). It is probably
    more my lack of understanding of what a validator checks rather than the
    tool itself - but its good enough to convince me of the benefits of a fussy
    checker.


    Good work!


    --
    Iso.
    FAQs: http://html-faq.com http://alt-html.org http://allmyfaqs.com/
    Recommended Hosting: http://www.affordablehost.com/
    Web Standards: http://www.webstandards.org/

    Comment

    • Jukka K. Korpela

      #3
      Re: W3C Validator Update

      jim@jibbering.c om (Jim Ley) wrote:
      [color=blue]
      > Yes the beta is wrong to claim valid document invalid, yes the beta is
      > wrong to default to fussy mode - I think everyone has acknowledged
      > that.[/color]

      So why hasn't it been fixed? And how _did_ they manage to make such
      elementary errors? If you ask me, it was just the culmination of the
      approach that created "fussy mode" in the first place.
      [color=blue]
      > Do you see anything else wrong in the beta you could report?[/color]

      Should I report something else than the fact that all the announced new
      features are nonsense?

      We would have needed a good tag soup checker years ago. Turning a
      validator to a very one-sided tag soup checker helps nobody.

      --
      Yucca, http://www.cs.tut.fi/~jkorpela/
      Pages about Web authoring: http://www.cs.tut.fi/~jkorpela/www.html

      Comment

      • Jim Ley

        #4
        Re: W3C Validator Update

        On Thu, 28 Aug 2003 20:12:49 +0000 (UTC), "Jukka K. Korpela"
        <jkorpela@cs.tu t.fi> wrote:
        [color=blue]
        >jim@jibbering. com (Jim Ley) wrote:
        >[color=green]
        >> Yes the beta is wrong to claim valid document invalid, yes the beta is
        >> wrong to default to fussy mode - I think everyone has acknowledged
        >> that.[/color]
        >
        >So why hasn't it been fixed?[/color]

        Hmm, QA processes suggest to me that fixes should be tested first,
        before being rolled out onto production machines, the beta validator
        does run on the same machine as the real validator, so hack and patch
        as you go probably isn't particularly wise is it?
        [color=blue]
        >And how _did_ they manage to make such
        >elementary errors?[/color]

        Elementary errors as what, defaulting to a wrong mode, that's pretty
        simple to do surely? The text of the message saying valid - well one
        word can easily be overlooked, it is mostly aesthetic when you know
        what you actually mean, I can at least understand both bugs creeping
        in with my authoring processes, which is why we have betas etc.
        [color=blue][color=green]
        >> Do you see anything else wrong in the beta you could report?[/color]
        >
        >Should I report something else than the fact that all the announced new
        >features are nonsense?[/color]

        Of course not, you very right to report the problems here. I was
        asking if you had seen any other issues.
        [color=blue]
        >We would have needed a good tag soup checker years ago. Turning a
        >validator to a very one-sided tag soup checker helps nobody.[/color]

        What do you mean by "one sided" ?

        Jim.
        --
        comp.lang.javas cript FAQ - http://jibbering.com/faq/

        Comment

        • Jukka K. Korpela

          #5
          Re: W3C Validator Update

          jim@jibbering.c om (Jim Ley) wrote:
          [color=blue][color=green]
          >>We would have needed a good tag soup checker years ago. Turning a
          >>validator to a very one-sided tag soup checker helps nobody.[/color]
          >
          > What do you mean by "one sided" ?[/color]

          It apparently applies a collection of simple syntactic rules, generally
          aimed at tag verbosity in XHTML style, trying to force them upon HTML,
          and generating cascades of "error messages". There's surely much else
          that could and should be checked in HTML markup.

          --
          Yucca, http://www.cs.tut.fi/~jkorpela/
          Pages about Web authoring: http://www.cs.tut.fi/~jkorpela/www.html

          Comment

          • Jim Ley

            #6
            Re: W3C Validator Update

            On Thu, 28 Aug 2003 22:30:12 +0200, "Alan J. Flavell"
            <flavell@mail.c ern.ch> wrote:
            [color=blue]
            >I think the problem here is that - as long as there have been people
            >who took the W3C's assertion "HTML is an application of SGML" in any
            >way seriously - the term "validator" has had a specific technical
            >meaning, namely the meaning which it inherits from SGML.[/color]

            Yes, and within that scope it's fine, but I don't fully agree that the
            W3 has been within that scope for some time, nor is it particularly
            useful for it to be, no-one has been taking the application of SGML
            seriously because there's too many holes in that.
            [color=blue]
            > So, within the scope of its
            >implementati on limitations, it gives an unambiguous answer, and all
            >validators must give the same answer, even if they express it in
            >different words.[/color]

            Certainly, which is all a default v.w3.org should do, having more
            useful QA options is a good thing, I feel and think Jukka is being
            much too negative in his criticism, by suggesting to me that the
            entire process is flawed. I'd love to see some alternative QA
            approach that didn't use Jukka's "one-sided" approach, I believe
            linting of attributes is pencilled in for a 0.7.0 release of v.w3.org.
            These sort of things are useful, if you want a validator you've got
            one, web authors though need QA tools, not pointless pedantry on HTML
            origins.

            Criticise the defaults, criticise the claim that it's invalid, they're
            horrible bugs *, but I'd like to see the idea of web-QA taken
            seriously, SGML validation doesn't do that.
            [color=blue]
            >but I *would* like to see the distinction made more clearly, that's
            >all I'm saying.[/color]

            I agree, much of the supporting commentary isn't there yet.

            Jim.

            * I'm beginning to wonder if the fussy default wasn't a bug but an
            intentional way to get the beta talked about and looked at, but I
            don't actually think Terje is quite that mad.
            --
            comp.lang.javas cript FAQ - http://jibbering.com/faq/

            Comment

            • Matthias Gutfeldt

              #7
              Re: W3C Validator Update

              Andreas Prilop schrieb:[color=blue]
              >
              > Isofarro <spamblock@spam detector.co.uk> wrote:
              >[color=green]
              > > There's certainly benefit in a fussy mode.[/color]
              >
              > What exactly is a "fussy mode"?[/color]

              It's explained in the announcement:
              <http://lists.w3.org/Archives/Public/www-validator/2003Aug/0105.html>


              Matthias

              Comment

              • JAMESICUS

                #8
                Re: W3C Validator Update

                Follow the fun at:



                James Pickering

                (Validates in v0.6.5 [Beta #1] "fussy" mode)

                Comment

                • Nick Kew

                  #9
                  Re: W3C Validator Update

                  In article <msqlib.bn1.ln@ sidious.isolani .co.uk>, one of infinite monkeys
                  at the keyboard of Isofarro <spamblock@spam detector.co.uk> wrote:[color=blue]
                  > Nick Kew wrote:
                  >[color=green]
                  >> There's a new beta of the W3C Markup Validation Service now live at
                  >> <URL:http://validator.w3.or g:8001/>[/color]
                  >
                  > This URL doesn't display any of the form elements in Konqueror 3.0.0 on Suse
                  > 8.0:
                  >
                  > http://www.isofarro.freeserve.co.uk/temp/w3val.png
                  >
                  > Not sure why - possibly the fieldset and legend need to be inside the form
                  > elements?[/color]

                  I've seen Konq 3 on deadrat 7.3 do that with fieldset. I'd say that's a
                  serious bug (Konq 2.2 is better). If only I had the time and kit to hack
                  browsers ....


                  --
                  Nick Kew

                  In urgent need of paying work - see http://www.webthing.com/~nick/cv.html

                  Comment

                  • Shawn K. Quinn

                    #10
                    Re: W3C Validator Update

                    Nick Kew wrote:
                    [color=blue]
                    > In article <msqlib.bn1.ln@ sidious.isolani .co.uk>, one of infinite monkeys
                    > at the keyboard of Isofarro <spamblock@spam detector.co.uk> wrote:[color=green]
                    >> This URL doesn't display any of the form elements in Konqueror 3.0.0 on
                    >> Suse 8.0:
                    >>
                    >> http://www.isofarro.freeserve.co.uk/temp/w3val.png
                    >>
                    >> Not sure why - possibly the fieldset and legend need to be inside the
                    >> form elements?[/color]
                    >
                    > I've seen Konq 3 on deadrat 7.3 do that with fieldset. I'd say that's a
                    > serious bug (Konq 2.2 is better). If only I had the time and kit to hack
                    > browsers ....[/color]

                    FWIW, the same browser on the same OS used to crash on
                    <http://www.w3.org/Style/CSS/> which did not impress me in the least. 3.1.3
                    has been much better except for one rather annoying bug (always-present
                    horizontal scrollbar).

                    --
                    Shawn K. Quinn

                    Comment

                    • Stan Brown

                      #11
                      Re: W3C Validator Update

                      In article <Xns93E5E162656 17jkorpelacstut fi@193.229.0.31 > in
                      comp.infosystem s.www.authoring.html, Jukka K. Korpela
                      <jkorpela@cs.tu t.fi> wrote:[color=blue]
                      >I wrote my immediate comments to the www-validator list, saying much the
                      >same as here, and that list would have been suitable for fixing the
                      >errors. But if the bogosity is announced on a wider forum, I think it
                      >needs to be pointed out what it is.[/color]

                      I am not a subscriber to the www-validator list, but if what you
                      wrote there is much the same as what you wrote here, I don't know
                      how they'll know what you meant. I've read your article twice and I
                      understand that you hate the new validator, but I don't understand
                      why.

                      The one specific complaint I understood was that a lot of the
                      explanations of error messages don't exist and instead have annoying
                      placeholders in small type. (Those would annoy me too.)

                      I can't say whether your other criticisms of the validator are right
                      or wrong, because I have no idea what they are. You say it "invents
                      its own rules" and "intentiona lly claims that a valid page is not
                      valid", but give no examples or explanations.

                      If someone gave you such a criticism for your work, what would you
                      do with it? How would that guide you in making improvements?

                      --
                      Stan Brown, Oak Road Systems, Cortland County, New York, USA

                      HTML 4.01 spec: http://www.w3.org/TR/html401/
                      validator: http://validator.w3.org/
                      CSS 2 spec: http://www.w3.org/TR/REC-CSS2/
                      validator: http://jigsaw.w3.org/css-validator/

                      Comment

                      • JAMESICUS

                        #12
                        Re: W3C Validator Update

                        Stan Brown the_stan_brown@ fastmail.fm wrote (with reference to a Jukka K.
                        Korpela posting):
                        [color=blue]
                        >I am not a subscriber to the www-validator list, but if what you
                        >wrote there is much the same as what you wrote here, I don't know
                        >how they'll know what you meant ..... <snip>[/color]

                        You can follow the interchange at:



                        James Pickering

                        Pickering Pages
                        Computer programming since 1951 (IBM 407)

                        Comment

                        • Alan J. Flavell

                          #13
                          Re: W3C Validator Update

                          On Fri, Aug 29, Nick Kew inscribed on the eternal scroll:
                          [color=blue]
                          > at the keyboard of "Jukka K. Korpela" <jkorpela@cs.tu t.fi> wrote:
                          >[color=green]
                          > > more serious than claiming undefined character references to be markup
                          > > errors).[/color]
                          >
                          > Do you have a reference for that? My reading of it is that undefined
                          > references are indeed errors, and that reporting them as such was in
                          > fact fixing a bug in early versions.[/color]

                          Some years ago (I guess around 1995?) I was assured by an SGML
                          specialist (I can only dimly recollect who it was, and at the moment I
                          cannot find the correspondence, so please excuse me for not trying to
                          name him) that in SGML an undefined numerical character reference is
                          not an error, and it is open to the parties to reach mutual agreement
                          on a meaning for them.

                          Google reveals this discussion document from 1996:


                          This sets the context:

                          The US national body recommends that the following be considered
                          during the ongoing review of ISO 8879:

                          Under 3(b), one can read:

                          There are 3 classes of characters: those that can be entered directly
                          or through character references, those that can be entered only
                          through character references, and those that are prohibited. The
                          first class is declared with a minimum literal or base set character
                          number, the second by "UNUSED", and the third by omitting the
                          ^^^^^^^^^^^^^^^ ^^^^^^^
                          character number from the described character set portions. In
                          addition to clarifying the interpretation of the character set
                          declaration, this approach provides users with a way to prohibit
                          references to undefined characters.

                          I think it's fair to deduce from what it says there, that at the time
                          of writing, references to undefined characters were not ipso facto an
                          error, otherwise there would have been no need to discuss providing
                          users with ways to prohibit them.

                          If you want an authoritative answer, I guess you need to look at any
                          subsequent amendments to the SGML standard. It's beyond my current
                          expertise, and I have too much fire-fighting to do on other fronts to
                          be able to research it further, sorry.

                          good luck

                          Comment

                          • Jim Dabell

                            #14
                            Re: W3C Validator Update

                            Nick Kew wrote:

                            [snip][color=blue]
                            > There are longstanding FAQs concerning SHORTTAGS and NET-enabling tags[1],
                            > and the fact that a strict SGML parser permits them is not helpful to
                            > most users (see the list archives for examples of confusion it causes).
                            > That's why the WDG Validator and Page Valet default to parse modes
                            > that complain about them. Do you consider that wrong?[/color]
                            [snip]

                            It depends on the nature of the complaint. If it claims that the document
                            is _invalid_, then yes, that would be wrong. If, however, it claims that
                            the document is valid but may have serious compatibility issues, I'd
                            consider that to be good behaviour. I see no problem with including
                            linting behaviour in a validator as long as it clearly separates the
                            concept of "compatibil ity issues" from actual mistakes with the markup.

                            --
                            Jim Dabell

                            Comment

                            • Andreas Prilop

                              #15
                              Re: W3C Validator Update

                              Jacqui or (maybe) Pete <porjes@spamcop .net> wrote:
                              [color=blue][color=green]
                              >> <URL:http://validator.w3.or g:8001/>[/color]
                              >
                              > Speaking as a humble html hacker, I like the fussy mode. It picks up
                              > things (eg unclosed elements) that while strictly valid html 4.01 strict
                              > generally indicate that I've cocked something up.[/color]

                              First, I am confused (as a non-native speaker of English) by the
                              term "fussy". There is also the word "fuzzy" in the English language.
                              I suggest to find a better, technical word as description.

                              Second, this "validator" labels one and the same document in
                              <h2>big letters</h2> as "valid" or as "not valid" - just depending
                              on this "fussy" parameter. Now, is the page valid according the
                              HTML specifications or not? Yes or No?

                              This "validator" requires always <tbody> although the HTML 4.01
                              specifications clearly permit tables without <tbody>.
                              Suppose I have a <table> with 100 <tr> but without <tbody>.
                              This "validator" gives me 100 "errors"! But in fact it is only
                              one "error", namely the omission of <tbody>. How silly!

                              --

                              Comment

                              Working...