For all interested in creating valid HTML5 parser: it’s not as easy as it looks. Let me share here some useful links:
- Understanding HTML, XML and XHTML article
- Parsing HTML documents - it’s part of specification (chapter
12.2
) about parsing HTML documents fromWHATWG
community
@josevalim: I think that 2nd link could be helpful for you (if you have time to read it), because you can look at parsing HTML documents (i.e. it’s much more complicated than simple datetime parsers) step by step, so you can check if making such advanced parser is doable using your library. Hope it’s helpful resource.