╤Є ЮкJc@s&dZddkZddklZlZddklZddk l Z lZddkl Z y eZWnej oeefZnXdefdДГYZd efd ДГYZdДZeddДZeedd ДZeeddДZeddДZeddДZeГZeГZdS(s An interface to html5lib. i N(t HTMLParsertXHTMLParser(tetree(t_contains_block_level_tagtXHTML_NAMESPACE(tTreeBuilderRcBseZdZedДZRS(s*An html5lib HTML parser with lxml as tree.cCsti|d|dtГdS(Ntstrictttree(t_HTMLParsert__init__R(tselfR((s;/usr/lib64/python2.6/site-packages/lxml/html/html5parser.pyR s(t__name__t __module__t__doc__tFalseR (((s;/usr/lib64/python2.6/site-packages/lxml/html/html5parser.pyRsRcBseZdZedДZRS(s+An html5lib XHTML Parser with lxml as tree.cCsti|d|dtГdS(NRR(t_XHTMLParserR R(R R((s;/usr/lib64/python2.6/site-packages/lxml/html/html5parser.pyR s(RRR RR (((s;/usr/lib64/python2.6/site-packages/lxml/html/html5parser.pyRscCs8|i|Г}|dj o|S|idt|fГS(Ns{%s}%s(tfindtNoneR(Rttagtelem((s;/usr/lib64/python2.6/site-packages/lxml/html/html5parser.pyt _find_tag s cCsPt|tГptdГВn|djo t}n|i|d|ГiГS(s%Parse a whole document into a string.sstring requiredt useChardetN(t isinstancet_stringst TypeErrorRthtml_parsertparsetgetroot(thtmlt guess_charsettparser((s;/usr/lib64/python2.6/site-packages/lxml/html/html5parser.pytdocument_fromstring's cCs░t|tГptdГВn|djo t}n|i|dd|Г}|oVt|dtГoB|o7|diГotid|dГВn|d=qмn|S(sФParses several HTML elements, returning a list of elements. The first item in the list may be a string. If no_leading_text is true, then it will be an error if there is leading text, and it will always be a list of only elements. If `guess_charset` is `True` and the text was not unicode but a bytestring, the `chardet` library will perform charset guessing on the string. sstring requiredtdivRisThere is leading text: %rN( RRRRRt parseFragmenttstripRtParserError(Rtno_leading_textRRtchildren((s;/usr/lib64/python2.6/site-packages/lxml/html/html5parser.pytfragments_fromstring2s cCsыt|tГptdГВn|o$|pd}d|||f}nt|t||Г}|ptidГВnt|ГdjotidГВn|d}|io*|ii Гotid|iГВnd |_|S( s Parses a single HTML element; it is an error if there is more than one element, or if anything but whitespace precedes or follows the element. If create_parent is true (or is a tag name) then a parent node will be created to encapsulate the HTML in a single element. sstring requiredR s<%s>%ssNo elements foundisMultiple elements foundisElement followed by text: %rN(RRRR&tTrueRR#tlenttailR"R(Rt create_parentRRt containerR%tresult((s;/usr/lib64/python2.6/site-packages/lxml/html/html5parser.pytfragment_fromstringNs cCs&t|tГptdГВnt|d|d|Г}|d iГiГ}|idГp|idГo|St|dГ}t|Гo|St|dГ}t|Гd joI|i p|i i Гo-|d ip|d ii Гo |dSt|Гo d|_ n d |_ |S(s№Parse the html, returning a single element/document. This tries to minimally parse the chunk of text, without knowing if it is a fragment or a document. base_url will set the document's base_url attribute (and the tree's docinfo.URL) sstring requiredRRi2ss* *