java - Parsing HTML with Jsoup -
i'm trying parsing , i'm stuck... here's structure of html:
<ul class="sub-menu">  <li id="1" class="1"><a href="http://link">some text</a>      <ul class="sub-menu">          <li .... ><a ... /></li>         <li .... ><a ... /></li>         <li .... ><a ... /></li>     </ul> </li> <li id="2" class="2"><a href="http://link2">some other text</a>      <ul class="sub-menu">          <li .... ><a ... /></li>         <li .... ><a ... /></li>         <li .... ><a ... /></li>     </ul> </li></ul> i need each li (id = 1, 2 , s) , lis inside them (<li .... ><a ... /></li>).
here's how java looks:
// ul contains source above elements lis = ul.select("li"); // know line screws here, can't figure out how correctly for(element li: lis) {     string text = li.select("a").first().text();     elements lis2 = li.select("ul[class=sub-menu]").first().getelementsbytag("li");          for(element li2: lis2)     {         element = li2.select("a").first();         // , other stuff 'a'     } } so can me solve problem?
edit: problem ul.select("li"); returns every single 'li' in source wrote here. need lis id 1, 2 , on. , need <li .... ><a ... /></li>. p.s. sorry bad english.
i'm not sure, try
for( element element : doc.select("[li]") ) {     if( element.attr("id")== 1 || element.attr("id").getvalue()== 2 )     {         // thats elements 'element'         system.out.println(element);     } } regards, hugo pedrosa
Comments
Post a Comment