java - Parsing HTML with Jsoup -

April 15, 2015

i'm trying parsing , i'm stuck... here's structure of html:

<ul class="sub-menu">  <li id="1" class="1"><a href="http://link">some text</a>      <ul class="sub-menu">          <li .... ><a ... /></li>         <li .... ><a ... /></li>         <li .... ><a ... /></li>     </ul> </li> <li id="2" class="2"><a href="http://link2">some other text</a>      <ul class="sub-menu">          <li .... ><a ... /></li>         <li .... ><a ... /></li>         <li .... ><a ... /></li>     </ul> </li></ul>

i need each li (id = 1, 2 , s) , lis inside them (<li .... ><a ... /></li>).

here's how java looks:

// ul contains source above elements lis = ul.select("li"); // know line screws here, can't figure out how correctly for(element li: lis) {     string text = li.select("a").first().text();     elements lis2 = li.select("ul[class=sub-menu]").first().getelementsbytag("li");          for(element li2: lis2)     {         element = li2.select("a").first();         // , other stuff 'a'     } }

so can me solve problem?

edit: problem ul.select("li"); returns every single 'li' in source wrote here. need lis id 1, 2 , on. , need <li .... ><a ... /></li>. p.s. sorry bad english.

i'm not sure, try

for( element element : doc.select("[li]") ) {     if( element.attr("id")== 1 || element.attr("id").getvalue()== 2 )     {         // thats elements 'element'         system.out.println(element);     } }

regards, hugo pedrosa

Search This Blog

New Mian

java - Parsing HTML with Jsoup -

Comments

Post a Comment

Popular posts from this blog

android - java.net.UnknownHostException(Unable to resolve host “URL”: No address associated with hostname) -

jquery - How can I dynamically add a browser tab? -

keyboard - C++ GetAsyncKeyState alternative -