extract source code from webpage

Is it possible to extract information from the source code of a webpage and how do i do that? (from inside a java program ofc).


  • Yes, it is possible. The work is already done. Use HttpUnit library. You will find it at www.sf.net. Also obtain its dependency like js.jar, etc. It is the best library for such kind of work.
  • Thanks a lot!

    Btw can anyone explain the MIT license for me, is it for me free to do whatever i want with it as long as i implement their text in my program or does it make my program free for all to use and do whatever if I use their library?

    since it may be used on a rather big website it's good to know =)
Sign In or Register to comment.

Howdy, Stranger!

It looks like you're new here. If you want to get involved, click one of these buttons!