Welcome to the Java Programming Forums


The professional, friendly Java community. 21,500 members and growing!


The Java Programming Forums are a community of Java programmers from all around the World. Our members have a wide range of skills and they all have one thing in common: A passion to learn and code Java. We invite beginner Java programmers right through to Java professionals to post here and share your knowledge. Become a part of the community, help others, expand your knowledge of Java and enjoy talking with like minded people. Registration is quick and best of all free. We look forward to meeting you.


>> REGISTER NOW TO START POSTING


Members have full access to the forums. Advertisements are removed for registered users.

Results 1 to 3 of 3

Thread: getting text from url?

  1. #1
    Member
    Join Date
    Apr 2013
    Posts
    93
    Thanks
    7
    Thanked 3 Times in 3 Posts

    Default getting text from url?

    hi is there a way to grab all text fom a url with jsoup i have seen code for extracting specific info like links and heaings and stuff but is there a way to grab all text from a url as if you copyied and pasted the site into a text.doc ? i cant find much info on it at all thsnks.


  2. #2
    Super Moderator pbrockway2's Avatar
    Join Date
    Jan 2012
    Posts
    989
    Thanks
    7
    Thanked 207 Times in 183 Posts

    Default Re: getting text from url?

    I don't know jsoup, but I would guess not. When you paste stuff from a web page into MS Word that program takes the contents of the page and parses it (to find paragraph breaks, table rows etc), downloads stuff (images for example), does a whole lot of layout and finally represents the resulting information in the appropriate format for .doc documents. I find it unlikely that jsoup would provide all this functionality for this particular (and outdated) format.

  3. #3
    Member
    Join Date
    Apr 2013
    Posts
    93
    Thanks
    7
    Thanked 3 Times in 3 Posts

    Default Re: getting text from url?

    Quote Originally Posted by pbrockway2 View Post
    I don't know jsoup, but I would guess not. When you paste stuff from a web page into MS Word that program takes the contents of the page and parses it (to find paragraph breaks, table rows etc), downloads stuff (images for example), does a whole lot of layout and finally represents the resulting information in the appropriate format for .doc documents. I find it unlikely that jsoup would provide all this functionality for this particular (and outdated) format.
    any sugestions on how it could be done?

Similar Threads

  1. Url Image stream? Getting an Image url.
    By turtlemaster in forum File I/O & Other I/O Streams
    Replies: 3
    Last Post: June 30th, 2012, 10:43 AM
  2. Beginner I/O Help: Writing To a Text File At the End of Existing Lines of Text
    By BloomingNutria in forum File I/O & Other I/O Streams
    Replies: 1
    Last Post: February 28th, 2012, 03:03 PM
  3. Replies: 0
    Last Post: October 29th, 2011, 03:37 AM
  4. Text file to text area and Radiobuttons?
    By donaldmax in forum What's Wrong With My Code?
    Replies: 9
    Last Post: May 27th, 2011, 05:45 AM