Welcome to the Java Programming Forums


The professional, friendly Java community. 21,500 members and growing!


The Java Programming Forums are a community of Java programmers from all around the World. Our members have a wide range of skills and they all have one thing in common: A passion to learn and code Java. We invite beginner Java programmers right through to Java professionals to post here and share your knowledge. Become a part of the community, help others, expand your knowledge of Java and enjoy talking with like minded people. Registration is quick and best of all free. We look forward to meeting you.


>> REGISTER NOW TO START POSTING


Members have full access to the forums. Advertisements are removed for registered users.

Results 1 to 3 of 3

Thread: how to read a PDF file?

  1. #1
    Junior Member
    Join Date
    Jul 2010
    Posts
    4
    Thanks
    1
    Thanked 0 Times in 0 Posts

    Default how to read a PDF file?

    hi, I wanna write and application to read a PDF file which is not in English (imagine an Asian or Middle Eastern language), from what kind of packages can I use? how can I do this? can you offer a website with a sample code? thanks in advance


  2. #2
    Super Moderator copeg's Avatar
    Join Date
    Oct 2009
    Location
    US
    Posts
    5,225
    Thanks
    176
    Thanked 817 Times in 760 Posts
    Blog Entries
    5

    Default Re: how to read a PDF file?

    Unfortunately java currently does not have a PDF parser as part of its standard library. I've used Apache's PDFBox in the past and it worked marginally well. Not sure if there are any foreign language constraints to this library however (if it can extract unicode from a PDF file it should be ok)

  3. #3
    Junior Member
    Join Date
    Jul 2010
    Posts
    4
    Thanks
    1
    Thanked 0 Times in 0 Posts

    Default Re: how to read a PDF file?

    Quote Originally Posted by copeg View Post
    Unfortunately java currently does not have a PDF parser as part of its standard library. I've used Apache's PDFBox in the past and it worked marginally well. Not sure if there are any foreign language constraints to this library however (if it can extract unicode from a PDF file it should be ok)
    is there anything to read MS Word docs ?

Similar Threads

  1. Java servlet read file
    By peliukasss in forum Java Servlet
    Replies: 0
    Last Post: April 15th, 2010, 07:02 PM
  2. Read file until EOF from offset each time
    By Pr0ject-Rec0n in forum File I/O & Other I/O Streams
    Replies: 4
    Last Post: April 2nd, 2010, 09:13 PM
  3. New person Just trying to read a file of ints
    By dubois.ford in forum File I/O & Other I/O Streams
    Replies: 1
    Last Post: March 7th, 2010, 10:47 PM
  4. Java program to read last line of a file
    By JavaPF in forum File Input/Output Tutorials
    Replies: 2
    Last Post: September 10th, 2009, 02:26 AM
  5. How to Read a Portion of a File in Java?
    By jazz2k8 in forum File I/O & Other I/O Streams
    Replies: 3
    Last Post: July 7th, 2009, 04:16 PM

Tags for this Thread