i want to genereate a report from an pdf file. The pdf file contains around 800 pages and it has a list of students along with there college code and their roll numbers. the roll no. is of format something like XXXXYYZZZZZZ
where XXXX is college code(eithe 0821or 0827 or 0831 or somethin like that Note college is 4 digit numeric code)
YY is the Branch code it is something like CS,ME,IT,EC ect. its two digit alphabetical code
ZZZZZZ is the roll code of the student it is 6 digit numeric code..
so in short the roll no contains
digits 1-4 college code (numeric code)
5-6 branch code (alphabetical code)
7-12 roll code (numeric code)
an example of roll no. is like 0821cs091021
the pdf file is sorted in terms of roll no. i.e. college wise ,branch wise in the the ascending order. It also contain a lot of information such as student name,fathers name, unversity name and logo and a lot more all these things are irrelevant to me.
only relevant data is the list of roll no.
all i want is to generate separate text files such that each file contains the roll no. of same college in the ascending order...
if u need a sampe pdf file then private message me your e mail id i will mail it to u..