f



getting an image off a pdf file

i've got a pdf file that's got one or more
scanned images.

i'd like to get programmatic access to the
image and do nasty stuff to it.

a page explaining the format of pdf file -
specially embedded images should do it.

thanks.


0
Tashkant
1/5/2004 12:29:52 PM
comp.text.pdf 5600 articles. 0 followers. ramon (1518) is leader. Post Follow

7 Replies
723 Views

Similar Articles

[PageSpeed] 43

"Tashkant" <goahead@spamme.com> wrote:

>i've got a pdf file that's got one or more
>scanned images.
>
>i'd like to get programmatic access to the
>image and do nasty stuff to it.
>
>a page explaining the format of pdf file -
>specially embedded images should do it.

"A page" sounds optimistic.  The PDF Reference is over 1000 pages. It
is an Addison Wesley book or find it on http://partners.adobe.com/
----------------------------------------
Aandi Inston  quite@dial.pipex.com http://www.quite.com
Please support usenet! Post replies and follow-ups, don't e-mail them.

0
quite
1/5/2004 12:52:22 PM
What is your budget? There are commercial tools that extract images and
start external applications to proces them, depending on size, color model,
etc. (e.g. Markzware Markscout)

I wouldn't recommend writing your own extractor :-(
Perhaps, there are Open Source PDF libraries available.

Waldo


"Tashkant" <goahead@spamme.com> wrote in message
news:4NcKb.88521$4F2.9295932@twister.nyc.rr.com...
> i've got a pdf file that's got one or more
> scanned images.
>
> i'd like to get programmatic access to the
> image and do nasty stuff to it.
>
> a page explaining the format of pdf file -
> specially embedded images should do it.
>
> thanks.
>
>


0
Waldo
1/5/2004 1:26:55 PM
well i have a software that needs to read a pdf file...and
as i wrote before will contain one or more scanned images...
nothing else.

i'd like to get access to the bits and bytes and be able to
display it in my app.

i will look at the adobe resource.

thanks.

"Waldo" <graaf@vno-ncw.nl> wrote in message
news:1073309191.550797@news-ext.oce.nl...
> What is your budget? There are commercial tools that extract images and
> start external applications to proces them, depending on size, color
model,
> etc. (e.g. Markzware Markscout)
>
> I wouldn't recommend writing your own extractor :-(
> Perhaps, there are Open Source PDF libraries available.
>
> Waldo
>
>
> "Tashkant" <goahead@spamme.com> wrote in message
> news:4NcKb.88521$4F2.9295932@twister.nyc.rr.com...
> > i've got a pdf file that's got one or more
> > scanned images.
> >
> > i'd like to get programmatic access to the
> > image and do nasty stuff to it.
> >
> > a page explaining the format of pdf file -
> > specially embedded images should do it.
> >
> > thanks.
> >
> >
>
>


0
Tashkant
1/5/2004 2:10:04 PM
"Tashkant" <goahead@spamme.com> wrote:

>well i have a software that needs to read a pdf file...and
>as i wrote before will contain one or more scanned images...
>nothing else.

It isn't quite as simple as it sounds. Each document contains pages,
each page contains content (vector instructions and resource
references) and resources. The resources might be scanned images only,
but there are many ways a PDF can be put together with "only" scanned
images.  So the problem is far more complex than, for instance,
finding the images in a multi-page TIFF file.
----------------------------------------
Aandi Inston  quite@dial.pipex.com http://www.quite.com
Please support usenet! Post replies and follow-ups, don't e-mail them.

0
quite
1/5/2004 2:26:35 PM
Tashkant wrote:
> i've got a pdf file that's got one or more
> scanned images.
> 
> i'd like to get programmatic access to the
> image and do nasty stuff to it.
> 
> a page explaining the format of pdf file -
> specially embedded images should do it.

I don't really understand what tou want, but try pdfimage, which is a 
part of xpdf. Just running
$ pdfimage yourpdf.pdf
will extract all images from the .pdf file and put them in yout present 
working directory.

At least this is a way to do it in linux.

greetings,
BBBart

0
BBBart
1/5/2004 3:24:36 PM
where can i get this pdfimage.

thanks.

"BBBart" <bbbart@kotnet.org> wrote in message
news:1073316244.498690@seven.kulnet.kuleuven.ac.be...
> Tashkant wrote:
> > i've got a pdf file that's got one or more
> > scanned images.
> >
> > i'd like to get programmatic access to the
> > image and do nasty stuff to it.
> >
> > a page explaining the format of pdf file -
> > specially embedded images should do it.
>
> I don't really understand what tou want, but try pdfimage, which is a
> part of xpdf. Just running
> $ pdfimage yourpdf.pdf
> will extract all images from the .pdf file and put them in yout present
> working directory.
>
> At least this is a way to do it in linux.
>
> greetings,
> BBBart
>


0
Tashkant
1/5/2004 4:09:07 PM
Tashkant wrote:
> where can i get this pdfimage.

Like I said, it is a part of the xpdf package: http://www.foolabs.com/xpdf/

greetings,
BBBart

0
BBBart
1/5/2004 5:56:30 PM
Reply: