Error in Acrobat 6 Web capture ?

  • Follow


Hi
In a html-file I have a number of hyper-links to different pdf-files.
I am using Web capture to assembly these pdf-files via the html-file
to one pdf-file. The reason for this is to have a sort of partlist of
documents f.ex. for production documentation, and bookmarks are
included. Furthermore it is done automatically.

The pdf-files look perfect before Web capture.
The problem occurs, when saving these web-captured files as one file.
Random text dissapears when saving, somtimes a lot of text and
sometimes less.

Any of You got an idea ?

The problem is not present in Acrobat 4.0.

Soren
0
Reply sas 1/26/2005 10:25:41 AM

Soren Sorensen wrote:

> Hi
> In a html-file I have a number of hyper-links to different pdf-files.
> I am using Web capture to assembly these pdf-files via the html-file
> to one pdf-file. The reason for this is to have a sort of partlist of
> documents f.ex. for production documentation, and bookmarks are
> included. Furthermore it is done automatically.
> 
> The pdf-files look perfect before Web capture.
> The problem occurs, when saving these web-captured files as one file.
> Random text dissapears when saving, somtimes a lot of text and
> sometimes less.
> 
> Any of You got an idea ?
> 
> The problem is not present in Acrobat 4.0.
> 
> Soren

Maybe some of the input PDFs have subset fonts that use the same font 
name but have subset the font differently.  This could confuse the 
viewer.  Acrobat won't allow you to merge such PDFs, as described here:
http://www.adobe.com/support/techdocs/327594.html

Maybe Acrobat doesn't check for this during Web capture?

After subsetting a font, Acrobat prepends a random string to the font 
name to reduce the chances of collision.  This prefix is not shown by 
Acrobat or Reader.  You can use the command-line tool pdffonts to see 
it.  Pdffonts is part of the xpdf project:
http://foolabs.com/xpdf/download.html

First thing I would do is run pdffonts on your output PDF.  The 
command-line tools sort and uniq would come in handy to discover any 
duplicates.  Windows user can get these by installing MSYS or this 
GnuWin32 package:
http://gnuwin32.sourceforge.net/packages/textutils.htm

Learn about sort and uniq by googling "man sort" or "man uniq".

HTH-

Sid Steward
http://www.AccessPDF.com/pdftk/
0
Reply Sid 1/26/2005 6:40:11 PM


Hi Sid

Thanks very much for Your answer.
You were absolutely right, it was the font-substitution, that went wrong.

The only remaining problem I have now, is that the html code contains a font 
Arial, which always is substituted, when web captured.

Do you also have an answer for that ?

In general Arial are used in the other pdf-files and if I use an other font 
but Arial in the html-document, the capture succeeds.

Kind regards
Soren


"Sid Steward" <sid@AccessPDF.com> skrev i en meddelelse 
news:fuRJd.6146$8Z1.4975@newssvr14.news.prodigy.com...
> Soren Sorensen wrote:
>
>> Hi
>> In a html-file I have a number of hyper-links to different pdf-files.
>> I am using Web capture to assembly these pdf-files via the html-file
>> to one pdf-file. The reason for this is to have a sort of partlist of
>> documents f.ex. for production documentation, and bookmarks are
>> included. Furthermore it is done automatically.
>>
>> The pdf-files look perfect before Web capture.
>> The problem occurs, when saving these web-captured files as one file.
>> Random text dissapears when saving, somtimes a lot of text and
>> sometimes less.
>>
>> Any of You got an idea ?
>>
>> The problem is not present in Acrobat 4.0.
>>
>> Soren
>
> Maybe some of the input PDFs have subset fonts that use the same font name 
> but have subset the font differently.  This could confuse the viewer. 
> Acrobat won't allow you to merge such PDFs, as described here:
> http://www.adobe.com/support/techdocs/327594.html
>
> Maybe Acrobat doesn't check for this during Web capture?
>
> After subsetting a font, Acrobat prepends a random string to the font name 
> to reduce the chances of collision.  This prefix is not shown by Acrobat 
> or Reader.  You can use the command-line tool pdffonts to see it. 
> Pdffonts is part of the xpdf project:
> http://foolabs.com/xpdf/download.html
>
> First thing I would do is run pdffonts on your output PDF.  The 
> command-line tools sort and uniq would come in handy to discover any 
> duplicates.  Windows user can get these by installing MSYS or this 
> GnuWin32 package:
> http://gnuwin32.sourceforge.net/packages/textutils.htm
>
> Learn about sort and uniq by googling "man sort" or "man uniq".
>
> HTH-
>
> Sid Steward
> http://www.AccessPDF.com/pdftk/ 


0
Reply sas 1/27/2005 9:15:07 AM

Hi Soren-

Glad I could help.

When I capture a web site using 6 Pro (File > Create PDF > From Web 
Page), the "Create PDF from Web Page" dialog opens.  Click Settings..., 
select HTML under "File Type Settings," and click Settings....  Click 
the Fonts and Encoding tab and you should find an "Embed Platform Fonts 
when Possible" checkbox at the bottom.  Mine is unchecked.  Maybe that 
will help?

Another idea would be to remove the PDF pages that Acrobat converted 
from HTML, after your document is assembled.  That would also remove the 
trouble font.

Kind Regards-

Sid

sas wrote:

> Hi Sid
> 
> Thanks very much for Your answer.
> You were absolutely right, it was the font-substitution, that went wrong.
> 
> The only remaining problem I have now, is that the html code contains a font 
> Arial, which always is substituted, when web captured.
> 
> Do you also have an answer for that ?
> 
> In general Arial are used in the other pdf-files and if I use an other font 
> but Arial in the html-document, the capture succeeds.
> 
> Kind regards
> Soren
> 
> 
> "Sid Steward" <sid@AccessPDF.com> skrev i en meddelelse 
> news:fuRJd.6146$8Z1.4975@newssvr14.news.prodigy.com...
> 
>>Soren Sorensen wrote:
>>
>>
>>>Hi
>>>In a html-file I have a number of hyper-links to different pdf-files.
>>>I am using Web capture to assembly these pdf-files via the html-file
>>>to one pdf-file. The reason for this is to have a sort of partlist of
>>>documents f.ex. for production documentation, and bookmarks are
>>>included. Furthermore it is done automatically.
>>>
>>>The pdf-files look perfect before Web capture.
>>>The problem occurs, when saving these web-captured files as one file.
>>>Random text dissapears when saving, somtimes a lot of text and
>>>sometimes less.
>>>
>>>Any of You got an idea ?
>>>
>>>The problem is not present in Acrobat 4.0.
>>>
>>>Soren
>>
>>Maybe some of the input PDFs have subset fonts that use the same font name 
>>but have subset the font differently.  This could confuse the viewer. 
>>Acrobat won't allow you to merge such PDFs, as described here:
>>http://www.adobe.com/support/techdocs/327594.html
>>
>>Maybe Acrobat doesn't check for this during Web capture?
>>
>>After subsetting a font, Acrobat prepends a random string to the font name 
>>to reduce the chances of collision.  This prefix is not shown by Acrobat 
>>or Reader.  You can use the command-line tool pdffonts to see it. 
>>Pdffonts is part of the xpdf project:
>>http://foolabs.com/xpdf/download.html
>>
>>First thing I would do is run pdffonts on your output PDF.  The 
>>command-line tools sort and uniq would come in handy to discover any 
>>duplicates.  Windows user can get these by installing MSYS or this 
>>GnuWin32 package:
>>http://gnuwin32.sourceforge.net/packages/textutils.htm
>>
>>Learn about sort and uniq by googling "man sort" or "man uniq".
>>
>>HTH-
>>
>>Sid Steward
>>http://www.AccessPDF.com/pdftk/ 
> 
> 
> 
0
Reply Sid 1/27/2005 5:31:20 PM

Hi Sid

You are right again.
There are now no fonts substituded and no text is removed when saving.

Thanks again for Your help.

Kind regards
Soren

"Sid Steward" <sid@AccessPDF.com> skrev i en meddelelse 
news:Iz9Kd.15663$wZ2.3401@newssvr13.news.prodigy.com...
> Hi Soren-
>
> Glad I could help.
>
> When I capture a web site using 6 Pro (File > Create PDF > From Web Page), 
> the "Create PDF from Web Page" dialog opens.  Click Settings..., select 
> HTML under "File Type Settings," and click Settings....  Click the Fonts 
> and Encoding tab and you should find an "Embed Platform Fonts when 
> Possible" checkbox at the bottom.  Mine is unchecked.  Maybe that will 
> help?
>
> Another idea would be to remove the PDF pages that Acrobat converted from 
> HTML, after your document is assembled.  That would also remove the 
> trouble font.
>
> Kind Regards-
>
> Sid
>
> sas wrote:
>
>> Hi Sid
>>
>> Thanks very much for Your answer.
>> You were absolutely right, it was the font-substitution, that went wrong.
>>
>> The only remaining problem I have now, is that the html code contains a 
>> font Arial, which always is substituted, when web captured.
>>
>> Do you also have an answer for that ?
>>
>> In general Arial are used in the other pdf-files and if I use an other 
>> font but Arial in the html-document, the capture succeeds.
>>
>> Kind regards
>> Soren
>>
>>
>> "Sid Steward" <sid@AccessPDF.com> skrev i en meddelelse 
>> news:fuRJd.6146$8Z1.4975@newssvr14.news.prodigy.com...
>>
>>>Soren Sorensen wrote:
>>>
>>>
>>>>Hi
>>>>In a html-file I have a number of hyper-links to different pdf-files.
>>>>I am using Web capture to assembly these pdf-files via the html-file
>>>>to one pdf-file. The reason for this is to have a sort of partlist of
>>>>documents f.ex. for production documentation, and bookmarks are
>>>>included. Furthermore it is done automatically.
>>>>
>>>>The pdf-files look perfect before Web capture.
>>>>The problem occurs, when saving these web-captured files as one file.
>>>>Random text dissapears when saving, somtimes a lot of text and
>>>>sometimes less.
>>>>
>>>>Any of You got an idea ?
>>>>
>>>>The problem is not present in Acrobat 4.0.
>>>>
>>>>Soren
>>>
>>>Maybe some of the input PDFs have subset fonts that use the same font 
>>>name but have subset the font differently.  This could confuse the 
>>>viewer. Acrobat won't allow you to merge such PDFs, as described here:
>>>http://www.adobe.com/support/techdocs/327594.html
>>>
>>>Maybe Acrobat doesn't check for this during Web capture?
>>>
>>>After subsetting a font, Acrobat prepends a random string to the font 
>>>name to reduce the chances of collision.  This prefix is not shown by 
>>>Acrobat or Reader.  You can use the command-line tool pdffonts to see it. 
>>>Pdffonts is part of the xpdf project:
>>>http://foolabs.com/xpdf/download.html
>>>
>>>First thing I would do is run pdffonts on your output PDF.  The 
>>>command-line tools sort and uniq would come in handy to discover any 
>>>duplicates.  Windows user can get these by installing MSYS or this 
>>>GnuWin32 package:
>>>http://gnuwin32.sourceforge.net/packages/textutils.htm
>>>
>>>Learn about sort and uniq by googling "man sort" or "man uniq".
>>>
>>>HTH-
>>>
>>>Sid Steward
>>>http://www.AccessPDF.com/pdftk/
>>
>> 

0
Reply sas 1/28/2005 8:06:22 AM

4 Replies
502 Views

(page loaded in 0.159 seconds)

Similiar Articles:













7/24/2012 9:17:12 PM


Reply: