Hi
In a html-file I have a number of hyper-links to different pdf-files.
I am using Web capture to assembly these pdf-files via the html-file
to one pdf-file. The reason for this is to have a sort of partlist of
documents f.ex. for production documentation, and bookmarks are
included. Furthermore it is done automatically.
The pdf-files look perfect before Web capture.
The problem occurs, when saving these web-captured files as one file.
Random text dissapears when saving, somtimes a lot of text and
sometimes less.
Any of You got an idea ?
The problem is not present in Acrobat 4.0.
Soren
|
|
0
|
|
|
|
Reply
|
sas
|
1/26/2005 10:25:41 AM |
|
Soren Sorensen wrote:
> Hi
> In a html-file I have a number of hyper-links to different pdf-files.
> I am using Web capture to assembly these pdf-files via the html-file
> to one pdf-file. The reason for this is to have a sort of partlist of
> documents f.ex. for production documentation, and bookmarks are
> included. Furthermore it is done automatically.
>
> The pdf-files look perfect before Web capture.
> The problem occurs, when saving these web-captured files as one file.
> Random text dissapears when saving, somtimes a lot of text and
> sometimes less.
>
> Any of You got an idea ?
>
> The problem is not present in Acrobat 4.0.
>
> Soren
Maybe some of the input PDFs have subset fonts that use the same font
name but have subset the font differently. This could confuse the
viewer. Acrobat won't allow you to merge such PDFs, as described here:
http://www.adobe.com/support/techdocs/327594.html
Maybe Acrobat doesn't check for this during Web capture?
After subsetting a font, Acrobat prepends a random string to the font
name to reduce the chances of collision. This prefix is not shown by
Acrobat or Reader. You can use the command-line tool pdffonts to see
it. Pdffonts is part of the xpdf project:
http://foolabs.com/xpdf/download.html
First thing I would do is run pdffonts on your output PDF. The
command-line tools sort and uniq would come in handy to discover any
duplicates. Windows user can get these by installing MSYS or this
GnuWin32 package:
http://gnuwin32.sourceforge.net/packages/textutils.htm
Learn about sort and uniq by googling "man sort" or "man uniq".
HTH-
Sid Steward
http://www.AccessPDF.com/pdftk/
|
|
0
|
|
|
|
Reply
|
Sid
|
1/26/2005 6:40:11 PM
|
|
Hi Sid
Thanks very much for Your answer.
You were absolutely right, it was the font-substitution, that went wrong.
The only remaining problem I have now, is that the html code contains a font
Arial, which always is substituted, when web captured.
Do you also have an answer for that ?
In general Arial are used in the other pdf-files and if I use an other font
but Arial in the html-document, the capture succeeds.
Kind regards
Soren
"Sid Steward" <sid@AccessPDF.com> skrev i en meddelelse
news:fuRJd.6146$8Z1.4975@newssvr14.news.prodigy.com...
> Soren Sorensen wrote:
>
>> Hi
>> In a html-file I have a number of hyper-links to different pdf-files.
>> I am using Web capture to assembly these pdf-files via the html-file
>> to one pdf-file. The reason for this is to have a sort of partlist of
>> documents f.ex. for production documentation, and bookmarks are
>> included. Furthermore it is done automatically.
>>
>> The pdf-files look perfect before Web capture.
>> The problem occurs, when saving these web-captured files as one file.
>> Random text dissapears when saving, somtimes a lot of text and
>> sometimes less.
>>
>> Any of You got an idea ?
>>
>> The problem is not present in Acrobat 4.0.
>>
>> Soren
>
> Maybe some of the input PDFs have subset fonts that use the same font name
> but have subset the font differently. This could confuse the viewer.
> Acrobat won't allow you to merge such PDFs, as described here:
> http://www.adobe.com/support/techdocs/327594.html
>
> Maybe Acrobat doesn't check for this during Web capture?
>
> After subsetting a font, Acrobat prepends a random string to the font name
> to reduce the chances of collision. This prefix is not shown by Acrobat
> or Reader. You can use the command-line tool pdffonts to see it.
> Pdffonts is part of the xpdf project:
> http://foolabs.com/xpdf/download.html
>
> First thing I would do is run pdffonts on your output PDF. The
> command-line tools sort and uniq would come in handy to discover any
> duplicates. Windows user can get these by installing MSYS or this
> GnuWin32 package:
> http://gnuwin32.sourceforge.net/packages/textutils.htm
>
> Learn about sort and uniq by googling "man sort" or "man uniq".
>
> HTH-
>
> Sid Steward
> http://www.AccessPDF.com/pdftk/
|
|
0
|
|
|
|
Reply
|
sas
|
1/27/2005 9:15:07 AM
|
|
Hi Soren-
Glad I could help.
When I capture a web site using 6 Pro (File > Create PDF > From Web
Page), the "Create PDF from Web Page" dialog opens. Click Settings...,
select HTML under "File Type Settings," and click Settings.... Click
the Fonts and Encoding tab and you should find an "Embed Platform Fonts
when Possible" checkbox at the bottom. Mine is unchecked. Maybe that
will help?
Another idea would be to remove the PDF pages that Acrobat converted
from HTML, after your document is assembled. That would also remove the
trouble font.
Kind Regards-
Sid
sas wrote:
> Hi Sid
>
> Thanks very much for Your answer.
> You were absolutely right, it was the font-substitution, that went wrong.
>
> The only remaining problem I have now, is that the html code contains a font
> Arial, which always is substituted, when web captured.
>
> Do you also have an answer for that ?
>
> In general Arial are used in the other pdf-files and if I use an other font
> but Arial in the html-document, the capture succeeds.
>
> Kind regards
> Soren
>
>
> "Sid Steward" <sid@AccessPDF.com> skrev i en meddelelse
> news:fuRJd.6146$8Z1.4975@newssvr14.news.prodigy.com...
>
>>Soren Sorensen wrote:
>>
>>
>>>Hi
>>>In a html-file I have a number of hyper-links to different pdf-files.
>>>I am using Web capture to assembly these pdf-files via the html-file
>>>to one pdf-file. The reason for this is to have a sort of partlist of
>>>documents f.ex. for production documentation, and bookmarks are
>>>included. Furthermore it is done automatically.
>>>
>>>The pdf-files look perfect before Web capture.
>>>The problem occurs, when saving these web-captured files as one file.
>>>Random text dissapears when saving, somtimes a lot of text and
>>>sometimes less.
>>>
>>>Any of You got an idea ?
>>>
>>>The problem is not present in Acrobat 4.0.
>>>
>>>Soren
>>
>>Maybe some of the input PDFs have subset fonts that use the same font name
>>but have subset the font differently. This could confuse the viewer.
>>Acrobat won't allow you to merge such PDFs, as described here:
>>http://www.adobe.com/support/techdocs/327594.html
>>
>>Maybe Acrobat doesn't check for this during Web capture?
>>
>>After subsetting a font, Acrobat prepends a random string to the font name
>>to reduce the chances of collision. This prefix is not shown by Acrobat
>>or Reader. You can use the command-line tool pdffonts to see it.
>>Pdffonts is part of the xpdf project:
>>http://foolabs.com/xpdf/download.html
>>
>>First thing I would do is run pdffonts on your output PDF. The
>>command-line tools sort and uniq would come in handy to discover any
>>duplicates. Windows user can get these by installing MSYS or this
>>GnuWin32 package:
>>http://gnuwin32.sourceforge.net/packages/textutils.htm
>>
>>Learn about sort and uniq by googling "man sort" or "man uniq".
>>
>>HTH-
>>
>>Sid Steward
>>http://www.AccessPDF.com/pdftk/
>
>
>
|
|
0
|
|
|
|
Reply
|
Sid
|
1/27/2005 5:31:20 PM
|
|
Hi Sid
You are right again.
There are now no fonts substituded and no text is removed when saving.
Thanks again for Your help.
Kind regards
Soren
"Sid Steward" <sid@AccessPDF.com> skrev i en meddelelse
news:Iz9Kd.15663$wZ2.3401@newssvr13.news.prodigy.com...
> Hi Soren-
>
> Glad I could help.
>
> When I capture a web site using 6 Pro (File > Create PDF > From Web Page),
> the "Create PDF from Web Page" dialog opens. Click Settings..., select
> HTML under "File Type Settings," and click Settings.... Click the Fonts
> and Encoding tab and you should find an "Embed Platform Fonts when
> Possible" checkbox at the bottom. Mine is unchecked. Maybe that will
> help?
>
> Another idea would be to remove the PDF pages that Acrobat converted from
> HTML, after your document is assembled. That would also remove the
> trouble font.
>
> Kind Regards-
>
> Sid
>
> sas wrote:
>
>> Hi Sid
>>
>> Thanks very much for Your answer.
>> You were absolutely right, it was the font-substitution, that went wrong.
>>
>> The only remaining problem I have now, is that the html code contains a
>> font Arial, which always is substituted, when web captured.
>>
>> Do you also have an answer for that ?
>>
>> In general Arial are used in the other pdf-files and if I use an other
>> font but Arial in the html-document, the capture succeeds.
>>
>> Kind regards
>> Soren
>>
>>
>> "Sid Steward" <sid@AccessPDF.com> skrev i en meddelelse
>> news:fuRJd.6146$8Z1.4975@newssvr14.news.prodigy.com...
>>
>>>Soren Sorensen wrote:
>>>
>>>
>>>>Hi
>>>>In a html-file I have a number of hyper-links to different pdf-files.
>>>>I am using Web capture to assembly these pdf-files via the html-file
>>>>to one pdf-file. The reason for this is to have a sort of partlist of
>>>>documents f.ex. for production documentation, and bookmarks are
>>>>included. Furthermore it is done automatically.
>>>>
>>>>The pdf-files look perfect before Web capture.
>>>>The problem occurs, when saving these web-captured files as one file.
>>>>Random text dissapears when saving, somtimes a lot of text and
>>>>sometimes less.
>>>>
>>>>Any of You got an idea ?
>>>>
>>>>The problem is not present in Acrobat 4.0.
>>>>
>>>>Soren
>>>
>>>Maybe some of the input PDFs have subset fonts that use the same font
>>>name but have subset the font differently. This could confuse the
>>>viewer. Acrobat won't allow you to merge such PDFs, as described here:
>>>http://www.adobe.com/support/techdocs/327594.html
>>>
>>>Maybe Acrobat doesn't check for this during Web capture?
>>>
>>>After subsetting a font, Acrobat prepends a random string to the font
>>>name to reduce the chances of collision. This prefix is not shown by
>>>Acrobat or Reader. You can use the command-line tool pdffonts to see it.
>>>Pdffonts is part of the xpdf project:
>>>http://foolabs.com/xpdf/download.html
>>>
>>>First thing I would do is run pdffonts on your output PDF. The
>>>command-line tools sort and uniq would come in handy to discover any
>>>duplicates. Windows user can get these by installing MSYS or this
>>>GnuWin32 package:
>>>http://gnuwin32.sourceforge.net/packages/textutils.htm
>>>
>>>Learn about sort and uniq by googling "man sort" or "man uniq".
>>>
>>>HTH-
>>>
>>>Sid Steward
>>>http://www.AccessPDF.com/pdftk/
>>
>>
|
|
0
|
|
|
|
Reply
|
sas
|
1/28/2005 8:06:22 AM
|
|
|
4 Replies
502 Views
(page loaded in 0.159 seconds)
|