Render an Image, Thumbnail, or PDF for Ingested Pages

The Web Connector can render an image, thumbnail image, or PDF of each Web page that it ingests. When you configure the connector to create one or more of these files they are each indexed as the document content of a separate document, alongside the indexed Web page.

To render an image, thumbnail, or PDF for each ingested page

  1. Stop the connector and open the configuration file.
  2. Modify your fetch task by adding the following parameters:

    CreateImageRendition To render an image for each ingested page, set this parameter to true.
    CreateThumbnailRendition To render a thumbnail image for each ingested page, set this parameter to true.
    CreatePDFRendition To render a PDF copy of each ingested page, set this parameter to true.
    RenditionImageFormat The image format for images and thumbnail images.
    RenditionImageQuality The image quality for JPEG and PNG images and thumbnail images. Specify an integer value from 0 to 100, where lower values represent higher compression (usually resulting in a smaller file size), and higher values represent higher quality.
    ThumbnailRenditionWidth The maximum width for thumbnail images, in pixels.
    ThumbnailRenditionHeight The maximum height for thumbnail images, in pixels.
  3. Save and close the configuration file.

Example

The following example renders a thumbnail for each page, as a PNG image that has a maximum width of 350 pixels:

[MyTask]
Url=http://www.autonomy.com
...
CreateThumbnailRendition=true
RenditionImageFormat=png
ThumbnailRenditionWidth=350

_HP_HTML5_bannerTitle.htm