Skip to main content

CloudConvert

The CloudConvert plugin extracts text from files uploaded into Brightspot, and editors can search for those files using free text. For example, if you upload a Word file, the plugin extracts and indexes the text, making the file searchable.

The CloudConvert plugin works in conjunction with Digital Asset Management, which provides document, spreadsheet, and presentation content types. These content types support file uploads.

The plugin uses the CloudConvert API to convert document, spreadsheet, and presentation formats to text. The following table lists the formats you can convert.

File formats available with CloudConvert:

File TypeSupported Formats
Documenthtml, doc, docx, odt, pdf, rtf
Presentationppt, pptx
Spreadsheetxls, xlsx, csv

Text extraction and thumbnail generation run as a background task.

Enabling data extraction

You can enable data extraction via CloudConvert. Once configured, uploading a file to a DAM content type triggers indexing of that file's content.

To configure CloudConvert:

  1. Obtain your CloudConvert account's API key and webhook signing secret.
  2. Click > Admin > Sites & Settings > Sites > Global.
  3. Under the CMS tab, expand the DAM Document Data Extraction Settings cluster.
  4. From the Extractor Services list, select Cloud Convert Document Data Extractor.
  5. In the API Key and Webhook Signing Secret fields, enter the values you obtained in step 1.
  6. Click Save.

Viewing tasks

Brightspot manages the extraction and indexing of a file's text as a background task.

To view the status of a CloudConvert task:

  1. Navigate to Dari Standard Tools at http://<brightspot-host>/_debug/.
  2. Click Background Tasks.
  3. Scroll down to DocumentExtractionTask Executor.

Document extraction task

Uploading a file for text extraction

You can upload a file, such as a presentation or spreadsheet, and other editors can search for the file using the file's text. For example, if you upload a single file containing all of the adventures of Sherlock Holmes, other editors can find the file by searching on watson, baker street, or mrs. hudson.

To upload a file for text extraction:

  1. In the header, click .
  2. Create a new Document, Presentation or Spreadsheet.
  3. In the content edit form, set the applicable fields.
  4. From the File field, select New Upload, and click Choose.
  5. Navigate to and select the file.
  6. Click Publish, or click save to save a draft.

Brightspot extracts the file's text. When the extraction is complete, Brightspot displays the text and a thumbnail under the content edit form's METADATA tab. You can modify the thumbnail by clicking Edit. For information about editing images in Brightspot, see Image editing.

When editors search for any of the terms in the extracted text, Brightspot lists the document in the search panel.