The extracted content rendition file is larger than the original Excel file


Description

The extracted content rendition file might be much larger than the original Excel type file, causing potential problems with the graph service. As a result, search-related features might not perform as intended. 
For instance, recent updates or newly imported data might not be immediately accessible within the Index. This can lead to delays in business activity or the inability to access the data via search pages, facets, or full-text search.

Solution

To mitigate the issue, it is suggested to create another Media processing flow dedicated for the xls and xlsx file types with the disabled Extract content flag:

  1. On the menu bar, click Manage .
  2. On the Manage page, click Media processing.
  3. On the Media processing page, click the Content tab.
  4. Open the Settings of the Documents flow.
  5. Remove the xls and xlsx file types from the File types property.
  6. Click Add flow.
  7. In the Flow settings dialog box, fill out the following properties. In the File types property, add xls and xlsx types.
  8. Add tasks to the newly created flow similar to the tasks in the existing Documents flow.
  9. In the Convert document task, disable the Extract content flag.
  10. Click Save.
  11. Publish the changes.

Note that, by default, automatic reprocessing of renditions is disabled. You need to refresh the renditions of the existing assets to apply your updated processing flow.