This bug was discovered in a DSpaceDirect production site.
Discovery allows no means to limit the amount of full text that a given site wishes to index for search/browse. The legacy Lucene search engine supported the "search.maxfieldlength" setting in dspace.cfg:
However, Discovery ignores the "search.maxfieldlength" setting and always indexes the full text of the files in the TEXT bundle:
This means that if a site attempts to store large text-based files in DSpace, the Solr index may grow rapidly. This could cause memory issues if many of these documents are loaded into memory at once (but
DS-2832 solved some of those issues already)
Nonetheless, Discovery should provide a way to limit the size of the full text that is indexed, for sites that need this capability or wish to better control the size of their Solr Index.