Uploaded image for project: 'DSpace'
  1. DSpace
  2. DS-412

Xpdf MediaFilter: generate UTF-8 text, and improve error reporting

    XMLWordPrintable

    Details

    • Type: Improvement
    • Status: Closed (View Workflow)
    • Priority: Trivial
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 1.6.0
    • Component/s: DSpace API
    • Labels:
      None
    • Attachments:
      1
    • Comments:
      2
    • Documentation Status:
      Not Required

      Description

      This patch makes 2 unrelated improvements:

      1. Add the "-enc UTF-8" option to the pdftotext command to produce Unicode text; the default is Latin-1. This has been tested successfully with Greek text.

      2. Improve error reporting by testing status value. E.g. pdftotext returns 3 when it fails because of a permission problem in the source PDF, which is useful information that leads straight to a diagnosis.

        Attachments

          Activity

            People

            Assignee:
            lcs Larry Stone
            Reporter:
            lcs Larry Stone
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

              Dates

              Created:
              Updated:
              Resolved: