Uploaded image for project: 'AdMax'
  1. AdMax
  2. ADMAX-3031

ADR Report fails to generate after encountering data extraction issues from large crawl file (edition.cnn.com)

    Details

    • Type: Bug
    • Status: Open
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: Sustaining
    • Component/s: Cloud Spider
    • Labels:
      None
    • Environment:

      Production

      Description

      A (likely) performance-related issue. This crawl generates approx 3 million rows of issue data to write out to excel report format. Some issue types fail to extract from main crawl file, which causes a downstream issue that prevents a report from being created. Replicated the issue locally, not sure what can be done about the data extraction problem, but probably can make some fixes for the report generation to handle the problem more gracefully, and provide a incomplete report. Estimate 2-4 days effort.

        Attachments

          Activity

            People

            • Assignee:
              jshih Jeff Shih (Inactive)
              Reporter:
              pwynne Patrick Wynne
            • Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

              • Created:
                Updated: