Uploaded image for project: 'AdMax'
  1. AdMax
  2. ADMAX-2824

Cloud Spider: Spider Fetches URLs in <img> tags

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Trivial
    • Resolution: Fixed
    • Affects Version/s: Cloud Spider 3.01
    • Fix Version/s: Cloud Spider 3.02
    • Component/s: Cloud Spider
    • Labels:
      None

      Description

      Nutch by default fetches URLs from the following tags, "a, area, form, frame, iframe, script, link, img" - Some of these need to be removed (like img) - This will be a change in config file.

        Attachments

          Activity

            People

            • Assignee:
              antony Antony Rajiv (Inactive)
              Reporter:
              antony Antony Rajiv (Inactive)
            • Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: