Uploaded image for project: 'AdMax'
  1. AdMax
  2. ADMAX-2438

Spider Report: Some URL's with <H1> tags in their page are shown in black in Spidered Urls_1 sheet

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: Spider
    • Labels:
      None
    • Environment:

      Test Env: client-pc.ri.thesearchagency.com

      Description

      Updated the spider-db by executing the file "11-17-2010-ADRTabRulesChange-TSASpider-2.0-TSASpider.sql" in svn under svn://10.128.128.101/laxsvn/trunk/TSA-Spider/Spider Database Generation Scripts/ .

      Updated the spider application with the below builds and restarted the Spider Queue Manager service

      ADR app Build: Hudson Build # 22 of Spider ADR application project.

      Crawl app build: Hudson Build # 24 of Spider Crawl Application project

      1. Spider a URL, whose web page has <H1> tag, say

      "http://pvwb-of1pvd0010.ri.thesearchagency.com/gurpreet/Spider/long.htm"

      2. Download the generated ADR

      Observed that the URL "http://pvwb-of1pvd0010.ri.thesearchagency.com/gurpreet/Spider/long.htm" appears in Spidered Urls_1 sheet tab in black color but all other URL's are shown in red

      Also observed the same issue with the below URL's (ADR attached)

      -----------------------------------

      http://pvwb-of1pvd0010.ri.thesearchagency.com/gurpreet/Spider/dup1.htm

      http://pvwb-of1pvd0010.ri.thesearchagency.com/gurpreet/Spider/dup2.htm

      http://pvwb-of1pvd0010.ri.thesearchagency.com/gurpreet/Spider/short.htm

      http://pvwb-of1pvd0010.ri.thesearchagency.com/gurpreet/Spider/unclosedno.htm

      http://pvwb-of1pvd0010.ri.thesearchagency.com/gurpreet/Spider/commented_js.htm

      ------------------------------------

        Attachments

          Activity

            People

            • Assignee:
              abhiram Abhiram Bhagwat
              Reporter:
              saravanan.t Saravanan (Inactive)
            • Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: