Details
-
Type: Bug
-
Status: Closed
-
Priority: Minor
-
Resolution: Fixed
-
Affects Version/s: None
-
Fix Version/s: None
-
Component/s: Spider
-
Labels:None
-
Environment:
Test Env: client-pc.ri.thesearchagency.com
Description
Updated the spider-db by executing the file "11-17-2010-ADRTabRulesChange-TSASpider-2.0-TSASpider.sql" in svn under svn://10.128.128.101/laxsvn/trunk/TSA-Spider/Spider Database Generation Scripts/ .
Updated the spider application with the below builds and restarted the Spider Queue Manager service
ADR app Build: Hudson Build # 22 of Spider ADR application project.
Crawl app build: Hudson Build # 24 of Spider Crawl Application project
1. Spider a URL, whose web page has <H1> tag, say
"http://pvwb-of1pvd0010.ri.thesearchagency.com/gurpreet/Spider/long.htm"
2. Download the generated ADR
Observed that the URL "http://pvwb-of1pvd0010.ri.thesearchagency.com/gurpreet/Spider/long.htm" appears in Spidered Urls_1 sheet tab in black color but all other URL's are shown in red
Also observed the same issue with the below URL's (ADR attached)
-----------------------------------
http://pvwb-of1pvd0010.ri.thesearchagency.com/gurpreet/Spider/dup1.htm
http://pvwb-of1pvd0010.ri.thesearchagency.com/gurpreet/Spider/dup2.htm
http://pvwb-of1pvd0010.ri.thesearchagency.com/gurpreet/Spider/short.htm
http://pvwb-of1pvd0010.ri.thesearchagency.com/gurpreet/Spider/unclosedno.htm
http://pvwb-of1pvd0010.ri.thesearchagency.com/gurpreet/Spider/commented_js.htm
------------------------------------