Details
-
Type: New Feature
-
Status: Open
-
Priority: Minor
-
Resolution: Unresolved
-
Affects Version/s: None
-
Fix Version/s: None
-
Component/s: Cloud Spider
-
Labels:None
Description
In most situations, excluding URLs containing odd special characters can help reduce false-positive page level flags (such as missing title tags, long meta data, etc.). However, we've now seen cases where by following these guidelines we exclude primary site content from the ADR.
To address this situation, I suggest we create a user-defined special character crawling inclusion option. This feature would use the same interface as our URL Appended Parameter box, and would allow the user to define special characters which should be considered OK to crawl and would be excluded from our current "failed" flagging:
Example (see attached screen)
Examples of ADRs "failing" URLs with special characters.
Pipe | (URLs excluded)
S-1807
R-2199
Bracket [] (URLs excluded)
S-1543
R-1892