Uploaded image for project: 'Data'
  1. Data
  2. DATA-858

Data: Move feed creating duplicate sources

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: unspecified
    • Fix Version/s: None
    • Component/s: Search Engine Data
    • Labels:
      None
    • Environment:

      Operating System: Linux
      Platform: PC

    • Bugzilla Id:
      2597

      Description

      Here is the email string for this. Last email from therouxj

      That makes sense. BasDataFeed would be the first place to check.

      The following scenario might be possible, which would explain what we're
      seeing.

      1) The Move.com Ask account was synchronized before the sources were
      tagged and creates Source A

      2) The sources were tagged in Ask.com

      3) The datafeed runs in the morning and picks picks up a conversion for
      one of the newly tagged sources and creates Source B

      4) The Move.com Ask account is synchronized, and the null altkey in
      Source A is overwritten with the same altkey as Source B.

      If this is the case, then SearchEngineUpdater needs to be written to
      handle this case going forward.

      I wrote a script to grep the log files and found the following (related
      to the Ask.com example)

      2008-11-18 11:04:40.876 (3) [main]: new source created for altkey
      [aja408442b_loan]

      On Mon, 2009-01-12 at 13:32 -0500, Bill Duffy wrote:
      > Next step? File a bug and assign to Murty?
      >
      >
      > On Mon, 2009-01-12 at 07:32 -0800, Jeff Theroux wrote:
      > > Hmm, looks like you might be right, because the unknown source
      > > actually
      > > has a higher auto increment id.
      > >
      > > On Fri, 2009-01-09 at 13:58 -0500, Bill Duffy wrote:
      > > > Jeff,
      > > >
      > > > I suspect it may be the data feed not finding the original source
      > > not
      > > > vice versa for what its worth.
      > > >
      > > > Bill
      > > >
      > > >
      > > > On Wed, 2009-01-07 at 17:34 -0500, Bill Duffy wrote:
      > > > > Ffej,
      > > > >
      > > > > As I mentioned earlier, it looks like we have 2 records for some
      > > Yahoo
      > > > > refcds - one from an unknown source and one from the search
      > > engine. This
      > > > > issue is also effecting Ask.
      > > > >
      > > > > acctdb02 - Move Finance
      > > > >
      > > > > mysql> select * from sources where altkey='aja408442b_loan'\G
      > > > > *************************** 1. row ***************************
      > > > > id: 121541092
      > > > > altKey: aja408442b_loan
      > > > > accountID: 22
      > > > > siteID: 0
      > > > > searchEngineAccountID: 0
      > > > > keywordID: 0
      > > > > distributionID: 35
      > > > > searchEngineIdentifier: NULL
      > > > > searchEngineGroupIdentifier:
      > > > > searchEngineMatchType: unknown
      > > > > searchEngineGroupID: 0
      > > > > isBiddable: false
      > > > > campaignID: 0
      > > > > description: Unknown Tag "aja408442b_loan"
      > > > > type: 0
      > > > > keyword: aja408442b_loan
      > > > > sourceURL:
      > > > > cost: NULL
      > > > > maxCost: 0
      > > > > waypointID: NULL
      > > > > landingURL:
      > > > > passParams:
      > > > > trackPerformance: true
      > > > > searchEngineStatus: unknown
      > > > > searchEngineStatusText:
      > > > > lastChecked: 0000-00-00 00:00:00
      > > > > *************************** 2. row ***************************
      > > > > id: 114905172
      > > > > altKey: aja408442b_loan
      > > > > accountID: 22
      > > > > siteID: 0
      > > > > searchEngineAccountID: 29439
      > > > > keywordID: 0
      > > > > distributionID: 35
      > > > > searchEngineIdentifier: 227953314
      > > > > searchEngineGroupIdentifier: 594088657
      > > > > searchEngineMatchType: broad
      > > > > searchEngineGroupID: 1113562
      > > > > isBiddable: true
      > > > > campaignID: 0
      > > > > description: Keyword: [loan] broad
      > > > > type: 1
      > > > > keyword: loan
      > > > > sourceURL:
      > > > > cost: 0.333
      > > > > maxCost: 0.37
      > > > > waypointID: NULL
      > > > > landingURL:
      > > > > http://www1.move.com/HomeFinance/Mortgages/Default.
      > > > > asp?source=a12686&refcd=AJa408442b_loan&tsacr=aj

      {ad_id}

      > > > > passParams:
      > > > > trackPerformance: true
      > > > > searchEngineStatus: ok
      > > > > searchEngineStatusText: On
      > > > > lastChecked: 0000-00-00 00:00:00
      > > > > 2 rows in set (0.07 sec)
      > > > >
      > > > > acctdb02 Realtor.com
      > > > >
      > > > > *************************** 1. row ***************************
      > > > > id: 111660032
      > > > > altKey:
      > > > > yhg135721722011a_florence_real_estate_listing
      > > > > accountID: 273
      > > > > siteID: 0
      > > > > searchEngineAccountID: 0
      > > > > keywordID: 0
      > > > > distributionID: 164
      > > > > searchEngineIdentifier: NULL
      > > > > searchEngineGroupIdentifier:
      > > > > searchEngineMatchType: unknown
      > > > > searchEngineGroupID: 0
      > > > > isBiddable: false
      > > > > campaignID: 0
      > > > > description: Unknown Tag
      > > > > "yhg135721722011a_florence_real_estate_listing"
      > > > > type: 0
      > > > > keyword:
      > > > > yhg135721722011a_florence_real_estate_listing
      > > > > sourceURL:
      > > > > cost: NULL
      > > > > maxCost: 0
      > > > > waypointID: NULL
      > > > > landingURL:
      > > > > passParams:
      > > > > trackPerformance: true
      > > > > searchEngineStatus: unknown
      > > > > searchEngineStatusText:
      > > > > lastChecked: 0000-00-00 00:00:00
      > > > > *************************** 2. row ***************************
      > > > > id: 27857830
      > > > > altKey:
      > > > > yhg135721722011a_florence_real_estate_listing
      > > > > accountID: 273
      > > > > siteID: 0
      > > > > searchEngineAccountID: 10286
      > > > > keywordID: 0
      > > > > distributionID: 164
      > > > > searchEngineIdentifier: 135721722011
      > > > > searchEngineGroupIdentifier: 7590477899
      > > > > searchEngineMatchType: advanced
      > > > > searchEngineGroupID: 314408
      > > > > isBiddable: true
      > > > > campaignID: 0
      > > > > description: Keyword: [florence real estate
      > > listing]
      > > > > advanced
      > > > > type: 1
      > > > > keyword: florence real estate listing
      > > > > sourceURL:
      > > > > cost: 0.57
      > > > > maxCost: 0.57
      > > > > waypointID: NULL
      > > > > landingURL:
      > > > >
      > >
      http://www.realtor.com/?source=a15697&refcd=YHg

      {ysmkwid} {ysmmtc:e:a:c}

      _florence_real_estate_listing&tsacr=YH

      {ysmadid}

      &s_kwcid=TC-2100-

      {OVKWID}

      -

      {ovmtc:S:S:C}

      -

      {OVADID}

      > > > > passParams:
      > > > > trackPerformance: true
      > > > > searchEngineStatus: ok
      > > > > searchEngineStatusText: On
      > > > > lastChecked: 0000-00-00 00:00:00
      > > > > 2 rows in set (0.28 sec)
      > > > >
      > > > >
      > > > > Any thoughts on why this may be happening? It seems to be having a
      > > > > negative effect on tracking. Bad enough that Ami has turned off
      > > Ask for
      > > > > Realtor for now - though happened before Christmas.
      > > > >
      > > > > Bill
      > > > >
      > > > >
      > >
      > >

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              bduffy Bill Duffy (Inactive)
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: