{"id":43,"date":"2011-07-18T12:17:18","date_gmt":"2011-07-18T16:17:18","guid":{"rendered":"http:\/\/blogs.bu.edu\/wing\/?p=43"},"modified":"2011-07-18T12:17:18","modified_gmt":"2011-07-18T16:17:18","slug":"googles-locksmith-problem","status":"publish","type":"post","link":"https:\/\/blogs.bu.edu\/wing\/2011\/07\/18\/googles-locksmith-problem\/","title":{"rendered":"Google&#8217;s &#8220;locksmith&#8221; problem"},"content":{"rendered":"<p><a href=\"http:\/\/nyti.ms\/pwauFR\">Here<\/a> is an interesting NYT article about &#8220;Search Engine Optimization&#8221; (SEO) applied to Google. It seems that certain service categories like local locksmiths are getting flooded by bogus websites that are fronts for phone banks. \u00a0 \u00a0So an unsuspecting customer who searches for <a href=\"http:\/\/www.google.com\/#hl=en&amp;cp=12&amp;gs_id=16&amp;xhr=t&amp;q=boston+locksmith&amp;qe=Ym9zdG9uIGxvY2tz&amp;qesig=I4AOzfUnXW-zj76GGVFZEw&amp;pkc=AFgZ2tkMAeETppo3QyWjb2Bj_zYtvxBQtuh9VB-omVpYGqrqgCu-BwhOW-Ca4iPHujp5OkerwlbHn7bX7Zukhl8Y9g_lNTvJ6g&amp;pf=p&amp;sclient=psy&amp;site=&amp;source=hp&amp;pbx=1&amp;oq=boston+locks&amp;aq=0&amp;aqi=g2g-v3&amp;aql=&amp;gs_sm=&amp;gs_upl=&amp;bav=on.2,or.r_gc.r_pw.&amp;fp=a31f319eb03eef21&amp;biw=1111&amp;bih=1434\">&#8220;locksmith boston&#8221;<\/a> will get a large number of hits that essentially all go to the same service in the end.<\/p>\n<p>There are a number of research questions here, for example:<\/p>\n<ol>\n<li>For how many categories of services is this a problem?<\/li>\n<li>For any given category, how can one sort the &#8220;real&#8221; from the &#8220;fake&#8221; sites?<\/li>\n<\/ol>\n<p>The nice thing about these questions is that you can do the research just by typing google queries and looking at the results. \u00a0 The main observation I would start from is that any attempt to overwhelm search results must rely heavily on automation, and therefore incorporate simple patterns that can be detected.<\/p>\n<p>For example, &#8220;boston locksmith&#8221; yields top hits with domain names bostonlocksmiths.net, bostonlocksmith.com, bostonlocksmith.org, boston-locksmiths.us, and quite a few more following that pattern. \u00a0 Similarly, doing a search for &#8220;dc locksmith&#8221; yields domains like &#8220;dclocksmith.org&#8221;, etc.<\/p>\n<p>Another example is the HTML content of web pages. \u00a0For example take a look at\u00a0<a href=\"http:\/\/www.minneapolis-locksmith.us\/\">http:\/\/www.minneapolis-locksmith.us\/<\/a> and \u00a0<a href=\"http:\/\/www.bostonlocksmith.us\/\">http:\/\/www.bostonlocksmith.us\/ <\/a> and <a href=\"http:\/\/www.chicagolocksmith.us\/\">http:\/\/www.chicagolocksmith.us\/<\/a>. \u00a0 The similarities here should be easily detected.<\/p>\n<p>Finally, Google can help you directly. \u00a0 Google image search has come a long way in allowing &#8220;query by example&#8221;. \u00a0 Searching for the graphic on the left hand side of <a href=\"http:\/\/www.bostonlocksmith.us\/\">http:\/\/www.bostonlocksmith.us\/<\/a> finds the <a href=\"http:\/\/images.google.com\/search?tbs=sbi:AMhZZivUgqS44hcVYTAx6XOq0gHDEq-HVHO_1z54gCzlKIjC_1FGG1iYp8TzJiCvi3ItkORCaY1BZv1NwpaMVUvxWGp8_10zMAh_1kq7X5IUYG1ZBjokZVpuZ7RplKnRGThebJLQ_1i-Nz4TrRfuhJ89I-XuA3IsUt99X9QxzhmqDy-mBmyVRil_1ad4LpaYBHcu9MwiNjmRApFLTcbaBMwvxsogg_1vqX5l5h3nE6WWxMpXXEcVp6HHR_1OSvFRsKUQDIs8T_1T68y-m2KUoVA6Cebf-Fc5_1ZkWryyW02UEm7JCMrNN48Fci5MR3PA8syB9WJFEiCJ8oo2k5ZmL557F1KBiIWdV_12Wv5DbsgtMMIfDwylK3nG3T_1eIu2aB_1gFJANuwcnBXSGc2ewkSTnL5d4s8A46n0GVHDupdL5L2GXSiWxTbXLFN-Gz-q6r0NXjha-ML6dmfqiLSUIfJa5krd4L6TandNGX6MvJrb0jFviFJtG4sr03mcunZHP20gVcUzfMoWqj0yKxudX1cVaVD4zRlYqtXJRnSdDfmbgGVIncKOtTZcSGFeXAMkHbFNn-PbO7wR14F_1qXwfRv4CsLs6DfdTh8iQQYtJUDdkI9gd2SUnGLpaik_1sPOHFvwkIGX-dm9qgT53xggIO1rNq4Bs5gqehFHcBGbO9isljo-BknhYU9jC_18jau0ni5o7qgc15Z6LNfWArNHRqTqp4j_1Xm0L86l_1oX3BzXs9ZJjejmhTjvUiWtKGq-5gxufp6kaLDtA-iXZHQpbu8eeIgaXQouAPNJNTfDsjhKgD1enxrkS23lwszesWc69Yn0jZg_1lRTv3cVaRDTY3fbDbFOFWKUWP819cbql6vBKNf7z_1AHp0XQP9U4PdfJM6gAil6XyEDwx_1JEGipmzAd9DhfQ3PLZV06ud05O4atwXGUzsaJuT5cP6GrYCi-J7mixidVvxuXcUePmio7purzlZur4dx5tXPrf7Tg4EtVWsbWaw8cJBmqjWYrn3ZKulkIt6sv6uIejl5Qkva3E2oo-Nodk9WW5dwFemaUw-XtfiAfOOz26wX7rMzX4A17CQ5z6SVHQRCZmBo0E-As0o8N40_1eoM-FfCW3TYelo2voavz1LXq1Ld3qAIVuMKLQtL3SeM68ULuIaaNYoh--gb4E0DcnzUChnDhP531e7Xu8KlL4YJto-9QWkbvBFSNzFHFU1LGzLsqxACeecWjx3lrbxR-lv84l-9U-96FgCpSIl-VLisCWBkAovT9jDgLH_17SSEL0jmm12IbNsZaBZRUTYduOayhru5iYBJaHaf-4tFqg2-I72P0oi8Jmn5k1US6KYwwAAt6U&amp;btnG=Search&amp;hl=en&amp;bih=1435&amp;biw=1111\">same image used on locksmith sites in about a dozen cities.<\/a> (It also finds the original image which was appropriated for this graphic &#8212; coming from <a href=\"http:\/\/www.unigis.org\/uk\/staff\/d_lambrick.htm\">a professor in Manchester England!<\/a>)<\/p>\n<p>Could be a neat project to &#8220;reverse-engineer&#8221; these SEO strategies!<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Here is an interesting NYT article about &#8220;Search Engine Optimization&#8221; (SEO) applied to Google. It seems that certain service categories like local locksmiths are getting flooded by bogus websites that are fronts for phone banks. \u00a0 \u00a0So an unsuspecting customer who searches for &#8220;locksmith boston&#8221; will get a large number of hits that essentially all [&hellip;]<\/p>\n","protected":false},"author":2086,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[1],"tags":[],"_links":{"self":[{"href":"https:\/\/blogs.bu.edu\/wing\/wp-json\/wp\/v2\/posts\/43"}],"collection":[{"href":"https:\/\/blogs.bu.edu\/wing\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blogs.bu.edu\/wing\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blogs.bu.edu\/wing\/wp-json\/wp\/v2\/users\/2086"}],"replies":[{"embeddable":true,"href":"https:\/\/blogs.bu.edu\/wing\/wp-json\/wp\/v2\/comments?post=43"}],"version-history":[{"count":4,"href":"https:\/\/blogs.bu.edu\/wing\/wp-json\/wp\/v2\/posts\/43\/revisions"}],"predecessor-version":[{"id":47,"href":"https:\/\/blogs.bu.edu\/wing\/wp-json\/wp\/v2\/posts\/43\/revisions\/47"}],"wp:attachment":[{"href":"https:\/\/blogs.bu.edu\/wing\/wp-json\/wp\/v2\/media?parent=43"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blogs.bu.edu\/wing\/wp-json\/wp\/v2\/categories?post=43"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blogs.bu.edu\/wing\/wp-json\/wp\/v2\/tags?post=43"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}