Orwell Award Announcement SusanOhanian.Org Home

The Dirty Little Secrets of Search

Reader Comment: Anyone who's used Google since its inception, as I have, can't help but be aware of it's commercially skewed, dumbed down descent into banal, superficial, and just plain bizarre results. I also use Gmail, for now, and if I send an email with 'cake' as a subject head or in the body of the email, Google blankets me with commercial links to retailers of anything cake related. Its annoying, it's pathetic (why would I click on these links, especially when I resent being bombarded by them), and it's just plain creepy. I'm a Google user who uses Google less and less and less. I get better targeted, more honest, and truly useful search results elsewhere.

Reader Comment: Google sold out its integrity a long, long time ago. That is why the company is so rich. Vint Cert, on of the so-called "Father's of the Internet" used to rant and rave that this would never happen to the internet. Then he became Senior VP of Google and a very wealthy man. Google is all about advertising and they have done a great job of building a sales organization that excels at what they do. The see no moral issue or even a sense of responsibility. The truth may be...who cares?

Reader Comment: I'm shocked, shocked to find that gambling is going on in here!

By David Segal

PRETEND for a moment that you are Googleâs search engine.

Someone types the word âdressesâ and hits enter. What will be the very first result?

There are, of course, a lot of possibilities. Macyâs comes to mind. Maybe a specialty chain, like J. Crew or the Gap. Perhaps a Wikipedia entry on the history of hemlines.

O.K., how about the word âbeddingâ? Bed Bath & Beyond seems a candidate. Or Wal-Mart, or perhaps the bedding section of Amazon.com.

âArea rugsâ? Crate & Barrel is a possibility. Home Depot, too, and Sears, Pier 1 or any of those Web sites with âarea rugâ in the name, like arearugs.com.

You could imagine a dozen contenders for each of these searches. But in the last several months, one name turned up, with uncanny regularity, in the No. 1 spot for each and every term:

J. C. Penney.

The company bested millions of sites â and not just in searches for dresses, bedding and area rugs. For months, it was consistently at or near the top in searches for âskinny jeans,â âhome decor,â âcomforter sets,â âfurnitureâ and dozens of other words and phrases, from the blandly generic (âtableclothsâ) to the strangely specific (âgrommet top curtainsâ).

This striking performance lasted for months, most crucially through the holiday season, when there is a huge spike in online shopping. J. C. Penney even beat out the sites of manufacturers in searches for the products of those manufacturers. Type in âSamsonite carry on luggage,â for instance, and Penney for months was first on the list, ahead of Samsonite.com.

With more than 1,100 stores and $17.8 billion in total revenue in 2010, Penney is certainly a major player in American retailing. But Googleâs stated goal is to sift through every corner of the Internet and find the most important, relevant Web sites.

Does the collective wisdom of the Web really say that Penney has the most essential site when it comes to dresses? And bedding? And area rugs? And dozens of other words and phrases?

The New York Times asked an expert in online search, Doug Pierce of Blue Fountain Media in New York, to study this question, as well as Penneyâs astoundingly strong search-term performance in recent months. What he found suggests that the digital ageâs most mundane act, the Google search, often represents layer upon layer of intrigue. And the intrigue starts in the sprawling, subterranean world of âblack hatâ optimization, the dark art of raising the profile of a Web site with methods that Google considers tantamount to cheating.

Despite the cowboy outlaw connotations, black-hat services are not illegal, but trafficking in them risks the wrath of Google. The company draws a pretty thick line between techniques it considers deceptive and âwhite hatâ approaches, which are offered by hundreds of consulting firms and are legitimate ways to increase a siteâs visibility. Penneyâs results were derived from methods on the wrong side of that line, says Mr. Pierce. He described the optimization as the most ambitious attempt to game Googleâs search results that he has ever seen.

âActually, itâs the most ambitious attempt Iâve ever heard of,â he said. âThis whole thing just blew me away. Especially for such a major brand. Youâd think they would have people around them that would know better.â

TO understand the strategy that kept J. C. Penney in the pole position for so many searches, you need to know how Web sites rise to the top of Googleâs results. Weâre talking, to be clear, about the âorganicâ results â in other words, the ones that are not paid advertisements. In deriving organic results, Googleâs algorithm takes into account dozens of criteria, many of which the company will not discuss.

But it has described one crucial factor in detail: links from one site to another.

If you own a Web site, for instance, about Chinese cooking, your siteâs Google ranking will improve as other sites link to it. The more links to your site, especially those from other Chinese cooking-related sites, the higher your ranking. In a way, what Google is measuring is your siteâs popularity by polling the best-informed online fans of Chinese cooking and counting their links to your site as votes of approval.

But even links that have nothing to do with Chinese cooking can bolster your profile if your site is barnacled with enough of them. And hereâs where the strategy that aided Penney comes in. Someone paid to have thousands of links placed on hundreds of sites scattered around the Web, all of which lead directly to JCPenney.com.

Who is that someone? A spokeswoman for J. C. Penney, Darcie Brossart, says it was not Penney.

âJ. C. Penney did not authorize, and we were not involved with or aware of, the posting of the links that you sent to us, as it is against our natural search policies,â Ms. Brossart wrote in an e-mail. She added, âWe are working to have the links taken down.â

The links do not bear any fingerprints, but nothing else about them was particularly subtle. Using an online tool called Open Site Explorer, Mr. Pierce found 2,015 pages with phrases like âcasual dresses,â âevening dresses,â âlittle black dressâ or âcocktail dress.â Click on any of these phrases on any of these 2,015 pages, and you are bounced directly to the main page for dresses on JCPenney.com.

Some of the 2,015 pages are on sites related, at least nominally, to clothing. But most are not. The phrase âblack dressesâ and a Penney link were tacked to the bottom of a site called nuclear.engineeringaddict.com. âEvening dressesâ appeared on a site called casino-focus.com. âCocktail dressesâ showed up on bulgariapropertyportal.com. âCasual dressesâ was on a site called elistofbanks.com. âSemi-formal dressesâ was pasted, rather incongruously, on usclettermen.org.

There are links to JCPenney.comâs dresses page on sites about diseases, cameras, cars, dogs, aluminum sheets, travel, snoring, diamond drills, bathroom tiles, hotel furniture, online games, commodities, fishing, Adobe Flash, glass shower doors, jokes and dentists â and the list goes on.

Some of these sites seem all but abandoned, except for the links. The greeting at myflhomebuyer.com sounds like the saddest fortune cookie ever: âSorry, but you are looking for something that isnât here.â

When you read the enormous list of sites with Penney links, the landscape of the Internet acquires a whole new topography. It starts to seem like a city with a few familiar, well-kept buildings, surrounded by millions of hovels kept upright for no purpose other than the ads that are painted on their walls.

Exploiting those hovels for links is a Google no-no. The companyâs guidelines warn against using tricks to improve search engine rankings, including what it refers to as âlink schemes.â The penalty for getting caught is a pair of virtual concrete shoes: the company sinks in Googleâs results.

Often drastically. In 2006, Google announced that it had caught BMW using a black-hat strategy to bolster the companyâs German Web site, BMW.de. That site was temporarily given what the BBC at the time called âthe death penalty,â stating that it was âremoved from search results.â

BMW acknowledged that it had set up âdoorway pages,â which exist just to attract search engines and then redirect traffic to a different site. The company at the time said it had no intention of deceiving users, adding âif Google says all doorway pages are illegal, we have to take this into consideration.â

J. C. Penney, it seems, will not suffer the same fate. But starting Wednesday, it was the subject of what Google calls âcorrective action.â

Last week, The Times sent Google the evidence it had collected about the links to JCPenney.com. Google promptly set up an interview with Matt Cutts, the head of the Webspam team at Google, and a man whose every speech, blog post and Twitter update is parsed like papal encyclicals by players in the search engine world.

âI can confirm that this violates our guidelines,â said Mr. Cutts during an hourlong interview on Wednesday, after looking at a list of paid links to JCPenney.com.

He said Google had detected previous guidelines violations related to JCPenney.com on three occasions, most recently last November. Each time, steps were taken that reduced Penneyâs search results â Mr. Cutts avoids the word âpunishedâ â but Google did not later âcircle backâ to the company to see if it was still breaking the rules, he said.

He and his team had missed this recent campaign of paid links, which he said had been up and running for the last three to four months.

âDo I wish our system had detected things sooner? I do,â he said. âBut given the one billion queries that Google handles each day, I think we do an amazing job.â

Mr. Cutts sounded remarkably upbeat and unperturbed during this conversation, which was a surprise given that we were discussing a large, sustained effort to snooker his employer. Asked about his zenlike calm, he said the company strives not to act out of anger. You get the sense that Mr. Cutts and his colleagues are acutely aware of the singular power they wield as judge, jury and appeals panel, and theyâre eager to project an air of maturity and judiciousness.

That said, he added, âI donât think I could do my job well if in some sense I was not offended by things that were bad for Google users.â

âAm I happy this happened?â he later asked. âAbsolutely not. Is Google going to take strong corrective action? We absolutely will.â

And the company did. On Wednesday evening, Google began what it calls a âmanual actionâ against Penney, essentially demotions specifically aimed at the company.

At 7 p.m. Eastern time on Wednesday, J. C. Penney was still the No. 1 result for âSamsonite carry on luggage.â

Two hours later, it was at No. 71.

At 7 p.m. on Wednesday, Penney was No. 1 in searches for âliving room furniture.â

By 9 p.m., it had sunk to No. 68.

In other words, one moment Penney was the most visible online destination for living room furniture in the country.

The next it was essentially buried.

PENNEY reacted to this instant reversal of fortune by, among other things, firing its search engine consulting firm, SearchDex. Executives there did not return e-mail or phone calls.

Penney also issued a statement: âWe are disappointed that Google has reduced our rankings due to this matter,â Ms. Brossart wrote, âbut we will continue to work actively to retain our high natural search position.â

She added that while the collection of links surely brought in additional revenue, it was hardly a bonanza. Just 7 percent of JCPenney.comâs traffic comes from clicks on organic search results, she wrote. A far bigger source of profits this holiday season, she stated, came from partnerships with companies like Yahoo and Time Warner, from new mobile applications and from in-store kiosks.

Search experts, however, say Penney likely reaped substantial rewards from the paid links. If you think of Google as the entrance to the planetâs largest shopping center, the links helped Penney appear as though it was the first and most inviting spot in the mall, to millions and millions of online shoppers.

How valuable was that? A study last May by Daniel Ruby of Chitika, an online advertising network of 100,000 sites, found that, on average, 34 percent of Googleâs traffic went to the No. 1 result, about twice the percentage that went to No. 2.

The Keyword Estimator at Google puts the number of searches for âdressesâ in the United States at 11.1 million a month, an average based on 12 months of data. So for âdressesâ alone, Penney may have been attracting roughly 3.8 million visits every month it showed up as No. 1. Exactly how many of those visits translate into sales, and the size of each sale, only Penney would know.

But in January, the company was crowing about its online holiday sales. Kate Coultas, a company spokeswoman, wrote to a reporter in January, âInternet sales through jcp.com posted strong growth in December, with significant increases in traffic and orders for the key holiday shopping periods of the week after Thanksgiving and the week before Christmas.â

There was considerable pressure from investors for Penney to deliver strong holiday results. It has been struggling through one of the more trying times of its century of retailing. The $17.8 billion in revenue it reported last year is the exact same figure it reported in 2001. It announced in January that it would close a handful of underperforming stores, as well as two of its five call centers and 19 outlets that sell excess catalog merchandise.

Adding to the companyâs woes is the demise of its catalog business. Penney has phased out what it called its Big Book and poured money into its Web site. But so far, the loss of the catalog has not been offset by the expansion of the Web site. At its peak, the catalog brought in about $4 billion in revenue. In 2009, the site brought in $1.5 billion.

âFor the last 35 years, Penney has tried to be accepted as a department store, and during unusually good times, it does very well,â said Bernard Sosnick, an analyst at Gilford Securities. âBut in bad times, it gets punished by shoppers who pull back after having spent aspirationally.â

MANY owners of Web sites with Penney links seem to relish their unreachability. But there were exceptions, and they included cocaman.ch. (âGeekness â closer to the worldâ is the cryptic header atop the site.) It turned out to be owned and run by Corsin Camichel, a chatty 25-year-old I.T. security analyst in Switzerland.

The word âdressesâ appears in a small collection of links in the middle of a largely blank Cocaman page. Asked about that link, Mr. Camichel said his records show that it turned up on his site last April, though he said it might have been earlier than that.

The link came through a Web site, TNX.net, which pays Mr. Camichel with TNX points, which he then trades for links that drive traffic to his other sites, like cookingutensils.net. He earns money when people visit that site and click on the ads. He could also, he said, get cash from TNX. Currently, Cocaman is home to 403 links, all of them placed there by TNX on behalf of clients.

âYou do pretty well,â he wrote, referring to income from his links trading. âThe thing is, the more you invest (time and money) the better results you get. Right now I get enough to buy myself new test devices for my Android apps (like $150/month) with zero effort. I have to do nothing. Ads just sit there and if people click, I make money.â

Efforts to reach TNX itself last week via e-mail were not successful.

Interviewing a purveyor of black-hat services face-to-face was a considerable undertaking. They are a low-profile bunch. But a link-selling specialist named Mark Stevens â who says he had nothing to do with the Penney link effort â agreed to chat. He did so on the condition that his company not be named, a precaution he justified by recounting what happened when the company apparently angered Google a few months ago.

âIt was my fault,â Mr. Stevens said. âI posted a job opening on a Stanford Engineering alumni mailing list, and mentioned the name of our company and a brief description of what we do. I think some Google employees saw it.â

In a matter of days, the company could not be found in a Google search.

âLiterally, you typed the name of the company into the search box and we did not turn up. Anywhere. Youâd find us if you knew our Web address. But in terms of search, we just disappeared.â

The company now operates under a new name and with a profile that is low even in the building where it claims to have an office. The landlord at the building, a gleaming, glassy midrise next to Route 101 in Redwood City, Calif., said she had never heard of the company.

Mr. Stevens agreed to meet in mid-January for a dinner paid for by The Times. Asked to pick a âfine restaurantâ in his neighborhood, he rather cheekily selected a modern French bistro in Palo Alto offering an eight-course prix fixe meal for $118. Liquid nitrogen and âfairy tale pumpkinâ were two of the featured ingredients.

Mr. Stevens turned out to be a boyish-looking 31-year-old native of Singapore. (Stevens is the name he uses for work; he says he has a Chinese last name, which he did not share.) He speaks with a slight accent and in an animated hush, like a man worried about eavesdroppers. He describes his works with the delighted, mischievous grin of a sophomore who just hid a stink bomb.

âThe key is to roll the campaign out slowly,â he said as he nibbled at seared duck foie gras. âA lot of companies are in a rush. They want as many links as we can get them as fast as possible. But Google will spot that. It will flag a Web site that goes from zero links to a few hundred in a week.â

The hardest part about the link-selling business, he explained, is signing up deep-pocketed mainstream clients. Lots of them, it seems, are afraid theyâll get caught. Another difficulty is finding quality sites to post links. Whoever set up the JCPenney.com campaign, he said, relied on some really low-rent, spammy sites â the kind with low PageRanks, as Google calls its patented measure of a siteâs quality. The higher the PageRank, the more âGoogle juiceâ a site offers others to which it is linked.

âThe sites that TNX uses mostly have low PageRanks,â Mr. Stevens said.

Mr. Stevens said that Web site owners, or publishers, as he calls them, get a small fee for each link, and the transaction is handled entirely over the Web.

Publishers can reject certain keywords and links â Mr. Stevens said some balked at a lingerie link â but for the most part the system is on a kind of autopilot. A client pays Mr. Stevens and his colleagues for links, which are then farmed out to Web sites. Payment to publishers is handled via PayPal.

You might expect Mr. Stevens to have a certain amount of contempt for Google, given that he spends his professional life finding ways to subvert it. But through the evening he mentioned a few times that heâs in awe of the company, and the quality of its search engine.

So how does he justify all his efforts to undermine that engine?

âI think we need to make a distinction between two different kinds of searches â informational and commercial,â he said. âIf you search âcancer,â thatâs an informational search and on those, Google is amazing. But in commercial searches, Googleâs results are really polluted. My own personal experience says that the guy with the biggest S.E.O. budget always ranks the highest.â

To Mr. Stevens, S.E.O. is a game, and if youâre not paying black hats, you are losing to rivals with fewer compunctions.

WHY did Google fail to catch a campaign that had been under way for months? One, no less, that benefited a company that Google had already taken action against three times? And one that relied on a collection of Web sites that were not exactly hiding their spamminess?

Mr. Cutts emphasized that there are 200 million domain names and a mere 24,000 employees at Google.

âSpammers never stop,â he said. Battling those spammers is a never-ending job, and one that he believes Google keeps getting better and better at.

Hereâs another hypothesis, this one for the conspiracy-minded. Last year, Advertising Age obtained a Google document that listed some of its largest advertisers, including AT&T, eBay and yes, J. C. Penney. The company, this document said, spent $2.46 million a month on paid Google search ads â the kind you see next to organic results.

Is it possible that Google was willing to countenance an extensive black-hat campaign because it helped one of its larger advertisers? Itâs the sort of question that European Union officials are now studying in an investigation of possible antitrust abuses by Google.

Investigators have been asking advertisers in Europe questions like this: âPlease explain whether and, if yes, to what extent your advertising spending with Google has ever had an influence on your ranking in Googleâs natural search.â And: âHas Google ever mentioned to you that increasing your advertising spending could improve your ranking in Googleâs natural search?â

Asked if Penney received any breaks because of the money it has spent on ads, Mr. Cutts said, âIâll give a categorical denial.â He then made an impassioned case for Googleâs commitment to separating the money side of the business from the search side. The former has zero influence on the latter, he said.

âIf you asked me for the names of five people in advertising engineering, I donât think I could give you the names,â he said. âThere is a very long history at Google of saying âWe are not going to worry about short-term revenue.â â He added: âWe rely on the trust of our users. We realize the responsibility that we have to our users.â

He noted, too, that before The Times presented evidence of the paid links to JCPenney.com, Google had just begun to roll out an algorithm change that had a negative effect on Penneyâs search results. (The tweak affected âhow we trust links,â Mr. Cutts said, declining to elaborate.)

True, JCPenney.comâs showing in Google searches had declined slightly by Feb. 8, as the algorithm change began to take effect. In âcomforter sets,â Penney went from No. 1 to No. 7. In âsweater dresses,â from No. 1 to No. 10.

But the real damage to Penneyâs results began when Google started that âmanual action.â The decline can be charted: On Feb. 1, the average Penney position for 59 search terms was 1.3.

On Feb. 8, when the algorithm was changing, it was 4.

By Feb. 10, it was 52.

MR. CUTTS said he did not plan to write about Penneyâs situation, as he did with BMW in 2006. Rarely, he explained, does he single out a company publicly, because Googleâs goal is to preserve the integrity of results, not to embarrass people.

âBut just because we donât talk about it,â he said, âdoesnât mean we wonât take strong action.â

— David Segal
New York Times





This site contains copyrighted material the use of which has not always been specifically authorized by the copyright owner. We are making such material available in our efforts to advance understanding of education issues vital to a democracy. We believe this constitutes a 'fair use' of any such copyrighted material as provided for in section 107 of the US Copyright Law. In accordance with Title 17 U.S.C. Section 107, the material on this site is distributed without profit to those who have expressed a prior interest in receiving the included information for research and educational purposes. For more information click here. If you wish to use copyrighted material from this site for purposes of your own that go beyond 'fair use', you must obtain permission from the copyright owner.