Search Engine Privacy
- EPIC, Privacy Groups, Urge Court to Reject Proposed Google Settlement: EPIC, joined by several leading privacy and consumer protection organizations, submitted a letter to the Northern District of California regarding a proposed settlement in a class-action lawsuit against Google. The settlement was proposed by class action lawyers on behalf of Google users in a case concerning the unlawful disclosure of search terms by Google to third parties. Under the terms of the proposed settlement, Google would be allowed to continue to disclose user search terms to third parties. The letter explains that the proposed settlement "provides no benefit to Class members" because it does not require Google to change its business practices. "Furthermore," the letter states, "the proposed cy pres allocation is not aligned with the interests of the purported Class members." "Cy press" ("as near as possible") is a legal doctrine that allows courts to allocate funds to protect the interests of individuals when there is a class action settlement. Under Ninth Circuit precedent, cy pres funds must be used to advance the interests of the class members. EPIC previously highlighted the dangers of improper cy pres distributions in settlements. For more information, see EPIC: Fraley v. Facebook, EPIC: Lane v. Facebook, and EPIC: Search Engine Privacy and EPIC: Google Buzz. (Aug. 22, 2013)
- IPv6, New Internet Protocol, Launches with Privacy Questions: The Internet Society has announced the world launch of IPv6, which will dramatically expand the number of Internet addresses. IPv6 creates fixed IP addresses, allowing routine tracking of Internet-connected devices, such as laptops, cellphones, and soon many consumer appliances. This will make it easier for law enforcement agencies and advertisers to track users of Internet-based services. A Privacy Extension allows the use of IPv6 without persistent identifiers, though it is not clear how widely it will be be adopted. In 2008, EPIC testified before the European Parliament on IP addresses and privacy, and said that companies that use IPv6 linked to identifiable users should be subject to data privacy requirements. The EU classifies IP addresses as personal information. For more information: See EPIC: Search Engine Privacy. (Jun. 7, 2012)
- Pew Study: Search Engine Users Anxious About Collection of Personal Information: A Pew study found that users of search engines were pleased with the quality of search results but opposed targeted advertising and search results, and were generally anxious about the collection of personal information by search engines. Specifically, 73 percent of those surveyed were opposed to search engines tracking their searches, and 68 percent opposed behavioral advertising. 83 percent of respondents reported using Google to conduct searches. Recently, Google began combining user data gathered from more than sixty Google products and services—including Google search--to create a single, comprehensive profile for each user. For more information, see EPIC: Search Engine Privacy and EPIC: EPIC v. FTC. (Mar. 9, 2012)
- Congress Seeks Answers on Google's Plans for Data Consolidation: Eight members of Congress wrote to Google asking the company to explain the "steps [that] are being taken to ensure the protection of consumers' privacy rights." The letter follows Google's announcement that it would begin combining data gathered on consumers of over 60 Google products and services, including Gmail, Google+, Youtube, and the Android mobile operating system. The members' letter includes 11 specific questions ranging from the ways in which Google collects information to the specific consequences for Android phone users. In 2010, EPIC, along with other privacy groups, wrote a letter to Google about the company's decision to combine user data among 12 Google services. The groups warned that the practical effect would be to reduce privacy protection for users of Google services. For more information, see EPIC: In re: Google Buzz and EPIC: Google search. (Jan. 27, 2012)
- Privacy International Launches System to Shed Light on Controversial Technologies: International watchdog Privacy International has announced the launch of a new website for bringing transparency to "technical mysteries" behind controversial systems. Cracking the Black Box identifies key questions regarding mysterious technologies and asks experts, whistleblowers, and other concerned parties to "help crack the box" by anonymously contributing ideas and input. The organization responsible for the technology in question is then invited to provide an official response. The first two issues addressed on the PI site are the Google Wi-Fi controversy and the EU proposal to retain search data. (Jun. 16, 2010)
- Congress Pursues Investigation of Google and Facebook's Business Practices: Following similar letters from other Congressional leaders, the head of the House Judiciary Committee has asked Google Inc. and Facebook to cooperate with government inquiries into privacy practices at both companies. Rep. Conyers (D-MI) noted that Google's collection of user data "may be the subject of federal and state investigations" and asked Google to retain the data until "such time as review of this matter is complete." Rep. Conyers also asked Facebook to provide a detailed explanation regarding its collection and sharing of user information. The House Judiciary Committee is expected to hold hearings on electronic privacy later this year. For more information, see EPIC: Facebook Privacy, EPIC: In re Facebook II, and EPIC: Search Engine Privacy. (Jun. 1, 2010)
- Microsoft to Delete Search Data after Six Months, Following Recommendation by European Privacy Officials: In order to comply with European privacy law, Microsoft announced that it will delete user search data, including IP addresses, after six months. In 2008 the Article 29 Working Group, which includes data protection officials across the European Union, met with Microsoft, Google, and Yahoo to discuss their data retention practices. Following a determination that records are subject to European privacy law, the Article 29 Working Group asked the search engine companies to eliminate online user data, including IP addresses and search queries, after six months. Microsoft will redesign its new Bing search engine to comply with the request. It is unclear at this point what Google and Yahoo will do. In early 2008, EPIC urged the European Parliament to protect the privacy of search histories. For more information, see EPIC: Search Engine Privacy. (Jan. 20, 2010)
- Change in Yahoo Search Retention Leaves Privacy Questions Unresolved. Yahoo announced that, after 90 days, it will obscure some elements in the records that it keeps about all Internet users who use the company's services. The search company will continue to keep modified record locators, time/date stamps, web pages viewed, and a persistent user identifier, known as a "cookie" for an indefinite period. Yahoo is also retaining much of the IP address, which typically identifies a user's device, such as a laptop or a mobile phone. Privacy rules classify IP addresses as "personal data." Experts have criticized the partial deletion of IP address data as insufficient to protect consumers, and called for complete deletion. For more information, see EPIC's Search Engine Privacy page. (Dec. 18, 2008)
- Google "Flu Trends" Raises Privacy Concerns. Google announced this week a new web tool that may make it possible to detect flu outbreaks before they might otherwise be reported. Google Flu Trends relies on individual search terms, such as "flu symptoms," provided by Internet users. Google has said that it will only reveal aggregate data, but there are no clear legal or technological privacy safeguards to prevent the disclosure of individual search histories concerning the flu, or related medical concerns, such as "AIDS symptoms," "ritalin," or "Paxil." Privacy and medical groups have urged Google to be more transparent and publish the algorithm on which Flu Trends data is based so that the public can determine whether the privacy safeguards are adequate. (Nov. 12, 2008)
- European Privacy Officials: Privacy Rules Apply to Search Engines. European privacy officials have established "a clear set of responsibilities" on search engine companies regarding their handling of user data. The opinion, issued by the Article 29 Working Group, states that the European Union Data Protection Directive requires search engines to "delete or irreversibly anonymise personal data once they no longer serve the specified and legitimate purpose" for which they were collected. This requirement has particular significance for search engines, because European privacy rules classify Internet Protocol (IP) addresses as "personal data." The opinion further holds that European privacy laws generally apply to search engines "even when their headquarters are outside [Europe]," and requires that search engines must delete personal data within six months of collection. (Apr. 7, 2008)
- Search Histories Subject to European Privacy Rules. European privacy officials determined this week that companies operating search engines will be subject to European privacy rules that limit the collection, use, and disclosure of personal information. The privacy officials who make up the Article 29 Working Group stated that "The protection of the users' privacy and the guaranteeing of their rights, such as the right to access to their data and the right to information as provided for by the applicable data protection regulations, remain the core issues of the ongoing debate." Earlier this year, EPIC urged the European Parliament to protect the privacy of search histories. A report from the Article 29 Working Group on Search Engines and Privacy is expected in April. (Feb. 22, 2008)
Internet search engines are the primary means by which individuals access Internet content. In January 2008, Americans used search engines to conduct more than 10 billion searches. Typically, search engines collect detailed information that is personally identifiable or can be made personally identifiable. This information includes the search terms submitted to the search engine, as well as the time, date, and location of the computer submitting the search. This information is generally collected for marketing and consumer profiling purposes. It is also used by search engines to carry out research and generate statistical usage data.
In 2008, EPIC urged the European Parliament to protect the privacy of search histories.
An IP address is a device's (typically a computer's) numerical address as expressed in the format specified in the Internet Protocol. In IPv4, the current addressing format, an IP address is a 32-bit sequence divided into four groups of decimal numbers separated by periods. In some circumstances, the IP address identifies a unique computer. In other circumstances, such as when a network of computers connects to the Internet via a single Internet connection, it may not. An IP address for a computer is similar to a telephone number for a telephone.
The emergence of targeted Internet advertising has led to "behavioral marketing." In the course of recording users' viewing habits and monitoring their search terms, companies collect information about user interests and tastes, including the things they buy, the stories they read, and the websites they visit, in addition to very sensitive personal information. Search terms entered into search engines may reveal a plethora of personal information such as an individual's medical issues, religious beliefs, political preferences, sexual orientation, and investments. The expansion of the behavioral marketing industry, as well as its ability and incentive to monitor online search behavior, has produced significant privacy problems and substantial risks to Internet users. Opaque industry practices result in consumers remaining largely unaware of the monitoring of their online behavior, the security of this information and the extent to which this information is kept confidential. Industry practices, in the absence of strong privacy principles, also prevent users from exercising any meaningful control over their personal data that is obtained.
Public Disclosure of Search Engine Information
In 2006, America Online (AOL) published three months of search records for 658,000 Americans. AOL attempted to "anonymize" the records, and intended for academics and technologists to use the data for research purposes. The records did not link searches to IP addresses or user names, but did group searches by individual users via randomly-assigned numerical IDs. Subsequent events demonstrated that AOL's storage of numerical IDs as opposed to usernames or IP addresses does not necessarily prevent search data from being linked back to individuals. Though the search logs released by AOL had been "anonymized," identifying the user by only a number, quick research by New York Times reporters matched some user numbers with the correct individuals. Other sources identified sensitive and occasionally disturbing personal information in the AOL search data, including user searches for "how to kill your wife" "anti psychotic drugs," and "aftermath of incest." In response, several privacy groups filed complaints with the Federal Trade Commission.
IP Address Privacy in the United States
In the United States, federal law does not provide uniform privacy protections for personal data submitted to search engines or for IP addresses. Some federal regulations (i.e. 45 C.F.R. § 164.514(b)(O)) treat IP addresses as "individually identifiable" information for specific purposes, but such treatment is not comprehensive.
IP Address Privacy in the European Union
The European Commission classifies IP addresses as personal data. Search engine data falls under the relevant EU data protection directives, and EU regulations generally apply to search engine companies even when they are headquartered outside Europe. Search engines must comply with European privacy provisions if they maintain an establishment in one of the EU Member States, or if they use automated equipment based in one of the Member States for the purposes of processing personal data. European privacy rules limit the collection, use, and disclosure of personal information. The privacy officials who make up the EU Article 29 Working Group have stated that "the protection of the users' privacy and the guaranteeing of their rights, such as the right to access to their data and the right to information as provided for by the applicable data protection regulations, remain the core issues of the ongoing debate."
The European Union Data Protection Directive requires search engines to "delete or irreversibly anonymise personal data once they no longer serve the specified and legitimate purpose" for which they were collected. Retention of personal data by search engines for more than six months is presumed to be unnecessary. Search engines that retain personal data for longer periods must "demonstrate comprehensively that it is strictly necessary for the service." This requirement applies to IP address data, which virtually all search engines collect each time a user runs a search. The EU also imposes limits on the lifetime of search engines' cookies - small computer files that can track users between multiple sessions and web sites. As a technical matter, every cookie expires eventually, and web sites can easily select the expiration dates for their cookies. EU guidelines prohibit search engines from setting expiration dates farther in the future than necessary to provide search services.
The Article 29 Working Group's April 4, 2008 report issued a set of obligations to search engine firms, including:
- Search engines should get informed consent from users if they correlate personal data across different services, such as desktop search;
- Search engine providers must delete or anonymise (in an irreversible and efficient way) personal data once they are no longer necessary for the purpose for which they were collected;
- Personal data should not be held by search engines for longer than six months;
- In case search engine providers retain personal data longer than six months, they must demonstrate comprehensively that it is strictly necessary for the service;
- It is not necessary to collect additional personal data from individual users in order to be able to perform the service of delivering search results and advertisements;
- Search engine providers must give users clear and intelligible information about their identity and location and about the data they intend to collect, store, or transmit, as well as the purpose for which they are collected
Corporate Policies Regarding IP Address Privacy
Google, the leading Internet search engine, automatically collects its users' search terms in connection with their IP addresses. Google states that, after collection, it retains the personally identifiable information for 18 months, and then "anonymizes" the data linking search terms to specific IP addresses by erasing the last octect of the IP address.
Ixquick states that it deletes users' search data (including IP addresses) within 48 hours. Ixquick further states that it does not set any uniquely identifying cookies, and that it shares data with 3rd parties only in limited circumstances.
- EPIC Testimony on Search Engine Privacy in European Parliament.
- Article 29 Data Protection Working Party Opinion on data protection issues related to search engines, April 4, 2008.
- Article 29 Data Protection Working Party Statement, February 19, 2008.
- Article 29 Working Group - Main Page.
News Stories and Blog Items
- Yahoo to purge user data after 90 days, San Francisco Chronicle, December 18, 2008
- Yahoo to anonymize user data after three months, Computerworld, December 18, 2008
- Yahoo to purge user data after 90 days, Los Angeles Times, December 18, 2008
- Yahoo Limits Retention of Search Data, New York Times, December 17, 2008
- Yahoo to Anonymize User Data After 90 Days, Wired, December 17, 2008
- Yahoo Changes Data-Retention Policy, Washington Post, December 17, 2008
- Yahoo! Sets New Industry Privacy Standard with Data Retention Policy, Yahoo (Press Release), December 17, 2008
- Leading article: In search of online privacy, The Independent, UK, April 9, 2008
- EU To Restrict Time Companies Can Hold Online Search Data, Dow Jones, April 7, 2008
- Search engines warned over data, BBC News, April 7, 2008
- European Groups Says Search Engines Must Delete Search Data Within Six-Months, Search Engine Land, April 7, 2008.
- EU: 18 Months Too Long To Keep Search Data, SecurityProNews, April 7, 2008.
- Google, Yahoo Keep User Data Too Long, EU Group Says, Bloomberg, April 4, 2008.
- Google scrambles to avoid EU privacy regulators, CNET, February 25, 2008.
- I.P. Address: Partially Personal Information, The New York Times, February 24, 2008.
- Google mounts Chewbacca defense in EU privacy debate, The Register, February 23. 2008.
- Google Says I.P. Addresses Aren't Personal, The New York Times, February 22, 2008.
- Google argues against calling IP addresses "personal data," Ars Technica, February 22, 2008.
- Are IP addresses personal?, Google Public Policy Blog, February 22, 2008.
- EU: Search Engines Under EU Rules, Associated Press, February 22, 2008.
- EU data guardians: search engines must obey our rules, The Register, February 22, 2008.
- Search Engines Must Comply With Strict EU Privacy Rules, Mashable, February 22, 2008.
- Google, Yahoo, Microsoft & Other Search Engines Must Comply With EU Privacy Rules, Search Engine Land, February 22, 2008.
- European privacy advocates to issue report in April, International Herald Tribune, February 20, 2008.
- EU Ponders Privacy of Internet Addresses, PC World, January 27, 2008.
- IP addresses could become "personal information" in Europe, Ars Technica, January 22, 2008.
- EU official says IP address is personal, MSNBC, January 21, 2008.
- EU: IP Addresses Are Personal Information, CBS News, January 21, 2008.