Lower your internet bill
61% of people overpay for their internet.
Are you one of them?
Unlock exclusive offers in your area!
Call now
[tel]Enter zip code
{ "@context": "https://schema.org", "@graph": [ { "@type": "Organization", "@id": "https://compareinternet.com/#organization", "name": "Blog", "logo": { "@type": "ImageObject", "@id": "https://compareinternet.com/#logo", "url": "", "contentUrl": "", "caption": "Blog", "inLanguage": "en-US", "width": "144", "height": "63" } }, { "@type": "WebSite", "@id": "https://compareinternet.com/#website", "url": "https://compareinternet.com/", "name": "https://compareinternet.com/", "publisher": { "@id": "https://compareinternet.com/#organization" }, "inLanguage": "en-US" }, { "@type": "ImageObject", "@id": "https://content.isg.us/wp-content/uploads/2024/07/RedditBlock1.jpg", "url": "https://content.isg.us/wp-content/uploads/2024/07/RedditBlock1.jpg", "width": "1200", "height": "750", "caption": "
Reddit Blocks Major Search Engines Except Google
", "inLanguage": "en-US" }, { "@type": "BreadcrumbList", "@id": "https://compareinternet.com/news/artificial-intelligence/reddit-blocks-major-search-engines-except-google/#breadcrumb", "itemListElement": [ { "@type": "ListItem", "position": "1", "item": { "@id": "https://compareinternet.com/", "name": "Home" } }, { "@type": "ListItem", "position": "2", "item": { "@id": "https://compareinternet.com/news/", "name": "News" } }, { "@type": "ListItem", "position": "3", "item": { "@id": "https://compareinternet.com/news/artificial-intelligence/reddit-blocks-major-search-engines-except-google/", "name": "Reddit Blocks Major Search Engines Except Google" } } ] }, { "@type": "WebPage", "@id": "https://compareinternet.com/news/artificial-intelligence/reddit-blocks-major-search-engines-except-google/#webpage", "url": "https://compareinternet.com/news/artificial-intelligence/reddit-blocks-major-search-engines-except-google/", "name": "DISH vs Cable | Infinity DISH", "datePublished": "2024-07-30T16:30:45+00:00", "dateModified": "2024-07-30T16:32:22+00:00", "isPartOf": { "@id": "https://compareinternet.com//#website" }, "primaryImageOfPage": { "@id": "https://content.isg.us/wp-content/uploads/2024/07/RedditBlock1.jpg" }, "inLanguage": "en-US", "breadcrumb": { "@id": "https://compareinternet.com/news/artificial-intelligence/reddit-blocks-major-search-engines-except-google/#breadcrumb" } }, { "@type": "Person", "@id": "https://compareinternet.com/rosslyn-elliott/", "name": "Rosslyn Elliott", "url": "https://compareinternet.com/rosslyn-elliott/", "image": { "@type": "ImageObject", "@id": "https://content.isg.us/wp-content/uploads/2023/02/Roz-Elliot.jpeg", "url": "https://content.isg.us/wp-content/uploads/2023/02/Roz-Elliot.jpeg", "caption": "Rosslyn Elliott", "inLanguage": "en-US" }, "worksFor": { "@id": "https://compareinternet.com/" } }, { "@context": "https://schema.org", "@type": "NewsArticle", "articleBody": "<p>Last week,<a href="https://www.404media.co/google-is-the-only-search-engine-that-works-on-reddit-now-thanks-to-ai-deal/" target="_blank" rel="noopener noreferrer"> 404 Media</a> broke the news that Reddit has blocked most major search engines from indexing its recent content.</p><p>The single exception is Google, as a result of the company’s<a href="https://www.searchenginejournal.com/reddit-limits-search-engine-access-google-remains-exception/523131/" target="_blank" rel="noopener noreferrer"> agreement earlier this year</a> to pay for Reddit’s content.</p><p>Though Google’s payment to Reddit may seem like a logical reason for its continuing access, the privilege will again give Google a massive advantage over competitors.</p><p>For a company already facing lawsuits for its monopolistic status, the outcome may cause more government crackdown in the long run.</p><h2>How Did Reddit Block Search Engines?</h2><p>Reddit recently updated its robots.txt file, a standard web protocol that tells search engines which parts of a website they can crawl and index. This change prevents web crawlers from accessing Reddit’s latest posts and comments, affecting a wide range of popular search engines.</p><p>Google apparently is using an authorized manual override to avoid the block.</p><h2>Search Engines That Can’t Access New Reddit Content</h2><p>The following<a href="https://www.ccn.com/news/technology/reddit-blocks-search-engines-ai-bots-google/" target="_blank" rel="noopener noreferrer"> search engines</a> have been affected by Reddit’s new policy:</p><p>Bing</p><p>DuckDuckGo</p><p>Mojeek</p><p>Qwant</p><p>Baidu</p><p>Yandex</p><p>The search engine Kagi still has access to new Reddit data because of its previous agreement to purchase content from Google.</p><p> </p><div id="attachment_6631" style="width: 1034px" class="wp-caption alignnone"><img aria-describedby="caption-attachment-6631" decoding="async" loading="lazy" class="size-large wp-image-6631 cap_b cap_cv c_color_w" src="https://content.isg.us/wp-content/uploads/2024/07/RedditBlock2-1024x512.jpg" alt="Woman looks frustrated while using laptop at home" width="1024" height="512" srcset="https://content.isg.us/wp-content/uploads/2024/07/RedditBlock2-1024x512.jpg 1024w, https://content.isg.us/wp-content/uploads/2024/07/RedditBlock2-300x150.jpg 300w, https://content.isg.us/wp-content/uploads/2024/07/RedditBlock2-768x384.jpg 768w, https://content.isg.us/wp-content/uploads/2024/07/RedditBlock2.jpg 1200w" sizes="(max-width: 1024px) 100vw, 1024px" /><p id="caption-attachment-6631" class="wp-caption-text">Users may find Reddit blocked</p></div><h2></h2><h2>Google’s Deal to Pay for Reddit Content Preserves Search Access</h2><p>Google’s continued access to Reddit’s content stems from a $60 million deal struck earlier this year. This agreement allows Google to use Reddit’s data for AI training purposes, setting it apart from other search engines. Reddit forced the issue of compensation for its content by<a href="https://www.techradar.com/computing/artificial-intelligence/reddit-is-now-blocking-big-search-engines-and-their-ai-web-crawlers-from-bringing-up-relevant-posts-unless-they-pay-up-and-google-already-has" target="_blank" rel="noopener noreferrer"> blacking out</a> Google’s access in 2023 in protest of API changes.</p><p>While users and other search engine companies were quick to claim that the lock-out occurred because of the Google deal, Reddit denies that connection. “We block all crawlers that are unwilling to commit to not using crawl data for AI training, which is in line with enforcing our Public Content Policy and updated robots.txt file,” a company spokesperson said to<a href="https://www.engadget.com/search-engines-that-dont-pay-up-cant-index-reddit-content-172949170.html" target="_blank" rel="noopener noreferrer"> Engadget.</a></p><h2>How to Tell If Your Search Engine is Blocked</h2><p>Users can easily check if their preferred search engine is affected by entering “site:reddit.com” in a search box followed by a date range or sorting by recent results.</p><p>For blocked search engines, users will notice:</p><p>· No results from the past week</p><p>· Empty search result pages</p><p>· Outdated content (several years old)</p><p>· Messages stating the site won’t allow descriptions</p><h2>Impact on Users and Search Engines</h2><p>For many internet users, adding “Reddit” to search queries has become a common way to find human-generated answers on topics ranging from tech support to personal advice. With the<a href="https://www.compareinternet.com/blog/is-ai-dangerous-on-the-internet/" target="_blank" rel="noopener noreferrer"> flood of AI information</a> online, Reddit provides a valuable source of human input.</p><p>With this change, users who are looking for recent Reddit content will be limited to Google or search engines that pull from Google’s index.</p><p> </p><div id="attachment_6632" style="width: 1034px" class="wp-caption alignnone"><img aria-describedby="caption-attachment-6632" decoding="async" loading="lazy" class="size-large wp-image-6632 cap_b cap_cv c_color_w" src="https://content.isg.us/wp-content/uploads/2024/07/RedditBlock3-1024x512.jpg" alt="robot finger scraping on keyboard" width="1024" height="512" srcset="https://content.isg.us/wp-content/uploads/2024/07/RedditBlock3-1024x512.jpg 1024w, https://content.isg.us/wp-content/uploads/2024/07/RedditBlock3-300x150.jpg 300w, https://content.isg.us/wp-content/uploads/2024/07/RedditBlock3-768x384.jpg 768w, https://content.isg.us/wp-content/uploads/2024/07/RedditBlock3.jpg 1200w" sizes="(max-width: 1024px) 100vw, 1024px" /><p id="caption-attachment-6632" class="wp-caption-text">AI scraping causes conflict</p></div><h2></h2><h2>Understanding Web Scraping and the AI Controversy</h2><h3>What is Web Scraping?</h3><p>Web scraping is the automated process of extracting data from websites. Reddit has a<a href="https://www.theverge.com/2024/7/24/24205244/reddit-blocking-search-engine-crawlers-ai-bot-google" target="_blank" rel="noopener noreferrer"> no-scraping policy</a> that forbids companies to scrape its data without compensation.</p><p>AI companies are just one type of organization that tries to scrape the web. Others include:</p><p>1. Search engines to index content</p><p>2. Researchers gathering data</p><p>3. Businesses monitoring competitors</p><p>AI companies try to scrape data in order to create new material based on that data. This use has created unprecedented controversy.</p><p>The aggressive scraping from AI has also caused concern for individuals, as more people wish to<a href="https://www.compareinternet.com/blog/stop-data-being-used-to-train-ai/" target="_blank" rel="noopener noreferrer"> prevent their data</a> from being used by AI. General anxiety about exposed<a href="https://www.compareinternet.com/blog/remove-your-personal-information-from-the-internet/" target="_blank" rel="noopener noreferrer"> personal information</a> in the cloud is increasing with recent<a href="https://about.att.com/story/2024/addressing-data-set-released-on-dark-web.html" target="_blank" rel="noopener noreferrer"> major hacks</a>.</p><h3>The AI Data Controversy</h3><p>The use of online data for AI training has become a source of public debate and lawsuits due to several issues:</p><p>1. Copyright concerns: Many content creators argue that using their work to train AI models without permission or compensation infringes on their<a href="https://hbr.org/2023/04/generative-ai-has-an-intellectual-property-problem" target="_blank" rel="noopener noreferrer"> intellectual property</a> rights.</p><p>2. Privacy issues: Privacy advocates have concerns about personal information being included in training data without consent.</p><p>3. Bias and representation: The data used to train AI can perpetuate or amplify existing biases in online content.</p><p>4. Economic impact: As AI models become more sophisticated, there are fears they could<a href="https://www.nexford.edu/insights/how-will-ai-affect-jobs" target="_blank" rel="noopener noreferrer"> replace human workers</a>, especially in customer service, retail, bookkeeping, and banking.</p><h2>Reddit’s Statement on Scraping and Compensation for Content</h2><p>Reddit spokesperson<a href="https://www.theverge.com/2024/7/24/24205244/reddit-blocking-search-engine-crawlers-ai-bot-google" target="_blank" rel="noopener noreferrer"> Tim Rathschmidt</a> stated to the Verge:</p><p>“We have been in discussions with multiple search engines. We have been unable to reach agreements with all of them, since some are unable or unwilling to make enforceable promises regarding their use of Reddit content, including their use for AI.”</p><p>This statement suggests that Reddit’s primary concern is not just about compensation, but also about controlling how its content is used, particularly in AI applications.</p><p> </p><div id="attachment_6633" style="width: 1034px" class="wp-caption alignnone"><img aria-describedby="caption-attachment-6633" decoding="async" loading="lazy" class="size-large wp-image-6633 cap_b cap_cv c_color_w" src="https://content.isg.us/wp-content/uploads/2024/07/RedditBlock4-1024x512.jpg" alt="content creator sits at desk to record and publish" width="1024" height="512" srcset="https://content.isg.us/wp-content/uploads/2024/07/RedditBlock4-1024x512.jpg 1024w, https://content.isg.us/wp-content/uploads/2024/07/RedditBlock4-300x150.jpg 300w, https://content.isg.us/wp-content/uploads/2024/07/RedditBlock4-768x384.jpg 768w, https://content.isg.us/wp-content/uploads/2024/07/RedditBlock4.jpg 1200w" sizes="(max-width: 1024px) 100vw, 1024px" /><p id="caption-attachment-6633" class="wp-caption-text">Who owns web content?</p></div><h2></h2><h2>Search Block is the Latest Volley in a Larger Battle</h2><h3>Content Monetization and AI Training</h3><p>Reddit’s move aligns with a growing trend of content creators and platforms seeking compensation for the use of their data in AI training.</p><p>Many web publishers feel that<a href="https://www.theatlantic.com/technology/archive/2024/04/generative-ai-search-llmo/678154/" target="_blank" rel="noopener noreferrer"> their survival</a> depends on not allowing AI to take their content without payment.</p><p><a href="https://www.linkedin.com/posts/brentcsutoras_reddit-is-now-blocking-major-search-engines-activity-7222229354318061569-yfl9/?utm_source=share&utm_medium=member_ios" target="_blank" rel="noopener noreferrer">Brent Csutoras</a>, founder of Search Engine Journal, commented on the battle between AI companies and content platforms. “Publications, artists, and entertainers have been suing OpenAI and other AI companies, blocking AI companies, and fighting to avoid using public content for AI training,” Csutoras said in a LinkedIn post.</p><h3>Search Engine Choice at Risk</h3><p>If Google is the only major search engine able to index recent Reddit content, there will be a new threat to the ability of other search engines to compete with Google.</p><p>Google’s market dominance has always loomed as a potential annihilator of user choice. The fact that an entire industry (SEO marketing) depends on Google’s algorithms shows the lack of a truly competitive landscape in the search engine market.</p><h3>Implications for the Open Web</h3><p>Reddit’s decision raises questions about the future of the open web. As more platforms restrict access to their content, it could lead to a more fragmented internet where information is siloed within specific ecosystems.</p><h3>Questions Raised for Regulation and Search in the Future</h3><p>As the situation unfolds, several questions remain:</p><p>1. Will other search engines eventually strike deals similar to Google’s?</p><p>2. How will this standoff affect Reddit’s upcoming IPO and overall valuation?</p><p>3. Could this lead to more<a href="https://www.justice.gov/opa/pr/justice-department-sues-google-monopolizing-digital-advertising-technologies" target="_blank" rel="noopener noreferrer"> anti-trust lawsuits</a> over Google’s growing influence?</p><p>4. Will other major websites follow Reddit’s lead in restricting access?</p><p> </p><div id="attachment_6634" style="width: 1034px" class="wp-caption alignnone"><img aria-describedby="caption-attachment-6634" decoding="async" loading="lazy" class="size-large wp-image-6634 cap_c cap_l c_color_w" src="https://content.isg.us/wp-content/uploads/2024/07/RedditBlock5-1024x512.jpg" alt="Reddit buttons on light background" width="1024" height="512" srcset="https://content.isg.us/wp-content/uploads/2024/07/RedditBlock5-1024x512.jpg 1024w, https://content.isg.us/wp-content/uploads/2024/07/RedditBlock5-300x150.jpg 300w, https://content.isg.us/wp-content/uploads/2024/07/RedditBlock5-768x384.jpg 768w, https://content.isg.us/wp-content/uploads/2024/07/RedditBlock5.jpg 1200w" sizes="(max-width: 1024px) 100vw, 1024px" /><p id="caption-attachment-6634" class="wp-caption-text">Creators and publishing platforms</p></div><h2>The Future of Web Crawling and Content Access</h2><p>Reddit’s actions may set a precedent for other major websites and platforms. As the value of data continues to rise, we may see more content providers implementing similar restrictions on web crawlers and AI training data access.</p><h3>Potential Outcomes</h3><p>Legal and regulatory changes: Governments might step in to regulate the use of online data for AI training.</p><p>New business models: We might see the emergence of new monetization strategies for online content.</p><p>Technological adaptations: Search engines and AI companies may develop new ways to access and use online data ethically.</p><p>User behavior shifts: Internet users may change how they search for and consume online content.</p><p> </p><div id="attachment_6635" style="width: 1034px" class="wp-caption alignnone"><img aria-describedby="caption-attachment-6635" decoding="async" loading="lazy" class="size-large wp-image-6635 cap_c cap_l c_color_w" src="https://content.isg.us/wp-content/uploads/2024/07/RedditBlock6-1024x512.jpg" alt="legal scales on digital technology background" width="1024" height="512" srcset="https://content.isg.us/wp-content/uploads/2024/07/RedditBlock6-1024x512.jpg 1024w, https://content.isg.us/wp-content/uploads/2024/07/RedditBlock6-300x150.jpg 300w, https://content.isg.us/wp-content/uploads/2024/07/RedditBlock6-768x384.jpg 768w, https://content.isg.us/wp-content/uploads/2024/07/RedditBlock6.jpg 1200w" sizes="(max-width: 1024px) 100vw, 1024px" /><p id="caption-attachment-6635" class="wp-caption-text">The law must control technology</p></div><h2></h2><h2>FAQ: AI and Why Reddit Blocked Major Search Engines</h2><h3>What is web scraping?</h3><p>Web scraping is the automated process of extracting data from websites. It’s used by search engines, researchers, businesses, and AI companies to gather online information.</p><h3>How does AI potentially violate intellectual property laws?</h3><p>AI companies may use copyrighted content to train their models without permission or compensation. This can infringe on creators’ intellectual property rights.</p><h3>Which major search engines are blocked from accessing new Reddit content?</h3><p>Bing, DuckDuckGo, Mojeek, Qwant, Baidu, and Yandex are blocked from accessing new Reddit content. Google is the main exception due to a payment agreement.</p><h3>What is a robots.txt file?</h3><p>A robots.txt file is a standard web protocol that tells search engines which parts of a website they can crawl and index. Reddit updated theirs to block most search engines from recent content.</p><h3>How much did Google agree to pay Reddit for content access?</h3><p>Google agreed to pay Reddit $60 million for access to its content. This deal allows Google to use Reddit’s data for AI training purposes.</p><p> </p>", "headline": "Reddit Blocks Major Search Engines Except Google", "articleSection": "Artificial Intelligence", "datePublished": "2024-07-30T16:30:45+00:00", "dateModified": "2024-07-30T16:32:22+00:00", "publisher": [{ "@type": "Organization", "name": "Compare Internet", "logo": { "@type": "ImageObject", "url": "https://www.compareinternet.com/wp-content/uploads/2023/02/Compare-Internet-white.png", "width": 1350, "height": 360 }, "alternateName": "Compare Internet" }], "author": [{ "@type": "Person", "name": "Rosslyn Elliott", "url": "https://compareinternet.com/authors/rosslyn-elliott/", "jobTitle": "Rosslyn Elliott", "image": { "@type": "ImageObject", "url": "https://content.isg.us/wp-content/uploads/2023/02/Roz-Elliot.jpeg", "height": 337, "width": 337 } }], "image": [{ "@type": "ImageObject", "url": "https://content.isg.us/wp-content/uploads/2024/07/RedditBlock1.jpg", "height": 1200, "width": 750 }], "description": "Reddit Blocks Major Search Engines Except Google", "wordCount": "2318", "mainEntityOfPage": "https://compareinternet.com/news/artificial-intelligence/reddit-blocks-major-search-engines-except-google/" } ] }