首页 News 正文

Hard Google and Microsoft Reddit insist on data charges or block search engine crawlers

红花少年压
1272 0 0

According to a report by the Washington Post on Friday, the aggregating news website Reddit is in discussions with AI giants regarding data payment. If the two parties cannot reach an agreement, Reddit may cut off services for Google and Bing, prohibiting crawlers from search engines such as Google and Bing from obtaining content from the platform.
This will force users to log in to their Reddit account to obtain the information they want. That is to say, Reddit's content will not be displayed in Google and Bing's searches.
In response, the Washington Post's subsequent correction report, as well as The Verge's latest report, pointed out that Reddit denied the claim in the above report that "users must log in to the platform to view content", and as for "blocking search engine crawlers", the official did not deny it. The source also said, "Without searching (the website), Reddit can survive
Reddit is the most frequently visited news website by Americans, where users can create and share content. It is known as the "American version of Baidu Tieba" and currently has over 130000 active communities. According to the company's data at the end of 2020, it has over 1.5 billion registered users, 430 million monthly active users, and 52 million daily active users.
Training AIGC tools requires massive amounts of data, and Reddit has accumulated a large amount of user generated content, all of which are high-quality AI training data, which has enabled this company to find business opportunities.
In April, Reddit announced that it would charge data usage fees to companies using its API to train AI chat robots, including Microsoft, Google, OpenAI, and others; In June, its higher than industry average fee standard was exposed - $12000 per 50 million API requests.
If the massive data assets provide Reddit with the possibility of charging, its listing plan highlights the necessity of Reddit charging.
Previously, insiders said that Reddit's goal was to eventually go public later this year - possibly in the second half of the year. Reddit and other companies, including Instaart, are updating their IPO documents to prepare for potential IPOs when market conditions improve.
Multi party pressure on AI giants to end the era of free data?
At present, the AIGC wave is sweeping through companies with data assets, such as Reddit and X (formerly Twitter), waiting to be sold. It is understood that X's pricing is higher than Reddit. According to previous reports by WIRED, the cheapest package provided by X is: paying $42000 per month to access 50 million tweets.
Companies represented by newspaper publishers choose to build high walls. The Washington Post reported that since August, at least 535 news organizations (including The New York Times, Reuters, and The Washington Post) have installed interceptors to prevent their content from being captured by companies such as OpenAI and used to train products such as ChatGPT.
The purpose is the same - hoping to gain a share in the AIGC market. According to Semafor's July report, the media group IAC, which owns The Daily Beast, is attempting to establish a publisher alliance aimed at winning billions of dollars from AI companies through litigation or legislative action. In August, NPR reported that the New York Times was also considering filing a lawsuit against OpenAI.
In addition to the fee requirements of large companies, large AI companies also face personal pressure, with a large number of authors, artists, and software programmers filing copyright lawsuits, demanding compensation for infringement losses and sharing profits. According to previous reports from Reuters, former Arkansas Governor Mike Huckabee has joined the class action lawsuit against Meta, Microsoft, and Bloomberg as plaintiffs, accusing them of using pirated books to train AI.
Bloomberg stated that by 2032, this market (data fee market) is expected to reach $1.3 trillion.
Of course, behind the fees, it's not just a matter of money. Many companies view data usage as a survival issue and worry that AI will learn from their own data and instead poach their own users. Prashanth Chandrasekar, CEO of Stack Overflow, a question and answer platform for programmers, stated that one month after OpenAI launched GPT-4, as programmers turned to AI to seek answers to coding questions, the traffic in the coding community Stack Overflow decreased by 15%. He believes that artificial intelligence has been trained in Stack Overflow data.
The latest news shows that Stack Overflow has laid off 28% of employees.
At present, whether it is media groups or mainstream social platforms, they are still in a tug of war with AI giants. Whether or not they need to pay, how to charge, and companies with different discourse rights will receive different results.
CandyLake.com 系信息发布平台,仅提供信息存储空间服务。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
您需要登录后才可以回帖 登录 | 立即注册

本版积分规则

  •   知名做空机构香橼研究(Citron Research)周四(11月21日)在社交媒体平台X上发布消息称,该公司已决定做空“比特币大户”微策略(Microstrategy)这家公司,并认为该公司已经将自己变身成为一家比特币投资基金 ...
    caffycat
    7 小时前
    支持
    反对
    回复
    收藏
  •   每经AI快讯,11月20日,文远知行宣布旗下自动驾驶环卫车S6与无人扫路机S1分别在新加坡滨海湾海岸大道与滨海艺术中心正式投入运营。据介绍,这是新加坡首个商业化运营的自动驾驶环卫项目。 ...
    star8699
    前天 19:48
    支持
    反对
    回复
    收藏
  •   上证报中国证券网讯(记者王子霖)11月20日,斗鱼发布2024年第三季度未经审计的财务报告。本季度斗鱼依托丰富的游戏内容生态,充分发挥主播资源和新业务潜力,持续为用户提供高质量的直播内容及游戏服务,进一步 ...
    goodfriendboy
    前天 20:09
    支持
    反对
    回复
    收藏
  •   人民网北京11月22日电 (记者栗翘楚、任妍)2024广州车展,在新能源汽车占据“半壁江山”的同时,正加速向智能网联新能源汽车全面过渡,随着“端到端”成为新宠,智能驾驶解决方案成为本届广州车展各大车企竞 ...
    3233340
    1 小时前
    支持
    反对
    回复
    收藏
红花少年压 新手上路
  • 粉丝

    0

  • 关注

    0

  • 主题

    0