In addition to Weibo, there is also WeChat
Please pay attention

WeChat public account
Shulou
 
            
                     
                
2025-10-31 Update From: SLTechnology News&Howtos shulou NAV: SLTechnology News&Howtos > IT Information >
Share
Shulou(Shulou.com)11/24 Report--
CTOnews.com June 26, the Massachusetts Institute of Technology (MIT) research team recently published a paper pointing out that the existing third-party Twitter (Twitter) robot account automatic detection tool is not accurate because its data set is too simple and lack of generality.
Earlier, it was reported that too many robot accounts were one of the reasons to prevent Musk from buying Twitter. Twitter claimed at the time that 5% of its daily active users were robot accounts, but Musk said that number was much higher than 5%.
Twitter has its own robot account identification system, but it has not been made public. Therefore, for the general public, the third-party tool is a more feasible detection method. These third-party tools use data sets and machine learning models collected from Twitter to detect suspicious signs of robots. Many tools and models have been used to study robot activities on social media, and there have been thousands of related papers.
▲ 's public benchmark data set for Twitter robot detection most of the benchmark data sets in these papers are data sets collected in different tweets, many of which are collected in specific tweets (such as tweets with specific topic tags), each of which is manually marked as a robot or human. However, this specially trained robot detection model does well in this professional field, does not cover all areas, and relies heavily on specific data, rather than the fundamental differences between robots and humans.
When these models are tested on data sets in other areas, their accuracy is very poor, almost equal to the level of random prediction. At the same time, in many data sets, even the relatively simple model is as accurate as the most advanced machine learning model (SOTA).
Comparison of the performance of ▲ simple model and SOTA model on basic data sets in other words, the model trained on one data set can not be extended to other data sets, and the existing robot detection data sets are less universal because of simple data collection.
Finally, the researchers warn that when using existing robots to detect data sets, users should carefully consider what types of deviations may exist. The researchers believe that a fundamental solution is that social media such as Twitter itself should provide researchers with rich and reliable data and high-quality real tags.
CTOnews.com enclose the address of the paper: click here to
Welcome to subscribe "Shulou Technology Information " to get latest news, interesting things and hot topics in the IT industry, and controls the hottest and latest Internet news, technology news and IT industry trends.
Views: 0
*The comments in the above article only represent the author's personal views and do not represent the views and positions of this website. If you have more insights, please feel free to contribute and share.

The market share of Chrome browser on the desktop has exceeded 70%, and users are complaining about

The world's first 2nm mobile chip: Samsung Exynos 2600 is ready for mass production.According to a r


A US federal judge has ruled that Google can keep its Chrome browser, but it will be prohibited from

Continue with the installation of the previous hadoop.First, install zookooper1. Decompress zookoope





 
             
            About us Contact us Product review car news thenatureplanet
More Form oMedia: AutoTimes. Bestcoffee. SL News. Jarebook. Coffee Hunters. Sundaily. Modezone. NNB. Coffee. Game News. FrontStreet. GGAMEN
© 2024 shulou.com SLNews company. All rights reserved.