English

How Many Bots are on Twitter? The Question is Difficult to Answer

Kai-Cheng Yang

June 6, 2022
5 min read
2765 Views

How Many Bots are on Twitter? The Question is Difficult to Answer

Twitter reports that fewer than 5 percent of accounts are fakes or spammers, commonly referred to as “bots.” Since his offer to buy Twitter was accepted, Elon Musk has repeatedly questioned these estimates, even dismissing Chief Executive Officer Parag Agrawal’s public response.

Later, Musk put the deal on hold and demanded more proof.

So why are people arguing about the percentage of bot accounts on Twitter?

As the creators of Botometer, a widely used bot detection tool, our group at the Indiana University Observatory on Social Media has been studying inauthentic accounts and manipulation on social media for over a decade. We brought the concept of the “social bot” to the foreground and first estimated their prevalence on Twitter in 2017.

Based on our knowledge and experience, we believe that estimating the percentage of bots on Twitter has become a very difficult task, and debating the accuracy of the estimate might be missing the point. Here is why.

What, Exactly, is a Bot?

To measure the prevalence of problematic accounts on Twitter, a clear definition of the targets is necessary. Common terms such as “fake accounts,” “spam accounts” and “bots” are used interchangeably, but they have different meanings. Fake or false accounts are those that impersonate people. Accounts that mass-produce unsolicited promotional content are defined as spammers. Bots, on the other hand, are accounts controlled in part by software; they may post content or carry out simple interactions, like retweeting, automatically.

These types of accounts often overlap. For instance, you can create a bot that impersonates a human to post spam automatically. Such an account is simultaneously a bot, a spammer and a fake. But not every fake account is a bot or a spammer, and vice versa. Coming up with an estimate without a clear definition only yields misleading results.

Defining and distinguishing account types can also inform proper interventions. Fake and spam accounts degrade the online environment and violate platform policy. Malicious bots are used to spread misinformation, inflate popularity, exacerbate conflict through negative and inflammatory content, manipulate opinions, influence elections, conduct financial fraud and disrupt communication. However, some bots can be harmless or even useful, for example by helping disseminate news, delivering disaster alerts and conducting research.

Simply banning all bots is not in the best interest of social media users.

For simplicity, researchers use the term “inauthentic accounts” to refer to the collection of fake accounts, spammers and malicious bots. This is also the definition Twitter appears to be using. However, it is unclear what Musk has in mind.

Also Read: ‘Deadbots’ Can Speak for You after Your Death. Is that Ethical?

Hard to Count

Even when a consensus is reached on a definition, there are still technical challenges to estimating prevalence.

a network graph showing a circle composed of groups of colored dots with lines connecting some of the dots — Networks of coordinated accounts spreading COVID-19 information from low-credibility sources on Twitter in 2020. Pik-Mai Hui

External researchers do not have access to the same data as Twitter, such as IP addresses and phone numbers. This hinders the public’s ability to identify inauthentic accounts. But even Twitter acknowledges that the actual number of inauthentic accounts could be higher than it has estimated, because detection is challenging.

Inauthentic accounts evolve and develop new tactics to evade detection. For example, some fake accounts use AI-generated faces as their profiles. These faces can be indistinguishable from real ones, even to humans. Identifying such accounts is hard and requires new technologies.

Another difficulty is posed by coordinated accounts that appear to be normal individually but act so similarly to each other that they are almost certainly controlled by a single entity. Yet they are like needles in the haystack of hundreds of millions of daily tweets.

Finally, inauthentic accounts can evade detection by techniques like swapping handles or automatically posting and deleting large volumes of content.

The distinction between inauthentic and genuine accounts gets more and more blurry. Accounts can be hacked, bought or rented, and some users “donate” their credentials to organizations who post on their behalf. As a result, so-called “cyborg” accounts are controlled by both algorithms and humans. Similarly, spammers sometimes post legitimate content to obscure their activity.

We have observed a broad spectrum of behaviors mixing the characteristics of bots and people. Estimating the prevalence of inauthentic accounts requires applying a simplistic binary classification: authentic or inauthentic account. No matter where the line is drawn, mistakes are inevitable.

Also Read: What Expert Thinks About Elon Musk Buying Twitter

Missing the Big Picture

The focus of the recent debate on estimating the number of Twitter bots oversimplifies the issue and misses the point of quantifying the harm of online abuse and manipulation by inauthentic accounts.

screenshot of a web form — Screenshot of the BotAmp application comparing likely bot activity around two topics on Twitter. Kaicheng Yang

Through BotAmp, a new tool from the Botometer family that anyone with a Twitter account can use, we have found that the presence of automated activity is not evenly distributed. For instance, the discussion about cryptocurrencies tends to show more bot activity than the discussion about cats. Therefore, whether the overall prevalence is 5 percent or 20 percent makes little difference to individual users; their experiences with these accounts depend on whom they follow and the topics they care about.

Recent evidence suggests that inauthentic accounts might not be the only culprits responsible for the spread of misinformation, hate speech, polarization and radicalization. These issues typically involve many human users. For instance, our analysis shows that misinformation about COVID-19 was disseminated overtly on both Twitter and Facebook by verified, high-profile accounts.

Even if it were possible to precisely estimate the prevalence of inauthentic accounts, this would do little to solve these problems. A meaningful first step would be to acknowledge the complex nature of these issues. This will help social media platforms and policymakers develop meaningful responses.

This article was first published on The Conversation, a global media resource that provides cutting edge ideas and people who know what they are talking about.

Share on Facebook

About Author

Kai-Cheng Yang

Ketika Diplomasi Makin Gencar, Ke Mana Perspektif Gender di Tahun Pertama Prabowo?

Setahun pertama pemerintahan Prabowo menunjukkan diplomasi Indonesia yang kian fokus pada keamanan dan investasi, sementara

December 12, 2025
10 Min Read

Makan Bergizi Gratis Harus Dekat ke Anak, Bukan ke Gudang Logistik

Mengalihkan Makan Bergizi Gratis dari proyek logistik raksasa ke keluarga bisa membuka jalan bagi gizi

December 12, 2025
10 Min Read

#TungguAnakSiap: PP Tunas Baru Awal, Butuh Kolaborasi Banyak Orang

PP Tunas jadi regulasi penting guna menciptakan ruang digital ramah anak. Orang tua, guru, hingga

December 10, 2025
10 Min Read

Kami Datang ke Stadion untuk Nonton Bola, Bukan Ditanya ‘Udah Nikah Belum?’

Slogan #NoDiscrimination terasa hampa ketika keamanan perempuan di stadion sepak bola masih bergantung pada “roleplay”

December 10, 2025
10 Min Read

Respons Lamban Pemerintah di Fase Kritis Bencana Sumatera: Mengapa 72 Jam Pertama Menentukan Nyawa Korban?

kritis bencana adalah momen emas penyelamatan yang hanya berlangsung di 72 jam pertama. Sayangnya, dalam

December 10, 2025
10 Min Read

Indonesia-Uni Eropa: Perlindungan Digital terhadap Perempuan dan Anak Adalah Isu Global

Pemerintah Indonesia dan Uni Eropa sepakat, kekerasan terhadap perempuan dan anak di ruang digital adalah

December 9, 2025
10 Min Read

Ketika Wajah Jadi Komoditas: Kekerasan Digital, AI, dan PR Perlindungan Kita

Di tengah maraknya deepfake dan komodifikasi wajah perempuan dan anak, kekerasan digital bukan lagi kasus

December 9, 2025
10 Min Read

Tobat Ekologis: Cara Baru Melihat Bencana dan Hubungan Kita dengan Alam

Bencana di Sumatra bikin kita kembali bertanya: seberapa besar peran manusia dalam krisis iklim? Konsep

December 8, 2025
10 Min Read

JAFF20 Capai Penonton Tertinggi Selama Dua Dekade: ‘Tinggal Meninggal’ Borong Empat Piala

'Becoming Human' menang Golden Hanoman, 'Tinggal Meninggal' dari Kristo Immanuel menang empat kategori Indonesian Screen

December 8, 2025
10 Min Read

Ajak Mahasiswa Inggris Belajar Bahasa Indonesia lewat Cilok

Cilok ternyata bisa jadi alat belajar Bahasa Indonesia paling ampuh buat bule-bule di Bristol, Inggris.

December 8, 2025
10 Min Read

Kentang Rebus, Kopi Pahit, dan Rindu yang Ditanak di Portugal

Adaptasi rasa di negara baru tak selalu mudah, tapi dapur perantau dan masakan Indonesia selalu

December 8, 2025
10 Min Read

Sepotong Terasi dalam Harmoni Rumah Tangga

Di banyak rumah diaspora, sepotong terasi bisa memicu rindu, kompromi, bahkan negosiasi kecil tentang perbedaan.

December 8, 2025
10 Min Read

“Nyasar ke Dimensi Facebook”: Festival Bertema Komedi dan Horor Hadir di Jakarta

Facebook Indonesia gelar acara komunitas seru bertajuk “Nyasar ke Dimensi Facebook” dengan tema komedi dan

December 7, 2025
10 Min Read

Banjir dan Longsor Sumatera: Wahana Visi Indonesia Kirimkan Tim Cepat Tanggap Selama 3 Bulan ke Depan

Wahana Visi Indonesia–salah satu organisasi kemanusiaan mengirim tim tanggap bencananya merespons bencana banjir Sumatera selama

December 7, 2025
10 Min Read

5 Artikel Pilihan: #AllEyesonSumatera, Nikah Siri, hingga KBGO dalam Pinjol

Redaksi Magdalene merangkum lima berita pilihan untuk pekan ini, mulai dari #AllEyesonSumatera, janji di balik

December 6, 2025
10 Min Read

How Many Bots are on Twitter? The Question is Difficult to Answer

Kai-Cheng Yang

What, Exactly, is a Bot?

Hard to Count

Missing the Big Picture

About Author

Kai-Cheng Yang

‘RRR’: Sebuah Perayaan Sinema dari Tollywood

Decisive People Don’t Make Better Decisions–New Research

Artikel Lainnya

Ketika Diplomasi Makin Gencar, Ke Mana Perspektif Gender di Tahun Pertama Prabowo?

Makan Bergizi Gratis Harus Dekat ke Anak, Bukan ke Gudang Logistik

#TungguAnakSiap: PP Tunas Baru Awal, Butuh Kolaborasi Banyak Orang

Kami Datang ke Stadion untuk Nonton Bola, Bukan Ditanya ‘Udah Nikah Belum?’

Respons Lamban Pemerintah di Fase Kritis Bencana Sumatera: Mengapa 72 Jam Pertama Menentukan Nyawa Korban?

Indonesia-Uni Eropa: Perlindungan Digital terhadap Perempuan dan Anak Adalah Isu Global

Ketika Wajah Jadi Komoditas: Kekerasan Digital, AI, dan PR Perlindungan Kita

Tobat Ekologis: Cara Baru Melihat Bencana dan Hubungan Kita dengan Alam

JAFF20 Capai Penonton Tertinggi Selama Dua Dekade: ‘Tinggal Meninggal’ Borong Empat Piala

Ajak Mahasiswa Inggris Belajar Bahasa Indonesia lewat Cilok

Kentang Rebus, Kopi Pahit, dan Rindu yang Ditanak di Portugal

Sepotong Terasi dalam Harmoni Rumah Tangga

“Nyasar ke Dimensi Facebook”: Festival Bertema Komedi dan Horor Hadir di Jakarta

Banjir dan Longsor Sumatera: Wahana Visi Indonesia Kirimkan Tim Cepat Tanggap Selama 3 Bulan ke Depan

5 Artikel Pilihan: #AllEyesonSumatera, Nikah Siri, hingga KBGO dalam Pinjol

Most Viewed Posts

Related Posts

What, Exactly, is a Bot?

Hard to Count

Missing the Big Picture

Tags:

About Author

Kai-Cheng Yang

Artikel Lainnya

Most Viewed Posts

Related Posts