I'm confused by how this data is being generated. For example, India is tagged with Punjabi, but somehow not Bengali, even though the latter is the second most widely spoken language in India, with about three times as many speakers.
Ironically, Punjabi would be way more useful in Pakistan, where it's actually the most common first language, but Pakistan is tagged only with Urdu (and no other language).
I'm also skeptical of the "physical safety" tags; they seem inconsistent as well in a way that's difficult to reconcile.
OP does not list all the major languages either. I have a feeling that, just the four main southern languages & its dialects will outrank all northern European languages combined (except English technically)
I'm confused by how this data is being generated. For example, India is tagged with Punjabi, but somehow not Bengali, even though the latter is the second most widely spoken language in India, with about three times as many speakers.
Ironically, Punjabi would be way more useful in Pakistan, where it's actually the most common first language, but Pakistan is tagged only with Urdu (and no other language).
I'm also skeptical of the "physical safety" tags; they seem inconsistent as well in a way that's difficult to reconcile.