This is a very interesting question, and I doubt that there is a way to get a solid single number. But since this question is asking what possible data points can be used, I feel that bringing up ELPA and MELPA is useful.
I think it is accurate to say that at some point in their career with Emacs, a user is going to download a tool that is hosted on ELPA
or MELPA
. So if it is possible to gather numbers regarding users from those repositories, that could serve as a way to get data.
I have not been able to find a way to gather user numbers from ELPA
. However, MELPA
seems to list the number of downloads next to the packages that it lists. If we sort by number of DLs per package, we can see that the dash
package has 4,264,755 DLs at time of writing. The next most popular package is magit
at 3,452,140 DLs. MELPA
claims that 291,308,600 package downloads to date have occurred.
Obviously these numbers have problems:
- I don't know what time period these DLs for specific packages represent. I believe they are lifetime DLs because the total downloads says it is to-date. However, I do not know if that claim extends to the individual packages.
- While these popular packages are well used in the emacs community, we cannot guarantee all users of emacs have used these packages. Nevertheless, it should be noted that
dash
is a dependency of many other packages which can be seen on the package summary. So an emacs user does not need to intentionally download dash
to be listed as a DL, and would be listed if they were using other popular packages, such as flycheck
, magit
, org-super-agenda
, or treemacs
.
- We cannot use these numbers to account for people who have never
interacted with a repo.
- These DLs do not represent individual users. At the very least, I know I use emacs on multiple computers and have downloaded magit for both of them.
- We have no idea how many of these DLs are active emacs users. Only that the package had been downloaded that many times.
- The numbers of DLs on each individual package should not be viewed in isolation. Taking the two packages mentioned above should not be read to imply that there are 7.7 million unique users of those packages because that is the sum of DLs currently -- These figures include users downloading multiple packages.
I think it could be a viable project for a savvy user to view the rate of increase for the DLs and use that rate to estimate active or new users. If someone were to do such a project, their findings could offer a unique perspective to this question.