The last weeks I was mainly busy with learning python. I wrote a web spider that starts at one particular page, from there on grabs all external hyper links from the index page and then grabs the main index page from these pages to recursively crawl through the web and find as many hosts as possible. After understanding pythons basics and having a smoothly running web crawler I was curious what Content Management System these pages are using or if they are using a CMS at all.
I’ve found 70’186 hosts and 17’002 of them responded they have a CMS running. That makes about 24.4%.
From these 17’002 systems the following CMS types were identified :
| WordPress | 72% | 12’268 |
| Drupal | 11.7% | 1’985 |
| Joomla | 8.7% | 1’476 |
| Blogger | 6.7 % | 1’135 |
| DotNetNuke | 0.8% | 133 |
| Textpattern | 0.03% | 5 |
Among all these CMSs there were also outdated versions. Quite outdated like WordPress 2.1 and even below 2.0.