-- Leo's gemini proxy

-- Connecting to gemini.bortzmeyer.org:1965...

-- Connected

-- Sending request

-- Meta line: 20 text/gemini; lang=en

Statistics on the Gemini space


This page presents some statistics on the current state of the Gemini space. It has been updated on 2022-11-01 01:04:02Z.


It cannot claim to represent the entire space. The real number of URIs is certainly higher. There are several reasons why many URIs are not in the database:


the capsule may forbid retrieval, through robots.txt,

we do not know all the URIs and some cannot be found from the ones we know,

Lupa has a maximum number of URIs per capsule, to save resources (currently 10000).


On this page, "working" means there was a successful connection recently. "recently" means "less than 31 days". "Dead" URLs and capsules are removed after 46 days and no longer appear in any statistics.




Currently, our database includes 481,925 URIs, 377,684 of them having been checked successfully (status code 20) and recently. Among the recently accessed, 291,614 URIs serve a Gemini content.



Resources


The average size of the resources is 35,120 bytes.


Quantiles


10% of the resources are 194 bytes or less,

20% of the resources are 489 bytes or less,

30% of the resources are 792 bytes or less,

40% of the resources are 1,273 bytes or less,

50% of the resources are 2,367 bytes or less, MEDIAN

60% of the resources are 4,033 bytes or less,

70% of the resources are 6,139 bytes or less,

80% of the resources are 12,019 bytes or less,

90% of the resources are 45,562 bytes or less,

100% of the resources are 4,156,230 bytes or less.


Quantiles only for Gemini pages


10% of the resources are 143 bytes or less,

20% of the resources are 378 bytes or less,

30% of the resources are 653 bytes or less,

40% of the resources are 894 bytes or less,

50% of the resources are 1,382 bytes or less, MEDIAN

60% of the resources are 2,415 bytes or less,

70% of the resources are 3,935 bytes or less,

80% of the resources are 5,561 bytes or less,

90% of the resources are 9,973 bytes or less,

100% of the resources are 4,156,230 bytes or less.


Ranges


Less than 10 bytes: 1190 URLs (0.32 %)

10 to 100 bytes: 25902 URLs (6.9 %)

100 to 1000 bytes: 106787 URLs (28.3 %)

1 to 10 kbytes: 159304 URLs (42.2 %)

10 to 100 kbytes: 62255 URLs (16.5 %)

100 to 1000 kbytes: 16570 URLs (4.4 %)

More than 1000 kbytes: 5676 URLs (1.50 %)




Most common media (MIME) types


text/gemini: 291,616 URLs

text/plain: 31,531 URLs

image/jpeg: 17,378 URLs

image/png: 14,190 URLs

application/octet-stream: 3,982 URLs

application/pdf: 3,462 URLs

image/gif: 2,415 URLs

octet/stream: 2,216 URLs

text/html: 1,801 URLs

application/zip: 1,330 URLs

audio/mpeg: 1,210 URLs

application/x-mscardfile: 1,198 URLs

text/x-diff: 750 URLs

application/json: 691 URLs

image/webp: 301 URLs

application/gzip: 216 URLs

audio/ogg: 210 URLs

application/atom+xml: 208 URLs

text/markdown: 202 URLs

application/lagrange-fontpack+zip: 200 URLs


Most common languages



Unspecified: 312,686 URLs

en: 45,126 URLs

de: 11,072 URLs

fr: 5,695 URLs

fi: 1,162 URLs

es: 428 URLs

es_ar: 397 URLs

ru: 249 URLs

en_us: 145 URLs

it: 123 URLs

pl: 99 URLs

ca: 86 URLs

ko: 72 URLs

gl: 54 URLs

sco,gd,it,en: 39 URLs

sv: 37 URLs

pl,en: 26 URLs

eo: 24 URLs

hu: 22 URLs

pl_pl: 18 URLs


Most common language tags



Unspecified: 312,644 URLs

en: 21,623 URLs

en-gb: 12,561 URLs

de: 11,047 URLs

en-us: 10,273 URLs

fr: 5,140 URLs

fi: 1,162 URLs

fr-fr: 555 URLs

es-es: 417 URLs

en-ie: 412 URLs

es_ar: 397 URLs

en-au: 215 URLs

en_us: 145 URLs

ru: 125 URLs

ru-ru: 124 URLs

it: 123 URLs

pl: 97 URLs

ca-es: 84 URLs

ko: 72 URLs

gl-es: 54 URLs


Most common encodings ("charsets") for all files


(Remember there exists testing capsules, with very exotic encodings, so don't be surprised by some strange ones.)



Unspecified: 342,825 URLs

utf-8: 25,346 URLs

us-ascii: 9,480 URLs

binary: 20 URLs

gzip: 7 URLs

bzip2: 2 URLs

u: 2 URLs

windows-1252: 1 URLs

cp437: 1 URLs

iso-8859-1: 1 URLs

utf-16: 1 URLs


Most common encodings for gemtext files only



Unspecified: 274,037 URLs

utf-8: 17,575 URLs

cp437: 1 URLs

iso-8859-1: 1 URLs

utf-16: 1 URLs

windows-1252: 1 URLs


By the way, 1,922 of recently tested URLs (0.410 %) have a wrong encoding (it does not match the actual content).



Status codes


(Remember there are test capsules with funny status codes, to exercice Gemini clients.)



20 (Success): 377,686 occurrences (86.86 %)

51 (Not found): 18,967 occurrences (4.36 %)

44 (Slow down): 7,277 occurrences (1.67 %)

40 (Temporary failure): 6,951 occurrences (1.60 %)

50 (Permanent failure): 5,609 occurrences (1.29 %)

60 (Client certificate request): 5,512 occurrences (1.27 %)

30 (Temporary redirect): 4,751 occurrences (1.09 %)

42 (CGI error): 4,025 occurrences (0.93 %)

10 (Input request): 3,254 occurrences (0.75 %)

31 (Permanent redirect): 617 occurrences (0.14 %)

41 (Server unavailable): 43 occurrences (0.01 %)

59 (Bad request): 41 occurrences (0.01 %)


Links



(We count only backlinks from external capsules, and at most one link per capsule. Also, we exclude links from capsules like search engines or directories.)


Maximum number of incoming links: 227


Average number of incoming links: 0.18



Capsules


There are 2842 capsules. We successfully connected recently to 2176 of them.


Most common capsules by number of working URLs


We have a limit of 10000 URLs per capsule.



blitter.com: 10000 URLs

gemini.thebackupbox.net: 10000 URLs

midnight.pub: 10000 URLs

gemini.conman.org: 10000 URLs

hoagie.space: 9993 URLs

jsreed5.org: 9991 URLs

taz.de: 9989 URLs

gemini.spam.works: 9985 URLs

gemini.techrights.org: 9982 URLs

wikipedia.geminet.org:1966: 9946 URLs

jpfox.fr: 9929 URLs

auragem.space: 9838 URLs

gemini.omarpolo.com: 9111 URLs

tilde.team: 8619 URLs

gemini.autonomy.earth: 8255 URLs

vps01.rdelaage.ovh: 8098 URLs

ecs.d2evs.net: 7354 URLs

rawtext.club: 7302 URLs

gemini.knusbaum.com: 7044 URLs

mastogem.picasoft.net: 7030 URLs


Most common capsules by number of bytes in working URLs


We have a limit of bytes per URL.

Not properly documented yet



jpfox.fr: 1069.4 megabytes

uscoffings.net: 911.9 megabytes

blitter.com: 824.2 megabytes

nytpu.com: 664.5 megabytes

yam655.com: 595.1 megabytes

gael.mooo.com: 555.1 megabytes

hoagie.space: 515.2 megabytes

snowcode.ovh: 300.7 megabytes

ecs.d2evs.net: 286.5 megabytes

multiverse.thruhere.net: 233.6 megabytes

si3t.ch: 217.7 megabytes

skyjake.fi: 213.7 megabytes

tweek.zyxxyz.eu: 207.3 megabytes

mikelynch.org: 202.6 megabytes

gemini.spam.works: 197.3 megabytes

shit.cx: 182.3 megabytes

gemini.techrights.org: 174.2 megabytes

tilde.team: 168.8 megabytes

gemini.conman.org: 151.7 megabytes

wikipedia.geminet.org:1966: 144.2 megabytes

kota.nz: 120.1 megabytes



All working capsules:


As a text file

As a gemtext, with links




Certificates


1942 (89.2 %) capsules are self-signed, 192 (8.8 %) use the Certificate Authority Let's Encrypt, 42 (1.9 %) are signed by another CA (may be not a trusted one).



59 capsules (2.74 %) have an expired certificate.



Algorithms:


ecdsa-with-SHA256: 1442 capsules

sha256WithRSAEncryption: 722 capsules

ED25519: 12 capsules

ecdsa-with-SHA512: 3 capsules

sha512WithRSAEncryption: 3 capsules

ecdsa-with-SHA384: 1 capsules


Key types:


ECDSA: 1466 capsules

RSA: 706 capsules

ED25519: 11 capsules


Key sizes for RSA:


2048: 413 capsules

4096: 284 capsules

3072: 6 capsules

1024: 2 capsules

3584: 1 capsules


Key sizes for ECDSA:


256: 1376 capsules

384: 88 capsules

521: 2 capsules


TLS


95 % of the capsules use TLS 1.3, 5 % use TLS 1.2.



robots.txt


230 (11 %) the capsules have a robots.txt exclusion file.



Ports


11 working capsules (0.5 %) use an alternative port



Addresses


1150 IP addresses used. 16 % are IPv6.




Addresses with most virtual hosts



173.230.145.243: 685 vhosts

68.133.1.71: 190 vhosts

213.219.38.200: 189 vhosts

173.195.146.139: 92 vhosts

86.221.250.139: 28 vhosts

86.194.173.37: 27 vhosts

109.237.26.252: 19 vhosts

90.65.170.44: 18 vhosts

45.56.93.217: 17 vhosts

216.238.66.109: 15 vhosts

104.245.33.223: 8 vhosts

52.51.189.88: 8 vhosts

2a01:4f9:c010:e919::1: 7 vhosts

135.181.153.189: 7 vhosts

85.208.51.149: 7 vhosts

139.162.187.208: 6 vhosts

75.90.46.88: 6 vhosts

173.187.191.21: 6 vhosts

89.234.140.141: 6 vhosts

173.187.230.187: 6 vhosts


TLDs


There are 238 TLDs in the capsule's names, and 1468 registered domains.


Most common TLDs




By number of registered domains



com: 223 domains

net: 130 domains

org: 122 domains

xyz: 108 domains

space: 66 domains

de: 45 domains

me: 41 domains

dev: 38 domains

site: 37 domains

eu: 30 domains

info: 25 domains

uk: 24 domains

fr: 22 domains

io: 21 domains

club: 20 domains

online: 13 domains

ca: 12 domains

se: 12 domains

ch: 12 domains

us: 11 domains



By number of capsules


(There's a strong bias towards TLDs which have hosting services such as flounder.online, which has many capsules in subdomains. See before the TLDs per registered domains, which are probably more useful.)



online: 694 capsules

org: 349 capsules

com: 262 capsules

pub: 192 capsules

net: 154 capsules

xyz: 118 capsules

space: 85 capsules

de: 54 capsules

club: 44 capsules

me: 42 capsules

dev: 40 capsules

site: 40 capsules

eu: 37 capsules

info: 31 capsules

casa: 29 capsules

uk: 27 capsules

io: 26 capsules

fr: 23 capsules

us: 19 capsules

ch: 16 capsules



Other statistics on the geminispace


At the search engine geminispace.info

At the search engine TLGS

By Nervuri (specially for certificates)


Contact


Maintained by Stéphane Bortzmeyer (email <stephane+gemini@bortzmeyer.org>). Comments and criticisms are welcome.


Home page of the crawler

Source code of the crawler


My capsule



-- Response ended

-- Page fetched on Tue Jun 18 21:05:01 2024