-- Leo's gemini proxy

-- Connecting to gemini.bortzmeyer.org:1965...

-- Connected

-- Sending request

-- Meta line: 20 text/gemini; lang=en

Statistics on the Gemini space


This page presents some statistics on the current state of the Gemini space. It has been updated on 2023-10-01 03:04:01Z.


It cannot claim to represent the entire space. The real number of URIs is certainly higher. There are several reasons why many URIs are not in the database:


the capsule may forbid retrieval, through robots.txt,

we do not know all the URIs and some cannot be found from the ones we know,

Lupa has a maximum number of URIs per capsule, to save resources (currently 10000).


On this page, "working" means there was a successful connection recently. "recently" means "less than 31 days". "Dead" URLs and capsules are removed after 46 days and no longer appear in any statistics.




Currently, our database includes 591,535 URIs, 475,970 of them having been checked successfully (status code 20) and recently. Among the recently accessed, 367,640 URIs serve a Gemini content.



Resources


The average size of the resources is 40,199 bytes.


Quantiles


10% of the resources are 277 bytes or less,

20% of the resources are 588 bytes or less,

30% of the resources are 948 bytes or less,

40% of the resources are 1,624 bytes or less,

50% of the resources are 2,793 bytes or less, MEDIAN

60% of the resources are 5,178 bytes or less,

70% of the resources are 7,628 bytes or less,

80% of the resources are 16,935 bytes or less,

90% of the resources are 58,683 bytes or less,

100% of the resources are 4,156,230 bytes or less.


Quantiles only for Gemini pages


10% of the resources are 246 bytes or less,

20% of the resources are 471 bytes or less,

30% of the resources are 784 bytes or less,

40% of the resources are 1,143 bytes or less,

50% of the resources are 1,932 bytes or less, MEDIAN

60% of the resources are 3,176 bytes or less,

70% of the resources are 5,463 bytes or less,

80% of the resources are 7,476 bytes or less,

90% of the resources are 16,521 bytes or less,

100% of the resources are 4,156,230 bytes or less.


Ranges


Less than 10 bytes: 3257 URLs (0.68 %)

10 to 100 bytes: 11846 URLs (2.5 %)

100 to 1000 bytes: 132858 URLs (27.9 %)

1 to 10 kbytes: 202659 URLs (42.6 %)

10 to 100 kbytes: 92057 URLs (19.3 %)

100 to 1000 kbytes: 25525 URLs (5.4 %)

More than 1000 kbytes: 7768 URLs (1.63 %)




Most common media (MIME) types


text/gemini: 367,640 URLs

text/plain: 29,430 URLs

image/png: 19,191 URLs

image/jpeg: 18,292 URLs

application/octet-stream: 16,295 URLs

application/pdf: 4,675 URLs

application/zip: 3,143 URLs

octet/stream: 2,214 URLs

text/html: 2,004 URLs

image/gif: 2,003 URLs

audio/mpeg: 1,271 URLs

application/x-mscardfile: 1,199 URLs

MIME: 1,061 URLs

text/x-diff: 978 URLs

text/markdown: 696 URLs

application/json: 694 URLs

application/atom+xml: 566 URLs

text/xml: 545 URLs

audio/ogg: 403 URLs

application/xml: 309 URLs


Most common languages



Unspecified: 370,791 URLs

en: 75,924 URLs

de: 11,062 URLs

it: 7,129 URLs

fr: 5,341 URLs

es: 1,505 URLs

es_ar: 1,171 URLs

ja: 1,128 URLs

ru: 519 URLs

en_gb: 369 URLs

en_us: 223 URLs

pl: 153 URLs

ko: 97 URLs

ca: 86 URLs

sv: 69 URLs

eo: 54 URLs

gl: 54 URLs

sco,gd,it,en: 38 URLs

pl,en: 31 URLs

en,he: 27 URLs


Most common language tags



Unspecified: 370,746 URLs

en: 35,638 URLs

en-us: 20,515 URLs

en-gb: 19,033 URLs

de: 11,001 URLs

it: 7,129 URLs

fr: 4,389 URLs

es-es: 1,479 URLs

es_ar: 1,171 URLs

ja: 1,128 URLs

fr-fr: 952 URLs

en-ie: 473 URLs

ru-ru: 399 URLs

en_gb: 369 URLs

en_us: 223 URLs

ru: 120 URLs

pl: 105 URLs

en-au: 104 URLs

ko: 97 URLs

ca-es: 84 URLs


Most common encodings ("charsets") for all files


(Remember there exists testing capsules, with very exotic encodings, so don't be surprised by some strange ones.)



Unspecified: 402,518 URLs

utf-8: 63,930 URLs

us-ascii: 9,474 URLs

gzip: 24 URLs

binary: 20 URLs

bzip2: 2 URLs

xz: 1 URLs

iso-8859-1: 1 URLs


Most common encodings for gemtext files only



Unspecified: 313,489 URLs

utf-8: 54,150 URLs

iso-8859-1: 1 URLs


By the way, 1,255 of recently tested URLs (0.222 %) have a wrong encoding (it does not match the actual content).



Status codes


(Remember there are test capsules with funny status codes, to exercice Gemini clients.)



20 (Success): 475,970 occurrences (87.13 %)

51 (Not found): 21,446 occurrences (3.93 %)

50 (Permanent failure): 18,540 occurrences (3.39 %)

40 (Temporary failure): 12,615 occurrences (2.31 %)

60 (Client certificate request): 5,588 occurrences (1.02 %)

53 (Proxy request refused): 4,938 occurrences (0.90 %)

10 (Input request): 3,898 occurrences (0.71 %)

42 (CGI error): 1,586 occurrences (0.29 %)

30 (Temporary redirect): 1,160 occurrences (0.21 %)

59 (Bad request): 185 occurrences (0.03 %)

44 (Slow down): 180 occurrences (0.03 %)

43 (Proxy error): 71 occurrences (0.01 %)


Links



(We count only backlinks from external capsules, and at most one link per capsule. Also, we exclude links from capsules like search engines or directories.)


Maximum number of incoming links: 281


Average number of incoming links: 0.18



Capsules


There are 3523 capsules. We successfully connected recently to 2558 of them.


Most common capsules by number of working URLs


We have a limit of 10000 URLs per capsule.



midnight.pub: 10000 URLs

gemini.conman.org: 10000 URLs

mirrors.apple2.org.za: 9997 URLs

rwv.io: 9996 URLs

jsreed5.org: 9995 URLs

hoagie.space: 9994 URLs

blitter.com: 9993 URLs

library.inu.red: 9991 URLs

gemlog.stargrave.org: 9990 URLs

caiofior.pollux.casa: 9987 URLs

spam.works: 9983 URLs

gemini.omarpolo.com: 9979 URLs

taz.de: 9975 URLs

tjp.lol: 9940 URLs

gemini.techrights.org: 9796 URLs

gemini.unlimited.pizza: 9785 URLs

gemini.knusbaum.com: 9742 URLs

mastogem.remorse.us: 9641 URLs

auragem.letz.dev: 9416 URLs

gemini.autonomy.earth: 9363 URLs


Most common capsules by number of bytes in working URLs


We have a limit of bytes per URL.

Not properly documented yet



mirrors.apple2.org.za: 2801.8 megabytes

nytpu.com: 1082.2 megabytes

uscoffings.net: 926.0 megabytes

blitter.com: 823.0 megabytes

gael.mooo.com: 752.3 megabytes

gem.librehacker.com: 750.8 megabytes

yam655.com: 597.2 megabytes

hoagie.space: 495.7 megabytes

skyjake.fi: 440.6 megabytes

gemini.omarpolo.com: 390.4 megabytes

gemini.zachdecook.com: 340.3 megabytes

library.inu.red: 324.7 megabytes

ecs.d2evs.net: 304.2 megabytes

jpfox.fr: 255.7 megabytes

tweek.zyxxyz.eu: 255.5 megabytes

shit.cx: 182.3 megabytes

phreedom.club: 178.2 megabytes

higeki.jp: 177.0 megabytes

gemini.techrights.org: 171.0 megabytes

rwv.io: 168.5 megabytes

going-flying.com: 167.7 megabytes



All working capsules:


As a text file

As a gemtext, with links




Certificates


2284 (89.3 %) capsules are self-signed, 212 (8.3 %) use the Certificate Authority Let's Encrypt, 62 (2.4 %) are signed by another CA (may be not a trusted one).



63 capsules (2.48 %) have an expired certificate.



Algorithms:


ecdsa-with-SHA256: 1651 capsules

sha256WithRSAEncryption: 887 capsules

ED25519: 14 capsules

ecdsa-with-SHA512: 4 capsules

sha512WithRSAEncryption: 2 capsules

ecdsa-with-SHA384: 1 capsules

sha384WithRSAEncryption: 1 capsules


Key types:


ECDSA: 1690 capsules

RSA: 856 capsules

ED25519: 14 capsules


Key sizes for RSA:


2048: 591 capsules

4096: 249 capsules

3072: 12 capsules

1024: 3 capsules

3584: 1 capsules


Key sizes for ECDSA:


256: 1616 capsules

384: 72 capsules

521: 2 capsules


TLS


98 % of the capsules use TLS 1.3, 2 % use TLS 1.2.



robots.txt


253 (10 %) the capsules have a robots.txt exclusion file.



Ports


11 working capsules (0.4 %) use an alternative port



Addresses


1192 IP addresses used. 17 % are IPv6.




Addresses with most virtual hosts



173.230.145.243: 815 vhosts

68.133.1.71: 336 vhosts

213.219.38.200: 221 vhosts

173.195.146.139: 103 vhosts

90.65.170.44: 29 vhosts

109.237.26.252: 24 vhosts

45.56.93.217: 17 vhosts

216.238.66.109: 13 vhosts

128.140.115.191: 11 vhosts

51.222.161.16: 8 vhosts

174.138.124.169: 7 vhosts

85.208.51.149: 7 vhosts

104.245.33.223: 6 vhosts

89.234.140.141: 6 vhosts

2a00:5881:4008:d00::: 6 vhosts

139.162.187.208: 6 vhosts

2a01:7e01::f03c:93ff:fedf:bffe: 6 vhosts

68.183.213.240: 5 vhosts

172.105.4.126: 5 vhosts

89.253.220.199: 5 vhosts


TLDs


There are 261 TLDs in the capsule's names, and 1747 registered domains.


Most common TLDs




By number of registered domains



com: 277 domains

net: 148 domains

org: 142 domains

xyz: 119 domains

space: 79 domains

de: 55 domains

dev: 52 domains

me: 46 domains

site: 46 domains

eu: 33 domains

fr: 31 domains

uk: 29 domains

io: 25 domains

club: 23 domains

info: 23 domains

online: 15 domains

se: 15 domains

ch: 15 domains

ru: 14 domains

ca: 13 domains



By number of capsules


(There's a strong bias towards TLDs which have hosting services such as flounder.online, which has many capsules in subdomains. See before the TLDs per registered domains, which are probably more useful.)



online: 833 capsules

org: 525 capsules

com: 336 capsules

pub: 235 capsules

net: 174 capsules

xyz: 129 capsules

space: 102 capsules

de: 62 capsules

dev: 55 capsules

site: 49 capsules

me: 47 capsules

club: 46 capsules

eu: 41 capsules

casa: 37 capsules

io: 34 capsules

fr: 33 capsules

uk: 32 capsules

info: 30 capsules

us: 19 capsules

ru: 19 capsules



Other statistics on the geminispace


At the search engine geminispace.info

At the search engine TLGS

By Nervuri (specially for certificates)


Contact


Maintained by Stéphane Bortzmeyer (email <stephane+gemini@bortzmeyer.org>). Comments and criticisms are welcome.


Home page of the crawler

Source code of the crawler


My capsule



-- Response ended

-- Page fetched on Sun Jun 2 00:46:51 2024