-- Leo's gemini proxy

-- Connecting to gemini.dimakrasner.com:1965...

-- Connected

-- Sending request

-- Meta line: 20 text/gemini

The Guppy Protocol Specification v0.2


(I don't know if anyone is interested in this, but let's give it a try. It started as a thought experiment, when I looked for 'lighter' protocols I can implement on the Pico W. I see Spartan mentioned here and there, and I wonder if there's any interest in going even more ... spartan. If you find this interesting, useful or fun, and have ideas how to improve this protocol, I'd love to hear from you at my-first-name@dimakrasner.com!)


Overview


Guppy is a simple unencrypted client-to-server protocol, for download of text and text-based interfaces that require upload of short input. It uses UDP and inspired by TFTP, DNS and Spartan. The goal is to design a simple, text-based protocol that is easy to implement and can be used to host a "guplog" on a microcontroller (like a Raspberry Pi Pico W, ESP32 or ESP8266) without the overhead of cryptography and TCP sockets.


Requests are always sent as a single packet, while responses can be chunked and each chunk must be acknowledged by the client. The protocol is designed for short-lived sessions that transfer small text files, therefore it doesn't allow failed downloads to be resumed, and doesn't allow upload of big chunks of data.


Changelog


v0.2:

(Response to feedback from slondr)

Packet sizes can go >= 512 bytes

Success, continuation and EOF packets now have an increasing sequence number in the status line

Servers are allowed to re-transmit packets

Clients must ignore duplicate packets

Clients must acknowledge EOF

Typo and silly mistake fixes

Terminology section

Sample client and server


v0.1:

First version


Sample Implementation


git clone -b guppy --recursive https://github.com/dimkr/gplaces
cd gplaces
make PREFIX=/tmp/guppy CONFDIR=/tmp/guppy/etc install
/tmp/guppy/bin/gplaces guppy://hd.206267.xyz

(server source code will be published soon)


Terminology


"Must" means a strict requirement, a rule all conformant Guppy client or server must obey.


"Should" means a recommendation, something minimal clients or servers should do.


"May" means a soft recommendation, something good clients or servers should do.


URLs


If no port is specified in a guppy:// URL, clients and servers must fall back to 6775 ('gu').


The fragment component in a guppy:// URL has no special meaning, but clients must not remove it when following links.


MIME Types


The client must support at least text/plain and text/gemini (without the Spartan := type) documents.


If encoding is unspecified, the client may assume it's UTF-8 or ASCII.


Download vs. Upload


In Guppy, all URLs can be (theoretically) accompanied by user-provided input. The client must provide the user with means for sending a request with user-provided input, to any link line.


Server and content authors should inform users when input is required and describe what kind of input, using the link's user-friendly description.


If input is expected but not provided by the user, the server must respond with an error packet.


Clients may cache the returned error and present it to the user on attempt to access the same page without providing input.


Security and Privacy


The protocol is unencrypted, and these concerns are beyond the scope of this document.


Implementers should use encrypted file formats for input and downloaded content, and arbitrary or even randomized paths (for example, /a.gmi rather than /my-post-about-chess.gmi), if eavesdropping is a concern.


Implementers should use external means, like digitally-signed documents or PGP, as proof of authenticity.


Limitations


Clients and servers may restrict packet size, to allow slower but more reliable transfer. However, the packet size limit must be at least 512 bytes.


Client and servers should handle timeouts gracefully and must distinguish between timeout and proper termination of the session.


Packet Types


There are 7 packet types:

Request

Success

Continuation

End-of-file

Acknowledgement

Redirect

Error


All packets begin with a "status line", followed by \r\n.


TL;DR -

The server responds to a request (a URL) by sending a success, redirect or error packet

A success packet consist of a sequence number, a MIME type and data

A continuation packet is a success packet without a MIME type

The end of the response is marked by a continuation packet without data (an end-of-file packet)

The client acknowledgemes every packet by sending its sequence number back to the server


Requests


	url\r\n

The query part specifies user-provided input, percent-encoded.


The server must respond with a success, redirect or error packet.


Success


	seq type\r\n
	$data

The sequence number is an arbitrary number between 2 and 2147483647 minus the number of packets needed to transmit the response.


The type field specifies the response MIME type and must not be empty.


The server may send a chunked response, by sending continuation packets after the success packet.


Clients should start displaying the response as soon as possible.


Continuation


	seq\r\n
	$data

The sequence number must increase 1 in every continuation packet.


The client must ignore packets where the sequence number is not the sequence number of the previous packet plus 1.


If the client begins to display a textual response before the entire response has been received, it must not assume that the response is split on a line boundary: a long line may be sent in multiple response packets.


End-of-file


	seq\r\n

The server must mark the end of the transmission by sending a continuation packet without any data, even if the response fits in a single packet.


Clients must wait for the "end of file" packet, to differentiate between timeout, a partially received response and a successfully received response.


Acknowledgement


	seq\r\n

The client must acknowledge every success, continuation or EOF packet by echoing its sequence number back to the server.


The server should wait for the client to acknowledge the previous chunk of the response (the success packet or the previous continuation packet) before sending the next continuation packet, to avoid waste of network bandwidth.


The server may attempt re-transmission of a success, continuation or EOF packet after a while, if not acknowledgement by the client.


The client may attempt re-transmission of an acknowledgement packet and the server must ignore acknowledgement packets it's not waiting for.


Redirect


	0 url\r\n

The URL may be relative.


The client must inform the user of the redirection.


The client may remember redirected URLs. The server must not assume clients don't do this.


The client should limit the number of redirects during the handling of a single request.


The client may forbid a series of redirects, and may prompt the user to confirm each redirect.


Error


	1 error\r\n

Clients must display the error to the user.


Examples


Success - Single Packet


If the URL is guppy://localhost/a and the response is "# Title 1\n":


	> guppy://localhost/a\r\n
	< 566837578 text/gemini\r\n# Title 1\n
	> 566837578 text/gemini\r\n
	< 566837579\r\n
	> 566837579\r\n

Success - Single Packet with User Input


If the URL is guppy://localhost/a and input is "b c":


	> guppy://localhost/a?b%20c\r\n
	< 566837578 text/gemini\r\n# Title 1\n
	> 566837578 text/gemini\r\n
	< 566837579\r\n
	> 566837579\r\n

Success - Multiple Packets


If the URL is guppy://localhost/a and the response is "# Title 1\nParagraph 1\n":


	> guppy://localhost/a\r\n
	< 566837578 text/gemini\r\n# Title 1\n
	> 566837578 text/gemini\r\n
	< 566837579\r\nParagraph 1
	> 566837579\r\n\r\n
	< 566837580\r\n\n
	> 566837580\r\n
	< 566837581\r\n
	> 566837581\r\n

Success - Multiple Packets With Unreliable Network


If the URL is guppy://localhost/a and the response is "# Title 1\nParagraph 1\n":


	> guppy://localhost/a\r\n
	< 566837578 text/gemini\r\n# Title 1\n
	> 566837578 text/gemini\r\n
	< 566837578 text/gemini\r\n# Title 1\n (acknowledgement arrived after the server re-transmitted the success packet)
	< 566837579\r\nParagraph 1
	< 566837579\r\nParagraph 1 (first continuation packet was lost)
	> 566837579\r\n
	< 566837580\r\n\n
	> 566837580\r\n
	> 566837580\r\n (first acknowledgement packet was lost and the client re-transmitted it while waiting for a continuation or EOF packet)
	< 566837581\r\n (server sends EOF after receiving the re-transmitted acknowledgement packet)
	< 566837581\r\n (first EOF packet was lost while server waits for client to acknowledge EOF)
	> 566837581\r\n

Redirect - Absolute URL


	> guppy://localhost/a\r\n
	< 0 guppy://localhost/b\r\n

Redirect - Relative URL


	> guppy://localhost/a\r\n
	< 0 /b\r\n

Error


	> guppy://localhost/search\r\n
	< 1 No search keywords specified\r\n

-- Response ended

-- Page fetched on Fri May 10 14:42:14 2024