archives/snowflake

Fork 0

mirror of https://gitlab.torproject.org/tpo/anti-censorship/pluggable-transports/snowflake.git synced 2025-10-14 05:11:19 -04:00

Pluggable Transport using WebRTC, inspired by Flashproxy.

Find a file

obble 1d6a2580c6 Improving Snowflake Proxy Performance by Adjusting Copy Buffer Size TL;DR: The current implementation uses a 32K buffer size for a total of 64K of buffers/connection, but each read/write is less than 2K according to my measurements. # Background The Snwoflake proxy uses as particularly hot function `copyLoop` (proxy/lib/snowflake.go) to proxy data from a Tor relay to a connected client. This is currently done using the `io.Copy` function to write all incoming data both ways. Looking at the `io.Copy` implementation, it internally uses `io.CopyBuffer`, which in turn defaults to a buffer of size 32K for copying data (I checked and the current implementation uses 32K every time). Since `snowflake-proxy` is intended to be run in a very distributed manner, on as many machines as possible, minimizing the CPU and memory footprint of each proxied connection would be ideal, as well as maximising throughput for clients. # Hypothesis There might exist a buffer size `X` that is more suitable for usage in `copyLoop` than 32K. # Testing ## Using tcpdump Assuming you use `-ephemeral-ports-range 50000:51000` for `snowflake-proxy`, you can capture the UDP packets being proxied using ```sh sudo tcpdump -i <interface> udp portrange 50000-51000 ``` which will provide a `length` value for each packet captured. One good start value for `X` could then be slighly larger than the largest captured packet, assuming one packet is copied at a time. Experimentally I found this value to be 1265 bytes, which would make `X = 2K` a possible starting point. ## Printing actual read The following snippe was added in `proxy/lib/snowflake.go`: ```go // Taken straight from standardlib io.copyBuffer func copyBuffer(dst io.Writer, src io.Reader, buf []byte) (written int64, err error) { // If the reader has a WriteTo method, use it to do the copy. // Avoids an allocation and a copy. if wt, ok := src.(io.WriterTo); ok { return wt.WriteTo(dst) } // Similarly, if the writer has a ReadFrom method, use it to do the copy. if rt, ok := dst.(io.ReaderFrom); ok { return rt.ReadFrom(src) } if buf == nil { size := 32 * 1024 if l, ok := src.(*io.LimitedReader); ok && int64(size) > l.N { if l.N < 1 { size = 1 } else { size = int(l.N) } } buf = make([]byte, size) } for { nr, er := src.Read(buf) if nr > 0 { log.Printf("Read %d", nr) // THIS IS THE ONLY DIFFERENCE FROM io.CopyBuffer nw, ew := dst.Write(buf[0:nr]) if nw < 0 \|\| nr < nw { nw = 0 if ew == nil { ew = errors.New("invalid write result") } } written += int64(nw) if ew != nil { err = ew break } if nr != nw { err = io.ErrShortWrite break } } if er != nil { if er != io.EOF { err = er } break } } return written, err } ``` and `copyLoop` was amended to use this instead of `io.Copy`. The `Read: BYTES` was saved to a file using this command ```sh ./proxy -verbose -ephemeral-ports-range 50000:50010 2>&1 >/dev/null \| awk '/Read: / { print $4 }' \| tee read_sizes.txt ``` I got the result: min: 8 max: 1402 median: 1402 average: 910.305 Suggested buffer size: 2K Current buffer size: 32768 (32K, experimentally verified) ## Using a Snowflake Proxy in Tor browser and use Wireshark I also used Wireshark, and concluded that all packets sent was < 2K. # Conclusion As per the commit I suggest changing the buffer size to 2K. Some things I have not been able to answer: 1. Does this make a big impact on performance? 1. Are there any unforseen consequences? What happens if a packet is > 2K (I think the Go standard libary just splits the packet, but someone please confirm).		2024-08-21 15:02:15 +00:00
broker	docs(broker): clarify `allowed-relay-pattern`	2024-08-20 12:43:31 +01:00
client	Report a different implementation for client and server	2024-08-07 12:33:37 +02:00
common	Indicate modified in version string	2024-07-11 11:46:57 +01:00
doc	Merge remote-tracking branch 'origin/mr/258'	2024-03-12 08:28:53 -03:00
probetest	Use ptutil for safelog and prometheus rounded metrics	2024-05-09 16:24:33 +02:00
proxy	Improving Snowflake Proxy Performance by Adjusting Copy Buffer Size	2024-08-21 15:02:15 +00:00
server	Report a different implementation for client and server	2024-08-07 12:33:37 +02:00
.gitignore	stripped down Android build process for gitlab-ci and Vagrant	2021-12-01 11:48:03 +01:00
.gitlab-ci.yml	rename stable container tags to latest	2024-04-25 10:02:37 +01:00
.gitmodules	Remove proxy/translation submodule	2020-04-16 10:01:18 -04:00
.travis.yml	Bump snowflake library imports and go.mod to v2	2021-11-11 10:14:49 -05:00
ChangeLog	Bump version to v2.9.2	2024-03-18 14:47:44 -04:00
CONTRIBUTING.md	Remove mentions of coffeescript from docs	2019-07-10 10:49:53 +02:00
Dockerfile	chore(deps): update docker.io/library/golang docker tag to v1.22	2024-03-12 09:29:03 +00:00
go.mod	chore(deps): update module github.com/aws/aws-sdk-go-v2/service/sqs to v1.34.4	2024-08-21 10:30:24 +00:00
go.sum	chore(deps): update module github.com/aws/aws-sdk-go-v2/service/sqs to v1.34.4	2024-08-21 10:30:24 +00:00
LICENSE	Update license	2020-03-19 15:40:11 -04:00
README.md	Add utls-imitate, utls-nosni doc to README: fix style	2023-03-13 14:13:50 +00:00
renovate.json	Use go 1.21 in renovate	2023-10-16 20:48:47 +02:00
Vagrantfile	Move the development to gitlab	2023-05-31 10:01:47 +02:00

README.md

Snowflake

Pluggable Transport using WebRTC, inspired by Flashproxy.

Table of Contents

Structure of this Repository
Usage
Test Environment
FAQ
More info and links

Structure of this Repository

broker/ contains code for the Snowflake broker
doc/ contains Snowflake documentation and manpages
client/ contains the Tor pluggable transport client and client library code
common/ contains generic libraries used by multiple pieces of Snowflake
proxy/ contains code for the Go standalone Snowflake proxy
probetest/ contains code for a NAT probetesting service
server/ contains the Tor pluggable transport server and server library code

Usage

Snowflake is currently deployed as a pluggable transport for Tor.

Using Snowflake with Tor

To use the Snowflake client with Tor, you will need to add the appropriate Bridge and ClientTransportPlugin lines to your torrc file. See the client README for more information on building and running the Snowflake client.

Running a Snowflake Proxy

You can contribute to Snowflake by running a Snowflake proxy. We have the option to run a proxy in your browser or as a standalone Go program. See our community documentation for more details.

Using the Snowflake Library with Other Applications

Snowflake can be used as a Go API, and adheres to the v2.1 pluggable transports specification. For more information on using the Snowflake Go library, see the Snowflake library documentation.

Test Environment

There is a Docker-based test environment at https://github.com/cohosh/snowbox.

FAQ

Q: How does it work?

In the Tor use-case:

Volunteers visit websites which host the "snowflake" proxy. (just like flashproxy)
Tor clients automatically find available browser proxies via the Broker (the domain fronted signaling channel).
Tor client and browser proxy establish a WebRTC peer connection.
Proxy connects to some relay.
Tor occurs.

More detailed information about how clients, snowflake proxies, and the Broker fit together on the way...

Q: What are the benefits of this PT compared with other PTs?

Snowflake combines the advantages of flashproxy and meek. Primarily:

It has the convenience of Meek, but can support magnitudes more users with negligible CDN costs. (Domain fronting is only used for brief signalling / NAT-piercing to setup the P2P WebRTC DataChannels which handle the actual traffic.)
Arbitrarily high numbers of volunteer proxies are possible like in flashproxy, but NATs are no longer a usability barrier - no need for manual port forwarding!

Q: Why is this called Snowflake?

It utilizes the "ICE" negotiation via WebRTC, and also involves a great abundance of ephemeral and short-lived (and special!) volunteer proxies...