notgull

Explaining the internals of async-task from the ground up

2024-03-30T00:00:00+00:00

async-task is one of the most complicated crates in the smol ecosystem. But, fundamentally, it’s just a future on the heap.

I pride myself on smol packages being very easy to parse for anyone with a beginner’s level of experience in Rust. By that I mean, if you want to know how smol works, it should be very easy to pick up the source code, read through it, and understand how each individual part works.

Dependency Dog: Wait, do people normally read source code for fun?

notgull: No, I think that’s just a “me” thing.

There’s a few crates that are a little harder to take as bathroom reading. There’s polling, which does a lot of low-level system interaction to make asynchronous I/O work. I’ve done my best to make it interesting, but there’s not a whole lot to say about a crate that’s basically following the OS’s instruction manual.

Then there’s async-task. async-task’s philosophy runs counter to the rest of smol. When it comes to optimization, smol generally tries to go for safety and reasonability over crazy optimizations with diminishing returns. For async-task however, we take the gloves off. We go all out to make sure tasks are as small and use as few resources as possible.

notgull: This is actually because async-task predates smol! It was originally used as the task implementation for async-std.

I’d like to provide this series of blogposts as a reference for how async-task works, how you might arrive to an implementation like async-task organically, and how it was optimized into its current state.

notgull: As a heads up: most posts for this blog assume an intermediate knowledge of Rust. However, this post is intended for readers who may not already be familiar with concepts like executors or dynamic dispatch.

Of course, it may be a good idea to review the basics, even if you’re an expert.

Background Basics

Let’s say you have two Futures; blocks of asynchronous code that can be ran concurrently. You want to run both of them at once.

// Future #1
let foo = async {
    let x = my_function().await;
    do_something(x).await;
};

// Future #2
let bar = async {
    for _ in 0..50 {
        respond_to_user().await;
    }
};

Dependency Dog: Wait… Future? async? await? What’s that?

notgull: They’re Rust’s user-space concurrency building blocks! If you need a refresher on what these mean, it may be worth it to read the async book.

Running two futures at a time can be done very easily. First, we bring in the futures-lite crate:

$ cargo add futures-lite
    Updating crates.io index
      Adding futures-lite v2.3.0 to dependencies
             Features:
             + alloc
             + fastrand
             + futures-io
             + parking
             + race
             + std
             - memchr
    Updating crates.io index

Then, we can use the zip combinator to run both Futures at the same time. Finally, we can use block_on to poll the resulting Future until it completes. It looks like this:

use futures_lite::future;

// Run the two futures in parallel.
let combined = future::zip(foo, bar);

// Block on the combined future until it completes.
future::block_on(combined);

How the zip combinator works is as follows:

It tries to poll the first Future. If it is ready, it takes the result and saves it the memory. It remembers not to poll the first Future again.
It does the same thing for the second Future. It polls it if it hasn’t finished. If it has, it saves the result.
Once both Futures are finished, it returns a tuple of the result.

Using this strategy, we can poll two Futures at the same time. The following diagram shows what this looks like in practice:

Note that, even though only one thread of execution is used, it appears as though the futures are run at the same time.

Scalability Solutions

The zip combinator works for very simple cases of concurrency, but falls apart for higher-level scenarios. Let’s say you want to poll four futures at once. (The horror!)

let baz = async { /* ... */ };
let cap = async { /* ... */ };

Then, you would need to call zip three times!

let combined = future::zip(
    future::zip(foo, bar),
    future::zip(baz, cap)
);

You run into the some problems too, like:

You can only run a fixed number of futures at once. If you might run a variable number of futures, you’re out of luck.
What if you want to cancel one of the futures halfway through?
Each future is polled every time zip is woken up. This means polling zip is an O(n) operation. This is sometimes known as the “thundering herd” problem.

Let’s try to solve these problems. Without any prior art, I mean. We can solve the “fixed number of futures” problem pretty easily. Consider the [slab] crate, which lets us set up an indexed list of objects. It’s similar to an arena. We can fit our futures in there.

$ cargo add slab
    Updating crates.io index
      Adding slab v0.4.9 to dependencies
             Features:
             + std
             - serde
    Updating crates.io index

Let’s also box the Futures, so we can use multiple different implementors of Future in our same zip.

use slab::Slab;
use std::future::Future;
use std::pin::Pin;

struct GiantZip {
    // Completed futures are represented by `None`.
    futures: Slab<Option<Pin<Box<dyn Future<Output = ()>>>>>
}

impl GiantZip {
    fn new() -> Self {
        Self {
            futures: Slab::new()
        }
    }
}

Let’s have an insert method that can be used to add a new Future to this new zip-equivalent. It will return a key that can be used to look up the future in our list.

impl GiantZip {
    fn insert<F: Future<Output = ()> + 'static>(&mut self, future: F) -> usize {
        self.futures.insert(Some(Box::pin(future)))
    }
}

Dependency Dog: The Future needs to be 'static because it’s being boxed and pinned on the heap without a lifetime. It’s possible to work around this by adding a lifetime to GiantZip here, but let’s keep it simple for now.

Finally, let’s make it so polling the GiantZip tries to resolve every one of the futures contained within.

impl Future for GiantZip {
    type Output = ();

    fn poll(mut self: Pin<&mut Self>, cx: &mut Context<'_>) -> Poll<()> {
        let mut unfinished = false;

        for (_, future_slot) in self.futures.iter_mut() {
            if let Some(future) = future_slot.as_mut() {
                // Try to poll this future.
                match future.as_mut().poll(cx) {
                    Poll::Ready(()) => {
                        // Set the future to `None`.
                        *future_slot = None;
                    },

                    Poll::Pending => {
                        // We are unfinished; return Pending.
                        unfinished = true; 
                    }
                }
            }
        }

        if unfinished {
            Poll::Pending
        } else {
            Poll::Ready(())
        }
    }
}

Finally, we can test this out on futures that actually do something.

$ cargo add async-channel
    Updating crates.io index
      Adding async-channel v2.2.0 to dependencies
             Features:
             + std
    Updating crates.io index

// Create a channel with a capacity of 1.
let (sender, recv) = async_channel::bounded(1);

// This is basically an `async fn` that sends a number over the channel.
let our_future = |i: i32| {
    let sender = sender.clone();
    async move { sender.send(i).await.ok(); }
};

// Create a future that reads from the channel.
let reader = async move {
    for _ in 0..3 {
        println!("{}", recv.recv().await.unwrap());
    }
};

// Use the GiantZip to poll all of these at once.
let mut zipper = GiantZip::new();
zipper.insert(our_future(1));
zipper.insert(our_future(2));
zipper.insert(our_future(3));
zipper.insert(reader);

// Wait for them to finish.
future::block_on(zipper);

When we run it, we see this:

$ time cargo run -q
1
2
3
cargo run -q  0.03s user 0.03s system 93% cpu 0.064 total

That’s pretty fast, but that’s only because we have a low number of futures. If we have 10,000,000 futures (not an unrealistic number for a web server!), it will run much slower.

$ time cargo run -q
0
1
2
cargo run -q  9.16s user 0.70s system 100% cpu 9.803 total

notgull: It’s hard to express in the textual format, but each line had a few seconds’ delay between each of them. So it’s taking a while to get to the future that actually prints the line.

In addition to being inefficient, we’ve also stumbled upon another issue: GiantZip is unfair. The reader Future is at the very end of the futures list, which means it’s processed last when polling. Since a lot of the futures end up being blocked on reader, it means polling the GiantZip takes a lot longer than it normally should.

Thankfully, we can solve the O(n) problem and also (kind of) solve the fairness problem in one fell swoop. Instead of polling every future every time we poll, we should only poll the ones whose Wakers have been woken up. Since we know those ones are ready, we should only poll those.

Let’s add a queue structure to the GiantZip that contains the indexes of the futures that are ready to be woken. I’m wrapping it in an Arc and a Mutex for reasons that will become obvious later.

use std::collections::VecDeque;
use std::sync::{Arc, Mutex};

struct GiantZip {
    // Completed futures are represented by `None`.
    futures: Slab<Option<Pin<Box<dyn Future<Output = ()>>>>>,

    // NEW: Queue of futures that are waiting to be woken up.
    queue: Arc<Mutex<VecDeque<usize>>>,
}

impl GiantZip {
    fn new() -> Self {
        Self {
            futures: Slab::new(),
            queue: Arc::new(Mutex::new(VecDeque::new()))
        }
    }
}

Now, when we first insert the Future into the GiantZip, we have to mark it as ready. This is done by just pushing the index of the future into the queue.

impl GiantZip {
    fn insert<F: Future<Output = ()> + 'static>(&mut self, future: F) -> usize {
        // NEW: Save the index and push it to the back of the queue before returning.
        let index = self.futures.insert(Some(Box::pin(future)));
        self.queue.lock().unwrap().push_back(index);
        index
    }
}

We also need to have a way for the Future to mark itself as ready. The Future calls the Waker when it is ready to be woken up, so we can just create a Waker that wraps around the top-level Waker, but also marks the current future as ready.

We’ll bring in waker-fn to make this easier. A Waker is just a glorified callback, so we can easily represent it as one.

$ cargo add waker-fn
    Updating crates.io index
      Adding waker-fn v1.1.1 to dependencies
    Updating crates.io index

Let’s make creating the Waker a helper function on GiantZip, to keep things clean.

use waker_fn::waker_fn;
use std::task::Waker;

impl GiantZip {
    /// Create a waker that wakes the future in the provided slot.
    fn waker_for_slot(&self, index: usize, toplevel: &Waker) -> Waker {
        // Clone shared resources.
        // *This* is why we made `queue` wrapped in an `Arc`, by the way.
        let queue = self.queue.clone();
        let toplevel = toplevel.clone();

        // Create a waker.
        waker_fn(move || {
            // Mark the future as ready.
            queue.lock().unwrap().push_back(index);

            // Wake the toplevel `block_on` waker, so the GiantZip poll()
            // implementation is ran again.
            toplevel.wake_by_ref();
        })
    }
}

Finally, we can adjust the poll() implementation for GiantZip such that it pops from the queue instead of polling each and every future in the list.

impl GiantZip {
    /// Get the next index in the list.
    fn next_index(&self) -> Option<usize> {
        self.queue.lock().unwrap().pop_front()
    }
}

impl Future for GiantZip {
    type Output = ();

    fn poll(self: Pin<&mut Self>, cx: &mut Context<'_>) -> Poll<()> {
        // Get around Rust's pinning rules.
        let this = self.get_mut();

        // NEW: We drain the "queue" instead of iterating over every future.
        // Make sure not to hold the lock while polling; if a future is woken by another future,
        // it would deadlock otherwise.
        while let Some(index) = this.next_index() {
            // NEW: Create a waker to poll this future with.
            let waker = this.waker_for_slot(index, cx.waker());
            let mut slot_context = Context::from_waker(&waker);

            let future_slot = match this.futures.get_mut(index) {
                Some(slot) => slot,
                None => continue
            };
            if let Some(future) = future_slot.as_mut() {
                // Try to poll this future.
                match future.as_mut().poll(&mut slot_context) {
                    Poll::Ready(()) => {
                        // Set the future to `None`.
                        *future_slot = None;
                    },

                    Poll::Pending => {}
                }
            }
        }

        if this.futures.iter().any(|(_, fut)| fut.is_some()) {
            Poll::Pending
        } else {
            Poll::Ready(())
        }
    }
}

When we run the program, we see the following output:

$ time cargo run -q
0
1
2
cargo run -q  5.03s user 0.62s system 101% cpu 5.583 total

There is still quite a bit of contention caused by the initial burst of futures as well as the last burst of futures. But, there is no more delay between the printing of the numbers. This indicates that the runtime is being used more efficiently.

This also solves our cancellation problem; we can just add a remove method to the GiantZip to remove a keyed future from the list.

impl GiantZip {
    fn remove(&mut self, index: usize) {
        self.futures.remove(index);
    }
}

let key = zipper.insert(async { panic!() });
// Actually, might not be the best idea to run that task.
zipper.remove(key);

Now we’ve solved all of our problems… and introduced a million new ones.

Persistent Problems

I’ve deliberately made some mistakes in the above example, in order to illustrate how fixing those mistakes can lead to a very important data pattern. So let’s discuss those mistakes.

Dependency Dog: “Deliberately”, you say?

notgull: Hey, I’ll have you know, I’m just following the natural evolution of the async/await pattern.

Dependency Dog: Did the “natural evolution” force you to re-allocate a new Waker every time you polled a future?

The first issue is that all of this is very inefficient. Ignoring our suboptimal queueing structure, we have three main allocations here:

We have a Vec to hold our futures inside of.
Each individual Future requires its own Box.
Every time the GiantZip is polled we have to create a Waker to poll it with. The [waker_fn] crate allocates this inside of an Arc.

Specifically we should be concerned about the Waker allocation, since it occurs on the hot path. We should try our best to make sure that we can create a Waker without allocating.

There are some other persistent problems, like:

It’s very easy to misuse the API. usize isn’t a great type to index by, for a collection of hotly-polled futures.
You can’t get the result of a Future after it completes… or use a Future that returns anything other than (), for that matter.
It is very difficult (albeit not impossible) to remove a Future from the list while it is running.

We’ll begin to address these problems in the next blog post, when we start to build a real task abstraction.

Why choose async/await over threads?

2024-03-24T00:00:00+00:00

A common refrain is that threads can do everything that async/await can, but simpler. So why would anyone choose async/await?

This is a common question that I’ve seen a lot in the Rust community. Frankly, I completely understand where it’s coming from.

Rust is a low-level language that doesn’t hide the complexity of coroutines from you. This is in opposition to languages like Go, where async happens by default, without the programmer needing to even consider it.

Smart programmers try to avoid complexity. So, they see the extra complexity in async/await and question why it is needed. This question is especially pertinent when considering that a reasonable alternative exists in OS threads.

Let’s take a mind-journey through async and see how it stacks up.

Background Blitz

Rust is a low-level language. Normally, code is linear; one thing runs after another. It looks like this:

fn main() {
    foo();
    bar();
    baz();
}

Nice and simple, right?

However, sometimes you will want to run many things at once. The canonical example for this is a web server. Consider the following written in linear code:

fn main() -> io::Result<()> {
    let socket = TcpListener::bind("0.0.0.0:80")?;

    loop {
        let (client, _) = socket.accept()?;
        handle_client(client)?;
    }
}

Imagine if handle_client takes a few milliseconds, and two clients try to connect to your webserver at the same time. You’ll run into a serious problem!

Client #1 connects to the webserver, and is accepted by the accept() function. It starts running handle_client().
Client #2 connects to the webserver. However, since accept() is not currently running, we have to wait for handle_client() for Client #1 to finish running.
After waiting a few milliseconds, we get back to accept(). Client #2 can connect.

Now imagine that instead of two clients, there are two million simultaneous clients. At the end of the queue, you’ll have to wait several minutes before the web server can help you. It becomes un-scalable very quickly.

Obviously, the embryonic web tried to solve this problem. The original solution was to introduce threading. By saving the value of some registers and the program’s stack into memory, the operating system can stop a program, run another program in its place, then resume running that program later. Essentially, it allows for multiple routines (or “threads”, or “processes”) to run on the same CPU.

Using threads, we can rewrite the above code as follows:

fn main() -> io::Result<()> {
    let socket = TcpListener::bind("0.0.0.0:80")?;

    loop {
        let (client, _) = socket.accept()?;
        thread::spawn(move || handle_client(client));
    }
}

Now, the client is being handled by a separate thread than the one handling waiting for new connections. Great! This avoids the problem by allowing concurrent thread access.

Client #1 is accepted by the server. The server spawns a thread that calls handle_client.
Client #2 tries to connect to the server.
Eventually, handle_client blocks on something. The OS saves the thread handling Client #1 and brings back the main thread.
The main thread accepts Client #2. It spawns a separate thread to handle Client #2. With only a few microseconds of delay, Client #1 and Client #2 are run in parallel.

Threads work especially well when you consider that production-grade web servers have dozens of CPU cores. It’s not just that the OS can give the illusion that all of these threads run at the same time; it’s that the OS can actually make them all run at once.

Eventually, for reasons I’ll elaborate later, programmers wanted to bring this concurrency out of the OS space and into the user space. There are many different models for userspace concurrency. There is event-driven programming, actors, and coroutines. The one Rust settled on is async/await.

To oversimplify, you compile the program as a grab-bag of state machines that can all be run independently of another. Rust itself provides a mechanism for creating state machines; the mechanism of async and await. The above program in terms of async/await would look like this, written using smol:

#[apply(smol_macros::main!)]
async fn main(ex: &smol::Executor) -> io::Result<()> {
    let socket = TcpListener::bind("0.0.0.0:80").await?;

    loop {
        let (client, _) = socket.accept().await?;
        ex.spawn(async move {
            handle_client(client).await;
        }).detach();
    }
}

The main function is preceded with the async keyword. This means that it is not a traditional function, but one that returns a state machine. Roughly, the function’s contents correspond to that state machine.
await includes another state machine as a part of the currently running state machine. For accept(), it means that the state machine will include it as a step.
Eventually, one of the inner functions will yield, or give up control. For example, when accept() waits for a new connection. At this point the entire state machine will yield its execution to the higher-level executor. For us, that is smol::Executor.
Once execution is yielded, the Executor will replace the current state machine with another one that is running concurrently, spawned through the spawn function.
We pass an async block to the spawn function. This block represents an entire new state machine, independent of the one created by the main function. All this state machine does is run the handle_client function.
Once main yields, one of the clients is selected to run in its place. Once that client yields, the cycle repeats.
You can now handle millions of simultaneous clients.

Of course, user-space concurrency like this introduces an uptick in complexity. When you’re using threads, you don’t have to deal with executors and tasks and state machines and all.

If you’re a reasonable person, you might be asking “why do we need to do all of this? Threads work well; for 99% of programs, we don’t need to involve any kind of user-space concurrency. Introducing new complexity is technical debt, and technical debt costs us time and money.

“So why wouldn’t we use threads?”

Timeout Trouble

Perhaps one of Rust’s biggest strengths is composability. It provides a set of abstractions that can be nested, built upon, put together, and expanded upon.

I recall that the thing that made me stick with Rust is the Iterator trait. It blew my mind that you could make something an Iterator, apply a handful of different combinators, then pass the resulting Iterator into any function that took an Iterator.

It continues to impress me how powerful it is. Let’s say you want to receive a list of integers from another thread, only take the ones that are immediately available, discard any integers that aren’t even, add one to all of them, then push them onto a new list.

That would be fifty lines and a helper function in some other languages. In Rust it can be done in five:

let (send, recv) = mpsc::channel();
my_list.extend(
    recv.try_iter()
        .filter(|x| x & 1 == 0)
        .map(|x| x + 1)
);

The best thing about async/await is that it lets you apply this composability to I/O-bound functions. Let’s say you have a new client requirement; you want to add a timeout to your above function. Assume that our handle_client above function looks like this:

async fn handle_client(client: TcpStream) -> io::Result<()> {
    let mut data = vec![];
    client.read_to_end(&mut data).await?;
    
    let response = do_something_with_data(data).await?
    client.write_all(&response).await?;

    Ok(())
}

If we want to add, say, a three-second timeout, we can combine two combinators to do that:

The race function takes two futures and runs them at the same time.
The Timer future waits for some time before returning.

Here is what the final code looks like:

async fn handle_client(client: TcpStream) -> io::Result<()> {
    // Future that handles the actual connection.
    let driver = async move {
        let mut data = vec![];
        client.read_to_end(&mut data).await?;
        
        let response = do_something_with_data(data).await?
        client.write_all(&response).await?;

        Ok(())
    };

    // Future that handles waiting for a timeout.
    let timeout = async {
        Timer::after(Duration::from_secs(3)).await;

        // We just hit a timeout! Return an error.
        Err(io::ErrorKind::TimedOut.into())
    };

    // Run both in parallel.
    driver.race(timeout).await
}

I find this to be a very easy process. All you have to do is wrap your existing code in an async block and race it against another future.

An added bonus of this approach is that it works with any kind of stream. Here, we use a TcpStream. However we can easily replace it with anything that implements impl AsyncRead + AsyncWrite. It could be a GZIP stream on top of the normal stream, or a Unix socket, or a file. async just slides into whatever pattern you need from it.

Thematic Threads

What if we wanted to implement this in our threaded example above?

fn handle_client(client: TcpStream) -> io::Result<()> {
    let mut data = vec![];
    client.read_to_end(&mut data)?;
    
    let response = do_something_with_data(data)?
    client.write_all(&response)?;

    Ok(())
}

Well, it’s not easy. Generally, you can’t interrupt the read or write system calls in blocking code, without doing something catastrophic like closing the file descriptor (which can’t be done in Rust).

Thankfully, TcpStream has two functions set_read_timeout and set_write_timeout that can be used to set the timeouts for reading and writing, respectively. However, we can’t just use it naively. Imagine a client that sends one byte every 2.9 seconds, just to reset the timeout.

So we have to program a little defensively here. Due to the power of Rust combinators, we can write our own type wrapping around the TcpStream to program the timeout.

// Deadline-aware wrapper around `TcpStream.
struct DeadlineStream {
    tcp: TcpStream,
    deadline: Instant
}

impl DeadlineStream {
    /// Create a new `DeadlineStream` that expires after some time.
    fn new(tcp: TcpStream, timeout: Duration) -> Self {
        Self {
            tcp,
            deadline: Instant::now() + timeout,
        }
    }
}

impl io::Read for DeadlineStream {
    fn read(&mut self, buf: &mut [u8]) -> io::Result<usize> {
        // Set the deadline.
        let time_left = self.deadline.saturating_duration_since(Instant::now());
        self.tcp.set_read_timeout(Some(time_left))?;

        // Read from the stream.
        self.tcp.read(buf)
    }
}

impl io::Write for DeadlineStream {
    fn write(&mut self, buf: &[u8]) -> io::Result<usize> {
        // Set the deadline.
        let time_left = self.deadline.saturating_duration_since(Instant::now());
        self.tcp.set_write_timeout(Some(time_left))?;

        // Read from the stream.
        self.tcp.write(buf)
    }
}

// Create the wrapper.
let client = DeadlineStream::new(client, Duration::from_secs(3));

let mut data = vec![];
client.read_to_end(&mut data)?;

let response = do_something_with_data(data)?
client.write_all(&response)?;

Ok(())

On one hand, it could be argued that this is elegant. We used Rust’s capabilities to solve the problem with a relatively simple combinator. I’m sure it would work well enough.

On the other hand, it’s definitely hacky.

We’ve locked ourselves into using TcpStream. There’s no trait in Rust to abstract over using the set_read_timeout and set_write_timeout types. So it would take a lot of additional work to make it use any kind of writer.
It involves an extra system call for setting the timeout.
I imagine this type is much more unwieldy to use for the kinds of actual logic that web servers demand.

If I saw this code in production, I would ask the author why they avoided using async/await to solve this problem. This is the phenomenon I was describing in my post “Why you might actually want async in your project”. Quite frequently I encounter a pattern where synchronous code can’t be used without contortion, so I have to rewrite it in async.

Async Success Stories

There’s a reason why the HTTP ecosystem has adopted async/await as its primary runtime mechanism, even for clients. You can take any function that makes an HTTP call, and make it fit whatever hole or use case you want it to.

tower is probably the best example of this phenomenon I can think of, and it’s really the thing that made me realize how powerful async/await can be. If you implement your service as an async function, you get timeouts, rate limiting, load balancing, hedging and back-pressure handling. All of that for free.

It doesn’t matter what runtime you used, or what you’re actually doing in your service. You can throw tower at it to make it more robust.

macroquad is a miniature Rust game engine that aims to make game development as easy as possible. Its main function uses async/await in order to run its engine. This is because async/await is really the best way in Rust to express a linear function that needs to be stopped in order to wait for something else.

In practice, this can be extremely powerful. Imagine simultaneously polling a network connection to your game server and your GUI framework, on the same thread. The possibilities are endless.

Improving Async’s Image

I don’t think the issue is that some people think threads are better than async. I think the issue is that the benefits of async aren’t widely broadcast. This leads some people to be misinformed about the benefits of async.

If this is an educational problem, I think it’s worth taking a look at the educational material. Here’s what the Rust Async Book says when comparing async/await to operating system threads.

OS threads don’t require any changes to the programming model, which makes it very easy to express concurrency. However, synchronizing between threads can be difficult, and the performance overhead is large. Thread pools can mitigate some of these costs, but not enough to support massive IO-bound workloads.

- Rust Async Book, various authors

I think this is a consistent problem throughout the async community. When someone asks the question of “why do we want to use this over OS threads”, people have a tendency to kind of wave their hand and say “async has less overhead. Other than that, everything’s the same.”

This is the reason why web server authors switched to async/await. It’s how they solved the C10k problem. But, it’s not going to be the reason why everyone else switches to async/await.

Performance gains are fickle and can disappear in the wrong circumstances. There are plenty of cases where a threaded workflow can be faster than an equivalent async workflow (mostly, in the case of CPU bound tasks). I think that we, as a community, have over-emphasized the ephemeral performance benefits of async Rust while downplaying its semantic benefits.

In the worst case, it leads to people shrugging off async/await as “a weird thing that you resort to for niche use cases”. It should be seen as a powerful programming model that lets you succinctly express patterns that can’t be expressed in synchronous Rust without dozens of threads and channels.

I also think there’s a tendency to try to make async Rust “just like sync Rust” in a way that encourages negative comparison. By “tendency”, I mean that it’s the stated roadmap for the Rust project, saying that “that writing async Rust code should be as easy as writing sync code, apart from the occasional async and await keyword.”.

I reject this framing because it’s fundamentally impossible. It’s like trying to host a pizza party on a ski slope. Sure, you can probably get 99% of the way there, especially if you’re really talented. But there are differences that the average bear will notice, no matter how good you are.

We shouldn’t be trying to force our model into unfriendly idioms to appease programmers who refuse to adopt another type of pattern. We should be trying to highlight the strengths of Rust’s async/await ecosystem; its composability and its power. We should be trying to make it so async/await is the default choice whenever a programmer reaches for concurrency. Rather than trying to make sync Rust and async Rust the same, we should embrace the differences.

In short, we shouldn’t be using technical reasons to argue for a semantic model. We should be using semantic reasons.

Announcing smol-macros, smol-hyper and smol-axum

2024-01-01T00:00:00+00:00

smol just became a much easier choice to build web servers.

smol is a small and fast asynchronous runtime written in Rust. It serves as an alternative to crates like tokio with a new architecture and greater user flexibility.

However, tokio and crates like it are already well established in the async ecosystem, which makes migrating to smol hard. Even projects like wezterm depend on both smol and tokio because of this problem.

My goal is to make it so smol is much easier to make as an organizational choice for a stable runtime. Therefore, I have spent the past month or so writing a few utility crates that make smol easier to use with other crates.

smol-macros, a couple of macros for smol to make scaffolding easier.
smol-hyper, an integration layer between smol’s types and hyper’s types.
smol-axum, an integration layer between smol’s runtime and the axum web framework.

`smol-macros`

One of the great things about smol is that you set up your own executor and runtime. In tokio, you get handed a runtime that you either have to drive the program with or sequester it to another thread.

In smol, you can drive the Executor just about anywhere. You can even run an Executor inside of another Executor if you really want to. This property makes smol much easier to integrate with other runtime-like constructs, like winit.

However, the trade-off for this trick is that it takes some scaffolding to set up the runtime. You need to call block_on first, then run the Executor inside of that. If you want to run the Executor on a multi-threaded runtime, you need to spawn all of the threads and run the Executor on top of that. Then, if you want graceful shutdown, you also need to set up some kind of communication mechanism to tell the Executor to stop.

Overall, it’s between ten and two-hundred extra lines of code depending on how complex your application is. As a maintainer, it’s easy to write it off as typical application boilerplate. However, I’ve heard from some organizations that they are hesitant to adopt smol because of this additional boilerplate cost, where tokio doesn’t have any. Therefore, I’ve elected to create a solution for this problem.

smol-macros provides a handful of macros to make scaffolding a smol application much easier. The most important one for our case is main!, which wraps a typical main function and makes it async.

use smol_macros::{main, Executor};

main! {
    async fn main(ex: &Executor<'_>) {
        ex.spawn(async { println!("Hello world!"); }).await;
    }
}

Just like that, you already have a multithreaded runtime with a work stealing executor running the full smol runtime. The Executor can be wrapped in an Arc to easily enable it to be shared among different tasks and threads.

Since we are using the multithreaded Executor, it automatically spawns a set of threads that poll the Executor for as long as the program runs. These threads are configured to automatically drain tasks and drop the Executor once the main function exits.

Consider that we are using declarative macros in this case, instead of the more familiar procedural macro attributes. This design choice is intentional; declarative macros are built into the language, use zero dependencies and are generally faster to execute. If you miss the tokio style of attribute macros, you can use the macro_rules_attribute::apply macro in its place.

use macro_rules_attribute::apply;
use smol_macros::{main, Executor};

#[apply(main!)]
async fn main(ex: &Executor<'_>) {
    ex.spawn(async { println!("Hello world!"); }).await;
}

My goal with this crate is to make scaffolding easier. If the thing preventing you from switching to smol was the initial buy-in, I hope this makes it easier. Please give me feedback on how the API works for you.

`smol-hyper`

Another problem with smol is its lack of HTTP support. hyper, the most popular HTTP implementation in the Rust ecosystem, it pretty clearly tailored to support tokio. There are other async HTTP implementations like async-h1, but they don’t hold a candle to the stability and support offered by hyper.

However, hyper does have capabilities for working in different runtimes. Therefore, I’ve created smol-hyper, an integration layer between smol and hyper.

Recently, hyper released version 1.0.0, which removed a lot of the extra support needed to be implemented for tokio in favor of a more simple trait system. smol-hyper, therefore, just implements the runtime traits for smol’s types.

This crate is very deliberately simple and constrained, thanks to hyper’s new design. It’s mostly intended to act as plumbing for higher level crates, such as…

`smol-axum`

smol-axum is an integration layer between axum and smol. Like hyper, axum is generally designed with tokio in mind. smol-axum aims to reverse this status quo.

Rather than listening on a tokio TcpListener, you listen on a smol TcpListener. Rather than using the axum::serve, you use smol_axum::serve.

That’s where the differences end. I’ve intentionally made it easy to port axum applications to smol-axum. For instance, take the following standard axum application:

use axum::{response::Html, routing::get, Router};

#[tokio::main]
async fn main() {
    // build our application with a route
    let app = Router::new().route("/", get(handler));

    // run it
    let listener = tokio::net::TcpListener::bind("127.0.0.1:3000")
        .await
        .unwrap();
    println!("listening on {}", listener.local_addr().unwrap());
    axum::serve(listener, app).await.unwrap();
}

async fn handler() -> Html<&'static str> {
    Html("Hello, World!")
}

Shamelessly taken from axum’s examples.

You create a Router, start listening on a TCP socket, and then call serve to start handling web requests. With smol-axum, you do this:

use async_io::Async;
use axum::{response::Html, routing::get, Router};
use macro_rules_attribute::apply;

use std::io;
use std::net::TcpListener;
use std::sync::Arc;

#[apply(smol_macros::main!)]
async fn main(ex: &Arc<smol_macros::Executor<'_>>) -> io::Result<()> {
    // Build our application with a route.
    let app = Router::new().route("/", get(handler));

    // Create a `smol`-based TCP listener.
    let listener = Async::<TcpListener>::bind(([127, 0, 0, 1], 3000)).unwrap();
    println!("listening on {}", listener.get_ref().local_addr().unwrap());

    // Run it using `smol_axum`
    smol_axum::serve(ex.clone(), listener, app).await
}

async fn handler() -> Html<&'static str> {
    Html("Hello, World!")
}

There’s hardly a difference, by my reckoning.

`tokio` Turnstile

The main trade-off here is that hyper still brings in tokio as a dependency, even with its usual tokio features disabled. Granted, it only enables tokio’s synchronization features, which avoids bringing it its multi-threaded runtime or its mio-based reactor. This is a win in my book, but should still be kept in mind for the future.

This could be resolved by porting the underlying logic to another HTTP implementation. axum is not intrinsically tied to hyper. However, it is tied to the http crate. The only other async HTTP implementation I’m aware of, async-h1, uses a separate set of underlying traits than http. It also brings in async-std, which comes with its own problems.

A problem like this could be resolved by writing my own HTTP implementation based on http that doesn’t bring in tokio. However, this would be a lot of work for minimal benefit aside from ideological clout. hyper is fast, correct and has an active team of maintainers, so it works very well for now.

If this new crate is something your organization is interested in, pay me and I’ll put it higher on my priorities list.

Edit for 2024-03-10: It has been brought to my attention that trillium_http is an HTTP v1.1 implementation that doesn’t involve tokio. So if you’re looking for one it might be worth checking out.

`smol` v2.0.0

In addition, I’ve released the new breaking changes for smol.

Most of the subcrates already had their second versions released, so this is more of a formality than anything else. Still, after my last post about smol v2.0.0, it’s nice to finally get it out there almost five months later.

Most of my predictions from earlier came true, save for !Unpin futures, as there was some unsoundness that had to be addressed. This optimization may be reintroduced in the future.

There’s also a couple of other things, like not needing to spawn an entire separate thread to poll for child processes. Still, by and large it should be the same smol you’ve known and loved for the past three years.

Parting Shots

I hope these crates find their place in the increasingly fast-paced Rust ecosystem. Please open GitHub issues if you have feedback or, better, open a PR for any changes you’d like to see!

Creating a time machine for my blog

2023-12-30T00:00:00+00:00

We can’t let the Internet Archive do all of the work.

I’ve recently been reading a lot about the Small Web lately. While I don’t think that we can get there 100 percent, I think it has some good ideas.

One of those ideas is that we should set our sites up in a way to prevent link rot. The suggestion proposed by the Small Web folks involves proxying, but since my blog is just a list of HTML files I’ve found a better solution: just copy the files to a location on the disk and point a new domain there.

I’m announcing 2023.notgull.net. It is an exact copy of this site as of this morning, down to every individual HTML file. From this point on, it will not be modified. It will be an exact time capsule of this site as it was in 2023.

In addition, I am also announcing 2024.notgull.net. At this point, it just redirects to notgull.net. However, at this time next year, I will do the copy again. 2024.notgull.net will become a time capsule, and 2025.notgull.net will become the redirect. As above, so below.

The goal is to prevent one of the main causes of link rot: stuff being moved around. When you link to my blog, make sure to link to .notgull.net instead of just notgull.net. At this point, the link will never rot, and the context will never* change.

Of course, there’s no guarantee of this from my side. I could easily delete articles, change things and make a fool out of anyone who links to it. But I won’t. Scout’s promise. Even with tools like the Wayback Machine, you have to trust that they don’t modify it on their end.

Unless, of course, there’s some urgent correction that needs to be made in order to prevent the spread of misinformation. Keep in mind that it’s just files. They can be edited, hopefully for better rather than for worse.

I’m aware that this measure doesn’t prevent the more extreme causes of link rot. I could get hit by a bus tomorrow, and cloud costs could go unpaid, and then this server goes down along with this history system. That’s a bridge I’ll burn when I get there. I’m relatively healthy, I’ll live for another sixty years give or take a few.

This measure doesn’t apply to src.notgull.net, by the way. That’s Gitea, not a scattershot collection of HTML files I can easily copy. So that’s not affected for now.

That’s all I wanted to say, just to announce this measure. I’ll also take the opportunity to say thank you for your continued support. Since I started this blog early this year, more and more people have been reading it. It’s gotten to the point where people I know in real life have commented on my articles.

It’s honestly a little surreal to see my blog posts making the rounds on HackerNews, Mastodon, Reddit, and probably other places I’m not aware of. It’s also quite exciting. Here’s to another year of Rust, async runtimes, and webfiction!

SCP-093 is a Timeless Masterpiece

2023-12-23T00:00:00+00:00

SCP-093 is an absolute masterwork. Here’s why.

This essay is adapted from this comment on the SCP-093 discussion page, by me, in 2019. So, the audience of the post of more “people who are familiar with the SCP wiki” than the usual audience of this blog.

Some time ago, I was trying to write a long, exploration-log based SCP. However, I found myself struggling with keeping pace with the logs over a long period of time. So I decided to consult one of my favorite skips: SCP-093.

I consider SCP-093 to be the second best skip of all time, second only to SCP-2000, which probably deserves its own essay. I wanted to go in depth and figure out what made 093 different from all of the other 4000+ skips that have shown up since its writing in early 2009.

Context

SCP-093 is about a disc that showed up on the Red Sea. The disc acts as a portal to another world, where everybody’s dead and monsters roam the Earth. The SCP Foundation eventually discovers that this was the work of a singular entity, “Him”, who convinced the world it was their god.

Out of universe, SCP-093 used to just be a disc that glowed when people held it. The entry dated back to the EditThis wiki, before Wikidot, and its original author remains anonymous to this day. Eventually, an enterprising user named NekoChris completely rewrote SCP-093, adding a mechanic where it rolled towards mirrors, as well as the new test logs. Aside from some edits made by other users for grammar and tone, this is the version we’re reading today.

This section is probably unnecessary, but I feel like it’s important to understand these things in the context of their creation. Without further ado, let’s move onto the skip itself.

SCProcedures

I wanted to do a line-by-line analysis of this. So let’s skip the ubiquitous Item # and Object Class and skip to the Special Containment Procedures.

Special Containment Procedures: See testing document SCP-093-T1 for outline of testing conditions. SCP-093 must remain on a mirror at all times and under video surveillance. Admittance into the area of SCP-093’s containment must be authorized only with proper video recording and subject retrieval procedures in place. Any attempt to use SCP-093 outside of an approved test will be dealt with severely, up to and including termination.

Short and to the point. Keep it on a mirror, make sure it’s under surveillance, and don’t let anyone in without approval. This foreshadows what’s coming ahead, which is what any good SCProcedures should do.

There are, however, some points here I’d like to point out.

Any attempt to use SCP-093 outside of an approved test will be dealt with severely, up to and including termination.

This is kinda a recurring thing that I dislike about older SCPs. The SCP Foundation hires highly trained, valuable scientists and agents, so why would they just kill them? I think “termination of employment” would work better here.

See testing document SCP-093-T1 for outline of testing conditions.

I had to take a moment and think about not only why this line was in the SCProcedures, but why it was the first thing that it said in there. If this SCP-093 document was thrust into the hands of a containment agent, why would they want them to read “look at this document for testing conditions” first?

I don’t know, but I have a few ideas:

NekoChris wanted to emphasize that this is a testing log skip.
In universe, the testing procedures are really important to make sure He doesn’t breach containment (more on Him later)
My personal favorite: notice that it says “testing document T1” (where the mirror testing is) rather than “testing document T2” (where the exploring is). I think that this is outlined at the top of the file because they want to make sure people are testing 093 with the mirrors correctly, so they don’t accidentally enter the alternate reality. This foreshadows that the Foundation doesn’t want people entering the alternate reality, even by accident. I’ll bring this up later in this essay.

Description

Description: SCP-093 is a primarily red disc carved from a stone composite resembling cinnabar, with circular engravings and unknown symbols carved at 0.5 cm depth around the entire object. Deeper cuts are present on SCP-093 with a depth of 1 to 1.5 cm. SCP-093 is 7.62 cm in diameter and fits comfortably into most palms without abrasion. SCP-093 will change hue when held by a living individual. The colors taken by SCP-093 are still being researched to establish a link. Current belief holds that the changes depend upon regrets carried by the holder.

This is fairly straightforwards. The first paragraph describes the appearance of the object, that it can be held in your hand without issue, and that it changes color when it’s held by different people. Nothing big.

By the way, remember how I said that this was a rewrite of a previous low-quality skip? Well, NekoChris pretty much summed up that entire skip in a single paragraph. Which is, number one, an amazing feat of writing and, number two, I feel is done intentionally. He’s getting the old skip out of the way, so he can focus on the new stuff.

If SCP-093 is removed from a mirror and not held by a person, it will seek out the nearest mirror-like surface. SCP-093 has been observed to travel in the largest possible circle while rolling, building up phenomenal speed.

Again, pretty straightforwards. 093 wants to be in front of a mirror, so it’ll roll towards the mirror and break any obstacle in its way to do so.

Additional Notes: No records exist to clarify the nature of SCP-093’s discovery or presence in the Foundation. See SCP-093-OD. Since no records exist explaining SCP-093’s method of containment, a test procedure was initiated to establish why mirrors must be used to contain it. The results of SCP-093-T1 lead to the discovery of living beings holding SCP-093 being able to move through mirrors and the series of tests in SCP-093-T2 to ascertain the destination reached through this travel.

I’ll come back to this in a minute, but they’ve lost the records for SCP-093, so they have to do some tests to figure out what it does. This leads into the mirror tests, which lead into the exploration logs.

Original Documentation

Before we get to the tests, we get to see the original documentation of SCP-093. This is done in a very interesting way; the SCP-093-OD is actually the pre-rewrite version of SCP-093. This is interesting because, as far as I know, no other rewrites include their pre-rewrite versions wholesale.

However, the way that it’s included, it’s not just a callback to the original. NekoChris used it in such a way that, just by being there, it progresses the story. Let’s look at two points in this section that I feel are important:

SCP-093 resembled the documented blue for 54:34 at 1:23 on 26 April 1986 coincidentally when the body of 194-9834 was discovered in Research Facility █████.

Ties between 194-9834 and SCP-093 remain inconclusive and effects of prolonged exposure to 093 remain unknown except for infrequent reports of periods of calmness and in the case of 242-0049 as periodic waves of depression, loss of balance and thoughts of suicide.

Okay, in this, it asserts that SCP-093 causes calmness and, in some cases, suicidal thoughts. Remember that this is never mentioned for the rest of the skip. I think that there are two options here:

NekoChris either accidentally forgot to make a callback to this property, or deliberately did not call back to it to avoid mucking with his narrative.
In between 1985 and whatever date the current document takes place at, SCP-093 somehow lost those anomalous properties.

Additional Notes: Origins of 093 remain unknown and documents of recovery of 093 have since been destroyed in a fire in Research Facility █████, 09 December 1989.

At first glance, this may seem inconsequential. But, remember this line:

Additional Notes: No records exist to clarify the nature of SCP-093’s discovery or presence in the Foundation. See SCP-093-OD. Since no records exist explaining SCP-093’s method of containment, a test procedure was initiated to establish why mirrors must be used to contain it.

You see, these two relatively small blocks of text, written by two different authors, years apart from each other, instantly create a storyline involving the Foundation in less words and in a much more compelling way than most skips do today.

Let me explain; according to the “Additional Notes” section of the old documentation, there was a fire that destroyed most of the documentation for SCP-093, leaving them not knowing why they contained it on a mirror, or what the colors of SCP-093 meant. First off, this is actually kind of meta; it explains why a lot of Series I skips (especially the older ones) are so short: a fire destroyed their documentation.

However, NekoChris took advantage of this line in his rewrite by saying that the Foundation didn’t know about why it was laying on a mirror, and that testing was beginning to find out why.

This paints the following picture for me: a researcher, or bureaucrat, or whatever, is going through the old Foundation files, when they find the file for SCP-093. From reading it, they find out that we’ve had some red mood-ring disc lying on a (very expensive) mirror since 1968 for seemingly no rhyme or reason. Of course, they say something along the lines of “why?” followed by “we need some tests.”

This not only establishes the beginning of a narrative in only a few words, but sets up something grand that I don’t see in a lot of skips, new or old, do: it sets up the Foundation as a main character, and sets it up for characterization.

What I mean by this: in most skips, you learn information from a certain point of view, whether that of a researcher, the SCP itself, an MTF team, or just none at all. However, in SCP-093, the Foundation itself takes this role. The reader learns new information at the rate at which the Foundation discovers it, and almost gets to feel the role of the Foundation as they figure out the anomalous properties of SCP-093 and explore the world beyond. In addition, this also allows for some pretty cool characterization of the Foundation; we see the Foundation’s motive change from simple testing, to exploration, to full-on planetary defense. I’m not saying that 093 is the only skip to do this; many other skips characterize the Foundation’s goals, history, and parts as well. But they often use paragraphs upon paragraphs to do it, while 093 does it and such a wonderfully subtle way that it’s hardly noticeable without paying attention.

Just a reminder, all of this is set up in just around five lines. Five. Lines. This is economy of language at its finest, and is really genius level writing.

Anyways, now that the main character is set and the plot is ready to go, let’s start with the part where SCP-093 really gets interesting.

Test Log 1

Mirrored surface, brass frame, retail-grade mirror: SCP-093 rests without activity when placed on the mirror. This test alone removes the need for costly silver or wooden containment systems.

This is actually kind of funny, because it feels like something of a roast directed at the original author.

This test log really just restates what we’ve already learned from the Description, and principally serves to build up to the final log:

A person holding SCP-093 placing it on a mirror: This test was accidental, the result of one of the staff tripping another after some debate about who would be covering the lunch tab. As a result of the behavior of the researchers, it was discovered that a person holding SCP-093 and placing it against a mirror will in fact move into the mirror.

First, a complaint: along with the “termination punishment” thing mentioned above, this is how you can tell that this was written in early 2009; the era of lolFoundation, Keter Duty, and Omega-7. Why would a researcher bring SCP-093, a dangerous object that can punch through walls and has been known to cause suicides, to lunch? Why would they have it out while walking? Why is there a mirror in the hallway? Why would a researcher trip another researcher while he was carrying aforementioned object that punches through walls while it’s not in the palm of someone’s hand?

To be honest, I headcanon this as some kind of cover-up for some O5-eyes-only secret. Although I do admit that this line has personality, I think a simple “a D-class tripped and accidentally fell through a mirror” would work as well.

Alright, rant over. What this does well: it kickstarts the upcoming plot. We already knew that 093 rolls towards mirrors, but nobody knew why 093 rolls towards mirrors. In this, it’s implied that the whole reason why 093 rolls towards mirrors is to show us to this alternate realm. Reminds me of that one movie where there’s that bird that shows the main character to the gateway to the alternate dimensions where she’s supposed to go.

Now we know that SCP-093 + Mirror = Gateway to an Alternate Dimension. And now our main character, the Foundation, turns its motive from containment to exploration.

Test Log 2

(By the way, I’d like to interrupt this by saying that I really like this table-header format. It’s sad that, as far as I know, there aren’t any skips outside of 093 that use it.)

We transition here into the exploration logs, which is where the meat of this skip is. We get an idea of what our subjects are taking into the alternate dimension, and then we’re thrown into the Blue Test.

Blue Test

Camera activates, flickers to view. Subject is looking out over the same field reported by technicians. Looks like typical lowland plains, everything has a heavy blue tinge overlapping the normal colors. No discernible landmarks visible as subject pans view left to right, only grass, weeds, and a breeze moving the taller grass. No trees. No living beings visible.

The first paragraph asserts that we’re in some kind of plains. However, there are no trees, and no living beings. Right off the bat, it is asserted that we’re either in some kind of remote part of this alternate universe, or that everything is dead. This is a post-apocalyptic universe.

Eventually, the subject enters a hole in the ground, which leads to a tunnel, which leads to some kind of bunker.

Room is bare, no contents, but walls are filthy. Subject states material on walls isn’t dirt, but he can’t identify it, seems to resemble melted plastic but is brown in color rather than black.

This is where we’re first introduced to what I’ll be calling “mystery goo”. It smells bad, and it’s everywhere. It serves as sort of a “recurring mystery”, where it appears everywhere, and we’re slowly clued into its origin.

Walls of room are clean as is floor, ceiling is coated in the same strange brown material as the third room. In this room there is a makeshift cot made from aged blankets with a pillow, a wooden crate containing open boxes of what appears to have been food stuffs, language appears on video as squiggles however subject states they simply read ‘Cereal’. A second crate in the room contains what appear to be empty water bottles that have dried out. A book lays next to the cot, closed, no title or identifying marks.

Okay, this is definitely a survival bunker. This confirms the belief that this is some kind of post-apocalypse, and that there used to be people here. But now, there is just the weird mystery goo, collected on the ceiling.

Careful review of the following ten seconds of tape shows that as the camera pans, a figure is visible at the end of the tunnel where the seventh door is. The door is open only enough for a face to be seen through a crack just before the door silently closes. No details can be seen.

This is another one of those “recurring mysteries” to help keep readers hooked: these mysterious shadow people that show up everywhere. After all, it’s made clear that this is some kind of post-apocalypse, so why are there people?

This is almost glossed over- which is good. It’s kind of a “wait, what the hell?” moment, and the fact that it’s almost entirely unmentioned after the fact solidifies that moment.

Then there’s some cable trouble, then the subject gets pulled out of the hole by some unseen force, and the camera sees 37 creepy ghost people. They’re unidentifiable, and they’re watching the subject for some reason.

Control requests subject return following cable path and screams are caught on the audio with panic from subject. Five shots fired as subject aims pistol at something not visible on camera. Control reports being able to see subject returning toward point of origin while camera shows wire disappearing into a point floating in the air. As subject passes this point all cable is now in the pulley system and camera films only the floor. Control reports that the mirror took approximately five seconds to return to a reflection and SCP-093 remained blue in color until one hour after being recovered from subject.

A vile smelling fluid was present on subject’s clothes around his hands when firearm was recovered. This fluid dried quickly and was deemed insignificant of study due to lack of quality sample. Control personnel monitoring the mirror state having seen a massive human being, crawling on the ground, easily fifty times the size of a normal person with no facial features and a very short arm reach, pulling itself toward the mirror before it returned to a reflection. Due to proximity fine details could not be made out but at least one observer noted the being appeared to have been shot from the marks in the otherwise smooth featureless face.

This is where we get a look at the “antagonist” of SCP-093, for lack of a better word: the Unclean. In the first test log, the reader only gets a glimpse of them; they’re big, they’re scary, and they’re chasing the Subject.

Field Test Kit recovered from subject containing a news paper article that reads: [DATA EXPUNGED] and was filed as item [DATA EXPUNGED].

These expungements may seem unnecessary at first glance; however, this is some story information that is best saved ‘til later, and (IMO) actually convinces the reader to read on out of curiosity as to what’s behind the expungements.

All in all, a pretty good opening log for this skip. It introduces the parallel universe, the fact that it’s a post-apocalypse, the mystery goo, the ghost people, and the Unclean in just the right amount for it to be spooky, but also to leave the reader wanting more. I do think it could be improved by adding some foreshadowing to the religious apocalypse (I.e. maybe a cross in one of the bunker rooms?) and I think that the Unclean were described a bit too much for what should’ve been a vague hint at a monster (I.e. instead, maybe say “technicians saw a glimpse of a large humanoid figure crawling on its hand before the mirror returned to a true reflection” and leave it at that), but for what it is, it excellently introduces the concepts and players which are built upon in the remaining logs.

Green Test

No landmarks from Test 1 are discernable as subject pans camera over area.

Present is a field, long abandoned, in the middle of which stands the remains of a scarecrow of unknown design, fragments left are rotted and torn. Nothing grows in the tilled land. A farm house is visible to the right of the field, large, two stories, a basement shelter entrance is visible at one end.

Alright, the first few lines tell the audience the following:

This is in a different place than Blue Test.
We are on a farm.
Everything is dead.
There’s shelters everywhere, which implies that people knew about whatever apocalypse happened here beforehand.

As subject pans the area a metal hatch is visible in the ground, similar to a bulkhead on a submarine with a turn handle. Subject remarks that the smell is at its worst around the hatch and the dirt around the hatch is noted as being clumped and claylike.

This is repeating the whole “stench” and “mystery goo” thing from Blue Test. What this does is make the audience anticipate the inevitable “mystery goo” and perhaps even the ghost people from Blue Test.

On the beds are two skeletons and on the floor is a third, lying next to which is a simple six shooter revolver containing no ammunition. Three spent casings are across the floor near the gun.

I haven’t been commenting a lot for a reason I’ll bring up later, but this gives the audience new information: there was a struggle. They didn’t just keel over, they were killed by something.

In the distance, approximately 700 m from the farm, two massive, humanoid beings are crawling across the landscape.

This further introduces us to the Unclean, which were only slightly glimpsed in the last test.

There is a recliner, a couch, and a television all of 1950s style design. In the recliner is a laptop whose case also resembles 1950s decor and is coated in heavy dust.

This is some further world building. When this civilization ended, it was in the 50’s, or some analogous point. This helps to further paint a picture of this world in the reader’s mind.

To conclude, at first glance, this seems to be a retread of the first log. In fact, the reason why I didn’t do as many quotes was because a lot of the structure is almost identical to the first. However, along with some decent world building, it gives us new information. Whatever apocalypse happened here wasn’t instant. There was a struggle, against something. Something that managed to get into the bunkers and kill an entire population. And if you’re like me, you’re beginning to suspect that it has something to do with the large human like things that we’ve seen twice now.

This all helps to build the mystery, along with the creep factor NekoChris has been setting up: the eerie feel of an empty world, roamed by these eldritch horrors.

Now onto the third test…

Violet Test

Subject is in what appears to be a modern downtown district similar to a city like New York. The streets are mostly bare except for a few cars of unknown make or model. These cars look highly advanced and streamlined.

Instead of another rural area like the last two tests, we instead start with a city area, which, if we had any doubts about whether or not this was an apocalypse or just a bunch of people leaving the countryside, this silenced those doubts.

In addition, this gives us a glimpse into this civilization’s tech level with the advanced cars, which we haven’t yet seen. This is some further world building.

Subject attempts to look into the car windows without being instructed to but backs away remarking there is a ‘rank ass stank’ coming from the areas around most of them.

Subject is persuaded to move closer to one car and does so with coughing, wiping off a window which is covered in dirt. The inside of the car appears to be completely filled with a strange brown matter, there is nothing at all visible other than the brown matter.

Control debates this issue while subject stares around the cityscape from the car. During one pan a face is clearly seen staring into the car, eyes watching the subject; however, this was not noticed until post-test footage review.

Now we also have the stench, the mystery goo, and the ghosts. These are familiar elements, and they’re all gotten out of the way before we really start the exploration.

A team of four armed personnel is sent through the mirror and proceeds to subject’s location.

This time around, we have an entire team of soldiers traveling with the subject. I feel like this is done to change things up from the last log.

The view of the city is astonishing. This building is one of the tallest visible but certainly not alone in its stature. The city below is gray and silent, no evidence of life at this altitude. Some buildings in the city have a strange brown growth that appears to have been splashed against them as if a gelatinous mass was flung and then seeped down before hardening.

Again, some more world building. They have some pretty good tech, but whatever they did they still weren’t able to fend off the apocalypse. “Gray and silent” in particular get to me in emphasizing that this is a ghost town, minus the Unclean.

On the desk is a notepad titled ‘From the desk of Dr. Borisizki, Blessed Purificationist’.

Yeah, this was implied with the whole “FaithfulOS” thing earlier, but this tells us that this world has something to do with the wide adoption and fundamentalism of some religion. This is further asserted by the test tubes.

The camera pans to a section of raised expressway across which one of the large torsos is crawling slowly. The entity turns its featureless head to look at the escort team, raises its head to the sky, and emits a bellowing sound.

This is an “oh, shit” moment. So far, all we’ve seen involving these entities is them either walking around, or vague glimpses of them. Now, we’re fighting one head on.

When a matching color is displayed and applied to the mirror the video receiver is visible and all individuals report a horrific smell.

Later review of the recovered camera shows escort member ██████ grasping at the air where entry point should be and then turning to look up at the oversized torso. A brown gel seems to drip off the creature as it moves that disappears shortly after being dislodged as if evaporating. Several shots are fired at the creature’s face with the automatic weapon carried by ██████ that land in the ‘face’ of the creature, causing a spray of less viscous brown liquid to pour forth from the ‘wounds’.

I’ll comment on these two parts later, but this reveals two things:

The stench, and therefore, the mystery goo, are both associated with the Unclean.
Since they’re associated, and given the goo and the corpses in the bunker, the Unclean were the cause of this apocalypse.

Also, a side note: I think NekoChris went out of his way here to not say “SCP-093-1” and instead use descriptions to describe the Unclean. I think this is to evoke more imagery with the techniques that they have.

This test actually reveals quite a bit about the world SCP-093 leads to. We see the high tech cities, the people in the vials, the religious connections, where the mystery goo comes from, and we experience the first real encounter with the Unclean. While Blue Test and Green Test paint a picture of a post-apocalyptic world ravaged by the Unclean, the Violet Test paints a picture of the world as it was before apocalypse. As much as I do detest the fact that the religious aspect is hardly touched upon in the first two logs, I feel like it makes the payoff in this log much more satisfying.

Yellow Test

D-class subjects no longer authorized for testing. Testing focus has been shifted to data collection after analyzing the articles brought back from the previous three tests to better understand the fate of the world accessed by SCP-093 and determine if safeguards or practices are required for our own world.

Can I just remark on how epic this line is? After painting a good enough picture of the world in 093, the Foundation shifts its focus from exploration to full-on planetary defense. Holy crap.

Just a reminder that this line comes after the test log where we, the reader, receive a definite linking between the apocalypse and the Unclean. Remember what I said earlier about the first parts of the skip setting up the Foundation as a “perspective character,” a character where we learn information at its pace? This is that setup in action, and it puts the audience is the Foundation’s shoes as it calls for planetary defense. This is one of those genius writing moments that make me love this skip.

The descent down the elevator is long, consuming 15 minutes, during this time the camera experiences one malfunction where the image jerks and turns to snow, restoring to show 14 other figures in the elevator with Dr. █████ as video pans around, all of whom move as he moves to allow him space. They remain for 35 seconds then the camera flickers to snow and returns, Dr. █████ is now alone in the elevator dancing as is assumed by the ducks and sways of the video feed.

Just wanted to remark about how imagery-inspiring this is. For all the complaints about the tone of these logs, they have some pretty good imagery and characterization.

The third view is facing the opposite direction as a camera glides vertically checking each observation station. A total of 10 can be counted and Dr. █████ is visible as the camera passes by his own station. Looking up, a hovering camera unit with no visible means of propulsion glides up past him. The fourth view shows the ground floor below the observation deck where a single astonishingly large torso being is crawling in circles, bumping into walls and changing directions. From the camera feed the creature’s estimated size is six stories.

This is actually setting up for the conclusion, and it does so in an unusual way: contradicting itself. NekoChris has spent the past three logs asserting that everything is dead, and the world has fallen apart. Now, we’re seeing rows upon rows of function PCs, and the containment of one of the Unclean. I’ll comment on this more later.

Analysis of ███-███ and the ammunition for it postponed for reason that it would require deconstruction of one of the rounds and they may be beneficial until testing of SCP-093 is resolved. Video ends.

Two things about this line:

███-███ is just a poor man’s SCP.-███
“they may be beneficial until testing of SCP-093 is resolved.” This is one of those sentences that really pop out at me. Remember how it was asserted earlier that the Foundation’s now focusing on defending the world from whatever happened in the 093-world? Yeah, they’re taking every weapon they can get to make sure they’re defensive.

All in all, the yellow log is really a continuation of the violet log, and helps to set up for the finale with its tonal shifts and feats of working technology.

Without further ado, let’s get to that finale.

Red Test

Service Technician ██████ was able to cause SCP-093 take on a fierce red hue and glow, much brighter than the object’s normal color.

This is one of those moments where you, the reader, know that shit’s about to go down. It’s when your heart starts beating, and you end up glued to the chair as you scroll downwards. This almost perfect buildup for what’s about to happen.

Video flickers to life and Technician ██████, known hereafter as Subject, is viewing a large cylindrical pillar that is rotating on its own. Object is of unknown height and appears to be 1.8 m (6 ft) in width. Holes are distributed throughout the object at seemingly random intervals. On occasion a beam of white light is emitted from these holes. Turning of the camera finds that the beams are connected to a multitude of objects similar to SCP-093 that are part of the room’s wall. The room turns out to also be cylindrical in shape with countless copies of SCP-093.

This makes two assertions. The first is more obvious: SCP-093 is a dimension-travel technology. However, the second, the fact that this is some kind of hub for SCP-093, doesn’t guarantee that we’re in the same universe.

Let me explain this theory of mine. For the past four tests, we’ve been thrust into an alternate dimension, which appears to be SCP-093’s intended destination. Now, we’re in some kind of 093 hub. Maybe the service technician somehow activated a “return to sender” feature on SCP-093 that brought him back to the source? I’ll discuss this one more later, once we’ve finished the SCP and we have all of the facts in one basket.

The ladder exits into a large clean room full of computer equipment that appears antiquated compared to previously encountered equipment. Large computers running on reel-to-reels are clicking and spinning at various locations, a light bulb of unknown meaning turns on for ten seconds then turns off. A large CRT monitor is displaying single words in 8 colors at roughly 5 second intervals. While observed the words ‘Clean’ ‘Unclean’ ‘Clean’ ‘Clean’ ‘Lost’ ‘Unclean’ flash on the screen.

Some more evidence to my “not the same dimension” theory: it’s obviously not the same level of technology. I theorize that the meaning of the words on the screen are referring to different universes accessible through the 093’s we saw in the hub. Whether they’re not infected by the Unclean, are infected, or if their SCP-093 instance is broken or lost.

All further SCP-093 tests have been discontinued while review of materials recovered is in effect.

I’ll comment on this line later.

All in all, the red test is a pretty good finale. It takes an “oh, crap” moment and extends it throughout the entire log. Now, let’s move on to the last log.

Recovered Materials

Any employee reading past this point who does not have proper classification should consider themselves to be terminated from employment and now subject to disciplinary actions up to and including: Forced administration of Class A Amnesic, immediate transfer to Keter class security, and death.

Another one of those “Series-1-esque” bits I talked about earlier. It’s charming, but I feel like punishing employees with death is the wrong way to go.

Most Holy Father Announces Progress, Unclean Being Cleansed!

Alright, this segment, I feel, is the result of a “Catch-22.” NekoChris couldn’t put it explicitly in the Blue Test, as it reveals lore that should be saved for later. But that means that they had to put it here. At this point, the payoff is good, but could be done in another way, I feel.

But if you see this symbol, if you see it.. you run boy, you run fast, you run far, and you hide, and you never go back where you saw it. That’s all I know. - I remember the symbol, was on the rock he kept on his neck under his shirt. Next day, pap was gone, nowhere to be found, dad weren’t sad, said he knew it’d happen one day, pap went home. See you soon dad, pap..

[DATA EXPUNGED] Symbol matched symbol found on SCP-093’s surface as one of the deeper engravings. Also matches symbols noticed on video feed of final test on SCP-093 duplicates.

This is some creepy, good shit.

Also, why would the symbol be expunged, especially in a file where all of the recovered, unexpunged information is? Given what we learn later about the origin of SCP-093, I’d guess that the symbol is this.

During preparation for the Tears subject went into a rage and the attending Hand went to recover a sedative. Jennifer tore her clothes off and screamed impure words at me so I locked the door and instructed the Hand to wait outside. I am half shameful to admit I laid with Jennifer a total of seven times before putting her to the Tears.

This line introduces us to the “Tears,” as well as emphasizes the corruption of whatever system was in place before the apocalypse.

The Eyes have dated the sample, it is older than myself, older than my elders. It is over 200 cycles in ages. 200!

I do not think this Unclean is alone. I have seen how they can get into places, between places. Between places! Is that where they have been, all this time? Between places?

The smell, it is so strong, it comes from all directions. It is not a smell of the dead, it is a smell that comes from something that should be dead but does not know how to die

This is some Kalinin-tier prose, and the note it’s in is great. It shows the perspective of the corruption and the Unclean from somebody who’s not on the top, but knows quite a bit. It’s great insight into this world, and it does it without being too over-explaining.

In the event of any Emergency requiring the Facility to be evacuated, all Clear-4 staff should report to Train Station 3 and use their Vial to call the Evacuation Train. Only one Vial is required to call the train and may contain any amount of Tears. An Empty Vial will not call the train. Clear 2 and 1 staff should remain at their posts until either 10 minutes after the departure of Clear-4 persons or until authorized by Clear-4 staff. Clear-3 staff should utilize the Protective Garments at their stations and weapon lockers before proceeding to designated Crisis Areas as dictated by Clear-4 staff.

This makes it clear that this is some 093-infected Foundation analogue. But remember the multidimensional hub? Yeah, NekoChris invented the multidimensional, all-intrusive Foundation before Scantron codified the concept in their 001 proposal.

I actually found a scientific report written by someone who stumbled here with a SCP-093 copy. These creatures are the result of exposure to a very pure form of His Tears resulting in a genetic apocalypse occurring within the exposed.

“Genetic apocalypse?”
This is where it all starts coming together for the reader. And it does so beautifully.

I think someone is in this facility, or someones, I keep hearing voices and requests coming from areas under the floor. I want to leave this before I explore the facility any further. I have sent SCP-093 back through the entry mirror to seal that gate. These things can’t be let into our world nor should we have anything to do with this one, we’re simply not smart enough to understand it all I feel.

I’ll comment on this more later, but this is one of those moments where it calls back to something earlier: specifically, the “bad grammar” entry just before this one. And something happened to that person.

They’re.. in my head I think.. I didn’t notice it till just now but, equipment in this room is starting to react to me, words on the screen, begging for help.

I’ve seen the faces, of the people, the Unclean. They show up on the pictures cast by the machine, in the room with me, watching me. I think, they’re everywhere on this world, only seen by machines now.

And it also explains the ghost people. Nice.

they showed me things when i touched them and its not quite like the records say. the unclean remember it all, every person they touch becomes part of them, safe inside them, but dead to us. every mind, every feeling, every terror, its eternal to them. i kind of want to join them but.. too much to do.. they want me to.. find him, kill him.

Oh no.

there was no war it was him him him him him IT. IT. it came from between the folds of time and space and worlds and light and dark something that is but should not be slipped in and called out to them as their god and they believed it and they tasted it and touched it and layed with it and became its property and did its will and IT IS STILL HERE the scp-093 it brought with it pulled forcefully with it built it i don’t know they don’t know but it belongs to him it lets him move between places between worlds so i BROKE IT ha ha ha i threw pieces of it away and through holes so those doors are closed just like ours is closed and i can’t go home so what else can i do

Oh no.

it calls out through the rock, somehow, it knows where they are but can’t touch them, but if you hide the rock he can’t call out and he’s stuck too i got you you son of a bitch I GOT YOU BANG BANG ha ha

i touched him. with my fist. and my gun. and he fell down. but he’ll get back up. soon. i’m sorry, i did all i could, let me sleep now, please… let… me… slee

Oh, no!

Despite the over-explaining in the note, the end of it is a fitting conclusion to the adventure. It’s cryptic, but not too much, and leaves the reader wanting more.

Speculation

I’m gonna try here to take educated guesses as to what happened in SCP-093. I know it’s not really analysis, but I’ll use it to prove a point later.

So, first of all, what is Him? All we really know about Him is from the last log, so let’s put all that together:

He’s obviously not human, as the description of his origin (“it came from between the folds of time and space and worlds and light and dark something that is but should not be”) kinda makes me think of “Lovecraftian Monstrosity.”
He is able to impersonate, or is, a god.
He created several extremely high-end technologies, including SCP-093.

Now that we know what He is, what happened to him? My two theories are:

He killed and impersonated the service technician to get into our reality, and when he was killed He was killed (boring ending).
He is still alive, and is somehow attached to SCP-093 (see below).

I find the second one more interesting, and I think it actually makes a couple of writing quirks that I’ve commented on so far seem justified.

Let me use an example. For the purposes of the example, let’s give names to the worlds involved in the story:

World 1 - The world accessed in the Red Test, and the “Hub World” for SCP-093.
World 2 - The world accessed in the Blue-Yellow Tests, and has fallen to Him and the Unclean.
World 3 - The world where Agent ██████████ from the recovered documents is from.
World 4 - The world where our Foundation is in.

Let’s establish a timeline. In World 1, He appears, possibly just a natural anomaly, like how SCP-173 is an anomaly native to our world. He takes over World 1, builds the 093 hub, and sets out to send 093’s to the other dimensions to bring Him to the other worlds.

World 2 is a world that got successfully invaded by Him and taken over.

World 3 is a world that got ahold of SCP-093 and apparently went on a thorough investigation of either World 2 or another Unclean world. However, the agent eventually realized what SCP-093 really did, and then broke it to He couldn’t get to World 3.

In World 4, AKA our world, we see that SCP-093 is really just a vehicle for Him. We now know that its usage could lead to the end of the world for all of us. So the Foundation does what it does best: it locks it up in a box, to make sure it doesn’t see the light of day.

This explains the narrative quirk of: “why didn’t the Foundation do more explorations of SCP-093?” Why didn’t they pass around 093 some more to see if they could get it to glow orange, or something? Why didn’t they send an MTF back into the Blue Test so that they could break down some of those stuck doors? Why didn’t they even test out the Unclean-killing revolver? The answer is simple: to open up SCP-093 again would be to risk the same thing happening to us.

This also explains the missing sadness property from the original documentation: it was there originally, but maybe He fixed it so people would be more likely to pick it up. This also explains why SCP-093 spins towards mirrors: it’s Him, trying to show us the way to let him into our world.

This also explains the very first line of the SCP:

See testing document SCP-093-T1 for outline of testing conditions.

They’re saying “if we’re testing this thing again, whatever you do, don’t open up a mirror portal.” As the first thing we see in the documentation, this is driving home the fact that the Foundation wants SCP-093 to stay buried.

I love these SCPs that you can speculate on, and SCP-093 does a fantastic job of facilitating this speculation. It’s close-ended enough to be a satisfying story, while open-ended enough to the reader to have a sense of wonder.

In short, SCP-093 is a timeless masterpiece. It not only presents a compelling story and a well-built world at a time when these things were rare, but also develops it all through subtle, masterful writing, and introduces some ambitious concepts that make the SCP truly shine. Really, this could’ve been a lot worse; NekoChris could’ve just said “let’s just make it roll towards mirrors now” during his rewrite and left it at that. Who knows where this skip would be if that happened? But instead, NekoChris made something dazzling. NekoChris made a classic.

Parting Shots

Of course, this isn’t perfect. Some problems I noted:

There’s some Series 1 bits that I mentioned above. These slow the skip’s momentum and could be executed better.
The tone of the exploration logs is all over the place, and definitely could use a run-through for clinicality.
The last document comes across as an info-dump of sorts. I think that it could be spread more across the logs.

However, what I came here to do is to figure out what SCP-093 does right, and how we, as writers, can learn from it. Here’s some key takeaways:

Don’t Show Your Entire Monster - This is a weakness of the SCP Format: it often forces you to show the entirety of your monster up-front. But when you can, gradually reveal your monster to keep the reader guessing as to what it is.
Create Threading Mysteries - Stuff like the mystery goo and the Unclean that recur are what trigger reactions out of the audience and keep them hooked.
Imagery - With clinical tone, it’s hard to pull off imagery. But when you can, the pictures and worlds you can create are fabulous.
Don’t Overwhelm the Reader - Using a “perspective character” can help to keep information flowing at an appropriate, but exciting pace.
Leave the story Open-Ended - I see too many skips today that don’t leave anything up to speculation. This can be a great strength and helps to keep up the mystery.

If you want to see a skip that takes these lessons from SCP-093 and follows them to a T, look no further than djkaktus’ excellent SCP-2935. It barely even lets the reader get a picture of the anomaly at first, it has a ton of recurring mysteries, the world is very fleshed out through imagery, the information is well-paced, and the ending is solid enough to be impactful but open enough to leave room for speculation.

There are some other similarities too, like the log structure, the apocalyptic theme and how the ending is put together. I don’t know if this was an intentional homage to SCP-093, or if it was coincidental (the number 2935 points me towards the former conclusion), but it’s a good example of something that follows SCP-093’s example.

In conclusion, when we write today, instead of looking at older skips through our new-author-eyes, I think we should instead look at our newer skips through the eyes of the old authors. We can learn a lot from the tried-and-true older skips.

The rabbit hole of unsafe Rust bugs

2023-12-16T00:00:00+00:00

unsafe code is not only tricky to get right. It’s also tricky to track down when things go wrong.

Here’s a question to ponder: do you think you are experienced enough at Rust to use unsafe code properly? If you asked me 24 hours ago, I would confidently say “yes, I am.” I would describe my Rust experience level as at least intermediate. On top of that, I have a little bit of an ego.

In fact, I fully support using unsafe in libraries! Many times, there’s no other way to do something!

You’re communicating with a system API or C library. Although, for the former, I would recommend seeing if using rustix or another system interface is an option.
You’re doing something that the borrow checker or type system can’t fully check your work on, even with workarounds like RefCell or Any.
There’s just some performance boost that’s simply not possible without an unsafe workaround. I would strongly recommend benchmarking this claim before implementing it in a publicly used crate.

While I’m not against unsafe, I am for really checking and validating that unsafe. You need to be making sure it’s doing what it needs to be doing, and that it’s doing that thing right. The hazard with unsafe isn’t just that it can break your crate. It’s that it can break other crates, in a way that’s very, very difficult to trace back to your original crate.

You know how I said I had a chip on my shoulder? In the past week, I’ve had two soundness errors reported. For the same crate, no less!

One of those soundness errors is a fairly routine linked list rigamarole with an associated discussions about public API. The other one is a frantic chase through several different crates, where things are never as they seem. This blog post focuses on the second one, as I find it more entertaining and maybe we can even shoehorn a moral in there somewhere.

So let’s ask ourselves: what do we open ourselves up to when we add unchecked unsafe to our code.

I mention crates that are created by and maintained by other people in this post. They have done nothing wrong: all of the bugs were introduced by me, notgull. It should go without saying, but please do not seek out or harass these other people.

Event Listener Escapade

Let’s introduce our cast of characters.

event-listener is a fairly primitive crate in the smol ecosystem. It’s main responsibility is communications between tasks; it essentially handles the “if one thing happens in a task, wake up another task” use case. It’s used to implement locks and channels, among other things.

event-listener also has a lot of unsafe code. In my opinion, this is a good thing: it means that other crates that depend on it don’t need to have that unsafe code for themselves. I’ll also be the first to admit that a lot of the unsafe code is unnecessary. Some of it exists because we’ve benchmarked it to be faster, and some of it exists because there isn’t really a better way to implement, say, a limited-contention spinlock in Safe Rust.

A few days ago, one of our maintainers reported a bug. Most of the time, soundness bugs have a specific flavor, like bitter licorice. This bug didn’t have that flavor, at first glance. It was like biting into the sweet apple of a simple off-by-one error, with an aftertaste of smooth, smooth WebAssembly. One fateful day, GitHub Actions started failing with a strange error.

$ wasm-pack test --node --no-default-features --features portable-atomic
Running tests/notify.rs (target/wasm32-unknown-unknown/debug/deps/notify-0ef5209d8abd9615.wasm)
Set timeout to 20 seconds...
Executing bindgen...                              
                                                  
running 9 tests

panicked at /home/runner/work/event-listener/event-listener/src/no_std.rs:639:48:
index out of bounds: the len is 0 but the index is 4

Here is the full log, in case you want to try to take a crack at this bug for yourself.

Debugging Demonology

At first glance, I ran this code on my laptop and wasn’t able to replicate the issue.

$ wasm-pack test --node --no-default-features --features portable-atomic

I took one glance at the WASM bits and the portable-atomic feature and decided that it must be some kind of strange one-in-one-thousand race condition. It was a Thursday and I had work to do, so I decided to revisit it on the weekend. The weekend came and, after re-running the Actions workflow, the bugs were still there.

Weird, I thought to myself. If it was a race condition, it shouldn’t be happening consistently. So I needed to take a closer look at what was happening here.

While reading through the logs, I realized that the list that the out-of-bounds error was happening on was initialized like this:

/// Create a new, empty list.
pub(crate) fn new() -> Self {
    Self {
        listeners: alloc::vec![Entry::Sentinel],
        <snip: irrelevant fields>
    }
}

This list never has elements removed from it. It works a lot like the slab crate does; it leaves those slots open for future entries to fill. Remember that “out of bounds” error?

index out of bounds: the len is 0 but the index is 4

“len is 0”? The length of this list can only ever be one or more. So unless I’m accidentally calling remove() or truncate() on that list at some point, the length should never be zero.

On top of that, I noticed this little detail in the logs as well, for another failed test:

called `Result::unwrap()` on an `Err` value: Full(Notify { count: 1, is_additional: true })

This operation results from a concurrent queue push operation, where the queue is initialized like this:

pub(super) fn new() -> List<T> {
    List {
        queue: concurrent_queue::ConcurrentQueue::unbounded(),
        <snip: irrelevant fields>
    }
}

Unbounded queues can never be full. It doesn’t happen; they just allocate more memory to make room. At this point I curled my nose as the bitter aroma of memory corruption started to waft my way.

“Maybe it’s a rustc bug” I say to myself, still in the “denial” phase. I had to keep digging deeper; hopefully I can find the actual cause of the error and set up a minimal reproduction of it.

Damaged Dependencies

At this point, it occurred to me that one of the differences between my machine and the Actions virtual machine is that I have a Cargo.lock with some older dependencies in it. In most of my projects I don’t commit Cargo.lock to Git, even though it’s the default now.

So I back up my current Cargo.lock as a “last good state” (foreshadowing for later) and try the tests again.

$ wasm-pack test --node --no-default-features --features portable-atomic

$ cargo update
    Updating crates.io index
    Updating concurrent-queue v2.3.0 -> v2.4.0
    Updating crossbeam-utils v0.8.16 -> v0.8.17
    Updating futures-lite v2.0.1 -> v2.1.0
    Updating itoa v1.0.9 -> v1.0.10
    Updating js-sys v0.3.65 -> v0.3.66
    Updating libc v0.2.150 -> v0.2.151
    Updating once_cell v1.18.0 -> v1.19.0
    Updating portable-atomic v1.5.1 -> v1.6.0
    Updating proc-macro2 v1.0.69 -> v1.0.70
    Updating ryu v1.0.15 -> v1.0.16
    Updating serde v1.0.192 -> v1.0.193
    Updating serde_derive v1.0.192 -> v1.0.193
    Updating syn v2.0.39 -> v2.0.41
    Updating wasm-bindgen v0.2.88 -> v0.2.89
    Updating wasm-bindgen-backend v0.2.88 -> v0.2.89
    Updating wasm-bindgen-futures v0.4.38 -> v0.4.39
    Updating wasm-bindgen-macro v0.2.88 -> v0.2.89
    Updating wasm-bindgen-macro-support v0.2.88 -> v0.2.89
    Updating wasm-bindgen-shared v0.2.88 -> v0.2.89
    Updating wasm-bindgen-test v0.3.38 -> v0.3.39
    Updating wasm-bindgen-test-macro v0.3.38 -> v0.3.39
    Updating web-sys v0.3.65 -> v0.3.66
$ wasm-pack test --node --no-default-features --features portable-atomic

“Aha! It’s one of the dependencies!” I shout aloud to myself, fully aware no one is listening. “It’s not my fault!”

Put yourself in my shoes for a second. It’s probably portable-atomic, right? I mean, the error only happens when portable-atomic support is enabled. It’s a crate that adds a polyfill for the core::sync::atomic module that works on embedded platforms without atomics. It’s very thorough and has support for many different architectures and platforms. Since it has a lot of unsafe, that makes it very easy to blame.

So, let’s update it and see what happens.

$ cp Cargo.lock.old Cargo.lock
$ cargo update portable-atomic
    Updating crates.io index
    Updating portable-atomic v1.5.1 to v1.6.0
$ wasm-pack test --node --no-default-features --features portable-atomic

The tests passed… I’m sorry, what?

So it’s a dependency update… but not in portable-atomic? Weird, I guess I can run through the other dependencies we updated and see which one is the squeaky wheel. Let’s start with concurrent-queue, it seems innocent enough.

$ cp Cargo.lock.old Cargo.lock
$ cargo update concurrent-queue
    Updating crates.io index
    Updating concurrent-queue v2.3.0 -> v2.4.0
$ wasm-pack test --node --no-default-features --features portable-atomic

At this point, I did a double take at my monitor. What?

concurrent-queue is a smol crate that provides a queue that can be accessed concurrently. It’s significantly more efficient than having a Mutex around a non-synchronous queue, especially when multiple threads are pushing to or reading from the queue. It also has a lot of unsafe code which, under normal circumstances, I would be happy to blame.

But, I was the one to push that update from v2.3.0 to v2.4.0. The only real change was getting rid of a heap allocation. This didn’t add or remove any unsafe code; it just moved some fields onto the stack instead of the heap… actually, it was still in the heap, since in event-listener the queue was wrapped with an Arc, which allocates on the heap.

Still, if you’re looking for someone to blame, concurrent-queue seemed like it was caught red-handed. The build broke when I updated it, that was the smoking gun!

Goose Chase Gallery

My attention was piqued when I noticed that I’d forgotten to add WebAssembly tests for concurrent-queue. If this was a concurrent-queue issue, it would be hidden by these lack of tests. My suspicions were confirmed when I ran tests using wasm-pack and saw that, yes, they were failing with portable-atomic being enabled.

I began to settle in for the grueling task of reviewing atomics to make sure that all of the operations were correct. Especially for a crate as intricate as concurrent-queue, this could take weeks. Except… let me check those tests again.

Oh wait, the only tests that are failing are the ones that try to spawn threads. Of course they fail, you can’t spawn threads on WASM! Let’s mask those tests out, and..

Wait, the tests work? Hey, how’s that possible?

Maybe… maybe the leftover tests aren’t good enough. Yes, maybe the bug I’m looking for isn’t covered by these tests, so I need to write more tests. So I need to finally figure out how web workers work. Then I’ll need to rewrite my tests, nay, my test harness, to properly run all of these test cases?

Alternatively, maybe we have the wrong guy.

`miri` Magic

Have you ever watched a police procedural? Usually, they play a trick on you in the middle of the episode. A crime happens, and after some detective work they find a guy who looks like they did it. They have evidence! The smoking gun! But our protagonist, a rookie cop who’s flexible about his position on torturing civilians, doesn’t think they did it. But their completely reasonable qualms that might come up during trial are ignored by the rest of the police force. Then the protagonist confronts the actual perpetrator, who’s usually some poorly developed background character, and brings him to justice.

In this metaphor I’m the police force, and I thought I’d caught concurrent-queue red-handed. But something didn’t sit right.

Before I got too deep into concurrent-queue’s innards, I realized something. In my CI, I wasn’t running miri tests for the case where portable-atomic was enabled. This meant that, if this was a memory corruption error, it wasn’t being caught by miri.

This introduced a thought into my head for the first time, a thought I should’ve had three days ago. Maybe it isn’t strictly a WASM issue?

So I ran miri with portable-atomic support, and sure enough:

$ cargo miri test --no-default-features --features portable-atomic --tests



test drop_non_notified ... error: Undefined Behavior: constructing invalid value: encountered an unaligned reference (required 128 byte alignment but found 16)
   --> /home/jtnunley/.rustup/toolchains/nightly-x86_64-unknown-linux-musl/lib/rustlib/src/rust/library/core/src/ptr/mut_ptr.rs:367:57
    |
367 |         if self.is_null() { None } else { unsafe { Some(&*self) } }
    |                                                         ^^^^^^ constructing invalid value: encountered an unaligned reference (required 128 byte alignment but found 16)
    |
    = help: this indicates a bug in the program: it performed an invalid operation, and caused Undefined Behavior
    = help: see https://doc.rust-lang.org/nightly/reference/behavior-considered-undefined.html for further information
    = note: BACKTRACE:
    = note: inside `std::ptr::mut_ptr::*mut event_listener::Inner<()>>::as_ref::<'_>` at /home/jtnunley/.rustup/toolchains/nightly-x86_64-unknown-linux-musl/lib/rustlib/src/rust/library/core/src/ptr/mut_ptr.rs:367:57: 367:63
note: inside `event_listener::Event::try_inner`
   --> /home/jtnunley/Projects/smol-rs/event-listener/src/lib.rs:443:18
    |
443 |         unsafe { inner.as_ref() }
    |                  ^^^^^^^^^^^^^^

Specifically, I noticed this error:

required 128 byte alignment but found 16

What this is telling me is that we are trying to dereference a pointer that isn’t properly aligned. So, where is this pointer coming from?

/// Return a reference to the inner state if it has been initialized.
#[inline]
fn try_inner(&self) -> Option<&Inner<T>> {
    let inner = self.inner.load(Ordering::Acquire);
    unsafe { inner.as_ref() }
}

Okay, this is just the pointer we’re storing in the self.inner atomic variable. How are we calculating that pointer?

// Allocate the state on the heap.
let new = Arc::new(Inner::<T>::new());

// Convert the state to a raw pointer.
let new = Arc::into_raw(new) as *mut Inner<T>;

// Replace the null pointer with the new state pointer.
inner = self
    .inner
    .compare_exchange(inner, new, Ordering::AcqRel, Ordering::Acquire)
    .unwrap_or_else(|x| x);

So, inner is just the return value of an Arc allocation? Weird. The standard library’s types are usually pretty well-tested. Not to mention, I’m sure that other programs make use of dereferencing the into_raw too, and they would have seen this error before me right now. Let me check my imports…

Oh no.

#[cfg(feature = "portable-atomic")]
use portable_atomic_util::Arc;

Oh no.

Arc Apocalypse

portable-atomic-util is a sort of “side crate” to portable-atomic. It’s where you put things that are too specific to be put in portable-atomic, but are probably still useful.

One of the tools in portable-atomic-util is an implementation of Arc. It’s similar to the one in the standard library, but instead of using atomics from the standard library it uses portable-atomic. Unfortunately it’s not a 1:1 copy of the one from the standard library, as the standard library makes use of unstable features that can’t be used from normal library crates.

So, let’s take a look at its into_raw implementation. It looks a little like this:

// The inner heap allocation of an `Arc`.
#[repr(C)]
struct Shared<T: ?Sized> {
    /// The reference count of the `Arc`.
    header: Header,

    /// The value that is being reference counted.
    value: T,
}

impl<T> Arc<T> {
    #[must_use]
    pub fn as_ptr(&self) -> *const T {
        // Get the raw pointer.
        let ptr = self.shared.as_ptr() as *mut u8;

        // Add the size of the header so that it points to the value.
        let new_ptr = strict::map_addr(ptr, |addr| addr + mem::size_of::<Header>());

        // Cast the pointer to the correct type.
        strict::with_metadata_of(new_ptr, self.shared.as_ptr() as *mut T)
    }
}

It’s pretty simple. Our shared pointer points to an instance of the Shared structure on the heap, and we want it to point to the value field. That way, when the user dereferences the pointer, they get the underlying T.

Unfortunately we can’t use addr_of_mut like the standard library yet. It’s stable, but portable-atomic-util’s MSRV is lower than the version where it was introduced. So we do the math manually. The value will be after the Header, so just add the size of the Header to get the pointer. Right?

Wrong.

What the author of this code failed to consider is alignment. A lot of structures need to have their address be a multiple of a certain number, usually the size of their largest field. The reasons for this are related to the underlying CPU architectures and I can’t really get into that. This article is looking pretty long as it is.

However, the ConcurrentQueue type has an alignment of 128. This is related to how cache lines are implemented on the x86 architecture. When you have frequently changed atomic data, you want to put in a separate cache line than other data. That way, the processor won’t need to reload that other data too frequently. No, we really don’t have time.

So, it is possible for Shared above to be laid out with the Header at the start, then have some number of padding bytes before the actual value field. This means that the pointer addition from above is now pointing into padding, rather than the actual T value. That explains the memory corruption.

It also explains why it’s only started to cause undefined behavior now. In v2.3.0 of ConcurrentQueue, the data with an alignment of 128 was stored inside of a heap allocation. Since heap allocations have an alignment of the platform word size (in the case of 64-bit architectures, 8), there was no padding between the Header and the value here. It was only when that heap allocation was removed and the alignment was increased that this problem appeared.

Say, who wrote this code? I’ll need to find them and give them a piece of my mind. git blame can tell us who, exactly, is responsible for this mess.

Egads! It was me all along!

This bug is no longer in portable-atomic-util. I filed a PR which has now been merged, and the versions of portable-atomic-util with that bug have been yanked from crates.io.

To come full-circle, let’s see if our tests work with the newest version of portable-atomic-util.

$ wasm-pack test --node --no-default-features --features portable-atomic

Awesome!

Parting Shots

So, what’s my conclusion? What did we learn from this little escapade?

I’d like to bring your attention to the source of the original soundness bug.

#[must_use]
pub fn as_ptr(&self) -> *const T {
    // Get the raw pointer.
    let ptr = self.shared.as_ptr() as *mut u8;

    // Add the size of the header so that it points to the value.
    let new_ptr = strict::map_addr(ptr, |addr| addr + mem::size_of::<Header>());

    // Cast the pointer to the correct type.
    strict::with_metadata_of(new_ptr, self.shared.as_ptr() as *mut T)
}

Eagle-eyed readers might have noticed that this code has no unsafe. Not even a line. Sure, it does some risky pointer math, but that’s considered safe in Rust.

Also, keep in mind that concurrent-queue had no bugs introduced either. It just changed the alignment of a structure from 8 to 128. That can be done in safe Rust; hell, it isn’t even a minor change according to SemVer.

Here is the actual unsafe code that caused this unsoundness.

#[inline]
fn try_inner(&self) -> Option<&Inner<T>> {
    let inner = self.inner.load(Ordering::Acquire);
    unsafe { inner.as_ref() }
}

It’s not unsound; it’s not even incorrect. This code was checked, audited and did everything that it was supposed to do. It was a combination of two pieces of 100% safe code from another crate that caused this bug to happen.

When I begun this article, I talked about how you need to check your unsafe code. What I wanted to prove is that you can’t just check your unsafe code. You need to check each and every line of safe code too. Safety is non-local, and a bug in safe code can easily cause unsound behavior in your unsafe code if you’re not careful.

For me, I’m going to be extra careful when I write new unsafe logic. I advise that you do the same.

Evaluating new software forges

2023-12-15T00:00:00+00:00

What options are there other than GitHub?

Update: src.notgull.net is no more. I’ve moved my code to Codeberg at this point, although I still self-host my CI system.

Oh boy, I sure do love contributing to open source software on the largest software forge in the world! I hope they haven’t started down the slow and painful process of enshittification by following vague, ill-defined industry trends!

Wait, what’s that? Computer! Enhance!

Well, I guess I’m ready to find a new forge!

Software Host Hellscape

In all seriousness, I’ve been looking to move off of GitHub for a while now. Let me be clear, GitHub is still far and away the best website for open source discovery. Not to mention, its CI offerings are very nice, especially for something free. Yes, there are better paid CI offerings, but for something that costs zero dollars I’ve found it incredibly useful.

However, one thing has made me skeptical of GitHub is its “Copilot” offering. I’ll admit, I was in the beta program for Copilot, and found it really neat. Being able to write large amounts of code from small comments was very nice, even if it was really bad practice.

Then I found out it was training on GPL-licensed data, which left a pretty bad taste in my mouth. In addition to the fact that I’m increasingly uncomfortable with hosting my free software on a closed source forge, run by Microsoft.

Let’s take a look at everybody else.

GitLab Gauss

GitLab is the original GitHub competitor. The Linux to Microsoft’s Windows, or the MariaDB to Oracle’s MySQL. This has made it the most popular GitHub competitor by far, by virtue of people vocally quitting GitHub in favor of [GitLab].

Unfortunately, I didn’t really consider GitLab when I was finding a place to move. First of all, they aren’t actually open source. They’re “open core”, which, I admit, is better than closed source. However, like I said, I’m uncomfortable building free software on infrastructure that isn’t.

I know I can download GitLab and set it up on my own server. However, I’m a software developer, not a sysadmin. I want to spend my time developing software, not putting out fires and paying AWS bills for the rest of time.

Also, GitLab has adopted the unfortunate strategy of “following along with whatever GitHub does”. They’ve tried to jump onto the bandwagon so frequently, they’ve gotten splinters. For instance, what happens when we check their homepage?

Computer! Do the thing again!

Good golly, it’s even the same wording! Yeah, I’ll pass.

SourceHut Scramble

sr.ht takes the opposite approach as GitLab. Instead of trying to follow along with GitHub’s trends, it’s elected to do go in the other direction. Whenever GitHub does something, SourceHut does the exact opposite.

Pull requests? Too centralized, let’s construct a suitable code contribution system around email. Discussion? Why not IRC, it’s been around since the Bronze Age. Get rid of Mercurial support? Not interested.

I really like SourceHut. When you go to their homepage, they’re not showing off their fancy CSS effects or telling you about their AI offerings. They give you a simple user interface and some of the projects they host.

There was also much to impress me. Their CI offerings are better than GitHub, which alone justified me paying the humble $2/month price tag. Rather than needing a complicated YAML file to run a CI system, it’s just cloning Git repos and running commands. It’s delightfully simple yet powerful. Having native BSD and Plan9 runners doesn’t quite make up for its inability to run Windows, but I’m sure I can work around that.

Not to mention, SourceHut has the second best repository discovery system. When I go to sr.ht’s “explore” tab, I’m immediately greeted by a slew of interesting projects. Whether it’s a powerful Forth dialect that brings a lot of genuinely exciting ideas to the table, or a tiny C11 compiler written in simple ANSI C, I’m always amazed whenever I open up that tab.

I liked it so much that I announced that I was moving my personal projects to SourceHut. However, after moving my theo project to SourceHut, I found myself dissatisfied with a few things.

For one, the email-based workflow was a lot clunkier than I expected. In theory, building code contribution on top of a standard protocol that’s been around since the 60’s sounds like a good idea. In practice, it’s a lot clunkier than you’d expect, especially since most modern email clients are simply not built to read and write code.

After trying it for myself I can see that it might turn off a lot of people from contributing. I’m already losing a lot of potential contributors by moving off of GitHub. I don’t need those remaining contributors to also be turned off by a workflow completely different than what they’re probably used to.

Still, I can imagine this workflow working for many people, especially ones who already have a decent setup for email-based projects like Linux.

Codeberg

Codeberg is a public instance of Forgejo, which is in turn a fork of Gitea. It’s got a pretty nice interface similar enough to GitHub’s. It’s got the familiar pull-request-based contribution interface. The CI is good enough, I suppose. Docker containers aren’t the best CI environment, but I can certainly think of worse.

So what’s not to like?

My main problem is that Codeberg has a very limited CI capacity, and I have a lot of projects that require significant testing. theo, for instance, requires these things to be tested:

Make sure it compiles on Windows, Mac, Linux, WASM, Redox, and whatever oblique platform people run Rust on nowadays.
Check the various different backends that theo supports: pure software rendering, wgpu and OpenGL. Not to mention all of the different interfaces to OpenGL, so wgl, GLX, EGL…
Check formatting and linting.

…which doesn’t even cover testing. For rendering frameworks like theo, you want to have some pre-defined rendering programs that render your code to images. That way, you can regenerate these images and compare against an existing set of images in order to check for regressions.

This isn’t a practical concern, although it really should be. It’s a moral concern. You have an organization like Codeberg, donating a significant amount of time and resources to try to make a positive difference in the world of software. Now, here I am, sucking up all of those compute resources for my insignificant little projects.

Of course, while pondering this moral concern, I realized that I’ve locked myself into a Catch-22. I can’t use any independent project’s CI because of my concerns that I would drain too many of their resources. On the other hand, I can’t use any large company’s CI because I don’t want to host my project with a large company. I can’t self host, because that would be a pain.

Would it?

Self-Hosting Gitea

I said I didn’t want to self-host. I worked in IT for two years, so I’ve already gotten my fill of fighting with both servers and people.

However, in a Discord I’m in, an acquaintance of mine talked about how they set up Gitea and Drone CI on a school Kubernetes cluster they had access to. I mentioned my predicaments in finding a forge service, and they said that it was only two configuration files.

That tempted me. Not enough to deal with the absolute nightmare that is Kubernetes, but enough for me to rent out a couple of Lightsail servers to experiment.

I’d like to say that I set up the entire thing in a weekend, but it wasn’t that simple. Sure, it was easy enough to install Gitea and Drone in Docker containers. Sure, it wasn’t too hard after that to set up my DNS records to forward src.notgull.net to that new serer. Yeah, it’s probably hacky to set up my CI system on a public cloud, but as long as I keep people from abusing it that aren’t me it shouldn’t be too bad, right?

Of course, I forget my crucial weakness: my perfectionism. Sure, my site looks pretty good… but it looks a bit ugly. Let’s play around with themes for three days until I find one I like. Oh, the logo doesn’t look good with my new theme. Let’s convert the MS paint drawing I call my avatar into SVG and then set it up to be this site’s logo. Oh, Gitea has a weird templating system; let’s compile it from scratch!

At the end of this week(!)long process, I had a somewhat functional Git forge, with the following features:

A decent Linux Drone CI setup.
A Dependabot clone for automatic updates.
A system that allows everyone to create an account and open issues, but doesn’t allow people to create repos or mess with the CI.
- If you want to create a repo for PR’s sake, email me at dev at notgull dot net and I can set you up.
A mirror to GitHub, so I can still take advantage of their code discovery.
- Yeah yeah, I know, but keeping an arm’s length from GitHub is a win in my book.

I’ve uploaded a lot of my code to there, and it seems to be working well so far. I have to admit, it is somewhat liberating to have full control of how your code is forged.

Parting Shots

I still consider this to be in the “experimental” stage. If it ends up being too inconvenient or too expensive, I’ll probably move it somewhere else. Still, having my own space for code that I can do whatever I want with is very nice. Let’s hope it keeps working well and this blog post doesn’t age like milk.

All photos are from public websites and fall under free use, as this is a review.

My new, remote-access Rust development setup

2023-11-12T00:00:00+00:00

I’ve set up a new system for Rust development work.

I work on quite a few crates in the Rust ecosystem. Since I started taking Rust seriously back in 2021, I’ve been using the same laptop that I’ve used since high school.

It’s actually a pretty beefy laptop, all things considered. It has an i5 CPU and 16 GB of RAM. It ran Ubuntu for the majority of its life but now runs Alpine Linux. These stats were definitely overkill for a high schooler playing around with Python. Even now, it still works with smaller codebases. Honestly, it’s an exceptionally first world problem to even be complaining about.

However, I’ve started to hit the limits of this computer. 16 gigabytes of RAM is a lot when you’re dealing with codebases like smol. Even if you put all of smol’s subcrates together, rust-analyzer barely uses up four gigabytes.

However, lately I’ve been working on codebases that overflow my limits. winit is a medium-sized codebase at around 25KB lines, but with all of the dependencies it makes rust-analyzer slow to a crawl. x11rb has a lot of automatically generated code; I’ve actually experienced a kernel panic because rust-analyzer took up all of my memory processing x11rb, including an additional 16 gigabytes of swap space.

Not to mention, once my laptop touches swap space, it might as well be over. Memory access slows to a crawl, and the latency between my thoughts and my applications might as well be dial-up.

Time for an Upgrade

Since I started this blog in March of this year, I’ve managed to trick a company into giving me money in exchange for writing code for them. Around Prime Day, I decided to burn a paycheck on an upgrade.

My goal was to build a server that I can access remotely. I move around a lot; I like to write code at various hackerspaces and coffee shops around the city. It’s a nice, social environment where I can talk to (or at least be around) people while I code. I’m already an introvert, so I don’t want to be cooped up inside if I want access to serious computing power.

The remaining goal was absolute overkill. I don’t want to have to upgrade this machine for the forseeable future. Therefore, I’m going all-out on hardware and software.

The Hardware Specs Section

I built this computer around the AMD Ryzen 9 5950x processor. It was half-off for Prime Day, which is what prompted this entire process. It has 32 cores and is reasonably fast. I’m aware I could go higher, but I don’t want to break my bank account for something I’m not being paid for.

Don’t get me wrong; my laptop’s CPU was already good enough for my use case. With Rust’s incremental builds, I very rarely spend more than a few seconds waiting for feedback from rust-analyzer. Still, from-scratch compiles took a few minutes, especially for larger crates.

With 32 cores, I can easily parallelize crate builds. cargo install ripgrep completes in less than a minute. Combined with an absolutely overkill 128 gigabytes of RAM, I can build any crate I want to in record time.

My workload isn’t GPU intensive, so I grabbed an AMD Radeon 570x to use as a GPU to tie the build together. I combined this with an ASRock motherboard, a CoolerMaster chassis and liquid-cooling system, and a 4TB SSD to tie the rest of the computer together. The end result is a system that I don’t intend to use directly, but will be very nice to use remotely.

Putting It All Together

I’d like to muse briefly on putting this computer together. I’ve never actually built a computer before. Usually, when I’m in need of a new computer, I don’t have time to go through the process of figuring out which parts go where. So I’ve only used prebuilt systems until now.

I’ve heard from some people that putting together a PC is like building Legos. While this is true in some respects, keep in mind that these are 2000 dollar legos, where in some cases it might be unclear what part goes where. This leads to a lot of stress; especially around installing the CPU. One bent pin means that you have to throw out the CPU.

Thankfully, on attempt number three, my friend and I successfully managed to put the CPU in the socket and install the liquid-cooling system. Everything else screwed into place relatively easily. The motherboard goes into the chassis, and everything hooks into the motherboard from there.

However, I made two crucial mistakes. First, I ordered a 3U chassis where I actually needed a 4U chassis. I also needed a 750W power supply instead of the 500 W one I originally calculated I needed. Amazon has a decent return policy, so I was able to return those parts and get the ones I actually need.

Finally, after putting it all together, it refused to boot! We thought we’d installed the CPU wrong. After crawling over the manuals and forums for the motherboard a few times, we figured out that e needed to plug in a keybard. Weird. Anyways, one Alpine Linux install later, I had a working PC.

Utilizing the System

I’ve set up this PC in the attic. Thankfully, the liquid cooling is quiet enough so that I don’t hear it when I’m trying to sleep.

Now, I want to be able to access this from anywhere. Like I said, I want to be able to use this computer like I’m sitting down at it, even if I might be across the country. Originally, I was going to set up an SSH tunnel between my laptop and this server, using an AWS Lightsail instance as an intermediary.

However, then I found out about Tailscale (not sponsored), which fit my use case much better.

So, I’ve scaled my tail. I’ve hooked up my new server, my phone and my laptop to a tailnet. SSH is a breeze and setting up other services is a cinch as well.

I’ve been getting used to code-server. In fact, I’m writing this in there right now! I’ve tried to get used to Neovim and Emacs and other editors like that. However, I’m just too used to the VS Code workflow at this point, especially since using it for work. Generally, I don’t notice a difference between code-server’s interface and VS-Code.

Desktop Dismay

However, there is one cinch. When testing out winit, I need direct GUI access. Remote X11 just won’t cut it, especially for rendering.

I’ve installed a VNC server on my server, and I can just tab over to my VNC client whenever I need to run a GUI application. Still, it’s a somewhat awkward workflow. I’m open to new ideas.

Conclusions

There isn’t a moral to this story, I just wanted to talk about my new remote workflow. I hope this overview inspires similar remote flows in the future.

Recreating concurrent futures combinators in smol

2023-10-22T00:00:00+00:00

futures comes with many additional combinators that smol doesn’t have out of the box. We can rebuild them, better.

Whew, it’s almost been a month since my last blogpost here. This was because I was spending time doing research and testing, and not because I lost the PGP key that allows me to upload to this site. No sir, how could anyone be that irresponsible?

…or maybe I was just using the PGP key as an excuse not to write? It’s not like I’m being paid to psychoanalyze myself in front of you people.

It doesn’t matter, we’re back! Let’s talk about smol.

The Problem with `futures`

futures was originally released in 2016 to provide an implementation of asynchronous programming for Rust. In the time since, it’s accumulated a lot of baggage. Many of its combinators have been superseded by the async/await syntax, meaning that a large amount of its API surface isn’t relevant anymore.

For instance, take the Map combinator. It takes the value of some Future and maps the return value through some closure.

use futures::prelude::*;

let fut = async { 1 };
let mapped_fut = fut.map(|x| x * 2);

In the pre-2018-edition days, these combinators were the only way to manipulate the value of a Future. They were completely necessary for using these asynchronous values back in the day. Nowadays, we can just wrap the original futures using async/await and treat it more like a normal expression.

let fut = async { 1 };
let mapped_fut = async { fut.await * 2 };

Therefore, in this brave new post-async world, many of these combinators became unnecessary. futures-lite, one of the core components of smol, was created to address this new reality.

futures-lite explicitly ignores all combinators that can be implemented using async/await or features that have already been moved into the standard library. This leaves behind a small, clean subset of the API that builds fast.

A semi-frequently asked question I see goes along the lines of: “I was porting my application over to smol, but I noticed that it doesn’t have for_each_concurrent or buffered. Is this API excluded purposely?”

This is a reasonable question. The short answer is “yes”, and the medium answer is “these concurrent functions have small but frustrating problems that futures-lite avoids by not implementing them”. This article is the longer answer.

Concurrency Conundrum

I would argue that the concurrent Stream stream combinators mentioned above are a code smell. Well-formed production-ready code should not use for_each_concurrent or buffered. If I knew how compilers worked, I would suggest a Clippy lint that would flag these functions as a warning.

The for_each_concurrent function is called like this:

let my_stream = /* ... */;
my_stream.for_each_concurrent(
    None, // Limiter parameter, not important for now.
    |x| async move { do_something(x).await } // Closure to run for each element.
).await;

To massively oversimplify, for_each_concurrent does this:

use futures::prelude::*;
use futures::stream::FuturesUnordered;

let my_stream = /* ... */;

// Create a list to store all of the futures.
let mut futures_list = FuturesUnordered::new();

// Get all of the values from our stream.
while let Some(x) = my_stream.next().await {
    // Push the future to the list.
    futures_list.push(async move { x + 5 });
}

// Wait for all of the futures to complete. FuturesUnordered polls each future
// in order and returns their results.
futures_list.for_each(|()| {}).await;

FuturesUnordered is sort of an unholy combination of an executor and a Stream. It collects a bunch of Futures into a list, then maintains a queue of which Futures are ready to be polled. Once a Future returns that it is Ready, it returns that Future’s value.

This means that, every time you call for_each_concurrent, it creates an entire new executor, runs the Stream to completion on it, then discards that executor entirely. buffered is implemented in a similar way.

It’s bad for a couple of reasons. Most async runtimes already provide an executor. tokio provides one out of the box, and smol encourages you to create and optimize your own. By using for_each_concurrent or buffered, you are essentially ignoring your previous executor in order to spawn a short-lived temporary executor.

In addition to the resources that are wasted on the new executor, it’s often less efficient than async runtime executors. tokio and smol support options to let you offload tasks on other threads or handle contention more efficiently. In contrast, FuturesUnordered is a relatively naive executor that is completely unaware of its surrounding runtime.

Not to mention, FuturesUnordered comes with a few footguns that make it impractical for common use cases.

Make your own, better combinator

In smol, you can emulate these use cases somewhat easily. First, you need to create an Executor and execute your features in its context.

let ex = smol::Executor::new();

ex.run(async {
    // The code written below will take place in this context.
}).await;

You can emulate for_each_concurrent by turning every future in the stream into a task, then awaiting all of those tasks. Here’s how it looks if you don’t have a task limit:

use smol::prelude::*;

let my_stream = smol::stream::iter(vec![1u32, 2, 3]);

// Spawn the set of futures on an executor.
let handles: Vec<smol::Task<()>> = my_stream
    .map(|item| {
        // Spawn the future on the executor.
        ex.spawn(do_something(item))
    }).collect().await;

// Wait for all of the handles to complete.
for handle in handles {
    handle.await;
}

Here, we spawn every future involved onto the executor. We then take all of the task handles and collect them. Since we are running inside of the executor, all of these tasks will be run in parallel. Finally, we just .await on each handle to wait for all of the tasks to complete.

The best part is that the allocation, the Vec>, isn’t even necessary. It could be one-time allocation that is just extended to hold the tasks.

Generally, it doesn’t matter how many tasks are spawned onto the global executor. In contrast to the mini-executor that for_each_concurrent spawns, the global executor is designed to handle large numbers of tasks. However, if you still want to impose a resource limit, you can use a Semaphore.

use smol::prelude::*;
use std::sync::Arc;

let my_stream = smol::stream::iter(vec![1u32, 2, 3]);
let my_limit = 5;

// Semaphore for limiting the number of tasks.
let semaphore = Arc::new(smol::lock::Semaphore::new(my_limit));

// Spawn the set of futures on an executor.
let handles: Vec<smol::Task<()>> = my_stream
    // We use using `then` now, since we need to `.await` for the 
    // semaphore to have a permit available.
    .then(|item| {
        // Borrow the semaphore and executor.
        let (ex, semaphore) = (&ex, &semaphore);
        async move {
            // Wait for a semaphore permit.
            let permit = semaphore.acquire_arc().await;

            // Spawn the future on the executor.
            ex.spawn(async move {
                // Run our future.
                do_something(item).await;

                // Drop the permit to let another task run.
                drop(permit);
            })
        }
    })
    .collect()
    .await;

// Wait for the remaining handles to complete.
for handle in handles {
    handle.await;
}

This works by having each task borrow a Semaphore permit. The semaphore is sort of like a Mutex that can be locked by multiple parties at once, up to a certain limit. Once it runs out of permits, this code doesn’t spawn a task until one of the tasks completes. The permit is moved into the task and is dropped once the computation completes.

The then-stream above is also practically the equivalent of buffered. It yields tasks that can then be awaited to get their results.

// snip: semaphore and stream setup

// This time, we do something else that maps the value to another.
async fn do_something_else(x: u32) -> u32 { x + 1 }

// Get a `Stream` of tasks that can be `await`ed to get their value.
let buffered_stream = my_stream
    .then(|item| {
        // Borrow the semaphore and executor.
        let (ex, semaphore) = (&ex, &semaphore);
        async move {
            // Wait for a semaphore permit.
            let permit = semaphore.acquire_arc().await;

            // Spawn the future on the executor.
            ex.spawn(async move {
                // Run our future.
                // NEW: This now returns a value.
                let result = do_something_else(item).await;

                // Drop the permit to let another task run.
                drop(permit);

                // NEW: Return the result of the inner future.
                result
            })
        }
    });

// NEW: Because the stream uses an unpinned async closure,
// we have to pin it.
smol::pin!(buffered_stream);

// NEW: We can now wait on this stream for its values.
while let Some(task) = buffered_stream.next().await {
    println!("Value: {}", task.await);
}

This is all practically more efficient than buffered while giving you much greater control over how it runs.

Parting Shots

Unfortunately there’s not much documentation for the fact that for_each_concurrent and buffered spawn their own separate executors. Raising awareness of proper async code is powerful, in my opinion, as it unlocks a whole new world of computations for intermediately experienced Rustaceans. I hope this makes it clearer what should be happening in well-formed code.

Eyra is an interesting Rust project

2023-09-25T00:00:00+00:00

In the eternal quest to rewrite everything in Rust, even the C standard library isn’t safe from carcinisation.

Modern Rust programs are, for the most part, written mostly in Rust. For networking applications, the entire asynchronous stack is Rusty; no libuv in sight, only mio and polling. There is a robust rendering stack based on tiny-skia and cosmic-text. Even if you need FFI, the story is still pretty good. x11rb provides a robust wrapper around libxcb with a fully Rust-based alternative, and wayland-rs is the same with Wayland.

Still, if you want to write pure Rust programs, there is one annoying dependency that nearly every Rust program has. Let’s take a basic smol-based program, written top-to-bottom in Rust. Or so I think. Let’s see what ldd says.

$ ldd linux-timerfd
        linux-vdso.so.1 (0x00007fff58fae000)
        libgcc_s.so.1 => /lib/x86_64-linux-gnu/libgcc_s.so.1 (0x00007fe7681bb000)
        libc.so.6 => /lib/x86_64-linux-gnu/libc.so.6 (0x00007fe767e00000)
        /lib64/ld-linux-x86-64.so.2 (0x00007fe7682d4000)

Blech, disgusting! Let’ go over each of those libraries:

linux-vdso is the vDSO, which is used to implement certain system calls that can be reasonably implemented in user space. This object is used to prevent needing to incur syscall overhead unless it’s needed.
libgcc provides implementations for certain operations, like floating point operations and exception handling. The “_s” stands for “shared”, since it is a shared library.
ld-linux-x86-64 is the dynamic linker runtime. It’s what holds everything together.
libc is the C standard library, which contains wrappers around every relevant system call as well as a handful of C-oriented routines.

We’re going to spend most of our time talking about libc.

What is libc?

libc wears a lot of hats. It provides an extensive library of functions that are useful for C programmers, like fopen and memchr. It also contains more platform-specific wrappers around OS-specific functionality, like kqueue. In addition, for many operating systems, it’s the only stable interface between the user space and the kernel.

This part is important. Usually system calls, special interrupt instructions, are used to tell the kernel to do something important. However, these system call interfaces are usually unstable and prone to rapid, undocumented changes. This means that anyone trying to access kernel functionality has to go through libc, even if they aren’t actually C. This isn’t a suggestion; Go tried to use direct system calls on macOS a while back and got burned by it. It turns out, when they say “unstable”, they actually do mean “will change in inconsistent, backwards-incompatible ways”.

There are two important exceptions to this. The first is Windows, which has its own Win32 API that its libc is just a thin-ish wrapper over. This wrapper is how C programs written for Linux can sometimes still be used on Windows if it just uses the portable parts of libc. For our purposes it isn’t important. The second exception is Linux, which actually does have a stable system-call interface. You can call it from anywhere without going through libc, and you don’t have to worry about the actual calls changing out from under you.

Musl Melee

Because the userspace interface for Linux is the system calls and not the libc, you actually have a choice in what implementation of libc to use aside from “whatever the OS developer wants you to use”. There are two prominent implementations:

The GNU C Library (glibc), which is the battle-tested full-featured implementation.
Musl libc, which aims to be a simpler implementation focused on static linking.

In fact, if you were wondering what the “gnu” at the end of “x86_64-unknown-linux-gnu” means, it stands for the GNU C Library that Rust is using as an interface to the system. If we have the x86_64-unknown-linux-musl target installed, we can switch that out for Musl pretty easily.

$ rustup target add x86_64-unknown-linux-musl
$ cargo build --target x86_64-unknown-linux-musl

I wonder what ldd says instead now?

$ ldd linux-timerfd
        statically linked

Well, look at that! By default, the *-musl option automatically statically links the binary. No more dependencies! Everything is good, forever!

Rustix Revelation

Hmm, there’s something gnawing at the edge of my mind, like there’s something still wrong with this program. I just can’t put my finger on it.

It must be that there’s still C in there. Even though it’s statically linked, Musl is still a massive blob of unsafe, unsound, filthy C code.

Well, actually, Musl, and glibc for that matter, are extraordinarily well tested. Being that it’s a C standard library, it’s actually held to a very high standard for security and soundness.

But after my exposure to the Rustonomicon, my sanity has begun to decay like a flower wilting in winter. So let’s leave the realm of rational thought and imagine what can be. A veil lifts in my mind, revealing the Platonic ideal of a perfect program:

All Rust. Down to the very last bit. Perfect, clean Rust.

If only there was some way that we could tear that C code out by the teeth and leave it rotting at the wayside. But alas, our program needs that little bit of C code to run. Even if we were to rewrite our Rust code to use only syscalls, there would still need to be some glue code for program initialization, signal handling and threading. All written in Assembly and dirty, dirty C.

No, I see something wondrous. The answer to my prayers. What will purify my unclean executables and let them ascend into His Light!

The answer is eyra.

eyra is a set of libraries that aim to replace the role of the traditional libc in modern programs. It is written entirely in Rust, not counting the bits of Assembly necessary to tie the entire thing together. Not even a trace of C.

eyra was written by Dan Gohman, who is also the primary author of rustix. rustix is a safe wrapper around either raw system calls on Linux or libc on other platforms, and is a very fascinating piece of software that deserves its own blogpost. The point is, eyra is rustix taken to its logical conclusion: a complete replacement for libc.

The main drawback of eyra is that the process of integrating it into your program is more involved than just setting --target. But, it’s not so bad. Let’s write an eyra example program.

Enabling Eyra

First things first, let’s let cargo take care of scaffolding for us.

$ cargo new --bin eyra-example
     Created binary (application) `eyra-example` package

Let’s make it a little bit more complex than a “Hello, world!” program. Say, a smol-based TCP server that tells bad jokes.

$ cargo add smol fastrand eyre  
    Updating crates.io index
      Adding smol v1.3.0 to dependencies.
      Adding fastrand v2.0.1 to dependencies.
             Features:
             + alloc
             + std
             - getrandom
             - js
      Adding eyre v0.6.8 to dependencies.
             Features:
             + auto-install
             + track-caller
             - pyo3
    Updating crates.io index

In src/main.rs, we write a simple TCP server:

// in src/main.rs
use eyre::Result;
use smol::io::BufReader;
use smol::net::{TcpListener, TcpStream};
use smol::prelude::*;

const BAD_JOKES: &[&str] = &[
    "What do you call a fly without wings? A walk.",
    "Did you hear about the dull pencil? It was pointless.",
    "Why did the golfer bring two pairs of pants? In case he got a hole in one."
];

/// Handle an incoming connection.
async fn handle_connection(stream: TcpStream) -> Result<()> {
    // Wrap the stream in a BufReader to ease reading lines.
    let mut stream = BufReader::new(stream);

    // Read a line from the stream.
    let mut command = String::new();
    stream.read_line(&mut command).await?;

    // Remove the newline at the end if there is one.
    if command.ends_with('\n') {
        command.pop();
    }

    // Send a joke if the user asked for one.
    command.make_ascii_lowercase();
    if command == "tell me a joke" {
        // Choose a joke and send it.
        let joke = format!("{}\n", fastrand::choice(BAD_JOKES).unwrap());
        stream.get_mut().write_all(joke.as_bytes()).await?;
    } else {
        // Otherwise, send an error message.
        let message = "I only know how to tell jokes.\n";
        stream.get_mut().write_all(message.as_bytes()).await?;
    }

    Ok(())
}

fn main() -> eyre::Result<()> {
    smol::block_on(async {
        // Listen on a random port.
        let listener = TcpListener::bind("127.0.0.1:0").await?;
        let addr = listener.local_addr()?;
        println!("Listening at address {:?}", addr);

        // Start running an executor.
        let ex = smol::Executor::new(); 
        ex.run(async {
            loop {
                // Accept a new connection.
                let (stream, _) = listener.accept().await?;

                // Spawn a task to handle the connection.
                ex.spawn(async move {
                    // If an error occurs while running the task, print it.
                    if let Err(e) = handle_connection(stream).await {
                        eprintln!("An error occurred: {}", e);
                    }
                }).detach();
            } 
        }).await
    })
}

See the comments for a breakdown of how the program works, for those unfamiliar with smol’s API.

When we run the program, it tells us the IP address that it’s listening on:

$ cargo run
   Compiling eyra-example v0.1.0 (/home/jtnunley/Programming/eyra-example)
    Finished dev [unoptimized + debuginfo] target(s) in 0.64s
     Running `/home/jtnunley/Programming/CargoTarget/debug/eyra-example`
Listening at address 127.0.0.1:44439

In lieu of a dedicated client, we can use netcat to test out the server.

$ echo "tell me a joke" | nc 127.0.0.1 44439
Why did the golfer bring two pairs of pants? In case he got a hole in one.

By checking with ldd, we can see that we’ve compiled this program against glibc.

$ ldd /home/jtnunley/Programming/CargoTarget/debug/eyra-example
        linux-vdso.so.1 (0x00007ffe8334d000)
        libgcc_s.so.1 => /lib/x86_64-linux-gnu/libgcc_s.so.1 (0x00007f4bb8376000)
        libc.so.6 => /lib/x86_64-linux-gnu/libc.so.6 (0x00007f4bb8000000)
        /lib64/ld-linux-x86-64.so.2 (0x00007f4bb850a000)

Let’s try to integrate with eyra! (Not eyre, which I use to simplify error handling.) First, we need to add the latest version of eyra to our project. Let’s also add logging so we can see what eyra is doing under the hood.

$ cargo add eyra -F log,env_logger
    Updating crates.io index
      Adding eyra v0.15.2 to dependencies.
             Features:
             + env_logger
             + log
             - experimental-relocate
             - max_level_off

Using cargo tree, we can see that this pulls in c-gull, c-scape and a bunch of other things.

$ cargo tree
eyra-example v0.1.0 (/home/jtnunley/Programming/eyra-example)
├── eyra v0.15.2
│   └── c-gull v0.15.3
│       ├── c-scape v0.15.3
│       │   ├── < like, a lot of packages >
< snip rest out the output >

Then, we add extern crate eyra to the top of the main.rs file so that Rust knows to link to eyra, even if we don’t directly use anything from it.

// in src/main.rs
extern crate eyra;

// 

Finally, we have to add a build.rs file, which is a little build script that runs before your Rust crate is compiled. We ue this to tell Rust to link using the -nostartfiles argument, which tells Rust not to bring in any of the C runtime. This is because eyra has its own initializing runtime, written in Rust.

// in build.rs
fn main() {
    println!("cargo:rustc-link-arg=-nostartfiles");
}

Now, we can run cargo build, which builds a significantly greater number of dependencies. Afterwards, we still have the eyra-example executable. Let’s see what’s inside.

$ ldd /home/jtnunley/Programming/CargoTarget/debug/eyra-example
        statically linked

Nice! It’s been statically linked, hopefully with 100% Rust code. Let’s run the executable with RUST_LOG=trace and see how it works.

$ RUST_LOG=trace cargo run
    Finished dev [unoptimized + debuginfo] target(s) in 0.03s
     Running `/home/jtnunley/Programming/CargoTarget/debug/eyra-example`
[TRACE origin::program] Program started
[TRACE origin::thread] Main Thread[Pid(89539)] initialized
[TRACE origin::program] Calling `.init_array`-registered function `0x563e1d8a5600(1, 0x7ffc4fff5d78, 0x7ffc4fff5d88)`
[TRACE origin::program] Calling `origin_main(1, 0x7ffc4fff5d78, 0x7ffc4fff5d88)`
[TRACE async_io::driver] block_on()
[TRACE origin::thread] Thread[Pid(89539)] launched thread Thread[89541] with stack_size=2097152 and guard_size=16384
[TRACE origin::thread] Thread[89541] marked as detached by Thread[Pid(89539)]
[TRACE polling::epoll] add: epoll_fd=4, fd=6, ev=Event { key: 18446744073709551615, readable: false, writable: false }
[TRACE polling::epoll] add: epoll_fd=4, fd=5, ev=Event { key: 18446744073709551615, readable: true, writable: false }
< snip: lots of logs from smol being initialized >

Let’s break it down:

[TRACE origin::program] Program started
[TRACE origin::thread] Main Thread[Pid(89539)] initialized
[TRACE origin::program] Calling `.init_array`-registered function `0x563e1d8a5600(1, 0x7ffc4fff5d78, 0x7ffc4fff5d88)`
[TRACE origin::program] Calling `origin_main(1, 0x7ffc4fff5d78, 0x7ffc4fff5d88)`

These logs come from the program starting up and setting everything up. It initializes the main threads, calls the program constructors (see the ctor crate if you want to know more about that), and launches the program’s entry point, origin_main.

[TRACE async_io::driver] block_on()
[TRACE origin::thread] Thread[Pid(89539)] launched thread Thread[89541] with stack_size=2097152 and guard_size=16384
[TRACE origin::thread] Thread[89541] marked as detached by Thread[Pid(89539)]

async-io, smol’s I/O driver, works by spawning a thread and then running epoll from that. This is used to deliver events throughout the program. Here we can see the driver being started, then a thread being launched to run epoll on.

As we can see, our program is now running on pure-Rust (and a little assembly) software. Does it work?

$ echo "tell me a joke" | nc 127.0.0.1 37279
What do you call a fly without wings? A walk.

Works like a charm!

Final Tally

Although it’s certainly a neat project that’s treading a lot of new ground, I probably wouldn’t recommend using eyra for a production grade project. It’s still wet behind the ears and it doesn’t add much practical value to projects. Still, it’s cool to be able to say that my project is 100% Rust.

The setup is somewhat convoluted. It would be nice if there was some subcommand that set up eyra for a project temporarily, like cargo-hack does.

Also, eyra still doesn’t support every libc function. It’s a slow uphill battle. They are open for contributions if you’re missing something important.

Until then, I’m very excited for what eyra will bring for Rust programs in the future.

notgull

Explaining the internals of async-task from the ground up

Background Basics

Scalability Solutions

Persistent Problems

Why choose async/await over threads?

Background Blitz

Timeout Trouble

Thematic Threads

Async Success Stories

Improving Async’s Image

Announcing smol-macros, smol-hyper and smol-axum

smol-macros

smol-hyper

smol-axum

Hello, World!

Hello, World!

tokio Turnstile

smol v2.0.0

Parting Shots

Creating a time machine for my blog

SCP-093 is a Timeless Masterpiece

Context

SCProcedures

Description

Original Documentation

Test Log 1

Test Log 2

Blue Test

Green Test

Violet Test

Yellow Test

Red Test

Recovered Materials

Speculation

Parting Shots

The rabbit hole of unsafe Rust bugs

Event Listener Escapade

Debugging Demonology

Damaged Dependencies

Goose Chase Gallery

miri Magic

Arc Apocalypse

Parting Shots

Evaluating new software forges

Software Host Hellscape

GitLab Gauss

SourceHut Scramble

Codeberg

Self-Hosting Gitea

Parting Shots

My new, remote-access Rust development setup

Time for an Upgrade

The Hardware Specs Section

Putting It All Together

Utilizing the System

Desktop Dismay

Conclusions

Recreating concurrent futures combinators in smol

The Problem with futures

Concurrency Conundrum

Make your own, better combinator

Parting Shots

Eyra is an interesting Rust project

What is libc?

Musl Melee

Rustix Revelation

Enabling Eyra

Final Tally

`smol-macros`

`smol-hyper`

`smol-axum`

`tokio` Turnstile

`smol` v2.0.0

`miri` Magic

The Problem with `futures`