Base64-encoding a large value

jgaskins · May 18, 2024, 5:01pm

Is there a good way to base64-encode a large file without loading the whole thing into memory? Thinking of something along the lines of def Base64.encode(source : IO, destination : IO). It looks like the current stdlib implementation only supports objects that respond to to_slice as the source, which seems to imply that all of the data be loaded into RAM.

Sandra · May 21, 2024, 2:24pm

You can base64-encode any stream as long a you can read three bytes at a time. Three input bytes become four base64 characters.

Sandra · May 21, 2024, 2:26pm

In other words, grab subvectors of length divisible by 3 and then run them through the stdlib base64 encoder.

jgaskins · May 21, 2024, 7:09pm

This is what I’m currently doing as a workaround. :-) I don’t want to keep it as a workaround, though. I’m looking for a first-class solution and I was hoping someone knew of something.

Since I implemented it by monkeypatching Base64.encode (and a couple methods downstream from it) in my app and it actually fits pretty well into the Base64 module, I’m considering pushing it up as a PR to Crystal. This isn’t the first time I’ve needed this and I assume others that also need it have simply been accepting that they have to load an entire file into memory to base64-encode it over the wire.

straight-shoota · May 21, 2024, 7:36pm

Sounds like a good idea to upstream this

zw963 · June 18, 2025, 2:17pm

Hi, is there a link for the code? thanks

jgaskins · June 19, 2025, 12:11am

I posted my implementation as a PR to the Crystal stdlib.

I closed it because someone had some strong opinions on it and opened their own PR. They haven’t touched it since 3 days after that, though, so I’ll either reopen mine or release it as a shard. The important thing is that the functionality is supported.

Topic		Replies	Views
Do ascii/binary strings exist? Help & Support	25	514	March 21, 2022
A blog article on performant vs idiomatic code (using Crystal examples) News blog	15	733	February 11, 2024
Proposal: Support for arbitrary sized integers Crystal Contrib	5	572	February 25, 2021
Is there a way to Digest large files? Help & Support	9	620	September 2, 2020
Working with large bin files, proper way to do it? Help & Support	1	368	March 17, 2019

Base64-encoding a large value

Related topics