Json WebService Benchmark

fat · February 6, 2020, 3:35am

Hi all,

i and my friends are in a team of application developers, mainly developing json web services for web and mobile app.
previously, each of us has various language experiences : scala, perl, python
and we want to discuss the language for the next projects.

so i tried to make some benchmarks, as performance is one of the factor for the decision
several frameworks are used in this : http4s (scala), spring(java), actix (rust), spider-gazelle (crystal).
i use ab with this command : $ ab -n 1000 -c 4 http://127.0.0.1:555XX/a-url-path

in this benchmark, the thing that is measured is how fast a web service can deliver a json response, as a result of querying list of records from postgresql database
that is a very common scenario for web based or mobile app : querying for records in database, and deliver those records in json format as http response :

[{"id":1,"code":"FFFFFF","name":"White"},{"id":2,"code":"C0C0C0","name":"Silver"},{"id":3,"code":"000000","name":"Black"},{"id":4,"code":"0000A0","name":"Dark Blue"},{"id":5,"code":"FF0000","name":"Red"}]

here are the results so far (request per seconds, and memory usage in KB/MB) :

the full source code is here :

i ran the benchmarks in a vps (1 cpu core) having this spec : model name : Intel® Xeon® Gold 6140 CPU @ 2.30GHz

i would like to have some suggestions :

are there anything missing in actix (rust) based web service that i use (because the performance is lower than crystal)

a note : for crystal based web services, i dont use any ORM, because CRUD operation without ORM is very common

thanks everyone

vlazar · February 6, 2020, 5:43am

You might wanna look at these benchmark implementations https://www.techempower.com/benchmarks/

There are multiple endpoints showing various aspects of performance like just plain text response or JSON response or random data from DB, etc.

Rust (Actix) is close to best performers which is no surprise. Keep in mind the benchmark implementations can be changed (optimized) by anyone. For example I know Crystal implementations are now tweaked not for the max RPS but kind of a balance between best RPS and good latency.

The code is at https://github.com/TechEmpower/FrameworkBenchmarks

jgaskins · February 6, 2020, 5:44pm

I would take the TechEmpower benchmarks with a grain of salt. Even their most realistic benchmarks are pretty far from real-world code.

As with all benchmarks, running your own code is the best indicator of how something will perform for your use case rather than trusting contrived benchmarks. If Crystal is the highest performer by that large a margin and performance is one of the top criteria, then it sounds like Crystal is the best choice for this service.

asterite · February 6, 2020, 6:37pm

You could try profiling each app to see where the bottlenecks are.

asterite · February 6, 2020, 6:47pm

Wow, spider-gazelle is really fast, and the docs look very complete too. Plus the last commit was two days ago!

fat · February 7, 2020, 6:51am

thanks for your valuable inputs & suggestions
techempower framework benchmark (TFB) has very complete benchmark data
there are things that i found so far, regarding my simple benchmark :

actix based web service implementation

it’s very possibly that i havent found the correct way to get the maximum performance in using actix
so, i think i’ll consult with rust users for the correct best practice in using actix
and/or adding a go-lang framework for comparison, as go-lang seems easier to learn than rust

multi core cpu

TFB uses Intel Xeon Gold 5120 CPU : 14 Cores, 28 threads
what i’ve tested so far is using vps 1 core, so multi-core cpu should also be used to get additional data

thanks

stakach · February 10, 2020, 10:43am

Techempower is running an old version of spider-gazelle - I need to update it
This benchmark is more up to date

vlazar · February 10, 2020, 2:29pm

Please do! Router performance is interesting, but Techempower’s benchmarks are more real-world and useful.

stakach · February 14, 2020, 3:41am

Sort of, I mean I could switch to Redis and win all the benchmarks.
Kemel and Raze are not using ORMs whereas Amber and SG are.
So it’s kind of an apples to oranges comparison and properly misleading to the casual observer.

vlazar · February 14, 2020, 5:53am

There are different types of Techempower’s benchmarks. Some of them are meant to test specifically the typical use of different web frameworks (Classification: Fullstack). E.g. you would use say ECR templates in framework x but plain methods in framework y. There will be many other differences and such benchmarks would measure the default (recommended) way of coding in particular framework.

For example why the Rails performs much worse? Part of it is Ruby of course, but it also the fact that Rails has a long middleware stack doing lots of things. If you recreate such stack doing all the same things in Crystal the Crystal benchmarks can be slowed down. Or you can remove some default handlers from Rails middleware list and improve Ruby results. But that’s probably not what you’ll do in real life. Most likely you’ll chose some framework and follow it’s way.

ComputerMage · February 14, 2020, 12:04pm

If you want to beat the benchmarks then Redis is not an option as it is single threaded.
Look at Aerospike (aerospike.com) as it is truly multithreaded and have much better features than Redis.

I had big issues with Redis on one of my big projects (I recommended Aerospike but CTO was stuck with Redis, even though I did explain where and how Redis will break)

fat · February 15, 2020, 2:31pm

thanks for your valuable inputs,

here i just want to share some additional info, that might be useful

regarding actix, i’ve asked for suggestions in rust forum, here :

AFAIK,
it’s very possibly for a newbie in rust, to follow a simple example in tutorials in internet,
to write an implementation of web service using actix, like in this article (that can be easily found from the result of googling over these words : “rest api with actix web postgresql”) : https://turreta.com/2019/09/21/rest-api-with-rust-actix-web-and-postgresql-part-1/

but if we read more on actix, which is basically async (which is different from other rust web frameworks which is sync : nickel, iron, rocket, etc.), then we can find that the example in that article is sync or blocking.
in another word, it’s the wrong way of using actix, as the performance will be very low.

one of the best practice in using actix with database access is written in the official doc/example :

github.com

actix/examples/blob/master/r2d2/src/main.rs

//! Actix web r2d2 example
use std::io;

use actix_web::{middleware, web, App, Error, HttpResponse, HttpServer};
use r2d2::Pool;
use r2d2_sqlite::SqliteConnectionManager;

/// Async request handler. Ddb pool is stored in application state.
async fn index(
    path: web::Path<String>,
    db: web::Data<Pool<SqliteConnectionManager>>,
) -> Result<HttpResponse, Error> {
    // execute sync code in threadpool
    let res = web::block(move || {
        let conn = db.get().unwrap();

        let uuid = format!("{}", uuid::Uuid::new_v4());
        conn.execute(
            "INSERT INTO users (id, name) VALUES ($1, $2)",
            &[&uuid, &path.into_inner()],

This file has been truncated. show original

blocking parts of code must be put inside web::block(move || { ... })
one of an async implementation that i’ve tried, is more than 10 times faster than the sync version.

regarding the multi core cpu, i’ve made the same benchmark in 4 core cpu (Xeon CPU E3-1225 v5 @ 3.30GHz).
when using 4 concurrent client connections, crystal based frameworks are still relatively fast.
but when the concurrency gets increased, the results are different.
and in 64 concurrent connections, actix diesel is faster :

results : https://github.com/sharing-lab/ws-benchmark/blob/b4f9419b52871b96d049770ae42ef1ce61c15fee/results/xeon_4_core-max100-localhost.xlsx

the OS and crystal version which are used :

$ rpm -qa centos*
centos-release-6-10.el6.centos.12.3.x86_64
$ crystal -v
Crystal 0.30.1 [5e6a1b672] (2019-08-12)

LLVM: 4.0.0
Default target: x86_64-unknown-linux-gnu

jgaskins · February 19, 2020, 6:23am

Out of curiosity, what is the CPU usage of the various processes in this benchmark? Is it possible that Crystal is serving more requests per second per CPU core consumed and that building with -Dpreview_mt would improve its throughput?

fat · February 19, 2020, 10:40am

ok, thanks,
i’ve tried “-Dpreview_mt” options, and made some tests on another machine/box (8 core i7)

the results (ab test numbers) are copy-pasted in here : https://github.com/sharing-lab/ws-benchmark/blob/ca8c792acdb36569425a9024f427685bb27441f9/results/i7_8_core-max100-localhost.txt

the source code is here : https://github.com/sharing-lab/ws-benchmark/tree/ca8c792acdb36569425a9024f427685bb27441f9

implementations which are tested in the benchmark :

kemal : compiled without -Dpreview_mt
kemal-mt-x : with -Dpreview_mt, and using env.variable CRYSTAL_WORKERS=x

the chart :

detailed server environments :

$ cat /proc/cpuinfo |tail -n 30|head -n 10
cache_alignment	: 64
address sizes	: 39 bits physical, 48 bits virtual
power management:

processor	: 7
vendor_id	: GenuineIntel
cpu family	: 6
model		: 60
model name	: Intel(R) Core(TM) i7-4790 CPU @ 3.60GHz
stepping	: 3

$ rpm -qa centos*|grep release-7
centos-release-7-7.1908.0.el7.centos.x86_64

$ crystal -v
Crystal 0.33.0 [612825a53] (2020-02-14)

LLVM: 8.0.0
Default target: x86_64-unknown-linux-gnu

$ shards install
Fetching https://github.com/kemalcr/kemal.git
Fetching https://github.com/luislavena/radix.git
Fetching https://github.com/jeromegn/kilt.git
Fetching https://github.com/crystal-loot/exception_page.git
Fetching https://github.com/will/crystal-pg.git
Fetching https://github.com/crystal-lang/crystal-db.git
Installing kemal (0.26.1)
Installing radix (0.3.9)
Installing kilt (0.4.0)
Installing exception_page (0.1.2)
Installing pg (0.20.0)
Installing db (0.8.0)

$ crystal build --release -Dpreview_mt  -o bin/ws-rel-pmt src/ws.cr

jgaskins · February 20, 2020, 10:24pm

I’ve seen Crystal outperform Go on a lot of things, but seeing it outperform Rust is fantastic

fat · February 22, 2020, 5:03am

yes, about go, i’ve tried some go web framework yesterday (gorilla, fasthttp-router,…), to get some more comparisons.

for rust, to develop web service in rust, it’s very common to use ORM (mostly used: diesel).
so in my previous tests, i used diesel as the db connection layer.
today, i try not to use ORM, i use tokio-postgres (an async library to access postgress)

source code and results (in results directory) is here : https://github.com/sharing-lab/ws-benchmark/tree/26168797257b0ff773dbe21fb77afcde65ab1b9c

in short, here is the result using ab -n 1000 -c 4 http://127.0.0.1/555XX/color :

asterite · February 22, 2020, 11:52am

Nice! Crystal is doing pretty well.

Could you try doing the same. but with wrk instead of ab? Someone told me in the past that wrk is much more reliable and gives more consistent results.

bcardiff · February 22, 2020, 3:38pm

@fat some months ago we created and used Benchy: A benchmark tool . You might find it useful to play around different benchmarks. Let me know if you play with it and if there is any feedback.

fat · February 22, 2020, 11:47pm

thanks for your suggestions,
yes, i agree that sometime we need some scripts to help us automate the benchmark tasks.
i’ll take a look at the docs first.

rogerdpack · February 23, 2020, 12:39am

Wonder if somebody should make a go “fasthttp” equivalent for crystal…though I admit I don’t know much about it and “acing microbenchmarks” is often unuseful in real life LOL.

Topic		Replies	Views
Crystal, Rust, .NET, Swift, and JS relative performance News	0	747	May 28, 2024
WoW! I finally found a way to show the differentiation in speed between JSON::Any and static types	20	1459	April 15, 2019
Crystal vs Go web service comparison News	6	1665	March 6, 2020
WSL vs Native Linux Benchmarks	9	2097	January 31, 2019
Performance issues with the JSON parser Help & Support	24	642	March 21, 2024

Json WebService Benchmark

Related topics