Performance issues with the JSON parser

self · March 9, 2024, 7:35pm

Did anyone ever have any performance issues with the JSON parser?
Thought I’d ask here before investigating further (and I don’t know if time permits) if this is a known problem.

I need to parse a rather large payload of JSON data.
At first I tried JSON.parse(io). Then I switched to MyModel.from_json(io) using structs. I also tried with records and classes.

In the default build, it was always in the range of 2.5 to 3 seconds, give or take.
It was admittedly faster in “--release” build, but still felt slow.

Telling by what Time Profiler in XCode Instruments (a 25 GB install, what the heck) showed me, the bottleneck seemed to be somwhere in JSON reading from an IO, or in IO reading chars. (I don’t really know how to interpret the profile.)

Turns out, the parsing is indeed faster when parsing from a string.
Here’s what I found (built with “–release”):

from a File: 0.54 s
from an IO::Memory: 0.45 s
from a String: 0.20 s

By comparison, parsing the same thing in Ruby using the built-in JSON parser takes some 0.11 s (totally unscientific benchmark, for now). And that’s not even using one of the faster parsers, such as Oj.

Blacksmoke16 · March 10, 2024, 2:07am

I’d be curious to know:

How big is the file
What is the timing when using JSON.parse vs MyType.from_json

jgaskins · March 10, 2024, 2:18am

How large is your JSON payload? 200ms to parse JSON from a string and 450ms to parse from an IO sounds like a whole lot of JSON. On my laptop, reading an 11MB JSON payload from a string takes ~45ms.

Is it feasible to use a different serialization format for what you’re trying to do or are you stuck with JSON? Using MessagePack, I’m seeing parse times as low as 2ms for the equivalent payload.

self · March 10, 2024, 2:37am

The JSON data is some 34 MB.

Just checked: There’s basically no difference between JSON.parse(json_str) and MyType.from_json(json_str). They both take around 0.21 s.

Crystal 1.11.2, LLVM 17.0.6, on aarch64-apple-darwin, FWIW.

And I’m stuck with JSON.

self · March 10, 2024, 2:47am

There’s basically no difference between JSON.parse(json_str) and MyType.from_json(json_str). They both take around 0.21 s.

With laptop on battery, that is (for all numbers so far).
With the power connected, both take around 0.16 or 0.17 s.
And then in Ruby it’s 0.09 s.

npn · March 11, 2024, 2:42am

How did you parse Jason using file?

Like this?

MyJson.from_json File.read(path)

Or like this?

File.open(path, 'r'){ |io| MyJson.from_json io}

self · March 11, 2024, 4:47pm

From a File was basically like this:

file = File.open(path, "rb")

t0 = Time.utc
result = MyType.from_json(file)
t1 = Time.utc
puts "Parsing JSON took #{ t1 - t0 }."

From a String:

file = File.open(path, "rb")
bytes = Bytes.new(file.size)
file.read_fully?(bytes)
str = String.new(bytes)

t0 = Time.utc
result = MyType.from_json(str)
t1 = Time.utc
puts "Parsing JSON took #{ t1 - t0 }."

straight-shoota · March 11, 2024, 6:36pm

Tip: Better use Time.monotonic or .measure(&) for an accurate measure of duration.

RespiteSage · March 12, 2024, 3:47pm

I did some testing on my machine, and it certainly seems like the parsing is slower than I remember when testing GeoJSON parsing a while back. I could be misremembering, though, and I can’t find the benchmarks I made (on pretty large files) at the moment.

Here’s my code:

require "json"
require "random"
require "file"
require "time"

struct Inner
  include JSON::Serializable

  property inner_name : String
  property numbers : Array(Int32)

  def initialize(
    @inner_name : String,
    @numbers : Array(Int32) = Array(Int32).new
  )
  end
end

struct Middle
  include JSON::Serializable

  property middle_name : String
  property inner_values : Array(Inner)

  def initialize(
    @middle_name : String,
    @inner_values : Array(Inner) = Array(Inner).new
  )
  end
end

struct Outer
  include JSON::Serializable

  property outer_name : String
  property middle_values : Array(Middle)

  def initialize(
    @outer_name : String,
    @middle_values : Array(Middle) = Array(Middle).new
  )
  end
end

def create_structure(scale_factor : Int32, rng : Random, numbers_range : Range(Int32, Int32)) : Array(Outer)
  Array(Outer).new(scale_factor) {
    Outer.new(
      rng.base64,
      Array(Middle).new(scale_factor) {
        Middle.new(
          rng.base64,
          Array(Inner).new(scale_factor) {
            Inner.new(
              rng.base64,
              Array(Int32).new(scale_factor) { rng.rand(numbers_range) }
            )
          }
        )
      }
    )
  }
end

def count(structure : Array(Outer))
  structure.sum { |outer|
    outer.middle_values.sum { |middle|
      middle.inner_values.sum { |inner|
        0_u64 + inner.numbers.size
      }
    }
  }
end

def puts_seconds_elapsed(label, start_time, end_time)
  puts "#{label}: #{(end_time - start_time).total_seconds}s"
end

scale_factor = 100
if ARGV.size > 0 && (first_arg_int = ARGV.first.to_i?)
  scale_factor = first_arg_int
end

filename = "big_json.json"
rng = Random.new(seed: scale_factor)
numbers_range = (1000..9999)

do_write = true

if do_write
  structure = create_structure scale_factor, rng, numbers_range

  begin
    start_time = Time.monotonic
    structure_json = structure.to_json
    end_time = Time.monotonic

    puts_seconds_elapsed "serialization to string", start_time, end_time

    file = File.open filename, "w"
    start_time = Time.monotonic
    file << structure_json
    end_time = Time.monotonic
    file.close

    puts_seconds_elapsed "file write from string", start_time, end_time
  end

  file = File.open filename, "w"
  start_time = Time.monotonic
  structure.to_json file
  end_time = Time.monotonic
  file.close

  puts_seconds_elapsed "file write with serialization", start_time, end_time
end

begin
  start_time = Time.monotonic
  file_contents = File.read filename
  end_time = Time.monotonic

  puts_seconds_elapsed "file read to string", start_time, end_time

  start_time = Time.monotonic
  structure_from_file = Array(Outer).from_json file_contents
  end_time = Time.monotonic

  puts_seconds_elapsed "parsing from string", start_time, end_time

  # just to make sure the compiler doesn't elide anything
  File.write File::NULL, count(structure_from_file)
end

file = File.open(filename, "r")
start_time = Time.monotonic
structure_from_file = Array(Outer).from_json file
end_time = Time.monotonic
file.close

puts_seconds_elapsed "parsing from file", start_time, end_time

# just to make sure the compiler doesn't elide anything
File.write File::NULL, count(structure_from_file)

Notes on the Code

I tried out different File buffering settings, but it didn’t seem to make any difference, even in the “write with serialization” and “parsing from file” cases.
The begin...end blocks are an attempt to create variable scopes to help manage memory usage, but I don’t know if that actually works.
I tried to make the serializable structures as simple as possible (to make it easier to review) while still exhibiting nesting, since real-world JSON tends to be heavily nested.
I made basically no attempt to optimize create_structure or count because they’re not what I was trying to benchmark.

Example produced JSON, with scale factor 2, after formatting with jq

[
  {
    "outer_name": "qetdD4TLe9Ijt+J9Z+dlYg==",
    "middle_values": [
      {
        "middle_name": "T81VaTBy7EJ+2r4G2fATSA==",
        "inner_values": [
          {
            "inner_name": "epJ0N7wZSPdM/UZJzuTAvA==",
            "numbers": [
              9016,
              9814
            ]
          },
          {
            "inner_name": "gC1J8zsb6sXhl9i6A67Apw==",
            "numbers": [
              2739,
              3830
            ]
          }
        ]
      },
      {
        "middle_name": "0I9taSfFVEFJNkUbOPnJxA==",
        "inner_values": [
          {
            "inner_name": "bBlzJ6IPbI53SC+4LLIjAg==",
            "numbers": [
              1986,
              5623
            ]
          },
          {
            "inner_name": "x9Z5bWIal4qRClJfeMw2fg==",
            "numbers": [
              8853,
              7967
            ]
          }
        ]
      }
    ]
  },
  {
    "outer_name": "siItIL1Wb72iq3N/bqYoYQ==",
    "middle_values": [
      {
        "middle_name": "K/j4cgIgOXpV1juImq15uQ==",
        "inner_values": [
          {
            "inner_name": "os64AVLIAuYGuhKhBaxZDw==",
            "numbers": [
              9189,
              1888
            ]
          },
          {
            "inner_name": "CguKhvwLFKCG8WkAtlTUWA==",
            "numbers": [
              9455,
              9214
            ]
          }
        ]
      },
      {
        "middle_name": "uIYmymvfO2Y2k8wQXjCB6Q==",
        "inner_values": [
          {
            "inner_name": "RUZapun49A2gzOHArkubNA==",
            "numbers": [
              6706,
              3441
            ]
          },
          {
            "inner_name": "Fc+DBmjHNtxcevNweLKyQQ==",
            "numbers": [
              5703,
              9299
            ]
          }
        ]
      }
    ]
  }
]

And here’s the output I’m getting on my machine:

Scale Factor 10 (112 kb file)

serialization to string: 0.003119035s
file write from string: 0.0001233s
file write with serialization: 0.001344965s
file read to string: 0.000161006s
parsing from string: 0.002256683s
parsing from file: 0.005329564s

Scale Factor 50 (37 mb file)

serialization to string: 0.407213263s
file write from string: 0.036222296s
file write with serialization: 0.428914678s
file read to string: 0.029467402s
parsing from string: 0.910451311s
parsing from file: 1.886663833s

Scale Factor 100 (529 mb file)

serialization to string: 6.845525082s
file write from string: 0.533993881s
file write with serialization: 7.428256482s
file read to string: 0.199061933s
parsing from string: 15.233927115s
parsing from file: 29.497250194s

self · March 12, 2024, 6:19pm

FWIW: I get a 6 % improvement by simplifying how codepoints are counted in a String:

require "string"

class String
  def size : Int32
    if @length > 0 || @bytesize == 0
      return @length
    end
    # original:
    #@length = each_byte_index_and_char_index { }
    # new:
    @length = utf8_len
  end
  
  protected def utf8_len : Int32
    # 0b10...... is a continuation byte.
    # Counting continuation bytes is faster than counting leading bytes
    # if the string has more leading bytes than continuation bytes, i.e
    # mostly ASCII.
    count_continuation_bytes = to_slice.count{ |byte|
      byte & 0b11000000 == 0b10000000
    }
    return (bytesize - count_continuation_bytes)
  end
end

I’m also wondering if .from_json could be improved by operating on bytes instead of chars.

Or if it’s possible not to care about UTF-8/UTF-16/… initially. I.e. first just locate the hierarchy of objects (basically ‘{’/‘}’, unless in a string), then parallelize the parsing depth-first. Not exactly a low-hanging fruit though.

straight-shoota · March 12, 2024, 6:27pm

This algorithm only works on the premise that the string is valid UTF-8. Crystal’s String type expects to be UTF-8 encoded, but it does not enforce it. String data read from external source may contain bytes that are not valid UTF-8 encodings and those needs to be properly handled.

straight-shoota · March 12, 2024, 6:31pm

Yeah, that could be an easy optimization. The JSON structure is entirely ASCII, so there’s no need to handle multi byte encodings. They can just transparently pass through as payload. Some other parsing algorithms in stdlib work on this principle already.

straight-shoota · March 12, 2024, 6:32pm

Parellelizing sounds interesting. I fear it would be quite complex though

self · March 12, 2024, 7:57pm

I’m glad to hear I’m not alone.

self · March 12, 2024, 7:59pm

I see.
For JSON I would probably happily assume, that the data is valid UTF-8.
If it isn’t and something explodes:
Or maybe only then fallback to the more robust implementation. Ok, probably not good idea.
And while 6 % is nice, it’s far from 50 %.

jgaskins · March 13, 2024, 6:49am

IME the lowest-hanging fruit for performance optimization in Crystal is usually reducing heap allocations. I have no doubt that the JSON parser could make fewer allocations.

In a few cursory checks (I don’t have the energy to do much more than that atm), it currently allocates about 3-4x the JSON payload size — even more if your JSON::Serializable types’ properties have union types or use_json_discriminator. But I just had a quick scan through the lexer and parser code and nothing’s jumping out at me.

asterite · March 16, 2024, 10:36am

I found a way to improve the performance and the memory used: Optimize JSON parsing a bit by asterite · Pull Request #14366 · crystal-lang/crystal · GitHub

jwoertink · March 16, 2024, 3:53pm

Nice! I ran your benchmark against an empty json {} just to make sure that was ok, and it looks like the parse with IO gets faster there too!

# Before
JSON.parse (string)   9.43M (106.02ns) (± 0.40%)  640B/op        fastest
    JSON.parse (IO) 588.70k (  1.70µs) (± 0.25%)  608B/op  16.02× slower

# After
JSON.parse (string)   9.38M (106.64ns) (± 2.78%)  625B/op        fastest
    JSON.parse (IO) 764.24k (  1.31µs) (± 0.61%)  625B/op  12.27× slower

Nice work

Xen · March 16, 2024, 8:03pm

Wait, what? Shouldn’t Crystal at least be on par with Ruby, unless Ruby’s making some unacceptable shortcuts?

self · March 17, 2024, 12:44am

I didn’t look at what the JSON lexer/parser does in Ruby, but here’s my benchmarks:

github.com/crystal-lang/crystal

Comment by philipp-kempgen - Optimize JSON parsing a bit

crystal-lang:master ← crystal-lang:optimize-json

> A note for the forum thread: I tried parsing that same big file with Ruby 3.1 …and Ruby was (slightly) slower: 16 seconds in Crystal vs. 19 seconds in Ruby. This is on a Mac. So I don't know why it was slower in Crystal for OP Let's make the JSON data contain some long strings. They are Base64-encoded in my case, but that doesn't matter. `gen-json.rb`: ```rb json = %|{"a_base64":#{("a" * 5000).inspect},"b_base64":#{("a" * 10000).inspect}}| json = "[" + ([json] * 2500).join(",") + "]" File.write("json.json", json) ``` `benchmark-pk.cr`: ```cr require "json" require "benchmark" file_io = File.open("json.json") file_string = File.read("json.json") Benchmark.bm do |x| x.report("JSON.parse (string)") do JSON.parse(file_string) end end ``` `benchmark-pk.rb`: ```rb require "json" require "benchmark" file_io = File.open("json.json") str = File.read("json.json") Benchmark.bm(19) do |x| x.report("JSON.parse(str)") do JSON.parse(str) end end ``` ```sh $ crystal build --release benchmark-pk.cr $ /usr/bin/time -l ./benchmark-pk user system total real JSON.parse (string) 0.174718 0.005743 0.180461 ( 0.180668) 0,20 real 0,18 user 0,02 sys 94814208 maximum resident set size 0 average shared memory size 0 average unshared data size 0 average unshared stack size 5877 page reclaims 0 page faults 0 swaps 0 block input operations 0 block output operations 0 messages sent 0 messages received 0 signals received 0 voluntary context switches 39 involuntary context switches 1868924524 instructions retired 591412605 cycles elapsed 93389632 peak memory footprint ``` That's *before* your changes, on Apple aarch64/arm64. ```sh $ /usr/bin/time -l ruby ./benchmark-pk.rb user system total real JSON.parse(str) 0.063432 0.003553 0.066985 ( 0.067092) 0,12 real 0,10 user 0,01 sys 90554368 maximum resident set size 0 average shared memory size 0 average unshared data size 0 average unshared stack size 5642 page reclaims 0 page faults 0 swaps 0 block input operations 0 block output operations 0 messages sent 0 messages received 0 signals received 0 voluntary context switches 43 involuntary context switches 1390183537 instructions retired 367126927 cycles elapsed 86574272 peak memory footprint ``` That's using Ruby 3.3.0. The Ruby version is faster by a factor of 0.181/0.067 = 2.70.

Topic		Replies	Views
WoW! I finally found a way to show the differentiation in speed between JSON::Any and static types	20	1455	April 15, 2019
What is the best way to parse JSON Object? Help & Support	3	386	June 13, 2022
Json WebService Benchmark Help & Support	21	2122	February 24, 2020
Crystal, Rust, .NET, Swift, and JS relative performance News	0	711	May 28, 2024
Parsing large JSON array Help & Support	2	212	December 18, 2023

Performance issues with the JSON parser

Related topics