๐ Blazing-fast JSON encoding and decoding for PHP, powered by the simdjson project.
This is a fork of crazyxman/simdjson_php with new optimisations and encoding support.
Operation | PHP Built-in | simdjson_php | Speedup |
---|---|---|---|
Decode to array | 1.48 ms | 0.46 ms | 3.2ร |
Decode to object | 1.56 ms | 0.54 ms | 2.9ร |
Encode | 0.67 ms | 0.26 ms | 2.5ร |
Encode (pretty print) | 0.83 ms | 0.31 ms | 2.6ร |
Validate | 1.37 ms | 0.22 ms | 6.2ร |
Count items | 1.51 ms | 0.16 ms | 9.4ร |
Tests were conducted using PHP 8.3 on an Apple M1 Max. For test specification see TwitterDecodeBench.php
and TwitterEncoderBench.php
.
Additionally, simdjson_php reduces memory usage compared to json_decode()
. For example, when decoding twitter.json, memory consumption drops from 3.01 MB to 2.47 MB due to efficient array key deduplication.
- PHP 8.0+ (PHP 8.2+ recommended for maximum performance)
- g++ (version 7 or better) or clang++ (version 6 or better)
- A 64-bit system with a command-line shell (e.g., Linux, macOS, FreeBSD)
To compile simdjson_php, run the following commands:
phpize
./configure
make
make test
make install
Once installed, add this line to your php.ini
file:
extension=simdjson.so
$jsonString = <<<'JSON'
{
"Image": {
"Width": 800,
"Height": 600,
"Title": "View from 15th Floor",
"Thumbnail": {
"Url": "http://www.example.com/image/481989943",
"Height": 125,
"Width": 100
},
"Animated" : false,
"IDs": [116, 943, 234, 38793, {"p": "30"}]
}
}
JSON;
// Check if a JSON string is valid:
$isValid = simdjson_validate($jsonString); //return bool
var_dump($isValid); // true
// Parsing a JSON string. Similar to the json_decode() function but without the fourth argument
try {
// returns array|stdClass|string|float|int|bool|null.
$parsedJSON = simdjson_decode($jsonString, true, 512);
var_dump($parsedJSON); // PHP array
} catch (RuntimeException $e) {
echo "Failed to parse $jsonString: {$e->getMessage()}\n";
}
// Encode to JSON string
var_dump(simdjson_encode($parsedJSON));
// note. "/" is a separator. Can be used as the "key" of the object and the "index" of the array
// E.g. "/Image/Thumbnail/Url" is recommended starting in simdjson 4.0.0,
// but "Image/Thumbnail/Url" is accepted for now.
// get the value of a "key" in a json string
// (before simdjson 4.0.0, the recommended leading "/" had to be omitted)
$value = simdjson_key_value($jsonString, "/Image/Thumbnail/Url");
var_dump($value); // string(38) "http://www.example.com/image/481989943"
$value = simdjson_key_value($jsonString, "/Image/IDs/4", true);
var_dump($value);
/*
array(1) {
["p"]=>
string(2) "30"
}
*/
// check if the key exists. return true|false|null. "true" exists, "false" does not exist,
// throws for invalid JSON.
$res = simdjson_key_exists($jsonString, "/Image/IDs/1");
var_dump($res) //bool(true)
// count the values
$res = simdjson_key_count($jsonString, "/Image/IDs");
var_dump($res) //int(5)
Most of available options of default json_encode()
method are not supported by simdjson_encode()
as they are usually useless.
simdjson_encode($value)
method has similar behaviour as json_encode($value, JSON_UNESCAPED_SLASHES | JSON_UNESCAPED_UNICODE | JSON_THROW_ON_ERROR)
Supported options are:
SIMDJSON_PRETTY_PRINT
- use whitespace in returned data to format itSIMDJSON_INVALID_UTF8_SUBSTITUTE
- convert invalid UTF-8 characters to\0xfffd
(Unicode Character 'REPLACEMENT CHARACTER' ๏ฟฝ)SIMDJSON_INVALID_UTF8_IGNORE
- ignore invalid UTF-8 charactersSIMDJSON_APPEND_NEWLINE
- append new line character (\n
) to end of encoded string. This is useful when encoding data to JSONL format as PHP strings are immutable.
Differences are:
- uses different algorithm to convert floating-point number to string, so string format can be slightly different
- even when
JSON_UNESCAPED_UNICODE
is enabled, PHPjson_encode()
escapes some Unicode chars that do not need to be escaped.simdjson_encode()
escape just Unicode chars that needs to be escaped by JSON spec. - simdjson will throw
SimdJsonEncoderException
exception in case of error
JSON format do not support binary data. Common way how to transfer binary data in JSON encoding is using base64 encoding.
If you need to include base64 encoded value into JSON, you can use SimdJsonBase64Encode
class that offers optimised converting to base64 value into JSON and use less memory.
As creating new object in PHP is relatively slow, this approach make sense for string longer than 1 kB.
$fileContent = file_get_contents("example.jpg");
$fileContentEncoded = new SimdJsonBase64Encode($fileContent);
simdjson_encode(['image' => $fileContentEncoded]); // returns {"image":"TWFueSBoYW5kcyBtYWtlIGxpZ2h0IHdvcmsu..."}
You can also use base64url encoding (RFC 4648 ยง5) by setting second argument to true: new SimdJsonBase64Encode($fileContent, true);
For large data sets, simdjson_php provides the simdjson_encode_to_stream()
function to save data directly to a file or output buffer.
$bigStructure = [...];
simdjson_encode_to_stream($bigStructure, fopen("file.json", "w")); // save to file.json
simdjson_encode_to_stream($bigStructure, fopen("php://output", "w")); // send to output buffer
There are some differences from json_decode()
due to the implementation of the underlying simdjson library. This will throw a SimdJsonDecoderException
if simdjson rejects the JSON.
Note that the simdjson PECL is using a fork of the simdjson C library to imitate php's handling of integers and floats in JSON.
-
The maximum string length that can be passed to
simdjson_decode()
is 4GiB (4294967295 bytes).json_decode()
can decode longer strings. -
The handling of max depth is counted slightly differently for empty vs non-empty objects/arrays. In
json_decode
, an array with a scalar has the same depth as an array with no elements. Insimdjson_decode
, an array with a scalar is one level deeper than an array with no elements. For typical use cases, this shouldn't matter. (e.g.simdjson_decode('[[]]', true, 2)
will succeed butjson_decode('[[]]', true, 2)
andsimdjson_decode('[[1]]', true, 2)
will fail.)
If you need to decode a big file from JSON format that you want to save to a file or send to a user, you can use the simdjson_decode_from_stream
method.
simdjson_decode_from_stream(fopen("file.json", "r")); // load from file.json
simdjson_decode_from_stream(fopen("php://input", "r")); // send by user
See the benchmark folder for more benchmarks.