PHP Judy - Extension for creating and accessing dynamic arrays
php-judy is an extension by Nicolas Brousse for the Judy C library. It is compatible with PHP 8.0 and newer.
- PECL Package: http://pecl.php.net/package/Judy
- Packagist Package: https://packagist.org/packages/orieg/judy
- GitHub Repository: http://github.com/orieg/php-judy
A Judy array is a complex but very fast associative array data structure for storing and looking up values using integer or string keys. Unlike normal arrays, Judy arrays may be sparse; that is, they may have large ranges of unassigned indices.
The PHP extension is based on the Judy C library that implements a dynamic array. A Judy array consumes memory only when populated yet can grow to take advantage of all available memory. Judy's key benefits are: scalability, performance, memory efficiency, and ease of use. Judy arrays are designed to grow without tuning into the peta-element range, scaling near O(log-base-256) -- 1 more RAM access at 256 X population.
For a detailed performance comparison with native PHP arrays, please see the BENCHMARK.md file.
README.md This file
API.md Complete API reference
BENCHMARK.md Performance benchmarks and analysis
MIGRATION_2.2.0.md Migration guide for version 2.2.0
LICENSE The PHP License used by this project
tests/ Unit tests (176 tests)
examples/ Benchmark and example scripts
libjudy/ Bundled libJudy
*.c, *.h C source and header files
Judy.stub.php PHP stub for IDE autocompletion
PHP PIE (PHP Extension Installer) is the easiest way to install PHP Judy on supported platforms:
# Install PHP PIE if you don't have it
curl -sSL https://pie.dev/installer | php
# Install PHP Judy using PIE
pie install judyNote: PHP PIE automatically handles dependencies and builds the extension for your specific PHP version and platform.
You can also install PHP Judy using PECL:
# Install the extension with pecl
pecl install judyNote: You may need to install the Judy C library first on some systems.
From the PHP Judy sources:
phpize
./configure --with-judy[=DIR]
make
make test
make installIf you are using Ubuntu or Debian, you can install libJudy with apt:
apt-get install libjudydebian1 libjudy-dev
phpize
./configure --with-judy=/usr
make
make test
make installOn Windows, you will need to build LibJudy yourself.
Download the sources at http://sourceforge.net/projects/judy/
Extract the sources, and open the Visual Studio command prompt and navigate to the source directory. Then execute:
build
This creates "Judy.lib", copy this into the php-sdk library folder and name it libJudy.lib
Then copy the include file "judy.h" into the php-sdk includes folder. Now it's time to build pecl/judy, extract the pecl/judy into your build folder where the build scripts will be able to pick it up, e.g.:
C:\php\pecl\judy\
If your source of PHP is located in:
C:\php\src\
The rest of the steps is pretty straightforward, like any other external extension:
buildconf
configure --with-judy=shared
nmakeThe recommended way to install php-judy on Mac OS X is by using pie or pecl. You will need to have the Judy C library installed first, which can be done easily with Homebrew.
# Install PHP PIE if you don't have it
curl -sSL https://pie.dev/installer | php
# Install PHP Judy using PIE
pie install judy# First, install the Judy C library
brew install judy
# Then, install the extension with pecl
pecl install judyIf you prefer to compile from source, you will need to install the libJudy first. Download the sources at http://sourceforge.net/projects/judy/
Extract the sources, then cd into the source directory and execute:
./configure
make
make installJudy arrays can be used like usual PHP arrays. The difference will be in the type of key/values that you can use. Judy arrays are optimized for memory usage but it forces some limitations in the PHP API.
There are 10 types of PHP Judy Arrays, organized into three families:
A Judy array with only 1 bit per index. It can be used to store boolean values.
$judy = new Judy(Judy::BITSET);
$judy[100] = true;
$judy[200] = true;
$judy[300] = false;
if ($judy[100]) {
echo "Index 100 is set\n";
}A Judy array with integer keys and integer values.
$judy = new Judy(Judy::INT_TO_INT);
$judy[1] = 100;
$judy[2] = 200;
$judy[3] = 300;
echo $judy[2]; // Outputs: 200A Judy array with integer keys and mixed values (strings, integers, etc.).
$judy = new Judy(Judy::INT_TO_MIXED);
$judy[1] = "Hello";
$judy[2] = 42;
$judy[3] = [1, 2, 3];
echo $judy[1]; // Outputs: HelloA Judy array with integer keys and serialized ("packed") values. Values are stored as opaque byte buffers outside PHP's garbage collector using php_var_serialize/php_var_unserialize. This trades serialize/deserialize CPU cost for reduced GC pressure, making it suitable for large datasets where GC pauses are a concern.
Supports any serializable PHP value (strings, integers, floats, arrays, objects). Closures and generators cannot be stored.
$judy = new Judy(Judy::INT_TO_PACKED);
$judy[0] = "Hello";
$judy[1] = 42;
$judy[2] = [1, 2, 3];
$judy[3] = new DateTimeImmutable();
echo $judy[0]; // Outputs: Hello
// Values are fully reconstructed on read
$arr = $judy[2]; // Returns [1, 2, 3]When to use INT_TO_PACKED vs INT_TO_MIXED:
- Use
INT_TO_MIXEDfor small-to-medium arrays or when read/write speed is critical - Use
INT_TO_PACKEDfor large arrays (100K+ elements) where GC pause reduction matters more than individual read/write latency
Trie-based types use JudySL internally. Keys are stored in sorted lexicographic order, making iteration ordered and range queries efficient. Lookup is O(key-length).
A Judy array with string keys and integer values.
$judy = new Judy(Judy::STRING_TO_INT);
$judy["apple"] = 1;
$judy["banana"] = 2;
$judy["cherry"] = 3;
echo $judy["banana"]; // Outputs: 2A Judy array with string keys and mixed values.
$judy = new Judy(Judy::STRING_TO_MIXED);
$judy["name"] = "John Doe";
$judy["age"] = 30;
$judy["scores"] = [85, 92, 78];
echo $judy["name"]; // Outputs: John DoeHash-based types use JudyHS for O(1) average-case lookups, with a parallel JudySL key index that maintains sorted iteration order. Best for workloads dominated by random key access where you still need ordered iteration.
A hash-backed Judy array with string keys and integer values.
$judy = new Judy(Judy::STRING_TO_INT_HASH);
$judy["session_abc"] = 1;
$judy["session_xyz"] = 2;
echo $judy["session_abc"]; // Outputs: 1
// Iteration is still sorted (via the key index)
foreach ($judy as $key => $value) {
echo "$key => $value\n";
}A hash-backed Judy array with string keys and mixed values.
$judy = new Judy(Judy::STRING_TO_MIXED_HASH);
$judy["config_a"] = ["enabled" => true];
$judy["config_b"] = 42;Adaptive types use Short-String Optimization (SSO): keys of 7 bytes or fewer are packed into a 64-bit integer and stored in a JudyL array, avoiding hashing overhead entirely. Longer keys fall back to JudyHS. A JudySL key index maintains sorted iteration. Best for mixed-length key workloads with many short keys.
An adaptive Judy array with string keys and integer values.
$judy = new Judy(Judy::STRING_TO_INT_ADAPTIVE);
$judy["us"] = 1; // SSO: packed into JudyL (2 bytes)
$judy["uk"] = 2; // SSO: packed into JudyL (2 bytes)
$judy["a_very_long_country_name"] = 3; // Falls back to JudyHS
echo $judy["us"]; // Outputs: 1An adaptive Judy array with string keys and mixed values.
$judy = new Judy(Judy::STRING_TO_MIXED_ADAPTIVE);
$judy["id"] = 12345;
$judy["name"] = "Alice";
$judy["metadata"] = ["role" => "admin"];Judy arrays implement the PHP Iterator interface, allowing you to use them in foreach loops:
$judy = new Judy(Judy::INT_TO_MIXED);
$judy[1] = "First";
$judy[5] = "Fifth";
$judy[10] = "Tenth";
// Iterate through all elements
foreach ($judy as $key => $value) {
echo "Key: $key, Value: $value\n";
}
// Manual iteration
$judy->rewind();
while ($judy->valid()) {
$key = $judy->key();
$value = $judy->current();
echo "Key: $key, Value: $value\n";
$judy->next();
}- Memory Efficiency: Judy arrays use 2-4x less memory than PHP arrays
- Sequential Access: Excellent performance for ordered iteration
- Range Queries: Native support via
slice(),deleteRange(), andpopulationCount() - Random Access: Trie types are slower than PHP arrays (O(log n) vs O(1)); Hash types offer O(1) average-case lookups for string keys
- String Lookups: Use
STRING_TO_*_HASHorSTRING_TO_*_ADAPTIVEtypes for faster string key access when sorted traversal is not the primary use case
Judy arrays provide batch methods for efficient bulk operations:
// Convert a PHP array to a Judy array
$judy = Judy::fromArray(Judy::INT_TO_INT, [0 => 100, 5 => 200, 10 => 300]);
// Convert a Judy array back to a PHP array
$arr = $judy->toArray(); // [0 => 100, 5 => 200, 10 => 300]
// Bulk-insert from an existing array
$judy->putAll([20 => 400, 30 => 500]);
// Retrieve multiple values at once (missing keys return null)
$values = $judy->getAll([0, 5, 99]); // [0 => 100, 5 => 200, 99 => null]For INT_TO_INT, STRING_TO_INT, and STRING_TO_INT_HASH types, increment() performs an efficient counter update:
$counters = new Judy(Judy::STRING_TO_INT);
// Increment creates the key with the given amount if it doesn't exist
$counters->increment("page_views"); // returns 1
$counters->increment("page_views"); // returns 2
$counters->increment("page_views", 10); // returns 12
$counters->increment("page_views", -3); // returns 9For detailed performance analysis, see BENCHMARK.md.
Beyond basic array access, Judy provides a rich API including:
- Set operations:
union(),intersect(),diff(),xor(),mergeWith() - Functional iteration:
forEach(),filter(),map()(C-level, bypasses Iterator overhead) - Range operations:
slice(),deleteRange(),populationCount() - Aggregation:
sumValues(),averageValues() - Batch operations:
putAll(),getAll(),keys(),values(),toArray(),fromArray() - Serialization:
serialize()/unserialize(),json_encode() - Comparison:
equals()
For complete method signatures, parameter details, and type compatibility, see API.md.
Please report bugs and issues on the GitHub repository:
https://github.com/orieg/php-judy/issues
- Eliminate redundant JLG+JLI double traversal in write hot paths for MIXED/PACKED types
- C-level
forEach()/filter()/map()performance tuning (vtable dispatch) - Binary serialization format for faster
__serialize/__unserialize - Extend set operations (
union/intersect/diff/xor) to adaptive types - Extend
increment()to adaptive types
This project is licensed under the PHP License - see the LICENSE file for details.
Contributions are welcome! Please feel free to submit a Pull Request.
- API Reference: API.md for complete method documentation
- Benchmarks: BENCHMARK.md for performance analysis
- Migration Guide: MIGRATION_2.2.0.md for version 2.2.0 changes
- Examples: Check the
examples/directory for usage examples