Learned Static Functions

Static function data structures associate a static set of keys with values while allowing arbitrary output values for queries involving keys outside the set. This enables them to use significantly less memory. Several techniques are known, with compressed static functions approaching the zero-order empirical entropy of the value sequence. Learned static functions use machine learning to capture correlations between keys and values. For each key, a model predicts a probability distribution over the values, from which we derive a key-specific prefix code to compactly encode the true value. The resulting codeword is stored in a classic static function data structure. This design allows learned static functions to break the zero-order entropy barrier while still supporting point queries.

In this repository, we give the first implementation of a learned static function based on a modified version of BuRR that we also provide. You can find the original here: Code, paper.

File structure

train folder: Code for training the ML model using TensorFlow
BuRR-VLR repository: Implementation of our variable-length BuRR adaption
include folder: Implementation of our learned data structure
csf folder: Benchmark code for the GOV competitor

Reproducibility

For convenience, we provide a Docker image that can be used to reproduce our experiments. To run it, clone our repo and run Docker (as superuser).

git clone --recursive https://github.com/gvinciguerra/LearnedStaticFunction.git
docker build --pull --rm -t lsf .
docker run -it -v $(pwd)/lrdata:/lrdata -v $(pwd)/data_sux4j:/data_sux4j -v $(pwd)/out:/out lsf

It will run the training and the benchmarks for all competitors. It will automatically generate the paper based on those results. The paper can be found in /out/main.pdf together with other raw benchmark outputs. Note that the runtime is several hours.

License

This code is licensed under the GPLv3.

Name		Name	Last commit message	Last commit date
Latest commit History 96 Commits
csf		csf
include/lsf		include/lsf
lib		lib
paper		paper
train		train
.dockerignore		.dockerignore
.gitignore		.gitignore
.gitmodules		.gitmodules
CMakeLists.txt		CMakeLists.txt
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
filter_tuner.cpp		filter_tuner.cpp
plot_model_calibration.cpp		plot_model_calibration.cpp
ribbon_learned_bench.cpp		ribbon_learned_bench.cpp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Learned Static Functions

File structure

Reproducibility

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Learned Static Functions

File structure

Reproducibility

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages