YTsaurus is a scalable and fault-tolerant open-source big data platform.
-
Updated
May 21, 2026 - C++
YTsaurus is a scalable and fault-tolerant open-source big data platform.
Build scalable data pipelines on YTsaurus with automatic stage management, local development simulation, and more.
Demo pipeline on YT Framework and YTsaurus: list MSVD videos in S3, join captions, extract frames, and export ChatML training data at scale.
Запуск PySpark-задания в Yandex Managed Service for YTsaurus.
Add a description, image, and links to the ytsaurus topic page so that developers can more easily learn about it.
To associate your repository with the ytsaurus topic, visit your repo's landing page and select "manage topics."