Crate

datafusion

Browse Rust repositories that depend on datafusion.

38
Repositories
243k
Total stars
53
Active
46
Owners
Browse 57 repositories using datafusion in Repos →
Related topics
Often used with
Used by these organizations
50 of 57 repositories · ranked by stars
rustfs29kactive

🚀2.3x faster than MinIO for 4KB object payloads. RustFS is an open-source, S3-compatible high-performance object storage system supporting migration and coexistence with other S3-compatible platforms such as MinIO and Ceph.

nautilus_trader23kactive

Production-grade Rust-native trading engine with deterministic event-driven architecture

windmill17kactive

Open-source developer platform to power your entire infra and turn scripts into webhooks, workflows and UIs. Fastest workflow engine (13x vs Airflow). Open-source alternative to Retool and Temporal.

risingwave9.1kactive

Event streaming platform for agentic AI. Continuously ingest, transform, and serve event streams in real time, at scale.

lance6.6kactive

Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..

readyset5.2kactive

Readyset is a MySQL and Postgres wire-compatible caching layer that sits in front of existing databases to speed up queries and horizontally scale read throughput. Under the hood, ReadySet caches the results of cached select statements and incrementally updates these results over time as the underlying data changes.

restate4.0kactive

Restate is the platform for building resilient applications that tolerate all infrastructure faults w/o the need for a PhD.

vortex3.0kactive

An extensible, state-of-the-art framework for columnar compression, and the fastest FOSS columnar file format. Formerly at @spiraldb, now an Incubation Stage project at LFAI&Data, part of the Linux Foundation.

spiceai3.0kactive

A portable accelerated SQL query, search, and LLM-inference engine, written in Rust, for data-grounded AI apps and agents.

sail2.9kactive

Drop-in Apache Spark replacement written in Rust, unifying batch processing, stream processing, and compute-intensive AI workloads.

auron1.8kactive

The Auron accelerator for distributed computing framework (e.g., Spark) leverages native vectorized execution to accelerate query processing

⑂ 226133
cnosdb1.8kdormant

A cloud-native open source distributed time series database with high performance, high compression ratio and high availability.

rocky266active

A SQL transformation engine that type-checks your whole pipeline and catches breaking changes before they run — branches, replay, column-level lineage, compile-time contracts, per-model cost. Adapters: Databricks, Snowflake, BigQuery, DuckDB. Single static Rust binary. Apache 2.0.

timefusion170active

A timeseries database created for events, logs, traces and metrics. Speaks the postgres dialect, and stores data in s3 via delta lake protocol

datafusion-functions-json57active

JSON support for DataFusion (unofficial)

⑂ 2911
← Browse all repos