Submit repository
Discover trends that matter
Trending repositories
Daily
Weekly
Monthly
Yearly
Live mentions
Topics
GitHub trending
Repositories
Developers
Insights
Stats
Eventual-Inc/Daft — GitHub trending stats & insights | Trendshift
Sponsor spot open
·
promote your product
Embed Badge
Visit GitHub
Eventual-Inc/Daft
#
AI infrastructure
High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale
Rust
5.6k
492
172 contributors
Apache License 2.0
website
Social mentions
Recent discussions about this repository across the web
Flight shuffle now compresses with LZ4 by default. Frames are compressed once on the map side and stay compressed across disk and the wire. 1 TB TPC-H repartition sweep: ~10% faster on local NVMe,…
@everettkleven · x.com
🚢 Daft v0.7.15 just shipped. try_cast() converts types without crashing your pipeline — invalid values become null instead of throwing a runtime error. Also in this release: LZ4 flight shuffle…
@everettkleven · x.com
Flight shuffle now compresses with LZ4 by default. Frames are compressed once on the map side and stay compressed across disk and the wire. 1 TB TPC-H repartition sweep: ~10% faster on local NVMe…
@everettkleven · x.com
做 RAG、训练数据清洗或多模态检索时,数据经常散在 Parquet、图片、音频、视频、HuggingFace 数据集里。预处理脚本越堆越乱,最后很难复用。Daft 是一个面向 AI 和多模态工作负载的数据引擎。 GitHub: 它把这些数据处理任务放进统一的 DataFrame / ETL 管线里,支持 Parquet、Iceberg、Ray 等生态,也能处理非结构化媒体。适合把…
@wsl8297 · x.com
Processes structured data alongside images, audio, and video
@tom_doerr · x.com
🚢 Daft v0.7.14 has shipped Parquet reader rewrite — up to 17x faster remote reads Streaming distributed limits Native UUIDv7 generation JSON array/object functions
@everettkleven · x.com
Distributed limits in v0.7.14 are now streaming. Ray actor holds atomic (skip, take, done) state. Workers claim per morsel, slice in place. When budget is exhausted, scheduler cancels remaining…
@everettkleven · x.com
Arrow PyCapsule Interface in v0.7.11. Zero-copy DataFrame exchange between Daft, PyArrow, Polars, cuDF — direct pointers via the C Data Interface. No more serializing through the Python heap. via
@everettkleven · x.com
Daft now has bidirectional ASOF joins. Backward: "what happened before this?" >>> latest right row where right.key <= left.key Forward: "what happens next?" >>> earliest right row where right.key >=…
@everettkleven · x.com
Repository activities
repository's daily and monthly activities across stars, forks, merged PRs, issues, and closed issues
GitHub trending history
Shows when the repository has appeared on GitHub Trending across any language
all language ranking
rust ranking