A Guide to Uploading Lance Datasets on the Hugging Face Hub
Build a multimodal Lance dataset, publish it to the Hub, and query the precomputed vector + FTS indexes in LanceDB, without needing to download the dataset locally.
Build a multimodal Lance dataset, publish it to the Hub, and query the precomputed vector + FTS indexes in LanceDB, without needing to download the dataset locally.
OpenClaw and similar personal autonomous agents need a local-first long-term memory layer. LanceDB fits that role with embedded deployment, filesystem-native storage, and multimodal retrieval.
Announcing native read support for Lance format on Hugging Face Hub. You can now distribute your large multimodal datasets as a single, searchable artifact (including blobs, embeddings and indexes) all in one place!
Store a multimodal dataset of recipes in LanceDB, a multimodal lakehouse for AI, and keep it fresh with CocoIndex, a declarative data transformation framework for AI with incremental processing capabilities.
A practical definition of multimodal complexity, and how LanceDB’s Multimodal Lakehouse is built to address these challenges.
A comparison of where Iceberg and Lance sit in the modern lakehouse stack. We highlight emerging architectures that are bridging the worlds of analytics and AI/ML workloads using these two formats, while being built on the same data foundation.