Data Sharing & Collaboration

Share AI Tables Seamlessly

Collaborate on multimodal AI datasets with complete reproducibility. Share tables between local and cloud with read-only replicas, complete versioning, and full data lineage.

Documentation
MULTIMODAL

Handle all media types

AI-READY

Support AI workloads

VERSIONED

Track versions, stay synced

REPRODUCIBLE

Same data, same results

Key Features

Complete Multimodal API

Work with video, audio, images, and documents using one API

  • Publishers share multimodal data in simple tables
  • Recipients access all media types through one interface

Complete Sharing System

Eliminate manual file transfers and disconnected script sharing

  • Publishers bundle data and all generated outputs in tables
  • Recipients get embeddings and model outputs automatically

Complete Control & Flexibility

Maintain complete reproducibility, versioning, and lineage

  • Publishers push updates and control access with public or private sharing
  • Recipients maintain local development workflows with on-demand sync

How It Works

01

Publish

Publish your local table to Pixeltable Cloud with a single command. Control access with public or private visibility.

02

Replicate

Team members replicate the shared table to their local environment as a read-only replica. All data, schema, and computed columns are copied.

03

Sync

Push and pull updates seamlessly. Changes are automatically tracked with full versioning and lineage.

Explore Public Datasets

Discover multimodal datasets shared by the community

Code Examples

Three simple workflows for complete data sharing and collaboration

Note: Publishing requires a Pixeltable Cloud account and API key. Replicating requires no account.
publish_example.py
1import pixeltable as pxt
2from pixeltable.functions import openai
3import os
4
5# Set your Pixeltable Cloud API key
6os.environ['PIXELTABLE_API_KEY'] = '<YOUR_API_KEY>'
7
8# Create and populate your table
9t = pxt.create_table('images', {'input_image': pxt.Image})
10
11# Add AI-powered computed columns
12t.add_computed_column(
13 description=openai.vision(
14 prompt="Describe this image in detail.",
15 image=t.input_image,
16 model='gpt-4o-mini'
17 )
18)
19
20# Insert data
21t.insert(input_image='https://example.com/photo.jpg')
22
23# Publish to cloud (make it shareable)
24pxt.publish(
25 source=t,
26 destination_uri='pxt://org_name/shared_images',
27 access='private' # or 'public' for open access
28)

Use Cases

Perfect for teams collaborating on AI and ML projects

Team Collaboration

Share training datasets, models, and results across DS/AI/ML teams with complete versioning and reproducibility.

Data Science Projects

Collaborate on data analysis with shared cleaned datasets, computed embeddings, and version-tracked reproducibility.

Production Deployment

Develop locally, then publish to cloud for production. Push updates when ready.

Dataset Distribution

Share public datasets and benchmark results with the research community for reproducible science.

Start sharing today

Join developers and researchers collaborating on multimodal AI datasets.

Documentation