How Pixeltable Works

Store multimodal data in tables, define AI workflows as computed columns, and query everything together. Pixeltable handles orchestration, caching, and model execution automatically.

← Back to Home Docs

Build Your First AI Workflow

Four steps to create powerful multimodal AI applications with Pixeltable's declarative approach.

Step 1 / 4

Create Tables with Multimodal Types

The Declarative Foundation

Define tables for any data type (images, videos, documents, structured data) in a single schema. Add computed columns that transform your data using Python expressions. Pixeltable orchestrates computation automatically for all existing and future rows.

Unified schema for structured, unstructured, and multimodal data
Computed columns run automatically: define once, never re-run manually
Pixeltable manages the dependency graph and incremental updates

Read the docs

step_1_tables.py

1import pixeltable as pxt
2
3# Create a table with typed columns
4t = pxt.create_table('films', {
5    'name': pxt.String,
6    'revenue': pxt.Float,
7    'budget': pxt.Float,
8}, if_exists="replace")
9
10t.insert([
11    {'name': 'Inside Out', 'revenue': 800.5, 'budget': 200.0},
12    {'name': 'Toy Story', 'revenue': 1073.4, 'budget': 200.0},
13])
14
15# Computed column: auto-calculated for every row
16t.add_computed_column(profit=(t.revenue - t.budget))
17
18# +------------+--------+
19# | name       | profit |
20# +------------+--------+
21# | Inside Out | 600.5  |
22# | Toy Story  | 873.4  |
23# +------------+--------+

Step 2 / 4

Add AI Models as Computed Columns

Bring Your Own Model

Wrap any Python function (data cleaning, model inference, API calls) in a @pxt.udf decorator. It becomes a reusable pipeline component. Pixeltable handles parallelization, caching, and dependency resolution automatically.

@pxt.udf turns any Python function into a pipeline component
Integrate any model: HuggingFace, OpenAI, custom PyTorch, etc.
Pixeltable caches results and only recomputes when inputs change

Read the docs

step_2_ai_models.py

1import PIL
2import pixeltable as pxt
3
4# Any Python function becomes a pipeline component
5@pxt.udf
6def detect(image: PIL.Image.Image) -> list[str]:
7    from yolox.models import Yolox
8    from yolox.data.datasets import COCO_CLASSES
9    model = Yolox.from_pretrained("yolox_s")
10    result = model([image])
11    return [COCO_CLASSES[label] for label in result[0]["labels"]]
12
13# Apply as a computed column: runs for every row
14t.add_computed_column(classification=detect(t.image))
15
16# +----------------------+------------------+
17# | image                | classification   |
18# +----------------------+------------------+
19# | <Image: cat.jpg>     | ['cat', 'couch'] |
20# | <Image: birds.png>   | ['bird']         |
21# +----------------------+------------------+

Step 3 / 4

Search Across All Data Types

Built-in Vector Search

Add embedding indexes with one line: no separate vector database needed. Pixeltable generates embeddings, stores them co-located with your data, and keeps them automatically in sync. Search by text, image, or any modality.

No separate vector DB: embeddings live next to your data
Automatic sync: embeddings update when source data changes
Cross-modal search: text-to-image, image-to-image, and more

Read the docs

step_3_vector_search.py

1import pixeltable as pxt
2from pixeltable.functions.huggingface import clip
3
4# Add a CLIP embedding index: one line
5images.add_embedding_index(
6    'img',
7    embedding=clip.using(model_id='openai/clip-vit-base-patch32')
8)
9
10# Text-to-image similarity search
11query_text = "a dog playing fetch"
12sim = images.img.similarity(string=query_text)
13results = images.order_by(sim, asc=False).limit(5).collect()
14
15# Image-to-image search works the same way
16query_image = 'https://example.com/dog.jpg'
17sim = images.img.similarity(string=query_image)
18results = images.order_by(sim, asc=False).limit(5).collect()

Step 4 / 4

Build Incremental RAG Pipelines

Incremental & Automated

Define the entire RAG workflow (chunking, embedding, retrieval, generation) declaratively. Pixeltable orchestrates it end-to-end. Because the pipeline is incremental, only new or updated documents get processed. Your RAG system stays efficient and always up-to-date.

Declarative RAG: chunking → embedding → retrieval → LLM in a few lines
Incremental: only new/updated documents get processed
Everything co-located: source docs, chunks, embeddings, Q&A, all in sync

Read the docs

step_4_rag.py

1import pixeltable as pxt
2from pixeltable.functions import openai, huggingface
3from pixeltable.functions.document import document_splitter
4
5# 1. Source documents
6docs = pxt.create_table('docs', {'doc': pxt.Document})
7docs.insert([{'doc': 's3://my-data/annual-report.pdf'}])
8
9# 2. Auto-chunk into sentences
10chunks = pxt.create_view('chunks', docs,
11    iterator=document_splitter(
12        document=docs.doc, separators='sentence'))
13
14# 3. Embedding index
15embed = huggingface.sentence_transformer.using(model_id='all-MiniLM-L6-v2')
16chunks.add_embedding_index('text', string_embed=embed)
17
18# 4. Retrieval function
19@pxt.query
20def get_context(query_text: str):
21    sim = chunks.text.similarity(string=query_text)
22    return chunks.order_by(sim, asc=False).limit(5)
23
24# 5. RAG pipeline: ask a question, get an answer
25qa = pxt.create_table('qa', {'prompt': pxt.String})
26qa.add_computed_column(context=get_context(qa.prompt))
27qa.add_computed_column(
28    answer=openai.chat_completions(
29        model='gpt-4o-mini',
30        messages=[{
31            'role': 'user',
32            'content': qa.context.text + '\nQuestion: ' + qa.prompt
33        }]
34    ).choices[0].message.content
35)
36
37# Insert a question: pipeline runs automatically
38qa.insert([{'prompt': 'Key takeaways from the report?'}])

Ready to build?

Install Pixeltable and create your first multimodal AI workflow in minutes.

$pip install pixeltable