Dashboard Multi Dataset Support #244

ritch · 2025-08-13T18:56:19Z

No description provided.

brimoor · 2025-08-15T05:10:22Z

plugins/dashboard/__init__.py


        return {}

+    def get_union_view(self, dataset_names):


Queries across multiple datasets should probably be strictly limited to "full dataset" plots.

This implementation combines the current ctx.view with the entire content of the other datasets. Which is a weird behavior. I think "multiple dataset" queries should do one of two things:

Strictly be limited to "full dataset" queries. IE use ctx.dataset.add_stage() instead of ctx.view.add_stage()

Apply the current view's filters to all datasets. For some views, this would be as simple as injecting the Mongo() stage as the first stage in ctx.view. However, for views that involve things like limit/skip/take, then we'd need a version of the concat() stage that allowed combinations of multiple datasets instead.

Querying across multiple datasets is a bit dubious because there is no guarantee that other datasets will have the correct field names/types to be queried by whatever filter you've built based on the current dataset's schema.

TDLR: should we add guardrails here to ensure the user doesn't define an invalid plot? 🤔

Option 2 is tirc

brimoor · 2025-08-15T05:16:21Z

plugins/dashboard/__init__.py

+            return [ctx.dataset]
+
+        dataset_names = item.selected_datasets
+        if "all" in item.selected_datasets:


ALL datasets??? 🤯🤯🤯

These plots will surely take a loooong time to generate when the user has many datasets. Are we sure we can recommend this as a usable option?

Another consideration: how likely is it that any given aggregation would actually be valid across all datasets? Even if you are plotting a default field like metadata, image vs video datasets have different attributes (EG metadata.width for image datasets and metadata.frame_width for video datasets).

brimoor · 2025-08-15T05:17:23Z

plugins/dashboard/__init__.py

        inputs = types.Object()
+
+        # Dataset selection tabs
+        dataset_mode_choices = types.TabsView()


I'd need to see it IRL to confirm, but I do think using tabs is a reasonable UX here 👍

ritch added 4 commits August 7, 2025 14:57

feat(dashboard): multi dataset support

cbf741d

chore(dashboard): bump version to 1.1.0

27491a5

feat(dashboard): cache dataset listing

ebb52cd

fix: dashbaord multi dataset loading

a09da46

brimoor reviewed Aug 15, 2025

View reviewed changes

brimoor marked this pull request as draft September 29, 2025 17:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Dashboard Multi Dataset Support #244

Dashboard Multi Dataset Support #244

Uh oh!

ritch commented Aug 13, 2025

Uh oh!

brimoor Aug 15, 2025

Uh oh!

brimoor Aug 15, 2025

Uh oh!

brimoor Aug 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Dashboard Multi Dataset Support #244

Are you sure you want to change the base?

Dashboard Multi Dataset Support #244

Uh oh!

Conversation

ritch commented Aug 13, 2025

Uh oh!

brimoor Aug 15, 2025

Choose a reason for hiding this comment

Uh oh!

brimoor Aug 15, 2025

Choose a reason for hiding this comment

Uh oh!

brimoor Aug 15, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants