WIP: experimental dinov3 retinanet backbone #1105

jveitchmichaelis · 2025-08-21T21:37:59Z

This PR adds support for a basic DinoV3 backbone for RetinaNet.

As this is a WIP, I've added a few improvements to the CLI for debugging and logging. Some of this I'd like to PR separately and there is a minor fix to the dataset so that it actually uses root_dir for CSVs with full image paths. I also added a config option for the log folder.

To use Comet, make COMET_API_KEY and COMET_WORKSPACE available in your environment.

Train with:

[uv run] deepforest --config-name dinov3 train

Please try to use the CLI as much as possible so we can test the user experience.

For development I'd suggest making another config file with the train/val directories set up.

defaults:
  - dinov3
  - _self_

train:
    csv_file: 
    root_dir: 


validation:
    csv_file: 
    root_dir:

This will probably fail CI because we need to add a secret to pull the weights for testing. Locally the sanity checks pass (inference + train forward).

jveitchmichaelis · 2025-08-22T21:03:47Z

I think this is roughly the different paths we're comparing to (except we would always use COCO-pretrained ResNet to start)

flowchart TD

    %% Datasets -> Backbones
    ImageNet([ImageNet]) --> ResNet[ResNet Backbone]
    ImageNet -.-> MSCOCO([MS-COCO])
    MSCOCO --> ResNet

    Sat493M([Sat-493M]) --> Dinov3[Dinov3 Backbone]
    LVD1689M([LVD-1689M]) --> Dinov3

    %% Backbones -> Pretrained RetinaNet
    ResNet --> Baseline[Pre-Trained RetinaNet]
    Dinov3 --> Baseline

    %% Fine-tuning paths
    Baseline --> FineTuned([Hand Annotations])
    Baseline -.-> LIDAR([Weak LIDAR Supervision])
    LIDAR --> FineTuned

    %% Merge paths into evaluation
    FineTuned --> NeonTree([NeonTreeEvaluation])

jveitchmichaelis · 2025-08-22T21:14:03Z

In-progress training logs can be found here: https://www.comet.com/jveitchmichaelis/deepforest/view/new/panels

To dos:

Evaluation for DinoV3-ViT-L (300M params)
Evaluation for DinoV3-ViT-7B (7B params)
Evaluation for ResNet50 (25M params) to confirm reproducibility of existing pipeline

Currently performing cross-evaluation for the training dataset, followed by a "holdout" run on all train + NeonTreeEval. All Dino backbones are frozen for now, but generally we fine-tune ResNet. Previous hyper-params for resnet:

40 epochs
lr 1e-4

Also potentially different hyper-parameters for feature pooling, following conventions in ViTDet: https://arxiv.org/abs/2203.16527

bw4sz · 2025-09-09T21:31:17Z

Related to our recent conversation, is this PR WIP or is it ready for review, i'm requested, but it still has WIP.

jveitchmichaelis · 2025-09-09T23:49:20Z

You requested your review 😅 I don't anticipate many code changes here, but I would make some of the cli improvements optional (don't want to force comet on people, for example). I'd welcome a review of the model aspects at least.

As for whether it's out of WIP, do we want to go ahead and support it as a backbone now, or wait for results on the pretraining to see if it makes sense to add it as an option?

Only thing would be the relative pathing that we discussed, to support huge datasets that may be organised in subfolders.

bw4sz · 2025-09-10T16:37:06Z

You requested your review 😅 I don't anticipate many code changes here, but I would make some of the cli improvements optional (don't want to force comet on people, for example). I'd welcome a review of the model aspects at least.

As for whether it's out of WIP, do we want to go ahead and support it as a backbone now, or wait for results on the pretraining to see if it makes sense to add it as an option?

Only thing would be the relative pathing that we discussed, to support huge datasets that may be organised in subfolders.

Do you think that the improvements you've made are broader than just this one backbone? I think so, so it should go in regardless of the pretraining adventure. What do you think? I haven't reviewed yet, I forgot I requested it, I'll remove myself, up to you to when to take WIP off. I'll wait, there are plenty of other PRs.

jveitchmichaelis · 2025-09-10T16:54:46Z

I would probably include the CLI improvements. I've been trying to use that as my only training command to see if there are things it's missing. I added some sensible defaults for loggers/callbacks, output folder etc.

But otherwise the main contribution is the backbone definition and a tweak to how to select backbones for fine tuning (eg imagenet / coco).

jveitchmichaelis · 2025-09-13T00:44:26Z

Going to start moving out out-of-scope changes to other PRs. The core of this one should just be the model backbone IMO.

jveitchmichaelis marked this pull request as draft August 21, 2025 21:38

bw4sz self-requested a review August 21, 2025 21:49

jveitchmichaelis force-pushed the dinov3 branch 2 times, most recently from 6639118 to 4996ef0 Compare August 22, 2025 20:34

jveitchmichaelis force-pushed the dinov3 branch from 9280cd3 to 56e431f Compare August 25, 2025 23:13

bw4sz added this to the DeepForest 2.1 milestone Sep 4, 2025

jveitchmichaelis force-pushed the dinov3 branch 2 times, most recently from 2588c97 to 9d47f6f Compare September 4, 2025 18:35

jveitchmichaelis force-pushed the dinov3 branch 8 times, most recently from 8864087 to 0ed2528 Compare September 13, 2025 00:40

jveitchmichaelis force-pushed the dinov3 branch 7 times, most recently from 6ceb099 to d856c2d Compare September 13, 2025 16:22

jveitchmichaelis force-pushed the dinov3 branch from 6944e11 to d8f1105 Compare September 24, 2025 18:00

jveitchmichaelis added 14 commits September 24, 2025 14:11

handle empty pred in image callback

4a1ccbe

only rank 0 for image callback

0eebb06

push predictions to comet after train

f2bcd8f

support batch/upload eval to comet

20b1afd

eval gzip, defaults for cli

dded14c

fallback to epoch

c52793b

add idempotency to eval + comet upload

e9e5c32

fix dataframe mutation bug

33de05e

handle empty columns robustly

e65fa14

optimize IoU

ae14323

optimize sharding

a58b833

merge eval optim

e0a1e5d

improve parallel script

1eaaebc

fix eval merging

0d80a19

jveitchmichaelis force-pushed the dinov3 branch from e4efb97 to 0d80a19 Compare September 29, 2025 23:06

jveitchmichaelis and others added 3 commits September 29, 2025 19:07

supress many detection warning

c5bc3f3

add iou tests

5e1a183

bump lightning for barrier device id

24c5c67

bw4sz removed their request for review October 9, 2025 17:58

jveitchmichaelis added 10 commits October 14, 2025 03:51

resume/fine-tune from ckpt

c696ab0

backbone handling when loading from ckpt

4b6e293

docs

f44b1de

correct lightning bump

50227e8

update model config override

6e2ccc4

allow ckpt for predict/eval

d570cac

allow ckpt for predict/eval

11ac993

preserve model names when finetuning

53668e3

handle ckpts better

5fb8e67

for this branch, use basenames for simplicity.

70f5aa2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

WIP: experimental dinov3 retinanet backbone #1105

WIP: experimental dinov3 retinanet backbone #1105

Uh oh!

jveitchmichaelis commented Aug 21, 2025 •

edited

Loading

Uh oh!

jveitchmichaelis commented Aug 22, 2025 •

edited

Loading

Uh oh!

jveitchmichaelis commented Aug 22, 2025 •

edited

Loading

Uh oh!

bw4sz commented Sep 9, 2025

Uh oh!

jveitchmichaelis commented Sep 9, 2025 •

edited

Loading

Uh oh!

bw4sz commented Sep 10, 2025

Uh oh!

jveitchmichaelis commented Sep 10, 2025 •

edited

Loading

Uh oh!

jveitchmichaelis commented Sep 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

WIP: experimental dinov3 retinanet backbone #1105

Are you sure you want to change the base?

WIP: experimental dinov3 retinanet backbone #1105

Uh oh!

Conversation

jveitchmichaelis commented Aug 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jveitchmichaelis commented Aug 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jveitchmichaelis commented Aug 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bw4sz commented Sep 9, 2025

Uh oh!

jveitchmichaelis commented Sep 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bw4sz commented Sep 10, 2025

Uh oh!

jveitchmichaelis commented Sep 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jveitchmichaelis commented Sep 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jveitchmichaelis commented Aug 21, 2025 •

edited

Loading

jveitchmichaelis commented Aug 22, 2025 •

edited

Loading

jveitchmichaelis commented Aug 22, 2025 •

edited

Loading

jveitchmichaelis commented Sep 9, 2025 •

edited

Loading

jveitchmichaelis commented Sep 10, 2025 •

edited

Loading