-
Notifications
You must be signed in to change notification settings - Fork 385
Animalkingdom pr #833
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Animalkingdom pr #833
Conversation
…nition on wildlife videos - Implements animal, action, and activity recognition subtasks for the AnimalKingdom dataset - Includes dataset builder, YAML configs, and strict Jaccard metric evaluation - Utilities for prompt/answer extraction, result processing, and HuggingFace video download - See animalkingdom/README.md for details and usage instructions
try: | ||
from huggingface_hub import hf_hub_download | ||
|
||
# Download the video file from the HuggingFace dataset repository | ||
local_path = hf_hub_download(repo_id="luciehmct/animalkingdom-test", filename=clip_path, repo_type="dataset") | ||
|
||
if os.path.exists(local_path): | ||
return [local_path] | ||
else: | ||
eval_logger.error(f"Downloaded file does not exist: {local_path}") | ||
return [clip_path] | ||
|
||
except Exception as e: | ||
eval_logger.error(f"Failed to download video {clip_path}: {str(e)}") | ||
return [clip_path] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This part is a bit hacky. Should not really get clip path like this
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hi, I scan through some part of the PR. The question remains as most of your previous PR.
I think the major problem is the path and the configuration though. Maybe for most of the PR, you can again check what is the recommended way to pass the video path through doc_to_visual. |
Summary
_cot.jsonl
annotations into Hugging Face–style datasets via the same dataset_builder.py used for MammAlps (--dataset animalkingdom
).doc_to_visual
,doc_to_text
,doc_to_target
,process_results
) and strict Jaccard scoring/aggregation pipeline introduced for MammAlps.num_frame=32
,use_temporal_context=True
).Details
luciehmct/animalkingdom-test
, covering diverse YouTube wildlife clips. While MammAlps comes from alpine camera-trap footage, both datasets are standardized by the same builder, ensuring consistent JSON records and folder layouts.animalkingdom_animal.yaml
,animalkingdom_action.yaml
,animalkingdom_activity.yaml
) are direct analogs of the MammAlps configs.results/<model>_<timestamp>/animalkingdom_<subtask>.jsonl
with prompt, response, parsed predictions, ground truth, and per-example Jaccard score.lmms_eval/api/metrics.py
, identical to MammAlps.use_temporal_context=True
flag enriches prompts with frame timestamps, exactly as documented for MammAlps.Testing