Skip to content

wey-gu/grpo-graph-extraction

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Qwen GRPO Graph Extraction RL Finetune

We initially explored GRPO based on the Reasoning model synthetic CoT Graph Extraction data, with LLM involved in the reward function.

.
├── LICENSE
├── ground_truth_gen # data gen via DeepSeek R1
│   ├── polished_rl_training_data.csv
│   └── r1_distill_reasoning_graph_extraction.ipynb
└── train
    └── Qwen_GRPO_Graph_Extraction.ipynb # training process

update: Seems the training notebook doesnt render properly in github, check from colab instead:

Data Gen Training
Open In Colab Open In Colab

Credits

training

About

Qwen GRPO Graph Extraction RL Finetune

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published