Skip to content

BODF/Data-Modeling

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 

Repository files navigation

Data-Modeling

Scripts for data modeling

Grasshopper Algorithm for automated feature selection in ML pipelines:

  • Partly experimental, introduces fairness and balance metrics not in the original authors' work (Saremi et al 2017)
  • Essentially brute forces finding an optimal parameter set for feeding into another ML algorithm
  • Uses KNN, so it will get bogged down by massive datasets and is not parellelizable in Spark
  • KNN was chosen by the original authors because it is particularly sensitive to poor parameter selections

About

Scripts for data modeling

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages